Balliu, Brunilda; Tsonaka, Roula; Boehringer, Stefan; Houwing-Duistermaat, Jeanine
2015-03-01
Integrative omics, the joint analysis of outcome and multiple types of omics data, such as genomics, epigenomics, and transcriptomics data, constitute a promising approach for powerful and biologically relevant association studies. These studies often employ a case-control design, and often include nonomics covariates, such as age and gender, that may modify the underlying omics risk factors. An open question is how to best integrate multiple omics and nonomics information to maximize statistical power in case-control studies that ascertain individuals based on the phenotype. Recent work on integrative omics have used prospective approaches, modeling case-control status conditional on omics, and nonomics risk factors. Compared to univariate approaches, jointly analyzing multiple risk factors with a prospective approach increases power in nonascertained cohorts. However, these prospective approaches often lose power in case-control studies. In this article, we propose a novel statistical method for integrating multiple omics and nonomics factors in case-control association studies. Our method is based on a retrospective likelihood function that models the joint distribution of omics and nonomics factors conditional on case-control status. The new method provides accurate control of Type I error rate and has increased efficiency over prospective approaches in both simulated and real data. © 2015 Wiley Periodicals, Inc.
Palazzotto, Emilia; Weber, Tilmann
2018-04-12
Natural products produced by microorganisms represent the main source of bioactive molecules. The development of high-throughput (omics) techniques have importantly contributed to the renaissance of new antibiotic discovery increasing our understanding of complex mechanisms controlling the expression of biosynthetic gene clusters (BGCs) encoding secondary metabolites. In this context this review highlights recent progress in the use and integration of 'omics' approaches with focuses on genomics, transcriptomics, proteomics metabolomics meta-omics and combined omics as powerful strategy to discover new antibiotics. Copyright © 2018 Elsevier Ltd. All rights reserved.
Saw, Woei-Yuh; Tantoso, Erwin; Begum, Husna; Zhou, Lihan; Zou, Ruiyang; He, Cheng; Chan, Sze Ling; Tan, Linda Wei-Lin; Wong, Lai-Ping; Xu, Wenting; Moong, Don Kyin Nwe; Lim, Yenly; Li, Bowen; Pillai, Nisha Esakimuthu; Peterson, Trevor A; Bielawny, Tomasz; Meikle, Peter J; Mundra, Piyushkumar A; Lim, Wei-Yen; Luo, Ma; Chia, Kee-Seng; Ong, Rick Twee-Hee; Brunham, Liam R; Khor, Chiea-Chuen; Too, Heng Phon; Soong, Richie; Wenk, Markus R; Little, Peter; Teo, Yik-Ying
2017-09-21
The Singapore Integrative Omics Study provides valuable insights on establishing population reference measurement in 364 Chinese, Malay, and Indian individuals. These measurements include > 2.5 millions genetic variants, 21,649 transcripts expression, 282 lipid species quantification, and 284 clinical, lifestyle, and dietary variables. This concept paper introduces the depth of the data resource, and investigates the extent of ethnic variation at these omics and non-omics biomarkers. It is evident that there are specific biomarkers in each of these platforms to differentiate between the ethnicities, and intra-population analyses suggest that Chinese and Indians are the most biologically homogeneous and heterogeneous, respectively, of the three groups. Consistent patterns of correlations between lipid species also suggest the possibility of lipid tagging to simplify future lipidomics assays. The Singapore Integrative Omics Study is expected to allow the characterization of intra-omic and inter-omic correlations within and across all three ethnic groups through a systems biology approach.The Singapore Genome Variation projects characterized the genetics of Singapore's Chinese, Malay, and Indian populations. The Singapore Integrative Omics Study introduced here goes further in providing multi-omic measurements in individuals from these populations, including genetic, transcriptome, lipidome, and lifestyle data, and will facilitate the study of common diseases in Asian communities.
Jalili, Mahdi; Gebhardt, Tom; Wolkenhauer, Olaf; Salehzadeh-Yazdi, Ali
2018-06-01
Decoding health and disease phenotypes is one of the fundamental objectives in biomedicine. Whereas high-throughput omics approaches are available, it is evident that any single omics approach might not be adequate to capture the complexity of phenotypes. Therefore, integrated multi-omics approaches have been used to unravel genotype-phenotype relationships such as global regulatory mechanisms and complex metabolic networks in different eukaryotic organisms. Some of the progress and challenges associated with integrated omics studies have been reviewed previously in comprehensive studies. In this work, we highlight and review the progress, challenges and advantages associated with emerging approaches, integrating gene expression and protein-protein interaction networks to unravel network-based functional features. This includes identifying disease related genes, gene prioritization, clustering protein interactions, developing the modules, extract active subnetworks and static protein complexes or dynamic/temporal protein complexes. We also discuss how these approaches contribute to our understanding of the biology of complex traits and diseases. This article is part of a Special Issue entitled: Cardiac adaptations to obesity, diabetes and insulin resistance, edited by Professors Jan F.C. Glatz, Jason R.B. Dyck and Christine Des Rosiers. Copyright © 2018 Elsevier B.V. All rights reserved.
Poly-Omic Prediction of Complex Traits: OmicKriging
Wheeler, Heather E.; Aquino-Michaels, Keston; Gamazon, Eric R.; Trubetskoy, Vassily V.; Dolan, M. Eileen; Huang, R. Stephanie; Cox, Nancy J.; Im, Hae Kyung
2014-01-01
High-confidence prediction of complex traits such as disease risk or drug response is an ultimate goal of personalized medicine. Although genome-wide association studies have discovered thousands of well-replicated polymorphisms associated with a broad spectrum of complex traits, the combined predictive power of these associations for any given trait is generally too low to be of clinical relevance. We propose a novel systems approach to complex trait prediction, which leverages and integrates similarity in genetic, transcriptomic, or other omics-level data. We translate the omic similarity into phenotypic similarity using a method called Kriging, commonly used in geostatistics and machine learning. Our method called OmicKriging emphasizes the use of a wide variety of systems-level data, such as those increasingly made available by comprehensive surveys of the genome, transcriptome, and epigenome, for complex trait prediction. Furthermore, our OmicKriging framework allows easy integration of prior information on the function of subsets of omics-level data from heterogeneous sources without the sometimes heavy computational burden of Bayesian approaches. Using seven disease datasets from the Wellcome Trust Case Control Consortium (WTCCC), we show that OmicKriging allows simple integration of sparse and highly polygenic components yielding comparable performance at a fraction of the computing time of a recently published Bayesian sparse linear mixed model method. Using a cellular growth phenotype, we show that integrating mRNA and microRNA expression data substantially increases performance over either dataset alone. Using clinical statin response, we show improved prediction over existing methods. PMID:24799323
Integrated omics analysis of specialized metabolism in medicinal plants.
Rai, Amit; Saito, Kazuki; Yamazaki, Mami
2017-05-01
Medicinal plants are a rich source of highly diverse specialized metabolites with important pharmacological properties. Until recently, plant biologists were limited in their ability to explore the biosynthetic pathways of these metabolites, mainly due to the scarcity of plant genomics resources. However, recent advances in high-throughput large-scale analytical methods have enabled plant biologists to discover biosynthetic pathways for important plant-based medicinal metabolites. The reduced cost of generating omics datasets and the development of computational tools for their analysis and integration have led to the elucidation of biosynthetic pathways of several bioactive metabolites of plant origin. These discoveries have inspired synthetic biology approaches to develop microbial systems to produce bioactive metabolites originating from plants, an alternative sustainable source of medicinally important chemicals. Since the demand for medicinal compounds are increasing with the world's population, understanding the complete biosynthesis of specialized metabolites becomes important to identify or develop reliable sources in the future. Here, we review the contributions of major omics approaches and their integration to our understanding of the biosynthetic pathways of bioactive metabolites. We briefly discuss different approaches for integrating omics datasets to extract biologically relevant knowledge and the application of omics datasets in the construction and reconstruction of metabolic models. © 2017 The Authors The Plant Journal © 2017 John Wiley & Sons Ltd.
Integrative FourD omics approach profiles the target network of the carbon storage regulatory system
Sowa, Steven W.; Gelderman, Grant; Leistra, Abigail N.; Buvanendiran, Aishwarya; Lipp, Sarah; Pitaktong, Areen; Vakulskas, Christopher A.; Romeo, Tony; Baldea, Michael
2017-01-01
Abstract Multi-target regulators represent a largely untapped area for metabolic engineering and anti-bacterial development. These regulators are complex to characterize because they often act at multiple levels, affecting proteins, transcripts and metabolites. Therefore, single omics experiments cannot profile their underlying targets and mechanisms. In this work, we used an Integrative FourD omics approach (INFO) that consists of collecting and analyzing systems data throughout multiple time points, using multiple genetic backgrounds, and multiple omics approaches (transcriptomics, proteomics and high throughput sequencing crosslinking immunoprecipitation) to evaluate simultaneous changes in gene expression after imposing an environmental stress that accentuates the regulatory features of a network. Using this approach, we profiled the targets and potential regulatory mechanisms of a global regulatory system, the well-studied carbon storage regulatory (Csr) system of Escherichia coli, which is widespread among bacteria. Using 126 sets of proteomics and transcriptomics data, we identified 136 potential direct CsrA targets, including 50 novel ones, categorized their behaviors into distinct regulatory patterns, and performed in vivo fluorescence-based follow up experiments. The results of this work validate 17 novel mRNAs as authentic direct CsrA targets and demonstrate a generalizable strategy to integrate multiple lines of omics data to identify a core pool of regulator targets. PMID:28126921
Modelling plankton ecosystems in the meta-omics era. Are we ready?
Stec, Krzysztof Franciszek; Caputi, Luigi; Buttigieg, Pier Luigi; D'Alelio, Domenico; Ibarbalz, Federico Matias; Sullivan, Matthew B; Chaffron, Samuel; Bowler, Chris; Ribera d'Alcalà, Maurizio; Iudicone, Daniele
2017-04-01
Recent progress in applying meta-omics approaches to the study of marine ecosystems potentially allows scientists to study the genetic and functional diversity of plankton at an unprecedented depth and with enhanced precision. However, while a range of persistent technical issues still need to be resolved, a much greater obstacle currently preventing a complete and integrated view of the marine ecosystem is the absence of a clear conceptual framework. Herein, we discuss the knowledge that has thus far been derived from conceptual and statistical modelling of marine plankton ecosystems, and illustrate the potential power of integrated meta-omics approaches in the field. We then propose the use of a semantic framework is necessary to support integrative ecological modelling in the meta-omics era, particularly when having to face the increased interdisciplinarity needed to address global issues related to climate change. Copyright © 2017. Published by Elsevier B.V.
2016-12-01
acids (PUFA) on cerebral neurobiology: an integrated omics approach with epigenomic focus Nabarun Chakrabortya,b, Seid Muhiea,b, Raina Kumara,c, Aarti...C57BL/6j mice fed on any of these three diets from their neonatal age to midlife. Integrating the multiomics data, we focused on the genes encoding both...been evaluated in the context of a wide variety of health issues [23]. The escalated risks of pathological and psychological disease have been
Sowa, Steven W; Gelderman, Grant; Leistra, Abigail N; Buvanendiran, Aishwarya; Lipp, Sarah; Pitaktong, Areen; Vakulskas, Christopher A; Romeo, Tony; Baldea, Michael; Contreras, Lydia M
2017-02-28
Multi-target regulators represent a largely untapped area for metabolic engineering and anti-bacterial development. These regulators are complex to characterize because they often act at multiple levels, affecting proteins, transcripts and metabolites. Therefore, single omics experiments cannot profile their underlying targets and mechanisms. In this work, we used an Integrative FourD omics approach (INFO) that consists of collecting and analyzing systems data throughout multiple time points, using multiple genetic backgrounds, and multiple omics approaches (transcriptomics, proteomics and high throughput sequencing crosslinking immunoprecipitation) to evaluate simultaneous changes in gene expression after imposing an environmental stress that accentuates the regulatory features of a network. Using this approach, we profiled the targets and potential regulatory mechanisms of a global regulatory system, the well-studied carbon storage regulatory (Csr) system of Escherichia coli, which is widespread among bacteria. Using 126 sets of proteomics and transcriptomics data, we identified 136 potential direct CsrA targets, including 50 novel ones, categorized their behaviors into distinct regulatory patterns, and performed in vivo fluorescence-based follow up experiments. The results of this work validate 17 novel mRNAs as authentic direct CsrA targets and demonstrate a generalizable strategy to integrate multiple lines of omics data to identify a core pool of regulator targets. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Integrated omics dissection of proteome dynamics during cardiac remodeling.
Lau, Edward; Cao, Quan; Lam, Maggie P Y; Wang, Jie; Ng, Dominic C M; Bleakley, Brian J; Lee, Jessica M; Liem, David A; Wang, Ding; Hermjakob, Henning; Ping, Peipei
2018-01-09
Transcript abundance and protein abundance show modest correlation in many biological models, but how this impacts disease signature discovery in omics experiments is rarely explored. Here we report an integrated omics approach, incorporating measurements of transcript abundance, protein abundance, and protein turnover to map the landscape of proteome remodeling in a mouse model of pathological cardiac hypertrophy. Analyzing the hypertrophy signatures that are reproducibly discovered from each omics data type across six genetic strains of mice, we find that the integration of transcript abundance, protein abundance, and protein turnover data leads to 75% gain in discovered disease gene candidates. Moreover, the inclusion of protein turnover measurements allows discovery of post-transcriptional regulations across diverse pathways, and implicates distinct disease proteins not found in steady-state transcript and protein abundance data. Our results suggest that multi-omics investigations of proteome dynamics provide important insights into disease pathogenesis in vivo.
Reinhold, William C
2015-12-10
There is currently a split within the cancer research community between traditional molecular biological hypothesis-driven and the more recent "omic" forms or research. While the molecular biological approach employs the tried and true single alteration-single response formulations of experimentation, the omic employs broad-based assay or sample collection approaches that generate large volumes of data. How to integrate the benefits of these two approaches in an efficient and productive fashion remains an outstanding issue. Ideally, one would merge the understandability, exactness, simplicity, and testability of the molecular biological approach, with the larger amounts of data, simultaneous consideration of multiple alterations, consideration of genes both of known interest along with the novel, cross-sample comparisons among cell lines and patient samples, and consideration of directed questions while simultaneously gaining exposure to the novel provided by the omic approach. While at the current time integration of the two disciplines remains problematic, attempts to do so are ongoing, and will be necessary for the understanding of the large cell line screens including the Developmental Therapeutics Program's NCI-60, the Broad Institute's Cancer Cell Line Encyclopedia, and the Wellcome Trust Sanger Institute's Cancer Genome Project, as well as the the Cancer Genome Atlas clinical samples project. Going forward there is significant benefit to be had from the integration of the molecular biological and the omic forms or research, with the desired goal being improved translational understanding and application.
Rebollar, Eria A; Antwis, Rachael E; Becker, Matthew H; Belden, Lisa K; Bletz, Molly C; Brucker, Robert M; Harrison, Xavier A; Hughey, Myra C; Kueneman, Jordan G; Loudon, Andrew H; McKenzie, Valerie; Medina, Daniel; Minbiole, Kevin P C; Rollins-Smith, Louise A; Walke, Jenifer B; Weiss, Sophie; Woodhams, Douglas C; Harris, Reid N
2016-01-01
Emerging infectious diseases in wildlife are responsible for massive population declines. In amphibians, chytridiomycosis caused by Batrachochytrium dendrobatidis, Bd, has severely affected many amphibian populations and species around the world. One promising management strategy is probiotic bioaugmentation of antifungal bacteria on amphibian skin. In vivo experimental trials using bioaugmentation strategies have had mixed results, and therefore a more informed strategy is needed to select successful probiotic candidates. Metagenomic, transcriptomic, and metabolomic methods, colloquially called "omics," are approaches that can better inform probiotic selection and optimize selection protocols. The integration of multiple omic data using bioinformatic and statistical tools and in silico models that link bacterial community structure with bacterial defensive function can allow the identification of species involved in pathogen inhibition. We recommend using 16S rRNA gene amplicon sequencing and methods such as indicator species analysis, the Kolmogorov-Smirnov Measure, and co-occurrence networks to identify bacteria that are associated with pathogen resistance in field surveys and experimental trials. In addition to 16S amplicon sequencing, we recommend approaches that give insight into symbiont function such as shotgun metagenomics, metatranscriptomics, or metabolomics to maximize the probability of finding effective probiotic candidates, which can then be isolated in culture and tested in persistence and clinical trials. An effective mitigation strategy to ameliorate chytridiomycosis and other emerging infectious diseases is necessary; the advancement of omic methods and the integration of multiple omic data provide a promising avenue toward conservation of imperiled species.
From data to knowledge: The future of multi-omics data analysis for the rhizosphere
DOE Office of Scientific and Technical Information (OSTI.GOV)
Allen White, Richard; Borkum, Mark I.; Rivas-Ubach, Albert
The rhizosphere is the interface between a plant's roots and its surrounding soil. The rhizosphere microbiome, a complex microbial ecosystem, nourishes the terrestrial biosphere. Integrated multi-omics is a modern approach to systems biology that analyzes and interprets the datasets of multiple -omes of both individual organisms and multi-organism communities and consortia. The successful usage and application of integrated multi-omics to rhizospheric science is predicated upon the availability of rhizosphere-specific data, metadata and software. This review analyzes the availability of multi-omics data, metadata and software for rhizospheric science, identifying potential issues, challenges and opportunities.
Integrative Analysis of Omics Big Data.
Yu, Xiang-Tian; Zeng, Tao
2018-01-01
The diversity and huge omics data take biology and biomedicine research and application into a big data era, just like that popular in human society a decade ago. They are opening a new challenge from horizontal data ensemble (e.g., the similar types of data collected from different labs or companies) to vertical data ensemble (e.g., the different types of data collected for a group of person with match information), which requires the integrative analysis in biology and biomedicine and also asks for emergent development of data integration to address the great changes from previous population-guided to newly individual-guided investigations.Data integration is an effective concept to solve the complex problem or understand the complicate system. Several benchmark studies have revealed the heterogeneity and trade-off that existed in the analysis of omics data. Integrative analysis can combine and investigate many datasets in a cost-effective reproducible way. Current integration approaches on biological data have two modes: one is "bottom-up integration" mode with follow-up manual integration, and the other one is "top-down integration" mode with follow-up in silico integration.This paper will firstly summarize the combinatory analysis approaches to give candidate protocol on biological experiment design for effectively integrative study on genomics and then survey the data fusion approaches to give helpful instruction on computational model development for biological significance detection, which have also provided newly data resources and analysis tools to support the precision medicine dependent on the big biomedical data. Finally, the problems and future directions are highlighted for integrative analysis of omics big data.
Single Cell Multi-Omics Technology: Methodology and Application.
Hu, Youjin; An, Qin; Sheu, Katherine; Trejo, Brandon; Fan, Shuxin; Guo, Ying
2018-01-01
In the era of precision medicine, multi-omics approaches enable the integration of data from diverse omics platforms, providing multi-faceted insight into the interrelation of these omics layers on disease processes. Single cell sequencing technology can dissect the genotypic and phenotypic heterogeneity of bulk tissue and promises to deepen our understanding of the underlying mechanisms governing both health and disease. Through modification and combination of single cell assays available for transcriptome, genome, epigenome, and proteome profiling, single cell multi-omics approaches have been developed to simultaneously and comprehensively study not only the unique genotypic and phenotypic characteristics of single cells, but also the combined regulatory mechanisms evident only at single cell resolution. In this review, we summarize the state-of-the-art single cell multi-omics methods and discuss their applications, challenges, and future directions.
Single Cell Multi-Omics Technology: Methodology and Application
Hu, Youjin; An, Qin; Sheu, Katherine; Trejo, Brandon; Fan, Shuxin; Guo, Ying
2018-01-01
In the era of precision medicine, multi-omics approaches enable the integration of data from diverse omics platforms, providing multi-faceted insight into the interrelation of these omics layers on disease processes. Single cell sequencing technology can dissect the genotypic and phenotypic heterogeneity of bulk tissue and promises to deepen our understanding of the underlying mechanisms governing both health and disease. Through modification and combination of single cell assays available for transcriptome, genome, epigenome, and proteome profiling, single cell multi-omics approaches have been developed to simultaneously and comprehensively study not only the unique genotypic and phenotypic characteristics of single cells, but also the combined regulatory mechanisms evident only at single cell resolution. In this review, we summarize the state-of-the-art single cell multi-omics methods and discuss their applications, challenges, and future directions. PMID:29732369
Higdon, Roger; Earl, Rachel K.; Stanberry, Larissa; Hudac, Caitlin M.; Montague, Elizabeth; Stewart, Elizabeth; Janko, Imre; Choiniere, John; Broomall, William; Kolker, Natali
2015-01-01
Abstract Complex diseases are caused by a combination of genetic and environmental factors, creating a difficult challenge for diagnosis and defining subtypes. This review article describes how distinct disease subtypes can be identified through integration and analysis of clinical and multi-omics data. A broad shift toward molecular subtyping of disease using genetic and omics data has yielded successful results in cancer and other complex diseases. To determine molecular subtypes, patients are first classified by applying clustering methods to different types of omics data, then these results are integrated with clinical data to characterize distinct disease subtypes. An example of this molecular-data-first approach is in research on Autism Spectrum Disorder (ASD), a spectrum of social communication disorders marked by tremendous etiological and phenotypic heterogeneity. In the case of ASD, omics data such as exome sequences and gene and protein expression data are combined with clinical data such as psychometric testing and imaging to enable subtype identification. Novel ASD subtypes have been proposed, such as CHD8, using this molecular subtyping approach. Broader use of molecular subtyping in complex disease research is impeded by data heterogeneity, diversity of standards, and ineffective analysis tools. The future of molecular subtyping for ASD and other complex diseases calls for an integrated resource to identify disease mechanisms, classify new patients, and inform effective treatment options. This in turn will empower and accelerate precision medicine and personalized healthcare. PMID:25831060
Sarcoidosis Related Novel Candidate Genes Identified by Multi-Omics Integrative Analyses.
Hočevar, Keli; Maver, Aleš; Kunej, Tanja; Peterlin, Borut
2018-05-01
Sarcoidosis is a multifactorial systemic disease characterized by granulomatous inflammation and greatly impacting on global public health. The etiology and mechanisms of sarcoidosis are not fully understood. Recent high-throughput biological research has generated vast amounts of multi-omics big data on sarcoidosis, but their significance remains to be determined. We sought to identify novel candidate regions, and genes consistently altered in heterogeneous omics studies so as to reveal the underlying molecular mechanisms. We conducted a comprehensive integrative literature analysis on global data on sarcoidosis, including genomic, transcriptomic, proteomic, and phenomic studies. We performed positional integration analysis of 38 eligible datasets originating from 17 different biological layers. Using the integration interval length of 50 kb, we identified 54 regions reaching significance value p ≤ 0.0001 and 15 regions with significance value p ≤ 0.00001, when applying more stringent criteria. Secondary literature analysis of the top 20 regions, with the most significant accumulation of signals, revealed several novel candidate genes for which associations with sarcoidosis have not yet been established, but have considerable support for their involvement based on omic data. These new plausible candidate genes include NELFE, CFB, EGFL7, AGPAT2, FKBPL, NRC3, and NEU1. Furthermore, annotated data were prepared to enable custom visualization and browsing of these sarcoidosis related omics evidence in the University of California Santa Cruz (UCSC) Genome Browser. Further multi-omics approaches are called for sarcoidosis biomarkers and diagnostic and therapeutic innovation. Our approach for harnessing multi-omics data and the findings presented herein reflect important steps toward understanding the etiology and underlying pathological mechanisms of sarcoidosis.
Studying Cellular Signal Transduction with OMIC Technologies.
Landry, Benjamin D; Clarke, David C; Lee, Michael J
2015-10-23
In the gulf between genotype and phenotype exists proteins and, in particular, protein signal transduction systems. These systems use a relatively limited parts list to respond to a much longer list of extracellular, environmental, and/or mechanical cues with rapidity and specificity. Most signaling networks function in a highly non-linear and often contextual manner. Furthermore, these processes occur dynamically across space and time. Because of these complexities, systems and "OMIC" approaches are essential for the study of signal transduction. One challenge in using OMIC-scale approaches to study signaling is that the "signal" can take different forms in different situations. Signals are encoded in diverse ways such as protein-protein interactions, enzyme activities, localizations, or post-translational modifications to proteins. Furthermore, in some cases, signals may be encoded only in the dynamics, duration, or rates of change of these features. Accordingly, systems-level analyses of signaling may need to integrate multiple experimental and/or computational approaches. As the field has progressed, the non-triviality of integrating experimental and computational analyses has become apparent. Successful use of OMIC methods to study signaling will require the "right" experiments and the "right" modeling approaches, and it is critical to consider both in the design phase of the project. In this review, we discuss common OMIC and modeling approaches for studying signaling, emphasizing the philosophical and practical considerations for effectively merging these two types of approaches to maximize the probability of obtaining reliable and novel insights into signaling biology. Copyright © 2015 Elsevier Ltd. All rights reserved.
Resource recovery from wastewater: application of meta-omics to phosphorus and carbon management.
Sales, Christopher M; Lee, Patrick K H
2015-06-01
A growing trend at wastewater treatment plants is the recovery of resources and energy from wastewater. Enhanced biological phosphorus removal and anaerobic digestion are two established biotechnology approaches for the recovery of phosphorus and carbon, respectively. Meta-omics approaches (meta-genomics, transcriptomics, proteomics, and metabolomics) are providing novel biological insights into these complex biological systems. In particular, genome-centric metagenomics analyses are revealing the function and physiology of individual community members. Querying transcripts, proteins and metabolites are emerging techniques that can inform the cellular responses under different conditions. Overall, meta-omics approaches are shedding light into complex microbial communities once regarded as 'blackboxes', but challenges remain to integrate information from meta-omics into engineering design and operation guidelines. Copyright © 2015 Elsevier Ltd. All rights reserved.
Zeng, Irene Sui Lan; Lumley, Thomas
2018-01-01
Integrated omics is becoming a new channel for investigating the complex molecular system in modern biological science and sets a foundation for systematic learning for precision medicine. The statistical/machine learning methods that have emerged in the past decade for integrated omics are not only innovative but also multidisciplinary with integrated knowledge in biology, medicine, statistics, machine learning, and artificial intelligence. Here, we review the nontrivial classes of learning methods from the statistical aspects and streamline these learning methods within the statistical learning framework. The intriguing findings from the review are that the methods used are generalizable to other disciplines with complex systematic structure, and the integrated omics is part of an integrated information science which has collated and integrated different types of information for inferences and decision making. We review the statistical learning methods of exploratory and supervised learning from 42 publications. We also discuss the strengths and limitations of the extended principal component analysis, cluster analysis, network analysis, and regression methods. Statistical techniques such as penalization for sparsity induction when there are fewer observations than the number of features and using Bayesian approach when there are prior knowledge to be integrated are also included in the commentary. For the completeness of the review, a table of currently available software and packages from 23 publications for omics are summarized in the appendix.
MinOmics, an Integrative and Immersive Tool for Multi-Omics Analysis.
Maes, Alexandre; Martinez, Xavier; Druart, Karen; Laurent, Benoist; Guégan, Sean; Marchand, Christophe H; Lemaire, Stéphane D; Baaden, Marc
2018-06-21
Proteomic and transcriptomic technologies resulted in massive biological datasets, their interpretation requiring sophisticated computational strategies. Efficient and intuitive real-time analysis remains challenging. We use proteomic data on 1417 proteins of the green microalga Chlamydomonas reinhardtii to investigate physicochemical parameters governing selectivity of three cysteine-based redox post translational modifications (PTM): glutathionylation (SSG), nitrosylation (SNO) and disulphide bonds (SS) reduced by thioredoxins. We aim to understand underlying molecular mechanisms and structural determinants through integration of redox proteome data from gene- to structural level. Our interactive visual analytics approach on an 8.3 m2 display wall of 25 MPixel resolution features stereoscopic three dimensions (3D) representation performed by UnityMol WebGL. Virtual reality headsets complement the range of usage configurations for fully immersive tasks. Our experiments confirm that fast access to a rich cross-linked database is necessary for immersive analysis of structural data. We emphasize the possibility to display complex data structures and relationships in 3D, intrinsic to molecular structure visualization, but less common for omics-network analysis. Our setup is powered by MinOmics, an integrated analysis pipeline and visualization framework dedicated to multi-omics analysis. MinOmics integrates data from various sources into a materialized physical repository. We evaluate its performance, a design criterion for the framework.
Integrative eQTL analysis of tumor and host omics data in individuals with bladder cancer.
Pineda, Silvia; Van Steen, Kristel; Malats, Núria
2017-09-01
Integrative analyses of several omics data are emerging. The data are usually generated from the same source material (i.e., tumor sample) representing one level of regulation. However, integrating different regulatory levels (i.e., blood) with those from tumor may also reveal important knowledge about the human genetic architecture. To model this multilevel structure, an integrative-expression quantitative trait loci (eQTL) analysis applying two-stage regression (2SR) was proposed. This approach first regressed tumor gene expression levels with tumor markers and the adjusted residuals from the previous model were then regressed with the germline genotypes measured in blood. Previously, we demonstrated that penalized regression methods in combination with a permutation-based MaxT method (Global-LASSO) is a promising tool to fix some of the challenges that high-throughput omics data analysis imposes. Here, we assessed whether Global-LASSO can also be applied when tumor and blood omics data are integrated. We further compared our strategy with two 2SR-approaches, one using multiple linear regression (2SR-MLR) and other using LASSO (2SR-LASSO). We applied the three models to integrate genomic, epigenomic, and transcriptomic data from tumor tissue with blood germline genotypes from 181 individuals with bladder cancer included in the TCGA Consortium. Global-LASSO provided a larger list of eQTLs than the 2SR methods, identified a previously reported eQTLs in prostate stem cell antigen (PSCA), and provided further clues on the complexity of APBEC3B loci, with a minimal false-positive rate not achieved by 2SR-MLR. It also represents an important contribution for omics integrative analysis because it is easy to apply and adaptable to any type of data. © 2017 WILEY PERIODICALS, INC.
Dimension reduction techniques for the integrative analysis of multi-omics data
Zeleznik, Oana A.; Thallinger, Gerhard G.; Kuster, Bernhard; Gholami, Amin M.
2016-01-01
State-of-the-art next-generation sequencing, transcriptomics, proteomics and other high-throughput ‘omics' technologies enable the efficient generation of large experimental data sets. These data may yield unprecedented knowledge about molecular pathways in cells and their role in disease. Dimension reduction approaches have been widely used in exploratory analysis of single omics data sets. This review will focus on dimension reduction approaches for simultaneous exploratory analyses of multiple data sets. These methods extract the linear relationships that best explain the correlated structure across data sets, the variability both within and between variables (or observations) and may highlight data issues such as batch effects or outliers. We explore dimension reduction techniques as one of the emerging approaches for data integration, and how these can be applied to increase our understanding of biological systems in normal physiological function and disease. PMID:26969681
Kim, Dokyoon; Joung, Je-Gun; Sohn, Kyung-Ah; Shin, Hyunjung; Park, Yu Rang; Ritchie, Marylyn D; Kim, Ju Han
2015-01-01
Objective Cancer can involve gene dysregulation via multiple mechanisms, so no single level of genomic data fully elucidates tumor behavior due to the presence of numerous genomic variations within or between levels in a biological system. We have previously proposed a graph-based integration approach that combines multi-omics data including copy number alteration, methylation, miRNA, and gene expression data for predicting clinical outcome in cancer. However, genomic features likely interact with other genomic features in complex signaling or regulatory networks, since cancer is caused by alterations in pathways or complete processes. Methods Here we propose a new graph-based framework for integrating multi-omics data and genomic knowledge to improve power in predicting clinical outcomes and elucidate interplay between different levels. To highlight the validity of our proposed framework, we used an ovarian cancer dataset from The Cancer Genome Atlas for predicting stage, grade, and survival outcomes. Results Integrating multi-omics data with genomic knowledge to construct pre-defined features resulted in higher performance in clinical outcome prediction and higher stability. For the grade outcome, the model with gene expression data produced an area under the receiver operating characteristic curve (AUC) of 0.7866. However, models of the integration with pathway, Gene Ontology, chromosomal gene set, and motif gene set consistently outperformed the model with genomic data only, attaining AUCs of 0.7873, 0.8433, 0.8254, and 0.8179, respectively. Conclusions Integrating multi-omics data and genomic knowledge to improve understanding of molecular pathogenesis and underlying biology in cancer should improve diagnostic and prognostic indicators and the effectiveness of therapies. PMID:25002459
Kim, Dokyoon; Joung, Je-Gun; Sohn, Kyung-Ah; Shin, Hyunjung; Park, Yu Rang; Ritchie, Marylyn D; Kim, Ju Han
2015-01-01
Cancer can involve gene dysregulation via multiple mechanisms, so no single level of genomic data fully elucidates tumor behavior due to the presence of numerous genomic variations within or between levels in a biological system. We have previously proposed a graph-based integration approach that combines multi-omics data including copy number alteration, methylation, miRNA, and gene expression data for predicting clinical outcome in cancer. However, genomic features likely interact with other genomic features in complex signaling or regulatory networks, since cancer is caused by alterations in pathways or complete processes. Here we propose a new graph-based framework for integrating multi-omics data and genomic knowledge to improve power in predicting clinical outcomes and elucidate interplay between different levels. To highlight the validity of our proposed framework, we used an ovarian cancer dataset from The Cancer Genome Atlas for predicting stage, grade, and survival outcomes. Integrating multi-omics data with genomic knowledge to construct pre-defined features resulted in higher performance in clinical outcome prediction and higher stability. For the grade outcome, the model with gene expression data produced an area under the receiver operating characteristic curve (AUC) of 0.7866. However, models of the integration with pathway, Gene Ontology, chromosomal gene set, and motif gene set consistently outperformed the model with genomic data only, attaining AUCs of 0.7873, 0.8433, 0.8254, and 0.8179, respectively. Integrating multi-omics data and genomic knowledge to improve understanding of molecular pathogenesis and underlying biology in cancer should improve diagnostic and prognostic indicators and the effectiveness of therapies. © The Author 2014. Published by Oxford University Press on behalf of the American Medical Informatics Association.
Integrating Omics Technologies to Study Pulmonary Physiology and Pathology at the Systems Level
Pathak, Ravi Ramesh; Davé, Vrushank
2014-01-01
Assimilation and integration of “omics” technologies, including genomics, epigenomics, proteomics, and metabolomics has readily altered the landscape of medical research in the last decade. The vast and complex nature of omics data can only be interpreted by linking molecular information at the organismic level, forming the foundation of systems biology. Research in pulmonary biology/medicine has necessitated integration of omics, network, systems and computational biology data to differentially diagnose, interpret, and prognosticate pulmonary diseases, facilitating improvement in therapy and treatment modalities. This review describes how to leverage this emerging technology in understanding pulmonary diseases at the systems level –called a “systomic” approach. Considering the operational wholeness of cellular and organ systems, diseased genome, proteome, and the metabolome needs to be conceptualized at the systems level to understand disease pathogenesis and progression. Currently available omics technology and resources require a certain degree of training and proficiency in addition to dedicated hardware and applications, making them relatively less user friendly for the pulmonary biologist and clinicians. Herein, we discuss the various strategies, computational tools and approaches required to study pulmonary diseases at the systems level for biomedical scientists and clinical researchers. PMID:24802001
Multi-omics facilitated variable selection in Cox-regression model for cancer prognosis prediction.
Liu, Cong; Wang, Xujun; Genchev, Georgi Z; Lu, Hui
2017-07-15
New developments in high-throughput genomic technologies have enabled the measurement of diverse types of omics biomarkers in a cost-efficient and clinically-feasible manner. Developing computational methods and tools for analysis and translation of such genomic data into clinically-relevant information is an ongoing and active area of investigation. For example, several studies have utilized an unsupervised learning framework to cluster patients by integrating omics data. Despite such recent advances, predicting cancer prognosis using integrated omics biomarkers remains a challenge. There is also a shortage of computational tools for predicting cancer prognosis by using supervised learning methods. The current standard approach is to fit a Cox regression model by concatenating the different types of omics data in a linear manner, while penalty could be added for feature selection. A more powerful approach, however, would be to incorporate data by considering relationships among omics datatypes. Here we developed two methods: a SKI-Cox method and a wLASSO-Cox method to incorporate the association among different types of omics data. Both methods fit the Cox proportional hazards model and predict a risk score based on mRNA expression profiles. SKI-Cox borrows the information generated by these additional types of omics data to guide variable selection, while wLASSO-Cox incorporates this information as a penalty factor during model fitting. We show that SKI-Cox and wLASSO-Cox models select more true variables than a LASSO-Cox model in simulation studies. We assess the performance of SKI-Cox and wLASSO-Cox using TCGA glioblastoma multiforme and lung adenocarcinoma data. In each case, mRNA expression, methylation, and copy number variation data are integrated to predict the overall survival time of cancer patients. Our methods achieve better performance in predicting patients' survival in glioblastoma and lung adenocarcinoma. Copyright © 2017. Published by Elsevier Inc.
PaintOmics 3: a web resource for the pathway analysis and visualization of multi-omics data.
Hernández-de-Diego, Rafael; Tarazona, Sonia; Martínez-Mira, Carlos; Balzano-Nogueira, Leandro; Furió-Tarí, Pedro; Pappas, Georgios J; Conesa, Ana
2018-05-25
The increasing availability of multi-omic platforms poses new challenges to data analysis. Joint visualization of multi-omics data is instrumental in better understanding interconnections across molecular layers and in fully utilizing the multi-omic resources available to make biological discoveries. We present here PaintOmics 3, a web-based resource for the integrated visualization of multiple omic data types onto KEGG pathway diagrams. PaintOmics 3 combines server-end capabilities for data analysis with the potential of modern web resources for data visualization, providing researchers with a powerful framework for interactive exploration of their multi-omics information. Unlike other visualization tools, PaintOmics 3 covers a comprehensive pathway analysis workflow, including automatic feature name/identifier conversion, multi-layered feature matching, pathway enrichment, network analysis, interactive heatmaps, trend charts, and more. It accepts a wide variety of omic types, including transcriptomics, proteomics and metabolomics, as well as region-based approaches such as ATAC-seq or ChIP-seq data. The tool is freely available at www.paintomics.org.
Advances in Omics and Bioinformatics Tools for Systems Analyses of Plant Functions
Mochida, Keiichi; Shinozaki, Kazuo
2011-01-01
Omics and bioinformatics are essential to understanding the molecular systems that underlie various plant functions. Recent game-changing sequencing technologies have revitalized sequencing approaches in genomics and have produced opportunities for various emerging analytical applications. Driven by technological advances, several new omics layers such as the interactome, epigenome and hormonome have emerged. Furthermore, in several plant species, the development of omics resources has progressed to address particular biological properties of individual species. Integration of knowledge from omics-based research is an emerging issue as researchers seek to identify significance, gain biological insights and promote translational research. From these perspectives, we provide this review of the emerging aspects of plant systems research based on omics and bioinformatics analyses together with their associated resources and technological advances. PMID:22156726
Pirih, Nina; Kunej, Tanja
2018-05-01
The volume of publications and the type of research approaches used in omics system sciences are vast and continue to expand rapidly. This increased complexity and heterogeneity of omics data are challenging data extraction, sensemaking, analyses, knowledge translation, and interpretation. An extended and dynamic taxonomy for the classification and summary of omics studies are essential. We present an updated taxonomy for classification of omics research studies based on four criteria: (1) type and number of genomic loci in a research study, (2) number of species and biological samples, (3) the type of omics technology (e.g., genomics, transcriptomics, and proteomics) and omics technology application type (e.g., pharmacogenomics and nutrigenomics), and (4) phenotypes. In addition, we present a graphical summary approach that enables the researchers to define the main characteristics of their study in a single figure, and offers the readers to rapidly grasp the published study and omics data. We searched the PubMed and the Web of Science from 09/2002 to 02/2018, including research and review articles, and identified 90 scientific publications. We propose a call toward omics studies' standardization for reporting in scientific literature. We anticipate the proposed classification scheme will usefully contribute to improved classification of published reports in genomics and other omics fields, and help data extraction from publications for future multiomics data integration.
Unsupervised multiple kernel learning for heterogeneous data integration.
Mariette, Jérôme; Villa-Vialaneix, Nathalie
2018-03-15
Recent high-throughput sequencing advances have expanded the breadth of available omics datasets and the integrated analysis of multiple datasets obtained on the same samples has allowed to gain important insights in a wide range of applications. However, the integration of various sources of information remains a challenge for systems biology since produced datasets are often of heterogeneous types, with the need of developing generic methods to take their different specificities into account. We propose a multiple kernel framework that allows to integrate multiple datasets of various types into a single exploratory analysis. Several solutions are provided to learn either a consensus meta-kernel or a meta-kernel that preserves the original topology of the datasets. We applied our framework to analyse two public multi-omics datasets. First, the multiple metagenomic datasets, collected during the TARA Oceans expedition, was explored to demonstrate that our method is able to retrieve previous findings in a single kernel PCA as well as to provide a new image of the sample structures when a larger number of datasets are included in the analysis. To perform this analysis, a generic procedure is also proposed to improve the interpretability of the kernel PCA in regards with the original data. Second, the multi-omics breast cancer datasets, provided by The Cancer Genome Atlas, is analysed using a kernel Self-Organizing Maps with both single and multi-omics strategies. The comparison of these two approaches demonstrates the benefit of our integration method to improve the representation of the studied biological system. Proposed methods are available in the R package mixKernel, released on CRAN. It is fully compatible with the mixOmics package and a tutorial describing the approach can be found on mixOmics web site http://mixomics.org/mixkernel/. jerome.mariette@inra.fr or nathalie.villa-vialaneix@inra.fr. Supplementary data are available at Bioinformatics online.
Multi -omics and metabolic modelling pipelines: challenges and tools for systems microbiology.
Fondi, Marco; Liò, Pietro
2015-02-01
Integrated -omics approaches are quickly spreading across microbiology research labs, leading to (i) the possibility of detecting previously hidden features of microbial cells like multi-scale spatial organization and (ii) tracing molecular components across multiple cellular functional states. This promises to reduce the knowledge gap between genotype and phenotype and poses new challenges for computational microbiologists. We underline how the capability to unravel the complexity of microbial life will strongly depend on the integration of the huge and diverse amount of information that can be derived today from -omics experiments. In this work, we present opportunities and challenges of multi -omics data integration in current systems biology pipelines. We here discuss which layers of biological information are important for biotechnological and clinical purposes, with a special focus on bacterial metabolism and modelling procedures. A general review of the most recent computational tools for performing large-scale datasets integration is also presented, together with a possible framework to guide the design of systems biology experiments by microbiologists. Copyright © 2015. Published by Elsevier GmbH.
From Genomics to Omics Landscapes of Parkinson's Disease: Revealing the Molecular Mechanisms
Redenšek, Sara; Dolžan, Vita
2018-01-01
Abstract Molecular mechanisms of Parkinson's disease (PD) have already been investigated in various different omics landscapes. We reviewed the literature about different omics approaches between November 2005 and November 2017 to depict the main pathological pathways for PD development. In total, 107 articles exploring different layers of omics data associated with PD were retrieved. The studies were grouped into 13 omics layers: genomics–DNA level, transcriptomics, epigenomics, proteomics, ncRNomics, interactomics, metabolomics, glycomics, lipidomics, phenomics, environmental omics, pharmacogenomics, and integromics. We discussed characteristics of studies from different landscapes, such as main findings, number of participants, sample type, methodology, and outcome. We also performed curation and preliminary synthesis of multiple omics data, and identified overlapping results, which could lead toward selection of biomarkers for further validation of PD risk loci. Biomarkers could support the development of targeted prognostic/diagnostic panels as a tool for early diagnosis and prediction of progression rate and prognosis. This review presents an example of a comprehensive approach to revealing the underlying processes and risk factors of a complex disease. It urges scientists to structure the already known data and integrate it into a meaningful context. PMID:29356624
"Omics" of maize stress response for sustainable food production: opportunities and challenges.
Gong, Fangping; Yang, Le; Tai, Fuju; Hu, Xiuli; Wang, Wei
2014-12-01
Maize originated in the highlands of Mexico approximately 8700 years ago and is one of the most commonly grown cereal crops worldwide, followed by wheat and rice. Abiotic stresses (primarily drought, salinity, and high and low temperatures), together with biotic stresses (primarily fungi, viruses, and pests), negatively affect maize growth, development, and eventually production. To understand the response of maize to abiotic and biotic stresses and its mechanism of stress tolerance, high-throughput omics approaches have been used in maize stress studies. Integrated omics approaches are crucial for dissecting the temporal and spatial system-level changes that occur in maize under various stresses. In this comprehensive analysis, we review the primary types of stresses that threaten sustainable maize production; underscore the recent advances in maize stress omics, especially proteomics; and discuss the opportunities, challenges, and future directions of maize stress omics, with a view to sustainable food production. The knowledge gained from studying maize stress omics is instrumental for improving maize to cope with various stresses and to meet the food demands of the exponentially growing global population. Omics systems science offers actionable potential solutions for sustainable food production, and we present maize as a notable case study.
A practical data processing workflow for multi-OMICS projects.
Kohl, Michael; Megger, Dominik A; Trippler, Martin; Meckel, Hagen; Ahrens, Maike; Bracht, Thilo; Weber, Frank; Hoffmann, Andreas-Claudius; Baba, Hideo A; Sitek, Barbara; Schlaak, Jörg F; Meyer, Helmut E; Stephan, Christian; Eisenacher, Martin
2014-01-01
Multi-OMICS approaches aim on the integration of quantitative data obtained for different biological molecules in order to understand their interrelation and the functioning of larger systems. This paper deals with several data integration and data processing issues that frequently occur within this context. To this end, the data processing workflow within the PROFILE project is presented, a multi-OMICS project that aims on identification of novel biomarkers and the development of new therapeutic targets for seven important liver diseases. Furthermore, a software called CrossPlatformCommander is sketched, which facilitates several steps of the proposed workflow in a semi-automatic manner. Application of the software is presented for the detection of novel biomarkers, their ranking and annotation with existing knowledge using the example of corresponding Transcriptomics and Proteomics data sets obtained from patients suffering from hepatocellular carcinoma. Additionally, a linear regression analysis of Transcriptomics vs. Proteomics data is presented and its performance assessed. It was shown, that for capturing profound relations between Transcriptomics and Proteomics data, a simple linear regression analysis is not sufficient and implementation and evaluation of alternative statistical approaches are needed. Additionally, the integration of multivariate variable selection and classification approaches is intended for further development of the software. Although this paper focuses only on the combination of data obtained from quantitative Proteomics and Transcriptomics experiments, several approaches and data integration steps are also applicable for other OMICS technologies. Keeping specific restrictions in mind the suggested workflow (or at least parts of it) may be used as a template for similar projects that make use of different high throughput techniques. This article is part of a Special Issue entitled: Computational Proteomics in the Post-Identification Era. Guest Editors: Martin Eisenacher and Christian Stephan. Copyright © 2013 Elsevier B.V. All rights reserved.
Clinical multi-omics strategies for the effective cancer management.
Yoo, Byong Chul; Kim, Kyung-Hee; Woo, Sang Myung; Myung, Jae Kyung
2017-08-15
Cancer is a global health issue as a multi-factorial complex disease, and early detection and novel therapeutic strategies are required for more effective cancer management. With the development of systemic analytical -omics strategies, the therapeutic approach and study of the molecular mechanisms of carcinogenesis and cancer progression have moved from hypothesis-driven targeted investigations to data-driven untargeted investigations focusing on the integrated diagnosis, treatment, and prevention of cancer in individual patients. Predictive, preventive, and personalized medicine (PPPM) is a promising new approach to reduce the burden of cancer and facilitate more accurate prognosis, diagnosis, as well as effective treatment. Here we review the fundamentals of, and new developments in, -omics technologies, together with the key role of a variety of practical -omics strategies in PPPM for cancer treatment and diagnosis. In this review, a comprehensive and critical overview of the systematic strategy for predictive, preventive, and personalized medicine (PPPM) for cancer disease was described in a view of cancer prognostic prediction, diagnostics, and prevention as well as cancer therapy and drug responses. We have discussed multi-dimensional data obtained from various resources and integration of multisciplinary -omics strategies with computational method which could contribute the more effective PPPM for cancer. This review has provided the novel insights of the current applications of each and combined -omics technologies, which showed their powerful potential for the establishment of PPPM for cancer. Copyright © 2017 Elsevier B.V. All rights reserved.
Alzheimer's disease in the omics era.
Sancesario, Giulia M; Bernardini, Sergio
2018-06-18
Recent progresses in high-throughput technologies have led to a new scenario in investigating pathologies, named the "Omics era", which integrate the opportunity to collect large amounts of data and information at the molecular and protein levels together with the development of novel computational and statistical tools that are able to analyze and filter such data. Subsequently, advances in genotyping arrays, next generation sequencing, mass spectrometry technology, and bioinformatics allowed for the simultaneous large-scale study of thousands of genes (genomics), epigenetics factors (epigenomics), RNA (transcriptomics), metabolites (metabolomics) and proteins(proteomics), with the possibility of integrating multiple types of omics data ("multi -omics"). All of these technological innovations have modified the approach to the study of complex diseases, such as Alzheimer's Disease (AD), thus representing a promising tool to investigate the relationship between several molecular pathways in AD as well as other pathologies. This review focuses on the current knowledge on the pathology of AD, the recent findings from Omics sciences, and the challenge of the use of Big Data. We then focus on future perspectives for Omics sciences, such as the discovery of novel diagnostic biomarkers or drugs. Copyright © 2018. Published by Elsevier Inc.
Malla, Muneer A; Dubey, Anamika; Yadav, Shweta; Kumar, Ashwani; Hashem, Abeer; Abd Allah, Elsayed Fathi
2018-01-01
Rapid industrialization and population explosion has resulted in the generation and dumping of various contaminants into the environment. These harmful compounds deteriorate the human health as well as the surrounding environments. Current research aims to harness and enhance the natural ability of different microbes to metabolize these toxic compounds. Microbial-mediated bioremediation offers great potential to reinstate the contaminated environments in an ecologically acceptable approach. However, the lack of the knowledge regarding the factors controlling and regulating the growth, metabolism, and dynamics of diverse microbial communities in the contaminated environments often limits its execution. In recent years the importance of advanced tools such as genomics, proteomics, transcriptomics, metabolomics, and fluxomics has increased to design the strategies to treat these contaminants in ecofriendly manner. Previously researchers has largely focused on the environmental remediation using single omics-approach, however the present review specifically addresses the integrative role of the multi-omics approaches in microbial-mediated bioremediation. Additionally, we discussed how the multi-omics approaches help to comprehend and explore the structural and functional aspects of the microbial consortia in response to the different environmental pollutants and presented some success stories by using these approaches.
Malla, Muneer A.; Dubey, Anamika; Yadav, Shweta; Kumar, Ashwani; Hashem, Abeer; Abd_Allah, Elsayed Fathi
2018-01-01
Rapid industrialization and population explosion has resulted in the generation and dumping of various contaminants into the environment. These harmful compounds deteriorate the human health as well as the surrounding environments. Current research aims to harness and enhance the natural ability of different microbes to metabolize these toxic compounds. Microbial-mediated bioremediation offers great potential to reinstate the contaminated environments in an ecologically acceptable approach. However, the lack of the knowledge regarding the factors controlling and regulating the growth, metabolism, and dynamics of diverse microbial communities in the contaminated environments often limits its execution. In recent years the importance of advanced tools such as genomics, proteomics, transcriptomics, metabolomics, and fluxomics has increased to design the strategies to treat these contaminants in ecofriendly manner. Previously researchers has largely focused on the environmental remediation using single omics-approach, however the present review specifically addresses the integrative role of the multi-omics approaches in microbial-mediated bioremediation. Additionally, we discussed how the multi-omics approaches help to comprehend and explore the structural and functional aspects of the microbial consortia in response to the different environmental pollutants and presented some success stories by using these approaches. PMID:29915565
“Omics” of Maize Stress Response for Sustainable Food Production: Opportunities and Challenges
Gong, Fangping; Yang, Le; Tai, Fuju; Hu, Xiuli
2014-01-01
Abstract Maize originated in the highlands of Mexico approximately 8700 years ago and is one of the most commonly grown cereal crops worldwide, followed by wheat and rice. Abiotic stresses (primarily drought, salinity, and high and low temperatures), together with biotic stresses (primarily fungi, viruses, and pests), negatively affect maize growth, development, and eventually production. To understand the response of maize to abiotic and biotic stresses and its mechanism of stress tolerance, high-throughput omics approaches have been used in maize stress studies. Integrated omics approaches are crucial for dissecting the temporal and spatial system-level changes that occur in maize under various stresses. In this comprehensive analysis, we review the primary types of stresses that threaten sustainable maize production; underscore the recent advances in maize stress omics, especially proteomics; and discuss the opportunities, challenges, and future directions of maize stress omics, with a view to sustainable food production. The knowledge gained from studying maize stress omics is instrumental for improving maize to cope with various stresses and to meet the food demands of the exponentially growing global population. Omics systems science offers actionable potential solutions for sustainable food production, and we present maize as a notable case study. PMID:25401749
Zhang, Bo; Fu, Yingxue; Huang, Chao; Zheng, Chunli; Wu, Ziyin; Zhang, Wenjuan; Yang, Xiaoyan; Gong, Fukai; Li, Yuerong; Chen, Xiaoyu; Gao, Shuo; Chen, Xuetong; Li, Yan; Lu, Aiping; Wang, Yonghua
2016-02-25
The development of modern omics technology has not significantly improved the efficiency of drug development. Rather precise and targeted drug discovery remains unsolved. Here a large-scale cross-species molecular network association (CSMNA) approach for targeted drug screening from natural sources is presented. The algorithm integrates molecular network omics data from humans and 267 plants and microbes, establishing the biological relationships between them and extracting evolutionarily convergent chemicals. This technique allows the researcher to assess targeted drugs for specific human diseases based on specific plant or microbe pathways. In a perspective validation, connections between the plant Halliwell-Asada (HA) cycle and the human Nrf2-ARE pathway were verified and the manner by which the HA cycle molecules act on the human Nrf2-ARE pathway as antioxidants was determined. This shows the potential applicability of this approach in drug discovery. The current method integrates disparate evolutionary species into chemico-biologically coherent circuits, suggesting a new cross-species omics analysis strategy for rational drug development.
Portet, Anaïs; Pinaud, Silvain; Tetreau, Guillaume; Galinier, Richard; Cosseau, Céline; Duval, David; Grunau, Christoph; Mitta, Guillaume; Gourbal, Benjamin
2017-10-01
The fresh water snail Biomphalaria glabrata is one of the vectors of the trematode pathogen Schistosoma mansoni, which is one of the agents responsible of human schistosomiasis. In this host-parasite interaction, co-evolutionary dynamic results into an infectivity mosaic known as compatibility polymorphism. Integrative approaches including large scale molecular approaches have been conducted in recent years to improve our understanding of the mechanisms underlying compatibility. This review presents the combination of integrated Multi-Omic approaches leading to the discovery of two repertoires of polymorphic and/or diversified interacting molecules: the parasite antigens S. mansoni polymorphic mucins (SmPoMucs) and the B. glabrata immune receptors fibrinogen-related proteins (FREPs). We argue that their interactions may be major components for defining the compatible/incompatible status of a specific snail/schistosome combination. Copyright © 2017 Elsevier Ltd. All rights reserved.
Integrated Approaches to Drug Discovery for Oxidative Stress-Related Retinal Diseases.
Nishimura, Yuhei; Hara, Hideaki
2016-01-01
Excessive oxidative stress induces dysregulation of functional networks in the retina, resulting in retinal diseases such as glaucoma, age-related macular degeneration, and diabetic retinopathy. Although various therapies have been developed to reduce oxidative stress in retinal diseases, most have failed to show efficacy in clinical trials. This may be due to oversimplification of target selection for such a complex network as oxidative stress. Recent advances in high-throughput technologies have facilitated the collection of multilevel omics data, which has driven growth in public databases and in the development of bioinformatics tools. Integration of the knowledge gained from omics databases can be used to generate disease-related biological networks and to identify potential therapeutic targets within the networks. Here, we provide an overview of integrative approaches in the drug discovery process and provide simple examples of how the approaches can be exploited to identify oxidative stress-related targets for retinal diseases.
Integrated Approaches to Drug Discovery for Oxidative Stress-Related Retinal Diseases
Hara, Hideaki
2016-01-01
Excessive oxidative stress induces dysregulation of functional networks in the retina, resulting in retinal diseases such as glaucoma, age-related macular degeneration, and diabetic retinopathy. Although various therapies have been developed to reduce oxidative stress in retinal diseases, most have failed to show efficacy in clinical trials. This may be due to oversimplification of target selection for such a complex network as oxidative stress. Recent advances in high-throughput technologies have facilitated the collection of multilevel omics data, which has driven growth in public databases and in the development of bioinformatics tools. Integration of the knowledge gained from omics databases can be used to generate disease-related biological networks and to identify potential therapeutic targets within the networks. Here, we provide an overview of integrative approaches in the drug discovery process and provide simple examples of how the approaches can be exploited to identify oxidative stress-related targets for retinal diseases. PMID:28053689
Making the Most of Omics for Symbiosis Research
Chaston, J.; Douglas, A.E.
2012-01-01
Omics, including genomics, proteomics and metabolomics, enable us to explain symbioses in terms of the underlying molecules and their interactions. The central task is to transform molecular catalogs of genes, metabolites etc. into a dynamic understanding of symbiosis function. We review four exemplars of omics studies that achieve this goal, through defined biological questions relating to metabolic integration and regulation of animal-microbial symbioses, the genetic autonomy of bacterial symbionts, and symbiotic protection of animal hosts from pathogens. As omic datasets become increasingly complex, computationally-sophisticated downstream analyses are essential to reveal interactions not evident to visual inspection of the data. We discuss two approaches, phylogenomics and transcriptional clustering, that can divide the primary output of omics studies – long lists of factors – into manageable subsets, and we describe how they have been applied to analyze large datasets and generate testable hypotheses. PMID:22983030
Sehgal, Vasudha; Seviour, Elena G; Moss, Tyler J; Mills, Gordon B; Azencott, Robert; Ram, Prahlad T
2015-01-01
MicroRNAs (miRNAs) play a crucial role in the maintenance of cellular homeostasis by regulating the expression of their target genes. As such, the dysregulation of miRNA expression has been frequently linked to cancer. With rapidly accumulating molecular data linked to patient outcome, the need for identification of robust multi-omic molecular markers is critical in order to provide clinical impact. While previous bioinformatic tools have been developed to identify potential biomarkers in cancer, these methods do not allow for rapid classification of oncogenes versus tumor suppressors taking into account robust differential expression, cutoffs, p-values and non-normality of the data. Here, we propose a methodology, Robust Selection Algorithm (RSA) that addresses these important problems in big data omics analysis. The robustness of the survival analysis is ensured by identification of optimal cutoff values of omics expression, strengthened by p-value computed through intensive random resampling taking into account any non-normality in the data and integration into multi-omic functional networks. Here we have analyzed pan-cancer miRNA patient data to identify functional pathways involved in cancer progression that are associated with selected miRNA identified by RSA. Our approach demonstrates the way in which existing survival analysis techniques can be integrated with a functional network analysis framework to efficiently identify promising biomarkers and novel therapeutic candidates across diseases.
An Integrative Bioinformatics Approach for Knowledge Discovery
NASA Astrophysics Data System (ADS)
Peña-Castillo, Lourdes; Phan, Sieu; Famili, Fazel
The vast amount of data being generated by large scale omics projects and the computational approaches developed to deal with this data have the potential to accelerate the advancement of our understanding of the molecular basis of genetic diseases. This better understanding may have profound clinical implications and transform the medical practice; for instance, therapeutic management could be prescribed based on the patient’s genetic profile instead of being based on aggregate data. Current efforts have established the feasibility and utility of integrating and analysing heterogeneous genomic data to identify molecular associations to pathogenesis. However, since these initiatives are data-centric, they either restrict the research community to specific data sets or to a certain application domain, or force researchers to develop their own analysis tools. To fully exploit the potential of omics technologies, robust computational approaches need to be developed and made available to the community. This research addresses such challenge and proposes an integrative approach to facilitate knowledge discovery from diverse datasets and contribute to the advancement of genomic medicine.
Raja, Kalpana; Patrick, Matthew; Gao, Yilin; Madu, Desmond; Yang, Yuyang
2017-01-01
In the past decade, the volume of “omics” data generated by the different high-throughput technologies has expanded exponentially. The managing, storing, and analyzing of this big data have been a great challenge for the researchers, especially when moving towards the goal of generating testable data-driven hypotheses, which has been the promise of the high-throughput experimental techniques. Different bioinformatics approaches have been developed to streamline the downstream analyzes by providing independent information to interpret and provide biological inference. Text mining (also known as literature mining) is one of the commonly used approaches for automated generation of biological knowledge from the huge number of published articles. In this review paper, we discuss the recent advancement in approaches that integrate results from omics data and information generated from text mining approaches to uncover novel biomedical information. PMID:28331849
From data to knowledge: The future of multi-omics data analysis for the rhizosphere
Allen White III, Richard; Borkum, Mark I.; Rivas-Ubach, Albert; ...
2017-05-04
The rhizosphere is the interface between the root system of a plant and its surrounding soil. The microbiome of the rhizosphere, which is the totality of all microbes present there, represents a complex microbial ecosystem that nourishes the terrestrial biosphere. In order to untangle the complexity of the rhizosphere, and of the rhizospheric microbiome in particular, an integrated multi-omics approach can be applied to reveal the composition of the rhizospheric microbiome (through 16S ribosomal amplicons and metagenomics), the functional properties of the microbiome (through metatranscriptomics and metaproteomics), and the signaling network within the rhizosphere (through metametabolomics). The successful application ofmore » integrated multi-omics to rhizospheric science depends on the availability of rhizosphere-specific data and on the appropriate software used to analyze omics data from the rhizosphere. Here, we analyze the availability of software suites that are normally applied to surrogate disciplines (e.g., soil and plants) but which can be used for rhizospheric science. We also identify potential issues, challenges, and opportunities for rhizosphere science.« less
Großkinsky, Dominik K; Syaifullah, Syahnada Jaya; Roitsch, Thomas
2018-02-12
The study of senescence in plants is complicated by diverse levels of temporal and spatial dynamics as well as the impact of external biotic and abiotic factors and crop plant management. Whereas the molecular mechanisms involved in developmentally regulated leaf senescence are very well understood, in particular in the annual model plant species Arabidopsis, senescence of other organs such as the flower, fruit, and root is much less studied as well as senescence in perennials such as trees. This review addresses the need for the integration of multi-omics techniques and physiological phenotyping into holistic phenomics approaches to dissect the complex phenomenon of senescence. That became feasible through major advances in the establishment of various, complementary 'omics' technologies. Such an interdisciplinary approach will also need to consider knowledge from the animal field, in particular in relation to novel regulators such as small, non-coding RNAs, epigenetic control and telomere length. Such a characterization of phenotypes via the acquisition of high-dimensional datasets within a systems biology approach will allow us to systematically characterize the various programmes governing senescence beyond leaf senescence in Arabidopsis and to elucidate the underlying molecular processes. Such a multi-omics approach is expected to also spur the application of results from model plants to agriculture and their verification for sustainable and environmentally friendly improvement of crop plant stress resilience and productivity and contribute to improvements based on postharvest physiology for the food industry and the benefit of its customers. © The Author(s) 2017. Published by Oxford University Press on behalf of the Society for Experimental Biology. All rights reserved. For permissions, please email: journals.permissions@oup.com.
Techniques for integrating ‐omics data
Akula, Siva Prasad; Miriyala, Raghava Naidu; Thota, Hanuman; Rao, Allam Appa; Gedela, Srinubabu
2009-01-01
The challenge for -omics research is to tackle the problem of fragmentation of knowledge by integrating several sources of heterogeneous information into a coherent entity. It is widely recognized that successful data integration is one of the keys to improve productivity for stored data. Through proper data integration tools and algorithms, researchers may correlate relationships that enable them to make better and faster decisions. The need for data integration is essential for present ‐omics community, because ‐omics data is currently spread world wide in wide variety of formats. These formats can be integrated and migrated across platforms through different techniques and one of the important techniques often used is XML. XML is used to provide a document markup language that is easier to learn, retrieve, store and transmit. It is semantically richer than HTML. Here, we describe bio warehousing, database federation, controlled vocabularies and highlighting the XML application to store, migrate and validate -omics data. PMID:19255651
Techniques for integrating -omics data.
Akula, Siva Prasad; Miriyala, Raghava Naidu; Thota, Hanuman; Rao, Allam Appa; Gedela, Srinubabu
2009-01-01
The challenge for -omics research is to tackle the problem of fragmentation of knowledge by integrating several sources of heterogeneous information into a coherent entity. It is widely recognized that successful data integration is one of the keys to improve productivity for stored data. Through proper data integration tools and algorithms, researchers may correlate relationships that enable them to make better and faster decisions. The need for data integration is essential for present -omics community, because -omics data is currently spread world wide in wide variety of formats. These formats can be integrated and migrated across platforms through different techniques and one of the important techniques often used is XML. XML is used to provide a document markup language that is easier to learn, retrieve, store and transmit. It is semantically richer than HTML. Here, we describe bio warehousing, database federation, controlled vocabularies and highlighting the XML application to store, migrate and validate -omics data.
Cambiaghi, Alice; Ferrario, Manuela; Masseroli, Marco
2017-05-01
Metabolomics is a rapidly growing field consisting of the analysis of a large number of metabolites at a system scale. The two major goals of metabolomics are the identification of the metabolites characterizing each organism state and the measurement of their dynamics under different situations (e.g. pathological conditions, environmental factors). Knowledge about metabolites is crucial for the understanding of most cellular phenomena, but this information alone is not sufficient to gain a comprehensive view of all the biological processes involved. Integrated approaches combining metabolomics with transcriptomics and proteomics are thus required to obtain much deeper insights than any of these techniques alone. Although this information is available, multilevel integration of different 'omics' data is still a challenge. The handling, processing, analysis and integration of these data require specialized mathematical, statistical and bioinformatics tools, and several technical problems hampering a rapid progress in the field exist. Here, we review four main tools for number of users or provided features (MetaCoreTM, MetaboAnalyst, InCroMAP and 3Omics) out of the several available for metabolomic data analysis and integration with other 'omics' data, highlighting their strong and weak aspects; a number of related issues affecting data analysis and integration are also identified and discussed. Overall, we provide an objective description of how some of the main currently available software packages work, which may help the experimental practitioner in the choice of a robust pipeline for metabolomic data analysis and integration. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
CyanOmics: an integrated database of omics for the model cyanobacterium Synechococcus sp. PCC 7002.
Yang, Yaohua; Feng, Jie; Li, Tao; Ge, Feng; Zhao, Jindong
2015-01-01
Cyanobacteria are an important group of organisms that carry out oxygenic photosynthesis and play vital roles in both the carbon and nitrogen cycles of the Earth. The annotated genome of Synechococcus sp. PCC 7002, as an ideal model cyanobacterium, is available. A series of transcriptomic and proteomic studies of Synechococcus sp. PCC 7002 cells grown under different conditions have been reported. However, no database of such integrated omics studies has been constructed. Here we present CyanOmics, a database based on the results of Synechococcus sp. PCC 7002 omics studies. CyanOmics comprises one genomic dataset, 29 transcriptomic datasets and one proteomic dataset and should prove useful for systematic and comprehensive analysis of all those data. Powerful browsing and searching tools are integrated to help users directly access information of interest with enhanced visualization of the analytical results. Furthermore, Blast is included for sequence-based similarity searching and Cluster 3.0, as well as the R hclust function is provided for cluster analyses, to increase CyanOmics's usefulness. To the best of our knowledge, it is the first integrated omics analysis database for cyanobacteria. This database should further understanding of the transcriptional patterns, and proteomic profiling of Synechococcus sp. PCC 7002 and other cyanobacteria. Additionally, the entire database framework is applicable to any sequenced prokaryotic genome and could be applied to other integrated omics analysis projects. Database URL: http://lag.ihb.ac.cn/cyanomics. © The Author(s) 2015. Published by Oxford University Press.
Childhood obesity: a systems medicine approach.
Stone, William L; Schetzina, Karen; Stuart, Charles
2016-06-01
Childhood obesity and its sequelae are a major public health problem in both the USA and globally. This review will focus on a systems medicine approach to obesity. Systems medicine is an integrative approach utilizing the vast amount of data garnered from "omics" technology and integrating these data with conventional pathophysiology as well as diverse environmental factors such as diet, exercise, community dynamics and the intestinal microbiome. Omics technology includes genomics, epigenomics, metagenomics, metabolomics and proteomics. In addition to unraveling etiology, the goals of a systems medicine approach are to provide actionable and evidenced-based clinical approaches. In the case of childhood obesity, an additional goal is characterizing measureable risk factors/biomarkers for obesity at the earliest possible age and devising age-appropriate optimal intervention strategies. It is also important to establish the age at which interventions could be critical. As discussed below, it is possible that some of the pathophysiological and epigenetic changes resulting from childhood obesity could become more irreversible the longer the obesity remains untreated.
Omics Integration in Biology and Medicine Workshop | Office of Cancer Clinical Proteomics Research
The focus of this meeting will be on the emerging field of integrating disparate omic data from genomics, proteomics, glycomics, etc. in order to better understand key biological processes and also improve clinical practice. Discussants will focus on identifying the technical and biological barriers in omic integration, with solutions to build a consensus towards data integration in bioscience and to better define phenotypes.
Data integration in the era of omics: current and future challenges
2014-01-01
To integrate heterogeneous and large omics data constitutes not only a conceptual challenge but a practical hurdle in the daily analysis of omics data. With the rise of novel omics technologies and through large-scale consortia projects, biological systems are being further investigated at an unprecedented scale generating heterogeneous and often large data sets. These data-sets encourage researchers to develop novel data integration methodologies. In this introduction we review the definition and characterize current efforts on data integration in the life sciences. We have used a web-survey to assess current research projects on data-integration to tap into the views, needs and challenges as currently perceived by parts of the research community. PMID:25032990
Improved bacteriophage genome data is necessary for integrating viral and bacterial ecology.
Bibby, Kyle
2014-02-01
The recent rise in "omics"-enabled approaches has lead to improved understanding in many areas of microbial ecology. However, despite the importance that viruses play in a broad microbial ecology context, viral ecology remains largely not integrated into high-throughput microbial ecology studies. A fundamental hindrance to the integration of viral ecology into omics-enabled microbial ecology studies is the lack of suitable reference bacteriophage genomes in reference databases-currently, only 0.001% of bacteriophage diversity is represented in genome sequence databases. This commentary serves to highlight this issue and to promote bacteriophage genome sequencing as a valuable scientific undertaking to both better understand bacteriophage diversity and move towards a more holistic view of microbial ecology.
Wang, Peng; Chen, Zhen
2013-09-01
Traditional Chinese medicine (TCM) is a comprehensive system of medical practice that has been used to diagnose, treat, and prevent illnesses for more than 3000 years. ZHENG (also known as "syndrome") differentiation remains the essence of TCM. In China, TCM shares equal status, and integrated with Western medicine in the healthcare system to treat many types of diseases. Yet, compared to biomolecular science and Western medicine, the ZHENG/TCM approach to diagnostics might appear unobjective, but offers at the same time long-standing clinical and phenotypic-rich insights. With the current globalization of life sciences and the arrival of "Big Data" research and development, these two silos of medical lore are rapidly coalescing. The applications of multi-omics strategies to TCM have begun to provide novel insights into the essence and molecular basis of TCM ZHENG. We searched the Chinese electronic databases and PubMed for published articles related to "Omics" and "TCM ZHENG" and observed a dramatic increase in studies over the past few years. In this article, we provide a timely synthesis of the lessons learned, and the emerging applications of omics science in TCM ZHENG research. We suggest that the global health scholarship and the field of "developing world Omics" can usefully draw from TCM, and vice versa.
Fluxomics - connecting 'omics analysis and phenotypes.
Winter, Gal; Krömer, Jens O
2013-07-01
In our modern 'omics era, metabolic flux analysis (fluxomics) represents the physiological counterpart of its siblings transcriptomics, proteomics and metabolomics. Fluxomics integrates in vivo measurements of metabolic fluxes with stoichiometric network models to allow the determination of absolute flux through large networks of the central carbon metabolism. There are many approaches to implement fluxomics including flux balance analysis (FBA), (13) C fluxomics and (13) C-constrained FBA as well as many experimental settings for flux measurement including dynamic, stationary and semi-stationary. Here we outline the principles of the different approaches and their relative advantages. We demonstrate the unique contribution of flux analysis for phenotype elucidation using a thoroughly studied metabolic reaction as a case study, the microbial aerobic/anaerobic shift, highlighting the importance of flux analysis as a single layer of data as well as interlaced in multi-omics studies. © 2012 John Wiley & Sons Ltd and Society for Applied Microbiology.
EU Framework 6 Project: Predictive Toxicology (PredTox)-overview and outcome
DOE Office of Scientific and Technical Information (OSTI.GOV)
Suter, Laura, E-mail: Laura.suter-dick@roche.com; Schroeder, Susanne; Meyer, Kirstin
2011-04-15
In this publication, we report the outcome of the integrated EU Framework 6 Project: Predictive Toxicology (PredTox), including methodological aspects and overall conclusions. Specific details including data analysis and interpretation are reported in separate articles in this issue. The project, partly funded by the EU, was carried out by a consortium of 15 pharmaceutical companies, 2 SMEs, and 3 universities. The effects of 16 test compounds were characterized using conventional toxicological parameters and 'omics' technologies. The three major observed toxicities, liver hypertrophy, bile duct necrosis and/or cholestasis, and kidney proximal tubular damage were analyzed in detail. The combined approach ofmore » 'omics' and conventional toxicology proved a useful tool for mechanistic investigations and the identification of putative biomarkers. In our hands and in combination with histopathological assessment, target organ transcriptomics was the most prolific approach for the generation of mechanistic hypotheses. Proteomics approaches were relatively time-consuming and required careful standardization. NMR-based metabolomics detected metabolite changes accompanying histopathological findings, providing limited additional mechanistic information. Conversely, targeted metabolite profiling with LC/GC-MS was very useful for the investigation of bile duct necrosis/cholestasis. In general, both proteomics and metabolomics were supportive of other findings. Thus, the outcome of this program indicates that 'omics' technologies can help toxicologists to make better informed decisions during exploratory toxicological studies. The data support that hypothesis on mode of action and discovery of putative biomarkers are tangible outcomes of integrated 'omics' analysis. Qualification of biomarkers remains challenging, in particular in terms of identification, mechanistic anchoring, appropriate specificity, and sensitivity.« less
Tebani, Abdellah; Afonso, Carlos; Marret, Stéphane; Bekri, Soumeya
2016-01-01
The rise of technologies that simultaneously measure thousands of data points represents the heart of systems biology. These technologies have had a huge impact on the discovery of next-generation diagnostics, biomarkers, and drugs in the precision medicine era. Systems biology aims to achieve systemic exploration of complex interactions in biological systems. Driven by high-throughput omics technologies and the computational surge, it enables multi-scale and insightful overviews of cells, organisms, and populations. Precision medicine capitalizes on these conceptual and technological advancements and stands on two main pillars: data generation and data modeling. High-throughput omics technologies allow the retrieval of comprehensive and holistic biological information, whereas computational capabilities enable high-dimensional data modeling and, therefore, accessible and user-friendly visualization. Furthermore, bioinformatics has enabled comprehensive multi-omics and clinical data integration for insightful interpretation. Despite their promise, the translation of these technologies into clinically actionable tools has been slow. In this review, we present state-of-the-art multi-omics data analysis strategies in a clinical context. The challenges of omics-based biomarker translation are discussed. Perspectives regarding the use of multi-omics approaches for inborn errors of metabolism (IEM) are presented by introducing a new paradigm shift in addressing IEM investigations in the post-genomic era. PMID:27649151
Tebani, Abdellah; Afonso, Carlos; Marret, Stéphane; Bekri, Soumeya
2016-09-14
The rise of technologies that simultaneously measure thousands of data points represents the heart of systems biology. These technologies have had a huge impact on the discovery of next-generation diagnostics, biomarkers, and drugs in the precision medicine era. Systems biology aims to achieve systemic exploration of complex interactions in biological systems. Driven by high-throughput omics technologies and the computational surge, it enables multi-scale and insightful overviews of cells, organisms, and populations. Precision medicine capitalizes on these conceptual and technological advancements and stands on two main pillars: data generation and data modeling. High-throughput omics technologies allow the retrieval of comprehensive and holistic biological information, whereas computational capabilities enable high-dimensional data modeling and, therefore, accessible and user-friendly visualization. Furthermore, bioinformatics has enabled comprehensive multi-omics and clinical data integration for insightful interpretation. Despite their promise, the translation of these technologies into clinically actionable tools has been slow. In this review, we present state-of-the-art multi-omics data analysis strategies in a clinical context. The challenges of omics-based biomarker translation are discussed. Perspectives regarding the use of multi-omics approaches for inborn errors of metabolism (IEM) are presented by introducing a new paradigm shift in addressing IEM investigations in the post-genomic era.
Wolters, Jarno E J; van Breda, Simone G J; Grossmann, Jonas; Fortes, Claudia; Caiment, Florian; Kleinjans, Jos C S
2018-06-01
We performed a multiple 'omics study by integrating data on epigenomic, transcriptomic, and proteomic perturbations associated with mitochondrial dysfunction in primary human hepatocytes caused by the liver toxicant valproic acid (VPA), to deeper understand downstream events following epigenetic alterations in the mitochondrial genome. Furthermore, we investigated persistence of cross-omics changes after terminating drug treatment. Upon transient methylation changes of mitochondrial genes during VPA-treatment, increasing complexities of gene-interaction networks across time were demonstrated, which normalized during washout. Furthermore, co-expression between genes and their corresponding proteins increased across time. Additionally, in relation to persistently decreased ATP production, we observed decreased expression of mitochondrial complex I and III-V genes. Persistent transcripts and proteins were related to citric acid cycle and β-oxidation. In particular, we identified a potential novel mitochondrial-nuclear signaling axis, MT-CO2-FN1-MYC-CPT1. In summary, this cross-omics study revealed dynamic responses of the mitochondrial epigenome to an impulse toxicant challenge resulting in persistent mitochondrial dysfunctioning. Moreover, this approach allowed for discriminating between the toxic effect of VPA and adaptation. Copyright © 2018 Elsevier B.V. All rights reserved.
Karathanasis, Nestoras; Tsamardinos, Ioannis
2016-01-01
Background The advance of omics technologies has made possible to measure several data modalities on a system of interest. In this work, we illustrate how the Non-Parametric Combination methodology, namely NPC, can be used for simultaneously assessing the association of different molecular quantities with an outcome of interest. We argue that NPC methods have several potential applications in integrating heterogeneous omics technologies, as for example identifying genes whose methylation and transcriptional levels are jointly deregulated, or finding proteins whose abundance shows the same trends of the expression of their encoding genes. Results We implemented the NPC methodology within “omicsNPC”, an R function specifically tailored for the characteristics of omics data. We compare omicsNPC against a range of alternative methods on simulated as well as on real data. Comparisons on simulated data point out that omicsNPC produces unbiased / calibrated p-values and performs equally or significantly better than the other methods included in the study; furthermore, the analysis of real data show that omicsNPC (a) exhibits higher statistical power than other methods, (b) it is easily applicable in a number of different scenarios, and (c) its results have improved biological interpretability. Conclusions The omicsNPC function competitively behaves in all comparisons conducted in this study. Taking into account that the method (i) requires minimal assumptions, (ii) it can be used on different studies designs and (iii) it captures the dependences among heterogeneous data modalities, omicsNPC provides a flexible and statistically powerful solution for the integrative analysis of different omics data. PMID:27812137
Integrated data analysis for genome-wide research.
Steinfath, Matthias; Repsilber, Dirk; Scholz, Matthias; Walther, Dirk; Selbig, Joachim
2007-01-01
Integrated data analysis is introduced as the intermediate level of a systems biology approach to analyse different 'omics' datasets, i.e., genome-wide measurements of transcripts, protein levels or protein-protein interactions, and metabolite levels aiming at generating a coherent understanding of biological function. In this chapter we focus on different methods of correlation analyses ranging from simple pairwise correlation to kernel canonical correlation which were recently applied in molecular biology. Several examples are presented to illustrate their application. The input data for this analysis frequently originate from different experimental platforms. Therefore, preprocessing steps such as data normalisation and missing value estimation are inherent to this approach. The corresponding procedures, potential pitfalls and biases, and available software solutions are reviewed. The multiplicity of observations obtained in omics-profiling experiments necessitates the application of multiple testing correction techniques.
Integrative Exploratory Analysis of Two or More Genomic Datasets.
Meng, Chen; Culhane, Aedin
2016-01-01
Exploratory analysis is an essential step in the analysis of high throughput data. Multivariate approaches such as correspondence analysis (CA), principal component analysis, and multidimensional scaling are widely used in the exploratory analysis of single dataset. Modern biological studies often assay multiple types of biological molecules (e.g., mRNA, protein, phosphoproteins) on a same set of biological samples, thereby creating multiple different types of omics data or multiassay data. Integrative exploratory analysis of these multiple omics data is required to leverage the potential of multiple omics studies. In this chapter, we describe the application of co-inertia analysis (CIA; for analyzing two datasets) and multiple co-inertia analysis (MCIA; for three or more datasets) to address this problem. These methods are powerful yet simple multivariate approaches that represent samples using a lower number of variables, allowing a more easily identification of the correlated structure in and between multiple high dimensional datasets. Graphical representations can be employed to this purpose. In addition, the methods simultaneously project samples and variables (genes, proteins) onto the same lower dimensional space, so the most variant variables from each dataset can be selected and associated with samples, which can be further used to facilitate biological interpretation and pathway analysis. We applied CIA to explore the concordance between mRNA and protein expression in a panel of 60 tumor cell lines from the National Cancer Institute. In the same 60 cell lines, we used MCIA to perform a cross-platform comparison of mRNA gene expression profiles obtained on four different microarray platforms. Last, as an example of integrative analysis of multiassay or multi-omics data we analyzed transcriptomic, proteomic, and phosphoproteomic data from pluripotent (iPS) and embryonic stem (ES) cell lines.
The MPLEx Protocol for Multi-omic Analyses of Soil Samples
DOE Office of Scientific and Technical Information (OSTI.GOV)
Nicora, Carrie D.; Burnum-Johnson, Kristin E.; Nakayasu, Ernesto S.
Mass spectrometry (MS)-based integrated metaproteomic, metabolomic and lipidomic (multi-omic) studies are transforming our ability to understand and characterize microbial communities in environmental and biological systems. These measurements are even enabling enhanced analyses of complex soil microbial communities, which are the most complex microbial systems known to date. Multi-omic analyses, however, do have sample preparation challenges since separate extractions are typically needed for each omic study, thereby greatly amplifying the preparation time and amount of sample required. To address this limitation, a 3-in-1 method for simultaneous metabolite, protein, and lipid extraction (MPLEx) from the exact same soil sample was created bymore » adapting a solvent-based approach. This MPLEx protocol has proven to be simple yet robust for many sample types and even when utilized for limited quantities of complex soil samples. The MPLEx method also greatly enabled the rapid multi-omic measurements needed to gain a better understanding of the members of each microbial community, while evaluating the changes taking place upon biological and environmental perturbations.« less
Next generation sequencing and its applications in HPVassociated cancers
Tuna, Musaffe; Amos, Christopher I.
2017-01-01
Approximately 18% of all human cancers have a viral etiology, and human papillomavirus (HPV) has been identified as one of the most prevalent viruses that plays causative role in nearly all cervical cancers and, in addition, in subset of head and neck, anal, penile and vulvar cancers. The recent introduction of next generation sequencing (NGS) and other omics approaches have resulted in comprehensive knowledge on the pathogenesis of HPV-driven tumors. Specifically, these approaches have provided detailed information on genomic HPV integration sites, disrupted genes and pathways, and common and distinct genetic and epigenetic alterations in different human HPV-associated cancers. This review focuses on HPV integration sites, its concomitantly disrupted genes and pathways and its functional consequences in both cervical and head and neck cancers. Integration of NGS data with other omics and clinical data is crucial to better understand the pathophysiology of each individual malignancy and, based on this, to select targets and to design effective personalized treatment options. PMID:27784002
Next generation sequencing and its applications in HPV-associated cancers.
Tuna, Musaffe; Amos, Christopher I
2017-01-31
Approximately 18% of all human cancers have a viral etiology, and human papillomavirus (HPV) has been identified as one of the most prevalent viruses that plays causative role in nearly all cervical cancers and, in addition, in subset of head and neck, anal, penile and vulvar cancers. The recent introduction of next generation sequencing (NGS) and other 'omics' approaches have resulted in comprehensive knowledge on the pathogenesis of HPV-driven tumors. Specifically, these approaches have provided detailed information on genomic HPV integration sites, disrupted genes and pathways, and common and distinct genetic and epigenetic alterations in different human HPV-associated cancers. This review focuses on HPV integration sites, its concomitantly disrupted genes and pathways and its functional consequences in both cervical and head and neck cancers. Integration of NGS data with other 'omics' and clinical data is crucial to better understand the pathophysiology of each individual malignancy and, based on this, to select targets and to design effective personalized treatment options.
Montague, Elizabeth; Stanberry, Larissa; Higdon, Roger; Janko, Imre; Lee, Elaine; Anderson, Nathaniel; Choiniere, John; Stewart, Elizabeth; Yandl, Gregory; Broomall, William; Kolker, Natali
2014-01-01
Abstract Multi-omics data-driven scientific discovery crucially rests on high-throughput technologies and data sharing. Currently, data are scattered across single omics repositories, stored in varying raw and processed formats, and are often accompanied by limited or no metadata. The Multi-Omics Profiling Expression Database (MOPED, http://moped.proteinspire.org) version 2.5 is a freely accessible multi-omics expression database. Continual improvement and expansion of MOPED is driven by feedback from the Life Sciences Community. In order to meet the emergent need for an integrated multi-omics data resource, MOPED 2.5 now includes gene relative expression data in addition to protein absolute and relative expression data from over 250 large-scale experiments. To facilitate accurate integration of experiments and increase reproducibility, MOPED provides extensive metadata through the Data-Enabled Life Sciences Alliance (DELSA Global, http://delsaglobal.org) metadata checklist. MOPED 2.5 has greatly increased the number of proteomics absolute and relative expression records to over 500,000, in addition to adding more than four million transcriptomics relative expression records. MOPED has an intuitive user interface with tabs for querying different types of omics expression data and new tools for data visualization. Summary information including expression data, pathway mappings, and direct connection between proteins and genes can be viewed on Protein and Gene Details pages. These connections in MOPED provide a context for multi-omics expression data exploration. Researchers are encouraged to submit omics data which will be consistently processed into expression summaries. MOPED as a multi-omics data resource is a pivotal public database, interdisciplinary knowledge resource, and platform for multi-omics understanding. PMID:24910945
Integration of multi-omics data for integrative gene regulatory network inference.
Zarayeneh, Neda; Ko, Euiseong; Oh, Jung Hun; Suh, Sang; Liu, Chunyu; Gao, Jean; Kim, Donghyun; Kang, Mingon
2017-01-01
Gene regulatory networks provide comprehensive insights and indepth understanding of complex biological processes. The molecular interactions of gene regulatory networks are inferred from a single type of genomic data, e.g., gene expression data in most research. However, gene expression is a product of sequential interactions of multiple biological processes, such as DNA sequence variations, copy number variations, histone modifications, transcription factors, and DNA methylations. The recent rapid advances of high-throughput omics technologies enable one to measure multiple types of omics data, called 'multi-omics data', that represent the various biological processes. In this paper, we propose an Integrative Gene Regulatory Network inference method (iGRN) that incorporates multi-omics data and their interactions in gene regulatory networks. In addition to gene expressions, copy number variations and DNA methylations were considered for multi-omics data in this paper. The intensive experiments were carried out with simulation data, where iGRN's capability that infers the integrative gene regulatory network is assessed. Through the experiments, iGRN shows its better performance on model representation and interpretation than other integrative methods in gene regulatory network inference. iGRN was also applied to a human brain dataset of psychiatric disorders, and the biological network of psychiatric disorders was analysed.
Integration of multi-omics data for integrative gene regulatory network inference
Zarayeneh, Neda; Ko, Euiseong; Oh, Jung Hun; Suh, Sang; Liu, Chunyu; Gao, Jean; Kim, Donghyun
2017-01-01
Gene regulatory networks provide comprehensive insights and indepth understanding of complex biological processes. The molecular interactions of gene regulatory networks are inferred from a single type of genomic data, e.g., gene expression data in most research. However, gene expression is a product of sequential interactions of multiple biological processes, such as DNA sequence variations, copy number variations, histone modifications, transcription factors, and DNA methylations. The recent rapid advances of high-throughput omics technologies enable one to measure multiple types of omics data, called ‘multi-omics data’, that represent the various biological processes. In this paper, we propose an Integrative Gene Regulatory Network inference method (iGRN) that incorporates multi-omics data and their interactions in gene regulatory networks. In addition to gene expressions, copy number variations and DNA methylations were considered for multi-omics data in this paper. The intensive experiments were carried out with simulation data, where iGRN’s capability that infers the integrative gene regulatory network is assessed. Through the experiments, iGRN shows its better performance on model representation and interpretation than other integrative methods in gene regulatory network inference. iGRN was also applied to a human brain dataset of psychiatric disorders, and the biological network of psychiatric disorders was analysed. PMID:29354189
DOE Office of Scientific and Technical Information (OSTI.GOV)
Nakayasu, Ernesto S.; Nicora, Carrie D.; Sims, Amy C.
2016-05-03
ABSTRACT Integrative multi-omics analyses can empower more effective investigation and complete understanding of complex biological systems. Despite recent advances in a range of omics analyses, multi-omic measurements of the same sample are still challenging and current methods have not been well evaluated in terms of reproducibility and broad applicability. Here we adapted a solvent-based method, widely applied for extracting lipids and metabolites, to add proteomics to mass spectrometry-based multi-omics measurements. Themetabolite,protein, andlipidextraction (MPLEx) protocol proved to be robust and applicable to a diverse set of sample types, including cell cultures, microbial communities, and tissues. To illustrate the utility of thismore » protocol, an integrative multi-omics analysis was performed using a lung epithelial cell line infected with Middle East respiratory syndrome coronavirus, which showed the impact of this virus on the host glycolytic pathway and also suggested a role for lipids during infection. The MPLEx method is a simple, fast, and robust protocol that can be applied for integrative multi-omic measurements from diverse sample types (e.g., environmental,in vitro, and clinical). IMPORTANCEIn systems biology studies, the integration of multiple omics measurements (i.e., genomics, transcriptomics, proteomics, metabolomics, and lipidomics) has been shown to provide a more complete and informative view of biological pathways. Thus, the prospect of extracting different types of molecules (e.g., DNAs, RNAs, proteins, and metabolites) and performing multiple omics measurements on single samples is very attractive, but such studies are challenging due to the fact that the extraction conditions differ according to the molecule type. Here, we adapted an organic solvent-based extraction method that demonstrated broad applicability and robustness, which enabled comprehensive proteomics, metabolomics, and lipidomics analyses from the same sample.« less
Omics Workshop Videocast Available | Office of Cancer Clinical Proteomics Research
The Omics Integration in Biology and Medicine Workshop, held on June 19th and 20th is now available for viewing on NIH Videocast: Day 1 and Day 2. The workshop focused on the emerging field of integrating disparate omic data from genomics, proteomics, glycomics, etc. in order to better understand key biological processes and also improve clinical practice.
[Proteomics and personalized medicine].
Rocchiccioli, Silvia; Tedeschi, Lorena; Citti, Lorenzo; Cecchettini, Antonella
2013-05-01
With the disclosure of the human genome a new era for bio-medicine has arisen, characterized by the challenge to investigate pathogenic mechanisms, studying simultaneously metabolites, DNA, RNA, and proteins. As a result, the "omics" revolution boomed, giving birth to a new medicine named "omics-based medicine". Among the other "omics", proteomics has been widely used in medicine, since it can produce a more "holistic" overview of a disease and provide a "constellation" of possible specific markers, a molecular fingerprinting that defines the clinical condition of an individual. Endpoint of this comprehensive and detailed analysis is the "diagnostic-omics", i.e. the achievement of personalized diagnoses with obvious benefits for prevention and therapy and this goal can be reached only with a perfect integration between clinicians and proteomists. To impact on the possible key factors involved in the pathological processes, oligonucleotide-based knock-down strategies can be helpful. They exploit omics-derived molecular tools (antisense, siRNA, ribozymes, decoys, and aptamers) that can be used to inhibit, at transcriptional or post-transcriptional levels, the events leading to protein synthesis, thus decreasing its expression. The identification of the pivotal mechanisms involved in diseases using global, "scenic" approaches such as the "omics" ones, and the subsequent validation and detailed description of the processes by specific molecular tools, can result in a more preventive, predictive and personalized medicine.
Analysis Commons, A Team Approach to Discovery in a Big-Data Environment for Genetic Epidemiology
Brody, Jennifer A.; Morrison, Alanna C.; Bis, Joshua C.; O'Connell, Jeffrey R.; Brown, Michael R.; Huffman, Jennifer E.; Ames, Darren C.; Carroll, Andrew; Conomos, Matthew P.; Gabriel, Stacey; Gibbs, Richard A.; Gogarten, Stephanie M.; Gupta, Namrata; Jaquish, Cashell E.; Johnson, Andrew D.; Lewis, Joshua P.; Liu, Xiaoming; Manning, Alisa K.; Papanicolaou, George J.; Pitsillides, Achilleas N.; Rice, Kenneth M.; Salerno, William; Sitlani, Colleen M.; Smith, Nicholas L.; Heckbert, Susan R.; Laurie, Cathy C.; Mitchell, Braxton D.; Vasan, Ramachandran S.; Rich, Stephen S.; Rotter, Jerome I.; Wilson, James G.; Boerwinkle, Eric; Psaty, Bruce M.; Cupples, L. Adrienne
2017-01-01
Summary paragraph The exploding volume of whole-genome sequence (WGS) and multi-omics data requires new approaches for analysis. As one solution, we have created a cloud-based Analysis Commons, which brings together genotype and phenotype data from multiple studies in a setting that is accessible by multiple investigators. This framework addresses many of the challenges of multi-center WGS analyses, including data sharing mechanisms, phenotype harmonization, integrated multi-omics analyses, annotation, and computational flexibility. In this setting, the computational pipeline facilitates a sequence-to-discovery analysis workflow illustrated here by an analysis of plasma fibrinogen levels in 3996 individuals from the National Heart, Lung, and Blood Institute (NHLBI) Trans-Omics for Precision Medicine (TOPMed) WGS program. The Analysis Commons represents a novel model for transforming WGS resources from a massive quantity of phenotypic and genomic data into knowledge of the determinants of health and disease risk in diverse human populations. PMID:29074945
Huo, Zhiguang; Tseng, George
2017-01-01
Cancer subtypes discovery is the first step to deliver personalized medicine to cancer patients. With the accumulation of massive multi-level omics datasets and established biological knowledge databases, omics data integration with incorporation of rich existing biological knowledge is essential for deciphering a biological mechanism behind the complex diseases. In this manuscript, we propose an integrative sparse K-means (is-K means) approach to discover disease subtypes with the guidance of prior biological knowledge via sparse overlapping group lasso. An algorithm using an alternating direction method of multiplier (ADMM) will be applied for fast optimization. Simulation and three real applications in breast cancer and leukemia will be used to compare is-K means with existing methods and demonstrate its superior clustering accuracy, feature selection, functional annotation of detected molecular features and computing efficiency. PMID:28959370
Huo, Zhiguang; Tseng, George
2017-06-01
Cancer subtypes discovery is the first step to deliver personalized medicine to cancer patients. With the accumulation of massive multi-level omics datasets and established biological knowledge databases, omics data integration with incorporation of rich existing biological knowledge is essential for deciphering a biological mechanism behind the complex diseases. In this manuscript, we propose an integrative sparse K -means (is- K means) approach to discover disease subtypes with the guidance of prior biological knowledge via sparse overlapping group lasso. An algorithm using an alternating direction method of multiplier (ADMM) will be applied for fast optimization. Simulation and three real applications in breast cancer and leukemia will be used to compare is- K means with existing methods and demonstrate its superior clustering accuracy, feature selection, functional annotation of detected molecular features and computing efficiency.
Metabolic engineering with plants for a sustainable biobased economy.
Yoon, Jong Moon; Zhao, Le; Shanks, Jacqueline V
2013-01-01
Plants are bona fide sustainable organisms because they accumulate carbon and synthesize beneficial metabolites from photosynthesis. To meet the challenges to food security and health threatened by increasing population growth and depletion of nonrenewable natural resources, recent metabolic engineering efforts have shifted from single pathways to holistic approaches with multiple genes owing to integration of omics technologies. Successful engineering of plants results in the high yield of biomass components for primary food sources and biofuel feedstocks, pharmaceuticals, and platform chemicals through synthetic biology and systems biology strategies. Further discovery of undefined biosynthesis pathways in plants, integrative analysis of discrete omics data, and diversified process developments for production of platform chemicals are essential to overcome the hurdles for sustainable production of value-added biomolecules from plants.
Research from the NASA Twins Study and Omics in Support of Mars Missions
NASA Technical Reports Server (NTRS)
Kundrot, C.; Shelhamer, M.; Scott, G.
2015-01-01
The NASA Twins Study, NASA's first foray into integrated omic studies in humans, illustrates how an integrated omics approach can be brought to bear on the challenges to human health and performance on a Mars mission. The NASA Twins Study involves US Astronaut Scott Kelly and his identical twin brother, Mark Kelly, a retired US Astronaut. No other opportunity to study a twin pair for a prolonged period with one subject in space and one on the ground is available for the foreseeable future. A team of 10 principal investigators are conducting the Twins Study, examining a very broad range of biological functions including the genome, epigenome, transcriptome, proteome, metabolome, gut microbiome, immunological response to vaccinations, indicators of atherosclerosis, physiological fluid shifts, and cognition. A novel aspect of the study is the integrated study of molecular, physiological, cognitive, and microbiological properties. Major sample and data collection from both subjects for this study began approximately six months before Scott Kelly's one year mission on the ISS, continue while Scott Kelly is in flight and will conclude approximately six months after his return to Earth. Mark Kelly will remain on Earth during this study, in a lifestyle unconstrained by this study, thereby providing a measure of normal variation in the properties being studied. An overview of initial results and the future plans will be described as well as the technological and ethical issues raised for spaceflight studies involving omics.
Identifying novel glioma associated pathways based on systems biology level meta-analysis.
Hu, Yangfan; Li, Jinquan; Yan, Wenying; Chen, Jiajia; Li, Yin; Hu, Guang; Shen, Bairong
2013-01-01
With recent advances in microarray technology, including genomics, proteomics, and metabolomics, it brings a great challenge for integrating this "-omics" data to analysis complex disease. Glioma is an extremely aggressive and lethal form of brain tumor, and thus the study of the molecule mechanism underlying glioma remains very important. To date, most studies focus on detecting the differentially expressed genes in glioma. However, the meta-analysis for pathway analysis based on multiple microarray datasets has not been systematically pursued. In this study, we therefore developed a systems biology based approach by integrating three types of omics data to identify common pathways in glioma. Firstly, the meta-analysis has been performed to study the overlapping of signatures at different levels based on the microarray gene expression data of glioma. Among these gene expression datasets, 12 pathways were found in GeneGO database that shared by four stages. Then, microRNA expression profiles and ChIP-seq data were integrated for the further pathway enrichment analysis. As a result, we suggest 5 of these pathways could be served as putative pathways in glioma. Among them, the pathway of TGF-beta-dependent induction of EMT via SMAD is of particular importance. Our results demonstrate that the meta-analysis based on systems biology level provide a more useful approach to study the molecule mechanism of complex disease. The integration of different types of omics data, including gene expression microarrays, microRNA and ChIP-seq data, suggest some common pathways correlated with glioma. These findings will offer useful potential candidates for targeted therapeutic intervention of glioma.
CircadiOmics: circadian omic web portal.
Ceglia, Nicholas; Liu, Yu; Chen, Siwei; Agostinelli, Forest; Eckel-Mahan, Kristin; Sassone-Corsi, Paolo; Baldi, Pierre
2018-06-15
Circadian rhythms play a fundamental role at all levels of biological organization. Understanding the mechanisms and implications of circadian oscillations continues to be the focus of intense research. However, there has been no comprehensive and integrated way for accessing and mining all circadian omic datasets. The latest release of CircadiOmics (http://circadiomics.ics.uci.edu) fills this gap for providing the most comprehensive web server for studying circadian data. The newly updated version contains high-throughput 227 omic datasets corresponding to over 74 million measurements sampled over 24 h cycles. Users can visualize and compare oscillatory trajectories across species, tissues and conditions. Periodicity statistics (e.g. period, amplitude, phase, P-value, q-value etc.) obtained from BIO_CYCLE and other methods are provided for all samples in the repository and can easily be downloaded in the form of publication-ready figures and tables. New features and substantial improvements in performance and data volume make CircadiOmics a powerful web portal for integrated analysis of circadian omic data.
Vitali, Francesca; Li, Qike; Schissler, A Grant; Berghout, Joanne; Kenost, Colleen; Lussier, Yves A
2017-12-18
The development of computational methods capable of analyzing -omics data at the individual level is critical for the success of precision medicine. Although unprecedented opportunities now exist to gather data on an individual's -omics profile ('personalome'), interpreting and extracting meaningful information from single-subject -omics remain underdeveloped, particularly for quantitative non-sequence measurements, including complete transcriptome or proteome expression and metabolite abundance. Conventional bioinformatics approaches have largely been designed for making population-level inferences about 'average' disease processes; thus, they may not adequately capture and describe individual variability. Novel approaches intended to exploit a variety of -omics data are required for identifying individualized signals for meaningful interpretation. In this review-intended for biomedical researchers, computational biologists and bioinformaticians-we survey emerging computational and translational informatics methods capable of constructing a single subject's 'personalome' for predicting clinical outcomes or therapeutic responses, with an emphasis on methods that provide interpretable readouts. (i) the single-subject analytics of the transcriptome shows the greatest development to date and, (ii) the methods were all validated in simulations, cross-validations or independent retrospective data sets. This survey uncovers a growing field that offers numerous opportunities for the development of novel validation methods and opens the door for future studies focusing on the interpretation of comprehensive 'personalomes' through the integration of multiple -omics, providing valuable insights into individual patient outcomes and treatments. © The Author 2017. Published by Oxford University Press.
MONGKIE: an integrated tool for network analysis and visualization for multi-omics data.
Jang, Yeongjun; Yu, Namhee; Seo, Jihae; Kim, Sun; Lee, Sanghyuk
2016-03-18
Network-based integrative analysis is a powerful technique for extracting biological insights from multilayered omics data such as somatic mutations, copy number variations, and gene expression data. However, integrated analysis of multi-omics data is quite complicated and can hardly be done in an automated way. Thus, a powerful interactive visual mining tool supporting diverse analysis algorithms for identification of driver genes and regulatory modules is much needed. Here, we present a software platform that integrates network visualization with omics data analysis tools seamlessly. The visualization unit supports various options for displaying multi-omics data as well as unique network models for describing sophisticated biological networks such as complex biomolecular reactions. In addition, we implemented diverse in-house algorithms for network analysis including network clustering and over-representation analysis. Novel functions include facile definition and optimized visualization of subgroups, comparison of a series of data sets in an identical network by data-to-visual mapping and subsequent overlaying function, and management of custom interaction networks. Utility of MONGKIE for network-based visual data mining of multi-omics data was demonstrated by analysis of the TCGA glioblastoma data. MONGKIE was developed in Java based on the NetBeans plugin architecture, thus being OS-independent with intrinsic support of module extension by third-party developers. We believe that MONGKIE would be a valuable addition to network analysis software by supporting many unique features and visualization options, especially for analysing multi-omics data sets in cancer and other diseases. .
Guirro, Maria; Costa, Andrea; Gual-Grau, Andreu; Mayneris-Perxachs, Jordi; Torrell, Helena; Herrero, Pol; Canela, Núria; Arola, Lluís
2018-02-10
Over the last few years, the application of high-throughput meta-omics methods has provided great progress in improving the knowledge of the gut ecosystem and linking its biodiversity to host health conditions, offering complementary support to classical microbiology. Gut microbiota plays a crucial role in relevant diseases such as obesity or cardiovascular disease (CVD), and its regulation is closely influenced by several factors, such as dietary composition. In fact, polyphenol-rich diets are the most palatable treatment to prevent hypertension associated with CVD, although the polyphenol-microbiota interactions have not been completely elucidated. For this reason, the aim of this study was to evaluate microbiota effect in obese rats supplemented by hesperidin, after being fed with cafeteria or standard diet, using a multi meta-omics approaches combining strategy of metagenomics and metaproteomics analysis. We reported that cafeteria diet induces obesity, resulting in changes in the microbiota composition, which are related to functional alterations at proteome level. In addition, hesperidin supplementation alters microbiota diversity and also proteins involved in important metabolic pathways. Overall, going deeper into strategies to integrate omics sciences is necessary to understand the complex relationships between the host, gut microbiota, and diet. © 2018 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Barah, Pankaj; Bones, Atle M
2015-02-01
The biggest challenge for modern biology is to integrate multidisciplinary approaches towards understanding the organizational and functional complexity of biological systems at different hierarchies, starting from the subcellular molecular mechanisms (microscopic) to the functional interactions of ecological communities (macroscopic). The plant-insect interaction is a good model for this purpose with the availability of an enormous amount of information at the molecular and the ecosystem levels. Changing global climatic conditions are abruptly resetting plant-insect interactions. Integration of discretely located heterogeneous information from the ecosystem to genes and pathways will be an advantage to understand the complexity of plant-insect interactions. This review will present the recent developments in omics-based high-throughput experimental approaches, with particular emphasis on studying plant defence responses against insect attack. The review highlights the importance of using integrative systems approaches to study plant-insect interactions from the macroscopic to the microscopic level. We analyse the current efforts in generating, integrating and modelling multiomics data to understand plant-insect interaction at a systems level. As a future prospect, we highlight the growing interest in utilizing the synthetic biology platform for engineering insect-resistant plants. © The Author 2014. Published by Oxford University Press on behalf of the Society for Experimental Biology. All rights reserved. For permissions, please email: journals.permissions@oup.com.
Nakayasu, Ernesto S.; Nicora, Carrie D.; Sims, Amy C.; Burnum-Johnson, Kristin E.; Kim, Young-Mo; Kyle, Jennifer E.; Matzke, Melissa M.; Shukla, Anil K.; Chu, Rosalie K.; Schepmoes, Athena A.; Jacobs, Jon M.; Baric, Ralph S.; Webb-Robertson, Bobbie-Jo; Smith, Richard D.
2016-01-01
ABSTRACT Integrative multi-omics analyses can empower more effective investigation and complete understanding of complex biological systems. Despite recent advances in a range of omics analyses, multi-omic measurements of the same sample are still challenging and current methods have not been well evaluated in terms of reproducibility and broad applicability. Here we adapted a solvent-based method, widely applied for extracting lipids and metabolites, to add proteomics to mass spectrometry-based multi-omics measurements. The metabolite, protein, and lipid extraction (MPLEx) protocol proved to be robust and applicable to a diverse set of sample types, including cell cultures, microbial communities, and tissues. To illustrate the utility of this protocol, an integrative multi-omics analysis was performed using a lung epithelial cell line infected with Middle East respiratory syndrome coronavirus, which showed the impact of this virus on the host glycolytic pathway and also suggested a role for lipids during infection. The MPLEx method is a simple, fast, and robust protocol that can be applied for integrative multi-omic measurements from diverse sample types (e.g., environmental, in vitro, and clinical). IMPORTANCE In systems biology studies, the integration of multiple omics measurements (i.e., genomics, transcriptomics, proteomics, metabolomics, and lipidomics) has been shown to provide a more complete and informative view of biological pathways. Thus, the prospect of extracting different types of molecules (e.g., DNAs, RNAs, proteins, and metabolites) and performing multiple omics measurements on single samples is very attractive, but such studies are challenging due to the fact that the extraction conditions differ according to the molecule type. Here, we adapted an organic solvent-based extraction method that demonstrated broad applicability and robustness, which enabled comprehensive proteomics, metabolomics, and lipidomics analyses from the same sample. Author Video: An author video summary of this article is available. PMID:27822525
Validation of a novel biomarker panel for the detection of ovarian cancer
Leung, Felix; Bernardini, Marcus Q.; Brown, Marshall D.; Zheng, Yingye; Molina, Rafael; Bast, Robert C.; Davis, Gerard; Serra, Stefano; Diamandis, Eleftherios P.; Kulasingam, Vathany
2016-01-01
Background Ovarian cancer (OvCa) is the most lethal gynecological malignancy. Our integrated -omics approach to OvCa biomarker discovery has identified kallikrein 6 (KLK6) and folate-receptor 1 (FOLR1) as promising candidates but these markers require further validation. Methods KLK6, FOLR1 CA125 and HE4 were investigated in three independent serum cohorts with a total of 20 healthy controls, 150 benign controls and 216 OvCa patients. The serum biomarker levels were determined by ELISA or automated immunoassay. Results All biomarkers demonstrated elevations in the sera of OvCa patients compared to controls (p<0.01). Overall, CA125 and HE4 displayed the strongest ability (AUC 0.80 and 0.82, respectively) to identify OvCa patients and the addition of HE4 to CA125 improved the sensitivity from 36% to 67% at a set specificity of 95%. As well, the combination of HE4 and FOLR1 was a strong predictor of OvCa diagnosis, displaying comparable sensitivity (65%) to the best performing CA125-based models (67%) at a set specificity of 95%. Conclusions The markers identified through our integrated –omics approach performed similarly to the clinically-approved markers CA125 and HE4. Furthermore, HE4 represents a powerful diagnostic marker for OvCa and should be used more routinely in a clinical setting. Impact The implications of our study are two-fold: (1) we have demonstrated the strengths of HE4 alone and in combination with CA125, lending credence to increasing its usage in the clinic; and (2) we have demonstrated the clinical utility of our integrated –omics approach to identifying novel serum markers with comparable performance to clinical markers. PMID:27448593
Omics Data Complementarity Underlines Functional Cross-Communication in Yeast.
Malod-Dognin, Noël; Pržulj, Nataša
2017-06-10
Mapping the complete functional layout of a cell and understanding the cross-talk between different processes are fundamental challenges. They elude us because of the incompleteness and noisiness of molecular data and because of the computational intractability of finding the exact answer. We perform a simple integration of three types of baker's yeast omics data to elucidate the functional organization and lines of cross-functional communication. We examine protein-protein interaction (PPI), co-expression (COEX) and genetic interaction (GI) data, and explore their relationship with the gold standard of functional organization, the Gene Ontology (GO). We utilize a simple framework that identifies functional cross-communication lines in each of the three data types, in GO, and collectively in the integrated model of the three omics data types; we present each of them in our new Functional Organization Map (FOM) model. We compare the FOMs of the three omics datasets with the FOM of GO and find that GI is in best agreement with GO, followed COEX and PPI. We integrate the three FOMs into a unified FOM and find that it is in better agreement with the FOM of GO than those of any omics dataset alone, demonstrating functional complementarity of different omics data.
Omics methods for probing the mode of action of natural and synthetic phytotoxins.
Duke, Stephen O; Bajsa, Joanna; Pan, Zhiqiang
2013-02-01
For a little over a decade, omics methods (transcriptomics, proteomics, metabolomics, and physionomics) have been used to discover and probe the mode of action of both synthetic and natural phytotoxins. For mode of action discovery, the strategy for each of these approaches is to generate an omics profile for phytotoxins with known molecular targets and to compare this library of responses to the responses of compounds with unknown modes of action. Using more than one omics approach enhances the probability of success. Generally, compounds with the same mode of action generate similar responses with a particular omics method. Stress and detoxification responses to phytotoxins can be much clearer than effects directly related to the target site. Clues to new modes of action must be validated with in vitro enzyme effects or genetic approaches. Thus far, the only new phytotoxin target site discovered with omics approaches (metabolomics and physionomics) is that of cinmethylin and structurally related 5-benzyloxymethyl-1,2-isoxazolines. These omics approaches pointed to tyrosine amino-transferase as the target, which was verified by enzyme assays and genetic methods. In addition to being a useful tool of mode of action discovery, omics methods provide detailed information on genetic and biochemical impacts of phytotoxins. Such information can be useful in understanding the full impact of natural phytotoxins in both agricultural and natural ecosystems.
Jiang, Xiaoyu; Fuchs, Mathias
2017-01-01
As modern biotechnologies advance, it has become increasingly frequent that different modalities of high-dimensional molecular data (termed “omics” data in this paper), such as gene expression, methylation, and copy number, are collected from the same patient cohort to predict the clinical outcome. While prediction based on omics data has been widely studied in the last fifteen years, little has been done in the statistical literature on the integration of multiple omics modalities to select a subset of variables for prediction, which is a critical task in personalized medicine. In this paper, we propose a simple penalized regression method to address this problem by assigning different penalty factors to different data modalities for feature selection and prediction. The penalty factors can be chosen in a fully data-driven fashion by cross-validation or by taking practical considerations into account. In simulation studies, we compare the prediction performance of our approach, called IPF-LASSO (Integrative LASSO with Penalty Factors) and implemented in the R package ipflasso, with the standard LASSO and sparse group LASSO. The use of IPF-LASSO is also illustrated through applications to two real-life cancer datasets. All data and codes are available on the companion website to ensure reproducibility. PMID:28546826
de Oliveira Dal'Molin, Cristiana G; Orellana, Camila; Gebbie, Leigh; Steen, Jennifer; Hodson, Mark P; Chrysanthopoulos, Panagiotis; Plan, Manuel R; McQualter, Richard; Palfreyman, Robin W; Nielsen, Lars K
2016-01-01
The urgent need for major gains in industrial crops productivity and in biofuel production from bioenergy grasses have reinforced attention on understanding C4 photosynthesis. Systems biology studies of C4 model plants may reveal important features of C4 metabolism. Here we chose foxtail millet (Setaria italica), as a C4 model plant and developed protocols to perform systems biology studies. As part of the systems approach, we have developed and used a genome-scale metabolic reconstruction in combination with the use of multi-omics technologies to gain more insights into the metabolism of S. italica. mRNA, protein, and metabolite abundances, were measured in mature and immature stem/leaf phytomers, and the multi-omics data were integrated into the metabolic reconstruction framework to capture key metabolic features in different developmental stages of the plant. RNA-Seq reads were mapped to the S. italica resulting for 83% coverage of the protein coding genes of S. italica. Besides revealing similarities and differences in central metabolism of mature and immature tissues, transcriptome analysis indicates significant gene expression of two malic enzyme isoforms (NADP- ME and NAD-ME). Although much greater expression levels of NADP-ME genes are observed and confirmed by the correspondent protein abundances in the samples, the expression of multiple genes combined to the significant abundance of metabolites that participates in C4 metabolism of NAD-ME and NADP-ME subtypes suggest that S. italica may use mixed decarboxylation modes of C4 photosynthetic pathways under different plant developmental stages. The overall analysis also indicates different levels of regulation in mature and immature tissues in carbon fixation, glycolysis, TCA cycle, amino acids, fatty acids, lignin, and cellulose syntheses. Altogether, the multi-omics analysis reveals different biological entities and their interrelation and regulation over plant development. With this study, we demonstrated that this systems approach is powerful enough to complement the functional metabolic annotation of bioenergy grasses.
de Oliveira Dal'Molin, Cristiana G.; Orellana, Camila; Gebbie, Leigh; Steen, Jennifer; Hodson, Mark P.; Chrysanthopoulos, Panagiotis; Plan, Manuel R.; McQualter, Richard; Palfreyman, Robin W.; Nielsen, Lars K.
2016-01-01
The urgent need for major gains in industrial crops productivity and in biofuel production from bioenergy grasses have reinforced attention on understanding C4 photosynthesis. Systems biology studies of C4 model plants may reveal important features of C4 metabolism. Here we chose foxtail millet (Setaria italica), as a C4 model plant and developed protocols to perform systems biology studies. As part of the systems approach, we have developed and used a genome-scale metabolic reconstruction in combination with the use of multi-omics technologies to gain more insights into the metabolism of S. italica. mRNA, protein, and metabolite abundances, were measured in mature and immature stem/leaf phytomers, and the multi-omics data were integrated into the metabolic reconstruction framework to capture key metabolic features in different developmental stages of the plant. RNA-Seq reads were mapped to the S. italica resulting for 83% coverage of the protein coding genes of S. italica. Besides revealing similarities and differences in central metabolism of mature and immature tissues, transcriptome analysis indicates significant gene expression of two malic enzyme isoforms (NADP- ME and NAD-ME). Although much greater expression levels of NADP-ME genes are observed and confirmed by the correspondent protein abundances in the samples, the expression of multiple genes combined to the significant abundance of metabolites that participates in C4 metabolism of NAD-ME and NADP-ME subtypes suggest that S. italica may use mixed decarboxylation modes of C4 photosynthetic pathways under different plant developmental stages. The overall analysis also indicates different levels of regulation in mature and immature tissues in carbon fixation, glycolysis, TCA cycle, amino acids, fatty acids, lignin, and cellulose syntheses. Altogether, the multi-omics analysis reveals different biological entities and their interrelation and regulation over plant development. With this study, we demonstrated that this systems approach is powerful enough to complement the functional metabolic annotation of bioenergy grasses. PMID:27559337
You, Qi; Yan, Hengyu; Liu, Yue; Yi, Xin; Zhang, Kang; Xu, Wenying; Su, Zhen
2017-05-01
The 22-nucleotide non-coding microRNAs (miRNAs) are mostly transcribed by RNA polymerase II and are similar to protein-coding genes. Unlike the clear process from stem-loop precursors to mature miRNAs, the primary transcriptional regulation of miRNA, especially in plants, still needs to be further clarified, including the original transcription start site, functional cis-elements and primary transcript structures. Due to several well-characterized transcription signals in the promoter region, we proposed a systemic approach integrating multidimensional "omics" (including genomics, transcriptomics, and epigenomics) data to improve the genome-wide identification of primary miRNA transcripts. Here, we used the model plant Arabidopsis thaliana to improve the ability to identify candidate promoter locations in intergenic miRNAs and to determine rules for identifying primary transcription start sites of miRNAs by integrating high-throughput omics data, such as the DNase I hypersensitive sites, chromatin immunoprecipitation-sequencing of polymerase II and H3K4me3, as well as high throughput transcriptomic data. As a result, 93% of refined primary transcripts could be confirmed by the primer pairs from a previous study. Cis-element and secondary structure analyses also supported the feasibility of our results. This work will contribute to the primary transcriptional regulatory analysis of miRNAs, and the conserved regulatory pattern may be a suitable miRNA characteristic in other plant species.
Omics Research on the International Space Station
NASA Technical Reports Server (NTRS)
Love, John
2015-01-01
The International Space Station (ISS) is an orbiting laboratory whose goals include advancing science and technology research. Completion of ISS assembly ushered a new era focused on utilization, encompassing multiple disciplines such as Biology and Biotechnology, Physical Sciences, Technology Development and Demonstration, Human Research, Earth and Space Sciences, and Educational Activities. The research complement planned for upcoming ISS Expeditions 45&46 includes several investigations in the new field of omics, which aims to collectively characterize sets of biomolecules (e.g., genomic, epigenomic, transcriptomic, proteomic, and metabolomic products) that translate into organismic structure and function. For example, Multi-Omics is a JAXA investigation that analyzes human microbial metabolic cross-talk in the space ecosystem by evaluating data from immune dysregulation biomarkers, metabolic profiles, and microbiota composition. The NASA OsteoOmics investigation studies gravitational regulation of osteoblast genomics and metabolism. Tissue Regeneration uses pan-omics approaches with cells cultured in bioreactors to characterize factors involved in mammalian bone tissue regeneration in microgravity. Rodent Research-3 includes an experiment that implements pan-omics to evaluate therapeutically significant molecular circuits, markers, and biomaterials associated with microgravity wound healing and tissue regeneration in bone defective rodents. The JAXA Mouse Epigenetics investigation examines molecular alterations in organ specific gene expression patterns and epigenetic modifications, and analyzes murine germ cell development during long term spaceflight. Lastly, Twins Study ("Differential effects of homozygous twin astronauts associated with differences in exposure to spaceflight factors"), NASA's first foray into human omics research, applies integrated analyses to assess biomolecular responses to physical, physiological, and environmental stressors associated with spaceflight.
Abiotic Stress Responses and Microbe-Mediated Mitigation in Plants: The Omics Strategies
Meena, Kamlesh K.; Sorty, Ajay M.; Bitla, Utkarsh M.; Choudhary, Khushboo; Gupta, Priyanka; Pareek, Ashwani; Singh, Dhananjaya P.; Prabha, Ratna; Sahu, Pramod K.; Gupta, Vijai K.; Singh, Harikesh B.; Krishanani, Kishor K.; Minhas, Paramjit S.
2017-01-01
Abiotic stresses are the foremost limiting factors for agricultural productivity. Crop plants need to cope up adverse external pressure created by environmental and edaphic conditions with their intrinsic biological mechanisms, failing which their growth, development, and productivity suffer. Microorganisms, the most natural inhabitants of diverse environments exhibit enormous metabolic capabilities to mitigate abiotic stresses. Since microbial interactions with plants are an integral part of the living ecosystem, they are believed to be the natural partners that modulate local and systemic mechanisms in plants to offer defense under adverse external conditions. Plant-microbe interactions comprise complex mechanisms within the plant cellular system. Biochemical, molecular and physiological studies are paving the way in understanding the complex but integrated cellular processes. Under the continuous pressure of increasing climatic alterations, it now becomes more imperative to define and interpret plant-microbe relationships in terms of protection against abiotic stresses. At the same time, it also becomes essential to generate deeper insights into the stress-mitigating mechanisms in crop plants for their translation in higher productivity. Multi-omics approaches comprising genomics, transcriptomics, proteomics, metabolomics and phenomics integrate studies on the interaction of plants with microbes and their external environment and generate multi-layered information that can answer what is happening in real-time within the cells. Integration, analysis and decipherization of the big-data can lead to a massive outcome that has significant chance for implementation in the fields. This review summarizes abiotic stresses responses in plants in-terms of biochemical and molecular mechanisms followed by the microbe-mediated stress mitigation phenomenon. We describe the role of multi-omics approaches in generating multi-pronged information to provide a better understanding of plant–microbe interactions that modulate cellular mechanisms in plants under extreme external conditions and help to optimize abiotic stresses. Vigilant amalgamation of these high-throughput approaches supports a higher level of knowledge generation about root-level mechanisms involved in the alleviation of abiotic stresses in organisms. PMID:28232845
Abiotic Stress Responses and Microbe-Mediated Mitigation in Plants: The Omics Strategies.
Meena, Kamlesh K; Sorty, Ajay M; Bitla, Utkarsh M; Choudhary, Khushboo; Gupta, Priyanka; Pareek, Ashwani; Singh, Dhananjaya P; Prabha, Ratna; Sahu, Pramod K; Gupta, Vijai K; Singh, Harikesh B; Krishanani, Kishor K; Minhas, Paramjit S
2017-01-01
Abiotic stresses are the foremost limiting factors for agricultural productivity. Crop plants need to cope up adverse external pressure created by environmental and edaphic conditions with their intrinsic biological mechanisms, failing which their growth, development, and productivity suffer. Microorganisms, the most natural inhabitants of diverse environments exhibit enormous metabolic capabilities to mitigate abiotic stresses. Since microbial interactions with plants are an integral part of the living ecosystem, they are believed to be the natural partners that modulate local and systemic mechanisms in plants to offer defense under adverse external conditions. Plant-microbe interactions comprise complex mechanisms within the plant cellular system. Biochemical, molecular and physiological studies are paving the way in understanding the complex but integrated cellular processes. Under the continuous pressure of increasing climatic alterations, it now becomes more imperative to define and interpret plant-microbe relationships in terms of protection against abiotic stresses. At the same time, it also becomes essential to generate deeper insights into the stress-mitigating mechanisms in crop plants for their translation in higher productivity. Multi-omics approaches comprising genomics, transcriptomics, proteomics, metabolomics and phenomics integrate studies on the interaction of plants with microbes and their external environment and generate multi-layered information that can answer what is happening in real-time within the cells. Integration, analysis and decipherization of the big-data can lead to a massive outcome that has significant chance for implementation in the fields. This review summarizes abiotic stresses responses in plants in-terms of biochemical and molecular mechanisms followed by the microbe-mediated stress mitigation phenomenon. We describe the role of multi-omics approaches in generating multi-pronged information to provide a better understanding of plant-microbe interactions that modulate cellular mechanisms in plants under extreme external conditions and help to optimize abiotic stresses. Vigilant amalgamation of these high-throughput approaches supports a higher level of knowledge generation about root-level mechanisms involved in the alleviation of abiotic stresses in organisms.
Integrative Genomics and Computational Systems Medicine
DOE Office of Scientific and Technical Information (OSTI.GOV)
McDermott, Jason E.; Huang, Yufei; Zhang, Bing
The exponential growth in generation of large amounts of genomic data from biological samples has driven the emerging field of systems medicine. This field is promising because it improves our understanding of disease processes at the systems level. However, the field is still in its young stage. There exists a great need for novel computational methods and approaches to effectively utilize and integrate various omics data.
Incorporating zebrafish omics into chemical biology and toxicology.
Sukardi, Hendrian; Ung, Choong Yong; Gong, Zhiyuan; Lam, Siew Hong
2010-03-01
In this communication, we describe the general aspects of omics approaches for analyses of transcriptome, proteome, and metabolome, and how they can be strategically incorporated into chemical screening and perturbation studies using the zebrafish system. Pharmacological efficacy and selectivity of chemicals can be evaluated based on chemical-induced phenotypic effects; however, phenotypic observation has limitations in identifying mechanistic action of chemicals. We suggest adapting gene-expression-based high-throughput screening as a complementary strategy to zebrafish-phenotype-based screening for mechanistic insights about the mode of action and toxicity of a chemical, large-scale predictive applications and comparative analysis of chemical-induced omics signatures, which are useful to identify conserved biological responses, signaling pathways, and biomarkers. The potential mechanistic, predictive, and comparative applications of omics approaches can be implemented in the zebrafish system. Examples of these using the omics approaches in zebrafish, including data of ours and others, are presented and discussed. Omics also facilitates the translatability of zebrafish studies across species through comparison of conserved chemical-induced responses. This review is intended to update interested readers with the current omics approaches that have been applied in chemical studies on zebrafish and their potential in enhancing discovery in chemical biology.
Zhai, Peng; Yang, Longshu; Guo, Xiao; Wang, Zhe; Guo, Jiangtao; Wang, Xiaoqi; Zhu, Huaiqiu
2017-10-02
During the past decade, the development of high throughput nucleic sequencing and mass spectrometry analysis techniques have enabled the characterization of microbial communities through metagenomics, metatranscriptomics, metaproteomics and metabolomics data. To reveal the diversity of microbial communities and interactions between living conditions and microbes, it is necessary to introduce comparative analysis based upon integration of all four types of data mentioned above. Comparative meta-omics, especially comparative metageomics, has been established as a routine process to highlight the significant differences in taxon composition and functional gene abundance among microbiota samples. Meanwhile, biologists are increasingly concerning about the correlations between meta-omics features and environmental factors, which may further decipher the adaptation strategy of a microbial community. We developed a graphical comprehensive analysis software named MetaComp comprising a series of statistical analysis approaches with visualized results for metagenomics and other meta-omics data comparison. This software is capable to read files generated by a variety of upstream programs. After data loading, analyses such as multivariate statistics, hypothesis testing of two-sample, multi-sample as well as two-group sample and a novel function-regression analysis of environmental factors are offered. Here, regression analysis regards meta-omic features as independent variable and environmental factors as dependent variables. Moreover, MetaComp is capable to automatically choose an appropriate two-group sample test based upon the traits of input abundance profiles. We further evaluate the performance of its choice, and exhibit applications for metagenomics, metaproteomics and metabolomics samples. MetaComp, an integrative software capable for applying to all meta-omics data, originally distills the influence of living environment on microbial community by regression analysis. Moreover, since the automatically chosen two-group sample test is verified to be outperformed, MetaComp is friendly to users without adequate statistical training. These improvements are aiming to overcome the new challenges under big data era for all meta-omics data. MetaComp is available at: http://cqb.pku.edu.cn/ZhuLab/MetaComp/ and https://github.com/pzhaipku/MetaComp/ .
Spicer, John I
2014-01-01
Hypoxia (low O2) is a common and natural feature of many marine environments. However, human-induced hypoxia has been on the rise over the past half century and is now recognised as a major problem in the world's seas and oceans. Whilst we have information on how marine invertebrates respond physiologically to hypoxia in the laboratory, we still lack understanding of how they respond to such stress in the wild (now and in the future). Consequently, here the question 'what can an ecophysiological approach tell us about physiological responses of marine invertebrates to hypoxia' is addressed. How marine invertebrates work in the wild when challenged with hypoxia is explored using four case studies centred on different hypoxic environments. The recent integration of the various -omics into ecophysiology is discussed, and a number of advantages of, and challenges to, successful integration are suggested. The case studies and -omic/physiology integration data are used to inform the concluding part of the review, where it is suggested that physiological responses to hypoxia in the wild are not always the same as those predicted from laboratory experiments. This is due to behaviour in the wild modifying responses, and therefore more than one type of 'experimental' approach is essential to reliably determine the actual response. It is also suggested that assuming it is known what a measured response is 'for' can be misleading and that taking parodies of ecophysiology seriously may impede research progress. This review finishes with the suggestion that an -omics approach is, and is becoming, a powerful method of understanding the response of marine invertebrates to environmental hypoxia and may be an ideal way of studying hypoxic responses in the wild. Despite centring on physiological responses to hypoxia, the review hopefully serves as a contribution to the discussion of what (animal) ecophysiology looks like (or should look like) in the 21st century.
Advantages and Pitfalls of Mass Spectrometry Based Metabolome Profiling in Systems Biology.
Aretz, Ina; Meierhofer, David
2016-04-27
Mass spectrometry-based metabolome profiling became the method of choice in systems biology approaches and aims to enhance biological understanding of complex biological systems. Genomics, transcriptomics, and proteomics are well established technologies and are commonly used by many scientists. In comparison, metabolomics is an emerging field and has not reached such high-throughput, routine and coverage than other omics technologies. Nevertheless, substantial improvements were achieved during the last years. Integrated data derived from multi-omics approaches will provide a deeper understanding of entire biological systems. Metabolome profiling is mainly hampered by its diversity, variation of metabolite concentration by several orders of magnitude and biological data interpretation. Thus, multiple approaches are required to cover most of the metabolites. No software tool is capable of comprehensively translating all the data into a biologically meaningful context yet. In this review, we discuss the advantages of metabolome profiling and main obstacles limiting progress in systems biology.
Advantages and Pitfalls of Mass Spectrometry Based Metabolome Profiling in Systems Biology
Aretz, Ina; Meierhofer, David
2016-01-01
Mass spectrometry-based metabolome profiling became the method of choice in systems biology approaches and aims to enhance biological understanding of complex biological systems. Genomics, transcriptomics, and proteomics are well established technologies and are commonly used by many scientists. In comparison, metabolomics is an emerging field and has not reached such high-throughput, routine and coverage than other omics technologies. Nevertheless, substantial improvements were achieved during the last years. Integrated data derived from multi-omics approaches will provide a deeper understanding of entire biological systems. Metabolome profiling is mainly hampered by its diversity, variation of metabolite concentration by several orders of magnitude and biological data interpretation. Thus, multiple approaches are required to cover most of the metabolites. No software tool is capable of comprehensively translating all the data into a biologically meaningful context yet. In this review, we discuss the advantages of metabolome profiling and main obstacles limiting progress in systems biology. PMID:27128910
Kolker, Eugene; Özdemir, Vural; Martens, Lennart; Hancock, William; Anderson, Gordon; Anderson, Nathaniel; Aynacioglu, Sukru; Baranova, Ancha; Campagna, Shawn R; Chen, Rui; Choiniere, John; Dearth, Stephen P; Feng, Wu-Chun; Ferguson, Lynnette; Fox, Geoffrey; Frishman, Dmitrij; Grossman, Robert; Heath, Allison; Higdon, Roger; Hutz, Mara H; Janko, Imre; Jiang, Lihua; Joshi, Sanjay; Kel, Alexander; Kemnitz, Joseph W; Kohane, Isaac S; Kolker, Natali; Lancet, Doron; Lee, Elaine; Li, Weizhong; Lisitsa, Andrey; Llerena, Adrian; Macnealy-Koch, Courtney; Marshall, Jean-Claude; Masuzzo, Paola; May, Amanda; Mias, George; Monroe, Matthew; Montague, Elizabeth; Mooney, Sean; Nesvizhskii, Alexey; Noronha, Santosh; Omenn, Gilbert; Rajasimha, Harsha; Ramamoorthy, Preveen; Sheehan, Jerry; Smarr, Larry; Smith, Charles V; Smith, Todd; Snyder, Michael; Rapole, Srikanth; Srivastava, Sanjeeva; Stanberry, Larissa; Stewart, Elizabeth; Toppo, Stefano; Uetz, Peter; Verheggen, Kenneth; Voy, Brynn H; Warnich, Louise; Wilhelm, Steven W; Yandl, Gregory
2014-01-01
Biological processes are fundamentally driven by complex interactions between biomolecules. Integrated high-throughput omics studies enable multifaceted views of cells, organisms, or their communities. With the advent of new post-genomics technologies, omics studies are becoming increasingly prevalent; yet the full impact of these studies can only be realized through data harmonization, sharing, meta-analysis, and integrated research. These essential steps require consistent generation, capture, and distribution of metadata. To ensure transparency, facilitate data harmonization, and maximize reproducibility and usability of life sciences studies, we propose a simple common omics metadata checklist. The proposed checklist is built on the rich ontologies and standards already in use by the life sciences community. The checklist will serve as a common denominator to guide experimental design, capture important parameters, and be used as a standard format for stand-alone data publications. The omics metadata checklist and data publications will create efficient linkages between omics data and knowledge-based life sciences innovation and, importantly, allow for appropriate attribution to data generators and infrastructure science builders in the post-genomics era. We ask that the life sciences community test the proposed omics metadata checklist and data publications and provide feedback for their use and improvement.
Kolker, Eugene; Özdemir, Vural; Martens, Lennart; Hancock, William; Anderson, Gordon; Anderson, Nathaniel; Aynacioglu, Sukru; Baranova, Ancha; Campagna, Shawn R; Chen, Rui; Choiniere, John; Dearth, Stephen P; Feng, Wu-Chun; Ferguson, Lynnette; Fox, Geoffrey; Frishman, Dmitrij; Grossman, Robert; Heath, Allison; Higdon, Roger; Hutz, Mara H; Janko, Imre; Jiang, Lihua; Joshi, Sanjay; Kel, Alexander; Kemnitz, Joseph W; Kohane, Isaac S; Kolker, Natali; Lancet, Doron; Lee, Elaine; Li, Weizhong; Lisitsa, Andrey; Llerena, Adrian; MacNealy-Koch, Courtney; Marshall, Jean-Claude; Masuzzo, Paola; May, Amanda; Mias, George; Monroe, Matthew; Montague, Elizabeth; Mooney, Sean; Nesvizhskii, Alexey; Noronha, Santosh; Omenn, Gilbert; Rajasimha, Harsha; Ramamoorthy, Preveen; Sheehan, Jerry; Smarr, Larry; Smith, Charles V; Smith, Todd; Snyder, Michael; Rapole, Srikanth; Srivastava, Sanjeeva; Stanberry, Larissa; Stewart, Elizabeth; Toppo, Stefano; Uetz, Peter; Verheggen, Kenneth; Voy, Brynn H; Warnich, Louise; Wilhelm, Steven W; Yandl, Gregory
2013-12-01
Biological processes are fundamentally driven by complex interactions between biomolecules. Integrated high-throughput omics studies enable multifaceted views of cells, organisms, or their communities. With the advent of new post-genomics technologies, omics studies are becoming increasingly prevalent; yet the full impact of these studies can only be realized through data harmonization, sharing, meta-analysis, and integrated research. These essential steps require consistent generation, capture, and distribution of metadata. To ensure transparency, facilitate data harmonization, and maximize reproducibility and usability of life sciences studies, we propose a simple common omics metadata checklist. The proposed checklist is built on the rich ontologies and standards already in use by the life sciences community. The checklist will serve as a common denominator to guide experimental design, capture important parameters, and be used as a standard format for stand-alone data publications. The omics metadata checklist and data publications will create efficient linkages between omics data and knowledge-based life sciences innovation and, importantly, allow for appropriate attribution to data generators and infrastructure science builders in the post-genomics era. We ask that the life sciences community test the proposed omics metadata checklist and data publications and provide feedback for their use and improvement.
Integration, warehousing, and analysis strategies of Omics data.
Gedela, Srinubabu
2011-01-01
"-Omics" is a current suffix for numerous types of large-scale biological data generation procedures, which naturally demand the development of novel algorithms for data storage and analysis. With next generation genome sequencing burgeoning, it is pivotal to decipher a coding site on the genome, a gene's function, and information on transcripts next to the pure availability of sequence information. To explore a genome and downstream molecular processes, we need umpteen results at the various levels of cellular organization by utilizing different experimental designs, data analysis strategies and methodologies. Here comes the need for controlled vocabularies and data integration to annotate, store, and update the flow of experimental data. This chapter explores key methodologies to merge Omics data by semantic data carriers, discusses controlled vocabularies as eXtensible Markup Languages (XML), and provides practical guidance, databases, and software links supporting the integration of Omics data.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Yang, Laurence; Yurkovich, James T.; Lloyd, Colton J.
Integrating omics data to refine or make context-specific models is an active field of constraint-based modeling. Proteomics now cover over 95% of the Escherichia coli proteome by mass. Genome-scale models of Metabolism and macromolecular Expression (ME) compute proteome allocation linked to metabolism and fitness. Using proteomics data, we formulated allocation constraints for key proteome sectors in the ME model. The resulting calibrated model effectively computed the “generalist” (wild-type) E. coli proteome and phenotype across diverse growth environments. Across 15 growth conditions, prediction errors for growth rate and metabolic fluxes were 69% and 14% lower, respectively. The sector-constrained ME model thusmore » represents a generalist ME model reflecting both growth rate maximization and “hedging” against uncertain environments and stresses, as indicated by significant enrichment of these sectors for the general stress response sigma factor σS. Finally, the sector constraints represent a general formalism for integrating omics data from any experimental condition into constraint-based ME models. The constraints can be fine-grained (individual proteins) or coarse-grained (functionally-related protein groups) as demonstrated here. Furthermore, this flexible formalism provides an accessible approach for narrowing the gap between the complexity captured by omics data and governing principles of proteome allocation described by systems-level models.« less
Reverse engineering biomolecular systems using -omic data: challenges, progress and opportunities.
Quo, Chang F; Kaddi, Chanchala; Phan, John H; Zollanvari, Amin; Xu, Mingqing; Wang, May D; Alterovitz, Gil
2012-07-01
Recent advances in high-throughput biotechnologies have led to the rapid growing research interest in reverse engineering of biomolecular systems (REBMS). 'Data-driven' approaches, i.e. data mining, can be used to extract patterns from large volumes of biochemical data at molecular-level resolution while 'design-driven' approaches, i.e. systems modeling, can be used to simulate emergent system properties. Consequently, both data- and design-driven approaches applied to -omic data may lead to novel insights in reverse engineering biological systems that could not be expected before using low-throughput platforms. However, there exist several challenges in this fast growing field of reverse engineering biomolecular systems: (i) to integrate heterogeneous biochemical data for data mining, (ii) to combine top-down and bottom-up approaches for systems modeling and (iii) to validate system models experimentally. In addition to reviewing progress made by the community and opportunities encountered in addressing these challenges, we explore the emerging field of synthetic biology, which is an exciting approach to validate and analyze theoretical system models directly through experimental synthesis, i.e. analysis-by-synthesis. The ultimate goal is to address the present and future challenges in reverse engineering biomolecular systems (REBMS) using integrated workflow of data mining, systems modeling and synthetic biology.
Wu, Dingming; Wang, Dongfang; Zhang, Michael Q; Gu, Jin
2015-12-01
One major goal of large-scale cancer omics study is to identify molecular subtypes for more accurate cancer diagnoses and treatments. To deal with high-dimensional cancer multi-omics data, a promising strategy is to find an effective low-dimensional subspace of the original data and then cluster cancer samples in the reduced subspace. However, due to data-type diversity and big data volume, few methods can integrative and efficiently find the principal low-dimensional manifold of the high-dimensional cancer multi-omics data. In this study, we proposed a novel low-rank approximation based integrative probabilistic model to fast find the shared principal subspace across multiple data types: the convexity of the low-rank regularized likelihood function of the probabilistic model ensures efficient and stable model fitting. Candidate molecular subtypes can be identified by unsupervised clustering hundreds of cancer samples in the reduced low-dimensional subspace. On testing datasets, our method LRAcluster (low-rank approximation based multi-omics data clustering) runs much faster with better clustering performances than the existing method. Then, we applied LRAcluster on large-scale cancer multi-omics data from TCGA. The pan-cancer analysis results show that the cancers of different tissue origins are generally grouped as independent clusters, except squamous-like carcinomas. While the single cancer type analysis suggests that the omics data have different subtyping abilities for different cancer types. LRAcluster is a very useful method for fast dimension reduction and unsupervised clustering of large-scale multi-omics data. LRAcluster is implemented in R and freely available via http://bioinfo.au.tsinghua.edu.cn/software/lracluster/ .
The KUPNetViz: a biological network viewer for multiple -omics datasets in kidney diseases.
Moulos, Panagiotis; Klein, Julie; Jupp, Simon; Stevens, Robert; Bascands, Jean-Loup; Schanstra, Joost P
2013-07-24
Constant technological advances have allowed scientists in biology to migrate from conventional single-omics to multi-omics experimental approaches, challenging bioinformatics to bridge this multi-tiered information. Ongoing research in renal biology is no exception. The results of large-scale and/or high throughput experiments, presenting a wealth of information on kidney disease are scattered across the web. To tackle this problem, we recently presented the KUPKB, a multi-omics data repository for renal diseases. In this article, we describe KUPNetViz, a biological graph exploration tool allowing the exploration of KUPKB data through the visualization of biomolecule interactions. KUPNetViz enables the integration of multi-layered experimental data over different species, renal locations and renal diseases to protein-protein interaction networks and allows association with biological functions, biochemical pathways and other functional elements such as miRNAs. KUPNetViz focuses on the simplicity of its usage and the clarity of resulting networks by reducing and/or automating advanced functionalities present in other biological network visualization packages. In addition, it allows the extrapolation of biomolecule interactions across different species, leading to the formulations of new plausible hypotheses, adequate experiment design and to the suggestion of novel biological mechanisms. We demonstrate the value of KUPNetViz by two usage examples: the integration of calreticulin as a key player in a larger interaction network in renal graft rejection and the novel observation of the strong association of interleukin-6 with polycystic kidney disease. The KUPNetViz is an interactive and flexible biological network visualization and exploration tool. It provides renal biologists with biological network snapshots of the complex integrated data of the KUPKB allowing the formulation of new hypotheses in a user friendly manner.
The KUPNetViz: a biological network viewer for multiple -omics datasets in kidney diseases
2013-01-01
Background Constant technological advances have allowed scientists in biology to migrate from conventional single-omics to multi-omics experimental approaches, challenging bioinformatics to bridge this multi-tiered information. Ongoing research in renal biology is no exception. The results of large-scale and/or high throughput experiments, presenting a wealth of information on kidney disease are scattered across the web. To tackle this problem, we recently presented the KUPKB, a multi-omics data repository for renal diseases. Results In this article, we describe KUPNetViz, a biological graph exploration tool allowing the exploration of KUPKB data through the visualization of biomolecule interactions. KUPNetViz enables the integration of multi-layered experimental data over different species, renal locations and renal diseases to protein-protein interaction networks and allows association with biological functions, biochemical pathways and other functional elements such as miRNAs. KUPNetViz focuses on the simplicity of its usage and the clarity of resulting networks by reducing and/or automating advanced functionalities present in other biological network visualization packages. In addition, it allows the extrapolation of biomolecule interactions across different species, leading to the formulations of new plausible hypotheses, adequate experiment design and to the suggestion of novel biological mechanisms. We demonstrate the value of KUPNetViz by two usage examples: the integration of calreticulin as a key player in a larger interaction network in renal graft rejection and the novel observation of the strong association of interleukin-6 with polycystic kidney disease. Conclusions The KUPNetViz is an interactive and flexible biological network visualization and exploration tool. It provides renal biologists with biological network snapshots of the complex integrated data of the KUPKB allowing the formulation of new hypotheses in a user friendly manner. PMID:23883183
Multiomics Data Triangulation for Asthma Candidate Biomarkers and Precision Medicine.
Pecak, Matija; Korošec, Peter; Kunej, Tanja
2018-06-01
Asthma is a common complex disorder and has been subject to intensive omics research for disease susceptibility and therapeutic innovation. Candidate biomarkers of asthma and its precision treatment demand that they stand the test of multiomics data triangulation before they can be prioritized for clinical applications. We classified the biomarkers of asthma after a search of the literature and based on whether or not a given biomarker candidate is reported in multiple omics platforms and methodologies, using PubMed and Web of Science, we identified omics studies of asthma conducted on diverse platforms using keywords, such as asthma, genomics, metabolomics, and epigenomics. We extracted data about asthma candidate biomarkers from 73 articles and developed a catalog of 190 potential asthma biomarkers (167 human, 23 animal data), comprising DNA loci, transcripts, proteins, metabolites, epimutations, and noncoding RNAs. The data were sorted according to 13 omics types: genomics, epigenomics, transcriptomics, proteomics, interactomics, metabolomics, ncRNAomics, glycomics, lipidomics, environmental omics, pharmacogenomics, phenomics, and integrative omics. Importantly, we found that 10 candidate biomarkers were apparent in at least two or more omics levels, thus promising potential for further biomarker research and development and precision medicine applications. This multiomics catalog reported herein for the first time contributes to future decision-making on prioritization of biomarkers and validation efforts for precision medicine in asthma. The findings may also facilitate meta-analyses and integrative omics studies in the future.
Gea, Guerriero; Kjell, Sergeant; Jean-François, Hausman
2013-01-01
Lignin and cellulose represent the two main components of plant secondary walls and the most abundant polymers on Earth. Quantitatively one of the principal products of the phenylpropanoid pathway, lignin confers high mechanical strength and hydrophobicity to plant walls, thus enabling erect growth and high-pressure water transport in the vessels. Lignin is characterized by a high natural heterogeneity in its composition and abundance in plant secondary cell walls, even in the different tissues of the same plant. A typical example is the stem of fibre crops, which shows a lignified core enveloped by a cellulosic, lignin-poor cortex. Despite the great value of fibre crops for humanity, however, still little is known on the mechanisms controlling their cell wall biogenesis, and particularly, what regulates their spatially-defined lignification pattern. Given the chemical complexity and the heterogeneous composition of fibre crops’ secondary walls, only the use of multidisciplinary approaches can convey an integrated picture and provide exhaustive information covering different levels of biological complexity. The present review highlights the importance of combining high throughput -omics approaches to get a complete understanding of the factors regulating the lignification heterogeneity typical of fibre crops. PMID:23708098
Omics of the marine medaka (Oryzias melastigma) and its relevance to marine environmental research.
Kim, Bo-Mi; Kim, Jaebum; Choi, Ik-Young; Raisuddin, Sheikh; Au, Doris W T; Leung, Kenneth M Y; Wu, Rudolf S S; Rhee, Jae-Sung; Lee, Jae-Seong
2016-02-01
In recent years, the marine medaka (Oryzias melastigma), also known as the Indian medaka or brackish medaka, has been recognized as a model fish species for ecotoxicology and environmental research in the Asian region. O. melastigma has several promising features for research, which include a short generation period (3-4 months), daily spawning, small size (3-4 cm), transparent embryos, sexual dimorphism, and ease of mass culture in the laboratory. There have been extensive transcriptome and genome studies on the marine medaka in the past decade. Such omics data can be useful in understanding the signal transduction pathways of small teleosts in response to environmental stressors. An omics-integrated approach in the study of the marine medaka is important for strengthening its role as a small fish model for marine environmental studies. In this review, we present current omics information about the marine medaka and discuss its potential applications in the study of various molecular pathways that can be targets of marine environmental stressors, such as chemical pollutants. We believe that this review will encourage the use of this small fish as a model species in marine environmental research. Copyright © 2015 Elsevier Ltd. All rights reserved.
Computational biology for cardiovascular biomarker discovery.
Azuaje, Francisco; Devaux, Yvan; Wagner, Daniel
2009-07-01
Computational biology is essential in the process of translating biological knowledge into clinical practice, as well as in the understanding of biological phenomena based on the resources and technologies originating from the clinical environment. One such key contribution of computational biology is the discovery of biomarkers for predicting clinical outcomes using 'omic' information. This process involves the predictive modelling and integration of different types of data and knowledge for screening, diagnostic or prognostic purposes. Moreover, this requires the design and combination of different methodologies based on statistical analysis and machine learning. This article introduces key computational approaches and applications to biomarker discovery based on different types of 'omic' data. Although we emphasize applications in cardiovascular research, the computational requirements and advances discussed here are also relevant to other domains. We will start by introducing some of the contributions of computational biology to translational research, followed by an overview of methods and technologies used for the identification of biomarkers with predictive or classification value. The main types of 'omic' approaches to biomarker discovery will be presented with specific examples from cardiovascular research. This will include a review of computational methodologies for single-source and integrative data applications. Major computational methods for model evaluation will be described together with recommendations for reporting models and results. We will present recent advances in cardiovascular biomarker discovery based on the combination of gene expression and functional network analyses. The review will conclude with a discussion of key challenges for computational biology, including perspectives from the biosciences and clinical areas.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kolker, Eugene; Ozdemir, Vural; Martens , Lennart
Biological processes are fundamentally driven by complex interactions between biomolecules. Integrated high-throughput omics studies enable multifaceted views of cells, organisms, or their communities. With the advent of new post-genomics technologies omics studies are becoming increasingly prevalent yet the full impact of these studies can only be realized through data harmonization, sharing, meta-analysis, and integrated research,. These three essential steps require consistent generation, capture, and distribution of the metadata. To ensure transparency, facilitate data harmonization, and maximize reproducibility and usability of life sciences studies, we propose a simple common omics metadata checklist. The proposed checklist is built on the rich ontologiesmore » and standards already in use by the life sciences community. The checklist will serve as a common denominator to guide experimental design, capture important parameters, and be used as a standard format for stand-alone data publications. This omics metadata checklist and data publications will create efficient linkages between omics data and knowledge-based life sciences innovation and importantly, allow for appropriate attribution to data generators and infrastructure science builders in the post-genomics era. We ask that the life sciences community test the proposed omics metadata checklist and data publications and provide feedback for their use and improvement.« less
Özdemir, Vural; Martens, Lennart; Hancock, William; Anderson, Gordon; Anderson, Nathaniel; Aynacioglu, Sukru; Baranova, Ancha; Campagna, Shawn R.; Chen, Rui; Choiniere, John; Dearth, Stephen P.; Feng, Wu-Chun; Ferguson, Lynnette; Fox, Geoffrey; Frishman, Dmitrij; Grossman, Robert; Heath, Allison; Higdon, Roger; Hutz, Mara H.; Janko, Imre; Jiang, Lihua; Joshi, Sanjay; Kel, Alexander; Kemnitz, Joseph W.; Kohane, Isaac S.; Kolker, Natali; Lancet, Doron; Lee, Elaine; Li, Weizhong; Lisitsa, Andrey; Llerena, Adrian; MacNealy-Koch, Courtney; Marshall, Jean-Claude; Masuzzo, Paola; May, Amanda; Mias, George; Monroe, Matthew; Montague, Elizabeth; Mooney, Sean; Nesvizhskii, Alexey; Noronha, Santosh; Omenn, Gilbert; Rajasimha, Harsha; Ramamoorthy, Preveen; Sheehan, Jerry; Smarr, Larry; Smith, Charles V.; Smith, Todd; Snyder, Michael; Rapole, Srikanth; Srivastava, Sanjeeva; Stanberry, Larissa; Stewart, Elizabeth; Toppo, Stefano; Uetz, Peter; Verheggen, Kenneth; Voy, Brynn H.; Warnich, Louise; Wilhelm, Steven W.; Yandl, Gregory
2014-01-01
Abstract Biological processes are fundamentally driven by complex interactions between biomolecules. Integrated high-throughput omics studies enable multifaceted views of cells, organisms, or their communities. With the advent of new post-genomics technologies, omics studies are becoming increasingly prevalent; yet the full impact of these studies can only be realized through data harmonization, sharing, meta-analysis, and integrated research. These essential steps require consistent generation, capture, and distribution of metadata. To ensure transparency, facilitate data harmonization, and maximize reproducibility and usability of life sciences studies, we propose a simple common omics metadata checklist. The proposed checklist is built on the rich ontologies and standards already in use by the life sciences community. The checklist will serve as a common denominator to guide experimental design, capture important parameters, and be used as a standard format for stand-alone data publications. The omics metadata checklist and data publications will create efficient linkages between omics data and knowledge-based life sciences innovation and, importantly, allow for appropriate attribution to data generators and infrastructure science builders in the post-genomics era. We ask that the life sciences community test the proposed omics metadata checklist and data publications and provide feedback for their use and improvement. PMID:24456465
Integrative Analysis of Longitudinal Metabolomics Data from a Personal Multi-Omics Profile
Stanberry, Larissa; Mias, George I.; Haynes, Winston; Higdon, Roger; Snyder, Michael; Kolker, Eugene
2013-01-01
The integrative personal omics profile (iPOP) is a pioneering study that combines genomics, transcriptomics, proteomics, metabolomics and autoantibody profiles from a single individual over a 14-month period. The observation period includes two episodes of viral infection: a human rhinovirus and a respiratory syncytial virus. The profile studies give an informative snapshot into the biological functioning of an organism. We hypothesize that pathway expression levels are associated with disease status. To test this hypothesis, we use biological pathways to integrate metabolomics and proteomics iPOP data. The approach computes the pathways’ differential expression levels at each time point, while taking into account the pathway structure and the longitudinal design. The resulting pathway levels show strong association with the disease status. Further, we identify temporal patterns in metabolite expression levels. The changes in metabolite expression levels also appear to be consistent with the disease status. The results of the integrative analysis suggest that changes in biological pathways may be used to predict and monitor the disease. The iPOP experimental design, data acquisition and analysis issues are discussed within the broader context of personal profiling. PMID:24958148
HARNESSING BIG DATA FOR PRECISION MEDICINE: INFRASTRUCTURES AND APPLICATIONS.
Yu, Kun-Hsing; Hart, Steven N; Goldfeder, Rachel; Zhang, Qiangfeng Cliff; Parker, Stephen C J; Snyder, Michael
2017-01-01
Precision medicine is a health management approach that accounts for individual differences in genetic backgrounds and environmental exposures. With the recent advancements in high-throughput omics profiling technologies, collections of large study cohorts, and the developments of data mining algorithms, big data in biomedicine is expected to provide novel insights into health and disease states, which can be translated into personalized disease prevention and treatment plans. However, petabytes of biomedical data generated by multiple measurement modalities poses a significant challenge for data analysis, integration, storage, and result interpretation. In addition, patient privacy preservation, coordination between participating medical centers and data analysis working groups, as well as discrepancies in data sharing policies remain important topics of discussion. In this workshop, we invite experts in omics integration, biobank research, and data management to share their perspectives on leveraging big data to enable precision medicine.Workshop website: http://tinyurl.com/PSB17BigData; HashTag: #PSB17BigData.
Karapiperis, Christos; Kempf, Stefan J; Quintens, Roel; Azimzadeh, Omid; Vidal, Victoria Linares; Pazzaglia, Simonetta; Bazyka, Dimitry; Mastroberardino, Pier G; Scouras, Zacharias G; Tapio, Soile; Benotmane, Mohammed Abderrafi; Ouzounis, Christos A
2016-05-11
The underlying molecular processes representing stress responses to low-dose ionising radiation (LDIR) in mammals are just beginning to be understood. In particular, LDIR effects on the brain and their possible association with neurodegenerative disease are currently being explored using omics technologies. We describe a light-weight approach for the storage, analysis and distribution of relevant LDIR omics datasets. The data integration platform, called BRIDE, contains information from the literature as well as experimental information from transcriptomics and proteomics studies. It deploys a hybrid, distributed solution using both local storage and cloud technology. BRIDE can act as a knowledge broker for LDIR researchers, to facilitate molecular research on the systems biology of LDIR response in mammals. Its flexible design can capture a range of experimental information for genomics, epigenomics, transcriptomics, and proteomics. The data collection is available at:
USDA-ARS?s Scientific Manuscript database
Bovine mastitis is an inflammation-driven disease of the bovine mammary gland that costs the global dairy industry several billion dollars per annum. Because disease susceptibility is a multi-factorial complex phenotype, a multi-omic integrative biology approach is required to dissect the multilayer...
An Integrated Systems Genetics and Omics Toolkit to Probe Gene Function.
Li, Hao; Wang, Xu; Rukina, Daria; Huang, Qingyao; Lin, Tao; Sorrentino, Vincenzo; Zhang, Hongbo; Bou Sleiman, Maroun; Arends, Danny; McDaid, Aaron; Luan, Peiling; Ziari, Naveed; Velázquez-Villegas, Laura A; Gariani, Karim; Kutalik, Zoltan; Schoonjans, Kristina; Radcliffe, Richard A; Prins, Pjotr; Morgenthaler, Stephan; Williams, Robert W; Auwerx, Johan
2018-01-24
Identifying genetic and environmental factors that impact complex traits and common diseases is a high biomedical priority. Here, we developed, validated, and implemented a series of multi-layered systems approaches, including (expression-based) phenome-wide association, transcriptome-/proteome-wide association, and (reverse-) mediation analysis, in an open-access web server (systems-genetics.org) to expedite the systems dissection of gene function. We applied these approaches to multi-omics datasets from the BXD mouse genetic reference population, and identified and validated associations between genes and clinical and molecular phenotypes, including previously unreported links between Rpl26 and body weight, and Cpt1a and lipid metabolism. Furthermore, through mediation and reverse-mediation analysis we established regulatory relations between genes, such as the co-regulation of BCKDHA and BCKDHB protein levels, and identified targets of transcription factors E2F6, ZFP277, and ZKSCAN1. Our multifaceted toolkit enabled the identification of gene-gene and gene-phenotype links that are robust and that translate well across populations and species, and can be universally applied to any populations with multi-omics datasets. Copyright © 2017 The Authors. Published by Elsevier Inc. All rights reserved.
[Progress in omics research of Aspergillus niger].
Sui, Yufei; Ouyang, Liming; Lu, Hongzhong; Zhuang, Yingping; Zhang, Siliang
2016-08-25
Aspergillus niger, as an important industrial fermentation strain, is widely applied in the production of organic acids and industrial enzymes. With the development of diverse omics technologies, the data of genome, transcriptome, proteome and metabolome of A. niger are increasing continuously, which declared the coming era of big data for the research in fermentation process of A. niger. The data analysis from single omics and the comparison of multi-omics, to the integrations of multi-omics based on the genome-scale metabolic network model largely extends the intensive and systematic understanding of the efficient production mechanism of A. niger. It also provides possibilities for the reasonable global optimization of strain performance by genetic modification and process regulation. We reviewed and summarized progress in omics research of A. niger, and proposed the development direction of omics research on this cell factory.
Protein Interactome of Muscle Invasive Bladder Cancer
Bhat, Akshay; Heinzel, Andreas; Mayer, Bernd; Perco, Paul; Mühlberger, Irmgard; Husi, Holger; Merseburger, Axel S.; Zoidakis, Jerome; Vlahou, Antonia; Schanstra, Joost P.; Mischak, Harald; Jankowski, Vera
2015-01-01
Muscle invasive bladder carcinoma is a complex, multifactorial disease caused by disruptions and alterations of several molecular pathways that result in heterogeneous phenotypes and variable disease outcome. Combining this disparate knowledge may offer insights for deciphering relevant molecular processes regarding targeted therapeutic approaches guided by molecular signatures allowing improved phenotype profiling. The aim of the study is to characterize muscle invasive bladder carcinoma on a molecular level by incorporating scientific literature screening and signatures from omics profiling. Public domain omics signatures together with molecular features associated with muscle invasive bladder cancer were derived from literature mining to provide 286 unique protein-coding genes. These were integrated in a protein-interaction network to obtain a molecular functional map of the phenotype. This feature map educated on three novel disease-associated pathways with plausible involvement in bladder cancer, namely Regulation of actin cytoskeleton, Neurotrophin signalling pathway and Endocytosis. Systematic integration approaches allow to study the molecular context of individual features reported as associated with a clinical phenotype and could potentially help to improve the molecular mechanistic description of the disorder. PMID:25569276
[Applications of meta-analysis in multi-omics].
Han, Mingfei; Zhu, Yunping
2014-07-01
As a statistical method integrating multi-features and multi-data, meta-analysis was introduced to the field of life science in the 1990s. With the rapid advances in high-throughput technologies, life omics, the core of which are genomics, transcriptomics and proteomics, is becoming the new hot spot of life science. Although the fast output of massive data has promoted the development of omics study, it results in excessive data that are difficult to integrate systematically. In this case, meta-analysis is frequently applied to analyze different types of data and is improved continuously. Here, we first summarize the representative meta-analysis methods systematically, and then study the current applications of meta-analysis in various omics fields, finally we discuss the still-existing problems and the future development of meta-analysis.
Multi-Omics Factor Analysis-a framework for unsupervised integration of multi-omics data sets.
Argelaguet, Ricard; Velten, Britta; Arnol, Damien; Dietrich, Sascha; Zenz, Thorsten; Marioni, John C; Buettner, Florian; Huber, Wolfgang; Stegle, Oliver
2018-06-20
Multi-omics studies promise the improved characterization of biological processes across molecular layers. However, methods for the unsupervised integration of the resulting heterogeneous data sets are lacking. We present Multi-Omics Factor Analysis (MOFA), a computational method for discovering the principal sources of variation in multi-omics data sets. MOFA infers a set of (hidden) factors that capture biological and technical sources of variability. It disentangles axes of heterogeneity that are shared across multiple modalities and those specific to individual data modalities. The learnt factors enable a variety of downstream analyses, including identification of sample subgroups, data imputation and the detection of outlier samples. We applied MOFA to a cohort of 200 patient samples of chronic lymphocytic leukaemia, profiled for somatic mutations, RNA expression, DNA methylation and ex vivo drug responses. MOFA identified major dimensions of disease heterogeneity, including immunoglobulin heavy-chain variable region status, trisomy of chromosome 12 and previously underappreciated drivers, such as response to oxidative stress. In a second application, we used MOFA to analyse single-cell multi-omics data, identifying coordinated transcriptional and epigenetic changes along cell differentiation. © 2018 The Authors. Published under the terms of the CC BY 4.0 license.
Visualising associations between paired ‘omics’ data sets
2012-01-01
Background Each omics platform is now able to generate a large amount of data. Genomics, proteomics, metabolomics, interactomics are compiled at an ever increasing pace and now form a core part of the fundamental systems biology framework. Recently, several integrative approaches have been proposed to extract meaningful information. However, these approaches lack of visualisation outputs to fully unravel the complex associations between different biological entities. Results The multivariate statistical approaches ‘regularized Canonical Correlation Analysis’ and ‘sparse Partial Least Squares regression’ were recently developed to integrate two types of highly dimensional ‘omics’ data and to select relevant information. Using the results of these methods, we propose to revisit few graphical outputs to better understand the relationships between two ‘omics’ data and to better visualise the correlation structure between the different biological entities. These graphical outputs include Correlation Circle plots, Relevance Networks and Clustered Image Maps. We demonstrate the usefulness of such graphical outputs on several biological data sets and further assess their biological relevance using gene ontology analysis. Conclusions Such graphical outputs are undoubtedly useful to aid the interpretation of these promising integrative analysis tools and will certainly help in addressing fundamental biological questions and understanding systems as a whole. Availability The graphical tools described in this paper are implemented in the freely available R package mixOmics and in its associated web application. PMID:23148523
The evolution of analytical chemistry methods in foodomics.
Gallo, Monica; Ferranti, Pasquale
2016-01-08
The methodologies of food analysis have greatly evolved over the past 100 years, from basic assays based on solution chemistry to those relying on the modern instrumental platforms. Today, the development and optimization of integrated analytical approaches based on different techniques to study at molecular level the chemical composition of a food may allow to define a 'food fingerprint', valuable to assess nutritional value, safety and quality, authenticity and security of foods. This comprehensive strategy, defined foodomics, includes emerging work areas such as food chemistry, phytochemistry, advanced analytical techniques, biosensors and bioinformatics. Integrated approaches can help to elucidate some critical issues in food analysis, but also to face the new challenges of a globalized world: security, sustainability and food productions in response to environmental world-wide changes. They include the development of powerful analytical methods to ensure the origin and quality of food, as well as the discovery of biomarkers to identify potential food safety problems. In the area of nutrition, the future challenge is to identify, through specific biomarkers, individual peculiarities that allow early diagnosis and then a personalized prognosis and diet for patients with food-related disorders. Far from the aim of an exhaustive review of the abundant literature dedicated to the applications of omic sciences in food analysis, we will explore how classical approaches, such as those used in chemistry and biochemistry, have evolved to intersect with the new omics technologies to produce a progress in our understanding of the complexity of foods. Perhaps most importantly, a key objective of the review will be to explore the development of simple and robust methods for a fully applied use of omics data in food science. Copyright © 2015 Elsevier B.V. All rights reserved.
Laufer, Vincent A; Chen, Jake Y; Langefeld, Carl D; Bridges, S Louis
2017-08-01
The use of high-throughput omics may help to understand the contribution of genetic variants to the pathogenesis of rheumatic diseases. We discuss the concept of missing heritability: that genetic variants do not explain the heritability of rheumatoid arthritis and related rheumatologic conditions. In addition to an overview of how integrative data analysis can lead to novel insights into mechanisms of rheumatic diseases, we describe statistical approaches to prioritizing genetic variants for future functional analyses. We illustrate how analyses of large datasets provide hope for improved approaches to the diagnosis, treatment, and prevention of rheumatic diseases. Copyright © 2017 Elsevier Inc. All rights reserved.
Omics studies of citrus, grape and rosaceae fruit trees
Shiratake, Katsuhiro; Suzuki, Mami
2016-01-01
Recent advance of bioinformatics and analytical apparatuses such as next generation DNA sequencer (NGS) and mass spectrometer (MS) has brought a big wave of comprehensive study to biology. Comprehensive study targeting all genes, transcripts (RNAs), proteins, metabolites, hormones, ions or phenotypes is called genomics, transcriptomics, proteomics, metabolomics, hormonomics, ionomics or phenomics, respectively. These omics are powerful approaches to identify key genes for important traits, to clarify events of physiological mechanisms and to reveal unknown metabolic pathways in crops. Recently, the use of omics approach has increased dramatically in fruit tree research. Although the most reported omics studies on fruit trees are transcriptomics, proteomics and metabolomics, and a few is reported on hormonomics and ionomics. In this article, we reviewed recent omics studies of major fruit trees, i.e. citrus, grapevine and rosaceae fruit trees. The effectiveness and prospects of omics in fruit tree research will as well be highlighted. PMID:27069397
Omics studies of citrus, grape and rosaceae fruit trees.
Shiratake, Katsuhiro; Suzuki, Mami
2016-01-01
Recent advance of bioinformatics and analytical apparatuses such as next generation DNA sequencer (NGS) and mass spectrometer (MS) has brought a big wave of comprehensive study to biology. Comprehensive study targeting all genes, transcripts (RNAs), proteins, metabolites, hormones, ions or phenotypes is called genomics, transcriptomics, proteomics, metabolomics, hormonomics, ionomics or phenomics, respectively. These omics are powerful approaches to identify key genes for important traits, to clarify events of physiological mechanisms and to reveal unknown metabolic pathways in crops. Recently, the use of omics approach has increased dramatically in fruit tree research. Although the most reported omics studies on fruit trees are transcriptomics, proteomics and metabolomics, and a few is reported on hormonomics and ionomics. In this article, we reviewed recent omics studies of major fruit trees, i.e. citrus, grapevine and rosaceae fruit trees. The effectiveness and prospects of omics in fruit tree research will as well be highlighted.
Principles of proteome allocation are revealed using proteomic data and genome-scale models
Yang, Laurence; Yurkovich, James T.; Lloyd, Colton J.; Ebrahim, Ali; Saunders, Michael A.; Palsson, Bernhard O.
2016-01-01
Integrating omics data to refine or make context-specific models is an active field of constraint-based modeling. Proteomics now cover over 95% of the Escherichia coli proteome by mass. Genome-scale models of Metabolism and macromolecular Expression (ME) compute proteome allocation linked to metabolism and fitness. Using proteomics data, we formulated allocation constraints for key proteome sectors in the ME model. The resulting calibrated model effectively computed the “generalist” (wild-type) E. coli proteome and phenotype across diverse growth environments. Across 15 growth conditions, prediction errors for growth rate and metabolic fluxes were 69% and 14% lower, respectively. The sector-constrained ME model thus represents a generalist ME model reflecting both growth rate maximization and “hedging” against uncertain environments and stresses, as indicated by significant enrichment of these sectors for the general stress response sigma factor σS. Finally, the sector constraints represent a general formalism for integrating omics data from any experimental condition into constraint-based ME models. The constraints can be fine-grained (individual proteins) or coarse-grained (functionally-related protein groups) as demonstrated here. This flexible formalism provides an accessible approach for narrowing the gap between the complexity captured by omics data and governing principles of proteome allocation described by systems-level models. PMID:27857205
Principles of proteome allocation are revealed using proteomic data and genome-scale models
Yang, Laurence; Yurkovich, James T.; Lloyd, Colton J.; ...
2016-11-18
Integrating omics data to refine or make context-specific models is an active field of constraint-based modeling. Proteomics now cover over 95% of the Escherichia coli proteome by mass. Genome-scale models of Metabolism and macromolecular Expression (ME) compute proteome allocation linked to metabolism and fitness. Using proteomics data, we formulated allocation constraints for key proteome sectors in the ME model. The resulting calibrated model effectively computed the “generalist” (wild-type) E. coli proteome and phenotype across diverse growth environments. Across 15 growth conditions, prediction errors for growth rate and metabolic fluxes were 69% and 14% lower, respectively. The sector-constrained ME model thusmore » represents a generalist ME model reflecting both growth rate maximization and “hedging” against uncertain environments and stresses, as indicated by significant enrichment of these sectors for the general stress response sigma factor σS. Finally, the sector constraints represent a general formalism for integrating omics data from any experimental condition into constraint-based ME models. The constraints can be fine-grained (individual proteins) or coarse-grained (functionally-related protein groups) as demonstrated here. Furthermore, this flexible formalism provides an accessible approach for narrowing the gap between the complexity captured by omics data and governing principles of proteome allocation described by systems-level models.« less
Personalized Medicine: What's in it for Rare Diseases?
Schee Genannt Halfmann, Sebastian; Mählmann, Laura; Leyens, Lada; Reumann, Matthias; Brand, Angela
2017-01-01
Personalised Medicine has become a reality over the last years. The emergence of 'omics' and big data has started revolutionizing healthcare. New 'omics' technologies lead to a better molecular characterization of diseases and a new understanding of the complexity of diseases. The approach of PM is already successfully applied in different healthcare areas such as oncology, cardiology, nutrition and for rare diseases. However, health systems across the EU are often still promoting the 'one-size fits all' approach, even if it is known that patients do greatly vary in their molecular characteristics and response to drugs and other interventions. To make use of the full potentials of PM in the next years ahead several challenges need to be addressed such as the integration of big data, patient empowerment, translation of basic to clinical research, bringing the innovation to the market and shaping sustainable healthcare systems.
[Alternatives to animal testing].
Fabre, Isabelle
2009-11-01
The use of alternative methods to animal testing are an integral part of the 3Rs concept (refine, reduce, replace) defined by Russel & Burch in 1959. These approaches include in silico methods (databases and computer models), in vitro physicochemical analysis, biological methods using bacteria or isolated cells, reconstructed enzyme systems, and reconstructed tissues. Emerging "omic" methods used in integrated approaches further help to reduce animal use, while stem cells offer promising approaches to toxicologic and pathophysiologic studies, along with organotypic cultures and bio-artificial organs. Only a few alternative methods can so far be used in stand-alone tests as substitutes for animal testing. The best way to use these methods is to integrate them in tiered testing strategies (ITS), in which animals are only used as a last resort.
Wilmes, Anja; Bielow, Chris; Ranninger, Christina; Bellwon, Patricia; Aschauer, Lydia; Limonciel, Alice; Chassaigne, Hubert; Kristl, Theresa; Aiche, Stephan; Huber, Christian G; Guillou, Claude; Hewitt, Philipp; Leonard, Martin O; Dekant, Wolfgang; Bois, Frederic; Jennings, Paul
2015-12-25
Cisplatin is one of the most widely used chemotherapeutic agents for the treatment of solid tumours. The major dose-limiting factor is nephrotoxicity, in particular in the proximal tubule. Here, we use an integrated omics approach, including transcriptomics, proteomics and metabolomics coupled to biokinetics to identify cell stress response pathways induced by cisplatin. The human renal proximal tubular cell line RPTEC/TERT1 was treated with sub-cytotoxic concentrations of cisplatin (0.5 and 2 μM) in a daily repeat dose treating regime for up to 14 days. Biokinetic analysis showed that cisplatin was taken up from the basolateral compartment, transported to the apical compartment, and accumulated in cells over time. This is in line with basolateral uptake of cisplatin via organic cation transporter 2 and bioactivation via gamma-glutamyl transpeptidase located on the apical side of proximal tubular cells. Cisplatin affected several pathways including, p53 signalling, Nrf2 mediated oxidative stress response, mitochondrial processes, mTOR and AMPK signalling. In addition, we identified novel pathways changed by cisplatin, including eIF2 signalling, actin nucleation via the ARP/WASP complex and regulation of cell polarization. In conclusion, using an integrated omic approach together with biokinetics we have identified both novel and established mechanisms of cisplatin toxicity. Copyright © 2014 Elsevier Ltd. All rights reserved.
Omics strategies for revealing Yersinia pestis virulence
Yang, Ruifu; Du, Zongmin; Han, Yanping; Zhou, Lei; Song, Yajun; Zhou, Dongsheng; Cui, Yujun
2012-01-01
Omics has remarkably changed the way we investigate and understand life. Omics differs from traditional hypothesis-driven research because it is a discovery-driven approach. Mass datasets produced from omics-based studies require experts from different fields to reveal the salient features behind these data. In this review, we summarize omics-driven studies to reveal the virulence features of Yersinia pestis through genomics, trascriptomics, proteomics, interactomics, etc. These studies serve as foundations for further hypothesis-driven research and help us gain insight into Y. pestis pathogenesis. PMID:23248778
Time to "go large" on biofilm research: advantages of an omics approach.
Azevedo, Nuno F; Lopes, Susana P; Keevil, Charles W; Pereira, Maria O; Vieira, Maria J
2009-04-01
In nature, the biofilm mode of life is of great importance in the cell cycle for many microorganisms. Perhaps because of biofilm complexity and variability, the characterization of a given microbial system, in terms of biofilm formation potential, structure and associated physiological activity, in a large-scale, standardized and systematic manner has been hindered by the absence of high-throughput methods. This outlook is now starting to change as new methods involving the utilization of microtiter-plates and automated spectrophotometry and microscopy systems are being developed to perform large-scale testing of microbial biofilms. Here, we evaluate if the time is ripe to start an integrated omics approach, i.e., the generation and interrogation of large datasets, to biofilms--"biofomics". This omics approach would bring much needed insight into how biofilm formation ability is affected by a number of environmental, physiological and mutational factors and how these factors interplay between themselves in a standardized manner. This could then lead to the creation of a database where biofilm signatures are identified and interrogated. Nevertheless, and before embarking on such an enterprise, the selection of a versatile, robust, high-throughput biofilm growing device and of appropriate methods for biofilm analysis will have to be performed. Whether such device and analytical methods are already available, particularly for complex heterotrophic biofilms is, however, very debatable.
Integrating multi-omic features exploiting Chromosome Conformation Capture data.
Merelli, Ivan; Tordini, Fabio; Drocco, Maurizio; Aldinucci, Marco; Liò, Pietro; Milanesi, Luciano
2015-01-01
The representation, integration, and interpretation of omic data is a complex task, in particular considering the huge amount of information that is daily produced in molecular biology laboratories all around the world. The reason is that sequencing data regarding expression profiles, methylation patterns, and chromatin domains is difficult to harmonize in a systems biology view, since genome browsers only allow coordinate-based representations, discarding functional clusters created by the spatial conformation of the DNA in the nucleus. In this context, recent progresses in high throughput molecular biology techniques and bioinformatics have provided insights into chromatin interactions on a larger scale and offer a formidable support for the interpretation of multi-omic data. In particular, a novel sequencing technique called Chromosome Conformation Capture allows the analysis of the chromosome organization in the cell's natural state. While performed genome wide, this technique is usually called Hi-C. Inspired by service applications such as Google Maps, we developed NuChart, an R package that integrates Hi-C data to describe the chromosomal neighborhood starting from the information about gene positions, with the possibility of mapping on the achieved graphs genomic features such as methylation patterns and histone modifications, along with expression profiles. In this paper we show the importance of the NuChart application for the integration of multi-omic data in a systems biology fashion, with particular interest in cytogenetic applications of these techniques. Moreover, we demonstrate how the integration of multi-omic data can provide useful information in understanding why genes are in certain specific positions inside the nucleus and how epigenetic patterns correlate with their expression.
Vijayakumar, Supreeta; Conway, Max; Lió, Pietro; Angione, Claudio
2017-05-30
Metabolic modelling has entered a mature phase with dozens of methods and software implementations available to the practitioner and the theoretician. It is not easy for a modeller to be able to see the wood (or the forest) for the trees. Driven by this analogy, we here present a 'forest' of principal methods used for constraint-based modelling in systems biology. This provides a tree-based view of methods available to prospective modellers, also available in interactive version at http://modellingmetabolism.net, where it will be kept updated with new methods after the publication of the present manuscript. Our updated classification of existing methods and tools highlights the most promising in the different branches, with the aim to develop a vision of how existing methods could hybridize and become more complex. We then provide the first hands-on tutorial for multi-objective optimization of metabolic models in R. We finally discuss the implementation of multi-view machine learning approaches in poly-omic integration. Throughout this work, we demonstrate the optimization of trade-offs between multiple metabolic objectives, with a focus on omic data integration through machine learning. We anticipate that the combination of a survey, a perspective on multi-view machine learning and a step-by-step R tutorial should be of interest for both the beginner and the advanced user. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Second Era of OMICS in Caries Research: Moving Past the Phase of Disillusionment.
Nascimento, M M; Zaura, E; Mira, A; Takahashi, N; Ten Cate, J M
2017-07-01
Novel approaches using OMICS techniques enable a collective assessment of multiple related biological units, including genes, gene expression, proteins, and metabolites. In the past decade, next-generation sequencing ( NGS) technologies were improved by longer sequence reads and the development of genome databases and user-friendly pipelines for data analysis, all accessible at lower cost. This has generated an outburst of high-throughput data. The application of OMICS has provided more depth to existing hypotheses as well as new insights in the etiology of dental caries. For example, the determination of complete bacterial microbiomes of oral samples rather than selected species, together with oral metatranscriptome and metabolome analyses, supports the viewpoint of dysbiosis of the supragingival biofilms. In addition, metabolome studies have been instrumental in disclosing the contributions of major pathways for central carbon and amino acid metabolisms to biofilm pH homeostasis. New, often noncultured, oral streptococci have been identified, and their phenotypic characterization has revealed candidates for probiotic therapy. Although findings from OMICS research have been greatly informative, problems related to study design, data quality, integration, and reproducibility still need to be addressed. Also, the emergence and continuous updates of these computationally demanding technologies require expertise in advanced bioinformatics for reliable interpretation of data. Despite the obstacles cited above, OMICS research is expected to encourage the discovery of novel caries biomarkers and the development of next-generation diagnostics and therapies for caries control. These observations apply equally to the study of other oral diseases.
Metabolic Network Modeling of Microbial Communities
Biggs, Matthew B.; Medlock, Gregory L.; Kolling, Glynis L.
2015-01-01
Genome-scale metabolic network reconstructions and constraint-based analysis are powerful methods that have the potential to make functional predictions about microbial communities. Current use of genome-scale metabolic networks to characterize the metabolic functions of microbial communities includes species compartmentalization, separating species-level and community-level objectives, dynamic analysis, the “enzyme-soup” approach, multi-scale modeling, and others. There are many challenges inherent to the field, including a need for tools that accurately assign high-level omics signals to individual community members, new automated reconstruction methods that rival manual curation, and novel algorithms for integrating omics data and engineering communities. As technologies and modeling frameworks improve, we expect that there will be proportional advances in the fields of ecology, health science, and microbial community engineering. PMID:26109480
Biomedical data integration in computational drug design and bioinformatics.
Seoane, Jose A; Aguiar-Pulido, Vanessa; Munteanu, Cristian R; Rivero, Daniel; Rabunal, Juan R; Dorado, Julian; Pazos, Alejandro
2013-03-01
In recent years, in the post genomic era, more and more data is being generated by biological high throughput technologies, such as proteomics and transcriptomics. This omics data can be very useful, but the real challenge is to analyze all this data, as a whole, after integrating it. Biomedical data integration enables making queries to different, heterogeneous and distributed biomedical data sources. Data integration solutions can be very useful not only in the context of drug design, but also in biomedical information retrieval, clinical diagnosis, system biology, etc. In this review, we analyze the most common approaches to biomedical data integration, such as federated databases, data warehousing, multi-agent systems and semantic technology, as well as the solutions developed using these approaches in the past few years.
Morioka, Yusuke; Everroad, R. Craig; Shino, Amiu; Matsushima, Akihiro; Haruna, Hideaki; Moriya, Shigeharu; Toyoda, Tetsuro; Kikuchi, Jun
2012-01-01
Ecosystems can be conceptually thought of as interconnected environmental and metabolic systems, in which small molecules to macro-molecules interact through diverse networks. State-of-the-art technologies in post-genomic science offer ways to inspect and analyze this biomolecular web using omics-based approaches. Exploring useful genes and enzymes, as well as biomass resources responsible for anabolism and catabolism within ecosystems will contribute to a better understanding of environmental functions and their application to biotechnology. Here we present ECOMICS, a suite of web-based tools for ECosystem trans-OMICS investigation that target metagenomic, metatranscriptomic, and meta-metabolomic systems, including biomacromolecular mixtures derived from biomass. ECOMICS is made of four integrated webtools. E-class allows for the sequence-based taxonomic classification of eukaryotic and prokaryotic ribosomal data and the functional classification of selected enzymes. FT2B allows for the digital processing of NMR spectra for downstream metabolic or chemical phenotyping. Bm-Char allows for statistical assignment of specific compounds found in lignocellulose-based biomass, and HetMap is a data matrix generator and correlation calculator that can be applied to trans-omics datasets as analyzed by these and other web tools. This web suite is unique in that it allows for the monitoring of biomass metabolism in a particular environment, i.e., from macromolecular complexes (FT2DB and Bm-Char) to microbial composition and degradation (E-class), and makes possible the understanding of relationships between molecular and microbial elements (HetMap). This website is available to the public domain at: https://database.riken.jp/ecomics/. PMID:22319563
LinkedOmics: analyzing multi-omics data within and across 32 cancer types.
Vasaikar, Suhas V; Straub, Peter; Wang, Jing; Zhang, Bing
2018-01-04
The LinkedOmics database contains multi-omics data and clinical data for 32 cancer types and a total of 11 158 patients from The Cancer Genome Atlas (TCGA) project. It is also the first multi-omics database that integrates mass spectrometry (MS)-based global proteomics data generated by the Clinical Proteomic Tumor Analysis Consortium (CPTAC) on selected TCGA tumor samples. In total, LinkedOmics has more than a billion data points. To allow comprehensive analysis of these data, we developed three analysis modules in the LinkedOmics web application. The LinkFinder module allows flexible exploration of associations between a molecular or clinical attribute of interest and all other attributes, providing the opportunity to analyze and visualize associations between billions of attribute pairs for each cancer cohort. The LinkCompare module enables easy comparison of the associations identified by LinkFinder, which is particularly useful in multi-omics and pan-cancer analyses. The LinkInterpreter module transforms identified associations into biological understanding through pathway and network analysis. Using five case studies, we demonstrate that LinkedOmics provides a unique platform for biologists and clinicians to access, analyze and compare cancer multi-omics data within and across tumor types. LinkedOmics is freely available at http://www.linkedomics.org. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Predicting clinical outcome of neuroblastoma patients using an integrative network-based approach.
Tranchevent, Léon-Charles; Nazarov, Petr V; Kaoma, Tony; Schmartz, Georges P; Muller, Arnaud; Kim, Sang-Yoon; Rajapakse, Jagath C; Azuaje, Francisco
2018-06-07
One of the main current challenges in computational biology is to make sense of the huge amounts of multidimensional experimental data that are being produced. For instance, large cohorts of patients are often screened using different high-throughput technologies, effectively producing multiple patient-specific molecular profiles for hundreds or thousands of patients. We propose and implement a network-based method that integrates such patient omics data into Patient Similarity Networks. Topological features derived from these networks were then used to predict relevant clinical features. As part of the 2017 CAMDA challenge, we have successfully applied this strategy to a neuroblastoma dataset, consisting of genomic and transcriptomic data. In particular, we observe that models built on our network-based approach perform at least as well as state of the art models. We furthermore explore the effectiveness of various topological features and observe, for instance, that redundant centrality metrics can be combined to build more powerful models. We demonstrate that the networks inferred from omics data contain clinically relevant information and that patient clinical outcomes can be predicted using only network topological data. This article was reviewed by Yang-Yu Liu, Tomislav Smuc and Isabel Nepomuceno.
Multilayered Genetic and Omics Dissection of Mitochondrial Activity in a Mouse Reference Population
Wu, Yibo; Williams, Evan G.; Dubuis, Sébastien; Mottis, Adrienne; Jovaisaite, Virginija; Houten, Sander M.; Argmann, Carmen A.; Faridi, Pouya; Wolski, Witold; Kutalik, Zoltán; Zamboni, Nicola; Auwerx, Johan; Aebersold, Ruedi
2014-01-01
SUMMARY The manner by which genotype and environment affect complex phenotypes is one of the fundamental questions in biology. In this study, we quantified the transcriptome—a subset of the metabolome—and, using targeted proteomics, quantified a subset of the liver proteome from 40 strains of the BXD mouse genetic reference population on two diverse diets. We discovered dozens of transcript, protein, and metabolite QTLs, several of which linked to metabolic phenotypes. Most prominently, Dhtkd1 was identified as a primary regulator of 2-aminoadipate, explaining variance in fasted glucose and diabetes status in both mice and humans. These integrated molecular profiles also allowed further characterization of complex pathways, particularly the mitochondrial unfolded protein response (UPRmt). UPRmt shows strikingly variant responses at the transcript and protein level that are remarkably conserved among C. elegans, mice, and humans. Overall, these examples demonstrate the value of an integrated multilayered omics approach to characterize complex metabolic phenotypes. PMID:25215496
Morphomics: An integral part of systems biology of the human placenta.
Mayhew, T M
2015-04-01
The placenta is a transient organ the functioning of which has health consequences far beyond the embryo/fetus. Understanding the biology of any system (organ, organism, single cell, etc) requires a comprehensive and inclusive approach which embraces all the biomedical disciplines and 'omic' technologies and then integrates information obtained from all of them. Among the latest 'omics' is morphomics. The terms morphome and morphomics have been applied incoherently in biology and biomedicine but, recently, they have been given clear and widescale definitions. Morphomics is placed in the context of other 'omics' and its pertinent technologies and tools for sampling and quantitation are reviewed. Emphasis is accorded to the importance of random sampling principles in systems biology and the value of combining 3D quantification with alternative imaging techniques to advance knowledge and understanding of the human placental morphome. By analogy to other 'omes', the morphome is the totality of morphological features within a system and morphomics is the systematic study of those structures. Information about structure is required at multiple levels of resolution in order to understand better the processes by which a given system alters with time, experimental treatment or environmental insult. Therefore, morphomics research includes all imaging techniques at all levels of achievable resolution from gross anatomy and medical imaging, via optical and electron microscopy, to molecular characterisation. Quantification is an important element of all 'omics' studies and, because biological systems exist and operate in 3-dimensional (3D) space, precise descriptions of form, content and spatial relationships require the quantification of structure in 3D. These considerations are relevant to future study contributions to the Human Placenta Project. Copyright © 2015 Elsevier Ltd. All rights reserved.
Putignani, Lorenza; Del Chierico, Federica; Vernocchi, Pamela; Cicala, Michele; Cucchiara, Salvatore; Dallapiccola, Bruno
2016-02-01
Gastrointestinal disorders, although clinically heterogeneous, share pathogenic mechanisms, including genetic susceptibility, impaired gut barrier function, altered microbiota, and environmental triggers (infections, social and behavioral factors, epigenetic control, and diet). Gut microbiota has been studied for inflammatory bowel disease (IBD) and irritable bowel syndrome (IBS) in either children or adults, while modifiable gut microbiota features, acting as risk and premorbid factors along the childhood-adulthood transition, have not been thoroughly investigated so far. Indeed, the relationship between variations of the entire host/microbiota/environmental scenario and clinical phenotypes is still not fully understood. In this respect, tracking gut dysbiosis grading may help deciphering host phenotype-genotype associations and microbiota shifts in an integrated top-down omics-based approach within large-scale pediatric and adult case-control cohorts. Large-scale gut microbiota signatures and host inflammation patterns may be integrated with dietary habits, under genetic and epigenetic constraints, providing gut dysbiosis profiles acting as risk predictors of IBD or IBS in preclinical cases. Tracking dysbiosis supports new personalized/stratified IBD and IBS prevention programmes, generating Decision Support System tools. They include (1) high risk or flare-up recurrence -omics-based dysbiosis profiles; (2) microbial and molecular biomarkers of health and disease; (3) -omics-based pipelines for laboratory medicine diagnostics; (4) health apps for self-management of score-based dietary profiles, which can be shared with clinicians for nutritional habit and lifestyle amendment; (5) -omics profiling data warehousing and public repositories for IBD and IBS profile consultation. Dysbiosis-related indexes can represent novel laboratory and clinical medicine tools preventing or postponing the disease, finally interfering with its natural history.
Reproducibility and Transparency of Omics Research - Impacts on Human Health Risk Assessment
Omics technologies are becoming more widely used in toxicology, necessitating their consideration in human health hazard and risk assessment programs. Today, risk assessors in the United States Environmental Protection Agency’s Integrated Risk Information System (IRIS) Toxicologi...
Omics methods for probing the mode of action of natural phytotoxins
USDA-ARS?s Scientific Manuscript database
For a little over a decade, omics methods (transcriptomics, proteomics, metabolomics, and physionomics) have been used to discover and probe the mode of action of both synthetic and natural phytotoxins. For mode of action discovery, the strategy for each of these approaches is to generate an omics...
Bjornson, Marta; Balcke, Gerd Ulrich; Xiao, Yanmei; de Souza, Amancio; Wang, Jin-Zheng; Zhabinskaya, Dina; Tagkopoulos, Ilias; Tissier, Alain; Dehesh, Katayoon
2017-07-01
To maintain homeostasis in the face of intrinsic and extrinsic insults, cells have evolved elaborate quality control networks to resolve damage at multiple levels. Interorganellar communication is a key requirement for this maintenance, however the underlying mechanisms of this communication have remained an enigma. Here we integrate the outcome of transcriptomic, proteomic, and metabolomics analyses of genotypes including ceh1, a mutant with constitutively elevated levels of both the stress-specific plastidial retrograde signaling metabolite methyl-erythritol cyclodiphosphate (MEcPP) and the defense hormone salicylic acid (SA), as well as the high MEcPP but SA deficient genotype ceh1/eds16, along with corresponding controls. Integration of multi-omic analyses enabled us to delineate the function of MEcPP from SA, and expose the compartmentalized role of this retrograde signaling metabolite in induction of distinct but interdependent signaling cascades instrumental in adaptive responses. Specifically, here we identify strata of MEcPP-sensitive stress-response cascades, among which we focus on selected pathways including organelle-specific regulation of jasmonate biosynthesis; simultaneous induction of synthesis and breakdown of SA; and MEcPP-mediated alteration of cellular redox status in particular glutathione redox balance. Collectively, these integrated multi-omic analyses provided a vehicle to gain an in-depth knowledge of genome-metabolism interactions, and to further probe the extent of these interactions and delineate their functional contributions. Through this approach we were able to pinpoint stress-mediated transcriptional and metabolic signatures and identify the downstream processes modulated by the independent or overlapping functions of MEcPP and SA in adaptive responses. © 2017 The Authors The Plant Journal © 2017 John Wiley & Sons Ltd.
Spotlight on environmental omics and toxicology: a long way in a short time.
Martyniuk, Christopher J; Simmons, Denina B
2016-09-01
The applications for high throughput omics technologies in environmental science have increased dramatically in recent years. Transcriptomics, proteomics, and metabolomics have been used to study how chemicals in our environment affect both aquatic and terrestrial organisms, and the characterization of molecular initiating events is a significant goal in toxicology to better predict adverse responses to toxicants. This special journal edition demonstrates the scope of the science that leverages omics-based methods in both laboratory and wild populations within the context of environmental toxicology, ranging from fish to mammals. It is important to recognize that the environment comprises one axis of the One Health concept - the idea that human health is unequivocally intertwined to our environment and to the organisms that inhabit that environment. We have much to learn from a comparative approach, and studies that integrate the transcriptome, proteome, and the metabolome are expected to offer the most detailed mechanism-based adverse outcome pathways that are applicable for use in both environmental monitoring and risk assessment. Copyright © 2016 Elsevier Inc. All rights reserved.
Narayanasamy, Shaman; Muller, Emilie E L; Sheik, Abdul R; Wilmes, Paul
2015-05-01
Biological wastewater treatment plants harbour diverse and complex microbial communities which prominently serve as models for microbial ecology and mixed culture biotechnological processes. Integrated omic analyses (combined metagenomics, metatranscriptomics, metaproteomics and metabolomics) are currently gaining momentum towards providing enhanced understanding of community structure, function and dynamics in situ as well as offering the potential to discover novel biological functionalities within the framework of Eco-Systems Biology. The integration of information from genome to metabolome allows the establishment of associations between genetic potential and final phenotype, a feature not realizable by only considering single 'omes'. Therefore, in our opinion, integrated omics will become the future standard for large-scale characterization of microbial consortia including those underpinning biological wastewater treatment processes. Systematically obtained time and space-resolved omic datasets will allow deconvolution of structure-function relationships by identifying key members and functions. Such knowledge will form the foundation for discovering novel genes on a much larger scale compared with previous efforts. In general, these insights will allow us to optimize microbial biotechnological processes either through better control of mixed culture processes or by use of more efficient enzymes in bioengineering applications. © 2015 The Authors. Microbial Biotechnology published by John Wiley & Sons Ltd and Society for Applied Microbiology.
McDermott, Jason E.; Wang, Jing; Mitchell, Hugh; Webb-Robertson, Bobbie-Jo; Hafen, Ryan; Ramey, John; Rodland, Karin D.
2012-01-01
Introduction The advent of high throughput technologies capable of comprehensive analysis of genes, transcripts, proteins and other significant biological molecules has provided an unprecedented opportunity for the identification of molecular markers of disease processes. However, it has simultaneously complicated the problem of extracting meaningful molecular signatures of biological processes from these complex datasets. The process of biomarker discovery and characterization provides opportunities for more sophisticated approaches to integrating purely statistical and expert knowledge-based approaches. Areas covered In this review we will present examples of current practices for biomarker discovery from complex omic datasets and the challenges that have been encountered in deriving valid and useful signatures of disease. We will then present a high-level review of data-driven (statistical) and knowledge-based methods applied to biomarker discovery, highlighting some current efforts to combine the two distinct approaches. Expert opinion Effective, reproducible and objective tools for combining data-driven and knowledge-based approaches to identify predictive signatures of disease are key to future success in the biomarker field. We will describe our recommendations for possible approaches to this problem including metrics for the evaluation of biomarkers. PMID:23335946
EXPOsOMICS: final policy workshop and stakeholder consultation.
Turner, Michelle C; Vineis, Paolo; Seleiro, Eduardo; Dijmarescu, Michaela; Balshaw, David; Bertollini, Roberto; Chadeau-Hyam, Marc; Gant, Timothy; Gulliver, John; Jeong, Ayoung; Kyrtopoulos, Soterios; Martuzzi, Marco; Miller, Gary W; Nawrot, Timothy; Nieuwenhuijsen, Mark; Phillips, David H; Probst-Hensch, Nicole; Samet, Jonathan; Vermeulen, Roel; Vlaanderen, Jelle; Vrijheid, Martine; Wild, Christopher; Kogevinas, Manolis
2018-02-15
The final meeting of the EXPOsOMICS project "Final Policy Workshop and Stakeholder Consultation" took place 28-29 March 2017 to present the main results of the project and discuss their implications both for future research and for regulatory and policy activities. This paper summarizes presentations and discussions at the meeting related with the main results and advances in exposome research achieved through the EXPOsOMICS project; on other parallel research initiatives on the study of the exposome in Europe and in the United States and their complementarity to EXPOsOMICS; lessons learned from these early studies on the exposome and how they may shape the future of research on environmental exposure assessment; and finally the broader implications of exposome research for risk assessment and policy development on environmental exposures. The main results of EXPOsOMICS in relation to studies of the external exposome and internal exposome in relation to both air pollution and water contaminants were presented as well as new technologies for environmental health research (adductomics) and advances in statistical methods. Although exposome research strengthens the scientific basis for policy development, there is a need in terms of showing added value for public health to: improve communication of research results to non-scientific audiences; target research to the broader landscape of societal challenges; and draw applicable conclusions. Priorities for future work include the development and standardization of methodologies and technologies for assessing the external and internal exposome, improved data sharing and integration, and the demonstration of the added value of exposome science over conventional approaches in answering priority policy questions.
2014-01-01
In omic research, such as genome wide association studies, researchers seek to repeat their results in other datasets to reduce false positive findings and thus provide evidence for the existence of true associations. Unfortunately this standard validation approach cannot completely eliminate false positive conclusions, and it can also mask many true associations that might otherwise advance our understanding of pathology. These issues beg the question: How can we increase the amount of knowledge gained from high throughput genetic data? To address this challenge, we present an approach that complements standard statistical validation methods by drawing attention to both potential false negative and false positive conclusions, as well as providing broad information for directing future research. The Diverse Convergent Evidence approach (DiCE) we propose integrates information from multiple sources (omics, informatics, and laboratory experiments) to estimate the strength of the available corroborating evidence supporting a given association. This process is designed to yield an evidence metric that has utility when etiologic heterogeneity, variable risk factor frequencies, and a variety of observational data imperfections might lead to false conclusions. We provide proof of principle examples in which DiCE identified strong evidence for associations that have established biological importance, when standard validation methods alone did not provide support. If used as an adjunct to standard validation methods this approach can leverage multiple distinct data types to improve genetic risk factor discovery/validation, promote effective science communication, and guide future research directions. PMID:25071867
Amiour, Nardjis; Imbaud, Sandrine; Clément, Gilles; Agier, Nicolas; Zivy, Michel; Valot, Benoît; Balliau, Thierry; Quilleré, Isabelle; Tercé-Laforgue, Thérèse; Dargel-Graffin, Céline; Hirel, Bertrand
2014-11-20
To identify the key elements controlling grain production in maize, it is essential to have an integrated view of the responses to alterations in the main steps of nitrogen assimilation by modification of gene expression. Two maize mutant lines (gln1.3 and gln1.4), deficient in two genes encoding cytosolic glutamine synthetase, a key enzyme involved in nitrogen assimilation, were previously characterized by a reduction of kernel size in the gln1.4 mutant and by a reduction of kernel number in the gln1.3 mutant. In this work, the differences in leaf gene transcripts, proteins and metabolite accumulation in gln1.3 and gln1.4 mutants were studied at two key stages of plant development, in order to identify putative candidate genes, proteins and metabolic pathways contributing on one hand to the control of plant development and on the other to grain production. The most interesting finding in this study is that a number of key plant processes were altered in the gln1.3 and gln1.4 mutants, including a number of major biological processes such as carbon metabolism and transport, cell wall metabolism, and several metabolic pathways and stress responsive and regulatory elements. We also found that the two mutants share common or specific characteristics across at least two or even three of the "omics" considered at the vegetative stage of plant development, or during the grain filling period. This is the first comprehensive molecular and physiological characterization of two cytosolic glutamine synthetase maize mutants using a combined transcriptomic, proteomic and metabolomic approach. We find that the integration of the three "omics" procedures is not straight forward, since developmental and mutant-specific levels of regulation seem to occur from gene expression to metabolite accumulation. However, their potential use is discussed with a view to improving our understanding of nitrogen assimilation and partitioning and its impact on grain production.
Listeriomics: an Interactive Web Platform for Systems Biology of Listeria
Koutero, Mikael; Tchitchek, Nicolas; Cerutti, Franck; Lechat, Pierre; Maillet, Nicolas; Hoede, Claire; Chiapello, Hélène; Gaspin, Christine
2017-01-01
ABSTRACT As for many model organisms, the amount of Listeria omics data produced has recently increased exponentially. There are now >80 published complete Listeria genomes, around 350 different transcriptomic data sets, and 25 proteomic data sets available. The analysis of these data sets through a systems biology approach and the generation of tools for biologists to browse these various data are a challenge for bioinformaticians. We have developed a web-based platform, named Listeriomics, that integrates different tools for omics data analyses, i.e., (i) an interactive genome viewer to display gene expression arrays, tiling arrays, and sequencing data sets along with proteomics and genomics data sets; (ii) an expression and protein atlas that connects every gene, small RNA, antisense RNA, or protein with the most relevant omics data; (iii) a specific tool for exploring protein conservation through the Listeria phylogenomic tree; and (iv) a coexpression network tool for the discovery of potential new regulations. Our platform integrates all the complete Listeria species genomes, transcriptomes, and proteomes published to date. This website allows navigation among all these data sets with enriched metadata in a user-friendly format and can be used as a central database for systems biology analysis. IMPORTANCE In the last decades, Listeria has become a key model organism for the study of host-pathogen interactions, noncoding RNA regulation, and bacterial adaptation to stress. To study these mechanisms, several genomics, transcriptomics, and proteomics data sets have been produced. We have developed Listeriomics, an interactive web platform to browse and correlate these heterogeneous sources of information. Our website will allow listeriologists and microbiologists to decipher key regulation mechanism by using a systems biology approach. PMID:28317029
Voillet, Valentin; Besse, Philippe; Liaubet, Laurence; San Cristobal, Magali; González, Ignacio
2016-10-03
In omics data integration studies, it is common, for a variety of reasons, for some individuals to not be present in all data tables. Missing row values are challenging to deal with because most statistical methods cannot be directly applied to incomplete datasets. To overcome this issue, we propose a multiple imputation (MI) approach in a multivariate framework. In this study, we focus on multiple factor analysis (MFA) as a tool to compare and integrate multiple layers of information. MI involves filling the missing rows with plausible values, resulting in M completed datasets. MFA is then applied to each completed dataset to produce M different configurations (the matrices of coordinates of individuals). Finally, the M configurations are combined to yield a single consensus solution. We assessed the performance of our method, named MI-MFA, on two real omics datasets. Incomplete artificial datasets with different patterns of missingness were created from these data. The MI-MFA results were compared with two other approaches i.e., regularized iterative MFA (RI-MFA) and mean variable imputation (MVI-MFA). For each configuration resulting from these three strategies, the suitability of the solution was determined against the true MFA configuration obtained from the original data and a comprehensive graphical comparison showing how the MI-, RI- or MVI-MFA configurations diverge from the true configuration was produced. Two approaches i.e., confidence ellipses and convex hulls, to visualize and assess the uncertainty due to missing values were also described. We showed how the areas of ellipses and convex hulls increased with the number of missing individuals. A free and easy-to-use code was proposed to implement the MI-MFA method in the R statistical environment. We believe that MI-MFA provides a useful and attractive method for estimating the coordinates of individuals on the first MFA components despite missing rows. MI-MFA configurations were close to the true configuration even when many individuals were missing in several data tables. This method takes into account the uncertainty of MI-MFA configurations induced by the missing rows, thereby allowing the reliability of the results to be evaluated.
Active Interaction Mapping as a tool to elucidate hierarchical functions of biological processes.
Farré, Jean-Claude; Kramer, Michael; Ideker, Trey; Subramani, Suresh
2017-07-03
Increasingly, various 'omics data are contributing significantly to our understanding of novel biological processes, but it has not been possible to iteratively elucidate hierarchical functions in complex phenomena. We describe a general systems biology approach called Active Interaction Mapping (AI-MAP), which elucidates the hierarchy of functions for any biological process. Existing and new 'omics data sets can be iteratively added to create and improve hierarchical models which enhance our understanding of particular biological processes. The best datatypes to further improve an AI-MAP model are predicted computationally. We applied this approach to our understanding of general and selective autophagy, which are conserved in most eukaryotes, setting the stage for the broader application to other cellular processes of interest. In the particular application to autophagy-related processes, we uncovered and validated new autophagy and autophagy-related processes, expanded known autophagy processes with new components, integrated known non-autophagic processes with autophagy and predict other unexplored connections.
Metabolomics through the lens of precision cardiovascular medicine.
Lam, Sin Man; Wang, Yuan; Li, Bowen; Du, Jie; Shui, Guanghou
2017-03-20
Metabolomics, which targets at the extensive characterization and quantitation of global metabolites from both endogenous and exogenous sources, has emerged as a novel technological avenue to advance the field of precision medicine principally driven by genomics-oriented approaches. In particular, metabolomics has revealed the cardinal roles that the environment exerts in driving the progression of major diseases threatening public health. Herein, the existent and potential applications of metabolomics in two key areas of precision cardiovascular medicine will be critically discussed: 1) the use of metabolomics in unveiling novel disease biomarkers and pathological pathways; 2) the contribution of metabolomics in cardiovascular drug development. Major issues concerning the statistical handling of big data generated by metabolomics, as well as its interpretation, will be briefly addressed. Finally, the need for integration of various omics branches and adopting a multi-omics approach to precision medicine will be discussed. Copyright © 2017 Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, and Genetics Society of China. Published by Elsevier Ltd. All rights reserved.
Oulas, Anastasis; Minadakis, George; Zachariou, Margarita; Sokratous, Kleitos; Bourdakou, Marilena M; Spyrou, George M
2017-11-27
Systems Bioinformatics is a relatively new approach, which lies in the intersection of systems biology and classical bioinformatics. It focuses on integrating information across different levels using a bottom-up approach as in systems biology with a data-driven top-down approach as in bioinformatics. The advent of omics technologies has provided the stepping-stone for the emergence of Systems Bioinformatics. These technologies provide a spectrum of information ranging from genomics, transcriptomics and proteomics to epigenomics, pharmacogenomics, metagenomics and metabolomics. Systems Bioinformatics is the framework in which systems approaches are applied to such data, setting the level of resolution as well as the boundary of the system of interest and studying the emerging properties of the system as a whole rather than the sum of the properties derived from the system's individual components. A key approach in Systems Bioinformatics is the construction of multiple networks representing each level of the omics spectrum and their integration in a layered network that exchanges information within and between layers. Here, we provide evidence on how Systems Bioinformatics enhances computational therapeutics and diagnostics, hence paving the way to precision medicine. The aim of this review is to familiarize the reader with the emerging field of Systems Bioinformatics and to provide a comprehensive overview of its current state-of-the-art methods and technologies. Moreover, we provide examples of success stories and case studies that utilize such methods and tools to significantly advance research in the fields of systems biology and systems medicine. © The Author 2017. Published by Oxford University Press.
In this commentary we present the findings from an international consortium on fish toxicogenomics sponsored by the UK Natural Environment Research Council (NERC) with an objective of moving omic technologies into chemical risk assessment and environmental monitoring. Objectiv...
Background: In this commentary we present the findings from an international consortium on fish toxicogenomics sponsored by the UK Natural Environment Research Council (NERC) with a remit of moving omic technologies into chemical risk assessment and environmental monitoring. Obj...
Loziuk, Philip; Meier, Florian; Johnson, Caroline
2016-01-01
Quantitative methods for detection of biological molecules are needed more than ever before in the emerging age of “omics” and “big data.” Here, we provide an integrated approach for systematic analysis of the “lipidome” in tissue. To test our approach in a biological context, we utilized brain tissue selectively deficient for the transcription factor Specificity Protein 2 (Sp2). Conditional deletion of Sp2 in the mouse cerebral cortex results in developmental deficiencies including disruption of lipid metabolism. Silver (Ag) cationization was implemented for infrared matrix-assisted laser desorption electrospray ionization (IR-MALDESI) to enhance the ion abundances for olefinic lipids, as these have been linked to regulation by Sp2. Combining Ag-doped and conventional IR-MALDESI imaging, this approach was extended to IR-MALDESI imaging of embryonic mouse brains. Further, our imaging technique was combined with bottom-up shotgun proteomic LC-MS/MS analysis and western blot for comparing Sp2 conditional knockout (Sp2-cKO) and wild-type (WT) cortices of tissue sections. This provided an integrated omics dataset which revealed many specific changes to fundamental cellular processes and biosynthetic pathways. In particular, step-specific altered abundances of nucleotides, lipids, and associated proteins were observed in the cerebral cortices of Sp2-cKO embryos. PMID:26942738
DOE Office of Scientific and Technical Information (OSTI.GOV)
McDermott, Jason E.; Wang, Jing; Mitchell, Hugh D.
2013-01-01
The advent of high throughput technologies capable of comprehensive analysis of genes, transcripts, proteins and other significant biological molecules has provided an unprecedented opportunity for the identification of molecular markers of disease processes. However, it has simultaneously complicated the problem of extracting meaningful signatures of biological processes from these complex datasets. The process of biomarker discovery and characterization provides opportunities both for purely statistical and expert knowledge-based approaches and would benefit from improved integration of the two. Areas covered In this review we will present examples of current practices for biomarker discovery from complex omic datasets and the challenges thatmore » have been encountered. We will then present a high-level review of data-driven (statistical) and knowledge-based methods applied to biomarker discovery, highlighting some current efforts to combine the two distinct approaches. Expert opinion Effective, reproducible and objective tools for combining data-driven and knowledge-based approaches to biomarker discovery and characterization are key to future success in the biomarker field. We will describe our recommendations of possible approaches to this problem including metrics for the evaluation of biomarkers.« less
Omics/systems biology and cancer cachexia.
Gallagher, Iain J; Jacobi, Carsten; Tardif, Nicolas; Rooyackers, Olav; Fearon, Kenneth
2016-06-01
Cancer cachexia is a complex syndrome generated by interaction between the host and tumour cells with a background of treatment effects and toxicity. The complexity of the physiological pathways likely involved in cancer cachexia necessitates a holistic view of the relevant biology. Emergent properties are characteristic of complex systems with the result that the end result is more than the sum of its parts. Recognition of the importance of emergent properties in biology led to the concept of systems biology wherein a holistic approach is taken to the biology at hand. Systems biology approaches will therefore play an important role in work to uncover key mechanisms with therapeutic potential in cancer cachexia. The 'omics' technologies provide a global view of biological systems. Genomics, transcriptomics, proteomics, lipidomics and metabolomics approaches all have application in the study of cancer cachexia to generate systems level models of the behaviour of this syndrome. The current work reviews recent applications of these technologies to muscle atrophy in general and cancer cachexia in particular with a view to progress towards integration of these approaches to better understand the pathology and potential treatment pathways in cancer cachexia. Copyright © 2016. Published by Elsevier Ltd.
CAS-viewer: web-based tool for splicing-guided integrative analysis of multi-omics cancer data.
Han, Seonggyun; Kim, Dongwook; Kim, Youngjun; Choi, Kanghoon; Miller, Jason E; Kim, Dokyoon; Lee, Younghee
2018-04-20
The Cancer Genome Atlas (TCGA) project is a public resource that provides transcriptomic, DNA sequence, methylation, and clinical data for 33 cancer types. Transforming the large size and high complexity of TCGA cancer genome data into integrated knowledge can be useful to promote cancer research. Alternative splicing (AS) is a key regulatory mechanism of genes in human cancer development and in the interaction with epigenetic factors. Therefore, AS-guided integration of existing TCGA data sets will make it easier to gain insight into the genetic architecture of cancer risk and related outcomes. There are already existing tools analyzing and visualizing alternative mRNA splicing patterns for large-scale RNA-seq experiments. However, these existing web-based tools are limited to the analysis of individual TCGA data sets at a time, such as only transcriptomic information. We implemented CAS-viewer (integrative analysis of Cancer genome data based on Alternative Splicing), a web-based tool leveraging multi-cancer omics data from TCGA. It illustrates alternative mRNA splicing patterns along with methylation, miRNAs, and SNPs, and then provides an analysis tool to link differential transcript expression ratio to methylation, miRNA, and splicing regulatory elements for 33 cancer types. Moreover, one can analyze AS patterns with clinical data to identify potential transcripts associated with different survival outcome for each cancer. CAS-viewer is a web-based application for transcript isoform-driven integration of multi-omics data in multiple cancer types and will aid in the visualization and possible discovery of biomarkers for cancer by integrating multi-omics data from TCGA.
Integrated network analysis and effective tools in plant systems biology
Fukushima, Atsushi; Kanaya, Shigehiko; Nishida, Kozo
2014-01-01
One of the ultimate goals in plant systems biology is to elucidate the genotype-phenotype relationship in plant cellular systems. Integrated network analysis that combines omics data with mathematical models has received particular attention. Here we focus on the latest cutting-edge computational advances that facilitate their combination. We highlight (1) network visualization tools, (2) pathway analyses, (3) genome-scale metabolic reconstruction, and (4) the integration of high-throughput experimental data and mathematical models. Multi-omics data that contain the genome, transcriptome, proteome, and metabolome and mathematical models are expected to integrate and expand our knowledge of complex plant metabolisms. PMID:25408696
An inference method from multi-layered structure of biomedical data.
Kim, Myungjun; Nam, Yonghyun; Shin, Hyunjung
2017-05-18
Biological system is a multi-layered structure of omics with genome, epigenome, transcriptome, metabolome, proteome, etc., and can be further stretched to clinical/medical layers such as diseasome, drugs, and symptoms. One advantage of omics is that we can figure out an unknown component or its trait by inferring from known omics components. The component can be inferred by the ones in the same level of omics or the ones in different levels. To implement the inference process, an algorithm that can be applied to the multi-layered complex system is required. In this study, we develop a semi-supervised learning algorithm that can be applied to the multi-layered complex system. In order to verify the validity of the inference, it was applied to the prediction problem of disease co-occurrence with a two-layered network composed of symptom-layer and disease-layer. The symptom-disease layered network obtained a fairly high value of AUC, 0.74, which is regarded as noticeable improvement when comparing 0.59 AUC of single-layered disease network. If further stretched to whole layered structure of omics, the proposed method is expected to produce more promising results. This research has novelty in that it is a new integrative algorithm that incorporates the vertical structure of omics data, on contrary to other existing methods that integrate the data in parallel fashion. The results can provide enhanced guideline for disease co-occurrence prediction, thereby serve as a valuable tool for inference process of multi-layered biological system.
West, Allyson A; Caudill, Marie A
2014-08-01
Nutritional genomics, defined as the study of reciprocal interactions among nutrients, metabolic intermediates, and the genome, along with other closely related nutritional -omic fields (eg, epigenomics, transcriptomics, and metabolomics) have become vital areas of nutrition study and knowledge. Utilizing results from human metabolic research on the essential nutrient choline, this article illustrates how nutrigenetic, nutrigenomic, and inter-related -omic research has provided new insights into choline metabolism and its effect on physiologic processes. Findings from highlighted choline research are also discussed in the context of translation to clinical and public health nutrition applications. Overall, this article underscores the utility of -omic research methods in elucidating nutrient metabolism as well as the potential for nutritional -omic concepts and discoveries to be broadly applied in nutritional practice. Copyright © 2014 Academy of Nutrition and Dietetics. Published by Elsevier Inc. All rights reserved.
Accessing and integrating data and knowledge for biomedical research.
Burgun, A; Bodenreider, O
2008-01-01
To review the issues that have arisen with the advent of translational research in terms of integration of data and knowledge, and survey current efforts to address these issues. Using examples form the biomedical literature, we identified new trends in biomedical research and their impact on bioinformatics. We analyzed the requirements for effective knowledge repositories and studied issues in the integration of biomedical knowledge. New diagnostic and therapeutic approaches based on gene expression patterns have brought about new issues in the statistical analysis of data, and new workflows are needed are needed to support translational research. Interoperable data repositories based on standard annotations, infrastructures and services are needed to support the pooling and meta-analysis of data, as well as their comparison to earlier experiments. High-quality, integrated ontologies and knowledge bases serve as a source of prior knowledge used in combination with traditional data mining techniques and contribute to the development of more effective data analysis strategies. As biomedical research evolves from traditional clinical and biological investigations towards omics sciences and translational research, specific needs have emerged, including integrating data collected in research studies with patient clinical data, linking omics knowledge with medical knowledge, modeling the molecular basis of diseases, and developing tools that support in-depth analysis of research data. As such, translational research illustrates the need to bridge the gap between bioinformatics and medical informatics, and opens new avenues for biomedical informatics research.
Multi-Omics Driven Assembly and Annotation of the Sandalwood (Santalum album) Genome.
Mahesh, Hirehally Basavarajegowda; Subba, Pratigya; Advani, Jayshree; Shirke, Meghana Deepak; Loganathan, Ramya Malarini; Chandana, Shankara Lingu; Shilpa, Siddappa; Chatterjee, Oishi; Pinto, Sneha Maria; Prasad, Thottethodi Subrahmanya Keshava; Gowda, Malali
2018-04-01
Indian sandalwood ( Santalum album ) is an important tropical evergreen tree known for its fragrant heartwood-derived essential oil and its valuable carving wood. Here, we applied an integrated genomic, transcriptomic, and proteomic approach to assemble and annotate the Indian sandalwood genome. Our genome sequencing resulted in the establishment of a draft map of the smallest genome for any woody tree species to date (221 Mb). The genome annotation predicted 38,119 protein-coding genes and 27.42% repetitive DNA elements. In-depth proteome analysis revealed the identities of 72,325 unique peptides, which confirmed 10,076 of the predicted genes. The addition of transcriptomic and proteogenomic approaches resulted in the identification of 53 novel proteins and 34 gene-correction events that were missed by genomic approaches. Proteogenomic analysis also helped in reassigning 1,348 potential noncoding RNAs as bona fide protein-coding messenger RNAs. Gene expression patterns at the RNA and protein levels indicated that peptide sequencing was useful in capturing proteins encoded by nuclear and organellar genomes alike. Mass spectrometry-based proteomic evidence provided an unbiased approach toward the identification of proteins encoded by organellar genomes. Such proteins are often missed in transcriptome data sets due to the enrichment of only messenger RNAs that contain poly(A) tails. Overall, the use of integrated omic approaches enhanced the quality of the assembly and annotation of this nonmodel plant genome. The availability of genomic, transcriptomic, and proteomic data will enhance genomics-assisted breeding, germplasm characterization, and conservation of sandalwood trees. © 2018 American Society of Plant Biologists. All Rights Reserved.
Taylor, Jacquelyn Y; Barcelona de Mendoza, Veronica
2018-01-01
The purpose of this article is to provide an overview of the role of nurse scientists in -omics-based research and to promote discussion around the conduct of -omics-based nursing research in minority communities. Nurses are advocates, educators, practitioners, scientists, and researchers, and are crucial to the design and successful implementation of -omics studies, particularly including minority communities. The contribution of nursing in this area of research is crucial to reducing health disparities. In this article, challenges in the conduct of -omics-based research in minority communities are discussed, and recommendations for improving diversity among nurse scientists, study participants, and utilization of training and continuing education programs in -omics are provided. Many opportunities exist for nurses to increase their knowledge in -omics and to continue to build the ranks of nurse scientists as leaders in -omics-based research. In order to work successfully with communities of color, nurse scientists must advocate for participation in the Precision Medicine Initiative, improve representation of nurse faculty of color, and increase utilization of training programs in -omics and lead such initiatives. All nursing care has the potential to be affected by the era of -omics and precision health. By taking an inclusive approach to diversity in nursing and -omics research, nurses will be well placed to be leaders in reducing health disparities through research, practice, and education. © 2017 Sigma Theta Tau International.
Fitó, Montserrat; Melander, Olle; Martínez, José Alfredo; Toledo, Estefanía; Carpéné, Christian; Corella, Dolores
2016-01-01
Intervention with Mediterranean diet (MedDiet) has provided a high level of evidence in primary prevention of cardiovascular events. Besides enhancing protection from classical risk factors, an improvement has also been described in a number of non-classical ones. Benefits have been reported on biomarkers of oxidation, inflammation, cellular adhesion, adipokine production, and pro-thrombotic state. Although the benefits of the MedDiet have been attributed to its richness in antioxidants, the mechanisms by which it exercises its beneficial effects are not well known. It is thought that the integration of omics including genomics, transcriptomics, epigenomics, and metabolomics, into studies analyzing nutrition and cardiovascular diseases will provide new clues regarding these mechanisms. However, omics integration is still in its infancy. Currently, some single-omics analyses have provided valuable data, mostly in the field of genomics. Thus, several gene-diet interactions in determining both intermediate (plasma lipids, etc.) and final cardiovascular phenotypes (stroke, myocardial infarction, etc.) have been reported. However, few studies have analyzed changes in gene expression and, moreover very few have focused on epigenomic or metabolomic biomarkers related to the MedDiet. Nevertheless, these preliminary results can help to better understand the inter-individual differences in cardiovascular risk and dietary response for further applications in personalized nutrition. PMID:27598147
From Sample to Multi-Omics Conclusions in under 48 Hours
Navas-Molina, Jose A.; Hyde, Embriette R.; Vázquez-Baeza, Yoshiki; Humphrey, Greg; Gaffney, James; Minich, Jeremiah J.; Melnik, Alexey V.; Herschend, Jakob; DeReus, Jeff; Durant, Austin; Dutton, Rachel J.; Khosroheidari, Mahdieh; Green, Clifford; da Silva, Ricardo; Dorrestein, Pieter C.; Knight, Rob
2016-01-01
ABSTRACT Multi-omics methods have greatly advanced our understanding of the biological organism and its microbial associates. However, they are not routinely used in clinical or industrial applications, due to the length of time required to generate and analyze omics data. Here, we applied a novel integrated omics pipeline for the analysis of human and environmental samples in under 48 h. Human subjects that ferment their own foods provided swab samples from skin, feces, oral cavity, fermented foods, and household surfaces to assess the impact of home food fermentation on their microbial and chemical ecology. These samples were analyzed with 16S rRNA gene sequencing, inferred gene function profiles, and liquid chromatography-tandem mass spectrometry (LC-MS/MS) metabolomics through the Qiita, PICRUSt, and GNPS pipelines, respectively. The human sample microbiomes clustered with the corresponding sample types in the American Gut Project (http://www.americangut.org), and the fermented food samples produced a separate cluster. The microbial communities of the household surfaces were primarily sourced from the fermented foods, and their consumption was associated with increased gut microbial diversity. Untargeted metabolomics revealed that human skin and fermented food samples had separate chemical ecologies and that stool was more similar to fermented foods than to other sample types. Metabolites from the fermented foods, including plant products such as procyanidin and pheophytin, were present in the skin and stool samples of the individuals consuming the foods. Some food metabolites were modified during digestion, and others were detected in stool intact. This study represents a first-of-its-kind analysis of multi-omics data that achieved time intervals matching those of classic microbiological culturing. IMPORTANCE Polymicrobial infections are difficult to diagnose due to the challenge in comprehensively cultivating the microbes present. Omics methods, such as 16S rRNA sequencing, metagenomics, and metabolomics, can provide a more complete picture of a microbial community and its metabolite production, without the biases and selectivity of microbial culture. However, these advanced methods have not been applied to clinical or industrial microbiology or other areas where complex microbial dysbioses require immediate intervention. The reason for this is the length of time required to generate and analyze omics data. Here, we describe the development and application of a pipeline for multi-omics data analysis in time frames matching those of the culture-based approaches often used for these applications. This study applied multi-omics methods effectively in clinically relevant time frames and sets a precedent toward their implementation in clinical medicine and industrial microbiology. PMID:27822524
Progress in oral personalized medicine: contribution of 'omics'.
Glurich, Ingrid; Acharya, Amit; Brilliant, Murray H; Shukla, Sanjay K
2015-01-01
Precision medicine (PM), representing clinically applicable personalized medicine, proactively integrates and interprets multidimensional personal health data, including clinical, 'omics', and environmental profiles, into clinical practice. Realization of PM remains in progress. The focus of this review is to provide a descriptive narrative overview of: 1) the current status of oral personalized medicine; and 2) recent advances in genomics and related 'omic' and emerging research domains contributing to advancing oral-systemic PM, with special emphasis on current understanding of oral microbiomes. A scan of peer-reviewed literature describing oral PM or 'omic'-based research conducted on humans/data published in English within the last 5 years in journals indexed in the PubMed database was conducted using mesh search terms. An evidence-based approach was used to report on recent advances with potential to advance PM in the context of historical critical and systematic reviews to delineate current state-of-the-art technologies. Special focus was placed on oral microbiome research associated with health and disease states, emerging research domains, and technological advances, which are positioning realization of PM. This review summarizes: 1) evolving conceptualization of personalized medicine; 2) emerging insight into roles of oral infectious and inflammatory processes as contributors to both oral and systemic diseases; 3) community shifts in microbiota that may contribute to disease; 4) evidence pointing to new uncharacterized potential oral pathogens; 5) advances in technological approaches to 'omics' research that will accelerate PM; 6) emerging research domains that expand insights into host-microbe interaction including inter-kingdom communication, systems and network analysis, and salivaomics; and 7) advances in informatics and big data analysis capabilities to facilitate interpretation of host and microbiome-associated datasets. Furthermore, progress in clinically applicable screening assays and biomarker definition to inform clinical care are briefly explored. Advancement of oral PM currently remains in research and discovery phases. Although substantive progress has been made in advancing the understanding of the role of microbiome dynamics in health and disease and is being leveraged to advance early efforts at clinical translation, further research is required to discern interpretable constituency patterns in the complex interactions of these microbial communities in health and disease. Advances in biotechnology and bioinformatics facilitating novel approaches to rapid analysis and interpretation of large datasets are providing new insights into oral health and disease, potentiating clinical application and advancing realization of PM within the next decade.
Perspectives of Physiology as a Discipline from Senior-Level Millennial-Generation Students
ERIC Educational Resources Information Center
Steury, Michael D.; Poteracki, James M.; Kelly, Kevin L.; Wehrwein, Erica A.
2015-01-01
In the last several decades, there has been a shift in the mindset of research structure from classical "systems or integrative biology" to more molecular focused "-omics" study. A recent topic of debate in physiological societies has been whether or not the "-omic" revolution has delivered in its promises in both…
A novel approach for data integration and disease subtyping
Tagett, Rebecca; Diaz, Diana
2017-01-01
Advances in high-throughput technologies allow for measurements of many types of omics data, yet the meaningful integration of several different data types remains a significant challenge. Another important and difficult problem is the discovery of molecular disease subtypes characterized by relevant clinical differences, such as survival. Here we present a novel approach, called perturbation clustering for data integration and disease subtyping (PINS), which is able to address both challenges. The framework has been validated on thousands of cancer samples, using gene expression, DNA methylation, noncoding microRNA, and copy number variation data available from the Gene Expression Omnibus, the Broad Institute, The Cancer Genome Atlas (TCGA), and the European Genome-Phenome Archive. This simultaneous subtyping approach accurately identifies known cancer subtypes and novel subgroups of patients with significantly different survival profiles. The results were obtained from genome-scale molecular data without any other type of prior knowledge. The approach is sufficiently general to replace existing unsupervised clustering approaches outside the scope of bio-medical research, with the additional ability to integrate multiple types of data. PMID:29066617
Novel insights, challenges and practical implications of DOHaD-omics research.
Hodyl, Nicolette A; Muhlhausler, Beverly
2016-02-15
Research investigating the developmental origins of health and disease (DOHaD) has never had the technology to investigate physiology in such a data-rich capacity and at such a microlevel as it does now. A symposium at the inaugural meeting of the DOHaD Society of Australia and New Zealand outlined the advantages and challenges of using "-omics" technologies in DOHaD research. DOHaD studies with -omics approaches to generate large, rich datasets were discussed. We discuss implications for policy and practice and make recommendations to facilitate successful translation of results of future DOHaD-omics studies.
Ozdemir, Vural; Motulsky, Arno G; Kolker, Eugene; Godard, Béatrice
2009-02-01
The relationships between food, nutrition science, and health outcomes have been mapped over the past century. Genomic variation among individuals and populations is a new factor that enriches and challenges our understanding of these complex relationships. Hence, the confluence of nutritional science and genomics-nutrigenomics--was the focus of the OMICS: A Journal of Integrative Biology in December 2008 (Part 1). The 2009 Special Issue (Part 2) concludes the analysis of nutrigenomics research and innovations. Together, these two issues expand the scope and depth of critical scholarship in nutrigenomics, in keeping with an integrated multidisciplinary analysis across the bioscience, omics technology, social, ethical, intellectual property and policy dimensions. Historically, the field of pharmacogenetics provided the first examples of specifically identifiable gene variants predisposing to unexpected responses to drugs since the 1950s. Brewer coined the term ecogenetics in 1971 to broaden the concept of gene-environment interactions from drugs and nutrition to include environmental agents in general. In the mid-1990s, introduction of high-throughput technologies led to the terms pharmacogenomics, nutrigenomics and ecogenomics to describe, respectively, the contribution of genomic variability to differential responses to drugs, food, and environment defined in the broadest sense. The distinctions, if any, between these newer fields (e.g., nutrigenomics) and their predecessors (e.g., nutrigenetics) remain to be delineated. For nutrigenomics, its reliance on genome-wide analyses may lead to detection of new biological mechanisms governing host response to food. Recognizing "genome-environment interactions" as the conceptual thread that connects and runs through pharmacogenomics, nutrigenomics, and ecogenomics may contribute toward anticipatory governance and prospective real-time analysis of these omics fields. Such real-time analysis of omics technologies and innovations is crucial, because it can influence and positively shape them as these approaches develop, and help avoid predictable pitfalls, and thus ensure their effective and ethical application in the laboratory, clinic, and society.
Holtfreter, Silva; Kolata, Julia; Stentzel, Sebastian; Bauerfeind, Stephanie; Schmidt, Frank; Sundaramoorthy, Nandakumar; Bröker, Barbara M.
2016-01-01
Staphylococcus aureus is a dangerous pathogen both in hospitals and in the community. Due to the crisis of antibiotic resistance, there is an urgent need for new strategies to combat S. aureus infections, such as vaccination. Increasing our knowledge about the mechanisms of protection will be key for the successful prevention or treatment of S. aureus invasion. Omics technologies generate a comprehensive picture of the physiological and pathophysiological processes within cells, tissues, organs, organisms and even populations. This review provides an overview of the contribution of genomics, transcriptomics, proteomics, metabolomics and immunoproteomics to the current understanding of S. aureus‑host interaction, with a focus on the adaptive immune response to the microorganism. While antibody responses during colonization and infection have been analyzed in detail using immunoproteomics, the full potential of omics technologies has not been tapped yet in terms of T-cells. Omics technologies promise to speed up vaccine development by enabling reverse vaccinology approaches. In consequence, omics technologies are powerful tools for deepening our understanding of the “superbug” S. aureus and for improving its control. PMID:28248221
Ferguson, Jane F; Allayee, Hooman; Gerszten, Robert E; Ideraabdullah, Folami; Kris-Etherton, Penny M; Ordovás, José M; Rimm, Eric B; Wang, Thomas J; Bennett, Brian J
2016-06-01
Cardiometabolic diseases are the leading cause of death worldwide and are strongly linked to both genetic and nutritional factors. The field of nutrigenomics encompasses multiple approaches aimed at understanding the effects of diet on health or disease development, including nutrigenetic studies investigating the relationship between genetic variants and diet in modulating cardiometabolic risk, as well as the effects of dietary components on multiple "omic" measures, including transcriptomics, metabolomics, proteomics, lipidomics, epigenetic modifications, and the microbiome. Here, we describe the current state of the field of nutrigenomics with respect to cardiometabolic disease research and outline a direction for the integration of multiple omics techniques in future nutrigenomic studies aimed at understanding mechanisms and developing new therapeutic options for cardiometabolic disease treatment and prevention. © 2016 American Heart Association, Inc.
Caie, Peter D; Harrison, David J
2016-01-01
The field of pathology is rapidly transforming from a semiquantitative and empirical science toward a big data discipline. Large data sets from across multiple omics fields may now be extracted from a patient's tissue sample. Tissue is, however, complex, heterogeneous, and prone to artifact. A reductionist view of tissue and disease progression, which does not take this complexity into account, may lead to single biomarkers failing in clinical trials. The integration of standardized multi-omics big data and the retention of valuable information on spatial heterogeneity are imperative to model complex disease mechanisms. Mathematical modeling through systems pathology approaches is the ideal medium to distill the significant information from these large, multi-parametric, and hierarchical data sets. Systems pathology may also predict the dynamical response of disease progression or response to therapy regimens from a static tissue sample. Next-generation pathology will incorporate big data with systems medicine in order to personalize clinical practice for both prognostic and predictive patient care.
Multi-platform 'Omics Analysis of Human Ebola Virus Disease Pathogenesis.
Eisfeld, Amie J; Halfmann, Peter J; Wendler, Jason P; Kyle, Jennifer E; Burnum-Johnson, Kristin E; Peralta, Zuleyma; Maemura, Tadashi; Walters, Kevin B; Watanabe, Tokiko; Fukuyama, Satoshi; Yamashita, Makoto; Jacobs, Jon M; Kim, Young-Mo; Casey, Cameron P; Stratton, Kelly G; Webb-Robertson, Bobbie-Jo M; Gritsenko, Marina A; Monroe, Matthew E; Weitz, Karl K; Shukla, Anil K; Tian, Mingyuan; Neumann, Gabriele; Reed, Jennifer L; van Bakel, Harm; Metz, Thomas O; Smith, Richard D; Waters, Katrina M; N'jai, Alhaji; Sahr, Foday; Kawaoka, Yoshihiro
2017-12-13
The pathogenesis of human Ebola virus disease (EVD) is complex. EVD is characterized by high levels of virus replication and dissemination, dysregulated immune responses, extensive virus- and host-mediated tissue damage, and disordered coagulation. To clarify how host responses contribute to EVD pathophysiology, we performed multi-platform 'omics analysis of peripheral blood mononuclear cells and plasma from EVD patients. Our results indicate that EVD molecular signatures overlap with those of sepsis, imply that pancreatic enzymes contribute to tissue damage in fatal EVD, and suggest that Ebola virus infection may induce aberrant neutrophils whose activity could explain hallmarks of fatal EVD. Moreover, integrated biomarker prediction identified putative biomarkers from different data platforms that differentiated survivors and fatalities early after infection. This work reveals insight into EVD pathogenesis, suggests an effective approach for biomarker identification, and provides an important community resource for further analysis of human EVD severity. Copyright © 2017 Elsevier Inc. All rights reserved.
Misra, Namrata; Panda, Prasanna Kumar; Parida, Bikram Kumar
2013-11-01
Microalgal biofuels offer great promise in contributing to the growing global demand for alternative sources of renewable energy. However, to make algae-based fuels cost competitive with petroleum, lipid production capabilities of microalgae need to improve substantially. Recent progress in algal genomics, in conjunction with other "omic" approaches, has accelerated the ability to identify metabolic pathways and genes that are potential targets in the development of genetically engineered microalgal strains with optimum lipid content. In this review, we summarize the current bioeconomic status of global biofuel feedstocks with particular reference to the role of "omics" in optimizing sustainable biofuel production. We also provide an overview of the various databases and bioinformatics resources available to gain a more complete understanding of lipid metabolism across algal species, along with the recent contributions of "omic" approaches in the metabolic pathway studies for microalgal biofuel production.
The condition-dependent transcriptional network in Escherichia coli.
Lemmens, Karen; De Bie, Tijl; Dhollander, Thomas; Monsieurs, Pieter; De Moor, Bart; Collado-Vides, Julio; Engelen, Kristof; Marchal, Kathleen
2009-03-01
Thanks to the availability of high-throughput omics data, bioinformatics approaches are able to hypothesize thus-far undocumented genetic interactions. However, due to the amount of noise in these data, inferences based on a single data source are often unreliable. A popular approach to overcome this problem is to integrate different data sources. In this study, we describe DISTILLER, a novel framework for data integration that simultaneously analyzes microarray and motif information to find modules that consist of genes that are co-expressed in a subset of conditions, and their corresponding regulators. By applying our method on publicly available data, we evaluated the condition-specific transcriptional network of Escherichia coli. DISTILLER confirmed 62% of 736 interactions described in RegulonDB, and 278 novel interactions were predicted.
Systems Toxicology: Real World Applications and Opportunities.
Hartung, Thomas; FitzGerald, Rex E; Jennings, Paul; Mirams, Gary R; Peitsch, Manuel C; Rostami-Hodjegan, Amin; Shah, Imran; Wilks, Martin F; Sturla, Shana J
2017-04-17
Systems Toxicology aims to change the basis of how adverse biological effects of xenobiotics are characterized from empirical end points to describing modes of action as adverse outcome pathways and perturbed networks. Toward this aim, Systems Toxicology entails the integration of in vitro and in vivo toxicity data with computational modeling. This evolving approach depends critically on data reliability and relevance, which in turn depends on the quality of experimental models and bioanalysis techniques used to generate toxicological data. Systems Toxicology involves the use of large-scale data streams ("big data"), such as those derived from omics measurements that require computational means for obtaining informative results. Thus, integrative analysis of multiple molecular measurements, particularly acquired by omics strategies, is a key approach in Systems Toxicology. In recent years, there have been significant advances centered on in vitro test systems and bioanalytical strategies, yet a frontier challenge concerns linking observed network perturbations to phenotypes, which will require understanding pathways and networks that give rise to adverse responses. This summary perspective from a 2016 Systems Toxicology meeting, an international conference held in the Alps of Switzerland, describes the limitations and opportunities of selected emerging applications in this rapidly advancing field. Systems Toxicology aims to change the basis of how adverse biological effects of xenobiotics are characterized, from empirical end points to pathways of toxicity. This requires the integration of in vitro and in vivo data with computational modeling. Test systems and bioanalytical technologies have made significant advances, but ensuring data reliability and relevance is an ongoing concern. The major challenge facing the new pathway approach is determining how to link observed network perturbations to phenotypic toxicity.
Systems Toxicology: Real World Applications and Opportunities
2017-01-01
Systems Toxicology aims to change the basis of how adverse biological effects of xenobiotics are characterized from empirical end points to describing modes of action as adverse outcome pathways and perturbed networks. Toward this aim, Systems Toxicology entails the integration of in vitro and in vivo toxicity data with computational modeling. This evolving approach depends critically on data reliability and relevance, which in turn depends on the quality of experimental models and bioanalysis techniques used to generate toxicological data. Systems Toxicology involves the use of large-scale data streams (“big data”), such as those derived from omics measurements that require computational means for obtaining informative results. Thus, integrative analysis of multiple molecular measurements, particularly acquired by omics strategies, is a key approach in Systems Toxicology. In recent years, there have been significant advances centered on in vitro test systems and bioanalytical strategies, yet a frontier challenge concerns linking observed network perturbations to phenotypes, which will require understanding pathways and networks that give rise to adverse responses. This summary perspective from a 2016 Systems Toxicology meeting, an international conference held in the Alps of Switzerland, describes the limitations and opportunities of selected emerging applications in this rapidly advancing field. Systems Toxicology aims to change the basis of how adverse biological effects of xenobiotics are characterized, from empirical end points to pathways of toxicity. This requires the integration of in vitro and in vivo data with computational modeling. Test systems and bioanalytical technologies have made significant advances, but ensuring data reliability and relevance is an ongoing concern. The major challenge facing the new pathway approach is determining how to link observed network perturbations to phenotypic toxicity. PMID:28362102
DOE Office of Scientific and Technical Information (OSTI.GOV)
Boaro, Amy A.; Kim, Young-Mo; Konopka, Allan
2014-12-01
Integrated ‘omics have been used on pure cultures and co-cultures, yet they have not been applied to complex microbial communities to examine questions of perturbation response. In this study, we used integrated ‘omics to measure the perturbation response of a cellulose-degrading bioreactor community fed with microcrystalline cellulose (Avicel). We predicted that a pH decrease by addition of a pulse of acid would reduce microbial community diversity and temporarily reduce reactor function such as cellulose degradation. However, 16S rDNA pyrosequencing results revealed increased alpha diversity in the microbial community after the perturbation, and a persistence of the dominant community members overmore » the duration of the experiment. Proteomics results showed a decrease in activity of proteins associated with Fibrobacter succinogenes two days after the perturbation followed by increased protein abundances six days after the perturbation. The decrease in cellulolytic activity suggested by the proteomics was confirmed by the accumulation of Avicel in the reactor. Metabolomics showed a pattern similar to that of the proteome, with amino acid production decreasing two days after the perturbation and increasing after six days. This study demonstrated that community ‘omics data provides valuable information about the interactions and function of anaerobic cellulolytic community members after a perturbation.« less
Kambouris, Manousos E; Manoussopoulos, Yiannis; Kantzanou, Maria; Velegraki, Aristea; Gaitanis, Georgios; Arabatzis, Michalis; Patrinos, George P
2018-01-01
Global Catastrophic Biological Risks (GCBRs) refer to biological events-natural, deliberate, and accidental-of a global and lasting impact. This challenges the life scientists to raise their game on two hitherto neglected innovation frontiers: a veritable "futures" thinking to "think the unthinkable," and "systems thinking" so as to see both the trees and the forest when it comes to GCBRs. This innovation analysis article outlines the promise of Omics systems science biotechnologies, for example, to deploy rapid fire diagnostics for health security crises at GCBR level, possibly involving neopathogens and/or incurring epidemics (e.g., severe acute respiratory syndrome [SARS] and Ebola) that collectively threaten the lives of global society and interdependent biological ecosystems. Moreover, Omics encourages thinking beyond immediacy and in long-term strategies for biopreparedness and response innovation when the timelines are aggressive and compressed in response to crises such as GCBRs, but also to non-global but surging, multiple threats occurring as successive, overlapping, or distinct events, rather than as distinct entities-a prospect enforcing a reboot in Bioresilience. We define Next-Generation Bioresilience as "a systems approach against natural, accidental and perpetrated GCBRs using Omics technologies, and a shift in mentality, whereby the systems approach is expanded to include multiple plausible futures and expose unchecked assumptions attendant to risks, beyond technological determinism." In sum, it is time to think about the realistic potential of Omics biotechnologies beyond clinical practice and precision medicine so as to harness the opportunities and address the uncertainties associated not only with GCBRs but also with other emerging Omics applications in health and society.
Dwivedi, Bhakti; Kowalski, Jeanne
2018-01-01
While many methods exist for integrating multi-omics data or defining gene sets, there is no one single tool that defines gene sets based on merging of multiple omics data sets. We present shinyGISPA, an open-source application with a user-friendly web-based interface to define genes according to their similarity in several molecular changes that are driving a disease phenotype. This tool was developed to help facilitate the usability of a previously published method, Gene Integrated Set Profile Analysis (GISPA), among researchers with limited computer-programming skills. The GISPA method allows the identification of multiple gene sets that may play a role in the characterization, clinical application, or functional relevance of a disease phenotype. The tool provides an automated workflow that is highly scalable and adaptable to applications that go beyond genomic data merging analysis. It is available at http://shinygispa.winship.emory.edu/shinyGISPA/.
Dwivedi, Bhakti
2018-01-01
While many methods exist for integrating multi-omics data or defining gene sets, there is no one single tool that defines gene sets based on merging of multiple omics data sets. We present shinyGISPA, an open-source application with a user-friendly web-based interface to define genes according to their similarity in several molecular changes that are driving a disease phenotype. This tool was developed to help facilitate the usability of a previously published method, Gene Integrated Set Profile Analysis (GISPA), among researchers with limited computer-programming skills. The GISPA method allows the identification of multiple gene sets that may play a role in the characterization, clinical application, or functional relevance of a disease phenotype. The tool provides an automated workflow that is highly scalable and adaptable to applications that go beyond genomic data merging analysis. It is available at http://shinygispa.winship.emory.edu/shinyGISPA/. PMID:29415010
Perrino, Cinzia; Barabási, Albert-Laszló; Condorelli, Gianluigi; Davidson, Sean Michael; De Windt, Leon; Dimmeler, Stefanie; Engel, Felix Benedikt; Hausenloy, Derek John; Hill, Joseph Addison; Van Laake, Linda Wilhelmina; Lecour, Sandrine; Leor, Jonathan; Madonna, Rosalinda; Mayr, Manuel; Prunier, Fabrice; Sluijter, Joost Petrus Geradus; Schulz, Rainer; Thum, Thomas; Ytrehus, Kirsti
2017-01-01
Despite advances in myocardial reperfusion therapies, acute myocardial ischaemia/reperfusion injury and consequent ischaemic heart failure represent the number one cause of morbidity and mortality in industrialized societies. Although different therapeutic interventions have been shown beneficial in preclinical settings, an effective cardioprotective or regenerative therapy has yet to be successfully introduced in the clinical arena. Given the complex pathophysiology of the ischaemic heart, large scale, unbiased, global approaches capable of identifying multiple branches of the signalling networks activated in the ischaemic/reperfused heart might be more successful in the search for novel diagnostic or therapeutic targets. High-throughput techniques allow high-resolution, genome-wide investigation of genetic variants, epigenetic modifications, and associated gene expression profiles. Platforms such as proteomics and metabolomics (not described here in detail) also offer simultaneous readouts of hundreds of proteins and metabolites. Isolated omics analyses usually provide Big Data requiring large data storage, advanced computational resources and complex bioinformatics tools. The possibility of integrating different omics approaches gives new hope to better understand the molecular circuitry activated by myocardial ischaemia, putting it in the context of the human ‘diseasome’. Since modifications of cardiac gene expression have been consistently linked to pathophysiology of the ischaemic heart, the integration of epigenomic and transcriptomic data seems a promising approach to identify crucial disease networks. Thus, the scope of this Position Paper will be to highlight potentials and limitations of these approaches, and to provide recommendations to optimize the search for novel diagnostic or therapeutic targets for acute ischaemia/reperfusion injury and ischaemic heart failure in the post-genomic era. PMID:28460026
Uncovering Hidden Layers of Cell Cycle Regulation through Integrative Multi-omic Analysis
Aviner, Ranen; Shenoy, Anjana; Elroy-Stein, Orna; Geiger, Tamar
2015-01-01
Studying the complex relationship between transcription, translation and protein degradation is essential to our understanding of biological processes in health and disease. The limited correlations observed between mRNA and protein abundance suggest pervasive regulation of post-transcriptional steps and support the importance of profiling mRNA levels in parallel to protein synthesis and degradation rates. In this work, we applied an integrative multi-omic approach to study gene expression along the mammalian cell cycle through side-by-side analysis of mRNA, translation and protein levels. Our analysis sheds new light on the significant contribution of both protein synthesis and degradation to the variance in protein expression. Furthermore, we find that translation regulation plays an important role at S-phase, while progression through mitosis is predominantly controlled by changes in either mRNA levels or protein stability. Specific molecular functions are found to be co-regulated and share similar patterns of mRNA, translation and protein expression along the cell cycle. Notably, these include genes and entire pathways not previously implicated in cell cycle progression, demonstrating the potential of this approach to identify novel regulatory mechanisms beyond those revealed by traditional expression profiling. Through this three-level analysis, we characterize different mechanisms of gene expression, discover new cycling gene products and highlight the importance and utility of combining datasets generated using different techniques that monitor distinct steps of gene expression. PMID:26439921
Accessing and Integrating Data and Knowledge for Biomedical Research
Burgun, A.; Bodenreider, O.
2008-01-01
Summary Objectives To review the issues that have arisen with the advent of translational research in terms of integration of data and knowledge, and survey current efforts to address these issues. Methods Using examples form the biomedical literature, we identified new trends in biomedical research and their impact on bioinformatics. We analyzed the requirements for effective knowledge repositories and studied issues in the integration of biomedical knowledge. Results New diagnostic and therapeutic approaches based on gene expression patterns have brought about new issues in the statistical analysis of data, and new workflows are needed are needed to support translational research. Interoperable data repositories based on standard annotations, infrastructures and services are needed to support the pooling and meta-analysis of data, as well as their comparison to earlier experiments. High-quality, integrated ontologies and knowledge bases serve as a source of prior knowledge used in combination with traditional data mining techniques and contribute to the development of more effective data analysis strategies. Conclusion As biomedical research evolves from traditional clinical and biological investigations towards omics sciences and translational research, specific needs have emerged, including integrating data collected in research studies with patient clinical data, linking omics knowledge with medical knowledge, modeling the molecular basis of diseases, and developing tools that support in-depth analysis of research data. As such, translational research illustrates the need to bridge the gap between bioinformatics and medical informatics, and opens new avenues for biomedical informatics research. PMID:18660883
Ogawa, Diogo M. O.; Moriya, Shigeharu; Tsuboi, Yuuri; Date, Yasuhiro; Prieto-da-Silva, Álvaro R. B.; Rádis-Baptista, Gandhi; Yamane, Tetsuo; Kikuchi, Jun
2014-01-01
We propose the technique of biogeochemical typing (BGC typing) as a novel methodology to set forth the sub-systems of organismal communities associated to the correlated chemical profiles working within a larger complex environment. Given the intricate characteristic of both organismal and chemical consortia inherent to the nature, many environmental studies employ the holistic approach of multi-omics analyses undermining as much information as possible. Due to the massive amount of data produced applying multi-omics analyses, the results are hard to visualize and to process. The BGC typing analysis is a pipeline built using integrative statistical analysis that can treat such huge datasets filtering, organizing and framing the information based on the strength of the various mutual trends of the organismal and chemical fluctuations occurring simultaneously in the environment. To test our technique of BGC typing, we choose a rich environment abounding in chemical nutrients and organismal diversity: the surficial freshwater from Japanese paddy fields and surrounding waters. To identify the community consortia profile we employed metagenomics as high throughput sequencing (HTS) for the fragments amplified from Archaea rRNA, universal 16S rRNA and 18S rRNA; to assess the elemental content we employed ionomics by inductively coupled plasma optical emission spectroscopy (ICP-OES); and for the organic chemical profile, metabolomics employing both Fourier transformed infrared (FT-IR) spectroscopy and proton nuclear magnetic resonance (1H-NMR) all these analyses comprised our multi-omics dataset. The similar trends between the community consortia against the chemical profiles were connected through correlation. The result was then filtered, organized and framed according to correlation strengths and peculiarities. The output gave us four BGC types displaying uniqueness in community and chemical distribution, diversity and richness. We conclude therefore that the BGC typing is a successful technique for elucidating the sub-systems of organismal communities with associated chemical profiles in complex ecosystems. PMID:25330259
Ogawa, Diogo M O; Moriya, Shigeharu; Tsuboi, Yuuri; Date, Yasuhiro; Prieto-da-Silva, Álvaro R B; Rádis-Baptista, Gandhi; Yamane, Tetsuo; Kikuchi, Jun
2014-01-01
We propose the technique of biogeochemical typing (BGC typing) as a novel methodology to set forth the sub-systems of organismal communities associated to the correlated chemical profiles working within a larger complex environment. Given the intricate characteristic of both organismal and chemical consortia inherent to the nature, many environmental studies employ the holistic approach of multi-omics analyses undermining as much information as possible. Due to the massive amount of data produced applying multi-omics analyses, the results are hard to visualize and to process. The BGC typing analysis is a pipeline built using integrative statistical analysis that can treat such huge datasets filtering, organizing and framing the information based on the strength of the various mutual trends of the organismal and chemical fluctuations occurring simultaneously in the environment. To test our technique of BGC typing, we choose a rich environment abounding in chemical nutrients and organismal diversity: the surficial freshwater from Japanese paddy fields and surrounding waters. To identify the community consortia profile we employed metagenomics as high throughput sequencing (HTS) for the fragments amplified from Archaea rRNA, universal 16S rRNA and 18S rRNA; to assess the elemental content we employed ionomics by inductively coupled plasma optical emission spectroscopy (ICP-OES); and for the organic chemical profile, metabolomics employing both Fourier transformed infrared (FT-IR) spectroscopy and proton nuclear magnetic resonance (1H-NMR) all these analyses comprised our multi-omics dataset. The similar trends between the community consortia against the chemical profiles were connected through correlation. The result was then filtered, organized and framed according to correlation strengths and peculiarities. The output gave us four BGC types displaying uniqueness in community and chemical distribution, diversity and richness. We conclude therefore that the BGC typing is a successful technique for elucidating the sub-systems of organismal communities with associated chemical profiles in complex ecosystems.
Kolker, Eugene; Sansone, Susanna
2018-01-15
The Genomic Standards Consortium was formed in September 2005. It is an international, open-membership working body which promotes standardization in the description of genomes and the exchange and integration of genomic data. The 2009 meeting was an activity of a five-year funding "Research Coordination Network" from the National Science Foundation and was organized held at the DOE Joint Genome Institute with organizational support provided by the JGI and by the University of California - San Diego. Eugene Kolker from Seattle Children's Hospital briefly discusses "The Encyclopedia of Systems Biology and OMICS," followed by Susanna Sansone from the EBI on "The ISA Infrastructure for multi-omics data" at the Genomic Standards Consortium's 8th meeting at the DOE JGI in Walnut Creek, CA. on Sept. 11, 2009.
Yu, Kun-Hsing; Fitzpatrick, Michael R; Pappas, Luke; Chan, Warren; Kung, Jessica; Snyder, Michael
2017-09-12
Precision oncology is an approach that accounts for individual differences to guide cancer management. Omics signatures have been shown to predict clinical traits for cancer patients. However, the vast amount of omics information poses an informatics challenge in systematically identifying patterns associated with health outcomes, and no general-purpose data-mining tool exists for physicians, medical researchers, and citizen scientists without significant training in programming and bioinformatics. To bridge this gap, we built the Omics AnalySIs System for PRecision Oncology (OASISPRO), a web-based system to mine the quantitative omics information from The Cancer Genome Atlas (TCGA). This system effectively visualizes patients' clinical profiles, executes machine-learning algorithms of choice on the omics data, and evaluates the prediction performance using held-out test sets. With this tool, we successfully identified genes strongly associated with tumor stage, and accurately predicted patients' survival outcomes in many cancer types, including mesothelioma and adrenocortical carcinoma. By identifying the links between omics and clinical phenotypes, this system will facilitate omics studies on precision cancer medicine and contribute to establishing personalized cancer treatment plans. This web-based tool is available at http://tinyurl.com/oasispro ;source codes are available at http://tinyurl.com/oasisproSourceCode . © The Author (2017). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com
Deep-Sea Microbes: Linking Biogeochemical Rates to -Omics Approaches
NASA Astrophysics Data System (ADS)
Herndl, G. J.; Sintes, E.; Bayer, B.; Bergauer, K.; Amano, C.; Hansman, R.; Garcia, J.; Reinthaler, T.
2016-02-01
Over the past decade substantial progress has been made in determining deep ocean microbial activity and resolving some of the enigmas in understanding the deep ocean carbon flux. Also, metagenomics approaches have shed light onto the dark ocean's microbes but linking -omics approaches to biogeochemical rate measurements are generally rare in microbial oceanography and even more so for the deep ocean. In this presentation, we will show by combining metagenomics, -proteomics and biogeochemical rate measurements on the bulk and single-cell level that deep-sea microbes exhibit characteristics of generalists with a large genome repertoire, versatile in utilizing substrate as revealed by metaproteomics. This is in striking contrast with the apparently rather uniform dissolved organic matter pool in the deep ocean. Combining the different -omics approaches with metabolic rate measurements, we will highlight some major inconsistencies and enigmas in our understanding of the carbon cycling and microbial food web structure in the dark ocean.
Rajamani, Deepa; Bhasin, Manoj K
2016-05-03
Pancreatic cancer is an aggressive cancer with dismal prognosis, urgently necessitating better biomarkers to improve therapeutic options and early diagnosis. Traditional approaches of biomarker detection that consider only one aspect of the biological continuum like gene expression alone are limited in their scope and lack robustness in identifying the key regulators of the disease. We have adopted a multidimensional approach involving the cross-talk between the omics spaces to identify key regulators of disease progression. Multidimensional domain-specific disease signatures were obtained using rank-based meta-analysis of individual omics profiles (mRNA, miRNA, DNA methylation) related to pancreatic ductal adenocarcinoma (PDAC). These domain-specific PDAC signatures were integrated to identify genes that were affected across multiple dimensions of omics space in PDAC (genes under multiple regulatory controls, GMCs). To further pin down the regulators of PDAC pathophysiology, a systems-level network was generated from knowledge-based interaction information applied to the above identified GMCs. Key regulators were identified from the GMC network based on network statistics and their functional importance was validated using gene set enrichment analysis and survival analysis. Rank-based meta-analysis identified 5391 genes, 109 miRNAs and 2081 methylation-sites significantly differentially expressed in PDAC (false discovery rate ≤ 0.05). Bimodal integration of meta-analysis signatures revealed 1150 and 715 genes regulated by miRNAs and methylation, respectively. Further analysis identified 189 altered genes that are commonly regulated by miRNA and methylation, hence considered GMCs. Systems-level analysis of the scale-free GMCs network identified eight potential key regulator hubs, namely E2F3, HMGA2, RASA1, IRS1, NUAK1, ACTN1, SKI and DLL1, associated with important pathways driving cancer progression. Survival analysis on individual key regulators revealed that higher expression of IRS1 and DLL1 and lower expression of HMGA2, ACTN1 and SKI were associated with better survival probabilities. It is evident from the results that our hierarchical systems-level multidimensional analysis approach has been successful in isolating the converging regulatory modules and associated key regulatory molecules that are potential biomarkers for pancreatic cancer progression.
Computational dynamic approaches for temporal omics data with applications to systems medicine.
Liang, Yulan; Kelemen, Arpad
2017-01-01
Modeling and predicting biological dynamic systems and simultaneously estimating the kinetic structural and functional parameters are extremely important in systems and computational biology. This is key for understanding the complexity of the human health, drug response, disease susceptibility and pathogenesis for systems medicine. Temporal omics data used to measure the dynamic biological systems are essentials to discover complex biological interactions and clinical mechanism and causations. However, the delineation of the possible associations and causalities of genes, proteins, metabolites, cells and other biological entities from high throughput time course omics data is challenging for which conventional experimental techniques are not suited in the big omics era. In this paper, we present various recently developed dynamic trajectory and causal network approaches for temporal omics data, which are extremely useful for those researchers who want to start working in this challenging research area. Moreover, applications to various biological systems, health conditions and disease status, and examples that summarize the state-of-the art performances depending on different specific mining tasks are presented. We critically discuss the merits, drawbacks and limitations of the approaches, and the associated main challenges for the years ahead. The most recent computing tools and software to analyze specific problem type, associated platform resources, and other potentials for the dynamic trajectory and interaction methods are also presented and discussed in detail.
Furusawa, Chikara; Horinouchi, Takaaki; Hirasawa, Takashi; Shimizu, Hiroshi
2013-01-01
It is widely acknowledged that in order to establish sustainable societies, production processes should shift from petrochemical-based processes to bioprocesses. Because bioconversion technologies, in which biomass resources are converted to valuable materials, are preferable to processes dependent on fossil resources, the former should be further developed. The following two approaches can be adopted to improve cellular properties and obtain high productivity and production yield of target products: (1) optimization of cellular metabolic pathways involved in various bioprocesses and (2) creation of stress-tolerant cells that can be active even under severe stress conditions in the bioprocesses. Recent progress in omics analyses has facilitated the analysis of microorganisms based on bioinformatics data for molecular breeding and bioprocess development. Systems metabolic engineering is a new area of study, and it has been defined as a methodology in which metabolic engineering and systems biology are integrated to upgrade the designability of industrially useful microorganisms. This chapter discusses multi-omics analyses and rational design methods for molecular breeding. The first is an example of the rational design of metabolic networks for target production by flux balance analysis using genome-scale metabolic models. Recent progress in the development of genome-scale metabolic models and the application of these models to the design of desirable metabolic networks is also described in this example. The second is an example of evolution engineering with omics analyses for the creation of stress-tolerant microorganisms. Long-term culture experiments to obtain the desired phenotypes and omics analyses to identify the phenotypic changes are described here.
Cytoplasmic male sterility (CMS) in hybrid breeding in field crops.
Bohra, Abhishek; Jha, Uday C; Adhimoolam, Premkumar; Bisht, Deepak; Singh, Narendra P
2016-05-01
A comprehensive understanding of CMS/Rf system enabled by modern omics tools and technologies considerably improves our ability to harness hybrid technology for enhancing the productivity of field crops. Harnessing hybrid vigor or heterosis is a promising approach to tackle the current challenge of sustaining enhanced yield gains of field crops. In the context, cytoplasmic male sterility (CMS) owing to its heritable nature to manifest non-functional male gametophyte remains a cost-effective system to promote efficient hybrid seed production. The phenomenon of CMS stems from a complex interplay between maternally-inherited (mitochondrion) and bi-parental (nucleus) genomic elements. In recent years, attempts aimed to comprehend the sterility-inducing factors (orfs) and corresponding fertility determinants (Rf) in plants have greatly increased our access to candidate genomic segments and the cloned genes. To this end, novel insights obtained by applying state-of-the-art omics platforms have substantially enriched our understanding of cytoplasmic-nuclear communication. Concomitantly, molecular tools including DNA markers have been implicated in crop hybrid breeding in order to greatly expedite the progress. Here, we review the status of diverse sterility-inducing cytoplasms and associated Rf factors reported across different field crops along with exploring opportunities for integrating modern omics tools with CMS-based hybrid breeding.
Inflammaging and human longevity in the omics era.
Monti, Daniela; Ostan, Rita; Borelli, Vincenzo; Castellani, Gastone; Franceschi, Claudio
2017-07-01
Inflammaging is a recent theory of aging originally proposed in 2000 where data and conceptualizations regarding the aging of the immune system (immunosenescence) and the evolution of immune responses from invertebrates to mammals converged. This theory has received an increasing number of citations and experimental confirmations. Here we present an updated version of inflammaging focused on omics data - particularly on glycomics - collected on centenarians, semi-supercentenarians and their offspring. Accordingly, we arrived to the following conclusions: i) inflammaging has a structure where specific combinations of pro- and anti-inflammatory mediators are involved; ii) inflammaging is systemic and more complex than we previously thought, as many organs, tissues and cell types participate in producing pro- and anti-inflammatory stimuli defined "molecular garbage"; iii) inflammaging is dynamic, can be propagated locally to neighboring cells and systemically from organ to organ by circulating products and microvesicles, and amplified by chronic age-related diseases constituting a "local fire", which in turn produces additional inflammatory stimuli and molecular garbage; iv) an integrated Systems Medicine approach is urgently needed to let emerge a robust and highly informative set/combination of omics markers able to better grasp the complex molecular core of inflammaging in elderly and centenarians. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
A statistical framework for applying RNA profiling to chemical hazard detection.
Kostich, Mitchell S
2017-12-01
Use of 'omics technologies in environmental science is expanding. However, application is mostly restricted to characterizing molecular steps leading from toxicant interaction with molecular receptors to apical endpoints in laboratory species. Use in environmental decision-making is limited, due to difficulty in elucidating mechanisms in sufficient detail to make quantitative outcome predictions in any single species or in extending predictions to aquatic communities. Here we introduce a mechanism-agnostic statistical approach, supplementing mechanistic investigation by allowing probabilistic outcome prediction even when understanding of molecular pathways is limited, and facilitating extrapolation from results in laboratory test species to predictions about aquatic communities. We use concepts familiar to environmental managers, supplemented with techniques employed for clinical interpretation of 'omics-based biomedical tests. We describe the framework in step-wise fashion, beginning with single test replicates of a single RNA variant, then extending to multi-gene RNA profiling, collections of test replicates, and integration of complementary data. In order to simplify the presentation, we focus on using RNA profiling for distinguishing presence versus absence of chemical hazards, but the principles discussed can be extended to other types of 'omics measurements, multi-class problems, and regression. We include a supplemental file demonstrating many of the concepts using the open source R statistical package. Published by Elsevier Ltd.
Taking Bioinformatics to Systems Medicine.
van Kampen, Antoine H C; Moerland, Perry D
2016-01-01
Systems medicine promotes a range of approaches and strategies to study human health and disease at a systems level with the aim of improving the overall well-being of (healthy) individuals, and preventing, diagnosing, or curing disease. In this chapter we discuss how bioinformatics critically contributes to systems medicine. First, we explain the role of bioinformatics in the management and analysis of data. In particular we show the importance of publicly available biological and clinical repositories to support systems medicine studies. Second, we discuss how the integration and analysis of multiple types of omics data through integrative bioinformatics may facilitate the determination of more predictive and robust disease signatures, lead to a better understanding of (patho)physiological molecular mechanisms, and facilitate personalized medicine. Third, we focus on network analysis and discuss how gene networks can be constructed from omics data and how these networks can be decomposed into smaller modules. We discuss how the resulting modules can be used to generate experimentally testable hypotheses, provide insight into disease mechanisms, and lead to predictive models. Throughout, we provide several examples demonstrating how bioinformatics contributes to systems medicine and discuss future challenges in bioinformatics that need to be addressed to enable the advancement of systems medicine.
Integrated, multi-scale, spatial-temporal cell biology--A next step in the post genomic era.
Horwitz, Rick
2016-03-01
New microscopic approaches, high-throughput imaging, and gene editing promise major new insights into cellular behaviors. When coupled with genomic and other 'omic information and "mined" for correlations and associations, a new breed of powerful and useful cellular models should emerge. These top down, coarse-grained, and statistical models, in turn, can be used to form hypotheses merging with fine-grained, bottom up mechanistic studies and models that are the back bone of cell biology. The goal of the Allen Institute for Cell Science is to develop the top down approach by developing a high throughput microscopy pipeline that is integrated with modeling, using gene edited hiPS cell lines in various physiological and pathological contexts. The output of these experiments and models will be an "animated" cell, capable of integrating and analyzing image data generated from experiments and models. Copyright © 2015 Elsevier Inc. All rights reserved.
McIntyre, Roger S; Cha, Danielle S; Jerrell, Jeanette M; Swardfager, Walter; Kim, Rachael D; Costa, Leonardo G; Baskaran, Anusha; Soczynska, Joanna K; Woldeyohannes, Hanna O; Mansur, Rodrigo B; Brietzke, Elisa; Powell, Alissa M; Gallaugher, Ashley; Kudlow, Paul; Kaidanovich-Beilin, Oksana; Alsuwaidan, Mohammad
2014-08-01
To provide a strategic framework for the prevention of bipolar disorder (BD) that incorporates a 'Big Data' approach to risk assessment for BD. Computerized databases (e.g., Pubmed, PsychInfo, and MedlinePlus) were used to access English-language articles published between 1966 and 2012 with the search terms bipolar disorder, prodrome, 'Big Data', and biomarkers cross-referenced with genomics/genetics, transcriptomics, proteomics, metabolomics, inflammation, oxidative stress, neurotrophic factors, cytokines, cognition, neurocognition, and neuroimaging. Papers were selected from the initial search if the primary outcome(s) of interest was (were) categorized in any of the following domains: (i) 'omics' (e.g., genomics), (ii) molecular, (iii) neuroimaging, and (iv) neurocognitive. The current strategic approach to identifying individuals at risk for BD, with an emphasis on phenotypic information and family history, has insufficient predictive validity and is clinically inadequate. The heterogeneous clinical presentation of BD, as well as its pathoetiological complexity, suggests that it is unlikely that a single biomarker (or an exclusive biomarker approach) will sufficiently augment currently inadequate phenotypic-centric prediction models. We propose a 'Big Data'- bioinformatics approach that integrates vast and complex phenotypic, anamnestic, behavioral, family, and personal 'omics' profiling. Bioinformatic processing approaches, utilizing cloud- and grid-enabled computing, are now capable of analyzing data on the order of tera-, peta-, and exabytes, providing hitherto unheard of opportunities to fundamentally revolutionize how psychiatric disorders are predicted, prevented, and treated. High-throughput networks dedicated to research on, and the treatment of, BD, integrating both adult and younger populations, will be essential to sufficiently enroll adequate samples of individuals across the neurodevelopmental trajectory in studies to enable the characterization and prevention of this heterogeneous disorder. Advances in bioinformatics using a 'Big Data' approach provide an opportunity for novel insights regarding the pathoetiology of BD. The coordinated integration of research centers, inclusive of mixed-age populations, is a promising strategic direction for advancing this line of neuropsychiatric research. © 2013 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Overcoming the matched-sample bottleneck: an orthogonal approach to integrate omic data.
Nguyen, Tin; Diaz, Diana; Tagett, Rebecca; Draghici, Sorin
2016-07-12
MicroRNAs (miRNAs) are small non-coding RNA molecules whose primary function is to regulate the expression of gene products via hybridization to mRNA transcripts, resulting in suppression of translation or mRNA degradation. Although miRNAs have been implicated in complex diseases, including cancer, their impact on distinct biological pathways and phenotypes is largely unknown. Current integration approaches require sample-matched miRNA/mRNA datasets, resulting in limited applicability in practice. Since these approaches cannot integrate heterogeneous information available across independent experiments, they neither account for bias inherent in individual studies, nor do they benefit from increased sample size. Here we present a novel framework able to integrate miRNA and mRNA data (vertical data integration) available in independent studies (horizontal meta-analysis) allowing for a comprehensive analysis of the given phenotypes. To demonstrate the utility of our method, we conducted a meta-analysis of pancreatic and colorectal cancer, using 1,471 samples from 15 mRNA and 14 miRNA expression datasets. Our two-dimensional data integration approach greatly increases the power of statistical analysis and correctly identifies pathways known to be implicated in the phenotypes. The proposed framework is sufficiently general to integrate other types of data obtained from high-throughput assays.
Hernández-Jarguín, Angélica; Díaz-Sánchez, Sandra; Villar, Margarita; de la Fuente, José
2018-05-05
An innovative metaomics approach integrating metatranscriptomics and metaproteomics was used to characterize bacterial communities in the microbiota of the Lyme borreliosis spirochete vector, Ixodes ricinus (Acari: Ixodidae). Whole internal tissues and salivary glands from unfed larvae and female ticks, respectively were used. Reused I. ricinus RNA-sequencing data for metranscriptomics analysis together with metaproteomics provided a better characterization of tick bacterial microbiota by increasing bacteria identification and support for identified bacteria with putative functional implications. The results showed the presence of symbiotic, commensal, soil, environmental, and pathogenic bacteria in the I. ricinus microbiota, including previously unrecognized commensal and soil microorganisms. The results of the metaomics approach may have implications in the characterization of putative mechanisms by which pathogen infection manipulates tick microbiota to facilitate infection. Metaomics approaches integrating different omics datasets would provide a better description of tick microbiota compositions, and insights into tick interactions with microbiota, pathogens and hosts. Copyright © 2018 Elsevier GmbH. All rights reserved.
Kambouris, Manousos E; Manoussopoulos, Yiannis; Kritikou, Stavroula; Milioni, Aphroditi; Mantzoukas, Spyridon; Velegraki, Aristea
2018-04-01
Agrigenomics is one of the emerging focus areas for omics sciences. Yet, agrigenomics differs from medical omics applications such as pharmacogenomics and precision medicine, by virtue of vastly distributed geography of applications at the intersection of agriculture, nutrition, and genomics research streams. Crucially, agrigenomics can address diagnostics and safety surveillance needs in remote and rural farming communities or decentralized food, crop, and environmental monitoring programs for prompt, selective, and differential identification of pathogens. A case in point is the potato crop that serves as a fundamental nutritional source worldwide. Decentralized potato crop and plant protection facilities are pivotal to minimize unnecessary, preemptive use of broad-spectrum fungicides, thus helping to curtail the costs, environmental burden, and the development of resistance in opportunistic human pathogenic fungi. We report here a polymerase chain reaction-restriction fragment length polymorphism approach that is sensitive and adaptable in detection and broad identification of fungal pathogens in potato crops, with a view to future decentralized agrigenomic surveillance programs. Notably, the fingerprinting patterns obtained by the method fully differentiated 12 fungal species examined in silico, with 10 of them also tested in vitro. The method can be scaled up through improvements in electrophoresis and enzyme panel for adaption to other crops and/or pathogens. We suggest that decentralized and integrated agrosurveillance programs and translational agrigenomic programs can inform future innovations in multidomain biosecurity, particularly across omics applications from agriculture and nutrition to clinical medicine and environmental biosafety.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kolker, Eugene; Sansone, Susanna
2011-09-11
The Genomic Standards Consortium was formed in September 2005. It is an international, open-membership working body which promotes standardization in the description of genomes and the exchange and integration of genomic data. The 2009 meeting was an activity of a five-year funding "Research Coordination Network" from the National Science Foundation and was organized held at the DOE Joint Genome Institute with organizational support provided by the JGI and by the University of California - San Diego. Eugene Kolker from Seattle Children's Hospital briefly discusses "The Encyclopedia of Systems Biology and OMICS," followed by Susanna Sansone from the EBI on "Themore » ISA Infrastructure for multi-omics data" at the Genomic Standards Consortium's 8th meeting at the DOE JGI in Walnut Creek, CA. on Sept. 11, 2009.« less
Educating future nursing scientists: Recommendations for integrating omics content in PhD programs.
Conley, Yvette P; Heitkemper, Margaret; McCarthy, Donna; Anderson, Cindy M; Corwin, Elizabeth J; Daack-Hirsch, Sandra; Dorsey, Susan G; Gregory, Katherine E; Groer, Maureen W; Henly, Susan J; Landers, Timothy; Lyon, Debra E; Taylor, Jacquelyn Y; Voss, Joachim
2015-01-01
Preparing the next generation of nursing scientists to conduct high-impact, competitive, sustainable, innovative, and interdisciplinary programs of research requires that the curricula for PhD programs keep pace with emerging areas of knowledge and health care/biomedical science. A field of inquiry that holds great potential to influence our understanding of the underlying biology and mechanisms of health and disease is omics. For the purpose of this article, omics refers to genomics, transcriptomics, proteomics, epigenomics, exposomics, microbiomics, and metabolomics. Traditionally, most PhD programs in schools of nursing do not incorporate this content into their core curricula. As part of the Council for the Advancement of Nursing Science's Idea Festival for Nursing Science Education, a work group charged with addressing omics preparation for the next generation of nursing scientists was convened. The purpose of this article is to describe key findings and recommendations from the work group that unanimously and enthusiastically support the incorporation of omics content into the curricula of PhD programs in nursing. The work group also calls to action faculty in schools of nursing to develop strategies to enable students needing immersion in omics science and methods to execute their research goals. Copyright © 2015 Elsevier Inc. All rights reserved.
Integrated Metagenomics/Metaproteomics Reveals Human Host-Microbiota Signatures of Crohn's Disease
Darzi, Youssef; Mongodin, Emmanuel F.; Pan, Chongle; Shah, Manesh; Halfvarson, Jonas; Tysk, Curt; Henrissat, Bernard; Raes, Jeroen; Verberkmoes, Nathan C.; Jansson, Janet K.
2012-01-01
Crohn's disease (CD) is an inflammatory bowel disease of complex etiology, although dysbiosis of the gut microbiota has been implicated in chronic immune-mediated inflammation associated with CD. Here we combined shotgun metagenomic and metaproteomic approaches to identify potential functional signatures of CD in stool samples from six twin pairs that were either healthy, or that had CD in the ileum (ICD) or colon (CCD). Integration of these omics approaches revealed several genes, proteins, and pathways that primarily differentiated ICD from healthy subjects, including depletion of many proteins in ICD. In addition, the ICD phenotype was associated with alterations in bacterial carbohydrate metabolism, bacterial-host interactions, as well as human host-secreted enzymes. This eco-systems biology approach underscores the link between the gut microbiota and functional alterations in the pathophysiology of Crohn's disease and aids in identification of novel diagnostic targets and disease specific biomarkers. PMID:23209564
Application of omics data in regulatory toxicology: report of an international BfR expert workshop.
Marx-Stoelting, P; Braeuning, A; Buhrke, T; Lampen, A; Niemann, L; Oelgeschlaeger, M; Rieke, S; Schmidt, F; Heise, T; Pfeil, R; Solecki, R
2015-11-01
Advances in omics techniques and molecular toxicology are necessary to provide new perspectives for regulatory toxicology. By the application of modern molecular techniques, more mechanistic information should be gained to support standard toxicity studies and to contribute to a reduction and refinement of animal experiments required for certain regulatory purposes. The relevance and applicability of data obtained by omics methods to regulatory purposes such as grouping of chemicals, mode of action analysis or classification and labelling needs further improvement, defined validation and cautious expert judgment. Based on the results of an international expert workshop organized 2014 by the Federal Institute for Risk Assessment in Berlin, this paper is aimed to provide a critical overview of the regulatory relevance and reliability of omics methods, basic requirements on data quality and validation, as well as regulatory criteria to decide which effects observed by omics methods should be considered adverse or non-adverse. As a way forward, it was concluded that the inclusion of omics data can facilitate a more flexible approach for regulatory risk assessment and may help to reduce or refine animal testing.
An overview on forensic analysis devoted to analytical chemists.
Castillo-Peinado, L S; Luque de Castro, M D
2017-05-15
The present article has as main aim to show analytical chemists interested in forensic analysis the world they will face if decision in favor of being a forensic analytical chemist is adopted. With this purpose, the most outstanding aspects of forensic analysis in dealing with sampling (involving both bodily and no bodily samples), sample preparation, and analytical equipment used in detection, identification and quantitation of key sample components are critically discussed. The role of the great omics in forensic analysis, and the growing role of the youngest of the great omics -metabolomics- are also discussed. The foreseeable role of integrative omics is also outlined. Copyright © 2017 Elsevier B.V. All rights reserved.
Lindemann, Stephen R.; Mobberley, Jennifer M.; Cole, Jessica K.; Markillie, L. M.; Taylor, Ronald C.; Huang, Eric; Chrisler, William B.; Wiley, H. S.; Lipton, Mary S.; Nelson, William C.; Fredrickson, James K.; Romine, Margaret F.
2017-01-01
The principles governing acquisition and interspecies exchange of nutrients in microbial communities and how those exchanges impact community productivity are poorly understood. Here, we examine energy and macronutrient acquisition in unicyanobacterial consortia for which species-resolved genome information exists for all members, allowing us to use multi-omic approaches to predict species’ abilities to acquire resources and examine expression of resource-acquisition genes during succession. Metabolic reconstruction indicated that a majority of heterotrophic community members lacked the genes required to directly acquire the inorganic nutrients provided in culture medium, suggesting high metabolic interdependency. The sole primary producer in consortium UCC-O, cyanobacterium Phormidium sp. OSCR, displayed declining expression of energy harvest, carbon fixation, and nitrate and sulfate reduction proteins but sharply increasing phosphate transporter expression over 28 days. Most heterotrophic members likewise exhibited signs of phosphorus starvation during succession. Though similar in their responses to phosphorus limitation, heterotrophs displayed species-specific expression of nitrogen acquisition genes. These results suggest niche partitioning around nitrogen sources may structure the community when organisms directly compete for limited phosphate. Such niche complementarity around nitrogen sources may increase community diversity and productivity in phosphate-limited phototrophic communities. PMID:28659875
Deep sexual dimorphism in adult medaka fish liver highlighted by multi-omic approach
Qiao, Qin; Le Manach, Séverine; Sotton, Benoit; Huet, Hélène; Duvernois-Berthet, Evelyne; Paris, Alain; Duval, Charlotte; Ponger, Loïc; Marie, Arul; Blond, Alain; Mathéron, Lucrèce; Vinh, Joelle; Bolbach, Gérard; Djediat, Chakib; Bernard, Cécile; Edery, Marc; Marie, Benjamin
2016-01-01
Sexual dimorphism describes the features that discriminate between the two sexes at various biological levels. Especially, during the reproductive phase, the liver is one of the most sexually dimorphic organs, because of different metabolic demands between the two sexes. The liver is a key organ that plays fundamental roles in various physiological processes, including digestion, energetic metabolism, xenobiotic detoxification, biosynthesis of serum proteins, and also in endocrine or immune response. The sex-dimorphism of the liver is particularly obvious in oviparous animals, as the female liver is the main organ for the synthesis of oocyte constituents. In this work, we are interested in identifying molecular sexual dimorphism in the liver of adult medaka fish and their sex-variation in response to hepatotoxic exposures. By developing an integrative approach combining histology and different high-throughput omic investigations (metabolomics, proteomics and transcriptomics), we were able to globally depict the strong sexual dimorphism that concerns various cellular and molecular processes of hepatocytes comprising protein synthesis, amino acid, lipid and polysaccharide metabolism, along with steroidogenesis and detoxification. The results of this work imply noticeable repercussions on the biology of oviparous organisms environmentally exposed to chemical or toxin issues. PMID:27561897
DOE Office of Scientific and Technical Information (OSTI.GOV)
Yoon, Hyunjin; Ansong, Charles; McDermott, Jason E.
Background: Systemic bacterial infections are highly regulated and complex processes that are orchestrated by numerous virulence factors. Genes that are coordinately controlled by the set of regulators required for systemic infection are potentially required for pathogenicity. Results: In this study we present a systems biology approach in which sample-matched multi-omic measurements of fourteen virulence-essential regulator mutants were coupled with computational network analysis to efficiently identify Salmonella virulence factors. Immunoblot experiments verified network-predicted virulence factors and a subset was determined to be secreted into the host cytoplasm, suggesting that they are virulence factors directly interacting with host cellular components. Two ofmore » these, SrfN and PagK2, were required for full mouse virulence and were shown to be translocated independent of either of the type III secretion systems in Salmonella or the type III injectisome-related flagellar mechanism. Conclusions: Integrating multi-omic datasets from Salmonella mutants lacking virulence regulators not only identified novel virulence factors but also defined a new class of translocated effectors involved in pathogenesis. The success of this strategy at discovery of known and novel virulence factors suggests that the approach may have applicability for other bacterial pathogens.« less
Application of proteomics in research on traditional Chinese medicine.
Suo, Tongchuan; Wang, Haixia; Li, Zheng
2016-09-01
Traditional Chinese medicine (TCM) is a widely used complementary alternative medicine approach. Although many aspects of its effectiveness have been approved clinically, rigorous scientific techniques are highly required to translate the promises from TCM into powerful modern therapies. In this respect, proteomics is useful because of its ability to unveil the underlying target proteins and/or protein biomarkers. In this review, we summarize the recent interplay between proteomics and research on TCM, ranging from exploration of the medicinal materials to the biological basis of TCM concepts, and from pathological studies to pharmacological investigations. We show that proteomic analyses provide preliminary biological evidence of the promises in TCM, and the integration of proteomics with other omics and bioinformatics offers a comprehensive methodology to address the complications of TCM. Expert commentary: Currently, only limited information can be obtained regarding TCM issues and thus more work is required to resolve the ambiguity. As such, more collaborations between proteomics and other techniques (other omics, network pharmacology, etc.) are essential for deciphering the underlying biological basis in TCM topics.
Imaging and the completion of the omics paradigm in breast cancer.
Leithner, D; Horvat, J V; Ochoa-Albiztegui, R E; Thakur, S; Wengert, G; Morris, E A; Helbich, T H; Pinker, K
2018-06-08
Within the field of oncology, "omics" strategies-genomics, transcriptomics, proteomics, metabolomics-have many potential applications and may significantly improve our understanding of the underlying processes of cancer development and progression. Omics strategies aim to develop meaningful imaging biomarkers for breast cancer (BC) by rapid assessment of large datasets with different biological information. In BC the paradigm of omics technologies has always favored the integration of multiple layers of omics data to achieve a complete portrait of BC. Advances in medical imaging technologies, image analysis, and the development of high-throughput methods that can extract and correlate multiple imaging parameters with "omics" data have ushered in a new direction in medical research. Radiogenomics is a novel omics strategy that aims to correlate imaging characteristics (i. e., the imaging phenotype) with underlying gene expression patterns, gene mutations, and other genome-related characteristics. Radiogenomics not only represents the evolution in the radiology-pathology correlation from the anatomical-histological level to the molecular level, but it is also a pivotal step in the omics paradigm in BC in order to fully characterize BC. Armed with modern analytical software tools, radiogenomics leads to new discoveries of quantitative and qualitative imaging biomarkers that offer hitherto unprecedented insights into the complex tumor biology and facilitate a deeper understanding of cancer development and progression. The field of radiogenomics in breast cancer is rapidly evolving, and results from previous studies are encouraging. It can be expected that radiogenomics will play an important role in the future and has the potential to revolutionize the diagnosis, treatment, and prognosis of BC patients. This article aims to give an overview of breast radiogenomics, its current role, future applications, and challenges.
Measuring soil sustainability via soil resilience.
Ludwig, Marie; Wilmes, Paul; Schrader, Stefan
2018-06-01
Soils are the nexus of water, energy and food, which illustrates the need for a holistic approach in sustainable soil management. The present study therefore aimed at identifying a bioindicator for the evaluation of soil management sustainability in a cross-disciplinary approach between soil science and multi-omics research. For this purpose we first discuss the remaining problems and challenges of evaluating sustainability and consequently suggest one measurable bioindicator for soil management sustainability. In this concept, we define soil sustainability as the maintenance of soil functional integrity. The potential to recover functional and structural integrity after a disturbance is generally defined as resilience. This potential is a product of the past and the present soil management, and at the same time prospect of possible soil responses to future disturbances. Additionally, it is correlated with the multiple soil functions and hence reflecting the multifunctionality of the soil system. Consequently, resilience can serve as a bioindicator for soil sustainability. The measurable part of soil resilience is the response diversity, calculated from the systematic contrasting of multi-omic markers for genetic potential and functional activity, and referred to as potential Maximum Ecological Performance (MEPpot) in this study. Calculating MEPpot will allow to determine the thresholds of resistance and resilience and potential tipping points for a regime shift towards irreversible or permanent unfavorable soil states for each individual soil considered. The calculation of such ecosystem thresholds is to our opinion the current global cross-disciplinary challenge. Copyright © 2017 The Authors. Published by Elsevier B.V. All rights reserved.
Clustering multilayer omics data using MuNCut.
Teran Hidalgo, Sebastian J; Ma, Shuangge
2018-03-14
Omics profiling is now a routine component of biomedical studies. In the analysis of omics data, clustering is an essential step and serves multiple purposes including for example revealing the unknown functionalities of omics units, assisting dimension reduction in outcome model building, and others. In the most recent omics studies, a prominent trend is to conduct multilayer profiling, which collects multiple types of genetic, genomic, epigenetic and other measurements on the same subjects. In the literature, clustering methods tailored to multilayer omics data are still limited. Directly applying the existing clustering methods to multilayer omics data and clustering each layer first and then combing across layers are both "suboptimal" in that they do not accommodate the interconnections within layers and across layers in an informative way. In this study, we develop the MuNCut (Multilayer NCut) clustering approach. It is tailored to multilayer omics data and sufficiently accounts for both across- and within-layer connections. It is based on the novel NCut technique and also takes advantages of regularized sparse estimation. It has an intuitive formulation and is computationally very feasible. To facilitate implementation, we develop the function muncut in the R package NcutYX. Under a wide spectrum of simulation settings, it outperforms competitors. The analysis of TCGA (The Cancer Genome Atlas) data on breast cancer and cervical cancer shows that MuNCut generates biologically meaningful results which differ from those using the alternatives. We propose a more effective clustering analysis of multiple omics data. It provides a new venue for jointly analyzing genetic, genomic, epigenetic and other measurements.
Ghan, Ryan; Van Sluyter, Steven C; Hochberg, Uri; Degu, Asfaw; Hopper, Daniel W; Tillet, Richard L; Schlauch, Karen A; Haynes, Paul A; Fait, Aaron; Cramer, Grant R
2015-11-16
Grape cultivars and wines are distinguishable by their color, flavor and aroma profiles. Omic analyses (transcripts, proteins and metabolites) are powerful tools for assessing biochemical differences in biological systems. Berry skins of red- (Cabernet Sauvignon, Merlot, Pinot Noir) and white-skinned (Chardonnay, Semillon) wine grapes were harvested near optimum maturity (°Brix-to-titratable acidity ratio) from the same experimental vineyard. The cultivars were exposed to a mild, seasonal water-deficit treatment from fruit set until harvest in 2011. Identical sample aliquots were analyzed for transcripts by grapevine whole-genome oligonucleotide microarray and RNAseq technologies, proteins by nano-liquid chromatography-mass spectroscopy, and metabolites by gas chromatography-mass spectroscopy and liquid chromatography-mass spectroscopy. Principal components analysis of each of five Omic technologies showed similar results across cultivars in all Omic datasets. Comparison of the processed data of genes mapped in RNAseq and microarray data revealed a strong Pearson's correlation (0.80). The exclusion of probesets associated with genes with potential for cross-hybridization on the microarray improved the correlation to 0.93. The overall concordance of protein with transcript data was low with a Pearson's correlation of 0.27 and 0.24 for the RNAseq and microarray data, respectively. Integration of metabolite with protein and transcript data produced an expected model of phenylpropanoid biosynthesis, which distinguished red from white grapes, yet provided detail of individual cultivar differences. The mild water deficit treatment did not significantly alter the abundance of proteins or metabolites measured in the five cultivars, but did have a small effect on gene expression. The five Omic technologies were consistent in distinguishing cultivar variation. There was high concordance between transcriptomic technologies, but generally protein abundance did not correlate well with transcript abundance. The integration of multiple high-throughput Omic datasets revealed complex biochemical variation amongst five cultivars of an ancient and economically important crop species.
Serrano, Alejandra; Espinoza, Carmen; Armijo, Grace; Inostroza-Blancheteau, Claudio; Poblete, Evelyn; Meyer-Regueiro, Carlos; Arce, Anibal; Parada, Francisca; Santibáñez, Claudia; Arce-Johnson, Patricio
2017-01-01
Grapevine fruit development is a dynamic process that can be divided into three stages: formation (I), lag (II), and ripening (III), in which physiological and biochemical changes occur, leading to cell differentiation and accumulation of different solutes. These stages can be positively or negatively affected by multiple environmental factors. During the last decade, efforts have been made to understand berry development from a global perspective. Special attention has been paid to transcriptional and metabolic networks associated with the control of grape berry development, and how external factors affect the ripening process. In this review, we focus on the integration of global approaches, including proteomics, metabolomics, and especially transcriptomics, to understand grape berry development. Several aspects will be considered, including seed development and the production of seedless fruits; veraison, at which anthocyanin accumulation begins in the berry skin of colored varieties; and hormonal regulation of berry development and signaling throughout ripening, focusing on the transcriptional regulation of hormone receptors, protein kinases, and genes related to secondary messenger sensing. Finally, berry responses to different environmental factors, including abiotic (temperature, water-related stress and UV-B radiation) and biotic (fungi and viruses) stresses, and how they can significantly modify both, development and composition of vine fruit, will be discussed. Until now, advances have been made due to the application of Omics tools at different molecular levels. However, the potential of these technologies should not be limited to the study of single-level questions; instead, data obtained by these platforms should be integrated to unravel the molecular aspects of grapevine development. Therefore, the current challenge is the generation of new tools that integrate large-scale data to assess new questions in this field, and to support agronomical practices. PMID:28936215
Dagostino, Concetta; De Gregori, Manuela; Gieger, Christian; Manz, Judith; Gudelj, Ivan; Lauc, Gordan; Divizia, Laura; Wang, Wei; Sim, Moira; Pemberton, Iain K; MacDougall, Jane; Williams, Frances; Van Zundert, Jan; Primorac, Dragan; Aulchenko, Yurii; Kapural, Leonardo; Allegri, Massimo
2017-01-01
Chronic low back pain (CLBP) is one of the most common medical conditions, ranking as the greatest contributor to global disability and accounting for huge societal costs based on the Global Burden of Disease 2010 study. Large genetic and -omics studies provide a promising avenue for the screening, development and validation of biomarkers useful for personalized diagnosis and treatment (precision medicine). Multicentre studies are needed for such an effort, and a standardized and homogeneous approach is vital for recruitment of large numbers of participants among different centres (clinical and laboratories) to obtain robust and reproducible results. To date, no validated standard operating procedures (SOPs) for genetic/-omics studies in chronic pain have been developed. In this study, we validated an SOP model that will be used in the multicentre (5 centres) retrospective "PainOmics" study, funded by the European Community in the 7th Framework Programme, which aims to develop new biomarkers for CLBP through three different -omics approaches: genomics, glycomics and activomics. The SOPs describe the specific procedures for (1) blood collection, (2) sample processing and storage, (3) shipping details and (4) cross-check testing and validation before assays that all the centres involved in the study have to follow. Multivariate analysis revealed the absolute specificity and homogeneity of the samples collected by the five centres for all genetics, glycomics and activomics analyses. The SOPs used in our multicenter study have been validated. Hence, they could represent an innovative tool for the correct management and collection of reliable samples in other large-omics-based multicenter studies.
Rai, Vineeta; Karthikaichamy, Anbarasu; Das, Debasish; Noronha, Santosh; Wangikar, Pramod P; Srivastava, Sanjeeva
2016-07-01
Current momentum of microalgal research rests extensively in tapping the potential of multi-omics methodologies in regard to sustainable biofuels. Microalgal biomass is fermented to bioethanol; while lipids, particularly triacylglycerides (TAGs), are transesterified to biodiesels. Biodiesel has emerged as an ideal biofuel candidate; hence, its commercialization and use are increasingly being emphasized. Abiotic stresses exaggerate TAG accumulation, but the precise mechanisms are yet to be known. More recently, comprehensive multi-omics studies in microalgae have emerged from the biofuel perspective. Genomics and transcriptomics of microalgae have provided crucial leads and basic understanding toward lipid biosynthesis. Proteomics and metabolomics are now complementing "algal omics" and offer precise functional insights into the attendant static and dynamic physiological contexts. Indeed, the field has progressed from shotgun to targeted approaches. Notably, targeted proteomics studies in microalga are not yet reported. Several multi-omics tools and technologies that may be used to dig deeper into the microalgal physiology are examined and highlighted in this review. The article therefore aims to both introduce various available high-throughput biotechnologies and applications of "omics" in microalgae, and enlists a compendium of the emerging cutting edge literature. We suggest that a strategic and thoughtful combination of data streams from different omics platforms can provide a system-wide overview. The algal omics warrants closer attention in the future, with a view to technical, economic, and societal impacts that are anticipated in the current postgenomics era.
Deciphering functional diversification within the lichen microbiota by meta-omics.
Cernava, Tomislav; Erlacher, Armin; Aschenbrenner, Ines Aline; Krug, Lisa; Lassek, Christian; Riedel, Katharina; Grube, Martin; Berg, Gabriele
2017-07-19
Recent evidence of specific bacterial communities extended the traditional concept of fungal-algal lichen symbioses by a further organismal kingdom. Although functional roles were already assigned to dominant members of the highly diversified microbiota, a substantial fraction of the ubiquitous colonizers remained unexplored. We employed a multi-omics approach to further characterize functional guilds in an unconventional model system. The general community structure of the lichen-associated microbiota was shown to be highly similar irrespective of the employed omics approach. Five highly abundant bacterial orders-Sphingomonadales, Rhodospirillales, Myxococcales, Chthoniobacterales, and Sphingobacteriales-harbor functions that are of substantial importance for the holobiome. Identified functions range from the provision of vitamins and cofactors to the degradation of phenolic compounds like phenylpropanoid, xylenols, and cresols. Functions that facilitate the persistence of Lobaria pulmonaria under unfavorable conditions were present in previously overlooked fractions of the microbiota. So far, unrecognized groups like Chthoniobacterales (Verrucomicrobia) emerged as functional protectors in the lichen microbiome. By combining multi-omics and imaging techniques, we highlight previously overlooked participants in the complex microenvironment of the lichens.
Multi-omics of permafrost, active layer and thermokarst bog soil microbiomes
NASA Astrophysics Data System (ADS)
Hultman, Jenni; Waldrop, Mark P.; Mackelprang, Rachel; David, Maude M.; McFarland, Jack; Blazewicz, Steven J.; Harden, Jennifer; Turetsky, Merritt R.; McGuire, A. David; Shah, Manesh B.; Verberkmoes, Nathan C.; Lee, Lang Ho; Mavrommatis, Kostas; Jansson, Janet K.
2015-05-01
Over 20% of Earth's terrestrial surface is underlain by permafrost with vast stores of carbon that, once thawed, may represent the largest future transfer of carbon from the biosphere to the atmosphere. This process is largely dependent on microbial responses, but we know little about microbial activity in intact, let alone in thawing, permafrost. Molecular approaches have recently revealed the identities and functional gene composition of microorganisms in some permafrost soils and a rapid shift in functional gene composition during short-term thaw experiments. However, the fate of permafrost carbon depends on climatic, hydrological and microbial responses to thaw at decadal scales. Here we use the combination of several molecular `omics' approaches to determine the phylogenetic composition of the microbial communities, including several draft genomes of novel species, their functional potential and activity in soils representing different states of thaw: intact permafrost, seasonally thawed active layer and thermokarst bog. The multi-omics strategy reveals a good correlation of process rates to omics data for dominant processes, such as methanogenesis in the bog, as well as novel survival strategies for potentially active microbes in permafrost.
Multi-omics of permafrost, active layer and thermokarst bog soil microbiomes.
Hultman, Jenni; Waldrop, Mark P; Mackelprang, Rachel; David, Maude M; McFarland, Jack; Blazewicz, Steven J; Harden, Jennifer; Turetsky, Merritt R; McGuire, A David; Shah, Manesh B; VerBerkmoes, Nathan C; Lee, Lang Ho; Mavrommatis, Kostas; Jansson, Janet K
2015-05-14
Over 20% of Earth's terrestrial surface is underlain by permafrost with vast stores of carbon that, once thawed, may represent the largest future transfer of carbon from the biosphere to the atmosphere. This process is largely dependent on microbial responses, but we know little about microbial activity in intact, let alone in thawing, permafrost. Molecular approaches have recently revealed the identities and functional gene composition of microorganisms in some permafrost soils and a rapid shift in functional gene composition during short-term thaw experiments. However, the fate of permafrost carbon depends on climatic, hydrological and microbial responses to thaw at decadal scales. Here we use the combination of several molecular 'omics' approaches to determine the phylogenetic composition of the microbial communities, including several draft genomes of novel species, their functional potential and activity in soils representing different states of thaw: intact permafrost, seasonally thawed active layer and thermokarst bog. The multi-omics strategy reveals a good correlation of process rates to omics data for dominant processes, such as methanogenesis in the bog, as well as novel survival strategies for potentially active microbes in permafrost.
Liu, Bao-Cheng; Ji, Guang
2017-07-01
Incorporating "-omics" studies with environmental interactions could help elucidate the biological mechanisms responsible for Traditional Chinese Medicine (TCM) patterns. Based on the authors' own experiences, this review outlines a model of an ideal combination of "-omics" biomarkers, environmental factors, and TCM pattern classifications; provides a narrative review of the relevant genetic and TCM studies; and lists several successful integrative examples. Two integration tools are briefly introduced. The first is the integration of modern devices into objective diagnostic methods of TCM patterning, which would improve current clinical decision-making and practice. The second is the use of biobanks and data platforms, which could broadly support biological and medical research. Such efforts will transform current medical management and accelerate the progression of precision medicine.
A System Biology Perspective on Environment-Host-Microbe Interactions.
Chen, Lianmin; Garmaeva, Sanzhima; Zherankova, Alexandra; Fu, Jingyuan; Wijmenga, Cisca
2018-04-16
A vast, complex and dynamic consortium of microorganisms known as the gut microbiome colonizes the human gut. Over the past few decades we have developed an increased awareness of its important role in human health. In this review we discuss the role of the gut microbiome in complex diseases and the possible causal scenarios behind its interactions with the host genome and environmental factors. We then propose a new analysis framework that combines a systems biology approach, cross-kingdom integration of multiple levels of omics data, and innovative in vitro models to yield an integrated picture of human host-microbe interactions. This new framework will lay the foundation for the development of the next phase in personalized medicine.
van Haaften, Rachel I M; Luceri, Cristina; van Erk, Arie; Evelo, Chris T A
2009-06-01
Omics technology used for large-scale measurements of gene expression is rapidly evolving. This work pointed out the need of an extensive bioinformatics analyses for array quality assessment before and after gene expression clustering and pathway analysis. A study focused on the effect of red wine polyphenols on rat colon mucosa was used to test the impact of quality control and normalisation steps on the biological conclusions. The integration of data visualization, pathway analysis and clustering revealed an artifact problem that was solved with an adapted normalisation. We propose a possible point to point standard analysis procedure, based on a combination of clustering and data visualization for the analysis of microarray data.
Pathview Web: user friendly pathway visualization and data integration
Pant, Gaurav; Bhavnasi, Yeshvant K.; Blanchard, Steven G.; Brouwer, Cory
2017-01-01
Abstract Pathway analysis is widely used in omics studies. Pathway-based data integration and visualization is a critical component of the analysis. To address this need, we recently developed a novel R package called Pathview. Pathview maps, integrates and renders a large variety of biological data onto molecular pathway graphs. Here we developed the Pathview Web server, as to make pathway visualization and data integration accessible to all scientists, including those without the special computing skills or resources. Pathview Web features an intuitive graphical web interface and a user centered design. The server not only expands the core functions of Pathview, but also provides many useful features not available in the offline R package. Importantly, the server presents a comprehensive workflow for both regular and integrated pathway analysis of multiple omics data. In addition, the server also provides a RESTful API for programmatic access and conveniently integration in third-party software or workflows. Pathview Web is openly and freely accessible at https://pathview.uncc.edu/. PMID:28482075
'Omics' techniques for identifying flooding-response mechanisms in soybean.
Komatsu, Setsuko; Shirasaka, Naoki; Sakata, Katsumi
2013-11-20
Plant growth and productivity are adversely influenced by various environmental stresses, which often lead to reduced seedling growth and decreased crop yields. Plants respond to stressful conditions through changes in 'omics' profiles, including transcriptomics, proteomics, and metabolomics. Linking plant phenotype to gene expression patterns, protein abundance, and metabolite accumulation is one of the main challenges for improving agricultural production. 'Omics' approaches may shed insight into the mechanisms that function in soybean in response to environmental stresses, particularly flooding by frequent rain, which occurs worldwide due to changes in global climate. Flooding causes significant reductions in the growth and yield of several crops, especially soybean. The application of 'omics' techniques may facilitate the development of flood-tolerant cultivars of soybean. In this review, the use of 'omics' techniques towards understanding the flooding-responsive mechanisms of soybeans is discussed, as the findings from these studies are expected to have applications in both breeding and agronomy. This article is part of a Special Issue entitled: Translational Plant Proteomics. Copyright © 2012 Elsevier B.V. All rights reserved.
2013-01-01
The Diagnostic and Statistical Manual of Mental Disorders (DSM) is universally acknowledged as the prominent reference textbook for the diagnosis and assessment of psychiatric diseases. However, since the publication of its first version in 1952, controversies have been raised concerning its reliability and validity and the need for other novel clinical tools has emerged. Currently the DSM is in its fourth edition and a new fifth edition is expected for release in 2013, in an intense intellectual debate and in a call for new proposals. Since 1952, psychiatry has undergone many changes and is emerging as unique field in the medical area in which a novel approach is being demanded for properly treating patients: not the classical “one-size-fits-all” approach, but a more targeted and tailored diagnosis and therapeutics, taking into account the complex interactions among genes and their products, environment, culture and the psychological apparatus of the subject. OMICS sciences, being based on high-throughput technologies, are systems biology related fields (like genomics, proteomics, transcriptomics and so on). In the frame of the P5 medicine (personalized, participatory, predictive, preventive, psycho-cognitive), they could establish links between psychiatric diseases, which are disorders with a final common symptomatology with vastly heterogeneous biological, environmental and sociological underpinnings, and by understanding the psychiatric diseases beyond their classic symptomatic or syndromal definitions using OMICS research, one can have a broader picture and unprecedented links and reclassification of psychiatric nosology. Importantly, by understanding the basis of heterogeneity in diseases through OMICS research, one could also personalize treatment of psychiatric illnesses. In this manuscript, we discuss a gap in the current psychiatric research, namely the missing logical link among OMICS, personalized medicine and reclassification of diseases. Moreover, we explore the importance of incorporating OMICS-based quantitative dimensional criteria, besides the classical qualitative and categorical approach. PMID:23849623
Experimental Systems-Biology Approaches for Clostridia-Based Bioenergy Production
DOE Office of Scientific and Technical Information (OSTI.GOV)
Papoutsakis, Elefterios
This is the final project report for project "Experimental Systems-Biology Approaches for Clostridia-Based Bioenergy Production" for the funding period of 9/1/12 to 2/28/2015 (three years with a 6-month no-cost extension) OVERVIEW AND PROJECT GOALS The bottleneck of achieving higher rates and titers of toxic metabolites (such as solvents and carboxylic acids that can used as biofuels or biofuel precursors) can be overcome by engineering the stress response system. Thus, understanding and modeling the response of cells to toxic metabolites is a problem of great fundamental and practical significance. In this project, our goal is to dissect at the molecular systemsmore » level and build models (conceptual and quantitative) for the stress response of C. acetobutylicum (Cac) to its two toxic metabolites: butanol (BuOH) and butyrate (BA). Transcriptional (RNAseq and microarray based), proteomic and fluxomic data and their analysis are key requirements for this goal. Transcriptional data from mid-exponential cultures of Cac under 4 different levels of BuOH and BA stress was obtained using both microarrays (Papoutsakis group) and deep sequencing (RNAseq; Meyers and Papoutsakis groups). These two sets of data do not only serve to validate each other, but are also used for identification of stress-induced changes in transcript levels, small regulatory RNAs, & in transcriptional start sites. Quantitative proteomic data (Lee group), collected using the iTRAQ technology, are essential for understanding of protein levels and turnover under stress and the various protein-protein interactions that orchestrate the stress response. Metabolic flux changes (Antoniewicz group) of core pathways, which provide important information on the re-allocation of energy and carbon resources under metabolite stress, were examined using 13C-labelled chemicals. Omics data are integrated at different levels and scales. At the metabolic-pathway level, omics data are integrated into a 2nd generation genome-scale model (GSM) (Maranas group). Omics data are also integrated using bioinformatics (Wu and Huang group), whereby regulatory details of gene and protein expression, protein-protein interactions and metabolic flux regulation are incorporated. The PI (Papoutsakis) facilitated project integration through monthly meeting and reports, conference calls, and collaborative manuscript preparation. The five groups collaborated extensively and made a large number of presentations in national and international meetings. It has also published several papers, with several more in the preparation stage. Several PhD, MS and postdoctoral students were trained as part of this collaborative and interdisciplinary project.« less
The state of rhizospheric science in the era of multi-omics: A practical guide to omics technologies
White, Richard Allen; Rivas-Ubach, Albert; Borkum, Mark I.; ...
2017-05-06
Over the past century, the significance of the rhizosphere has been increasingly recognized by the scientific community. Furthermore, this complex biological system is comprised of vast interconnected networks of microbial organisms that interact directly with their plant hosts, including archaea, bacteria, fungi, picoeukaryotes, and viruses. The rhizosphere provides a nutritional base to the terrestrial biosphere, and is integral to plant growth, crop production, and ecosystem health. There is little mechanistic understanding of the rhizosphere, but, and that constitutes a critical knowledge gap. It inhibits our ability to predict and control the terrestrial ecosystem to achieve desirable outcomes, such as bioenergymore » production, crop yield maximization, and soil-based carbon sequestration. Multi-omics have the potential to significantly advance our knowledge of rhizospheric science. Our review covers multi-omic techniques and technologies; methods and protocols for specific rhizospheric science questions; and the challenges to be addressed during this century of rhizospheric science.« less
The state of rhizospheric science in the era of multi-omics: A practical guide to omics technologies
DOE Office of Scientific and Technical Information (OSTI.GOV)
White, Richard Allen; Rivas-Ubach, Albert; Borkum, Mark I.
Over the past century, the significance of the rhizosphere has been increasingly recognized by the scientific community. Furthermore, this complex biological system is comprised of vast interconnected networks of microbial organisms that interact directly with their plant hosts, including archaea, bacteria, fungi, picoeukaryotes, and viruses. The rhizosphere provides a nutritional base to the terrestrial biosphere, and is integral to plant growth, crop production, and ecosystem health. There is little mechanistic understanding of the rhizosphere, but, and that constitutes a critical knowledge gap. It inhibits our ability to predict and control the terrestrial ecosystem to achieve desirable outcomes, such as bioenergymore » production, crop yield maximization, and soil-based carbon sequestration. Multi-omics have the potential to significantly advance our knowledge of rhizospheric science. Our review covers multi-omic techniques and technologies; methods and protocols for specific rhizospheric science questions; and the challenges to be addressed during this century of rhizospheric science.« less
The state of rhizospheric science in the era of multi-omics: A practical guide to omics technologies
DOE Office of Scientific and Technical Information (OSTI.GOV)
White, Richard Allen; Rivas-Ubach, Albert; Borkum, Mark I.
Over the past century, the significance of the rhizosphere as a complex, biological system, comprised of vast, interconnected networks of microbial organisms that interact directly with their plant hosts (e.g., archæa, bacteria, fungi, eukaryotes, and viruses) has been increasingly recognized by the scientific community. Providing a nutritional base to the terrestrial biosphere, the rhizosphere is integral to plant growth, crop production and ecosystem health. Lack of mechanistic understanding of the rhizosphere constitutes a critical knowledge gap, inhibiting our ability to predict and control the terrestrial ecosystem in order to achieve desirable outcomes (e.g., bioenergy production, crop yield maximization, and soilbasedmore » carbon sequestration). Application of multi-omics has the potential to significantly advance our knowledge of rhizospheric science. This review covers: cutting- and bleeding-edge, multi-omic techniques and technologies; methods and protocols for specific rhizospheric science questions; and, challenges to be addressed during this century of rhizospheric science.« less
Yang, Fu-Chun; Chen, Yi-Lung; Tang, Sen-Lin; Yu, Chang-Ping; Wang, Po-Hsiang; Ismail, Wael; Wang, Chia-Hsiang; Ding, Jiun-Yan; Yang, Cheng-Yu; Yang, Chia-Ying; Chiang, Yin-Ru
2016-01-01
Steroid hormones, such as androgens, are common surface-water contaminants. However, literature on the ecophysiological relevance of steroid-degrading organisms in the environment, particularly in anoxic ecosystems, is extremely limited. We previously reported that Steroidobacter denitrificans anaerobically degrades androgens through the 2,3-seco pathway. In this study, the genome of Sdo. denitrificans was completely sequenced. Transcriptomic data revealed gene clusters that were distinctly expressed during anaerobic growth on testosterone. We isolated and characterized the bifunctional 1-testosterone hydratase/dehydrogenase, which is essential for anaerobic degradation of steroid A-ring. Because of apparent substrate preference of this molybdoenzyme, corresponding genes, along with the signature metabolites of the 2,3-seco pathway, were used as biomarkers to investigate androgen biodegradation in the largest sewage treatment plant in Taipei, Taiwan. Androgen metabolite analysis indicated that denitrifying bacteria in anoxic sewage use the 2,3-seco pathway to degrade androgens. Metagenomic analysis and PCR-based functional assays showed androgen degradation in anoxic sewage by Thauera spp. through the action of 1-testosterone hydratase/dehydrogenase. Our integrative ‘omics' approach can be used for culture-independent investigations of the microbial degradation of structurally complex compounds where isotope-labeled substrates are not easily available. PMID:26872041
Shao, Yaping; Ye, Guozhu; Ren, Shancheng; Piao, Hai-Long; Zhao, Xinjie; Lu, Xin; Wang, Fubo; Ma, Wang; Li, Jia; Yin, Peiyuan; Xia, Tian; Xu, Chuanliang; Yu, Jane J; Sun, Yinghao; Xu, Guowang
2018-07-15
Genetic alterations drive metabolic reprograming to meet increased biosynthetic precursor and energy demands for cancer cell proliferation and survival in unfavorable environments. A systematic study of gene-metabolite regulatory networks and metabolic dysregulation should reveal the molecular mechanisms underlying prostate cancer (PCa) pathogenesis. Herein, we performed gas chromatography-mass spectrometry (GC-MS)-based metabolomics and RNA-seq analyses in prostate tumors and matched adjacent normal tissues (ANTs) to elucidate the molecular alterations and potential underlying regulatory mechanisms in PCa. Significant accumulation of metabolic intermediates and enrichment of genes in the tricarboxylic acid (TCA) cycle were observed in tumor tissues, indicating TCA cycle hyperactivation in PCa tissues. In addition, the levels of fumarate and malate were highly correlated with the Gleason score, tumor stage and expression of genes encoding related enzymes and were significantly related to the expression of genes involved in branched chain amino acid degradation. Using an integrated omics approach, we further revealed the potential anaplerotic routes from pyruvate, glutamine catabolism and branched chain amino acid (BCAA) degradation contributing to replenishing metabolites for TCA cycle. Integrated omics techniques enable the performance of network-based analyses to gain a comprehensive and in-depth understanding of PCa pathophysiology and may facilitate the development of new and effective therapeutic strategies. © 2018 UICC.
2015-01-01
Background microRNA (miRNA) expression plays an influential role in cancer classification and malignancy, and miRNAs are feasible as alternative diagnostic markers for pancreatic cancer, a highly aggressive neoplasm with silent early symptoms, high metastatic potential, and resistance to conventional therapies. Methods In this study, we evaluated the benefits of multi-omics data analysis by integrating miRNA and mRNA expression data in pancreatic cancer. Using support vector machine (SVM) modelling and leave-one-out cross validation (LOOCV), we evaluated the diagnostic performance of single- or multi-markers based on miRNA and mRNA expression profiles from 104 PDAC tissues and 17 benign pancreatic tissues. For selecting even more reliable and robust markers, we performed validation by independent datasets from the Gene Expression Omnibus (GEO) and the Cancer Genome Atlas (TCGA) data depositories. For validation, miRNA activity was estimated by miRNA-target gene interaction and mRNA expression datasets in pancreatic cancer. Results Using a comprehensive identification approach, we successfully identified 705 multi-markers having powerful diagnostic performance for PDAC. In addition, these marker candidates annotated with cancer pathways using gene ontology analysis. Conclusions Our prediction models have strong potential for the diagnosis of pancreatic cancer. PMID:26328610
NASA Astrophysics Data System (ADS)
Pagel, Holger; Kandeler, Ellen; Seifert, Jana; Camarinha-Silva, Amélia; Kügler, Philipp; Rennert, Thilo; Poll, Christian; Streck, Thilo
2016-04-01
Matter cycling in soils and associated soil functions are intrinsically controlled by microbial dynamics. It is therefore crucial to consider functional traits of microorganisms in biogeochemical models. Tremendous advances in 'omic' methods provide a plethora of data on physiology, metabolic capabilities and ecological life strategies of microorganisms in soil. Combined with isotopic techniques, biochemical pathways and transformations can be identified and quantified. Such data have been, however, rarely used to improve the mechanistic representation of microbial dynamics in soil organic matter models. It is the goal of the Young Investigator Group SoilReg to address this challenge. Our general approach is to tightly integrate experiments and biochemical modeling. NextGen sequencing will be applied to identify key functional groups. Active microbial groups will be quantified by measurements of functional genes and by stable isotope probing methods of DNA and proteins. Based on this information a biogeochemical model that couples a mechanistic representation of microbial dynamics with physicochemical processes will be set up and calibrated. Sensitivity and stability analyses of the model as well as scenario simulations will reveal the importance of intrinsic and extrinsic controls of organic matter turnover. We will demonstrate our concept and present first results of two case studies on pesticide degradation and methane oxidation.
Safo, Sandra E; Li, Shuzhao; Long, Qi
2018-03-01
Integrative analysis of high dimensional omics data is becoming increasingly popular. At the same time, incorporating known functional relationships among variables in analysis of omics data has been shown to help elucidate underlying mechanisms for complex diseases. In this article, our goal is to assess association between transcriptomic and metabolomic data from a Predictive Health Institute (PHI) study that includes healthy adults at a high risk of developing cardiovascular diseases. Adopting a strategy that is both data-driven and knowledge-based, we develop statistical methods for sparse canonical correlation analysis (CCA) with incorporation of known biological information. Our proposed methods use prior network structural information among genes and among metabolites to guide selection of relevant genes and metabolites in sparse CCA, providing insight on the molecular underpinning of cardiovascular disease. Our simulations demonstrate that the structured sparse CCA methods outperform several existing sparse CCA methods in selecting relevant genes and metabolites when structural information is informative and are robust to mis-specified structural information. Our analysis of the PHI study reveals that a number of gene and metabolic pathways including some known to be associated with cardiovascular diseases are enriched in the set of genes and metabolites selected by our proposed approach. © 2017, The International Biometric Society.
Malentacchi, Francesca; Mancini, Irene; Brandslund, Ivan; Vermeersch, Pieter; Schwab, Matthias; Marc, Janja; van Schaik, Ron H N; Siest, Gerard; Theodorsson, Elvar; Pazzagli, Mario; Di Resta, Chiara
2015-06-01
Developments in "-omics" are creating a paradigm shift in laboratory medicine leading to personalized medicine. This allows the increase in diagnostics and therapeutics focused on individuals rather than populations. In order to investigate whether laboratory medicine is ready to play a key role in the integration of personalized medicine in routine health care and set the state-of-the-art knowledge about personalized medicine and laboratory medicine in Europe, a questionnaire was constructed under the auspices of the European Federation of Clinical Chemistry and Laboratory Medicine (EFLM) and the European Society of Pharmacogenomics and Personalised Therapy (ESPT). The answers of the participating laboratory medicine professionals indicate that they are aware that personalized medicine can represent a new and promising health model, and that laboratory medicine should play a key role in supporting the implementation of personalized medicine in the clinical setting. Participants think that the current organization of laboratory medicine needs additional/relevant implementations such as (i) new technological facilities in -omics; (ii) additional training for the current personnel focused on the new methodologies; (iii) incorporation in the laboratory of new competencies in data interpretation and counseling; and (iv) cooperation and collaboration among professionals of different disciplines to integrate information according to a personalized medicine approach.
A generic Transcriptomics Reporting Framework (TRF) for 'omics data processing and analysis.
Gant, Timothy W; Sauer, Ursula G; Zhang, Shu-Dong; Chorley, Brian N; Hackermüller, Jörg; Perdichizzi, Stefania; Tollefsen, Knut E; van Ravenzwaay, Ben; Yauk, Carole; Tong, Weida; Poole, Alan
2017-12-01
A generic Transcriptomics Reporting Framework (TRF) is presented that lists parameters that should be reported in 'omics studies used in a regulatory context. The TRF encompasses the processes from transcriptome profiling from data generation to a processed list of differentially expressed genes (DEGs) ready for interpretation. Included within the TRF is a reference baseline analysis (RBA) that encompasses raw data selection; data normalisation; recognition of outliers; and statistical analysis. The TRF itself does not dictate the methodology for data processing, but deals with what should be reported. Its principles are also applicable to sequencing data and other 'omics. In contrast, the RBA specifies a simple data processing and analysis methodology that is designed to provide a comparison point for other approaches and is exemplified here by a case study. By providing transparency on the steps applied during 'omics data processing and analysis, the TRF will increase confidence processing of 'omics data, and regulatory use. Applicability of the TRF is ensured by its simplicity and generality. The TRF can be applied to all types of regulatory 'omics studies, and it can be executed using different commonly available software tools. Crown Copyright © 2017. Published by Elsevier Inc. All rights reserved.
Climate shapes the protein abundance of dominant soil bacteria.
Bastida, Felipe; Crowther, Tom W; Prieto, Iván; Routh, Devin; García, Carlos; Jehmlich, Nico
2018-05-28
Sensitive models of climate change impacts would require a better integration of multi-omics approaches that connect the abundance and activity of microbial populations. Here, we show that climate is a fundamental driver of the protein abundance of Actinobacteria, Planctomycetes and Proteobacteria, supporting the hypothesis that metabolic activity of some dominant phyla may be closely linked to climate. These results may improve our capacity to construct microbial models that better predict the impact of climate change in ecosystem processes. Copyright © 2018 Elsevier B.V. All rights reserved.
Reconstruction of genome-scale human metabolic models using omics data.
Ryu, Jae Yong; Kim, Hyun Uk; Lee, Sang Yup
2015-08-01
The impact of genome-scale human metabolic models on human systems biology and medical sciences is becoming greater, thanks to increasing volumes of model building platforms and publicly available omics data. The genome-scale human metabolic models started with Recon 1 in 2007, and have since been used to describe metabolic phenotypes of healthy and diseased human tissues and cells, and to predict therapeutic targets. Here we review recent trends in genome-scale human metabolic modeling, including various generic and tissue/cell type-specific human metabolic models developed to date, and methods, databases and platforms used to construct them. For generic human metabolic models, we pay attention to Recon 2 and HMR 2.0 with emphasis on data sources used to construct them. Draft and high-quality tissue/cell type-specific human metabolic models have been generated using these generic human metabolic models. Integration of tissue/cell type-specific omics data with the generic human metabolic models is the key step, and we discuss omics data and their integration methods to achieve this task. The initial version of the tissue/cell type-specific human metabolic models can further be computationally refined through gap filling, reaction directionality assignment and the subcellular localization of metabolic reactions. We review relevant tools for this model refinement procedure as well. Finally, we suggest the direction of further studies on reconstructing an improved human metabolic model.
Ohyanagi, Hajime; Takano, Tomoyuki; Terashima, Shin; Kobayashi, Masaaki; Kanno, Maasa; Morimoto, Kyoko; Kanegae, Hiromi; Sasaki, Yohei; Saito, Misa; Asano, Satomi; Ozaki, Soichi; Kudo, Toru; Yokoyama, Koji; Aya, Koichiro; Suwabe, Keita; Suzuki, Go; Aoki, Koh; Kubo, Yasutaka; Watanabe, Masao; Matsuoka, Makoto; Yano, Kentaro
2015-01-01
Comprehensive integration of large-scale omics resources such as genomes, transcriptomes and metabolomes will provide deeper insights into broader aspects of molecular biology. For better understanding of plant biology, we aim to construct a next-generation sequencing (NGS)-derived gene expression network (GEN) repository for a broad range of plant species. So far we have incorporated information about 745 high-quality mRNA sequencing (mRNA-Seq) samples from eight plant species (Arabidopsis thaliana, Oryza sativa, Solanum lycopersicum, Sorghum bicolor, Vitis vinifera, Solanum tuberosum, Medicago truncatula and Glycine max) from the public short read archive, digitally profiled the entire set of gene expression profiles, and drawn GENs by using correspondence analysis (CA) to take advantage of gene expression similarities. In order to understand the evolutionary significance of the GENs from multiple species, they were linked according to the orthology of each node (gene) among species. In addition to other gene expression information, functional annotation of the genes will facilitate biological comprehension. Currently we are improving the given gene annotations with natural language processing (NLP) techniques and manual curation. Here we introduce the current status of our analyses and the web database, PODC (Plant Omics Data Center; http://bioinf.mind.meiji.ac.jp/podc/), now open to the public, providing GENs, functional annotations and additional comprehensive omics resources. PMID:25505034
Yamamoto, Naoki; Suzuki, Tomohiro; Kobayashi, Masaaki; Dohra, Hideo; Sasaki, Yohei; Hirai, Hirofumi; Yokoyama, Koji; Kawagishi, Hirokazu; Yano, Kentaro
2014-12-03
The angel's wing oyster mushroom (Pleurocybella porrigens, Sugihiratake) is a well-known delicacy. However, its potential risk in acute encephalopathy was recently revealed by a food poisoning incident. To disclose the genes underlying the accident and provide mechanistic insight, we seek to develop an information infrastructure containing omics data. In our previous work, we sequenced the genome and transcriptome using next-generation sequencing techniques. The next step in achieving our goal is to develop a web database to facilitate the efficient mining of large-scale omics data and identification of genes specifically expressed in the mushroom. This paper introduces a web database A-WINGS (http://bioinf.mind.meiji.ac.jp/a-wings/) that provides integrated genomic and transcriptomic information for the angel's wing oyster mushroom. The database contains structure and functional annotations of transcripts and gene expressions. Functional annotations contain information on homologous sequences from NCBI nr and UniProt, Gene Ontology, and KEGG Orthology. Digital gene expression profiles were derived from RNA sequencing (RNA-seq) analysis in the fruiting bodies and mycelia. The omics information stored in the database is freely accessible through interactive and graphical interfaces by search functions that include 'GO TREE VIEW' browsing, keyword searches, and BLAST searches. The A-WINGS database will accelerate omics studies on specific aspects of the angel's wing oyster mushroom and the family Tricholomataceae.
Dimitrova, N; Nagaraj, A B; Razi, A; Singh, S; Kamalakaran, S; Banerjee, N; Joseph, P; Mankovich, A; Mittal, P; DiFeo, A; Varadan, V
2017-04-27
Characterizing the complex interplay of cellular processes in cancer would enable the discovery of key mechanisms underlying its development and progression. Published approaches to decipher driver mechanisms do not explicitly model tissue-specific changes in pathway networks and the regulatory disruptions related to genomic aberrations in cancers. We therefore developed InFlo, a novel systems biology approach for characterizing complex biological processes using a unique multidimensional framework integrating transcriptomic, genomic and/or epigenomic profiles for any given cancer sample. We show that InFlo robustly characterizes tissue-specific differences in activities of signalling networks on a genome scale using unique probabilistic models of molecular interactions on a per-sample basis. Using large-scale multi-omics cancer datasets, we show that InFlo exhibits higher sensitivity and specificity in detecting pathway networks associated with specific disease states when compared to published pathway network modelling approaches. Furthermore, InFlo's ability to infer the activity of unmeasured signalling network components was also validated using orthogonal gene expression signatures. We then evaluated multi-omics profiles of primary high-grade serous ovarian cancer tumours (N=357) to delineate mechanisms underlying resistance to frontline platinum-based chemotherapy. InFlo was the only algorithm to identify hyperactivation of the cAMP-CREB1 axis as a key mechanism associated with resistance to platinum-based therapy, a finding that we subsequently experimentally validated. We confirmed that inhibition of CREB1 phosphorylation potently sensitized resistant cells to platinum therapy and was effective in killing ovarian cancer stem cells that contribute to both platinum-resistance and tumour recurrence. Thus, we propose InFlo to be a scalable and widely applicable and robust integrative network modelling framework for the discovery of evidence-based biomarkers and therapeutic targets.
Conceptual framework for drought phenotyping during molecular breeding.
Salekdeh, Ghasem Hosseini; Reynolds, Matthew; Bennett, John; Boyer, John
2009-09-01
Drought is a major threat to agricultural production and drought tolerance is a prime target for molecular approaches to crop improvement. To achieve meaningful results, these approaches must be linked with suitable phenotyping protocols at all stages, such as the screening of germplasm collections, mutant libraries, mapping populations, transgenic lines and breeding materials and the design of OMICS and quantitative trait loci (QTLs) experiments. Here we present a conceptual framework for molecular breeding for drought tolerance based on the Passioura equation of expressing yield as the product of water use (WU), water use efficiency (WUE) and harvest index (HI). We identify phenotyping protocols that address each of these factors, describe their key features and illustrate their integration with different molecular approaches.
Linking Ecological, Environmental and Biogeochemical Data with Multi'omics Analysis
NASA Astrophysics Data System (ADS)
Hasler-Sheetal, H.; Castorani, M. C.; Fragner, L.; Zeng, Y.; Holmer, M.; Glud, R. N.; Weckwerth, W.; Canfield, D. E.
2016-02-01
The integrated analysis of multi'omics and environmental data provides a holistic understanding of biological processes and has been proven to be challenging. Here we present our research concept for conducting multi-omics experiments and linking them to environmental data. Hypoxia, reduced light availability and species interaction - all amplified by global warming - cause a global decline of seagrasses. Metabolic mechanisms for coping with these global threats are largely unknown and multi'omics approaches can be an important approach for generating this insight. We applied GC, LC-qTOF-MS and bioinformatics to investigate the effects of environmental pressure on metabolites present in seagrasses. In a first experiment we assessed the metabolomics response of the seagrass Zostera marina towards anoxia and showed that photosynthetically derived oxygen could satisfy the oxygen demand in the leaves. But accumulation of fermentation products in the roots showed that the rhizosphere was under anoxic stress. In contrast nocturnal anoxia caused a biphasic shift in the metabolome of roots and leaves. This nocturnal reprogramming of the metabolome under anoxia indicates a mitigation mechanism to avoid the toxic effects. A pathway enrichment analysis proposes the alanine shunt, the GABA shunt and the 2-oxoglutarate shunt as such mitigation mechanisms that alleviate pyruvate levels and lead to carbon and nitrogen storage during anoxia. In a second experiment, varying light exposure and species interaction of Z. marina with the blue mussel Mytilus edulis - a co-occurring species in seagrass systems - resulted in treatment specific metabolic fingerprints in seagrass. Light modified the metabolic fingerprint expressed in Z. marina to the presence of mussels, indicating varying physiological responses to mussels in normal and low light regimes. Multivariate data-analysis indicated light exposure as main driver (45%) and mussel presence as minor driver (13%) for the metabolic responses. Traditional plant performance parameters exhibited light dependent variation but in contrast to the metabolome none of these parameters were dependent on the presence of M. edulis. This demonstrates the applicability of metabolomics to reveal hidden effects of environmental pressure on seagrasses.
Perrino, Cinzia; Barabási, Albert-Laszló; Condorelli, Gianluigi; Davidson, Sean Michael; De Windt, Leon; Dimmeler, Stefanie; Engel, Felix Benedikt; Hausenloy, Derek John; Hill, Joseph Addison; Van Laake, Linda Wilhelmina; Lecour, Sandrine; Leor, Jonathan; Madonna, Rosalinda; Mayr, Manuel; Prunier, Fabrice; Sluijter, Joost Petrus Geradus; Schulz, Rainer; Thum, Thomas; Ytrehus, Kirsti; Ferdinandy, Péter
2017-06-01
Despite advances in myocardial reperfusion therapies, acute myocardial ischaemia/reperfusion injury and consequent ischaemic heart failure represent the number one cause of morbidity and mortality in industrialized societies. Although different therapeutic interventions have been shown beneficial in preclinical settings, an effective cardioprotective or regenerative therapy has yet to be successfully introduced in the clinical arena. Given the complex pathophysiology of the ischaemic heart, large scale, unbiased, global approaches capable of identifying multiple branches of the signalling networks activated in the ischaemic/reperfused heart might be more successful in the search for novel diagnostic or therapeutic targets. High-throughput techniques allow high-resolution, genome-wide investigation of genetic variants, epigenetic modifications, and associated gene expression profiles. Platforms such as proteomics and metabolomics (not described here in detail) also offer simultaneous readouts of hundreds of proteins and metabolites. Isolated omics analyses usually provide Big Data requiring large data storage, advanced computational resources and complex bioinformatics tools. The possibility of integrating different omics approaches gives new hope to better understand the molecular circuitry activated by myocardial ischaemia, putting it in the context of the human 'diseasome'. Since modifications of cardiac gene expression have been consistently linked to pathophysiology of the ischaemic heart, the integration of epigenomic and transcriptomic data seems a promising approach to identify crucial disease networks. Thus, the scope of this Position Paper will be to highlight potentials and limitations of these approaches, and to provide recommendations to optimize the search for novel diagnostic or therapeutic targets for acute ischaemia/reperfusion injury and ischaemic heart failure in the post-genomic era. © The Author 2017. Published by Oxford University Press on behalf of the European Society of Cardiology.
Tools for the functional interpretation of metabolomic experiments.
Chagoyen, Monica; Pazos, Florencio
2013-11-01
The so-called 'omics' approaches used in modern biology aim at massively characterizing the molecular repertories of living systems at different levels. Metabolomics is one of the last additions to the 'omics' family and it deals with the characterization of the set of metabolites in a given biological system. As metabolomic techniques become more massive and allow characterizing larger sets of metabolites, automatic methods for analyzing these sets in order to obtain meaningful biological information are required. Only recently the first tools specifically designed for this task in metabolomics appeared. They are based on approaches previously used in transcriptomics and other 'omics', such as annotation enrichment analysis. These, together with generic tools for metabolic analysis and visualization not specifically designed for metabolomics will for sure be in the toolbox of the researches doing metabolomic experiments in the near future.
Chapter 3: Omics and the Future of Sustainable Biomaterials
Juliet D. Tang; Susan V. Diehl
2014-01-01
With global focus on the conversion of biomass into products, fuels, and energy, there is a strong need for information that will lead to new sustainable products, applications, and biotechnological advances. The omics approach to biology is a discovery-driven method that may deliver solutions to these overarching problems. It gives scientists the ability to obtain a...
Metagenomic insights into the ecology and physiology of microbes in bioelectrochemical systems.
Kouzuma, Atsushi; Ishii, Shun'ichi; Watanabe, Kazuya
2018-05-01
In bioelectrochemical systems (BESs), electrons are transferred between electrochemically active microbes (EAMs) and conductive materials, such as electrodes, via extracellular electron transfer (EET) pathways, and electrons thus transferred stimulate intracellular catabolic reactions. Catabolic and EET pathways have extensively been studied for several model EAMs, such as Shewanella oneidensis MR-1 and Geobacter sulfurreducens PCA, whereas it is also important to understand the ecophysiology of EAMs in naturally occurring microbiomes, such as those in anode biofilms in microbial fuel cells treating wastewater. Recent studies have exploited metagenomics and metatranscriptomics (meta-omics) approaches to characterize EAMs in BES-associated microbiomes. Here we review recent BES studies that used meta-omics approaches and show that these studies have discovered unexpected features of EAMs and deepened our understanding of functions and behaviors of microbes in BESs. It is desired that more studies will employ meta-omics approaches for advancing our knowledge on microbes in BESs. Copyright © 2018 Elsevier Ltd. All rights reserved.
Hsu, Han-Hsiu; Araki, Michihiro; Mochizuki, Masao; Hori, Yoshimi; Murata, Masahiro; Kahar, Prihardi; Yoshida, Takanobu; Hasunuma, Tomohisa; Kondo, Akihiko
2017-03-02
Chinese hamster ovary (CHO) cells are the primary host used for biopharmaceutical protein production. The engineering of CHO cells to produce higher amounts of biopharmaceuticals has been highly dependent on empirical approaches, but recent high-throughput "omics" methods are changing the situation in a rational manner. Omics data analyses using gene expression or metabolite profiling make it possible to identify key genes and metabolites in antibody production. Systematic omics approaches using different types of time-series data are expected to further enhance understanding of cellular behaviours and molecular networks for rational design of CHO cells. This study developed a systematic method for obtaining and analysing time-dependent intracellular and extracellular metabolite profiles, RNA-seq data (enzymatic mRNA levels) and cell counts from CHO cell cultures to capture an overall view of the CHO central metabolic pathway (CMP). We then calculated correlation coefficients among all the profiles and visualised the whole CMP by heatmap analysis and metabolic pathway mapping, to classify genes and metabolites together. This approach provides an efficient platform to identify key genes and metabolites in CHO cell culture.
The Human Blood Metabolome-Transcriptome Interface
Schramm, Katharina; Adamski, Jerzy; Gieger, Christian; Herder, Christian; Carstensen, Maren; Peters, Annette; Rathmann, Wolfgang; Roden, Michael; Strauch, Konstantin; Suhre, Karsten; Kastenmüller, Gabi; Prokisch, Holger; Theis, Fabian J.
2015-01-01
Biological systems consist of multiple organizational levels all densely interacting with each other to ensure function and flexibility of the system. Simultaneous analysis of cross-sectional multi-omics data from large population studies is a powerful tool to comprehensively characterize the underlying molecular mechanisms on a physiological scale. In this study, we systematically analyzed the relationship between fasting serum metabolomics and whole blood transcriptomics data from 712 individuals of the German KORA F4 cohort. Correlation-based analysis identified 1,109 significant associations between 522 transcripts and 114 metabolites summarized in an integrated network, the ‘human blood metabolome-transcriptome interface’ (BMTI). Bidirectional causality analysis using Mendelian randomization did not yield any statistically significant causal associations between transcripts and metabolites. A knowledge-based interpretation and integration with a genome-scale human metabolic reconstruction revealed systematic signatures of signaling, transport and metabolic processes, i.e. metabolic reactions mainly belonging to lipid, energy and amino acid metabolism. Moreover, the construction of a network based on functional categories illustrated the cross-talk between the biological layers at a pathway level. Using a transcription factor binding site enrichment analysis, this pathway cross-talk was further confirmed at a regulatory level. Finally, we demonstrated how the constructed networks can be used to gain novel insights into molecular mechanisms associated to intermediate clinical traits. Overall, our results demonstrate the utility of a multi-omics integrative approach to understand the molecular mechanisms underlying both normal physiology and disease. PMID:26086077
The -omics Era- Toward a Systems-Level Understanding of Streptomyces
Zhou, Zhan; Gu, Jianying; Du, Yi-Ling; Li, Yong-Quan; Wang, Yufeng
2011-01-01
Streptomyces is a group of soil bacteria of medicinal, economic, ecological, and industrial importance. It is renowned for its complex biology in gene regulation, antibiotic production, morphological differentiation, and stress response. In this review, we provide an overview of the recent advances in Streptomyces biology inspired by -omics based high throughput technologies. In this post-genomic era, vast amounts of data have been integrated to provide significant new insights into the fundamental mechanisms of system control and regulation dynamics of Streptomyces. PMID:22379394
Automatic Tool for Local Assembly Structures
DOE Office of Scientific and Technical Information (OSTI.GOV)
Whole community shotgun sequencing of total DNA (i.e. metagenomics) and total RNA (i.e. metatranscriptomics) has provided a wealth of information in the microbial community structure, predicted functions, metabolic networks, and is even able to reconstruct complete genomes directly. Here we present ATLAS (Automatic Tool for Local Assembly Structures) a comprehensive pipeline for assembly, annotation, genomic binning of metagenomic and metatranscriptomic data with an integrated framework for Multi-Omics. This will provide an open source tool for the Multi-Omic community at large.
G-DOC Plus - an integrative bioinformatics platform for precision medicine.
Bhuvaneshwar, Krithika; Belouali, Anas; Singh, Varun; Johnson, Robert M; Song, Lei; Alaoui, Adil; Harris, Michael A; Clarke, Robert; Weiner, Louis M; Gusev, Yuriy; Madhavan, Subha
2016-04-30
G-DOC Plus is a data integration and bioinformatics platform that uses cloud computing and other advanced computational tools to handle a variety of biomedical BIG DATA including gene expression arrays, NGS and medical images so that they can be analyzed in the full context of other omics and clinical information. G-DOC Plus currently holds data from over 10,000 patients selected from private and public resources including Gene Expression Omnibus (GEO), The Cancer Genome Atlas (TCGA) and the recently added datasets from REpository for Molecular BRAin Neoplasia DaTa (REMBRANDT), caArray studies of lung and colon cancer, ImmPort and the 1000 genomes data sets. The system allows researchers to explore clinical-omic data one sample at a time, as a cohort of samples; or at the level of population, providing the user with a comprehensive view of the data. G-DOC Plus tools have been leveraged in cancer and non-cancer studies for hypothesis generation and validation; biomarker discovery and multi-omics analysis, to explore somatic mutations and cancer MRI images; as well as for training and graduate education in bioinformatics, data and computational sciences. Several of these use cases are described in this paper to demonstrate its multifaceted usability. G-DOC Plus can be used to support a variety of user groups in multiple domains to enable hypothesis generation for precision medicine research. The long-term vision of G-DOC Plus is to extend this translational bioinformatics platform to stay current with emerging omics technologies and analysis methods to continue supporting novel hypothesis generation, analysis and validation for integrative biomedical research. By integrating several aspects of the disease and exposing various data elements, such as outpatient lab workup, pathology, radiology, current treatments, molecular signatures and expected outcomes over a web interface, G-DOC Plus will continue to strengthen precision medicine research. G-DOC Plus is available at: https://gdoc.georgetown.edu .
Inferring gene ontologies from pairwise similarity data
Kramer, Michael; Dutkowski, Janusz; Yu, Michael; Bafna, Vineet; Ideker, Trey
2014-01-01
Motivation: While the manually curated Gene Ontology (GO) is widely used, inferring a GO directly from -omics data is a compelling new problem. Recognizing that ontologies are a directed acyclic graph (DAG) of terms and hierarchical relations, algorithms are needed that: analyze a full matrix of gene–gene pairwise similarities from -omics data;infer true hierarchical structure in these data rather than enforcing hierarchy as a computational artifact; andrespect biological pleiotropy, by which a term in the hierarchy can relate to multiple higher level terms. Methods addressing these requirements are just beginning to emerge—none has been evaluated for GO inference. Methods: We consider two algorithms [Clique Extracted Ontology (CliXO), LocalFitness] that uniquely satisfy these requirements, compared with methods including standard clustering. CliXO is a new approach that finds maximal cliques in a network induced by progressive thresholding of a similarity matrix. We evaluate each method’s ability to reconstruct the GO biological process ontology from a similarity matrix based on (a) semantic similarities for GO itself or (b) three -omics datasets for yeast. Results: For task (a) using semantic similarity, CliXO accurately reconstructs GO (>99% precision, recall) and outperforms other approaches (<20% precision, <20% recall). For task (b) using -omics data, CliXO outperforms other methods using two -omics datasets and achieves ∼30% precision and recall using YeastNet v3, similar to an earlier approach (Network Extracted Ontology) and better than LocalFitness or standard clustering (20–25% precision, recall). Conclusion: This study provides algorithmic foundation for building gene ontologies by capturing hierarchical and pleiotropic structure embedded in biomolecular data. Contact: tideker@ucsd.edu PMID:24932003
Borisjuk, Ljudmilla; Hajirezaei, Mohammad-Reza; Klukas, Christian; Rolletschek, Hardy; Schreiber, Falk
2005-01-01
Modern 'omics'-technologies result in huge amounts of data about life processes. For analysis and data mining purposes this data has to be considered in the context of the underlying biological networks. This work presents an approach for integrating data from biological experiments into metabolic networks by mapping the data onto network elements and visualising the data enriched networks automatically. This methodology is implemented in DBE, an information system that supports the analysis and visualisation of experimental data in the context of metabolic networks. It consists of five parts: (1) the DBE-Database for consistent data storage, (2) the Excel-Importer application for the data import, (3) the DBE-Website as the interface for the system, (4) the DBE-Pictures application for the up- and download of binary (e. g. image) files, and (5) DBE-Gravisto, a network analysis and graph visualisation system. The usability of this approach is demonstrated in two examples.
RNA Systems Biology for Cancer: From Diagnosis to Therapy.
Amirkhah, Raheleh; Farazmand, Ali; Wolkenhauer, Olaf; Schmitz, Ulf
2016-01-01
It is due to the advances in high-throughput omics data generation that RNA species have re-entered the focus of biomedical research. International collaborate efforts, like the ENCODE and GENCODE projects, have spawned thousands of previously unknown functional non-coding RNAs (ncRNAs) with various but primarily regulatory roles. Many of these are linked to the emergence and progression of human diseases. In particular, interdisciplinary studies integrating bioinformatics, systems biology, and biotechnological approaches have successfully characterized the role of ncRNAs in different human cancers. These efforts led to the identification of a new tool-kit for cancer diagnosis, monitoring, and treatment, which is now starting to enter and impact on clinical practice. This chapter is to elaborate on the state of the art in RNA systems biology, including a review and perspective on clinical applications toward an integrative RNA systems medicine approach. The focus is on the role of ncRNAs in cancer.
Campos, Pollyanna Fernandes; Andrade-Silva, Débora; Zelanis, André; Paes Leme, Adriana Franco; Rocha, Marisa Maria Teixeira; Menezes, Milene Cristina; Serrano, Solange M.T.; Junqueira-de-Azevedo, Inácio de Loiola Meirelles
2016-01-01
Only few studies on snake venoms were dedicated to deeply characterize the toxin secretion of animals from the Colubridae family, despite the fact that they represent the majority of snake diversity. As a consequence, some evolutionary trends observed in venom proteins that underpinned the evolutionary histories of snake toxins were based on data from a minor parcel of the clade. Here, we investigated the proteins of the totally unknown venom from Phalotris mertensi (Dipsadinae subfamily), in order to obtain a detailed profile of its toxins and to appreciate evolutionary tendencies occurring in colubrid venoms. By means of integrated omics and functional approaches, including RNAseq, Sanger sequencing, high-resolution proteomics, recombinant protein production, and enzymatic tests, we verified an active toxic secretion containing up to 21 types of proteins. A high content of Kunitz-type proteins and C-type lectins were observed, although several enzymatic components such as metalloproteinases and an L-amino acid oxidase were also present in the venom. Interestingly, an arguable venom component of other species was demonstrated as a true venom protein and named svLIPA (snake venom acid lipase). This finding indicates the importance of checking the actual protein occurrence across species before rejecting genes suggested to code for toxins, which are relevant for the discussion about the early evolution of reptile venoms. Moreover, trends in the evolution of some toxin classes, such as simplification of metalloproteinases and rearrangements of Kunitz and Wap domains, parallel similar phenomena observed in other venomous snake families and provide a broader picture of toxin evolution. PMID:27412610
OmicsNet: a web-based tool for creation and visual analysis of biological networks in 3D space.
Zhou, Guangyan; Xia, Jianguo
2018-06-07
Biological networks play increasingly important roles in omics data integration and systems biology. Over the past decade, many excellent tools have been developed to support creation, analysis and visualization of biological networks. However, important limitations remain: most tools are standalone programs, the majority of them focus on protein-protein interaction (PPI) or metabolic networks, and visualizations often suffer from 'hairball' effects when networks become large. To help address these limitations, we developed OmicsNet - a novel web-based tool that allows users to easily create different types of molecular interaction networks and visually explore them in a three-dimensional (3D) space. Users can upload one or multiple lists of molecules of interest (genes/proteins, microRNAs, transcription factors or metabolites) to create and merge different types of biological networks. The 3D network visualization system was implemented using the powerful Web Graphics Library (WebGL) technology that works natively in most major browsers. OmicsNet supports force-directed layout, multi-layered perspective layout, as well as spherical layout to help visualize and navigate complex networks. A rich set of functions have been implemented to allow users to perform coloring, shading, topology analysis, and enrichment analysis. OmicsNet is freely available at http://www.omicsnet.ca.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lindemann, Stephen R.; Mobberley, Jennifer M.; Cole, Jessica K.
The principles governing acquisition and interspecies exchange of nutrients in microbial communities and how those exchanges impact community productivity are poorly understood. Here, we examine energy and macronutrient acquisition in unicyanobacterial consortia for which species-resolved genome information exists for all members, allowing us to use multi-omic approaches to predict species’ abilities to acquire resources and examine expression of resource-acquisition genes during succession. Metabolic reconstruction indicated that a majority of heterotrophic community members lacked the genes required to directly acquire the inorganic nutrients provided in culture medium, suggesting high metabolic interdependency. The sole primary producer in consortium UCC-O, cyanobacterium Phormidium sp.more » OSCR, displayed declining expression of energy harvest, carbon fixation, and nitrate and sulfate reduction proteins but sharply increasing phosphate transporter expression over 28 days. Most heterotrophic members likewise exhibited signs of phosphorus starvation during succession. Though similar in their responses to phosphorus limitation, heterotrophs displayed species-specific expression of nitrogen acquisition genes. These results suggest niche partitioning around nitrogen sources may structure the community when organisms directly compete for limited phosphate. Such niche complementarity around nitrogen sources may increase community diversity and productivity in phosphate-limited phototrophic communities.« less
Genome Sequencing Technologies and Nursing: What Are the Roles of Nurses and Nurse Scientists?
Taylor, Jacquelyn Y.; Wright, Michelle L.; Hickey, Kathleen T.; Housman, David
2016-01-01
Background Advances in DNA sequencing technology have resulted in an abundance of personalized data with challenging clinical utility and meaning for clinicians. This wealth of data has potential to dramatically impact the quality of healthcare. Nurses are at the focal point in educating patients regarding relevant healthcare needs; therefore, an understanding of sequencing technology and utilizing these data are critical. Aim The objective of this paper is to explicate the role of nurses and nurse scientists as integral members of healthcare teams in improving understanding of DNA sequencing data and translational genomics for patients. Approach A history of the nurse role in newborn screening is used as an exemplar. Discussion This paper serves as an exemplar on how genome sequencing has been utilized in nursing science and incorporates linkages of other omics approaches used by nurses that are included in this special issue. This special issue showcased nurse scientists conducting multi-omic research from various methods, including targeted candidate genes, pharmacogenomics, proteomics, epigenomics and the microbiome. From this vantage point, we provide an overview of the roles of nurse scientists in genome sequencing research and provide recommendations for the best utilization of nurses and nurse scientists related to genome sequencing. PMID:28252579
Lindemann, Stephen R.; Mobberley, Jennifer M.; Cole, Jessica K.; ...
2017-06-13
The principles governing acquisition and interspecies exchange of nutrients in microbial communities and how those exchanges impact community productivity are poorly understood. Here, we examine energy and macronutrient acquisition in unicyanobacterial consortia for which species-resolved genome information exists for all members, allowing us to use multi-omic approaches to predict species’ abilities to acquire resources and examine expression of resource-acquisition genes during succession. Metabolic reconstruction indicated that a majority of heterotrophic community members lacked the genes required to directly acquire the inorganic nutrients provided in culture medium, suggesting high metabolic interdependency. The sole primary producer in consortium UCC-O, cyanobacterium Phormidium sp.more » OSCR, displayed declining expression of energy harvest, carbon fixation, and nitrate and sulfate reduction proteins but sharply increasing phosphate transporter expression over 28 days. Most heterotrophic members likewise exhibited signs of phosphorus starvation during succession. Though similar in their responses to phosphorus limitation, heterotrophs displayed species-specific expression of nitrogen acquisition genes. These results suggest niche partitioning around nitrogen sources may structure the community when organisms directly compete for limited phosphate. Such niche complementarity around nitrogen sources may increase community diversity and productivity in phosphate-limited phototrophic communities.« less
Liu, Zhenqiu; Sun, Fengzhu; McGovern, Dermot P
2017-01-01
Feature selection and prediction are the most important tasks for big data mining. The common strategies for feature selection in big data mining are L 1 , SCAD and MC+. However, none of the existing algorithms optimizes L 0 , which penalizes the number of nonzero features directly. In this paper, we develop a novel sparse generalized linear model (GLM) with L 0 approximation for feature selection and prediction with big omics data. The proposed approach approximate the L 0 optimization directly. Even though the original L 0 problem is non-convex, the problem is approximated by sequential convex optimizations with the proposed algorithm. The proposed method is easy to implement with only several lines of code. Novel adaptive ridge algorithms ( L 0 ADRIDGE) for L 0 penalized GLM with ultra high dimensional big data are developed. The proposed approach outperforms the other cutting edge regularization methods including SCAD and MC+ in simulations. When it is applied to integrated analysis of mRNA, microRNA, and methylation data from TCGA ovarian cancer, multilevel gene signatures associated with suboptimal debulking are identified simultaneously. The biological significance and potential clinical importance of those genes are further explored. The developed Software L 0 ADRIDGE in MATLAB is available at https://github.com/liuzqx/L0adridge.
Yeoman, Carl J.; Thomas, Susan M.; Miller, Margret E. Berg; Ulanov, Alexander V.; Torralba, Manolito; Lucas, Sarah; Gillis, Marcus; Cregger, Melissa; Gomez, Andres; Ho, Mengfei; Leigh, Steven R.; Stumpf, Rebecca; Creedon, Douglas J.; Smith, Michael A.; Weisbaum, Jon S.; Nelson, Karen E.; Wilson, Brenda A.; White, Bryan A.
2013-01-01
Background Bacterial vaginosis (BV) is the most common vaginal disorder of reproductive-age women. Yet the cause of BV has not been established. To uncover key determinants of BV, we employed a multi-omic, systems-biology approach, including both deep 16S rRNA gene-based sequencing and metabolomics of lavage samples from 36 women. These women varied demographically, behaviorally, and in terms of health status and symptoms. Principal Findings 16S rRNA gene-based community composition profiles reflected Nugent scores, but not Amsel criteria. In contrast, metabolomic profiles were markedly more concordant with Amsel criteria. Metabolomic profiles revealed two distinct symptomatic BV types (SBVI and SBVII) with similar characteristics that indicated disruption of epithelial integrity, but each type was correlated to the presence of different microbial taxa and metabolites, as well as to different host behaviors. The characteristic odor associated with BV was linked to increases in putrescine and cadaverine, which were both linked to Dialister spp. Additional correlations were seen with the presence of discharge, 2-methyl-2-hydroxybutanoic acid, and Mobiluncus spp., and with pain, diethylene glycol and Gardnerella spp. Conclusions The results not only provide useful diagnostic biomarkers, but also may ultimately provide much needed insight into the determinants of BV. PMID:23405259
Yeoman, Carl J; Thomas, Susan M; Miller, Margret E Berg; Ulanov, Alexander V; Torralba, Manolito; Lucas, Sarah; Gillis, Marcus; Cregger, Melissa; Gomez, Andres; Ho, Mengfei; Leigh, Steven R; Stumpf, Rebecca; Creedon, Douglas J; Smith, Michael A; Weisbaum, Jon S; Nelson, Karen E; Wilson, Brenda A; White, Bryan A
2013-01-01
Bacterial vaginosis (BV) is the most common vaginal disorder of reproductive-age women. Yet the cause of BV has not been established. To uncover key determinants of BV, we employed a multi-omic, systems-biology approach, including both deep 16S rRNA gene-based sequencing and metabolomics of lavage samples from 36 women. These women varied demographically, behaviorally, and in terms of health status and symptoms. 16S rRNA gene-based community composition profiles reflected Nugent scores, but not Amsel criteria. In contrast, metabolomic profiles were markedly more concordant with Amsel criteria. Metabolomic profiles revealed two distinct symptomatic BV types (SBVI and SBVII) with similar characteristics that indicated disruption of epithelial integrity, but each type was correlated to the presence of different microbial taxa and metabolites, as well as to different host behaviors. The characteristic odor associated with BV was linked to increases in putrescine and cadaverine, which were both linked to Dialister spp. Additional correlations were seen with the presence of discharge, 2-methyl-2-hydroxybutanoic acid, and Mobiluncus spp., and with pain, diethylene glycol and Gardnerella spp. The results not only provide useful diagnostic biomarkers, but also may ultimately provide much needed insight into the determinants of BV.
Chae, Heejoon; Lee, Sangseon; Seo, Seokjun; Jung, Daekyoung; Chang, Hyeonsook; Nephew, Kenneth P; Kim, Sun
2016-12-01
Measuring gene expression, DNA sequence variation, and DNA methylation status is routinely done using high throughput sequencing technologies. To analyze such multi-omics data and explore relationships, reliable bioinformatics systems are much needed. Existing systems are either for exploring curated data or for processing omics data in the form of a library such as R. Thus scientists have much difficulty in investigating relationships among gene expression, DNA sequence variation, and DNA methylation using multi-omics data. In this study, we report a system called BioVLAB-mCpG-SNP-EXPRESS for the integrated analysis of DNA methylation, sequence variation (SNPs), and gene expression for distinguishing cellular phenotypes at the pairwise and multiple phenotype levels. The system can be deployed on either the Amazon cloud or a publicly available high-performance computing node, and the data analysis and exploration of the analysis result can be conveniently done using a web-based interface. In order to alleviate analysis complexity, all the process are fully automated, and graphical workflow system is integrated to represent real-time analysis progression. The BioVLAB-mCpG-SNP-EXPRESS system works in three stages. First, it processes and analyzes multi-omics data as input in the form of the raw data, i.e., FastQ files. Second, various integrated analyses such as methylation vs. gene expression and mutation vs. methylation are performed. Finally, the analysis result can be explored in a number of ways through a web interface for the multi-level, multi-perspective exploration. Multi-level interpretation can be done by either gene, gene set, pathway or network level and multi-perspective exploration can be explored from either gene expression, DNA methylation, sequence variation, or their relationship perspective. The utility of the system is demonstrated by performing analysis of phenotypically distinct 30 breast cancer cell line data set. BioVLAB-mCpG-SNP-EXPRESS is available at http://biohealth.snu.ac.kr/software/biovlab_mcpg_snp_express/. Copyright © 2016 Elsevier Inc. All rights reserved.
Czaplicki, Lauren M.; Gunsch, Claudia K.
2017-01-01
Bioremediation is generally viewed as a cost effective and sustainable technology because it relies on microbes to transform pollutants into benign compounds. Advances in molecular biological analyses allow unprecedented microbial detection and are increasingly incorporated into bioremediation. Throughout history, state-of-the-art techniques have informed bioremediation strategies. However, the insights those techniques provided were not as in depth as those provided by recently developed omics tools. Advances in next generation sequencing (NGS) have now placed metagenomics and metatranscriptomics within reach of environmental engineers. As NGS costs decrease, metagenomics and metatranscriptomics have become increasingly feasible options to rapidly scan sites for specific degradative functions and identify microorganisms important in pollutant degradation. These omic techniques are capable of revolutionizing biological treatment in environmental engineering by allowing highly sensitive characterization of previously uncultured microorganisms. Omics enables the discovery of novel microorganisms for use in bioaugmentation and supports systematic optimization of biostimulation strategies. This review describes the omics journey from roots in biology and medicine to its current status in environmental engineering including potential future directions in commercial application. PMID:28348455
Ciucci, Sara; Ge, Yan; Durán, Claudio; Palladini, Alessandra; Jiménez-Jiménez, Víctor; Martínez-Sánchez, Luisa María; Wang, Yuting; Sales, Susanne; Shevchenko, Andrej; Poser, Steven W.; Herbig, Maik; Otto, Oliver; Androutsellis-Theotokis, Andreas; Guck, Jochen; Gerl, Mathias J.; Cannistraci, Carlo Vittorio
2017-01-01
Omic science is rapidly growing and one of the most employed techniques to explore differential patterns in omic datasets is principal component analysis (PCA). However, a method to enlighten the network of omic features that mostly contribute to the sample separation obtained by PCA is missing. An alternative is to build correlation networks between univariately-selected significant omic features, but this neglects the multivariate unsupervised feature compression responsible for the PCA sample segregation. Biologists and medical researchers often prefer effective methods that offer an immediate interpretation to complicated algorithms that in principle promise an improvement but in practice are difficult to be applied and interpreted. Here we present PC-corr: a simple algorithm that associates to any PCA segregation a discriminative network of features. Such network can be inspected in search of functional modules useful in the definition of combinatorial and multiscale biomarkers from multifaceted omic data in systems and precision biomedicine. We offer proofs of PC-corr efficacy on lipidomic, metagenomic, developmental genomic, population genetic, cancer promoteromic and cancer stem-cell mechanomic data. Finally, PC-corr is a general functional network inference approach that can be easily adopted for big data exploration in computer science and analysis of complex systems in physics. PMID:28287094
Karaosmanoglu, Kubra; Sayar, Nihat Alpagu; Kurnaz, Isil Aksan; Akbulut, Berna Sariyar
2014-01-01
Postgenomics drug development is undergoing major transformation in the age of multi-omics studies and drug repositioning. Rather than applications solely in personalized medicine, omics science thus additionally offers a better understanding of a broader range of drug targets and drug repositioning. Berberine is an isoquinoline alkaloid found in many medicinal plants. We report here a whole genome microarray study in tandem with proteomics techniques for mining the plethora of targets that are putatively involved in the antimicrobial activity of berberine against Escherichia coli. We found DNA replication/repair and transcription to be triggered by berberine, indicating that nucleic acids, in general, are among its targets. Our combined transcriptomics and proteomics multi-omics findings underscore that, in the presence of berberine, cell wall or cell membrane transport and motility-related functions are also specifically regulated. We further report a general decline in metabolism, as seen by repression of genes in carbohydrate and amino acid metabolism, energy production, and conversion. An involvement of multidrug efflux pumps, as well as reduced membrane permeability for developing resistance against berberine in E. coli was noted. Collectively, these findings offer original and significant leads for omics-guided drug discovery and future repositioning approaches in the postgenomics era, using berberine as a multi-omics case study.
Mildew-Omics: How Global Analyses Aid the Understanding of Life and Evolution of Powdery Mildews.
Bindschedler, Laurence V; Panstruga, Ralph; Spanu, Pietro D
2016-01-01
The common powdery mildew plant diseases are caused by ascomycete fungi of the order Erysiphales. Their characteristic life style as obligate biotrophs renders functional analyses in these species challenging, mainly because of experimental constraints to genetic manipulation. Global large-scale ("-omics") approaches are thus particularly valuable and insightful for the characterisation of the life and evolution of powdery mildews. Here we review the knowledge obtained so far from genomic, transcriptomic and proteomic studies in these fungi. We consider current limitations and challenges regarding these surveys and provide an outlook on desired future investigations on the basis of the various -omics technologies.
Ramharack, Pritika; Soliman, Mahmoud E S
2018-06-01
Originally developed for the analysis of biological sequences, bioinformatics has advanced into one of the most widely recognized domains in the scientific community. Despite this technological evolution, there is still an urgent need for nontoxic and efficient drugs. The onus now falls on the 'omics domain to meet this need by implementing bioinformatics techniques that will allow for the introduction of pioneering approaches in the rational drug design process. Here, we categorize an updated list of informatics tools and explore the capabilities of integrative bioinformatics in disease control. We believe that our review will serve as a comprehensive guide toward bioinformatics-oriented disease and drug discovery research. Copyright © 2018 Elsevier Ltd. All rights reserved.
BluePen Biomarkers LLC: integrated biomarker solutions
Blair, Ian A; Mesaros, Clementina; Lilley, Patrick; Nunez, Matthew
2016-01-01
BluePen Biomarkers provides a unique comprehensive multi-omics biomarker discovery and validation platform. We can quantify, integrate and analyze genomics, proteomics, metabolomics and lipidomics biomarkers, alongside clinical data, demographics and other phenotypic data. A unique bio-inspired signal processing analytic approach is used that has the proven ability to identify biomarkers in a wide variety of diseases. The resulting biomarkers can be used for diagnosis, prognosis, mechanistic studies and predicting treatment response, in contexts from core research through clinical trials. BluePen Biomarkers provides an additional groundbreaking research goal: identifying surrogate biomarkers from different modalities. This not only provides new biological insights, but enables least invasive, least-cost tests that meet or exceed the predictive quality of current tests. PMID:28031971
Guhlin, Joseph; Silverstein, Kevin A T; Zhou, Peng; Tiffin, Peter; Young, Nevin D
2017-08-10
Rapid generation of omics data in recent years have resulted in vast amounts of disconnected datasets without systemic integration and knowledge building, while individual groups have made customized, annotated datasets available on the web with few ways to link them to in-lab datasets. With so many research groups generating their own data, the ability to relate it to the larger genomic and comparative genomic context is becoming increasingly crucial to make full use of the data. The Omics Database Generator (ODG) allows users to create customized databases that utilize published genomics data integrated with experimental data which can be queried using a flexible graph database. When provided with omics and experimental data, ODG will create a comparative, multi-dimensional graph database. ODG can import definitions and annotations from other sources such as InterProScan, the Gene Ontology, ENZYME, UniPathway, and others. This annotation data can be especially useful for studying new or understudied species for which transcripts have only been predicted, and rapidly give additional layers of annotation to predicted genes. In better studied species, ODG can perform syntenic annotation translations or rapidly identify characteristics of a set of genes or nucleotide locations, such as hits from an association study. ODG provides a web-based user-interface for configuring the data import and for querying the database. Queries can also be run from the command-line and the database can be queried directly through programming language hooks available for most languages. ODG supports most common genomic formats as well as generic, easy to use tab-separated value format for user-provided annotations. ODG is a user-friendly database generation and query tool that adapts to the supplied data to produce a comparative genomic database or multi-layered annotation database. ODG provides rapid comparative genomic annotation and is therefore particularly useful for non-model or understudied species. For species for which more data are available, ODG can be used to conduct complex multi-omics, pattern-matching queries.
Putative regulatory sites unraveled by network-embedded thermodynamic analysis of metabolome data
Kümmel, Anne; Panke, Sven; Heinemann, Matthias
2006-01-01
As one of the most recent members of the omics family, large-scale quantitative metabolomics data are currently complementing our systems biology data pool and offer the chance to integrate the metabolite level into the functional analysis of cellular networks. Network-embedded thermodynamic analysis (NET analysis) is presented as a framework for mechanistic and model-based analysis of these data. By coupling the data to an operating metabolic network via the second law of thermodynamics and the metabolites' Gibbs energies of formation, NET analysis allows inferring functional principles from quantitative metabolite data; for example it identifies reactions that are subject to active allosteric or genetic regulation as exemplified with quantitative metabolite data from Escherichia coli and Saccharomyces cerevisiae. Moreover, the optimization framework of NET analysis was demonstrated to be a valuable tool to systematically investigate data sets for consistency, for the extension of sub-omic metabolome data sets and for resolving intracompartmental concentrations from cell-averaged metabolome data. Without requiring any kind of kinetic modeling, NET analysis represents a perfectly scalable and unbiased approach to uncover insights from quantitative metabolome data. PMID:16788595
Ohyanagi, Hajime; Takano, Tomoyuki; Terashima, Shin; Kobayashi, Masaaki; Kanno, Maasa; Morimoto, Kyoko; Kanegae, Hiromi; Sasaki, Yohei; Saito, Misa; Asano, Satomi; Ozaki, Soichi; Kudo, Toru; Yokoyama, Koji; Aya, Koichiro; Suwabe, Keita; Suzuki, Go; Aoki, Koh; Kubo, Yasutaka; Watanabe, Masao; Matsuoka, Makoto; Yano, Kentaro
2015-01-01
Comprehensive integration of large-scale omics resources such as genomes, transcriptomes and metabolomes will provide deeper insights into broader aspects of molecular biology. For better understanding of plant biology, we aim to construct a next-generation sequencing (NGS)-derived gene expression network (GEN) repository for a broad range of plant species. So far we have incorporated information about 745 high-quality mRNA sequencing (mRNA-Seq) samples from eight plant species (Arabidopsis thaliana, Oryza sativa, Solanum lycopersicum, Sorghum bicolor, Vitis vinifera, Solanum tuberosum, Medicago truncatula and Glycine max) from the public short read archive, digitally profiled the entire set of gene expression profiles, and drawn GENs by using correspondence analysis (CA) to take advantage of gene expression similarities. In order to understand the evolutionary significance of the GENs from multiple species, they were linked according to the orthology of each node (gene) among species. In addition to other gene expression information, functional annotation of the genes will facilitate biological comprehension. Currently we are improving the given gene annotations with natural language processing (NLP) techniques and manual curation. Here we introduce the current status of our analyses and the web database, PODC (Plant Omics Data Center; http://bioinf.mind.meiji.ac.jp/podc/), now open to the public, providing GENs, functional annotations and additional comprehensive omics resources. © The Author 2014. Published by Oxford University Press on behalf of Japanese Society of Plant Physiologists.
Systems Medicine: The Future of Medical Genomics, Healthcare, and Wellness.
Saqi, Mansoor; Pellet, Johann; Roznovat, Irina; Mazein, Alexander; Ballereau, Stéphane; De Meulder, Bertrand; Auffray, Charles
2016-01-01
Recent advances in genomics have led to the rapid and relatively inexpensive collection of patient molecular data including multiple types of omics data. The integration of these data with clinical measurements has the potential to impact on our understanding of the molecular basis of disease and on disease management. Systems medicine is an approach to understanding disease through an integration of large patient datasets. It offers the possibility for personalized strategies for healthcare through the development of a new taxonomy of disease. Advanced computing will be an important component in effectively implementing systems medicine. In this chapter we describe three computational challenges associated with systems medicine: disease subtype discovery using integrated datasets, obtaining a mechanistic understanding of disease, and the development of an informatics platform for the mining, analysis, and visualization of data emerging from translational medicine studies.
Pathview Web: user friendly pathway visualization and data integration.
Luo, Weijun; Pant, Gaurav; Bhavnasi, Yeshvant K; Blanchard, Steven G; Brouwer, Cory
2017-07-03
Pathway analysis is widely used in omics studies. Pathway-based data integration and visualization is a critical component of the analysis. To address this need, we recently developed a novel R package called Pathview. Pathview maps, integrates and renders a large variety of biological data onto molecular pathway graphs. Here we developed the Pathview Web server, as to make pathway visualization and data integration accessible to all scientists, including those without the special computing skills or resources. Pathview Web features an intuitive graphical web interface and a user centered design. The server not only expands the core functions of Pathview, but also provides many useful features not available in the offline R package. Importantly, the server presents a comprehensive workflow for both regular and integrated pathway analysis of multiple omics data. In addition, the server also provides a RESTful API for programmatic access and conveniently integration in third-party software or workflows. Pathview Web is openly and freely accessible at https://pathview.uncc.edu/. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Nakajima, Tsubasa; Kajihata, Shuichi; Yoshikawa, Katsunori; Matsuda, Fumio; Furusawa, Chikara; Hirasawa, Takashi; Shimizu, Hiroshi
2014-09-01
Cyanobacteria have flexible metabolic capability that enables them to adapt to various environments. To investigate their underlying metabolic regulation mechanisms, we performed an integrated analysis of metabolic flux using transcriptomic and metabolomic data of a cyanobacterium Synechocystis sp. PCC 6803, under mixotrophic and photoheterotrophic conditions. The integrated analysis indicated drastic metabolic flux changes, with much smaller changes in gene expression levels and metabolite concentrations between the conditions, suggesting that the flux change was not caused mainly by the expression levels of the corresponding genes. Under photoheterotrophic conditions, created by the addition of the photosynthesis inhibitor atrazine in mixotrophic conditions, the result of metabolic flux analysis indicated the significant repression of carbon fixation and the activation of the oxidative pentose phosphate pathway (PPP). Moreover, we observed gluconeogenic activity of upstream of glycolysis, which enhanced the flux of the oxidative PPP to compensate for NADPH depletion due to the inhibition of the light reaction of photosynthesis. 'Omics' data suggested that these changes were probably caused by the repression of the gap1 gene, which functions as a control valve in the metabolic network. Since metabolic flux is the outcome of a complicated interplay of cellular components, integrating metabolic flux with other 'omics' layers can identify metabolic changes and narrow down these regulatory mechanisms more effectively. © The Author 2014. Published by Oxford University Press on behalf of Japanese Society of Plant Physiologists. All rights reserved. For permissions, please email: journals.permissions@oup.com.
A correlated meta-analysis strategy for data mining "OMIC" scans.
Province, Michael A; Borecki, Ingrid B
2013-01-01
Meta-analysis is becoming an increasingly popular and powerful tool to integrate findings across studies and OMIC dimensions. But there is the danger that hidden dependencies between putatively "independent" studies can cause inflation of type I error, due to reinforcement of the evidence from false-positive findings. We present here a simple method for conducting meta-analyses that automatically estimates the degree of any such non-independence between OMIC scans and corrects the inference for it, retaining the proper type I error structure. The method does not require the original data from the source studies, but operates only on summary analysis results from these in OMIC scans. The method is applicable in a wide variety of situations including combining GWAS and or sequencing scan results across studies with dependencies due to overlapping subjects, as well as to scans of correlated traits, in a meta-analysis scan for pleiotropic genetic effects. The method correctly detects which scans are actually independent in which case it yields the traditional meta-analysis, so it may safely be used in all cases, when there is even a suspicion of correlation amongst scans.
Suárez-Ulloa, Victoria; Fernández-Tajes, Juan; Manfrin, Chiara; Gerdol, Marco; Venier, Paola; Eirín-López, José M.
2013-01-01
The extraordinary progress experienced by sequencing technologies and bioinformatics has made the development of omic studies virtually ubiquitous in all fields of life sciences nowadays. However, scientific attention has been quite unevenly distributed throughout the different branches of the tree of life, leaving molluscs, one of the most diverse animal groups, relatively unexplored and without representation within the narrow collection of well established model organisms. Within this Phylum, bivalve molluscs play a fundamental role in the functioning of the marine ecosystem, constitute very valuable commercial resources in aquaculture, and have been widely used as sentinel organisms in the biomonitoring of marine pollution. Yet, it has only been very recently that this complex group of organisms became a preferential subject for omic studies, posing new challenges for their integrative characterization. The present contribution aims to give a detailed insight into the state of the art of the omic studies and functional information analysis of bivalve molluscs, providing a timely perspective on the available data resources and on the current and prospective applications for the biomonitoring of harmful marine compounds. PMID:24189277
Biosignature Discovery for Substance Use Disorders Using Statistical Learning.
Baurley, James W; McMahan, Christopher S; Ervin, Carolyn M; Pardamean, Bens; Bergen, Andrew W
2018-02-01
There are limited biomarkers for substance use disorders (SUDs). Traditional statistical approaches are identifying simple biomarkers in large samples, but clinical use cases are still being established. High-throughput clinical, imaging, and 'omic' technologies are generating data from SUD studies and may lead to more sophisticated and clinically useful models. However, analytic strategies suited for high-dimensional data are not regularly used. We review strategies for identifying biomarkers and biosignatures from high-dimensional data types. Focusing on penalized regression and Bayesian approaches, we address how to leverage evidence from existing studies and knowledge bases, using nicotine metabolism as an example. We posit that big data and machine learning approaches will considerably advance SUD biomarker discovery. However, translation to clinical practice, will require integrated scientific efforts. Copyright © 2017 Elsevier Ltd. All rights reserved.
Wang, Jack P.; Matthews, Megan L.; Williams, Cranos M.; ...
2018-04-20
A multi-omics quantitative integrative analysis of lignin biosynthesis can advance the strategic engineering of wood for timber, pulp, and biofuels. Lignin is polymerized from three monomers (monolignols) produced by a grid-like pathway. The pathway in wood formation of Populus trichocarpa has at least 21 genes, encoding enzymes that mediate 37 reactions on 24 metabolites, leading to lignin and affecting wood properties. We perturb these 21 pathway genes and integrate transcriptomic, proteomic, fluxomic and phenomic data from 221 lines selected from ~2000 transgenics (6-month-old). The integrative analysis estimates how changing expression of pathway gene or gene combination affects protein abundance, metabolic-flux,more » metabolite concentrations, and 25 wood traits, including lignin, tree-growth, density, strength, and saccharification. The analysis then predicts improvements in any of these 25 traits individually or in combinations, through engineering expression of specific monolignol genes. The analysis may lead to greater understanding of other pathways for improved growth and adaptation.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Wang, Jack P.; Matthews, Megan L.; Williams, Cranos M.
A multi-omics quantitative integrative analysis of lignin biosynthesis can advance the strategic engineering of wood for timber, pulp, and biofuels. Lignin is polymerized from three monomers (monolignols) produced by a grid-like pathway. The pathway in wood formation of Populus trichocarpa has at least 21 genes, encoding enzymes that mediate 37 reactions on 24 metabolites, leading to lignin and affecting wood properties. We perturb these 21 pathway genes and integrate transcriptomic, proteomic, fluxomic and phenomic data from 221 lines selected from ~2000 transgenics (6-month-old). The integrative analysis estimates how changing expression of pathway gene or gene combination affects protein abundance, metabolic-flux,more » metabolite concentrations, and 25 wood traits, including lignin, tree-growth, density, strength, and saccharification. The analysis then predicts improvements in any of these 25 traits individually or in combinations, through engineering expression of specific monolignol genes. The analysis may lead to greater understanding of other pathways for improved growth and adaptation.« less
Wang, Jack P; Matthews, Megan L; Williams, Cranos M; Shi, Rui; Yang, Chenmin; Tunlaya-Anukit, Sermsawat; Chen, Hsi-Chuan; Li, Quanzi; Liu, Jie; Lin, Chien-Yuan; Naik, Punith; Sun, Ying-Hsuan; Loziuk, Philip L; Yeh, Ting-Feng; Kim, Hoon; Gjersing, Erica; Shollenberger, Todd; Shuford, Christopher M; Song, Jina; Miller, Zachary; Huang, Yung-Yun; Edmunds, Charles W; Liu, Baoguang; Sun, Yi; Lin, Ying-Chung Jimmy; Li, Wei; Chen, Hao; Peszlen, Ilona; Ducoste, Joel J; Ralph, John; Chang, Hou-Min; Muddiman, David C; Davis, Mark F; Smith, Chris; Isik, Fikret; Sederoff, Ronald; Chiang, Vincent L
2018-04-20
A multi-omics quantitative integrative analysis of lignin biosynthesis can advance the strategic engineering of wood for timber, pulp, and biofuels. Lignin is polymerized from three monomers (monolignols) produced by a grid-like pathway. The pathway in wood formation of Populus trichocarpa has at least 21 genes, encoding enzymes that mediate 37 reactions on 24 metabolites, leading to lignin and affecting wood properties. We perturb these 21 pathway genes and integrate transcriptomic, proteomic, fluxomic and phenomic data from 221 lines selected from ~2000 transgenics (6-month-old). The integrative analysis estimates how changing expression of pathway gene or gene combination affects protein abundance, metabolic-flux, metabolite concentrations, and 25 wood traits, including lignin, tree-growth, density, strength, and saccharification. The analysis then predicts improvements in any of these 25 traits individually or in combinations, through engineering expression of specific monolignol genes. The analysis may lead to greater understanding of other pathways for improved growth and adaptation.
Urich, Tim; Lanzén, Anders; Stokke, Runar; Pedersen, Rolf B; Bayer, Christoph; Thorseth, Ingunn H; Schleper, Christa; Steen, Ida H; Ovreas, Lise
2014-09-01
Deep-sea hydrothermal vents are unique environments on Earth, as they host chemosynthetic ecosystems fuelled by geochemical energy with chemolithoautotrophic microorganisms at the basis of the food webs. Whereas discrete high-temperature venting systems have been studied extensively, the microbiotas associated with low-temperature diffuse venting are not well understood. We analysed the structure and functioning of microbial communities in two diffuse venting sediments from the Jan Mayen vent fields in the Norwegian-Greenland Sea, applying an integrated 'omics' approach combining metatranscriptomics, metaproteomics and metagenomics. Polymerase chain reaction-independent three-domain community profiling showed that the two sediments hosted highly similar communities dominated by Epsilonproteobacteria, Deltaproteobacteria and Gammaproteobacteria, besides ciliates, nematodes and various archaeal taxa. Active metabolic pathways were identified through transcripts and peptides, with genes of sulphur and methane oxidation, and carbon fixation pathways highly expressed, in addition to genes of aerobic and anaerobic (nitrate and sulphate) respiratory chains. High expression of chemotaxis and flagella genes reflected a lifestyle in a dynamic habitat rich in physico-chemical gradients. The major metabolic pathways could be assigned to distinct taxonomic groups, thus enabling hypotheses about the function of the different prokaryotic and eukaryotic taxa. This study advances our understanding of the functioning of microbial communities in diffuse hydrothermal venting sediments. © 2013 Society for Applied Microbiology and John Wiley & Sons Ltd.
Epigenetics and epidemiology: models of study and examples.
van Veldhoven, Karin; Rahman, Shati; Vineis, Paolo
2014-01-01
Epidemiological studies have successfully identified several environmental causes of disease, but often these studies are limited by methodological problems (e.g. lack of sensitivity and specificity in exposure assessment; confounding). Proposed approaches to improve observational studies of environmental associations are Mendelian randomization and the meet-in-the-middle (MITM) approach. The latter uses signals from the growing field of -omics as putative intermediate biomarkers in the pathogenetic process that links exposure with disease. The first part of this approach consists in the association between exposure and disease. The next step consists in the study of the relationship between (biomarkers of) exposure and intermediate -omic biomarkers of early effect; thirdly, the relation between the disease outcome and intermediate -omic biomarkers is assessed. We propose that when an association is found in all three steps it is possible that there is a casual association. One of the associations that have been investigated extensively in the recent years but is not completely understood is that between environmental endocrine disruptors and breast cancer. Here we present an example of how the "meet-in-the-middle" approach can be used to address the role of endocrine disruptors, by reviewing the relevant literature.
Omics Approaches for the Engineering of Pathogen Resistant Plants.
Gomez-Casati, Diego F; Pagani, María A; Busi, María V; Bhadauria, Vijai
2016-01-01
The attack of different pathogens, such as bacteria, fungi and viruses has a negative impact on crop production. In counter such attacks, plants have developed different strategies involving the modification of gene expression, activation of several metabolic pathways and post-translational modification of proteins, which culminate into the accumulation of primary and secondary metabolites implicated in plant defense responses. The recent advancement in omics techniques allows the increase coverage of plants transcriptomes, proteomes and metabolomes during pathogen attack, and the modulation of the response after the infection. Omics techniques also allow us to learn more about the biological cycle of the pathogens in addition to the identification of novel virulence factors in pathogens and their host targets. Both approaches become important to decipher the mechanism underlying pathogen attacks and to develop strategies for improving disease-resistant plants. In this review, we summarize some of the contribution of genomics, transcriptomics, proteomics, metabolomics and metallomics in devising the strategies to obtain plants with increased resistance to pathogens. These approaches constitute important research tools in the development of new technologies for the protection against diseases and increase plant production.
Martyniuk, Christopher J
2018-04-01
Environmental science has benefited a great deal from omics-based technologies. High-throughput toxicology has defined adverse outcome pathways (AOPs), prioritized chemicals of concern, and identified novel actions of environmental chemicals. While many of these approaches are conducted under rigorous laboratory conditions, a significant challenge has been the interpretation of omics data in "real-world" exposure scenarios. Clarity in the interpretation of these data limits their use in environmental monitoring programs. In recent years, one overarching objective of many has been to address fundamental questions concerning experimental design and the robustness of data collected under the broad umbrella of environmental genomics. These questions include: (1) the likelihood that molecular profiles return to a predefined baseline level following remediation efforts, (2) how reference site selection in an urban environment influences interpretation of omics data and (3) what is the most appropriate species to monitor in the environment from an omics point of view. In addition, inter-genomics studies have been conducted to assess transcriptome reproducibility in toxicology studies. One lesson learned from inter-genomics studies is that there are core molecular networks that can be identified by multiple laboratories using the same platform. This supports the idea that "omics-networks" defined a priori may be a viable approach moving forward for evaluating environmental impacts over time. Both spatial and temporal variability in ecosystem structure is expected to influence molecular responses to environmental stressors, and it is important to recognize how these variables, as well as individual factor (i.e. sex, age, maturation), may confound interpretation of network responses to chemicals. This mini-review synthesizes the progress made towards adopting these tools into environmental monitoring and identifies future challenges to be addressed, as we move into the next era of high throughput sequencing. A conceptual framework for validating and incorporating molecular networks into environmental monitoring programs is proposed. As AOPs become more defined and their potential in environmental monitoring assessments becomes more recognized, the AOP framework may prove to be the conduit between omics and penultimate ecological responses for environmental risk assessments. Copyright © 2018 Elsevier B.V. All rights reserved.
Reggiani, Claudio; Coppens, Sandra; Sekhara, Tayeb; Dimov, Ivan; Pichon, Bruno; Lufin, Nicolas; Addor, Marie-Claude; Belligni, Elga Fabia; Digilio, Maria Cristina; Faletra, Flavio; Ferrero, Giovanni Battista; Gerard, Marion; Isidor, Bertrand; Joss, Shelagh; Niel-Bütschi, Florence; Perrone, Maria Dolores; Petit, Florence; Renieri, Alessandra; Romana, Serge; Topa, Alexandra; Vermeesch, Joris Robert; Lenaerts, Tom; Casimir, Georges; Abramowicz, Marc; Bontempi, Gianluca; Vilain, Catheline; Deconinck, Nicolas; Smits, Guillaume
2017-07-19
Tissue-specific integrative omics has the potential to reveal new genic elements important for developmental disorders. Two pediatric patients with global developmental delay and intellectual disability phenotype underwent array-CGH genetic testing, both showing a partial deletion of the DLG2 gene. From independent human and murine omics datasets, we combined copy number variations, histone modifications, developmental tissue-specific regulation, and protein data to explore the molecular mechanism at play. Integrating genomics, transcriptomics, and epigenomics data, we describe two novel DLG2 promoters and coding first exons expressed in human fetal brain. Their murine conservation and protein-level evidence allowed us to produce new DLG2 gene models for human and mouse. These new genic elements are deleted in 90% of 29 patients (public and in-house) showing partial deletion of the DLG2 gene. The patients' clinical characteristics expand the neurodevelopmental phenotypic spectrum linked to DLG2 gene disruption to cognitive and behavioral categories. While protein-coding genes are regarded as well known, our work shows that integration of multiple omics datasets can unveil novel coding elements. From a clinical perspective, our work demonstrates that two new DLG2 promoters and exons are crucial for the neurodevelopmental phenotypes associated with this gene. In addition, our work brings evidence for the lack of cross-annotation in human versus mouse reference genomes and nucleotide versus protein databases.
DOE Office of Scientific and Technical Information (OSTI.GOV)
BIRKEL, GARRETT; GARCIA MARTIN, HECTOR; MORRELL, WILLIAM
"Arrowland" is a web-based software application primarily for mapping, integrating and visualizing a variety of metabolism data of living organisms, including but not limited to metabolomics, proteomics, transcriptomics and fluxomics. This software application makes multi-omics data analysis intuitive and interactive. It improves data sharing and communication by enabling users to visualize their omics data using a web browser (on a PC or mobile device). It increases user's productivity by simplifying multi-omics data analysis using well developed maps as a guide. Users using this tool can gain insights into their data sets that would be difficult or even impossible to teasemore » out by looking at raw number, or using their currently existing toolchains to generate static single-use maps. Arrowland helps users save time by visualizing relative changes in different conditions or over time, and helps users to produce more significant insights faster. Preexisting maps decrease the learning curve for beginners in the omics field. Sets of multi-omics data are presented in the browser, as a two-dimensional flowchart resembling a map, with varying levels of detail information, based on the scaling of the map. Users can pan and zoom to explore different maps, compare maps, upload their own research data sets onto desired maps, alter map appearance in ways that facilitate interpretation, visualization and analysis of the given data, and export data, reports and actionable items to help the user initiative.« less
Pitsiladis, Yannis P; Durussel, Jérôme; Rabin, Olivier
2014-05-01
Administration of recombinant human erythropoietin (rHumanEPO) improves sporting performance and hence is frequently subject to abuse by athletes, although rHumanEPO is prohibited by the WADA. Approaches to detect rHumanEPO doping have improved significantly in recent years but remain imperfect. A new transcriptomic-based longitudinal screening approach is being developed that has the potential to improve the analytical performance of current detection methods. In particular, studies are being funded by WADA to identify a 'molecular signature' of rHumanEPO doping and preliminary results are promising. In the first systematic study to be conducted, the expression of hundreds of genes were found to be altered by rHumanEPO with numerous gene transcripts being differentially expressed after the first injection and further transcripts profoundly upregulated during and subsequently downregulated up to 4 weeks postadministration of the drug; with the same transcriptomic pattern observed in all participants. The identification of a blood 'molecular signature' of rHumanEPO administration is the strongest evidence to date that gene biomarkers have the potential to substantially improve the analytical performance of current antidoping methods such as the Athlete Biological Passport for rHumanEPO detection. Given the early promise of transcriptomics, research using an 'omics'-based approach involving genomics, transcriptomics, proteomics and metabolomics should be intensified in order to achieve improved detection of rHumanEPO and other doping substances and methods difficult to detect such a recombinant human growth hormone and blood transfusions.
Ecosystems Biology Approaches To Determine Key Fitness Traits of Soil Microorganisms
NASA Astrophysics Data System (ADS)
Brodie, E.; Zhalnina, K.; Karaoz, U.; Cho, H.; Nuccio, E. E.; Shi, S.; Lipton, M. S.; Zhou, J.; Pett-Ridge, J.; Northen, T.; Firestone, M.
2014-12-01
The application of theoretical approaches such as trait-based modeling represent powerful tools to explain and perhaps predict complex patterns in microbial distribution and function across environmental gradients in space and time. These models are mostly deterministic and where available are built upon a detailed understanding of microbial physiology and response to environmental factors. However as most soil microorganisms have not been cultivated, for the majority our understanding is limited to insights from environmental 'omic information. Information gleaned from 'omic studies of complex systems should be regarded as providing hypotheses, and these hypotheses should be tested under controlled laboratory conditions if they are to be propagated into deterministic models. In a semi-arid Mediterranean grassland system we are attempting to dissect microbial communities into functional guilds with defined physiological traits and are using a range of 'omics approaches to characterize their metabolic potential and niche preference. Initially, two physiologically relevant time points (peak plant activity and prior to wet-up) were sampled and metagenomes sequenced deeply (600-900 Gbp). Following assembly, differential coverage and nucleotide frequency binning were carried out to yield draft genomes. In addition, using a range of cultivation media we have isolated a broad range of bacteria representing abundant bacterial genotypes and with genome sequences of almost 40 isolates are testing genomic predictions regarding growth rate, temperature and substrate utilization in vitro. This presentation will discuss the opportunities and challenges in parameterizing microbial functional guilds from environmental 'omic information for use in trait-based models.
Systems Biology Approaches for Host–Fungal Interactions: An Expanding Multi-Omics Frontier
Culibrk, Luka; Croft, Carys A.
2016-01-01
Abstract Opportunistic fungal infections are an increasing threat for global health, and for immunocompromised patients in particular. These infections are characterized by interaction between fungal pathogen and host cells. The exact mechanisms and the attendant variability in host and fungal pathogen interaction remain to be fully elucidated. The field of systems biology aims to characterize a biological system, and utilize this knowledge to predict the system's response to stimuli such as fungal exposures. A multi-omics approach, for example, combining data from genomics, proteomics, metabolomics, would allow a more comprehensive and pan-optic “two systems” biology of both the host and the fungal pathogen. In this review and literature analysis, we present highly specialized and nascent methods for analysis of multiple -omes of biological systems, in addition to emerging single-molecule visualization techniques that may assist in determining biological relevance of multi-omics data. We provide an overview of computational methods for modeling of gene regulatory networks, including some that have been applied towards the study of an interacting host and pathogen. In sum, comprehensive characterizations of host–fungal pathogen systems are now possible, and utilization of these cutting-edge multi-omics strategies may yield advances in better understanding of both host biology and fungal pathogens at a systems scale. PMID:26885725
2010-01-01
Background Genetic diversity among wild accessions and cultivars of common bean (Phaseolus vulgaris L.) has been characterized using plant morphology, seed protein allozymes, random amplified polymorphic DNA, restriction fragment length polymorphisms, DNA sequence analysis, chloroplast DNA, and microsatellite markers. Yet, little is known about whether these traits, which distinguish among genetically distinct types of common bean, can be evaluated using omics technologies. Results Three 'omics' approaches: transcriptomics, proteomics, and metabolomics were used to qualitatively evaluate the diversity of common bean from two Centers of Domestication (COD). All three approaches were able to classify common bean according to their COD using unsupervised analyses; these findings are consistent with the hypothesis that differences exist in gene transcription, protein expression, and synthesis and metabolism of small molecules among common bean cultivars representative of different COD. Metabolomic analyses of multiple cultivars within two common bean gene pools revealed cultivar differences in small molecules that were of sufficient magnitude to allow identification of unique cultivar fingerprints. Conclusions Given the high-throughput and low cost of each of these 'omics' platforms, significant opportunities exist for their use in the rapid identification of traits of agronomic and nutritional importance as well as to characterize genetic diversity. PMID:21126341
Kel, AlexanderE
2017-02-01
Computational analysis of master regulators through the search for transcription factor binding sites followed by analysis of signal transduction networks of a cell is a new approach of causal analysis of multi-omics data. This paper contains results on analysis of multi-omics data that include transcriptomics, proteomics and epigenomics data of methotrexate (MTX) resistant colon cancer cell line. The data were used for analysis of mechanisms of resistance and for prediction of potential drug targets and promising compounds for reverting the MTX resistance of these cancer cells. We present all results of the analysis including the lists of identified transcription factors and their binding sites in genome and the list of predicted master regulators - potential drug targets. This data was generated in the study recently published in the article "Multi-omics "Upstream Analysis" of regulatory genomic regions helps identifying targets against methotrexate resistance of colon cancer" (Kel et al., 2016) [4]. These data are of interest for researchers from the field of multi-omics data analysis and for biologists who are interested in identification of novel drug targets against NTX resistance.
Moroz, Leonid L
2015-12-01
The origins of neural systems and centralized brains are one of the major transitions in evolution. These events might occur more than once over 570-600 million years. The convergent evolution of neural circuits is evident from a diversity of unique adaptive strategies implemented by ctenophores, cnidarians, acoels, molluscs, and basal deuterostomes. But, further integration of biodiversity research and neuroscience is required to decipher critical events leading to development of complex integrative and cognitive functions. Here, we outline reference species and interdisciplinary approaches in reconstructing the evolution of nervous systems. In the "omic" era, it is now possible to establish fully functional genomics laboratories aboard of oceanic ships and perform sequencing and real-time analyses of data at any oceanic location (named here as Ship-Seq). In doing so, fragile, rare, cryptic, and planktonic organisms, or even entire marine ecosystems, are becoming accessible directly to experimental and physiological analyses by modern analytical tools. Thus, we are now in a position to take full advantages from countless "experiments" Nature performed for us in the course of 3.5 billion years of biological evolution. Together with progress in computational and comparative genomics, evolutionary neuroscience, proteomic and developmental biology, a new surprising picture is emerging that reveals many ways of how nervous systems evolved. As a result, this symposium provides a unique opportunity to revisit old questions about the origins of biological complexity. © The Author 2015. Published by Oxford University Press on behalf of the Society for Integrative and Comparative Biology. All rights reserved. For permissions please email: journals.permissions@oup.com.
Meat science: From proteomics to integrated omics towards system biology.
D'Alessandro, Angelo; Zolla, Lello
2013-01-14
Since the main ultimate goal of farm animal raising is the production of proteins for human consumption, research tools to investigate proteins play a major role in farm animal and meat science. Indeed, proteomics has been applied to the field of farm animal science to monitor in vivo performances of livestock animals (growth performances, fertility, milk quality etc.), but also to further our understanding of the molecular processes at the basis of meat quality, which are largely dependent on the post mortem biochemistry of the muscle, often in a species-specific way. Post mortem alterations to the muscle proteome reflect the biological complexity of the process of "muscle to meat conversion," a process that, despite decades of advancements, is all but fully understood. This is mainly due to the enormous amounts of variables affecting meat tenderness per se, including biological factors, such as animal species, breed specific-characteristic, muscle under investigation. However, it is rapidly emerging that the tender meat phenotype is not only tied to genetics (livestock breeding selection), but also to extrinsic factors, such as the rearing environment, feeding conditions, physical activity, administration of hormonal growth promotants, pre-slaughter handling and stress, post mortem handling. From this intricate scenario, biochemical approaches and systems-wide integrated investigations (metabolomics, transcriptomics, interactomics, phosphoproteomics, mathematical modeling), which have emerged as complementary tools to proteomics, have helped establishing a few milestones in our understanding of the events leading from muscle to meat conversion. The growing integration of omics disciplines in the field of systems biology will soon contribute to take further steps forward. Copyright © 2012 Elsevier B.V. All rights reserved.
Cross-Platform Toxicogenomics for the Prediction of Non-Genotoxic Hepatocarcinogenesis in Rat
Metzger, Ute; Templin, Markus F.; Plummer, Simon; Ellinger-Ziegelbauer, Heidrun; Zell, Andreas
2014-01-01
In the area of omics profiling in toxicology, i.e. toxicogenomics, characteristic molecular profiles have previously been incorporated into prediction models for early assessment of a carcinogenic potential and mechanism-based classification of compounds. Traditionally, the biomarker signatures used for model construction were derived from individual high-throughput techniques, such as microarrays designed for monitoring global mRNA expression. In this study, we built predictive models by integrating omics data across complementary microarray platforms and introduced new concepts for modeling of pathway alterations and molecular interactions between multiple biological layers. We trained and evaluated diverse machine learning-based models, differing in the incorporated features and learning algorithms on a cross-omics dataset encompassing mRNA, miRNA, and protein expression profiles obtained from rat liver samples treated with a heterogeneous set of substances. Most of these compounds could be unambiguously classified as genotoxic carcinogens, non-genotoxic carcinogens, or non-hepatocarcinogens based on evidence from published studies. Since mixed characteristics were reported for the compounds Cyproterone acetate, Thioacetamide, and Wy-14643, we reclassified these compounds as either genotoxic or non-genotoxic carcinogens based on their molecular profiles. Evaluating our toxicogenomics models in a repeated external cross-validation procedure, we demonstrated that the prediction accuracy of our models could be increased by joining the biomarker signatures across multiple biological layers and by adding complex features derived from cross-platform integration of the omics data. Furthermore, we found that adding these features resulted in a better separation of the compound classes and a more confident reclassification of the three undefined compounds as non-genotoxic carcinogens. PMID:24830643
TRIENNIAL LACTATION SYMPOSIUM: Nutrigenomics in livestock: Systems biology meets nutrition.
Loor, J J; Vailati-Riboni, M; McCann, J C; Zhou, Z; Bionaz, M
2015-12-01
The advent of high-throughput technologies to study an animal's genome, proteome, and metabolome (i.e., "omics" tools) constituted a setback to the use of reductionism in livestock research. More recent development of "next-generation sequencing" tools was instrumental in allowing in-depth studies of the microbiome in the rumen and other sections of the gastrointestinal tract. Omics, along with bioinformatics, constitutes the foundation of modern systems biology, a field of study widely used in model organisms (e.g., rodents, yeast, humans) to enhance understanding of the complex biological interactions occurring within cells and tissues at the gene, protein, and metabolite level. Application of systems biology concepts is ideal for the study of interactions between nutrition and physiological state with tissue and cell metabolism and function during key life stages of livestock species, including the transition from pregnancy to lactation, in utero development, or postnatal growth. Modern bioinformatic tools capable of discerning functional outcomes and biologically meaningful networks complement the ever-increasing ability to generate large molecular, microbial, and metabolite data sets. Simultaneous visualization of the complex intertissue adaptations to physiological state and nutrition can now be discerned. Studies to understand the linkages between the microbiome and the absorptive epithelium using the integrative approach are emerging. We present examples of new knowledge generated through the application of functional analyses of transcriptomic, proteomic, and metabolomic data sets encompassing nutritional management of dairy cows, pigs, and poultry. Published work to date underscores that the integrative approach across and within tissues may prove useful for fine-tuning nutritional management of livestock. An important goal during this process is to uncover key molecular players involved in the organismal adaptations to nutrition.
A semantic proteomics dashboard (SemPoD) for data management in translational research.
Jayapandian, Catherine P; Zhao, Meng; Ewing, Rob M; Zhang, Guo-Qiang; Sahoo, Satya S
2012-01-01
One of the primary challenges in translational research data management is breaking down the barriers between the multiple data silos and the integration of 'omics data with clinical information to complete the cycle from the bench to the bedside. The role of contextual metadata, also called provenance information, is a key factor ineffective data integration, reproducibility of results, correct attribution of original source, and answering research queries involving "What", "Where", "When", "Which", "Who", "How", and "Why" (also known as the W7 model). But, at present there is limited or no effective approach to managing and leveraging provenance information for integrating data across studies or projects. Hence, there is an urgent need for a paradigm shift in creating a "provenance-aware" informatics platform to address this challenge. We introduce an ontology-driven, intuitive Semantic Proteomics Dashboard (SemPoD) that uses provenance together with domain information (semantic provenance) to enable researchers to query, compare, and correlate different types of data across multiple projects, and allow integration with legacy data to support their ongoing research. The SemPoD platform, currently in use at the Case Center for Proteomics and Bioinformatics (CPB), consists of three components: (a) Ontology-driven Visual Query Composer, (b) Result Explorer, and (c) Query Manager. Currently, SemPoD allows provenance-aware querying of 1153 mass-spectrometry experiments from 20 different projects. SemPod uses the systems molecular biology provenance ontology (SysPro) to support a dynamic query composition interface, which automatically updates the components of the query interface based on previous user selections and efficiently prunes the result set usinga "smart filtering" approach. The SysPro ontology re-uses terms from the PROV-ontology (PROV-O) being developed by the World Wide Web Consortium (W3C) provenance working group, the minimum information required for reporting a molecular interaction experiment (MIMIx), and the minimum information about a proteomics experiment (MIAPE) guidelines. The SemPoD was evaluated both in terms of user feedback and as scalability of the system. SemPoD is an intuitive and powerful provenance ontology-driven data access and query platform that uses the MIAPE and MIMIx metadata guideline to create an integrated view over large-scale systems molecular biology datasets. SemPoD leverages the SysPro ontology to create an intuitive dashboard for biologists to compose queries, explore the results, and use a query manager for storing queries for later use. SemPoD can be deployed over many existing database applications storing 'omics data, including, as illustrated here, the LabKey data-management system. The initial user feedback evaluating the usability and functionality of SemPoD has been very positive and it is being considered for wider deployment beyond the proteomics domain, and in other 'omics' centers.
Omics Approach to Identify Factors Involved in Brassica Disease Resistance.
Francisco, Marta; Soengas, Pilar; Velasco, Pablo; Bhadauria, Vijai; Cartea, Maria E; Rodríguez, Victor M
2016-01-01
Understanding plant's defense mechanisms and their response to biotic stresses is of fundamental meaning for the development of resistant crop varieties and more productive agriculture. The Brassica genus involves a large variety of economically important species and cultivars used as vegetable source, oilseeds, forage and ornamental. Damage caused by pathogens attack affects negatively various aspects of plant growth, development, and crop productivity. Over the last few decades, advances in plant physiology, genetics, and molecular biology have greatly improved our understanding of plant responses to biotic stress conditions. In this regard, various 'omics' technologies enable qualitative and quantitative monitoring of the abundance of various biological molecules in a high-throughput manner, and thus allow determination of their variation between different biological states on a genomic scale. In this review, we have described advances in 'omic' tools (genomics, transcriptomics, proteomics and metabolomics) in the view of conventional and modern approaches being used to elucidate the molecular mechanisms that underlie Brassica disease resistance.
Systems Biology and Biomarkers of Early Effects for Occupational Exposure Limit Setting
DeBord, D. Gayle; Burgoon, Lyle; Edwards, Stephen W.; Haber, Lynne T.; Kanitz, M. Helen; Kuempel, Eileen; Thomas, Russell S.; Yucesoy, Berran
2015-01-01
In a recent National Research Council document, new strategies for risk assessment were described to enable more accurate and quicker assessments.( 1 ) This report suggested that evaluating individual responses through increased use of bio-monitoring could improve dose-response estimations. Identi-fication of specific biomarkers may be useful for diagnostics or risk prediction as they have the potential to improve exposure assessments. This paper discusses systems biology, biomarkers of effect, and computational toxicology approaches and their relevance to the occupational exposure limit setting process. The systems biology approach evaluates the integration of biological processes and how disruption of these processes by chemicals or other hazards affects disease outcomes. This type of approach could provide information used in delineating the mode of action of the response or toxicity, and may be useful to define the low adverse and no adverse effect levels. Biomarkers of effect are changes measured in biological systems and are considered to be preclinical in nature. Advances in computational methods and experimental -omics methods that allow the simultaneous measurement of families of macromolecules such as DNA, RNA, and proteins in a single analysis have made these systems approaches feasible for broad application. The utility of the information for risk assessments from -omics approaches has shown promise and can provide information on mode of action and dose-response relationships. As these techniques evolve, estimation of internal dose and response biomarkers will be a critical test of these new technologies for application in risk assessment strategies. While proof of concept studies have been conducted that provide evidence of their value, challenges with standardization and harmonization still need to be overcome before these methods are used routinely. PMID:26132979
Systems Biology and Biomarkers of Early Effects for Occupational Exposure Limit Setting.
DeBord, D Gayle; Burgoon, Lyle; Edwards, Stephen W; Haber, Lynne T; Kanitz, M Helen; Kuempel, Eileen; Thomas, Russell S; Yucesoy, Berran
2015-01-01
In a recent National Research Council document, new strategies for risk assessment were described to enable more accurate and quicker assessments. This report suggested that evaluating individual responses through increased use of bio-monitoring could improve dose-response estimations. Identification of specific biomarkers may be useful for diagnostics or risk prediction as they have the potential to improve exposure assessments. This paper discusses systems biology, biomarkers of effect, and computational toxicology approaches and their relevance to the occupational exposure limit setting process. The systems biology approach evaluates the integration of biological processes and how disruption of these processes by chemicals or other hazards affects disease outcomes. This type of approach could provide information used in delineating the mode of action of the response or toxicity, and may be useful to define the low adverse and no adverse effect levels. Biomarkers of effect are changes measured in biological systems and are considered to be preclinical in nature. Advances in computational methods and experimental -omics methods that allow the simultaneous measurement of families of macromolecules such as DNA, RNA, and proteins in a single analysis have made these systems approaches feasible for broad application. The utility of the information for risk assessments from -omics approaches has shown promise and can provide information on mode of action and dose-response relationships. As these techniques evolve, estimation of internal dose and response biomarkers will be a critical test of these new technologies for application in risk assessment strategies. While proof of concept studies have been conducted that provide evidence of their value, challenges with standardization and harmonization still need to be overcome before these methods are used routinely.
The Twins Study: NASA's First Foray into 21st Century Omics Research
NASA Technical Reports Server (NTRS)
Kundrot, C. E.; Shelhamer, M.; Scott, G. B. I.
2015-01-01
The full array of 21st century omics-based research methods should be intelligently employed to reduce the health and performance risks that astronauts will be exposed to during exploration missions beyond low Earth Orbit. In March of 2015, US Astronaut Scott Kelly will launch to the International Space Station for a one year mission while his twin brother, Mark Kelly, a retired US Astronaut, remains on the ground. This situation presents an extremely rare flight opportunity to perform an integrated omics-based demonstration pilot study involving identical twin astronauts. A group of 10 principal investigators has been competitively selected, funded, and teamed together to form the Twins Study. A very broad range of biological function are being examined including the genome, epigenome, transcriptome, proteome, metabolome, gut microbiome, immunological response to vaccinations, indicators of atherosclerosis, physiological fluid shifts, and cognition. The plans for the Twins Study and an overview of initial results will be described as well as the technological and ethical issues raised for such spaceflight studies. An anticipated outcome of the Twins Study is that it will place NASA on a trajectory of using omics-based information to develop precision countermeasures for individual astronauts.
Davey, Peter A; Pernice, Mathieu; Sablok, Gaurav; Larkum, Anthony; Lee, Huey Tyng; Golicz, Agnieszka; Edwards, David; Dolferus, Rudy; Ralph, Peter
2016-09-01
Seagrass meadows are disappearing at alarming rates as a result of increasing coastal development and climate change. The emergence of omics and molecular profiling techniques in seagrass research is timely, providing a new opportunity to address such global issues. Whilst these applications have transformed terrestrial plant research, they have only emerged in seagrass research within the past decade; In this time frame we have observed a significant increase in the number of publications in this nascent field, and as of this year the first genome of a seagrass species has been sequenced. In this review, we focus on the development of omics and molecular profiling and the utilization of molecular markers in the field of seagrass biology. We highlight the advances, merits and pitfalls associated with such technology, and importantly we identify and address the knowledge gaps, which to this day prevent us from understanding seagrasses in a holistic manner. By utilizing the powers of omics and molecular profiling technologies in integrated strategies, we will gain a better understanding of how these unique plants function at the molecular level and how they respond to on-going disturbance and climate change events.
Grady, Sarah L; Malfatti, Stephanie A; Gunasekera, Thusitha S; Dalley, Brian K; Lyman, Matt G; Striebich, Richard C; Mayhew, Michael B; Zhou, Carol L; Ruiz, Oscar N; Dugan, Larry C
2017-04-28
Examination of complex biological systems has long been achieved through methodical investigation of the system's individual components. While informative, this strategy often leads to inappropriate conclusions about the system as a whole. With the advent of high-throughput "omic" technologies, however, researchers can now simultaneously analyze an entire system at the level of molecule (DNA, RNA, protein, metabolite) and process (transcription, translation, enzyme catalysis). This strategy reduces the likelihood of improper conclusions, provides a framework for elucidation of genotype-phenotype relationships, and brings finer resolution to comparative genomic experiments. Here, we apply a multi-omic approach to analyze the gene expression profiles of two closely related Pseudomonas aeruginosa strains grown in n-alkanes or glycerol. The environmental P. aeruginosa isolate ATCC 33988 consumed medium-length (C 10 -C 16 ) n-alkanes more rapidly than the laboratory strain PAO1, despite high genome sequence identity (average nucleotide identity >99%). Our data shows that ATCC 33988 induces a characteristic set of genes at the transcriptional, translational and post-translational levels during growth on alkanes, many of which differ from those expressed by PAO1. Of particular interest was the lack of expression from the rhl operon of the quorum sensing (QS) system, resulting in no measurable rhamnolipid production by ATCC 33988. Further examination showed that ATCC 33988 lacked the entire lasI/lasR arm of the QS response. Instead of promoting expression of QS genes, ATCC 33988 up-regulates a small subset of its genome, including operons responsible for specific alkaline proteases and sphingosine metabolism. This work represents the first time results from RNA-seq, microarray, ribosome footprinting, proteomics, and small molecule LC-MS experiments have been integrated to compare gene expression in bacteria. Together, these data provide insights as to why strain ATCC 33988 is better adapted for growth and survival on n-alkanes.
Use of new scientific developments in regulatory risk assessments: challenges and opportunities.
Tarazona, Jose V
2013-07-01
Since the 1990s, science based ecological risk assessments constitute an essential tool for supporting decision making in the regulatory context. Using the European REACH Regulation as example, this article presents the challenges and opportunities for new scientific developments within the area of chemical control and environmental protection. These challenges can be sorted out in 3 main related topics (sets). In the short term, the challenges are directly associated with the regulatory requirements, required for facilitating a scientifically sound implementation of the different obligations for industry and authorities. It is important to mention that although the actual tools are different due to the regulatory requirements, the basic needs are still the same as those addressed in the early 1990s: understanding the ecological relevance of the predicted effects, including the uncertainty, and facilitating the link with the socio-economic assessment. The second set of challenges covers the opportunities for getting an added value from the regulatory efforts. The information compiled through REACH registration and notification processes is analyzed as source for new integrative developments for assessing the combined chemical risk at the regional level. Finally, the article discusses the challenge of inverting the process and developing risk assessment methods focusing on the receptor, the individual or ecosystem, instead of on the stressor or source. These approaches were limited in the past due to the lack of information, but the identification and dissemination of standard information, including uses, manufacturing sites, physical-chemical, environmental, ecotoxicological, and toxicological properties as well as operational conditions and risk management measures for thousands of chemicals, combined by the knowledge gathered through large scale monitoring programs and spatial information systems is generating new opportunities. The challenge is liking predictions and measured data in an integral "-omic type" approach, considering collectively data from different sources and offering a complete assessment of the chemical risk of individuals and ecosystems, with new conceptual approaches that could be defined as "risk-omics based" paradigms and models. Copyright © 2013 SETAC.
Houshyani, Benyamin; van der Krol, Alexander R; Bino, Raoul J; Bouwmeester, Harro J
2014-06-19
Molecular characterization is an essential step of risk/safety assessment of genetically modified (GM) crops. Holistic approaches for molecular characterization using omics platforms can be used to confirm the intended impact of the genetic engineering, but can also reveal the unintended changes at the omics level as a first assessment of potential risks. The potential of omics platforms for risk assessment of GM crops has rarely been used for this purpose because of the lack of a consensus reference and statistical methods to judge the significance or importance of the pleiotropic changes in GM plants. Here we propose a meta data analysis approach to the analysis of GM plants, by measuring the transcriptome distance to untransformed wild-types. In the statistical analysis of the transcriptome distance between GM and wild-type plants, values are compared with naturally occurring transcriptome distances in non-GM counterparts obtained from a database. Using this approach we show that the pleiotropic effect of genes involved in indirect insect defence traits is substantially equivalent to the variation in gene expression occurring naturally in Arabidopsis. Transcriptome distance is a useful screening method to obtain insight in the pleiotropic effects of genetic modification.
Identifying pathological biomarkers: histochemistry still ranks high in the omics era
Pellicciari, C.; Malatesta, M.
2011-01-01
In recent years, omic analyses have been proposed as possible approaches to diagnosis, in particular for tumours, as they should be able to provide quantitative tools to detect and measure abnormalities in gene and protein expression, through the evaluation of transcription and translation products in the abnormal vs normal tissues. Unfortunately, this approach proved to be much less powerful than expected, due to both intrinsic technical limits and the nature itself of the pathological tissues to be investigated, the heterogeneity deriving from polyclonality and tissue phenotype variability between patients being a major limiting factor in the search for unique omic biomarkers. Especially in the last few years, the application of refined techniques for investigating gene expression in situ has greatly increased the diagnostic/prognostic potential of histochemistry, while the progress in light microscopy technology and in the methods for imaging molecules in vivo have provided valuable tools for elucidating the molecular events and the basic mechanisms leading to a pathological condition. Histochemical techniques thus remain irreplaceable in pathologist's armamentarium, and it may be expected that even in the future histochemistry will keep a leading position among the methodological approaches for clinical pathology. PMID:22297448
Software for the Integration of Multiomics Experiments in Bioconductor.
Ramos, Marcel; Schiffer, Lucas; Re, Angela; Azhar, Rimsha; Basunia, Azfar; Rodriguez, Carmen; Chan, Tiffany; Chapman, Phil; Davis, Sean R; Gomez-Cabrero, David; Culhane, Aedin C; Haibe-Kains, Benjamin; Hansen, Kasper D; Kodali, Hanish; Louis, Marie S; Mer, Arvind S; Riester, Markus; Morgan, Martin; Carey, Vince; Waldron, Levi
2017-11-01
Multiomics experiments are increasingly commonplace in biomedical research and add layers of complexity to experimental design, data integration, and analysis. R and Bioconductor provide a generic framework for statistical analysis and visualization, as well as specialized data classes for a variety of high-throughput data types, but methods are lacking for integrative analysis of multiomics experiments. The MultiAssayExperiment software package, implemented in R and leveraging Bioconductor software and design principles, provides for the coordinated representation of, storage of, and operation on multiple diverse genomics data. We provide the unrestricted multiple 'omics data for each cancer tissue in The Cancer Genome Atlas as ready-to-analyze MultiAssayExperiment objects and demonstrate in these and other datasets how the software simplifies data representation, statistical analysis, and visualization. The MultiAssayExperiment Bioconductor package reduces major obstacles to efficient, scalable, and reproducible statistical analysis of multiomics data and enhances data science applications of multiple omics datasets. Cancer Res; 77(21); e39-42. ©2017 AACR . ©2017 American Association for Cancer Research.
Genome Sequencing Technologies and Nursing: What Are the Roles of Nurses and Nurse Scientists?
Taylor, Jacquelyn Y; Wright, Michelle L; Hickey, Kathleen T; Housman, David E
Advances in DNA sequencing technology have resulted in an abundance of personalized data with challenging clinical utility and meaning for clinicians. This wealth of data has potential to dramatically impact the quality of healthcare. Nurses are at the focal point in educating patients regarding relevant healthcare needs; therefore, an understanding of sequencing technology and utilizing these data are critical. The objective of this study was to explicate the role of nurses and nurse scientists as integral members of healthcare teams in improving understanding of DNA sequencing data and translational genomics for patients. A history of the nurse role in newborn screening is used as an exemplar. This study serves as an exemplar on how genome sequencing has been utilized in nursing science and incorporates linkages of other omics approaches used by nurses that are included in this special issue. This special issue showcased nurse scientists conducting multi-omic research from various methods, including targeted candidate genes, pharmacogenomics, proteomics, epigenomics, and the microbiome. From this vantage point, we provide an overview of the roles of nurse scientists in genome sequencing research and provide recommendations for the best utilization of nurses and nurse scientists related to genome sequencing.
Information Commons for Rice (IC4R)
2016-01-01
Rice is the most important staple food for a large part of the world's human population and also a key model organism for plant research. Here, we present Information Commons for Rice (IC4R; http://ic4r.org), a rice knowledgebase featuring adoption of an extensible and sustainable architecture that integrates multiple omics data through community-contributed modules. Each module is developed and maintained by different committed groups, deals with data collection, processing and visualization, and delivers data on-demand via web services. In the current version, IC4R incorporates a variety of rice data through multiple committed modules, including genome-wide expression profiles derived entirely from RNA-Seq data, resequencing-based genomic variations obtained from re-sequencing data of thousands of rice varieties, plant homologous genes covering multiple diverse plant species, post-translational modifications, rice-related literatures and gene annotations contributed by the rice research community. Unlike extant related databases, IC4R is designed for scalability and sustainability and thus also features collaborative integration of rice data and low costs for database update and maintenance. Future directions of IC4R include incorporation of other omics data and association of multiple omics data with agronomically important traits, dedicating to build IC4R into a valuable knowledgebase for both basic and translational researches in rice. PMID:26519466
Ottensmeier, Christian H; Perry, Kate L; Harden, Elena L; Stasakova, Jana; Jenei, Veronika; Fleming, Jason; Wood, Oliver; Woo, Jeongmin; Woelk, Christopher H; Thomas, Gareth J; Thirdborough, Stephen M
2016-07-15
Antibodies that block T-cell-regulatory checkpoints have recently emerged as a transformative approach to cancer treatment. However, the clinical efficacy of checkpoint blockade depends upon inherent tumor immunogenicity, with variation in infiltrating T cells contributing to differences in objective response rates. Here, we sought to understand the molecular correlates of tumor-infiltrating T lymphocytes (TIL) in squamous cell carcinoma (SCC), using a systems biologic approach to integrate publicly available omics datasets with histopathologic features. We provide evidence that links TIL abundance and therapeutic outcome to the regulation of tumor glycolysis by EGFR and HIF, both of which are attractive molecular targets for use in combination with immunotherapeutics. Cancer Res; 76(14); 4136-48. ©2016 AACR. ©2016 American Association for Cancer Research.
Meta-analysis of pathway enrichment: combining independent and dependent omics data sets.
Kaever, Alexander; Landesfeind, Manuel; Feussner, Kirstin; Morgenstern, Burkhard; Feussner, Ivo; Meinicke, Peter
2014-01-01
A major challenge in current systems biology is the combination and integrative analysis of large data sets obtained from different high-throughput omics platforms, such as mass spectrometry based Metabolomics and Proteomics or DNA microarray or RNA-seq-based Transcriptomics. Especially in the case of non-targeted Metabolomics experiments, where it is often impossible to unambiguously map ion features from mass spectrometry analysis to metabolites, the integration of more reliable omics technologies is highly desirable. A popular method for the knowledge-based interpretation of single data sets is the (Gene) Set Enrichment Analysis. In order to combine the results from different analyses, we introduce a methodical framework for the meta-analysis of p-values obtained from Pathway Enrichment Analysis (Set Enrichment Analysis based on pathways) of multiple dependent or independent data sets from different omics platforms. For dependent data sets, e.g. obtained from the same biological samples, the framework utilizes a covariance estimation procedure based on the nonsignificant pathways in single data set enrichment analysis. The framework is evaluated and applied in the joint analysis of Metabolomics mass spectrometry and Transcriptomics DNA microarray data in the context of plant wounding. In extensive studies of simulated data set dependence, the introduced correlation could be fully reconstructed by means of the covariance estimation based on pathway enrichment. By restricting the range of p-values of pathways considered in the estimation, the overestimation of correlation, which is introduced by the significant pathways, could be reduced. When applying the proposed methods to the real data sets, the meta-analysis was shown not only to be a powerful tool to investigate the correlation between different data sets and summarize the results of multiple analyses but also to distinguish experiment-specific key pathways.
A Syst-OMICS Approach to Ensuring Food Safety and Reducing the Economic Burden of Salmonellosis.
Emond-Rheault, Jean-Guillaume; Jeukens, Julie; Freschi, Luca; Kukavica-Ibrulj, Irena; Boyle, Brian; Dupont, Marie-Josée; Colavecchio, Anna; Barrere, Virginie; Cadieux, Brigitte; Arya, Gitanjali; Bekal, Sadjia; Berry, Chrystal; Burnett, Elton; Cavestri, Camille; Chapin, Travis K; Crouse, Alanna; Daigle, France; Danyluk, Michelle D; Delaquis, Pascal; Dewar, Ken; Doualla-Bell, Florence; Fliss, Ismail; Fong, Karen; Fournier, Eric; Franz, Eelco; Garduno, Rafael; Gill, Alexander; Gruenheid, Samantha; Harris, Linda; Huang, Carol B; Huang, Hongsheng; Johnson, Roger; Joly, Yann; Kerhoas, Maud; Kong, Nguyet; Lapointe, Gisèle; Larivière, Line; Loignon, Stéphanie; Malo, Danielle; Moineau, Sylvain; Mottawea, Walid; Mukhopadhyay, Kakali; Nadon, Céline; Nash, John; Ngueng Feze, Ida; Ogunremi, Dele; Perets, Ann; Pilar, Ana V; Reimer, Aleisha R; Robertson, James; Rohde, John; Sanderson, Kenneth E; Song, Lingqiao; Stephan, Roger; Tamber, Sandeep; Thomassin, Paul; Tremblay, Denise; Usongo, Valentine; Vincent, Caroline; Wang, Siyun; Weadge, Joel T; Wiedmann, Martin; Wijnands, Lucas; Wilson, Emily D; Wittum, Thomas; Yoshida, Catherine; Youfsi, Khadija; Zhu, Lei; Weimer, Bart C; Goodridge, Lawrence; Levesque, Roger C
2017-01-01
The Salmonella Syst-OMICS consortium is sequencing 4,500 Salmonella genomes and building an analysis pipeline for the study of Salmonella genome evolution, antibiotic resistance and virulence genes. Metadata, including phenotypic as well as genomic data, for isolates of the collection are provided through the Salmonella Foodborne Syst-OMICS database (SalFoS), at https://salfos.ibis.ulaval.ca/. Here, we present our strategy and the analysis of the first 3,377 genomes. Our data will be used to draw potential links between strains found in fresh produce, humans, animals and the environment. The ultimate goals are to understand how Salmonella evolves over time, improve the accuracy of diagnostic methods, develop control methods in the field, and identify prognostic markers for evidence-based decisions in epidemiology and surveillance.
Villar, Margarita; Mateos-Hernandez, Lourdes; de la Fuente, Jose
2018-03-14
Why an autoimmune disease that is the main cause of the acute neuromuscular paralysis worldwide has not yet a well-characterized cause or an effective treatment? The existence of different clinical variants for the Guillain-Barré syndrome (GBS) coupled with the fact that a high number of pathogens can cause an infection that sometimes, but not always, precedes the development of the syndrome, confers a high degree of uncertainty for both prognosis and treatment. In the post-genomic era, the development of omics technologies for the high-throughput analysis of biological molecules is allowing the characterization of biological systems in a degree of depth unimaginable before. In this context, this work summarize the application of post-genomics technologies to the study of GBS. We performed a structured search of bibliographic databases for peer-reviewed research literature to outline the state of the art with regard the application of post-genomics technologies to the study of GBS. The quality of retrieved papers was assessed using standard tools and thirty-four were included in the review. To date, transcriptomics and proteomics have been the unique post-genomics approaches applied to GBS study. Most of these studies have been performed on cerebrospinal fluid samples and only few studies have been conducted with other samples such as serum, Schwann cells and human peripheral nerve. In the post-genomics era, transcriptomics and proteomics have shown the possibilities that omics technologies can offer for a better understanding of the immunological and pathological mechanisms involved in GBS and the identification of potential biomarkers, but these results have only shown the tip of the iceberg and there is still a long way to exploit the full potential that post-genomics approaches could offer to the study of the GBS. The integration of different omics datasets through a systems biology approach could allow network-based analyses to describe the complexity and functionality of the molecular mechanisms involved in the course of disease facilitating the discovery of novel biomarkers that could be used to improve the diagnosis, predict the disease progression, improve our understanding of the pathology, and serve as therapeutic targets for GBS. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.
Expanding Omics Resources for Improvement of Soybean Seed Composition Traits
Chaudhary, Juhi; Patil, Gunvant B.; Sonah, Humira; Deshmukh, Rupesh K.; Vuong, Tri D.; Valliyodan, Babu; Nguyen, Henry T.
2015-01-01
Food resources of the modern world are strained due to the increasing population. There is an urgent need for innovative methods and approaches to augment food production. Legume seeds are major resources of human food and animal feed with their unique nutrient compositions including oil, protein, carbohydrates, and other beneficial nutrients. Recent advances in next-generation sequencing (NGS) together with “omics” technologies have considerably strengthened soybean research. The availability of well annotated soybean genome sequence along with hundreds of identified quantitative trait loci (QTL) associated with different seed traits can be used for gene discovery and molecular marker development for breeding applications. Despite the remarkable progress in these technologies, the analysis and mining of existing seed genomics data are still challenging due to the complexity of genetic inheritance, metabolic partitioning, and developmental regulations. Integration of “omics tools” is an effective strategy to discover key regulators of various seed traits. In this review, recent advances in “omics” approaches and their use in soybean seed trait investigations are presented along with the available databases and technological platforms and their applicability in the improvement of soybean. This article also highlights the use of modern breeding approaches, such as genome-wide association studies (GWAS), genomic selection (GS), and marker-assisted recurrent selection (MARS) for developing superior cultivars. A catalog of available important resources for major seed composition traits, such as seed oil, protein, carbohydrates, and yield traits are provided to improve the knowledge base and future utilization of this information in the soybean crop improvement programs. PMID:26635846
Systems biology of human atherosclerosis.
Shalhoub, Joseph; Sikkel, Markus B; Davies, Kerry J; Vorkas, Panagiotis A; Want, Elizabeth J; Davies, Alun H
2014-01-01
Systems biology describes a holistic and integrative approach to understand physiology and pathology. The "omic" disciplines include genomics, transcriptomics, proteomics, and metabolic profiling (metabonomics and metabolomics). By adopting a stance, which is opposing (yet complimentary) to conventional research techniques, systems biology offers an overview by assessing the "net" biological effect imposed by a disease or nondisease state. There are a number of different organizational levels to be understood, from DNA to protein, metabolites, cells, organs and organisms, even beyond this to an organism's context. Systems biology relies on the existence of "nodes" and "edges." Nodes are the constituent part of the system being studied (eg, proteins in the proteome), while the edges are the way these constituents interact. In future, it will be increasingly important to collaborate, collating data from multiple studies to improve data sets, making them freely available and undertaking integrative analyses.
Unraveling snake venom complexity with 'omics' approaches: challenges and perspectives.
Zelanis, André; Tashima, Alexandre Keiji
2014-09-01
The study of snake venom proteomes (venomics) has been experiencing a burst of reports, however the comprehensive knowledge of the dynamic range of proteins present within a single venom, the set of post-translational modifications (PTMs) as well as the lack of a comprehensive database related to venom proteins are among the main challenges in venomics research. The phenotypic plasticity in snake venom proteomes together with their inherent toxin proteoform diversity, points out to the use of integrative analysis in order to better understand their actual complexity. In this regard, such a systems venomics task should encompass the integration of data from transcriptomic and proteomic studies (specially the venom gland proteome), the identification of biological PTMs, and the estimation of artifactual proteomes and peptidomes generated by sample handling procedures. Copyright © 2014 Elsevier Ltd. All rights reserved.
Theophilou, Georgios; Paraskevaidi, Maria; Lima, Kássio M G; Kyrgiou, Maria; Martin-Hirsch, Pierre L; Martin, Francis L
2015-05-01
The complex processes driving cancer have so far impeded the discovery of dichotomous biomarkers associated with its initiation and progression. Reductionist approaches utilizing 'omics' technologies have met some success in identifying molecular alterations associated with carcinogenesis. Systems biology is an emerging science that combines high-throughput investigation techniques to define the dynamic interplay between regulatory biological systems in response to internal and external cues. Vibrational spectroscopy has the potential to play an integral role within systems biology research approaches. It is capable of examining global models of carcinogenesis by scrutinizing chemical bond alterations within molecules. The application of infrared or Raman spectroscopic approaches coupled with computational analysis under the systems biology umbrella can assist the transition of biomarker research from the molecular level to the system level. The comprehensive representation of carcinogenesis as a multilevel biological process will inevitably revolutionize cancer-related healthcare by personalizing risk prediction and prevention.
Stenzinger, Albrecht; Klauschen, Frederick; Wittschieber, Daniel; Weichert, Wilko; Denkert, Carsten; Dietel, Manfred; Roller, Claudio
2010-06-01
Research in pathology spans from merely descriptive work to functional studies, "-omics" approaches and, more recently, systems biology. The work presented here aims at placing pathological research into an epistemological context. Aided by Rudolf Virchow, we give an overview on the philosophy of science including the Wiener Kreis, Popper, Kuhn, Fleck and Rheinberger and demonstrate their implications for routine diagnostics and science in pathology. A focus is on the fields of "-omics" and systems pathology.
Chadeau-Hyam, Marc; Campanella, Gianluca; Jombart, Thibaut; Bottolo, Leonardo; Portengen, Lutzen; Vineis, Paolo; Liquet, Benoit; Vermeulen, Roel C H
2013-08-01
Recent technological advances in molecular biology have given rise to numerous large-scale datasets whose analysis imposes serious methodological challenges mainly relating to the size and complex structure of the data. Considerable experience in analyzing such data has been gained over the past decade, mainly in genetics, from the Genome-Wide Association Study era, and more recently in transcriptomics and metabolomics. Building upon the corresponding literature, we provide here a nontechnical overview of well-established methods used to analyze OMICS data within three main types of regression-based approaches: univariate models including multiple testing correction strategies, dimension reduction techniques, and variable selection models. Our methodological description focuses on methods for which ready-to-use implementations are available. We describe the main underlying assumptions, the main features, and advantages and limitations of each of the models. This descriptive summary constitutes a useful tool for driving methodological choices while analyzing OMICS data, especially in environmental epidemiology, where the emergence of the exposome concept clearly calls for unified methods to analyze marginally and jointly complex exposure and OMICS datasets. Copyright © 2013 Wiley Periodicals, Inc.
High-Performance Mixed Models Based Genome-Wide Association Analysis with omicABEL software.
Fabregat-Traver, Diego; Sharapov, Sodbo Zh; Hayward, Caroline; Rudan, Igor; Campbell, Harry; Aulchenko, Yurii; Bientinesi, Paolo
2014-01-01
To raise the power of genome-wide association studies (GWAS) and avoid false-positive results in structured populations, one can rely on mixed model based tests. When large samples are used, and when multiple traits are to be studied in the 'omics' context, this approach becomes computationally challenging. Here we consider the problem of mixed-model based GWAS for arbitrary number of traits, and demonstrate that for the analysis of single-trait and multiple-trait scenarios different computational algorithms are optimal. We implement these optimal algorithms in a high-performance computing framework that uses state-of-the-art linear algebra kernels, incorporates optimizations, and avoids redundant computations, increasing throughput while reducing memory usage and energy consumption. We show that, compared to existing libraries, our algorithms and software achieve considerable speed-ups. The OmicABEL software described in this manuscript is available under the GNU GPL v. 3 license as part of the GenABEL project for statistical genomics at http: //www.genabel.org/packages/OmicABEL.
Pan, Xinrong; Liu, Huanhuan; Liu, Jiao; Wang, Cheng; Wen, Jianping
2016-12-01
In order to relieve the toxicity of furfural on Rhizopus oryzae fermentation, the molecular mechanism of R. oryzae responding to furfural stress for fumaric acid-production was investigated by omics-based approaches. In metabolomics analysis, 29 metabolites including amino acid, sugars, polyols and fatty acids showed significant changes for maintaining the basic cell metabolism at the cost of lowering fumaric acid production. To further uncover the survival mechanism, lipidomics was carried out, revealing that phosphatidylcholine, phosphatidylglycerol, phosphatidylinositol and polyunsaturated acyl chains might be closely correlated with R. oryzae's adapting to furfural stress. Based on the above omics analysis, lecithin, inositol and soybean oil were exogenously supplemented separately with an optimized concentration in the presence of furfural, which increased fumaric acid titer from 5.78g/L to 10.03g/L, 10.05g/L and 12.13g/L (increased by 73.5%, 73.8% and 110%, respectively). These findings provide a methodological guidance for hemicellulose-fumaric acid development. Copyright © 2016 Elsevier Ltd. All rights reserved.
GWAS in a Box: Statistical and Visual Analytics of Structured Associations via GenAMap
Xing, Eric P.; Curtis, Ross E.; Schoenherr, Georg; Lee, Seunghak; Yin, Junming; Puniyani, Kriti; Wu, Wei; Kinnaird, Peter
2014-01-01
With the continuous improvement in genotyping and molecular phenotyping technology and the decreasing typing cost, it is expected that in a few years, more and more clinical studies of complex diseases will recruit thousands of individuals for pan-omic genetic association analyses. Hence, there is a great need for algorithms and software tools that could scale up to the whole omic level, integrate different omic data, leverage rich structure information, and be easily accessible to non-technical users. We present GenAMap, an interactive analytics software platform that 1) automates the execution of principled machine learning methods that detect genome- and phenome-wide associations among genotypes, gene expression data, and clinical or other macroscopic traits, and 2) provides new visualization tools specifically designed to aid in the exploration of association mapping results. Algorithmically, GenAMap is based on a new paradigm for GWAS and PheWAS analysis, termed structured association mapping, which leverages various structures in the omic data. We demonstrate the function of GenAMap via a case study of the Brem and Kruglyak yeast dataset, and then apply it on a comprehensive eQTL analysis of the NIH heterogeneous stock mice dataset and report some interesting findings. GenAMap is available from http://sailing.cs.cmu.edu/genamap. PMID:24905018
Integrative analysis of omics summary data reveals putative mechanisms underlying complex traits.
Wu, Yang; Zeng, Jian; Zhang, Futao; Zhu, Zhihong; Qi, Ting; Zheng, Zhili; Lloyd-Jones, Luke R; Marioni, Riccardo E; Martin, Nicholas G; Montgomery, Grant W; Deary, Ian J; Wray, Naomi R; Visscher, Peter M; McRae, Allan F; Yang, Jian
2018-03-02
The identification of genes and regulatory elements underlying the associations discovered by GWAS is essential to understanding the aetiology of complex traits (including diseases). Here, we demonstrate an analytical paradigm of prioritizing genes and regulatory elements at GWAS loci for follow-up functional studies. We perform an integrative analysis that uses summary-level SNP data from multi-omics studies to detect DNA methylation (DNAm) sites associated with gene expression and phenotype through shared genetic effects (i.e., pleiotropy). We identify pleiotropic associations between 7858 DNAm sites and 2733 genes. These DNAm sites are enriched in enhancers and promoters, and >40% of them are mapped to distal genes. Further pleiotropic association analyses, which link both the methylome and transcriptome to 12 complex traits, identify 149 DNAm sites and 66 genes, indicating a plausible mechanism whereby the effect of a genetic variant on phenotype is mediated by genetic regulation of transcription through DNAm.
Omics integrating physical techniques: aged Piedmontese meat analysis.
Lana, Alessandro; Longo, Valentina; Dalmasso, Alessandra; D'Alessandro, Angelo; Bottero, Maria Teresa; Zolla, Lello
2015-04-01
Piedmontese meat tenderness becomes higher by extending the ageing period after slaughter up to 44 days. Classical physical analysis only partially explain this evidence, so in order to discover the reason of the potential beneficial effects of prolonged ageing, we performed omic analysis in the Longissimus thoracis muscle by examining main biochemical changes through mass spectrometry-based metabolomics and proteomics. We observed a progressive decline in myofibrillar structural integrity (underpinning meat tenderness) and impaired energy metabolism. Markers of autophagic responses (e.g. serine and glutathione metabolism) and nitrogen metabolism (urea cycle intermediates) accumulated until the end of the assayed period. Key metabolites such as glutamate, a mediator of the appreciated umami taste of the meat, were found to constantly accumulate until day 44. Finally, statistical analyses revealed that glutamate, serine and arginine could serve as good predictors of ultimate meat quality parameters, even though further studies are mandatory. Copyright © 2014 Elsevier Ltd. All rights reserved.
Big Data Transforms Discovery-Utilization Therapeutics Continuum.
Waldman, S A; Terzic, A
2016-03-01
Enabling omic technologies adopt a holistic view to produce unprecedented insights into the molecular underpinnings of health and disease, in part, by generating massive high-dimensional biological data. Leveraging these systems-level insights as an engine driving the healthcare evolution is maximized through integration with medical, demographic, and environmental datasets from individuals to populations. Big data analytics has accordingly emerged to add value to the technical aspects of storage, transfer, and analysis required for merging vast arrays of omic-, clinical-, and eco-datasets. In turn, this new field at the interface of biology, medicine, and information science is systematically transforming modern therapeutics across discovery, development, regulation, and utilization. © 2015 ASCPT.
Biological Networks for Cancer Candidate Biomarkers Discovery
Yan, Wenying; Xue, Wenjin; Chen, Jiajia; Hu, Guang
2016-01-01
Due to its extraordinary heterogeneity and complexity, cancer is often proposed as a model case of a systems biology disease or network disease. There is a critical need of effective biomarkers for cancer diagnosis and/or outcome prediction from system level analyses. Methods based on integrating omics data into networks have the potential to revolutionize the identification of cancer biomarkers. Deciphering the biological networks underlying cancer is undoubtedly important for understanding the molecular mechanisms of the disease and identifying effective biomarkers. In this review, the networks constructed for cancer biomarker discovery based on different omics level data are described and illustrated from recent advances in the field. PMID:27625573
Chen, Bailing; Wan, Chun; Mehmood, Muhammad Aamer; Chang, Jo-Shu; Bai, Fengwu; Zhao, Xinqing
2017-11-01
Microalgae have promising potential to produce lipids and a variety of high-value chemicals. Suitable stress conditions such as nitrogen starvation and high salinity could stimulate synthesis and accumulation of lipids and high-value products by microalgae, therefore, various stress-modification strategies were developed to manipulate and optimize cultivation processes to enhance bioproduction efficiency. On the other hand, advancements in omics-based technologies have boosted the research to globally understand microalgal gene regulation under stress conditions, which enable further improvement of production efficiency via genetic engineering. Moreover, integration of multi-omics data, synthetic biology design, and genetic engineering manipulations exhibits a tremendous potential in the betterment of microalgal biorefinery. This review discusses the process manipulation strategies and omics studies on understanding the regulation of metabolite biosynthesis under various stressful conditions, and proposes genetic engineering of microalgae to improve bioproduction via manipulating stress tolerance. Copyright © 2017 Elsevier Ltd. All rights reserved.
Network Analysis of Rodent Transcriptomes in Spaceflight
NASA Technical Reports Server (NTRS)
Ramachandran, Maya; Fogle, Homer; Costes, Sylvain
2017-01-01
Network analysis methods leverage prior knowledge of cellular systems and the statistical and conceptual relationships between analyte measurements to determine gene connectivity. Correlation and conditional metrics are used to infer a network topology and provide a systems-level context for cellular responses. Integration across multiple experimental conditions and omics domains can reveal the regulatory mechanisms that underlie gene expression. GeneLab has assembled rich multi-omic (transcriptomics, proteomics, epigenomics, and epitranscriptomics) datasets for multiple murine tissues from the Rodent Research 1 (RR-1) experiment. RR-1 assesses the impact of 37 days of spaceflight on gene expression across a variety of tissue types, such as adrenal glands, quadriceps, gastrocnemius, tibalius anterior, extensor digitorum longus, soleus, eye, and kidney. Network analysis is particularly useful for RR-1 -omics datasets because it reinforces subtle relationships that may be overlooked in isolated analyses and subdues confounding factors. Our objective is to use network analysis to determine potential target nodes for therapeutic intervention and identify similarities with existing disease models. Multiple network algorithms are used for a higher confidence consensus.
A global "imaging'' view on systems approaches in immunology.
Ludewig, Burkhard; Stein, Jens V; Sharpe, James; Cervantes-Barragan, Luisa; Thiel, Volker; Bocharov, Gennady
2012-12-01
The immune system exhibits an enormous complexity. High throughput methods such as the "-omic'' technologies generate vast amounts of data that facilitate dissection of immunological processes at ever finer resolution. Using high-resolution data-driven systems analysis, causal relationships between complex molecular processes and particular immunological phenotypes can be constructed. However, processes in tissues, organs, and the organism itself (so-called higher level processes) also control and regulate the molecular (lower level) processes. Reverse systems engineering approaches, which focus on the examination of the structure, dynamics and control of the immune system, can help to understand the construction principles of the immune system. Such integrative mechanistic models can properly describe, explain, and predict the behavior of the immune system in health and disease by combining both higher and lower level processes. Moving from molecular and cellular levels to a multiscale systems understanding requires the development of methodologies that integrate data from different biological levels into multiscale mechanistic models. In particular, 3D imaging techniques and 4D modeling of the spatiotemporal dynamics of immune processes within lymphoid tissues are central for such integrative approaches. Both dynamic and global organ imaging technologies will be instrumental in facilitating comprehensive multiscale systems immunology analyses as discussed in this review. © 2012 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Network-Based Integration of Disparate Omic Data To Identify "Silent Players" in Cancer
Ruffalo, Matthew
2015-01-01
Development of high-throughput monitoring technologies enables interrogation of cancer samples at various levels of cellular activity. Capitalizing on these developments, various public efforts such as The Cancer Genome Atlas (TCGA) generate disparate omic data for large patient cohorts. As demonstrated by recent studies, these heterogeneous data sources provide the opportunity to gain insights into the molecular changes that drive cancer pathogenesis and progression. However, these insights are limited by the vast search space and as a result low statistical power to make new discoveries. In this paper, we propose methods for integrating disparate omic data using molecular interaction networks, with a view to gaining mechanistic insights into the relationship between molecular changes at different levels of cellular activity. Namely, we hypothesize that genes that play a role in cancer development and progression may be implicated by neither frequent mutation nor differential expression, and that network-based integration of mutation and differential expression data can reveal these “silent players”. For this purpose, we utilize network-propagation algorithms to simulate the information flow in the cell at a sample-specific resolution. We then use the propagated mutation and expression signals to identify genes that are not necessarily mutated or differentially expressed genes, but have an essential role in tumor development and patient outcome. We test the proposed method on breast cancer and glioblastoma multiforme data obtained from TCGA. Our results show that the proposed method can identify important proteins that are not readily revealed by molecular data, providing insights beyond what can be gleaned by analyzing different types of molecular data in isolation. PMID:26683094
The emerging CHO systems biology era: harnessing the 'omics revolution for biotechnology.
Kildegaard, Helene Faustrup; Baycin-Hizal, Deniz; Lewis, Nathan E; Betenbaugh, Michael J
2013-12-01
Chinese hamster ovary (CHO) cells are the primary factories for biopharmaceuticals because of their capacity to correctly fold and post-translationally modify recombinant proteins compatible with humans. New opportunities are arising to enhance these cell factories, especially since the CHO-K1 cell line was recently sequenced. Now, the CHO systems biology era is underway. Critical 'omics data sets, including proteomics, transcriptomics, metabolomics, fluxomics, and glycomics, are emerging, allowing the elucidation of the molecular basis of CHO cell physiology. The incorporation of these data sets into mathematical models that describe CHO phenotypes will provide crucial biotechnology insights. As 'omics technologies and computational systems biology mature, genome-scale approaches will lead to major innovations in cell line development and metabolic engineering, thereby improving protein production and bioprocessing. Copyright © 2013 Elsevier Ltd. All rights reserved.
Omics-Based Identification of Biomarkers for Nasopharyngeal Carcinoma
2015-01-01
Nasopharyngeal carcinoma (NPC) is a head and neck cancer that is highly found in distinct geographic areas, such as Southeast Asia. The management of NPC remains burdensome as the prognosis is poor due to the late presentation of the disease and the complex nature of NPC pathogenesis. Therefore, it is necessary to find effective molecular markers for early detection and therapeutic measure of NPC. In this paper, the discovery of molecular biomarker for NPC through the emerging omics technologies including genomics, miRNA-omics, transcriptomics, proteomics, and metabolomics will be extensively reviewed. These markers have been shown to play roles in various cellular pathways in NPC progression. The knowledge on their function will help us understand in more detail the complexity in tumor biology, leading to the better strategies for early detection, outcome prediction, detection of disease recurrence, and therapeutic approach. PMID:25999660
Comparative multi-omics systems analysis of Escherichia coli strains B and K-12.
Yoon, Sung Ho; Han, Mee-Jung; Jeong, Haeyoung; Lee, Choong Hoon; Xia, Xiao-Xia; Lee, Dae-Hee; Shim, Ji Hoon; Lee, Sang Yup; Oh, Tae Kwang; Kim, Jihyun F
2012-05-25
Elucidation of a genotype-phenotype relationship is critical to understand an organism at the whole-system level. Here, we demonstrate that comparative analyses of multi-omics data combined with a computational modeling approach provide a framework for elucidating the phenotypic characteristics of organisms whose genomes are sequenced. We present a comprehensive analysis of genome-wide measurements incorporating multifaceted holistic data - genome, transcriptome, proteome, and phenome - to determine the differences between Escherichia coli B and K-12 strains. A genome-scale metabolic network of E. coli B was reconstructed and used to identify genetic bases of the phenotypes unique to B compared with K-12 through in silico complementation testing. This systems analysis revealed that E. coli B is well-suited for production of recombinant proteins due to a greater capacity for amino acid biosynthesis, fewer proteases, and lack of flagella. Furthermore, E. coli B has an additional type II secretion system and a different cell wall and outer membrane composition predicted to be more favorable for protein secretion. In contrast, E. coli K-12 showed a higher expression of heat shock genes and was less susceptible to certain stress conditions. This integrative systems approach provides a high-resolution system-wide view and insights into why two closely related strains of E. coli, B and K-12, manifest distinct phenotypes. Therefore, systematic understanding of cellular physiology and metabolism of the strains is essential not only to determine culture conditions but also to design recombinant hosts.
Comparative multi-omics systems analysis of Escherichia coli strains B and K-12
2012-01-01
Background Elucidation of a genotype-phenotype relationship is critical to understand an organism at the whole-system level. Here, we demonstrate that comparative analyses of multi-omics data combined with a computational modeling approach provide a framework for elucidating the phenotypic characteristics of organisms whose genomes are sequenced. Results We present a comprehensive analysis of genome-wide measurements incorporating multifaceted holistic data - genome, transcriptome, proteome, and phenome - to determine the differences between Escherichia coli B and K-12 strains. A genome-scale metabolic network of E. coli B was reconstructed and used to identify genetic bases of the phenotypes unique to B compared with K-12 through in silico complementation testing. This systems analysis revealed that E. coli B is well-suited for production of recombinant proteins due to a greater capacity for amino acid biosynthesis, fewer proteases, and lack of flagella. Furthermore, E. coli B has an additional type II secretion system and a different cell wall and outer membrane composition predicted to be more favorable for protein secretion. In contrast, E. coli K-12 showed a higher expression of heat shock genes and was less susceptible to certain stress conditions. Conclusions This integrative systems approach provides a high-resolution system-wide view and insights into why two closely related strains of E. coli, B and K-12, manifest distinct phenotypes. Therefore, systematic understanding of cellular physiology and metabolism of the strains is essential not only to determine culture conditions but also to design recombinant hosts. PMID:22632713
Kraniotou, Christina; Karadima, Vasiliki; Bellos, George; Tsangaris, George Th
2018-03-05
The global incidence of metabolic disorders like type 2 diabetes mellitus (DM2) has assumed epidemic proportions, leading to adverse health and socio-economic impacts. It is therefore of critical importance the early diagnosis of DM2 patients and the detection of those at increased risk of disease. In this respect, Precision Medicine (PM) is an emerging approach that includes practices, tests, decisions and treatments adapted to the characteristics of each patient. With regard to DM2, PM manages a wealth of "omics" data (genomic, metabolic, proteomic, environmental, clinical and paraclinical) to increase the number of clinically validated biomarkers in order to identify patients in early stage even before the prediabetic phase. In this paper, we discuss the epidemic dimension of metabolic disorders like type 2 diabetes mellitus (DM2) and the urgent demand for novel biomarkers to reduce the incidence or even delay the onset of DM2. Recent research data produced by "multi-omics" technologies (genomics/epigenomics, transcriptomics, proteomics and metabolomics), suggest that many potential biomarkers might be helpful in the prediction and early diagnosis of DM2. Predictive, Preventive and Personalized Medicine (PPPM) manages and integrates these data to apply personalized, preventive, and therapeutic approaches. This is significant because there is an emerging need for establishing channels for communication and personalized consultation between systems research and precision medicine, as the medicine of the future. Copyright © 2018 Elsevier B.V. All rights reserved.
Omics Advances in Ecotoxicology.
Zhang, Xiaowei; Xia, Pu; Wang, Pingping; Yang, Jianghu; Baird, Donald J
2018-04-03
Toxic substances in the environment generate adverse effects at all levels of biological organization from the molecular level to community and ecosystem. Given this complexity, it is not surprising that ecotoxicologists have struggled to address the full consequences of toxic substance release at ecosystem level, due to the limits of observational and experimental tools to reveal the changes in deep structure at different levels of organization. -Omics technologies, consisting of genomics and ecogenomics, have the power to reveal, in unprecedented detail, the cellular processes of an individual or biodiversity of a community in response to environmental change with high sample/observation throughput. This represents a historic opportunity to transform the way we study toxic substances in ecosystems, through direct linkage of ecological effects with the systems biology of organisms. Three recent examples of -omics advance in the assessment of toxic substances are explored here: (1) the use of functional genomics in the discovery of novel molecular mechanisms of toxicity of chemicals in the environment; (2) the development of laboratory pipelines of dose-dependent, reduced transcriptomics to support high-throughput chemical testing at the biological pathway level; and (3) the use of eDNA metabarcoding approaches for assessing chemical effects on biological communities in mesocosm experiments and through direct observation in field monitoring. -Omics advances in ecotoxicological studies not only generate new knowledge regarding mechanisms of toxicity and environmental effect, improving the relevance and immediacy of laboratory toxicological assessment, but can provide a wholly new paradigm for ecotoxicology by linking ecological models to mechanism-based, systems biology approaches.
Translating standards into practice - one Semantic Web API for Gene Expression.
Deus, Helena F; Prud'hommeaux, Eric; Miller, Michael; Zhao, Jun; Malone, James; Adamusiak, Tomasz; McCusker, Jim; Das, Sudeshna; Rocca Serra, Philippe; Fox, Ronan; Marshall, M Scott
2012-08-01
Sharing and describing experimental results unambiguously with sufficient detail to enable replication of results is a fundamental tenet of scientific research. In today's cluttered world of "-omics" sciences, data standards and standardized use of terminologies and ontologies for biomedical informatics play an important role in reporting high-throughput experiment results in formats that can be interpreted by both researchers and analytical tools. Increasing adoption of Semantic Web and Linked Data technologies for the integration of heterogeneous and distributed health care and life sciences (HCLSs) datasets has made the reuse of standards even more pressing; dynamic semantic query federation can be used for integrative bioinformatics when ontologies and identifiers are reused across data instances. We present here a methodology to integrate the results and experimental context of three different representations of microarray-based transcriptomic experiments: the Gene Expression Atlas, the W3C BioRDF task force approach to reporting Provenance of Microarray Experiments, and the HSCI blood genomics project. Our approach does not attempt to improve the expressivity of existing standards for genomics but, instead, to enable integration of existing datasets published from microarray-based transcriptomic experiments. SPARQL Construct is used to create a posteriori mappings of concepts and properties and linking rules that match entities based on query constraints. We discuss how our integrative approach can encourage reuse of the Experimental Factor Ontology (EFO) and the Ontology for Biomedical Investigations (OBIs) for the reporting of experimental context and results of gene expression studies. Copyright © 2012 Elsevier Inc. All rights reserved.
Tonon, Thierry; Eveillard, Damien; Prigent, Sylvain; Bourdon, Jérémie; Potin, Philippe; Boyen, Catherine; Siegel, Anne
2011-12-01
Brown algae belong to a phylogenetic lineage distantly related to land plants and animals. They are almost exclusively found in the intertidal zone, a harsh and frequently changing environment where organisms are submitted to marine and terrestrial constraints. In relation with their unique evolutionary history and their habitat, they feature several peculiarities, including at the level of their primary and secondary metabolism. The establishment of Ectocarpus siliculosus as a model organism for brown algae has represented a framework in which several omics techniques have been developed, in particular, to study the response of these organisms to abiotic stresses. With the recent publication of medium to high throughput profiling data, it is now possible to envision integrating observations at the cellular scale to apply systems biology approaches. As a first step, we propose a protocol focusing on integrating heterogeneous knowledge gained on brown algal metabolism. The resulting abstraction of the system will then help understanding how brown algae cope with changes in abiotic parameters within their unique habitat, and to decipher some of the mechanisms underlying their (1) acclimation and (2) adaptation, respectively consequences of (1) the behavior or (2) the topology of the system resulting from the integrative approach.
The Air Force In Silico -- Computational Biology in 2025
2007-11-01
and chromosome) these new fields are commonly referred to as “~omics.” Proteomics , transcriptomics, metabolomics , epigenomics, physiomics... Bioinformatics , 2006, http://journal.imbio.de/ http://www-bm.ipk-gatersleben.de/stable/php/ journal /articles/pdf/jib-22.pdf (accessed 30 September...Chirino, G. Tansley and I. Dryden, “The implications for Bioinformatics of integration across physical scales,” Journal of Integrative Bioinformatics
Tensor Factorization for Precision Medicine in Heart Failure with Preserved Ejection Fraction
Luo, Yuan; Ahmad, Faraz S.; Shah, Sanjiv J.
2017-01-01
Heart failure with preserved ejection fraction (HFpEF) is a heterogeneous clinical syndrome that may benefit from improved subtyping in order to better characterize its pathophysiology and to develop novel targeted therapies. The United States Precision Medicine Initiative comes amid the rapid growth in quantity and modality of clinical data for HFpEF patients ranging from deep phenotypic to trans-omic data. Tensor factorization, a form of machine learning, allows for the integration of multiple data modalities to derive clinically relevant HFpEF subtypes that may have significant differences in underlying pathophysiology and differential response to therapies. Tensor factorization also allows for better interpretability by supporting dimensionality reduction and identifying latent groups of data for meaningful summarization of both features and disease outcomes. In this narrative review, we analyze the modest literature on the application of tensor factorization to related biomedical fields including genotyping and phenotyping. Based on the cited work including work of our own, we suggest multiple tensor factorization formulations capable of integrating the deep phenotypic and trans-omic modalities of data for HFpEF, or accounting for interactions between genetic variants at different -omic hierarchies. We encourage extensive experimental studies to tackle challenges in applying tensor factorization for precision medicine in HFpEF, including effectively incorporating existing medical knowledge, properly accounting for uncertainty, and efficiently enforcing sparsity for better interpretability. PMID:28116551
The Role of Omics in the Application of Adverse Outcome Pathways for Chemical Risk Assessment.
Brockmeier, Erica K; Hodges, Geoff; Hutchinson, Thomas H; Butler, Emma; Hecker, Markus; Tollefsen, Knut Erik; Garcia-Reyero, Natalia; Kille, Peter; Becker, Dörthe; Chipman, Kevin; Colbourne, John; Collette, Timothy W; Cossins, Andrew; Cronin, Mark; Graystock, Peter; Gutsell, Steve; Knapen, Dries; Katsiadaki, Ioanna; Lange, Anke; Marshall, Stuart; Owen, Stewart F; Perkins, Edward J; Plaistow, Stewart; Schroeder, Anthony; Taylor, Daisy; Viant, Mark; Ankley, Gerald; Falciani, Francesco
2017-08-01
In conjunction with the second International Environmental Omics Symposium (iEOS) conference, held at the University of Liverpool (United Kingdom) in September 2014, a workshop was held to bring together experts in toxicology and regulatory science from academia, government and industry. The purpose of the workshop was to review the specific roles that high-content omics datasets (eg, transcriptomics, metabolomics, lipidomics, and proteomics) can hold within the adverse outcome pathway (AOP) framework for supporting ecological and human health risk assessments. In light of the growing number of examples of the application of omics data in the context of ecological risk assessment, we considered how omics datasets might continue to support the AOP framework. In particular, the role of omics in identifying potential AOP molecular initiating events and providing supportive evidence of key events at different levels of biological organization and across taxonomic groups was discussed. Areas with potential for short and medium-term breakthroughs were also discussed, such as providing mechanistic evidence to support chemical read-across, providing weight of evidence information for mode of action assignment, understanding biological networks, and developing robust extrapolations of species-sensitivity. Key challenges that need to be addressed were considered, including the need for a cohesive approach towards experimental design, the lack of a mutually agreed framework to quantitatively link genes and pathways to key events, and the need for better interpretation of chemically induced changes at the molecular level. This article was developed to provide an overview of ecological risk assessment process and a perspective on how high content molecular-level datasets can support the future of assessment procedures through the AOP framework. © The Author 2017. Published by Oxford University Press on behalf of the Society of Toxicology.
The Role of Omics in the Application of Adverse Outcome Pathways for Chemical Risk Assessment
Brockmeier, Erica K.; Hodges, Geoff; Hutchinson, Thomas H.; Butler, Emma; Hecker, Markus; Tollefsen, Knut Erik; Garcia-Reyero, Natalia; Kille, Peter; Becker, Dörthe; Chipman, Kevin; Colbourne, John; Collette, Timothy W.; Cossins, Andrew; Cronin, Mark; Graystock, Peter; Gutsell, Steve; Knapen, Dries; Katsiadaki, Ioanna; Lange, Anke; Marshall, Stuart; Owen, Stewart F.; Perkins, Edward J.; Plaistow, Stewart; Schroeder, Anthony; Taylor, Daisy; Viant, Mark; Ankley, Gerald; Falciani, Francesco
2017-01-01
Abstract In conjunction with the second International Environmental Omics Symposium (iEOS) conference, held at the University of Liverpool (United Kingdom) in September 2014, a workshop was held to bring together experts in toxicology and regulatory science from academia, government and industry. The purpose of the workshop was to review the specific roles that high-content omics datasets (eg, transcriptomics, metabolomics, lipidomics, and proteomics) can hold within the adverse outcome pathway (AOP) framework for supporting ecological and human health risk assessments. In light of the growing number of examples of the application of omics data in the context of ecological risk assessment, we considered how omics datasets might continue to support the AOP framework. In particular, the role of omics in identifying potential AOP molecular initiating events and providing supportive evidence of key events at different levels of biological organization and across taxonomic groups was discussed. Areas with potential for short and medium-term breakthroughs were also discussed, such as providing mechanistic evidence to support chemical read-across, providing weight of evidence information for mode of action assignment, understanding biological networks, and developing robust extrapolations of species-sensitivity. Key challenges that need to be addressed were considered, including the need for a cohesive approach towards experimental design, the lack of a mutually agreed framework to quantitatively link genes and pathways to key events, and the need for better interpretation of chemically induced changes at the molecular level. This article was developed to provide an overview of ecological risk assessment process and a perspective on how high content molecular-level datasets can support the future of assessment procedures through the AOP framework. PMID:28525648
Thomas, Geraldine; Unger, Kristian; Krznaric, Marko; Galpine, Angela; Bethel, Jackie; Tomlinson, Christopher; Woodbridge, Mark; Butcher, Sarah
2012-01-01
The only unequivocal radiological effect of the Chernobyl accident on human health is the increase in thyroid cancer in those exposed in childhood or early adolescence. In response to the scientific interest in studying the molecular biology of thyroid cancer post Chernobyl, the Chernobyl Tissue Bank (CTB: www.chernobyltissuebank.com) was established in 1998. Thus far it is has collected biological samples from 3,861 individuals, and provided 27 research projects with 11,254 samples. The CTB was designed from its outset as a resource to promote the integration of research and clinical data to facilitate a systems biology approach to radiation related thyroid cancer. The project has therefore developed as a multidisciplinary collaboration between clinicians, dosimetrists, molecular biologists and bioinformaticians and serves as a paradigm for tissue banking in the omics era. PMID:24704918
Integrated pest management in western flower thrips: past, present and future.
Mouden, Sanae; Sarmiento, Kryss Facun; Klinkhamer, Peter Gl; Leiss, Kirsten A
2017-05-01
Western flower thrips (WFT) is one of the most economically important pest insects of many crops worldwide. Recent EU legislation has caused a dramatic shift in pest management strategies, pushing for tactics that are less reliable on chemicals. The development of alternative strategies is therefore an issue of increasing urgency. This paper reviews the main control tactics in integrated pest management (IPM) of WFT, with the focus on biological control and host plant resistance as areas of major progress. Knowledge gaps are identified and innovative approaches emphasised, highlighting the advances in 'omics' technologies. Successful programmes are most likely generated when preventive and therapeutic strategies with mutually beneficial, cost-effective and environmentally sound foundations are incorporated. © 2017 The Authors. Pest Management Science published by John Wiley & Sons Ltd on behalf of Society of Chemical Industry. © 2017 The Authors. Pest Management Science published by John Wiley & Sons Ltd on behalf of Society of Chemical Industry.
A knowledge network for a dynamic taxonomy of psychiatric disease.
Krishnan, Ranga R
2015-03-01
Current taxonomic approaches in medicine and psychiatry are limited in validity and utility. They do serve simple communication purposes for medical coding, teaching, and reimbursement, but they are not suited for the modern era with its rapid explosion of knowledge from the "omics" revolution. The National Academy of Sciences published a report entitled Toward Precision Medicine: Building a Knowledge Network for Biomedical Research and a New Taxonomy of Disease. The authors advocate a new taxonomy that would integrate molecular data, clinical data, and health outcomes in a dynamic, iterative fashion, bringing together research, public health, and health-care delivery with the interlinked goals of advancing our understanding of disease pathogenesis and thereby improving health. As the need for an information hub and a knowledge network with a dynamic taxonomy based on integration of clinical and research data is vital, and timely, this proposal merits consideration.
Oita, Azusa; Tsuboi, Yuuri; Date, Yasuhiro; Oshima, Takahiro; Sakata, Kenji; Yokoyama, Akiko; Moriya, Shigeharu; Kikuchi, Jun
2018-04-24
There is an increasing need for assessing aquatic ecosystems that are globally endangered. Since aquatic ecosystems are complex, integrated consideration of multiple factors utilizing omics technologies can help us better understand aquatic ecosystems. An integrated strategy linking three analytical (machine learning, factor mapping, and forecast-error-variance decomposition) approaches for extracting the features of surface water from datasets comprising ions, metabolites, and microorganisms is proposed herein. The three developed approaches can be employed for diverse datasets of sample sizes and experimentally analyzed factors. The three approaches are applied to explore the features of bay water surrounding Odaiba, Tokyo, Japan, as a case study. Firstly, the machine learning approach separated 681 surface water samples within Japan into three clusters, categorizing Odaiba water into seawater with relatively low inorganic ions, including Mg, Ba, and B. Secondly, the factor mapping approach illustrated Odaiba water samples from the summer as rich in multiple amino acids and some other metabolites and poor in inorganic ions relative to other seasons based on their seasonal dynamics. Finally, forecast-error-variance decomposition using vector autoregressive models indicated that a type of microalgae (Raphidophyceae) grows in close correlation with alanine, succinic acid, and valine on filters and with isobutyric acid and 4-hydroxybenzoic acid in filtrate, Ba, and average wind speed. Our integrated strategy can be used to examine many biological, chemical, and environmental physical factors to analyze surface water. Copyright © 2018. Published by Elsevier B.V.
Pey, Jon; Valgepea, Kaspar; Rubio, Angel; Beasley, John E; Planes, Francisco J
2013-12-08
The study of cellular metabolism in the context of high-throughput -omics data has allowed us to decipher novel mechanisms of importance in biotechnology and health. To continue with this progress, it is essential to efficiently integrate experimental data into metabolic modeling. We present here an in-silico framework to infer relevant metabolic pathways for a particular phenotype under study based on its gene/protein expression data. This framework is based on the Carbon Flux Path (CFP) approach, a mixed-integer linear program that expands classical path finding techniques by considering additional biophysical constraints. In particular, the objective function of the CFP approach is amended to account for gene/protein expression data and influence obtained paths. This approach is termed integrative Carbon Flux Path (iCFP). We show that gene/protein expression data also influences the stoichiometric balancing of CFPs, which provides a more accurate picture of active metabolic pathways. This is illustrated in both a theoretical and real scenario. Finally, we apply this approach to find novel pathways relevant in the regulation of acetate overflow metabolism in Escherichia coli. As a result, several targets which could be relevant for better understanding of the phenomenon leading to impaired acetate overflow are proposed. A novel mathematical framework that determines functional pathways based on gene/protein expression data is presented and validated. We show that our approach is able to provide new insights into complex biological scenarios such as acetate overflow in Escherichia coli.
The contribution of genetics and environment to obesity.
Albuquerque, David; Nóbrega, Clévio; Manco, Licínio; Padez, Cristina
2017-09-01
Obesity is a global health problem mainly attributed to lifestyle changes such as diet, low physical activity or socioeconomics factors. However, several evidences consistently showed that genetics contributes significantly to the weight-gain susceptibility. A systematic literature search of most relevant original, review and meta-analysis, restricted to English was conducted in PubMed, Web of Science and Google scholar up to May 2017 concerning the contribution of genetics and environmental factors to obesity. Several evidences suggest that obesogenic environments contribute to the development of an obese phenotype. However, not every individual from the same population, despite sharing the same obesogenic environment, develop obesity. After more than 10 years of investigation on the genetics of obesity, the variants found associated with obesity represent only 3% of the estimated BMI-heritability, which is around 47-80%. Moreover, genetic factors per se were unable to explain the rapid spread of obesity prevalence. The integration of multi-omics data enables scientists having a better picture and to elucidate unknown pathways contributing to obesity. New studies based on case-control or gene candidate approach will be important to identify new variants associated with obesity susceptibility and consequently unveiling its genetic architecture. This will lead to an improvement of our understanding about underlying mechanisms involved in development and origin of the actual obesity epidemic. The integration of several omics will also provide insights about the interplay between genes and environments contributing to the obese phenotype. © The Author 2017. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com
CombiROC: an interactive web tool for selecting accurate marker combinations of omics data.
Mazzara, Saveria; Rossi, Riccardo L; Grifantini, Renata; Donizetti, Simone; Abrignani, Sergio; Bombaci, Mauro
2017-03-30
Diagnostic accuracy can be improved considerably by combining multiple markers, whose performance in identifying diseased subjects is usually assessed via receiver operating characteristic (ROC) curves. The selection of multimarker signatures is a complicated process that requires integration of data signatures with sophisticated statistical methods. We developed a user-friendly tool, called CombiROC, to help researchers accurately determine optimal markers combinations from diverse omics methods. With CombiROC data from different domains, such as proteomics and transcriptomics, can be analyzed using sensitivity/specificity filters: the number of candidate marker panels rising from combinatorial analysis is easily optimized bypassing limitations imposed by the nature of different experimental approaches. Leaving to the user full control on initial selection stringency, CombiROC computes sensitivity and specificity for all markers combinations, performances of best combinations and ROC curves for automatic comparisons, all visualized in a graphic interface. CombiROC was designed without hard-coded thresholds, allowing a custom fit to each specific data: this dramatically reduces the computational burden and lowers the false negative rates given by fixed thresholds. The application was validated with published data, confirming the marker combination already originally described or even finding new ones. CombiROC is a novel tool for the scientific community freely available at http://CombiROC.eu.
The human gutome: nutrigenomics of the host-microbiome interactions.
Dimitrov, Dimiter V
2011-01-01
Demonstrating the importance of the gut microbiota in human health and well-being represents a major transformational task in both medical and nutritional research. Owing to the high-throughput -omics methodologies, the complexity, evolution with age, and individual nature of the gut microflora have been more thoroughly investigated. The balance between this complex community of gut bacteria, food nutrients, and intestinal genomic and physiological milieu is increasingly recognized as a major contributor to human health and disease. This article discusses the "gutome," that is, nutritional systems biology of gut microbiome and host-microbiome interactions. We examine the novel ways in which the study of the human gutome, and nutrigenomics more generally, can have translational and transformational impacts in 21st century practice of biomedicine. We describe the clinical context in which experimental methodologies, as well as data-driven and process-driven approaches are being utilized in nutrigenomics and microbiome research. We underscore the pivotal importance of the gutome as a common platform for sharing data in the emerging field of the integrated metagenomics of gut pathophysiology. This vision needs to be articulated in a manner that recognizes both the omics biotechnology nuances and the ways in which nutrigenomics science can effectively inform population health and public policy, and vice versa.
Molecular signatures from omics data: from chaos to consensus.
Sung, Jaeyun; Wang, Yuliang; Chandrasekaran, Sriram; Witten, Daniela M; Price, Nathan D
2012-08-01
In the past 15 years, new "omics" technologies have made it possible to obtain high-resolution molecular snapshots of organisms, tissues, and even individual cells at various disease states and experimental conditions. It is hoped that these developments will usher in a new era of personalized medicine in which an individual's molecular measurements are used to diagnose disease, guide therapy, and perform other tasks more accurately and effectively than is possible using standard approaches. There now exists a vast literature of reported "molecular signatures". However, despite some notable exceptions, many of these signatures have suffered from limited reproducibility in independent datasets, insufficient sensitivity or specificity to meet clinical needs, or other challenges. In this paper, we discuss the process of molecular signature discovery on the basis of omics data. In particular, we highlight potential pitfalls in the discovery process, as well as strategies that can be used to increase the odds of successful discovery. Despite the difficulties that have plagued the field of molecular signature discovery, we remain optimistic about the potential to harness the vast amounts of available omics data in order to substantially impact clinical practice. Copyright © 2012 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Perspective for Aquaponic Systems: "Omic" Technologies for Microbial Community Analysis.
Munguia-Fragozo, Perla; Alatorre-Jacome, Oscar; Rico-Garcia, Enrique; Torres-Pacheco, Irineo; Cruz-Hernandez, Andres; Ocampo-Velazquez, Rosalia V; Garcia-Trejo, Juan F; Guevara-Gonzalez, Ramon G
2015-01-01
Aquaponics is the combined production of aquaculture and hydroponics, connected by a water recirculation system. In this productive system, the microbial community is responsible for carrying out the nutrient dynamics between the components. The nutrimental transformations mainly consist in the transformation of chemical species from toxic compounds into available nutrients. In this particular field, the microbial research, the "Omic" technologies will allow a broader scope of studies about a current microbial profile inside aquaponics community, even in those species that currently are unculturable. This approach can also be useful to understand complex interactions of living components in the system. Until now, the analog studies were made to set up the microbial characterization on recirculation aquaculture systems (RAS). However, microbial community composition of aquaponics is still unknown. "Omic" technologies like metagenomic can help to reveal taxonomic diversity. The perspectives are also to begin the first attempts to sketch the functional diversity inside aquaponic systems and its ecological relationships. The knowledge of the emergent properties inside the microbial community, as well as the understanding of the biosynthesis pathways, can derive in future biotechnological applications. Thus, the aim of this review is to show potential applications of current "Omic" tools to characterize the microbial community in aquaponic systems.
Tsurumaki, M; Kotake, M; Iwasaki, M; Saito, M; Tanaka, K; Aw, W; Fukuda, S; Tomita, M
2015-01-01
Inulin, a natural renewable polysaccharide resource produced by various plants in nature, has been reported to possess a significant number of diverse pharmaceutical and food applications. Recently, there has been rapid progress in high-throughput technologies and platforms to assay global mRNA, proteins, metabolites and gut microbiota. In this review, we will describe the current status of utilizing omics technologies of elucidating the impact of inulin and inulin-containing prebiotics at the transcriptome, proteome, metabolome and gut microbiome levels. Although many studies in this review have addressed the impact of inulin comprehensively, these omics technologies only enable us to understand physiological information at each different stage of mRNA, protein, metabolite and gut microbe. We believe that a synergistic approach is vital in order to fully illustrate the intricate beauty behind the relatively modest influence of food factors like inulin on host health. PMID:26619369
Omics databases on kidney disease: where they can be found and how to benefit from them.
Papadopoulos, Theofilos; Krochmal, Magdalena; Cisek, Katryna; Fernandes, Marco; Husi, Holger; Stevens, Robert; Bascands, Jean-Loup; Schanstra, Joost P; Klein, Julie
2016-06-01
In the recent decades, the evolution of omics technologies has led to advances in all biological fields, creating a demand for effective storage, management and exchange of rapidly generated data and research discoveries. To address this need, the development of databases of experimental outputs has become a common part of scientific practice in order to serve as knowledge sources and data-sharing platforms, providing information about genes, transcripts, proteins or metabolites. In this review, we present omics databases available currently, with a special focus on their application in kidney research and possibly in clinical practice. Databases are divided into two categories: general databases with a broad information scope and kidney-specific databases distinctively concentrated on kidney pathologies. In research, databases can be used as a rich source of information about pathophysiological mechanisms and molecular targets. In the future, databases will support clinicians with their decisions, providing better and faster diagnoses and setting the direction towards more preventive, personalized medicine. We also provide a test case demonstrating the potential of biological databases in comparing multi-omics datasets and generating new hypotheses to answer a critical and common diagnostic problem in nephrology practice. In the future, employment of databases combined with data integration and data mining should provide powerful insights into unlocking the mysteries of kidney disease, leading to a potential impact on pharmacological intervention and therapeutic disease management.
Shen, Yao; Stanislauskas, Milda; Li, Gen; Zheng, Deyou; Liu, Liang
2017-01-01
To elucidate the complex molecular mechanisms underlying the adverse effects UV radiation (UVR) on skin homeostasis, we performed multi-omics studies to characterize UV-induced genetic and epigenetic changes. Human keratinocytes from a single donor treated with or without UVR were analyzed by RNA-seq, exome-seq, and H3K27ac ChIP-seq at 4 h and 72 h following UVR. Compared to the relatively moderate mutagenic effects of UVR, acute UV exposure induced substantial epigenomic and transcriptomic alterations, illuminating a previously underappreciated role of epigenomic and transcriptomic instability in skin pathogenesis. Integration of the multi-omics data revealed that UVR-induced transcriptional dysregulation of a subset of genes was attributable to either genetic mutations or global redistribution of H3K27ac. H3K27ac redistribution further led to the formation of distinctive super enhancers in UV-irradiated cells. Our analysis also identified several new UV target genes, including CYP24A1, GJA5, SLAMF7 and ETV1, which were frequently dysregulated in human squamous cell carcinomas, highlighting their potential as new molecular targets for prevention or treatment of UVR-induced skin cancers. Taken together, our concurrent multi-omics analyses provide new mechanistic insights into the complex molecular networks underlying UV photobiological effects, which have important implications in understanding its impact on skin homeostasis and pathogenesis. PMID:28211524
Framework for the quantitative weight-of-evidence analysis of 'omics data for regulatory purposes.
Bridges, Jim; Sauer, Ursula G; Buesen, Roland; Deferme, Lize; Tollefsen, Knut E; Tralau, Tewes; van Ravenzwaay, Ben; Poole, Alan; Pemberton, Mark
2017-12-01
A framework for the quantitative weight-of-evidence (QWoE) analysis of 'omics data for regulatory purposes is presented. The QWoE framework encompasses seven steps to evaluate 'omics data (also together with non-'omics data): (1) Hypothesis formulation, identification and weighting of lines of evidence (LoEs). LoEs conjoin different (types of) studies that are used to critically test the hypothesis. As an essential component of the QWoE framework, step 1 includes the development of templates for scoring sheets that predefine scoring criteria with scores of 0-4 to enable a quantitative determination of study quality and data relevance; (2) literature searches and categorisation of studies into the pre-defined LoEs; (3) and (4) quantitative assessment of study quality and data relevance using the respective pre-defined scoring sheets for each study; (5) evaluation of LoE-specific strength of evidence based upon the study quality and study relevance scores of the studies conjoined in the respective LoE; (6) integration of the strength of evidence from the individual LoEs to determine the overall strength of evidence; (7) characterisation of uncertainties and conclusion on the QWoE. To put the QWoE framework in practice, case studies are recommended to confirm the relevance of its different steps, or to adapt them as necessary. Copyright © 2017 The Authors. Published by Elsevier Inc. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Mitchell, Hugh D.; Eisfeld, Amie J.; Sims, Amy
Respiratory infections stemming from influenza viruses and the Severe Acute Respiratory Syndrome corona virus (SARS-CoV) represent a serious public health threat as emerging pandemics. Despite efforts to identify the critical interactions of these viruses with host machinery, the key regulatory events that lead to disease pathology remain poorly targeted with therapeutics. Here we implement an integrated network interrogation approach, in which proteome and transcriptome datasets from infection of both viruses in human lung epithelial cells are utilized to predict regulatory genes involved in the host response. We take advantage of a novel “crowd-based” approach to identify and combine ranking metricsmore » that isolate genes/proteins likely related to the pathogenicity of SARS-CoV and influenza virus. Subsequently, a multivariate regression model is used to compare predicted lung epithelial regulatory influences with data derived from other respiratory virus infection models. We predicted a small set of regulatory factors with conserved behavior for consideration as important components of viral pathogenesis that might also serve as therapeutic targets for intervention. Our results demonstrate the utility of integrating diverse ‘omic datasets to predict and prioritize regulatory features conserved across multiple pathogen infection models.« less
A multi-objective constraint-based approach for modeling genome-scale microbial ecosystems.
Budinich, Marko; Bourdon, Jérémie; Larhlimi, Abdelhalim; Eveillard, Damien
2017-01-01
Interplay within microbial communities impacts ecosystems on several scales, and elucidation of the consequent effects is a difficult task in ecology. In particular, the integration of genome-scale data within quantitative models of microbial ecosystems remains elusive. This study advocates the use of constraint-based modeling to build predictive models from recent high-resolution -omics datasets. Following recent studies that have demonstrated the accuracy of constraint-based models (CBMs) for simulating single-strain metabolic networks, we sought to study microbial ecosystems as a combination of single-strain metabolic networks that exchange nutrients. This study presents two multi-objective extensions of CBMs for modeling communities: multi-objective flux balance analysis (MO-FBA) and multi-objective flux variability analysis (MO-FVA). Both methods were applied to a hot spring mat model ecosystem. As a result, multiple trade-offs between nutrients and growth rates, as well as thermodynamically favorable relative abundances at community level, were emphasized. We expect this approach to be used for integrating genomic information in microbial ecosystems. Following models will provide insights about behaviors (including diversity) that take place at the ecosystem scale.
Growing trend of CE at the omics level: the frontier of systems biology.
Oh, Eulsik; Hasan, Md Nabiul; Jamshed, Muhammad; Park, Soo Hyun; Hong, Hye-Min; Song, Eun Joo; Yoo, Young Sook
2010-01-01
In a novel attempt to comprehend the complexity of life, systems biology has recently emerged as a state-of-the-art approach for biological research in contrast to the reductionist approaches that have been used in molecular cell biology since the 1950s. Because a massive amount of information is required in many systems biology studies of life processes, we have increasingly come to depend on techniques that provide high-throughput omics data. CE and CE coupled to MS have served as powerful analytical tools for providing qualitative and quantitative omics data. Recent systems biology studies have focused strongly on the diagnosis and treatment of diseases. The increasing number of clinical research papers on drug discovery and disease therapies reflects this growing interest among scientists. Since such clinical research reflects one of the ultimate purposes of bioscience, these trends will be sustained for a long time. Thus, this review mainly focuses on the application of CE and CE-MS in diagnosis as well as on the latest CE methods developed. Furthermore, we outline the new challenges that arose in 2008 and later in elucidating the system-level functions of the bioconstituents of living organisms.
ISOL@: an Italian SOLAnaceae genomics resource.
Chiusano, Maria Luisa; D'Agostino, Nunzio; Traini, Alessandra; Licciardello, Concetta; Raimondo, Enrico; Aversano, Mario; Frusciante, Luigi; Monti, Luigi
2008-03-26
Present-day '-omics' technologies produce overwhelming amounts of data which include genome sequences, information on gene expression (transcripts and proteins) and on cell metabolic status. These data represent multiple aspects of a biological system and need to be investigated as a whole to shed light on the mechanisms which underpin the system functionality. The gathering and convergence of data generated by high-throughput technologies, the effective integration of different data-sources and the analysis of the information content based on comparative approaches are key methods for meaningful biological interpretations. In the frame of the International Solanaceae Genome Project, we propose here ISOLA, an Italian SOLAnaceae genomics resource. ISOLA (available at http://biosrv.cab.unina.it/isola) represents a trial platform and it is conceived as a multi-level computational environment.ISOLA currently consists of two main levels: the genome and the expression level. The cornerstone of the genome level is represented by the Solanum lycopersicum genome draft sequences generated by the International Tomato Genome Sequencing Consortium. Instead, the basic element of the expression level is the transcriptome information from different Solanaceae species, mainly in the form of species-specific comprehensive collections of Expressed Sequence Tags (ESTs). The cross-talk between the genome and the expression levels is based on data source sharing and on tools that enhance data quality, that extract information content from the levels' under parts and produce value-added biological knowledge. ISOLA is the result of a bioinformatics effort that addresses the challenges of the post-genomics era. It is designed to exploit '-omics' data based on effective integration to acquire biological knowledge and to approach a systems biology view. Beyond providing experimental biologists with a preliminary annotation of the tomato genome, this effort aims to produce a trial computational environment where different aspects and details are maintained as they are relevant for the analysis of the organization, the functionality and the evolution of the Solanaceae family.
Xia, Pu; Zhang, Xiaowei; Zhang, Hanxin; Wang, Pingping; Tian, Mingming; Yu, Hongxia
2017-08-15
One of the major challenges in environmental science is monitoring and assessing the risk of complex environmental mixtures. In vitro bioassays with limited key toxicological end points have been shown to be suitable to evaluate mixtures of organic pollutants in wastewater and recycled water. Omics approaches such as transcriptomics can monitor biological effects at the genome scale. However, few studies have applied omics approach in the assessment of mixtures of organic micropollutants. Here, an omics approach was developed for profiling bioactivity of 10 water samples ranging from wastewater to drinking water in human cells by a reduced human transcriptome (RHT) approach and dose-response modeling. Transcriptional expression of 1200 selected genes were measured by an Ampliseq technology in two cell lines, HepG2 and MCF7, that were exposed to eight serial dilutions of each sample. Concentration-effect models were used to identify differentially expressed genes (DEGs) and to calculate effect concentrations (ECs) of DEGs, which could be ranked to investigate low dose response. Furthermore, molecular pathways disrupted by different samples were evaluated by Gene Ontology (GO) enrichment analysis. The ability of RHT for representing bioactivity utilizing both HepG2 and MCF7 was shown to be comparable to the results of previous in vitro bioassays. Finally, the relative potencies of the mixtures indicated by RHT analysis were consistent with the chemical profiles of the samples. RHT analysis with human cells provides an efficient and cost-effective approach to benchmarking mixture of micropollutants and may offer novel insight into the assessment of mixture toxicity in water.
Improving the discoverability, accessibility, and citability of omics datasets: a case report.
Darlington, Yolanda F; Naumov, Alexey; McOwiti, Apollo; Kankanamge, Wasula H; Becnel, Lauren B; McKenna, Neil J
2017-03-01
Although omics datasets represent valuable assets for hypothesis generation, model testing, and data validation, the infrastructure supporting their reuse lacks organization and consistency. Using nuclear receptor signaling transcriptomic datasets as proof of principle, we developed a model to improve the discoverability, accessibility, and citability of published omics datasets. Primary datasets were retrieved from archives, processed to extract data points, then subjected to metadata enrichment and gap filling. The resulting secondary datasets were exposed on responsive web pages to support mining of gene lists, discovery of related datasets, and single-click citation integration with popular reference managers. Automated processes were established to embed digital object identifier-driven links to the secondary datasets in associated journal articles, small molecule and gene-centric databases, and a dataset search engine. Our model creates multiple points of access to reprocessed and reannotated derivative datasets across the digital biomedical research ecosystem, promoting their visibility and usability across disparate research communities. © The Author 2016. Published by Oxford University Press on behalf of the American Medical Informatics Association. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Doostparast Torshizi, Abolfazl; Petzold, Linda R
2018-01-01
Data integration methods that combine data from different molecular levels such as genome, epigenome, transcriptome, etc., have received a great deal of interest in the past few years. It has been demonstrated that the synergistic effects of different biological data types can boost learning capabilities and lead to a better understanding of the underlying interactions among molecular levels. In this paper we present a graph-based semi-supervised classification algorithm that incorporates latent biological knowledge in the form of biological pathways with gene expression and DNA methylation data. The process of graph construction from biological pathways is based on detecting condition-responsive genes, where 3 sets of genes are finally extracted: all condition responsive genes, high-frequency condition-responsive genes, and P-value-filtered genes. The proposed approach is applied to ovarian cancer data downloaded from the Human Genome Atlas. Extensive numerical experiments demonstrate superior performance of the proposed approach compared to other state-of-the-art algorithms, including the latest graph-based classification techniques. Simulation results demonstrate that integrating various data types enhances classification performance and leads to a better understanding of interrelations between diverse omics data types. The proposed approach outperforms many of the state-of-the-art data integration algorithms. © The Author 2017. Published by Oxford University Press on behalf of the American Medical Informatics Association. All rights reserved. For Permissions, please email: journals.permissions@oup.com
Liu, Lin; Gong, Weili; Sun, Xiaomeng; Chen, Guanjun; Wang, Lushan
2018-02-07
Byproducts of food processing can be utilized for the production of high-value-added enzyme cocktails. In this study, we utilized integrated functional omics technology to analyze composition and functional characteristics of extracellular enzymes produced by Aspergillus niger grown on food processing byproducts. The results showed that oligosaccharides constituted by arabinose, xylose, and glucose in wheat bran were able to efficiently induce the production of extracellular enzymes of A. niger. Compared with other substrates, wheat bran was more effective at inducing the secretion of β-glucosidases from GH1 and GH3 families, as well as >50% of proteases from A1-family aspartic proteases. Compared with proteins induced by single wheat bran or soybean dregs, the protein yield induced by their mixture was doubled, and the time required to reach peak enzyme activity was shortened by 25%. This study provided a technical platform for the complex formulation of various substrates and functional analysis of extracellular enzymes.
Pool, Martin; de Boer, H Rudolf; Hooge, Marjolijn N Lub-de; van Vugt, Marcel A T M; de Vries, Elisabeth G E
2017-01-01
Cancer is a growing problem worldwide. The cause of death in cancer patients is often due to treatment-resistant metastatic disease. Many molecularly targeted anticancer drugs have been developed against 'oncogenic driver' pathways. However, these treatments are usually only effective in properly selected patients. Resistance to molecularly targeted drugs through selective pressure on acquired mutations or molecular rewiring can hinder their effectiveness. This review summarizes how molecular imaging techniques can potentially facilitate the optimal implementation of targeted agents. Using the human epidermal growth factor receptor (HER) family as a model in (pre)clinical studies, we illustrate how molecular imaging may be employed to characterize whole body target expression as well as monitor drug effectiveness and the emergence of tumor resistance. We further discuss how an integrative omics discovery platform could guide the selection of 'effect sensors' - new molecular imaging targets - which are dynamic markers that indicate treatment effectiveness or resistance.
A physical sciences network characterization of non-tumorigenic and metastatic cells
Agus, David B.; Alexander, Jenolyn F.; Arap, Wadih; Ashili, Shashanka; Aslan, Joseph E.; Austin, Robert H.; Backman, Vadim; Bethel, Kelly J.; Bonneau, Richard; Chen, Wei-Chiang; Chen-Tanyolac, Chira; Choi, Nathan C.; Curley, Steven A.; Dallas, Matthew; Damania, Dhwanil; Davies, Paul C. W.; Decuzzi, Paolo; Dickinson, Laura; Estevez-Salmeron, Luis; Estrella, Veronica; Ferrari, Mauro; Fischbach, Claudia; Foo, Jasmine; Fraley, Stephanie I.; Frantz, Christian; Fuhrmann, Alexander; Gascard, Philippe; Gatenby, Robert A.; Geng, Yue; Gerecht, Sharon; Gillies, Robert J.; Godin, Biana; Grady, William M.; Greenfield, Alex; Hemphill, Courtney; Hempstead, Barbara L.; Hielscher, Abigail; Hillis, W. Daniel; Holland, Eric C.; Ibrahim-Hashim, Arig; Jacks, Tyler; Johnson, Roger H.; Joo, Ahyoung; Katz, Jonathan E.; Kelbauskas, Laimonas; Kesselman, Carl; King, Michael R.; Konstantopoulos, Konstantinos; Kraning-Rush, Casey M.; Kuhn, Peter; Kung, Kevin; Kwee, Brian; Lakins, Johnathon N.; Lambert, Guillaume; Liao, David; Licht, Jonathan D.; Liphardt, Jan T.; Liu, Liyu; Lloyd, Mark C.; Lyubimova, Anna; Mallick, Parag; Marko, John; McCarty, Owen J. T.; Meldrum, Deirdre R.; Michor, Franziska; Mumenthaler, Shannon M.; Nandakumar, Vivek; O’Halloran, Thomas V.; Oh, Steve; Pasqualini, Renata; Paszek, Matthew J.; Philips, Kevin G.; Poultney, Christopher S.; Rana, Kuldeepsinh; Reinhart-King, Cynthia A.; Ros, Robert; Semenza, Gregg L.; Senechal, Patti; Shuler, Michael L.; Srinivasan, Srimeenakshi; Staunton, Jack R.; Stypula, Yolanda; Subramanian, Hariharan; Tlsty, Thea D.; Tormoen, Garth W.; Tseng, Yiider; van Oudenaarden, Alexander; Verbridge, Scott S.; Wan, Jenny C.; Weaver, Valerie M.; Widom, Jonathan; Will, Christine; Wirtz, Denis; Wojtkowiak, Jonathan; Wu, Pei-Hsun
2013-01-01
To investigate the transition from non-cancerous to metastatic from a physical sciences perspective, the Physical Sciences–Oncology Centers (PS-OC) Network performed molecular and biophysical comparative studies of the non-tumorigenic MCF-10A and metastatic MDA-MB-231 breast epithelial cell lines, commonly used as models of cancer metastasis. Experiments were performed in 20 laboratories from 12 PS-OCs. Each laboratory was supplied with identical aliquots and common reagents and culture protocols. Analyses of these measurements revealed dramatic differences in their mechanics, migration, adhesion, oxygen response, and proteomic profiles. Model-based multi-omics approaches identified key differences between these cells' regulatory networks involved in morphology and survival. These results provide a multifaceted description of cellular parameters of two widely used cell lines and demonstrate the value of the PS-OC Network approach for integration of diverse experimental observations to elucidate the phenotypes associated with cancer metastasis. PMID:23618955
A physical sciences network characterization of non-tumorigenic and metastatic cells.
Agus, David B; Alexander, Jenolyn F; Arap, Wadih; Ashili, Shashanka; Aslan, Joseph E; Austin, Robert H; Backman, Vadim; Bethel, Kelly J; Bonneau, Richard; Chen, Wei-Chiang; Chen-Tanyolac, Chira; Choi, Nathan C; Curley, Steven A; Dallas, Matthew; Damania, Dhwanil; Davies, Paul C W; Decuzzi, Paolo; Dickinson, Laura; Estevez-Salmeron, Luis; Estrella, Veronica; Ferrari, Mauro; Fischbach, Claudia; Foo, Jasmine; Fraley, Stephanie I; Frantz, Christian; Fuhrmann, Alexander; Gascard, Philippe; Gatenby, Robert A; Geng, Yue; Gerecht, Sharon; Gillies, Robert J; Godin, Biana; Grady, William M; Greenfield, Alex; Hemphill, Courtney; Hempstead, Barbara L; Hielscher, Abigail; Hillis, W Daniel; Holland, Eric C; Ibrahim-Hashim, Arig; Jacks, Tyler; Johnson, Roger H; Joo, Ahyoung; Katz, Jonathan E; Kelbauskas, Laimonas; Kesselman, Carl; King, Michael R; Konstantopoulos, Konstantinos; Kraning-Rush, Casey M; Kuhn, Peter; Kung, Kevin; Kwee, Brian; Lakins, Johnathon N; Lambert, Guillaume; Liao, David; Licht, Jonathan D; Liphardt, Jan T; Liu, Liyu; Lloyd, Mark C; Lyubimova, Anna; Mallick, Parag; Marko, John; McCarty, Owen J T; Meldrum, Deirdre R; Michor, Franziska; Mumenthaler, Shannon M; Nandakumar, Vivek; O'Halloran, Thomas V; Oh, Steve; Pasqualini, Renata; Paszek, Matthew J; Philips, Kevin G; Poultney, Christopher S; Rana, Kuldeepsinh; Reinhart-King, Cynthia A; Ros, Robert; Semenza, Gregg L; Senechal, Patti; Shuler, Michael L; Srinivasan, Srimeenakshi; Staunton, Jack R; Stypula, Yolanda; Subramanian, Hariharan; Tlsty, Thea D; Tormoen, Garth W; Tseng, Yiider; van Oudenaarden, Alexander; Verbridge, Scott S; Wan, Jenny C; Weaver, Valerie M; Widom, Jonathan; Will, Christine; Wirtz, Denis; Wojtkowiak, Jonathan; Wu, Pei-Hsun
2013-01-01
To investigate the transition from non-cancerous to metastatic from a physical sciences perspective, the Physical Sciences-Oncology Centers (PS-OC) Network performed molecular and biophysical comparative studies of the non-tumorigenic MCF-10A and metastatic MDA-MB-231 breast epithelial cell lines, commonly used as models of cancer metastasis. Experiments were performed in 20 laboratories from 12 PS-OCs. Each laboratory was supplied with identical aliquots and common reagents and culture protocols. Analyses of these measurements revealed dramatic differences in their mechanics, migration, adhesion, oxygen response, and proteomic profiles. Model-based multi-omics approaches identified key differences between these cells' regulatory networks involved in morphology and survival. These results provide a multifaceted description of cellular parameters of two widely used cell lines and demonstrate the value of the PS-OC Network approach for integration of diverse experimental observations to elucidate the phenotypes associated with cancer metastasis.
A next generation multiscale view of inborn errors of metabolism
Argmann, Carmen A.; Houten, Sander M.; Zhu, Jun; Schadt, Eric E.
2015-01-01
Inborn errors of metabolism (IEM) are not unlike common diseases. They often present as a spectrum of disease phenotypes that correlates poorly with the severity of the disease-causing mutations. This greatly impacts patient care and reveals fundamental gaps in our knowledge of disease modifying biology. Systems biology approaches that integrate multi-omics data into molecular networks have significantly improved our understanding of complex diseases. Similar approaches to study IEM are rare despite their complex nature. We highlight that existing common disease-derived datasets and networks can be repurposed to generate novel mechanistic insight in IEM and potentially identify candidate modifiers. While understanding disease pathophysiology will advance the IEM field, the ultimate goal should be to understand per individual how their phenotype emerges given their primary mutation on the background of their whole genome, not unlike personalized medicine. We foresee that panomics and network strategies combined with recent experimental innovations will facilitate this. PMID:26712461
Mass Spectrometry-Based Metabolomics to Elucidate Functions in Marine Organisms and Ecosystems
Goulitquer, Sophie; Potin, Philippe; Tonon, Thierry
2012-01-01
Marine systems are very diverse and recognized as being sources of a wide range of biomolecules. This review provides an overview of metabolite profiling based on mass spectrometry (MS) approaches in marine organisms and their environments, focusing on recent advances in the field. We also point out some of the technical challenges that need to be overcome in order to increase applications of metabolomics in marine systems, including extraction of chemical compounds from different matrices and data management. Metabolites being important links between genotype and phenotype, we describe added value provided by integration of data from metabolite profiling with other layers of omics, as well as their importance for the development of systems biology approaches in marine systems to study several biological processes, and to analyze interactions between organisms within communities. The growing importance of MS-based metabolomics in chemical ecology studies in marine ecosystems is also illustrated. PMID:22690147
A physical sciences network characterization of non-tumorigenic and metastatic cells
NASA Astrophysics Data System (ADS)
Physical Sciences-Oncology Centers Network; Agus, David B.; Alexander, Jenolyn F.; Arap, Wadih; Ashili, Shashanka; Aslan, Joseph E.; Austin, Robert H.; Backman, Vadim; Bethel, Kelly J.; Bonneau, Richard; Chen, Wei-Chiang; Chen-Tanyolac, Chira; Choi, Nathan C.; Curley, Steven A.; Dallas, Matthew; Damania, Dhwanil; Davies, Paul C. W.; Decuzzi, Paolo; Dickinson, Laura; Estevez-Salmeron, Luis; Estrella, Veronica; Ferrari, Mauro; Fischbach, Claudia; Foo, Jasmine; Fraley, Stephanie I.; Frantz, Christian; Fuhrmann, Alexander; Gascard, Philippe; Gatenby, Robert A.; Geng, Yue; Gerecht, Sharon; Gillies, Robert J.; Godin, Biana; Grady, William M.; Greenfield, Alex; Hemphill, Courtney; Hempstead, Barbara L.; Hielscher, Abigail; Hillis, W. Daniel; Holland, Eric C.; Ibrahim-Hashim, Arig; Jacks, Tyler; Johnson, Roger H.; Joo, Ahyoung; Katz, Jonathan E.; Kelbauskas, Laimonas; Kesselman, Carl; King, Michael R.; Konstantopoulos, Konstantinos; Kraning-Rush, Casey M.; Kuhn, Peter; Kung, Kevin; Kwee, Brian; Lakins, Johnathon N.; Lambert, Guillaume; Liao, David; Licht, Jonathan D.; Liphardt, Jan T.; Liu, Liyu; Lloyd, Mark C.; Lyubimova, Anna; Mallick, Parag; Marko, John; McCarty, Owen J. T.; Meldrum, Deirdre R.; Michor, Franziska; Mumenthaler, Shannon M.; Nandakumar, Vivek; O'Halloran, Thomas V.; Oh, Steve; Pasqualini, Renata; Paszek, Matthew J.; Philips, Kevin G.; Poultney, Christopher S.; Rana, Kuldeepsinh; Reinhart-King, Cynthia A.; Ros, Robert; Semenza, Gregg L.; Senechal, Patti; Shuler, Michael L.; Srinivasan, Srimeenakshi; Staunton, Jack R.; Stypula, Yolanda; Subramanian, Hariharan; Tlsty, Thea D.; Tormoen, Garth W.; Tseng, Yiider; van Oudenaarden, Alexander; Verbridge, Scott S.; Wan, Jenny C.; Weaver, Valerie M.; Widom, Jonathan; Will, Christine; Wirtz, Denis; Wojtkowiak, Jonathan; Wu, Pei-Hsun
2013-04-01
To investigate the transition from non-cancerous to metastatic from a physical sciences perspective, the Physical Sciences-Oncology Centers (PS-OC) Network performed molecular and biophysical comparative studies of the non-tumorigenic MCF-10A and metastatic MDA-MB-231 breast epithelial cell lines, commonly used as models of cancer metastasis. Experiments were performed in 20 laboratories from 12 PS-OCs. Each laboratory was supplied with identical aliquots and common reagents and culture protocols. Analyses of these measurements revealed dramatic differences in their mechanics, migration, adhesion, oxygen response, and proteomic profiles. Model-based multi-omics approaches identified key differences between these cells' regulatory networks involved in morphology and survival. These results provide a multifaceted description of cellular parameters of two widely used cell lines and demonstrate the value of the PS-OC Network approach for integration of diverse experimental observations to elucidate the phenotypes associated with cancer metastasis.
A survey on evolutionary algorithm based hybrid intelligence in bioinformatics.
Li, Shan; Kang, Liying; Zhao, Xing-Ming
2014-01-01
With the rapid advance in genomics, proteomics, metabolomics, and other types of omics technologies during the past decades, a tremendous amount of data related to molecular biology has been produced. It is becoming a big challenge for the bioinformatists to analyze and interpret these data with conventional intelligent techniques, for example, support vector machines. Recently, the hybrid intelligent methods, which integrate several standard intelligent approaches, are becoming more and more popular due to their robustness and efficiency. Specifically, the hybrid intelligent approaches based on evolutionary algorithms (EAs) are widely used in various fields due to the efficiency and robustness of EAs. In this review, we give an introduction about the applications of hybrid intelligent methods, in particular those based on evolutionary algorithm, in bioinformatics. In particular, we focus on their applications to three common problems that arise in bioinformatics, that is, feature selection, parameter estimation, and reconstruction of biological networks.
Sorting Out the Ocean Crust Deep Biosphere with Single Cell Omics Approaches
NASA Astrophysics Data System (ADS)
Orcutt, B.; D'Angelo, T.; Goordial, J.; Jones, R. M.; Carr, S. A.
2017-12-01
Although oceanic crust comprises a large habitat for subsurface life, the structure, function, and dynamics of microbial communities living on rocks in the subsurface are poorly understood. Single cell level approaches can overcome limitations of low biomass in subsurface systems. Coupled with incubation experiments with amino acid orthologs, single cell level sorting can reveal high resolution information about identity, functional potential, and growth. Leveraging collaboration with the Single Cell Genomics Center and the Facility for Aquatic Cytometry at Bigelow Laboratory, we present recent results from single cell level sorting and -omics sequencing from several crustal environments, including the Atlantis Massif and the Juan de Fuca Ridge flank. We will also highlight new experiments conducted with samples recovered from the flank of the Mid-Atlantic Ridge.
Davidsen, Peter K; Turan, Nil; Egginton, Stuart; Falciani, Francesco
2016-02-01
The overall aim of physiological research is to understand how living systems function in an integrative manner. Consequently, the discipline of physiology has since its infancy attempted to link multiple levels of biological organization. Increasingly this has involved mathematical and computational approaches, typically to model a small number of components spanning several levels of biological organization. With the advent of "omics" technologies, which can characterize the molecular state of a cell or tissue (intended as the level of expression and/or activity of its molecular components), the number of molecular components we can quantify has increased exponentially. Paradoxically, the unprecedented amount of experimental data has made it more difficult to derive conceptual models underlying essential mechanisms regulating mammalian physiology. We present an overview of state-of-the-art methods currently used to identifying biological networks underlying genomewide responses. These are based on a data-driven approach that relies on advanced computational methods designed to "learn" biology from observational data. In this review, we illustrate an application of these computational methodologies using a case study integrating an in vivo model representing the transcriptional state of hypoxic skeletal muscle with a clinical study representing muscle wasting in chronic obstructive pulmonary disease patients. The broader application of these approaches to modeling multiple levels of biological data in the context of modern physiology is discussed. Copyright © 2016 the American Physiological Society.
Metabolomics and Integrative Omics for the Development of Thai Traditional Medicine
Khoomrung, Sakda; Wanichthanarak, Kwanjeera; Nookaew, Intawat; Thamsermsang, Onusa; Seubnooch, Patcharamon; Laohapand, Tawee; Akarasereenont, Pravit
2017-01-01
In recent years, interest in studies of traditional medicine in Asian and African countries has gradually increased due to its potential to complement modern medicine. In this review, we provide an overview of Thai traditional medicine (TTM) current development, and ongoing research activities of TTM related to metabolomics. This review will also focus on three important elements of systems biology analysis of TTM including analytical techniques, statistical approaches and bioinformatics tools for handling and analyzing untargeted metabolomics data. The main objective of this data analysis is to gain a comprehensive understanding of the system wide effects that TTM has on individuals. Furthermore, potential applications of metabolomics and systems medicine in TTM will also be discussed. PMID:28769804
The application of 'omics tools to biologically based monitoring and surveillance of aquatic environments shows considerable promise for complementing chemical monitoring in ecological risk assessments. However, few of the current approaches offer the ability to sample ecological...
USDA-ARS?s Scientific Manuscript database
Multidisciplinary approaches enabled a better understanding of the connection between human gut microbes and health. This knowledge is rapidly changing how we think about probiotics and related –biotics (prebiotics, synbiotics, pharmabiotics and postbiotics). Functional –omics approaches are very im...
Lourenço, Anália; Ferreira, Andreia; Veiga, Nuno; Machado, Idalina; Pereira, Maria Olivia; Azevedo, Nuno F
2012-01-01
Consortia of microorganisms, commonly known as biofilms, are attracting much attention from the scientific community due to their impact in human activity. As biofilm research grows to be a data-intensive discipline, the need for suitable bioinformatics approaches becomes compelling to manage and validate individual experiments, and also execute inter-laboratory large-scale comparisons. However, biofilm data is widespread across ad hoc, non-standardized individual files and, thus, data interchange among researchers, or any attempt of cross-laboratory experimentation or analysis, is hardly possible or even attempted. This paper presents BiofOmics, the first publicly accessible Web platform specialized in the management and analysis of data derived from biofilm high-throughput studies. The aim is to promote data interchange across laboratories, implementing collaborative experiments, and enable the development of bioinformatics tools in support of the processing and analysis of the increasing volumes of experimental biofilm data that are being generated. BiofOmics' data deposition facility enforces data structuring and standardization, supported by controlled vocabulary. Researchers are responsible for the description of the experiments, their results and conclusions. BiofOmics' curators interact with submitters only to enforce data structuring and the use of controlled vocabulary. Then, BiofOmics' search facility makes publicly available the profile and data associated with a submitted study so that any researcher can profit from these standardization efforts to compare similar studies, generate new hypotheses to be tested or even extend the conditions experimented in the study. BiofOmics' novelty lies in its support to standardized data deposition, the availability of computerizable data files and the free-of-charge dissemination of biofilm studies across the community. Hopefully, this will open promising research possibilities, namely the comparison of results between different laboratories, the reproducibility of methods within and between laboratories, and the development of guidelines and standardized protocols for biofilm formation operating procedures and analytical methods.
Pharmacometabolomics Informs Quantitative Radiomics for Glioblastoma Diagnostic Innovation.
Katsila, Theodora; Matsoukas, Minos-Timotheos; Patrinos, George P; Kardamakis, Dimitrios
2017-08-01
Applications of omics systems biology technologies have enormous promise for radiology and diagnostics in surgical fields. In this context, the emerging fields of radiomics (a systems scale approach to radiology using a host of technologies, including omics) and pharmacometabolomics (use of metabolomics for patient and disease stratification and guiding precision medicine) offer much synergy for diagnostic innovation in surgery, particularly in neurosurgery. This synthesis of omics fields and applications is timely because diagnostic accuracy in central nervous system tumors still challenges decision-making. Considering the vast heterogeneity in brain tumors, disease phenotypes, and interindividual variability in surgical and chemotherapy outcomes, we believe that diagnostic accuracy can be markedly improved by quantitative radiomics coupled to pharmacometabolomics and related health information technologies while optimizing economic costs of traditional diagnostics. In this expert review, we present an innovation analysis on a systems-level multi-omics approach toward diagnostic accuracy in central nervous system tumors. For this, we suggest that glioblastomas serve as a useful application paradigm. We performed a literature search on PubMed for articles published in English between 2006 and 2016. We used the search terms "radiomics," "glioblastoma," "biomarkers," "pharmacogenomics," "pharmacometabolomics," "pharmacometabonomics/pharmacometabolomics," "collaborative informatics," and "precision medicine." A list of the top 4 insights we derived from this literature analysis is presented in this study. For example, we found that (i) tumor grading needs to be better refined, (ii) diagnostic precision should be improved, (iii) standardization in radiomics is lacking, and (iv) quantitative radiomics needs to prove clinical implementation. We conclude with an interdisciplinary call to the metabolomics, pharmacy/pharmacology, radiology, and surgery communities that pharmacometabolomics coupled to information technologies (chemoinformatics tools, databases, collaborative systems) can inform quantitative radiomics, thus translating Big Data and information growth to knowledge growth, rational drug development and diagnostics innovation for glioblastomas, and possibly in other brain tumors.
imDEV: a graphical user interface to R multivariate analysis tools in Microsoft Excel
USDA-ARS?s Scientific Manuscript database
Interactive modules for data exploration and visualization (imDEV) is a Microsoft Excel spreadsheet embedded application providing an integrated environment for the analysis of omics data sets with a user-friendly interface. Individual modules were designed to provide toolsets to enable interactive ...
USDA-ARS?s Scientific Manuscript database
Background: Availability of a large number of data sets in public repositories and the advances in integrating multi-omics methods have greatly advanced our understanding of biological organisms and microbial associates, as well as large subcellular organelles, such as mitochondria. Mitochondrial ...
Incorporating deep learning into the analysis of diverse livestock data
USDA-ARS?s Scientific Manuscript database
Technological advances in high-throughput phenotyping and multiple omics fields have led to an explosion in the volume of data across the whole spectrum of biology, allowing researchers to integrate data of different types to inform hypotheses and expand the scope of their research questions. Howeve...
RAMONA: a Web application for gene set analysis on multilevel omics data.
Sass, Steffen; Buettner, Florian; Mueller, Nikola S; Theis, Fabian J
2015-01-01
Decreasing costs of modern high-throughput experiments allow for the simultaneous analysis of altered gene activity on various molecular levels. However, these multi-omics approaches lead to a large amount of data, which is hard to interpret for a non-bioinformatician. Here, we present the remotely accessible multilevel ontology analysis (RAMONA). It offers an easy-to-use interface for the simultaneous gene set analysis of combined omics datasets and is an extension of the previously introduced MONA approach. RAMONA is based on a Bayesian enrichment method for the inference of overrepresented biological processes among given gene sets. Overrepresentation is quantified by interpretable term probabilities. It is able to handle data from various molecular levels, while in parallel coping with redundancies arising from gene set overlaps and related multiple testing problems. The comprehensive output of RAMONA is easy to interpret and thus allows for functional insight into the affected biological processes. With RAMONA, we provide an efficient implementation of the Bayesian inference problem such that ontologies consisting of thousands of terms can be processed in the order of seconds. RAMONA is implemented as ASP.NET Web application and publicly available at http://icb.helmholtz-muenchen.de/ramona. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Nutrigenomics: the cutting edge and Asian perspectives.
Kato, Hisanori
2008-01-01
One of the two major goals of nutrigenomics is to make full use of genomic information to reveal how genetic variations affect nutrients and other food factors and thereby realize tailor-made nutrition (nutrigenetics). The other major goal of nutrigenomics is to comprehensively understand the response of the body to diets and food factors through various 'omics' technologies such as transcriptomics, proteomics, and metabolomics. The most successfully exploited technology to date is transcriptome analysis, due mainly to its efficiency and high-throughput feature. This technology has already provided a substantial amount of data on, for instance, the novel function of food factors, the unknown mechanism of the effect of nutrients, and even safety issues of foods. The nutrigenomics database that we have created now holds the publication data of several hundred of such 'omics' studies. Furthermore, the transcriptomics approach is being applied to food safety issues. For ex-ample, the data we have obtained thus far suggest that this new technology will facilitate the safety evaluation of newly developed foods and will help clarify the mechanism of toxic effects resulting from the excessive intake of a nutrient. The 'omics' data accumulated by our group and others strongly support the promise of the systems biology approach to food and nutrition science.
Zhang, Yan; Deng, Yongfeng; Zhao, Yanping; Ren, Hongqiang
2014-05-15
Mixed chemical wastewaters (MCWW) from industrial park contain complex mixtures of trace contaminants, which cannot be effectively removed by wastewater treatment plants (WWTP) and have become an unignored threat to ambient environment. However, limited information is available to evaluate the complicated toxic effects of MCWW and its effluent from wastewater treatment plant (WTPE) from the perspective of bio-omics. In this study, mice were exposed to the MCWW and WTPE for 90 days and distinct differences in the hepatic transcriptome and serum metabolome were analyzed by digital gene expression (DGE) and proton nuclear magnetic resonance ((1)H-NMR) spectra, respectively. Our results indicated that disruption of lipid metabolism in liver and hepatotoxicity were induced by both MCWW and WTPE exposure. WTPE is still a health risk to the environment, which is in need of more attention. Furthermore, we demonstrated the potential ability of bio-omics approaches for evaluating toxic effects of MCWW and WTPE. Copyright © 2014 Elsevier B.V. All rights reserved.
Integrated cellular network of transcription regulations and protein-protein interactions
2010-01-01
Background With the accumulation of increasing omics data, a key goal of systems biology is to construct networks at different cellular levels to investigate cellular machinery of the cell. However, there is currently no satisfactory method to construct an integrated cellular network that combines the gene regulatory network and the signaling regulatory pathway. Results In this study, we integrated different kinds of omics data and developed a systematic method to construct the integrated cellular network based on coupling dynamic models and statistical assessments. The proposed method was applied to S. cerevisiae stress responses, elucidating the stress response mechanism of the yeast. From the resulting integrated cellular network under hyperosmotic stress, the highly connected hubs which are functionally relevant to the stress response were identified. Beyond hyperosmotic stress, the integrated network under heat shock and oxidative stress were also constructed and the crosstalks of these networks were analyzed, specifying the significance of some transcription factors to serve as the decision-making devices at the center of the bow-tie structure and the crucial role for rapid adaptation scheme to respond to stress. In addition, the predictive power of the proposed method was also demonstrated. Conclusions We successfully construct the integrated cellular network which is validated by literature evidences. The integration of transcription regulations and protein-protein interactions gives more insight into the actual biological network and is more predictive than those without integration. The method is shown to be powerful and flexible and can be used under different conditions and for different species. The coupling dynamic models of the whole integrated cellular network are very useful for theoretical analyses and for further experiments in the fields of network biology and synthetic biology. PMID:20211003
Integrated cellular network of transcription regulations and protein-protein interactions.
Wang, Yu-Chao; Chen, Bor-Sen
2010-03-08
With the accumulation of increasing omics data, a key goal of systems biology is to construct networks at different cellular levels to investigate cellular machinery of the cell. However, there is currently no satisfactory method to construct an integrated cellular network that combines the gene regulatory network and the signaling regulatory pathway. In this study, we integrated different kinds of omics data and developed a systematic method to construct the integrated cellular network based on coupling dynamic models and statistical assessments. The proposed method was applied to S. cerevisiae stress responses, elucidating the stress response mechanism of the yeast. From the resulting integrated cellular network under hyperosmotic stress, the highly connected hubs which are functionally relevant to the stress response were identified. Beyond hyperosmotic stress, the integrated network under heat shock and oxidative stress were also constructed and the crosstalks of these networks were analyzed, specifying the significance of some transcription factors to serve as the decision-making devices at the center of the bow-tie structure and the crucial role for rapid adaptation scheme to respond to stress. In addition, the predictive power of the proposed method was also demonstrated. We successfully construct the integrated cellular network which is validated by literature evidences. The integration of transcription regulations and protein-protein interactions gives more insight into the actual biological network and is more predictive than those without integration. The method is shown to be powerful and flexible and can be used under different conditions and for different species. The coupling dynamic models of the whole integrated cellular network are very useful for theoretical analyses and for further experiments in the fields of network biology and synthetic biology.
Data mining in newt-omics, the repository for omics data from the newt.
Looso, Mario; Braun, Thomas
2015-01-01
Salamanders are an excellent model organism to study regenerative processes due to their unique ability to regenerate lost appendages or organs. Straightforward bioinformatics tools to analyze and take advantage of the growing number of "omics" studies performed in salamanders were lacking so far. To overcome this limitation, we have generated a comprehensive data repository for the red-spotted newt Notophthalmus viridescens, named newt-omics, merging omics style datasets on the transcriptome and proteome level including expression values and annotations. The resource is freely available via a user-friendly Web-based graphical user interface ( http://newt-omics.mpi-bn.mpg.de) that allows access and queries to the database without prior bioinformatical expertise. The repository is updated regularly, incorporating new published datasets from omics technologies.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ma, Jian; Casey, Cameron P.; Zheng, Xueyun
Motivation: Drift tube ion mobility spectrometry (DTIMS) is increasingly implemented in high throughput omics workflows, and new informatics approaches are necessary for processing the associated data. To automatically extract arrival times for molecules measured by DTIMS coupled with mass spectrometry and compute their associated collisional cross sections (CCS) we created the PNNL Ion Mobility Cross Section Extractor (PIXiE). The primary application presented for this algorithm is the extraction of information necessary to create a reference library containing accu-rate masses, DTIMS arrival times and CCSs for use in high throughput omics analyses. Results: We demonstrate the utility of this approach bymore » automatically extracting arrival times and calculating the associated CCSs for a set of endogenous metabolites and xenobiotics. The PIXiE-generated CCS values were identical to those calculated by hand and within error of those calcu-lated using commercially available instrument vendor software.« less
Evangelatos, Nikolaos; Satyamourthy, Kapaettu; Levidou, Georgia; Brand, Helmut; Bauer, Pia; Kouskouti, Christina; Brand, Angela
2018-05-01
"-Omics" systems sciences are at the epicenter of personalized medicine and public health, and drivers of knowledge-based biotechnology innovation. Bioinformatics, a core component of omics research, is one of the disciplines that first employed Free/Libre Open Source Software (FLOSS), and thus provided a fertile ground for its further development. Understanding the use and characteristics of FLOSS deployed in the omics field is valuable for future innovation strategies, policy and funding priorities. We conducted a bibliometric, longitudinal study of the use of FLOSS in sepsis omics research from 2011 to 2015 in the United States, EU-28 and China. Because sepsis is an interdisciplinary field at the intersection of multiple omics technologies and medical specialties, it was chosen as a model innovation ecosystem for this empirical analysis, which used publicly available data. Despite development of and competition from proprietary commercial software, scholars in omics continue to employ FLOSS routinely, and independent of the type of omics technology they work with. The number of articles using FLOSS increased significantly over time in the EU-28, as opposed to the United States and China (R = 0.96, p = 0.004). Furthermore, in an era where sharing of knowledge is being strongly advocated and promoted by public agencies and social institutions, we discuss possible correlations between the use of FLOSS and various funding sources in omics research. These observations and analyses provide new insights into the use of FLOSS in sepsis omics research across three (supra)national regions. Further benchmarking studies are warranted for FLOSS trends in other omics fields and geographical settings. These could, in time, lead to the development of new composite innovation and technology use metrics in omics systems sciences and bioinformatics communities.
GeneLab Analysis Working Group Kick-Off Meeting
NASA Technical Reports Server (NTRS)
Costes, Sylvain V.
2018-01-01
Goals to achieve for GeneLab AWG - GL vision - Review of GeneLab AWG charter Timeline and milestones for 2018 Logistics - Monthly Meeting - Workshop - Internship - ASGSR Introduction of team leads and goals of each group Introduction of all members Q/A Three-tier Client Strategy to Democratize Data Physiological changes, pathway enrichment, differential expression, normalization, processing metadata, reproducibility, Data federation/integration with heterogeneous bioinformatics external databases The GLDS currently serves over 100 omics investigations to the biomedical community via open access. In order to expand the scope of metadata record searches via the GLDS, we designed a metadata warehouse that collects and updates metadata records from external systems housing similar data. To demonstrate the capabilities of federated search and retrieval of these data, we imported metadata records from three open-access data systems into the GLDS metadata warehouse: NCBI's Gene Expression Omnibus (GEO), EBI's PRoteomics IDEntifications (PRIDE) repository, and the Metagenomics Analysis server (MG-RAST). Each of these systems defines metadata for omics data sets differently. One solution to bridge such differences is to employ a common object model (COM) to which each systems' representation of metadata can be mapped. Warehoused metadata records are then transformed at ETL to this single, common representation. Queries generated via the GLDS are then executed against the warehouse, and matching records are shown in the COM representation (Fig. 1). While this approach is relatively straightforward to implement, the volume of the data in the omics domain presents challenges in dealing with latency and currency of records. Furthermore, the lack of a coordinated has been federated data search for and retrieval of these kinds of data across other open-access systems, so that users are able to conduct biological meta-investigations using data from a variety of sources. Such meta-investigations are key to corroborating findings from many kinds of assays and translating them into systems biology knowledge and, eventually, therapeutics.
Li, Qinghong; Freeman, Lisa M; Rush, John E; Huggins, Gordon S; Kennedy, Adam D; Labuda, Jeffrey A; Laflamme, Dorothy P; Hannah, Steven S
2015-08-01
Canine degenerative mitral valve disease (DMVD) is the most common form of heart disease in dogs. The objective of this study was to identify cellular and metabolic pathways that play a role in DMVD by performing metabolomics and transcriptomics analyses on serum and tissue (mitral valve and left ventricle) samples previously collected from dogs with DMVD or healthy hearts. Gas or liquid chromatography followed by mass spectrophotometry were used to identify metabolites in serum. Transcriptomics analysis of tissue samples was completed using RNA-seq, and selected targets were confirmed by RT-qPCR. Random Forest analysis was used to classify the metabolites that best predicted the presence of DMVD. Results identified 41 known and 13 unknown serum metabolites that were significantly different between healthy and DMVD dogs, representing alterations in fat and glucose energy metabolism, oxidative stress, and other pathways. The three metabolites with the greatest single effect in the Random Forest analysis were γ-glutamylmethionine, oxidized glutathione, and asymmetric dimethylarginine. Transcriptomics analysis identified 812 differentially expressed transcripts in left ventricle samples and 263 in mitral valve samples, representing changes in energy metabolism, antioxidant function, nitric oxide signaling, and extracellular matrix homeostasis pathways. Many of the identified alterations may benefit from nutritional or medical management. Our study provides evidence of the growing importance of integrative approaches in multi-omics research in veterinary and nutritional sciences.
Ischemic Stroke: From Next Generation Sequencing and GWAS to Community Genomics?
Black, Michael; Wang, Wenzhi; Wang, Wei
2015-08-01
Stroke is a major cause of mortality and morbidity in both the developed and developing world. Next generation sequencing (NGS) and multi-omics integrative biology research offer new opportunities in the way we research and understand stroke. These biotechnologies also signal a shift from genetics to genomics of stroke, which is highlighted in this review. Stroke is a focal neurological deficit resulting from disruption of the cerebral blood supply. There are two main types of common stroke, ischemic stroke (IS), which comprises 80% of cases, and hemorrhagic stroke (HS) that accounts for about 20% of cases. IS is a complex multi-factorial disease with multiple environmental and genomic determinants. We discuss here IS from genomics and bioinformatics perspectives, including the highlights of the genome wide association studies (GWAS), NGS progress to date, and exome studies. While both 'common variant, common disease' and 'rare variant, common disease' approaches need to be assessed in tandem, future studies into IS omics should also consider pedigree and/or community based sampling to take account of the complex diversity of IS genetics. We conclude by presenting an example of such community genomics research from China in an extended pedigree sample, and the ways in which the intersection of genomics and global society can usefully inform our understanding of IS pathophysiology and potential preventive medicine interventions in the future.
Improvements in algal lipid production: a systems biology and gene editing approach.
Banerjee, Avik; Banerjee, Chiranjib; Negi, Sangeeta; Chang, Jo-Shu; Shukla, Pratyoosh
2018-05-01
In the wake of rising energy demands, microalgae have emerged as potential sources of sustainable and renewable carbon-neutral fuels, such as bio-hydrogen and bio-oil. For rational metabolic engineering, the elucidation of metabolic pathways in fine detail and their manipulation according to requirements is the key to exploiting the use of microalgae. Emergence of site-specific nucleases have revolutionized applied research leading to biotechnological gains. Genome engineering as well as modulation of the endogenous genome with high precision using CRISPR systems is being gradually employed in microalgal research. Further, to optimize and produce better algal platforms, use of systems biology network analysis and integration of omics data is required. This review discusses two important approaches: systems biology and gene editing strategies used on microalgal systems with a focus on biofuel production and sustainable solutions. It also emphasizes that the integration of such systems would contribute and compliment applied research on microalgae. Recent advances in microalgae are discussed, including systems biology, gene editing approaches in lipid bio-synthesis, and antenna engineering. Lastly, it has been attempted here to showcase how CRISPR/Cas systems are a better editing tool than existing techniques that can be utilized for gene modulation and engineering during biofuel production.
Deciphering Biochemical Network: from particles to planes then to spaces
NASA Astrophysics Data System (ADS)
Ye, Xinhao; Zhang, Siliang; Engineer Research CenterBiotechnology, National
2004-03-01
Today when we are still infatuated with the booming systematic fashion in life science, we, especially as biologist, ironically have fallen down into a sub-systematic maze. That is, although rapid advances in "omics" sciences ceaselessly provided so-called global or large-scale maps to exhibit the corresponding subnet, seldom paid attention to connecting these distinct but close-knit functional modules. Fortunately, a group of physicists recently cast off this natural moat and integrated multi-scale biological network into a simple life's pyramid. However, if extended this pyramid to a 3D structure in view of XYZ axis constructed by the temporal, spatial and organized characteristics respectively, it should be noted that this from-universal-to-particular pyramid is only a transverse section while the achievements in diverse "omics" sciences consist of relative longitudinal ones. On that footing, if analogizing the development of systems biology in last decades as a huge leap from discrete particles (typically in "a paper = a gene" era) to several planes (that is relative to corresponding OMICS science), we might rationally predict a next "space" era is coming soon to untangle and map the multi-tiered biological network really in a whole.
-Omic and Electronic Health Record Big Data Analytics for Precision Medicine.
Wu, Po-Yen; Cheng, Chih-Wen; Kaddi, Chanchala D; Venugopalan, Janani; Hoffman, Ryan; Wang, May D
2017-02-01
Rapid advances of high-throughput technologies and wide adoption of electronic health records (EHRs) have led to fast accumulation of -omic and EHR data. These voluminous complex data contain abundant information for precision medicine, and big data analytics can extract such knowledge to improve the quality of healthcare. In this paper, we present -omic and EHR data characteristics, associated challenges, and data analytics including data preprocessing, mining, and modeling. To demonstrate how big data analytics enables precision medicine, we provide two case studies, including identifying disease biomarkers from multi-omic data and incorporating -omic information into EHR. Big data analytics is able to address -omic and EHR data challenges for paradigm shift toward precision medicine. Big data analytics makes sense of -omic and EHR data to improve healthcare outcome. It has long lasting societal impact.
Next generation microbiological risk assessment-Potential of omics data for hazard characterisation.
Haddad, Nabila; Johnson, Nick; Kathariou, Sophia; Métris, Aline; Phister, Trevor; Pielaat, Annemarie; Tassou, Chrysoula; Wells-Bennik, Marjon H J; Zwietering, Marcel H
2018-04-12
According to the World Health Organization estimates in 2015, 600 million people fall ill every year from contaminated food and 420,000 die. Microbial risk assessment (MRA) was developed as a tool to reduce and prevent risks presented by pathogens and/or their toxins. MRA is organized in four steps to analyse information and assist in both designing appropriate control options and implementation of regulatory decisions and programs. Among the four steps, hazard characterisation is performed to establish the probability and severity of a disease outcome, which is determined as function of the dose of toxin and/or pathogen ingested. This dose-response relationship is subject to both variability and uncertainty. The purpose of this review/opinion article is to discuss how Next Generation Omics can impact hazard characterisation and, more precisely, how it can improve our understanding of variability and limit the uncertainty in the dose-response relation. The expansion of omics tools (e.g. genomics, transcriptomics, proteomics and metabolomics) allows for a better understanding of pathogenicity mechanisms and virulence levels of bacterial strains. Detection and identification of virulence genes, comparative genomics, analyses of mRNA and protein levels and the development of biomarkers can help in building a mechanistic dose-response model to predict disease severity. In this respect, systems biology can help to identify critical system characteristics that confer virulence and explain variability between strains. Despite challenges in the integration of omics into risk assessment, some omics methods have already been used by regulatory agencies for hazard identification. Standardized methods, reproducibility and datasets obtained from realistic conditions remain a challenge, and are needed to improve accuracy of hazard characterisation. When these improvements are realized, they will allow the health authorities and government policy makers to prioritize hazards more accurately and thus refine surveillance programs with the collaboration of all stakeholders of the food chain. Copyright © 2018 The Authors. Published by Elsevier B.V. All rights reserved.
Campos, Nádia A.; Panis, Bart; Carpentier, Sebastien C.
2017-01-01
One of the most important crops cultivated around the world is coffee. There are two main cultivated species, Coffea arabica and C. canephora. Both species are difficult to improve through conventional breeding, taking at least 20 years to produce a new cultivar. Biotechnological tools such as genetic transformation, micropropagation and somatic embryogenesis (SE) have been extensively studied in order to provide practical results for coffee improvement. While genetic transformation got many attention in the past and is booming with the CRISPR technology, micropropagation and SE are still the major bottle neck and urgently need more attention. The methodologies to induce SE and the further development of the embryos are genotype-dependent, what leads to an almost empirical development of specific protocols for each cultivar or clone. This is a serious limitation and excludes a general comprehensive understanding of the process as a whole. The aim of this review is to provide an overview of which achievements and molecular insights have been gained in (coffee) somatic embryogenesis and encourage researchers to invest further in the in vitro technology and combine it with the latest omics techniques (genomics, transcriptomics, proteomics, metabolomics, and phenomics). We conclude that the evolution of biotechnology and the integration of omics technologies offer great opportunities to (i) optimize the production process of SE and the subsequent conversion into rooted plantlets and (ii) to screen for possible somaclonal variation. However, currently the usage of the latest biotechnology did not pass the stage beyond proof of potential and needs to further improve. PMID:28871271
Metabolomics in transfusion medicine.
Nemkov, Travis; Hansen, Kirk C; Dumont, Larry J; D'Alessandro, Angelo
2016-04-01
Biochemical investigations on the regulatory mechanisms of red blood cell (RBC) and platelet (PLT) metabolism have fostered a century of advances in the field of transfusion medicine. Owing to these advances, storage of RBCs and PLT concentrates has become a lifesaving practice in clinical and military settings. There, however, remains room for improvement, especially with regard to the introduction of novel storage and/or rejuvenation solutions, alternative cell processing strategies (e.g., pathogen inactivation technologies), and quality testing (e.g., evaluation of novel containers with alternative plasticizers). Recent advancements in mass spectrometry-based metabolomics and systems biology, the bioinformatics integration of omics data, promise to speed up the design and testing of innovative storage strategies developed to improve the quality, safety, and effectiveness of blood products. Here we review the currently available metabolomics technologies and briefly describe the routine workflow for transfusion medicine-relevant studies. The goal is to provide transfusion medicine experts with adequate tools to navigate through the otherwise overwhelming amount of metabolomics data burgeoning in the field during the past few years. Descriptive metabolomics data have represented the first step omics researchers have taken into the field of transfusion medicine. However, to up the ante, clinical and omics experts will need to merge their expertise to investigate correlative and mechanistic relationships among metabolic variables and transfusion-relevant variables, such as 24-hour in vivo recovery for transfused RBCs. Integration with systems biology models will potentially allow for in silico prediction of metabolic phenotypes, thus streamlining the design and testing of alternative storage strategies and/or solutions. © 2015 AABB.
Predicting the carcinogenicity of chemicals with alternative approaches: recent advances.
Benigni, Romualdo
2014-09-01
Alternative approaches to the rodent bioassay are necessary for early identification of problematic drugs and biocides during the development process, and are the only practicable tool for assessing environmental chemicals with no or adequate safety documentation. This review informs on: i) the traditional prescreening through genotoxicity testing; ii) an integrative approach that assesses DNA-reactivity and ability to disorganize tissues; iii) new applications of omics technologies (ToxCast/Tox21 project); iv) a pragmatic approach aimed at filling data gaps by intrapolating/extrapolating from similar chemicals (read-across, category formation). The review also approaches the issue of the concerns about false-positive and false-negative results that prevents a wider acceptance and use of alternatives. The review addresses strengths and limitations of various proposals, and concludes on the need of differential approaches to the issue of false negatives and false positives. False negatives can be eliminated or reduced below the variability of the animal assay with conservative quantitative structure-activity relationships or in vitro tests; false positives can be cleared with ad hoc mechanistically based follow-ups. This framework can permit a reduction of animal testing and a better protection of human health.
Ray, Sandipan
2016-01-01
Integration of knowledge concerning circadian rhythms, metabolic networks, and sleep‐wake cycles is imperative for unraveling the mysteries of biological cycles and their underlying mechanisms. During the last decade, enormous progress in circadian biology research has provided a plethora of new insights into the molecular architecture of circadian clocks. However, the recent identification of autonomous redox oscillations in cells has expanded our view of the clockwork beyond conventional transcription/translation feedback loop models, which have been dominant since the first circadian period mutants were identified in fruit fly. Consequently, non‐transcriptional timekeeping mechanisms have been proposed, and the antioxidant peroxiredoxin proteins have been identified as conserved markers for 24‐hour rhythms. Here, we review recent advances in our understanding of interdependencies amongst circadian rhythms, sleep homeostasis, redox cycles, and other cellular metabolic networks. We speculate that systems‐level investigations implementing integrated multi‐omics approaches could provide novel mechanistic insights into the connectivity between daily cycles and metabolic systems. PMID:26866932
Zhang, Chi; Zhang, Ge; Chen, Ke-ji; Lu, Ai-ping
2016-04-01
The development of an effective classification method for human health conditions is essential for precise diagnosis and delivery of tailored therapy to individuals. Contemporary classification of disease systems has properties that limit its information content and usability. Chinese medicine pattern classification has been incorporated with disease classification, and this integrated classification method became more precise because of the increased understanding of the molecular mechanisms. However, we are still facing the complexity of diseases and patterns in the classification of health conditions. With continuing advances in omics methodologies and instrumentation, we are proposing a new classification approach: molecular module classification, which is applying molecular modules to classifying human health status. The initiative would be precisely defining the health status, providing accurate diagnoses, optimizing the therapeutics and improving new drug discovery strategy. Therefore, there would be no current disease diagnosis, no disease pattern classification, and in the future, a new medicine based on this classification, molecular module medicine, could redefine health statuses and reshape the clinical practice.
Mantovani, Alberto; Maranghi, Francesca; La Rocca, Cinzia; Tiboni, Gian Mario; Clementi, Maurizio
2008-09-01
The paper discusses current knowledge and possible research priorities on biomarkers of exposure, effect and susceptibility for potential endocrine activities of agrochemicals (dicarboximides, ethylene bisdithiocarbammates, triazoles, etc.). Possible widespread, multiple-pathway exposure to agrochemicals highlights the need to assess internal exposure of animals or humans, which is the most relevant exposure measure for hazard and risk estimation; however, exposure data should be integrated by early indicators predictive of possible health effects, particularly for vulnerable groups such as mother-child pairs. Research need include: non-invasive biomarkers for children biomonitoring; novel biomarkers of total exposure to measure whole endocrine disrupter-related burden; characterization of biomarkers of susceptibility, including the role of markers of nutritional status; anchoring early molecular markers to established toxicological endpoints to support their predictivity; integrating "omics"-based approaches in a system-toxicology framework. As biomonitoring becomes increasingly important in the environment-and-health scenario, toxicologists can substantially contribute both to the characterization of new biomarkers and to the predictivity assessment and improvement of the existing ones.
Ray, Sandipan; Reddy, Akhilesh B
2016-04-01
Integration of knowledge concerning circadian rhythms, metabolic networks, and sleep-wake cycles is imperative for unraveling the mysteries of biological cycles and their underlying mechanisms. During the last decade, enormous progress in circadian biology research has provided a plethora of new insights into the molecular architecture of circadian clocks. However, the recent identification of autonomous redox oscillations in cells has expanded our view of the clockwork beyond conventional transcription/translation feedback loop models, which have been dominant since the first circadian period mutants were identified in fruit fly. Consequently, non-transcriptional timekeeping mechanisms have been proposed, and the antioxidant peroxiredoxin proteins have been identified as conserved markers for 24-hour rhythms. Here, we review recent advances in our understanding of interdependencies amongst circadian rhythms, sleep homeostasis, redox cycles, and other cellular metabolic networks. We speculate that systems-level investigations implementing integrated multi-omics approaches could provide novel mechanistic insights into the connectivity between daily cycles and metabolic systems. © 2016 The Authors. Bioessays published by WILEY Periodicals, Inc.
OMICS-strategies and methods in the fight against doping.
Reichel, Christian
2011-12-10
During the past decade OMICS-methods not only continued to have their impact on research strategies in life sciences and in particular molecular biology, but also started to be used for anti-doping control purposes. Research activities were mainly reasoned by the fact that several substances and methods, which were prohibited by the World Anti-Doping Agency (WADA), were or still are difficult to detect by direct methods. Transcriptomics, proteomics, and metabolomics in theory offer ideal platforms for the discovery of biomarkers for the indirect detection of the abuse of these substances and methods. Traditionally, the main focus of transcriptomics and proteomics projects has been on the prolonged detection of the misuse of human growth hormone (hGH), recombinant erythropoietin (rhEpo), and autologous blood transfusion. An additional benefit of the indirect or marker approach would also be that similarly acting substances might then be detected by a single method, without being forced to develop new direct detection methods for new but comparable prohibited substances (as has been the case, e.g. for the various forms of Epo analogs and biosimilars). While several non-OMICS-derived parameters for the indirect detection of doping are currently in use, for example the blood parameters of the hematological module of the athlete's biological passport, the outcome of most non-targeted OMICS-projects led to no direct application in routine doping control so far. The main reason is the inherent complexity of human transcriptomes, proteomes, and metabolomes and their inter-individual variability. The article reviews previous and recent research projects and their results and discusses future strategies for a more efficient application of OMICS-methods in doping control. Copyright © 2011 Elsevier Ireland Ltd. All rights reserved.
Zoccali, Carmine; Vanholder, Raymond; Massy, Ziad A; Ortiz, Alberto; Sarafidis, Pantelis; Dekker, Friedo W; Fliser, Danilo; Fouque, Denis; Heine, Gunnar H; Jager, Kitty J; Kanbay, Mehmet; Mallamaci, Francesca; Parati, Gianfranco; Rossignol, Patrick; Wiecek, Andrzej; London, Gerard
2017-06-01
The accurate definition and staging of chronic kidney disease (CKD) is one of the major achievements of modern nephrology. Intensive research is now being undertaken to unravel the risk factors and pathophysiologic underpinnings of this disease. In particular, the relationships between the kidney and other organs have been comprehensively investigated in experimental and clinical studies in the last two decades. Owing to technological and analytical limitations, these links have been studied with a reductionist approach focusing on two organs at a time, such as the heart and the kidney or the bone and the kidney. Here, we discuss studies that highlight the complex and systemic nature of CKD. Energy balance, innate immunity and neuroendocrine signalling are highly integrated biological phenomena. The diseased kidney disrupts such integration and generates a high-risk phenotype with a clinical profile encompassing inflammation, protein-energy wasting, altered function of the autonomic and central nervous systems and cardiopulmonary, vascular and bone diseases. A systems biology approach to CKD using omics techniques will hopefully enable in-depth study of the pathophysiology of this systemic disease, and has the potential to unravel critical pathways that can be targeted for CKD prevention and therapy.
Wine biotechnology in South Africa: towards a systems approach to wine science.
Moore, John P; Divol, Benoit; Young, Philip R; Nieuwoudt, Hélène H; Ramburan, Viresh; du Toit, Maret; Bauer, Florian F; Vivier, Melané A
2008-11-01
The wine industry in South Africa is over three centuries old and over the last decade has reemerged as a significant competitor in world wine markets. The Institute for Wine Biotechnology (IWBT) was established in partnership with the Department of Viticulture and Oenology at Stellenbosch University to foster basic fundamental research in the wine sciences leading to applications in the broader wine and grapevine industries. This review focuses on the different research programmes of the Institute (grapevine, yeast and bacteria biotechnology programmes, and chemical-analytical research), commercialisation activities (SunBio) and new initiatives to integrate the various research disciplines. An important focus of future research is the Wine Science Research Niche Area programme, which connects the different research thrusts of the IWBT and of several research partners in viticulture, oenology, food science and chemistry. This 'Functional Wine-omics' programme uses a systems biology approach to wine-related organisms. The data generated within the programme will be integrated with other data sets from viticulture, oenology, analytical chemistry and the sensory sciences through chemometrics and other statistical tools. The aim of the programme is to model aspects of the wine making process, from the vineyard to the finished product.
Özdemir, Vural; Kolker, Eugene; Hotez, Peter J; Mohin, Sophie; Prainsack, Barbara; Wynne, Brian; Vayena, Effy; Coşkun, Yavuz; Dereli, Türkay; Huzair, Farah; Borda-Rodriguez, Alexander; Bragazzi, Nicola Luigi; Faris, Jack; Ramesar, Raj; Wonkam, Ambroise; Dandara, Collet; Nair, Bipin; Llerena, Adrián; Kılıç, Koray; Jain, Rekha; Reddy, Panga Jaipal; Gollapalli, Kishore; Srivastava, Sanjeeva; Kickbusch, Ilona
2014-01-01
Metadata refer to descriptions about data or as some put it, "data about data." Metadata capture what happens on the backstage of science, on the trajectory from study conception, design, funding, implementation, and analysis to reporting. Definitions of metadata vary, but they can include the context information surrounding the practice of science, or data generated as one uses a technology, including transactional information about the user. As the pursuit of knowledge broadens in the 21(st) century from traditional "science of whats" (data) to include "science of hows" (metadata), we analyze the ways in which metadata serve as a catalyst for responsible and open innovation, and by extension, science diplomacy. In 2015, the United Nations Millennium Development Goals (MDGs) will formally come to an end. Therefore, we propose that metadata, as an ingredient of responsible innovation, can help achieve the Sustainable Development Goals (SDGs) on the post-2015 agenda. Such responsible innovation, as a collective learning process, has become a key component, for example, of the European Union's 80 billion Euro Horizon 2020 R&D Program from 2014-2020. Looking ahead, OMICS: A Journal of Integrative Biology, is launching an initiative for a multi-omics metadata checklist that is flexible yet comprehensive, and will enable more complete utilization of single and multi-omics data sets through data harmonization and greater visibility and accessibility. The generation of metadata that shed light on how omics research is carried out, by whom and under what circumstances, will create an "intervention space" for integration of science with its socio-technical context. This will go a long way to addressing responsible innovation for a fairer and more transparent society. If we believe in science, then such reflexive qualities and commitments attained by availability of omics metadata are preconditions for a robust and socially attuned science, which can then remain broadly respected, independent, and responsibly innovative. "In Sierra Leone, we have not too much electricity. The lights will come on once in a week, and the rest of the month, dark[ness]. So I made my own battery to power light in people's houses." Kelvin Doe (Global Minimum, 2012) MIT Visiting Young Innovator Cambridge, USA, and Sierra Leone "An important function of the (Global) R&D Observatory will be to provide support and training to build capacity in the collection and analysis of R&D flows, and how to link them to the product pipeline." World Health Organization (2013) Draft Working Paper on a Global Health R&D Observatory.
The current approach to assessing adverse effects of chemicals in the environment is largely based on a battery of in-vivo study methods and a limited number of accepted in-silico approaches. For most substances the pool of data from which to predict ecosystem effects is limited ...
de Jong, Bouke; Siewers, Verena; Nielsen, Jens
2012-08-01
Transportation fuels will gradually shift from oil based fuels towards alternative fuel resources like biofuels. Current bioethanol and biodiesel can, however, not cover the increasing demand for biofuels and there is therefore a need for advanced biofuels with superior fuel properties. Novel cell factories will provide a production platform for advanced biofuels. However, deep cellular understanding is required for improvement of current biofuel cell factories. Fast screening and analysis (-omics) methods and metabolome-wide mathematical models are promising techniques. An integrated systems approach of these techniques drives diversity and quantity of several new biofuel compounds. This review will cover the recent technological developments that support improvement of the advanced biofuels 1-butanol, biodiesels and jetfuels. Copyright © 2011 Elsevier Ltd. All rights reserved.
U-BIOPRED: evaluation of the value of a public-private partnership to industry.
Riley, John H; Erpenbeck, Veit J; Matthews, J G; Holweg, C T J; Compton, C; Seibold, W; Higenbottam, T; Wagers, S S; Rowe, A; Myles, D
2018-06-21
Unbiased Biomarkers for the Prediction of Respiratory Disease Outcomes (U-BIOPRED) was initiated in the first year of the Innovative Medicines Initiative (IMI). It was an ambitious plan to tackle the understanding of asthma through an integration of clinical and multi-'omics approaches that necessitated the bringing together of industry, academic, and patient representatives because it was too large to be managed by any one of the partners in isolation. It was a novel experience for all concerned. In this review, we describe the main features of the U-BIOPRED experience from the industry perspective. We list some of the key advantages and learnings from the perspective of the authors, and also improvements that we feel could be made in future projects. Copyright © 2018 Elsevier Ltd. All rights reserved.
Doerrer, Nancy; Ladics, Gregory; McClain, Scott; Herouet-Guicheney, Corinne; Poulsen, Lars K; Privalle, Laura; Stagg, Nicola
2010-12-01
The International Life Sciences Institute Health and Environmental Sciences Institute Protein Allergenicity Technical Committee hosted an international workshop November 16-17, 2009, in Paris, France, with over 60 participants from academia, government, and industry to review and discuss the potential utility of "-omics" technologies for assessing the variability in plant gene, protein, and metabolite expression. The goal of the workshop was to illustrate how a plant's constituent makeup and phenotypic processes can be surveyed analytically. Presentations on the "-omics" techniques (i.e., genomics, proteomics, and metabolomics) highlighted the workshop, and summaries of these presentations are published separately in this supplemental issue. This paper summarizes key messages, as well as the consensus points reached, in a roundtable discussion on eight specific questions posed during the final session of the workshop. The workshop established some common, though not unique, challenges for all "-omics" techniques, and include (a) standardization of separation/extraction and analytical techniques; (b) difficulty in associating environmental impacts (e.g., planting, soil texture, location, climate, stress) with potential alterations in plants at genomic, proteomic, and metabolomic levels; (c) many independent analytical measurements, but few replicates/subjects--poorly defined accuracy and precision; and (d) bias--a lack of hypothesis-driven science. Information on natural plant variation is critical in establishing the utility of new technologies due to the variability in specific analytes that may result from genetic differences (crop genotype), different crop management practices (conventional high input, low input, organic), interaction between genotype and environment, and the use of different breeding methods. For example, variations of several classes of proteins were reported among different soybean, rice, or wheat varieties or varieties grown at different locations. Data on the variability of allergenic proteins are important in defining the risk of potential allergenicity. Once established as a standardized assay, survey approaches such as the "-omics" techniques can be considered in a hypothesis-driven analysis of plants, such as determining unintended effects in genetically modified (GM) crops. However, the analysis should include both the GM and control varieties that have the same breeding history and exposure to the same environmental conditions. Importantly, the biological relevance and safety significance of changes in "-omic" data are still unknown. Furthermore, the current compositional assessment for evaluating the substantial equivalence of GM crops is robust, comprehensive, and a good tool for food safety assessments. The overall consensus of the workshop participants was that many "-omics" techniques are extremely useful in the discovery and research phases of biotechnology, and are valuable for hypothesis generation. However, there are many methodological shortcomings identified with "-omics" approaches, a paucity of reference materials, and a lack of focused strategy for their use that currently make them not conducive for the safety assessment of GM crops. Copyright © 2010 Elsevier Inc. All rights reserved.
Computational challenges in modeling gene regulatory events.
Pataskar, Abhijeet; Tiwari, Vijay K
2016-10-19
Cellular transcriptional programs driven by genetic and epigenetic mechanisms could be better understood by integrating "omics" data and subsequently modeling the gene-regulatory events. Toward this end, computational biology should keep pace with evolving experimental procedures and data availability. This article gives an exemplified account of the current computational challenges in molecular biology.
ICM: a web server for integrated clustering of multi-dimensional biomedical data.
He, Song; He, Haochen; Xu, Wenjian; Huang, Xin; Jiang, Shuai; Li, Fei; He, Fuchu; Bo, Xiaochen
2016-07-08
Large-scale efforts for parallel acquisition of multi-omics profiling continue to generate extensive amounts of multi-dimensional biomedical data. Thus, integrated clustering of multiple types of omics data is essential for developing individual-based treatments and precision medicine. However, while rapid progress has been made, methods for integrated clustering are lacking an intuitive web interface that facilitates the biomedical researchers without sufficient programming skills. Here, we present a web tool, named Integrated Clustering of Multi-dimensional biomedical data (ICM), that provides an interface from which to fuse, cluster and visualize multi-dimensional biomedical data and knowledge. With ICM, users can explore the heterogeneity of a disease or a biological process by identifying subgroups of patients. The results obtained can then be interactively modified by using an intuitive user interface. Researchers can also exchange the results from ICM with collaborators via a web link containing a Project ID number that will directly pull up the analysis results being shared. ICM also support incremental clustering that allows users to add new sample data into the data of a previous study to obtain a clustering result. Currently, the ICM web server is available with no login requirement and at no cost at http://biotech.bmi.ac.cn/icm/. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Rinschen, Markus M; Gödel, Markus; Grahammer, Florian; Zschiedrich, Stefan; Helmstädter, Martin; Kretz, Oliver; Zarei, Mostafa; Braun, Daniela A; Dittrich, Sebastian; Pahmeyer, Caroline; Schroder, Patricia; Teetzen, Carolin; Gee, HeonYung; Daouk, Ghaleb; Pohl, Martin; Kuhn, Elisa; Schermer, Bernhard; Küttner, Victoria; Boerries, Melanie; Busch, Hauke; Schiffer, Mario; Bergmann, Carsten; Krüger, Marcus; Hildebrandt, Friedhelm; Dengjel, Joern; Benzing, Thomas; Huber, Tobias B
2018-05-22
Damage to and loss of glomerular podocytes has been identified as the culprit lesion in progressive kidney diseases. Here, we combine mass spectrometry-based proteomics with mRNA sequencing, bioinformatics, and hypothesis-driven studies to provide a comprehensive and quantitative map of mammalian podocytes that identifies unanticipated signaling pathways. Comparison of the in vivo datasets with proteomics data from podocyte cell cultures showed a limited value of available cell culture models. Moreover, in vivo stable isotope labeling by amino acids uncovered surprisingly rapid synthesis of mitochondrial proteins under steady-state conditions that was perturbed under autophagy-deficient, disease-susceptible conditions. Integration of acquired omics dimensions suggested FARP1 as a candidate essential for podocyte function, which could be substantiated by genetic analysis in humans and knockdown experiments in zebrafish. This work exemplifies how the integration of multi-omics datasets can identify a framework of cell-type-specific features relevant for organ health and disease. Copyright © 2018 The Authors. Published by Elsevier Inc. All rights reserved.
Crop improvement using life cycle datasets acquired under field conditions.
Mochida, Keiichi; Saisho, Daisuke; Hirayama, Takashi
2015-01-01
Crops are exposed to various environmental stresses in the field throughout their life cycle. Modern plant science has provided remarkable insights into the molecular networks of plant stress responses in laboratory conditions, but the responses of different crops to environmental stresses in the field need to be elucidated. Recent advances in omics analytical techniques and information technology have enabled us to integrate data from a spectrum of physiological metrics of field crops. The interdisciplinary efforts of plant science and data science enable us to explore factors that affect crop productivity and identify stress tolerance-related genes and alleles. Here, we describe recent advances in technologies that are key components for data driven crop design, such as population genomics, chronological omics analyses, and computer-aided molecular network prediction. Integration of the outcomes from these technologies will accelerate our understanding of crop phenology under practical field situations and identify key characteristics to represent crop stress status. These elements would help us to genetically engineer "designed crops" to prevent yield shortfalls because of environmental fluctuations due to future climate change.
-Omic and Electronic Health Records Big Data Analytics for Precision Medicine
Wu, Po-Yen; Cheng, Chih-Wen; Kaddi, Chanchala D.; Venugopalan, Janani; Hoffman, Ryan; Wang, May D.
2017-01-01
Objective Rapid advances of high-throughput technologies and wide adoption of electronic health records (EHRs) have led to fast accumulation of -omic and EHR data. These voluminous complex data contain abundant information for precision medicine, and big data analytics can extract such knowledge to improve the quality of health care. Methods In this article, we present -omic and EHR data characteristics, associated challenges, and data analytics including data pre-processing, mining, and modeling. Results To demonstrate how big data analytics enables precision medicine, we provide two case studies, including identifying disease biomarkers from multi-omic data and incorporating -omic information into EHR. Conclusion Big data analytics is able to address –omic and EHR data challenges for paradigm shift towards precision medicine. Significance Big data analytics makes sense of –omic and EHR data to improve healthcare outcome. It has long lasting societal impact. PMID:27740470
ICan: an integrated co-alteration network to identify ovarian cancer-related genes.
Zhou, Yuanshuai; Liu, Yongjing; Li, Kening; Zhang, Rui; Qiu, Fujun; Zhao, Ning; Xu, Yan
2015-01-01
Over the last decade, an increasing number of integrative studies on cancer-related genes have been published. Integrative analyses aim to overcome the limitation of a single data type, and provide a more complete view of carcinogenesis. The vast majority of these studies used sample-matched data of gene expression and copy number to investigate the impact of copy number alteration on gene expression, and to predict and prioritize candidate oncogenes and tumor suppressor genes. However, correlations between genes were neglected in these studies. Our work aimed to evaluate the co-alteration of copy number, methylation and expression, allowing us to identify cancer-related genes and essential functional modules in cancer. We built the Integrated Co-alteration network (ICan) based on multi-omics data, and analyzed the network to uncover cancer-related genes. After comparison with random networks, we identified 155 ovarian cancer-related genes, including well-known (TP53, BRCA1, RB1 and PTEN) and also novel cancer-related genes, such as PDPN and EphA2. We compared the results with a conventional method: CNAmet, and obtained a significantly better area under the curve value (ICan: 0.8179, CNAmet: 0.5183). In this paper, we describe a framework to find cancer-related genes based on an Integrated Co-alteration network. Our results proved that ICan could precisely identify candidate cancer genes and provide increased mechanistic understanding of carcinogenesis. This work suggested a new research direction for biological network analyses involving multi-omics data.
ICan: An Integrated Co-Alteration Network to Identify Ovarian Cancer-Related Genes
Zhou, Yuanshuai; Liu, Yongjing; Li, Kening; Zhang, Rui; Qiu, Fujun; Zhao, Ning; Xu, Yan
2015-01-01
Background Over the last decade, an increasing number of integrative studies on cancer-related genes have been published. Integrative analyses aim to overcome the limitation of a single data type, and provide a more complete view of carcinogenesis. The vast majority of these studies used sample-matched data of gene expression and copy number to investigate the impact of copy number alteration on gene expression, and to predict and prioritize candidate oncogenes and tumor suppressor genes. However, correlations between genes were neglected in these studies. Our work aimed to evaluate the co-alteration of copy number, methylation and expression, allowing us to identify cancer-related genes and essential functional modules in cancer. Results We built the Integrated Co-alteration network (ICan) based on multi-omics data, and analyzed the network to uncover cancer-related genes. After comparison with random networks, we identified 155 ovarian cancer-related genes, including well-known (TP53, BRCA1, RB1 and PTEN) and also novel cancer-related genes, such as PDPN and EphA2. We compared the results with a conventional method: CNAmet, and obtained a significantly better area under the curve value (ICan: 0.8179, CNAmet: 0.5183). Conclusion In this paper, we describe a framework to find cancer-related genes based on an Integrated Co-alteration network. Our results proved that ICan could precisely identify candidate cancer genes and provide increased mechanistic understanding of carcinogenesis. This work suggested a new research direction for biological network analyses involving multi-omics data. PMID:25803614
Chasman, Deborah; Walters, Kevin B.; Lopes, Tiago J. S.; Eisfeld, Amie J.; Kawaoka, Yoshihiro; Roy, Sushmita
2016-01-01
Mammalian host response to pathogenic infections is controlled by a complex regulatory network connecting regulatory proteins such as transcription factors and signaling proteins to target genes. An important challenge in infectious disease research is to understand molecular similarities and differences in mammalian host response to diverse sets of pathogens. Recently, systems biology studies have produced rich collections of omic profiles measuring host response to infectious agents such as influenza viruses at multiple levels. To gain a comprehensive understanding of the regulatory network driving host response to multiple infectious agents, we integrated host transcriptomes and proteomes using a network-based approach. Our approach combines expression-based regulatory network inference, structured-sparsity based regression, and network information flow to infer putative physical regulatory programs for expression modules. We applied our approach to identify regulatory networks, modules and subnetworks that drive host response to multiple influenza infections. The inferred regulatory network and modules are significantly enriched for known pathways of immune response and implicate apoptosis, splicing, and interferon signaling processes in the differential response of viral infections of different pathogenicities. We used the learned network to prioritize regulators and study virus and time-point specific networks. RNAi-based knockdown of predicted regulators had significant impact on viral replication and include several previously unknown regulators. Taken together, our integrated analysis identified novel module level patterns that capture strain and pathogenicity-specific patterns of expression and helped identify important regulators of host response to influenza infection. PMID:27403523
Enabling individualized therapy through nanotechnology.
Sakamoto, Jason H; van de Ven, Anne L; Godin, Biana; Blanco, Elvin; Serda, Rita E; Grattoni, Alessandro; Ziemys, Arturas; Bouamrani, Ali; Hu, Tony; Ranganathan, Shivakumar I; De Rosa, Enrica; Martinez, Jonathan O; Smid, Christine A; Buchanan, Rachel M; Lee, Sei-Young; Srinivasan, Srimeenakshi; Landry, Matthew; Meyn, Anne; Tasciotti, Ennio; Liu, Xuewu; Decuzzi, Paolo; Ferrari, Mauro
2010-08-01
Individualized medicine is the healthcare strategy that rebukes the idiomatic dogma of 'losing sight of the forest for the trees'. We are entering a new era of healthcare where it is no longer acceptable to develop and market a drug that is effective for only 80% of the patient population. The emergence of "-omic" technologies (e.g. genomics, transcriptomics, proteomics, metabolomics) and advances in systems biology are magnifying the deficiencies of standardized therapy, which often provide little treatment latitude for accommodating patient physiologic idiosyncrasies. A personalized approach to medicine is not a novel concept. Ever since the scientific community began unraveling the mysteries of the genome, the promise of discarding generic treatment regimens in favor of patient-specific therapies became more feasible and realistic. One of the major scientific impediments of this movement towards personalized medicine has been the need for technological enablement. Nanotechnology is projected to play a critical role in patient-specific therapy; however, this transition will depend heavily upon the evolutionary development of a systems biology approach to clinical medicine based upon "-omic" technology analysis and integration. This manuscript provides a forward looking assessment of the promise of nanomedicine as it pertains to individualized medicine and establishes a technology "snapshot" of the current state of nano-based products over a vast array of clinical indications and range of patient specificity. Other issues such as market driven hurdles and regulatory compliance reform are anticipated to "self-correct" in accordance to scientific advancement and healthcare demand. These peripheral, non-scientific concerns are not addressed at length in this manuscript; however they do exist, and their impact to the paradigm shifting healthcare transformation towards individualized medicine will be critical for its success. Copyright 2010 Elsevier Ltd. All rights reserved.
Fernando, Ruani N; Chaudhari, Umesh; Escher, Sylvia E; Hengstler, Jan G; Hescheler, Jürgen; Jennings, Paul; Keun, Hector C; Kleinjans, Jos C S; Kolde, Raivo; Kollipara, Laxmikanth; Kopp-Schneider, Annette; Limonciel, Alice; Nemade, Harshal; Nguemo, Filomain; Peterson, Hedi; Prieto, Pilar; Rodrigues, Robim M; Sachinidis, Agapios; Schäfer, Christoph; Sickmann, Albert; Spitkovsky, Dimitry; Stöber, Regina; van Breda, Simone G J; van de Water, Bob; Vivier, Manon; Zahedi, René P; Vinken, Mathieu; Rogiers, Vera
2016-06-01
SEURAT-1 is a joint research initiative between the European Commission and Cosmetics Europe aiming to develop in vitro- and in silico-based methods to replace the in vivo repeated dose systemic toxicity test used for the assessment of human safety. As one of the building blocks of SEURAT-1, the DETECTIVE project focused on a key element on which in vitro toxicity testing relies: the development of robust and reliable, sensitive and specific in vitro biomarkers and surrogate endpoints that can be used for safety assessments of chronically acting toxicants, relevant for humans. The work conducted by the DETECTIVE consortium partners has established a screening pipeline of functional and "-omics" technologies, including high-content and high-throughput screening platforms, to develop and investigate human biomarkers for repeated dose toxicity in cellular in vitro models. Identification and statistical selection of highly predictive biomarkers in a pathway- and evidence-based approach constitute a major step in an integrated approach towards the replacement of animal testing in human safety assessment. To discuss the final outcomes and achievements of the consortium, a meeting was organized in Brussels. This meeting brought together data-producing and supporting consortium partners. The presentations focused on the current state of ongoing and concluding projects and the strategies employed to identify new relevant biomarkers of toxicity. The outcomes and deliverables, including the dissemination of results in data-rich "-omics" databases, were discussed as were the future perspectives of the work completed under the DETECTIVE project. Although some projects were still in progress and required continued data analysis, this report summarizes the presentations, discussions and the outcomes of the project.
Barallobre-Barreiro, Javier; Chung, Yuen-Li; Mayr, Manuel
2013-08-01
In the last decade, proteomics and metabolomics have contributed substantially to our understanding of cardiovascular diseases. The unbiased assessment of pathophysiological processes without a priori assumptions complements other molecular biology techniques that are currently used in a reductionist approach. In this review, we highlight some of the "omics" methods used to assess protein and metabolite changes in cardiovascular disease. A discrete biological function is very rarely attributed to a single molecule; more often it is the combined input of many proteins. In contrast to the reductionist approach, in which molecules are studied individually, "omics" platforms allow the study of more complex interactions in biological systems. Combining proteomics and metabolomics to quantify changes in metabolites and their corresponding enzymes will advance our understanding of pathophysiological mechanisms and aid the identification of novel biomarkers for cardiovascular disease. Copyright © 2013 Sociedad Española de Cardiología. Published by Elsevier Espana. All rights reserved.
Lessons from Digestive-Tract Symbioses Between Bacteria and Invertebrates.
Graf, Joerg
2016-09-08
In most animals, digestive tracts harbor the greatest number of bacteria in the animal that contribute to its health: by aiding in the digestion of nutrients, provisioning essential nutrients and protecting against colonization by pathogens. Invertebrates have been used to enhance our understanding of metabolic processes and microbe-host interactions owing to experimental advantages. This review describes how advances in DNA sequencing technologies have dramatically altered how researchers investigate microbe-host interactions, including 16S rRNA gene surveys, metagenome experiments, and metatranscriptome studies. Advantages and challenges of each of these approaches are described herein. Hypotheses generated through omics studies can be directly tested using site-directed mutagenesis, and findings from transposon studies and site-directed experiments are presented. Finally, unique structural aspects of invertebrate digestive tracts that contribute to symbiont specificity are presented. The combination of omics approaches with genetics and microscopy allows researchers to move beyond correlations to identify conserved mechanisms of microbe-host interactions.
Ping, Yanyan; Deng, Yulan; Wang, Li; Zhang, Hongyi; Zhang, Yong; Xu, Chaohan; Zhao, Hongying; Fan, Huihui; Yu, Fulong; Xiao, Yun; Li, Xia
2015-01-01
The driver genetic aberrations collectively regulate core cellular processes underlying cancer development. However, identifying the modules of driver genetic alterations and characterizing their functional mechanisms are still major challenges for cancer studies. Here, we developed an integrative multi-omics method CMDD to identify the driver modules and their affecting dysregulated genes through characterizing genetic alteration-induced dysregulated networks. Applied to glioblastoma (GBM), the CMDD identified a core gene module of 17 genes, including seven known GBM drivers, and their dysregulated genes. The module showed significant association with shorter survival of GBM. When classifying driver genes in the module into two gene sets according to their genetic alteration patterns, we found that one gene set directly participated in the glioma pathway, while the other indirectly regulated the glioma pathway, mostly, via their dysregulated genes. Both of the two gene sets were significant contributors to survival and helpful for classifying GBM subtypes, suggesting their critical roles in GBM pathogenesis. Also, by applying the CMDD to other six cancers, we identified some novel core modules associated with overall survival of patients. Together, these results demonstrate integrative multi-omics data can identify driver modules and uncover their dysregulated genes, which is useful for interpreting cancer genome. PMID:25653168
Mulrane, Laoighse; Rexhepaj, Elton; Smart, Valerie; Callanan, John J; Orhan, Diclehan; Eldem, Türkan; Mally, Angela; Schroeder, Susanne; Meyer, Kirstin; Wendt, Maria; O'Shea, Donal; Gallagher, William M
2008-08-01
The widespread use of digital slides has only recently come to the fore with the development of high-throughput scanners and high performance viewing software. This development, along with the optimisation of compression standards and image transfer techniques, has allowed the technology to be used in wide reaching applications including integration of images into hospital information systems and histopathological training, as well as the development of automated image analysis algorithms for prediction of histological aberrations and quantification of immunohistochemical stains. Here, the use of this technology in the creation of a comprehensive library of images of preclinical toxicological relevance is demonstrated. The images, acquired using the Aperio ScanScope CS and XT slide acquisition systems, form part of the ongoing EU FP6 Integrated Project, Innovative Medicines for Europe (InnoMed). In more detail, PredTox (abbreviation for Predictive Toxicology) is a subproject of InnoMed and comprises a consortium of 15 industrial (13 large pharma, 1 technology provider and 1 SME) and three academic partners. The primary aim of this consortium is to assess the value of combining data generated from 'omics technologies (proteomics, transcriptomics, metabolomics) with the results from more conventional toxicology methods, to facilitate further informed decision making in preclinical safety evaluation. A library of 1709 scanned images was created of full-face sections of liver and kidney tissue specimens from male Wistar rats treated with 16 proprietary and reference compounds of known toxicity; additional biological materials from these treated animals were separately used to create 'omics data, that will ultimately be used to populate an integrated toxicological database. In respect to assessment of the digital slides, a web-enabled digital slide management system, Digital SlideServer (DSS), was employed to enable integration of the digital slide content into the 'omics database and to facilitate remote viewing by pathologists connected with the project. DSS also facilitated manual annotation of digital slides by the pathologists, specifically in relation to marking particular lesions of interest. Tissue microarrays (TMAs) were constructed from the specimens for the purpose of creating a repository of tissue from animals used in the study with a view to later-stage biomarker assessment. As the PredTox consortium itself aims to identify new biomarkers of toxicity, these TMAs will be a valuable means of validation. In summary, a large repository of histological images was created enabling the subsequent pathological analysis of samples through remote viewing and, along with the utilisation of TMA technology, will allow the validation of biomarkers identified by the PredTox consortium. The population of the PredTox database with these digitised images represents the creation of the first toxicological database integrating 'omics and preclinical data with histological images.
Bayesian hierarchical model for large-scale covariance matrix estimation.
Zhu, Dongxiao; Hero, Alfred O
2007-12-01
Many bioinformatics problems implicitly depend on estimating large-scale covariance matrix. The traditional approaches tend to give rise to high variance and low accuracy due to "overfitting." We cast the large-scale covariance matrix estimation problem into the Bayesian hierarchical model framework, and introduce dependency between covariance parameters. We demonstrate the advantages of our approaches over the traditional approaches using simulations and OMICS data analysis.
The human gut microbiome and its dysfunctions through the meta-omics prism.
Mondot, Stanislas; Lepage, Patricia
2016-05-01
The microorganisms inhabiting the human gut are abundant (10(14) cells) and diverse (approximately 500 species per individual). It is now acknowledged that the microbiota has coevolved with its host to achieve a symbiotic relationship, leading to physiological homeostasis. The gut microbiota ensures vital functions, such as food digestibility, maturation of the host immune system, and protection against pathogens. Over the last few decades, the gut microbiota has also been associated with numerous diseases, such as inflammatory bowel disease, irritable bowel syndrome, obesity, and metabolic diseases. In most of these pathologies, a microbial dysbiosis has been found, indicating shifts in the taxonomic composition of the gut microbiota and changes in its functionality. Our understanding of the influence of the gut microbiota on human health is still growing. Working with microorganisms residing in the gut is challenging since most of them are anaerobic and a vast majority (approximately 75%) are uncultivable to date. Recently, a wide range of new approaches (meta-omics) has been developed to bypass the uncultivability and reveal the intricate mechanisms that sustain gut microbial homeostasis. After a brief description of these approaches (metagenomics, metatranscriptomics, metaproteomics, and metabolomics), this review will discuss the importance of considering the gut microbiome as a structured ecosystem and the use of meta-omics to decipher dysfunctions of the gut microbiome in diseases. © 2016 New York Academy of Sciences.
Comparative Omics-Driven Genome Annotation Refinement: Application across Yersiniae
DOE Office of Scientific and Technical Information (OSTI.GOV)
Rutledge, Alexandra C.; Jones, Marcus B.; Chauhan, Sadhana
2012-03-27
Genome sequencing continues to be a rapidly evolving technology, yet most downstream aspects of genome annotation pipelines remain relatively stable or are even being abandoned. To date, the perceived value of manual curation for genome annotations is not offset by the real cost and time associated with the process. In order to balance the large number of sequences generated, the annotation process is now performed almost exclusively in an automated fashion for most genome sequencing projects. One possible way to reduce errors inherent to automated computational annotations is to apply data from 'omics' measurements (i.e. transcriptional and proteomic) to themore » un-annotated genome with a proteogenomic-based approach. This approach does require additional experimental and bioinformatics methods to include omics technologies; however, the approach is readily automatable and can benefit from rapid developments occurring in those research domains as well. The annotation process can be improved by experimental validation of transcription and translation and aid in the discovery of annotation errors. Here the concept of annotation refinement has been extended to include a comparative assessment of genomes across closely related species, as is becoming common in sequencing efforts. Transcriptomic and proteomic data derived from three highly similar pathogenic Yersiniae (Y. pestis CO92, Y. pestis pestoides F, and Y. pseudotuberculosis PB1/+) was used to demonstrate a comprehensive comparative omic-based annotation methodology. Peptide and oligo measurements experimentally validated the expression of nearly 40% of each strain's predicted proteome and revealed the identification of 28 novel and 68 previously incorrect protein-coding sequences (e.g., observed frameshifts, extended start sites, and translated pseudogenes) within the three current Yersinia genome annotations. Gene loss is presumed to play a major role in Y. pestis acquiring its niche as a virulent pathogen, thus the discovery of many translated pseudogenes underscores a need for functional analyses to investigate hypotheses related to divergence. Refinements included the discovery of a seemingly essential ribosomal protein, several virulence-associated factors, and a transcriptional regulator, among other proteins, most of which are annotated as hypothetical, that were missed during annotation.« less
Vafaee, Fatemeh; Diakos, Connie; Kirschner, Michaela B; Reid, Glen; Michael, Michael Z; Horvath, Lisa G; Alinejad-Rokny, Hamid; Cheng, Zhangkai Jason; Kuncic, Zdenka; Clarke, Stephen
2018-01-01
Recent advances in high-throughput technologies have provided an unprecedented opportunity to identify molecular markers of disease processes. This plethora of complex-omics data has simultaneously complicated the problem of extracting meaningful molecular signatures and opened up new opportunities for more sophisticated integrative and holistic approaches. In this era, effective integration of data-driven and knowledge-based approaches for biomarker identification has been recognised as key to improving the identification of high-performance biomarkers, and necessary for translational applications. Here, we have evaluated the role of circulating microRNA as a means of predicting the prognosis of patients with colorectal cancer, which is the second leading cause of cancer-related death worldwide. We have developed a multi-objective optimisation method that effectively integrates a data-driven approach with the knowledge obtained from the microRNA-mediated regulatory network to identify robust plasma microRNA signatures which are reliable in terms of predictive power as well as functional relevance. The proposed multi-objective framework has the capacity to adjust for conflicting biomarker objectives and to incorporate heterogeneous information facilitating systems approaches to biomarker discovery. We have found a prognostic signature of colorectal cancer comprising 11 circulating microRNAs. The identified signature predicts the patients' survival outcome and targets pathways underlying colorectal cancer progression. The altered expression of the identified microRNAs was confirmed in an independent public data set of plasma samples of patients in early stage vs advanced colorectal cancer. Furthermore, the generality of the proposed method was demonstrated across three publicly available miRNA data sets associated with biomarker studies in other diseases.
Mildew-Omics: How Global Analyses Aid the Understanding of Life and Evolution of Powdery Mildews
Bindschedler, Laurence V.; Panstruga, Ralph; Spanu, Pietro D.
2016-01-01
The common powdery mildew plant diseases are caused by ascomycete fungi of the order Erysiphales. Their characteristic life style as obligate biotrophs renders functional analyses in these species challenging, mainly because of experimental constraints to genetic manipulation. Global large-scale (“-omics”) approaches are thus particularly valuable and insightful for the characterisation of the life and evolution of powdery mildews. Here we review the knowledge obtained so far from genomic, transcriptomic and proteomic studies in these fungi. We consider current limitations and challenges regarding these surveys and provide an outlook on desired future investigations on the basis of the various –omics technologies. PMID:26913042
The use of 'Omics technology to rationally improve industrial mammalian cell line performance.
Lewis, Amanda M; Abu-Absi, Nicholas R; Borys, Michael C; Li, Zheng Jian
2016-01-01
Biologics represent an increasingly important class of therapeutics, with 7 of the 10 top selling drugs from 2013 being in this class. Furthermore, health authority approval of biologics in the immuno-oncology space is expected to transform treatment of patients with debilitating and deadly diseases. The growing importance of biologics in the healthcare field has also resulted in the recent approvals of several biosimilars. These recent developments, combined with pressure to provide treatments at lower costs to payers, are resulting in increasing need for the industry to quickly and efficiently develop high yielding, robust processes for the manufacture of biologics with the ability to control quality attributes within narrow distributions. Achieving this level of manufacturing efficiency and the ability to design processes capable of regulating growth, death and other cellular pathways through manipulation of media, feeding strategies, and other process parameters will undoubtedly be facilitated through systems biology tools generated in academic and public research communities. Here we discuss the intersection of systems biology, 'Omics technologies, and mammalian bioprocess sciences. Specifically, we address how these methods in conjunction with traditional monitoring techniques represent a unique opportunity to better characterize and understand host cell culture state, shift from an empirical to rational approach to process development and optimization of bioreactor cultivation processes. We summarize the following six key areas: (i) research applied to parental, non-recombinant cell lines; (ii) systems level datasets generated with recombinant cell lines; (iii) datasets linking phenotypic traits to relevant biomarkers; (iv) data depositories and bioinformatics tools; (v) in silico model development, and (vi) examples where these approaches have been used to rationally improve cellular processes. We critically assess relevant and state of the art research being conducted in academic, government and industrial laboratories. Furthermore, we apply our expertise in bioprocess to define a potential model for integration of these systems biology approaches into biologics development. © 2015 Wiley Periodicals, Inc.
NMR-based Metabolomics for Cancer Research
Metabolomics is considered as a complementary tool to other omics platforms to provide a snapshot of the cellular biochemistry and physiology taking place at any instant. Metabolmics approaches have been widely used to provide comprehensive and quantitative analyses of the metabo...
iMETHYL: an integrative database of human DNA methylation, gene expression, and genomic variation.
Komaki, Shohei; Shiwa, Yuh; Furukawa, Ryohei; Hachiya, Tsuyoshi; Ohmomo, Hideki; Otomo, Ryo; Satoh, Mamoru; Hitomi, Jiro; Sobue, Kenji; Sasaki, Makoto; Shimizu, Atsushi
2018-01-01
We launched an integrative multi-omics database, iMETHYL (http://imethyl.iwate-megabank.org). iMETHYL provides whole-DNA methylation (~24 million autosomal CpG sites), whole-genome (~9 million single-nucleotide variants), and whole-transcriptome (>14 000 genes) data for CD4 + T-lymphocytes, monocytes, and neutrophils collected from approximately 100 subjects. These data were obtained from whole-genome bisulfite sequencing, whole-genome sequencing, and whole-transcriptome sequencing, making iMETHYL a comprehensive database.
Bersanelli, Matteo; Mosca, Ettore; Remondini, Daniel; Castellani, Gastone; Milanesi, Luciano
2016-01-01
A relation exists between network proximity of molecular entities in interaction networks, functional similarity and association with diseases. The identification of network regions associated with biological functions and pathologies is a major goal in systems biology. We describe a network diffusion-based pipeline for the interpretation of different types of omics in the context of molecular interaction networks. We introduce the network smoothing index, a network-based quantity that allows to jointly quantify the amount of omics information in genes and in their network neighbourhood, using network diffusion to define network proximity. The approach is applicable to both descriptive and inferential statistics calculated on omics data. We also show that network resampling, applied to gene lists ranked by quantities derived from the network smoothing index, indicates the presence of significantly connected genes. As a proof of principle, we identified gene modules enriched in somatic mutations and transcriptional variations observed in samples of prostate adenocarcinoma (PRAD). In line with the local hypothesis, network smoothing index and network resampling underlined the existence of a connected component of genes harbouring molecular alterations in PRAD. PMID:27731320
High-Performance Mixed Models Based Genome-Wide Association Analysis with omicABEL software
Fabregat-Traver, Diego; Sharapov, Sodbo Zh.; Hayward, Caroline; Rudan, Igor; Campbell, Harry; Aulchenko, Yurii; Bientinesi, Paolo
2014-01-01
To raise the power of genome-wide association studies (GWAS) and avoid false-positive results in structured populations, one can rely on mixed model based tests. When large samples are used, and when multiple traits are to be studied in the ’omics’ context, this approach becomes computationally challenging. Here we consider the problem of mixed-model based GWAS for arbitrary number of traits, and demonstrate that for the analysis of single-trait and multiple-trait scenarios different computational algorithms are optimal. We implement these optimal algorithms in a high-performance computing framework that uses state-of-the-art linear algebra kernels, incorporates optimizations, and avoids redundant computations, increasing throughput while reducing memory usage and energy consumption. We show that, compared to existing libraries, our algorithms and software achieve considerable speed-ups. The OmicABEL software described in this manuscript is available under the GNU GPL v. 3 license as part of the GenABEL project for statistical genomics at http: //www.genabel.org/packages/OmicABEL. PMID:25717363
Pandey, Vishakha; Singh, Manoj; Pandey, Dinesh; Kumar, Anil
2018-05-18
Tilletia indica incites Karnal bunt (KB) disease in wheat. To date, no KB resistant wheat cultivar could be developed due to non-availability of potential biomarkers related to pathogenicity/virulence for screening of resistant wheat genotypes. The present study was carried out to compare the proteomes of T. indica highly (TiK) and low (TiP) virulent isolates. Twenty one protein spots consistently observed as up-regulated/differential in the TiK proteome were selected for identification by MALDI-TOF/TOF. Identified sequences showed homology with fungal proteins playing essential role in plant infection and pathogen survival, including stress response, adhesion, fungal penetration, invasion, colonization, degradation of host cell wall, signal transduction pathway. These results were integrated with T. indica genome sequence for identification of homologs of candidate pathogenicity/virulence related proteins. Protein identified in TiK isolate as malate dehydrogenase that converts malate to oxaloacetate which is precursor of oxalic acid. Oxalic acid is key pathogenicity factor in phytopathogenic fungi. These results were validated by GC-MS based metabolic profiling of T. indica isolates indicating that oxalic acid was exclusively identified in TiK isolate. Thus, integrated omics approaches leads to identification of pathogenicity/virulence factor(s) that would provide insights into pathogenic mechanisms of fungi and aid in devising effective disease management strategies.
Nagel, Zachary D; Engelward, Bevin P; Brenner, David J; Begley, Thomas J; Sobol, Robert W; Bielas, Jason H; Stambrook, Peter J; Wei, Qingyi; Hu, Jennifer J; Terry, Mary Beth; Dilworth, Caroline; McAllister, Kimberly A; Reinlib, Les; Worth, Leroy; Shaughnessy, Daniel T
2017-08-01
The rise of advanced technologies for characterizing human populations at the molecular level, from sequence to function, is shifting disease prevention paradigms toward personalized strategies. Because minimization of adverse outcomes is a key driver for treatment decisions for diseased populations, developing personalized therapy strategies represent an important dimension of both precision medicine and personalized prevention. In this commentary, we highlight recently developed enabling technologies in the field of DNA damage, DNA repair, and mutagenesis. We propose that omics approaches and functional assays can be integrated into population studies that fuse basic, translational and clinical research with commercial expertise in order to accelerate personalized prevention and treatment of cancer and other diseases linked to aberrant responses to DNA damage. This collaborative approach is generally applicable to efforts to develop data-driven, individualized prevention and treatment strategies for other diseases. We also recommend strategies for maximizing the use of biological samples for epidemiological studies, and for applying emerging technologies to clinical applications. Copyright © 2017 Elsevier B.V. All rights reserved.
Reverse Genetics and High Throughput Sequencing Methodologies for Plant Functional Genomics
Ben-Amar, Anis; Daldoul, Samia; Reustle, Götz M.; Krczal, Gabriele; Mliki, Ahmed
2016-01-01
In the post-genomic era, increasingly sophisticated genetic tools are being developed with the long-term goal of understanding how the coordinated activity of genes gives rise to a complex organism. With the advent of the next generation sequencing associated with effective computational approaches, wide variety of plant species have been fully sequenced giving a wealth of data sequence information on structure and organization of plant genomes. Since thousands of gene sequences are already known, recently developed functional genomics approaches provide powerful tools to analyze plant gene functions through various gene manipulation technologies. Integration of different omics platforms along with gene annotation and computational analysis may elucidate a complete view in a system biology level. Extensive investigations on reverse genetics methodologies were deployed for assigning biological function to a specific gene or gene product. We provide here an updated overview of these high throughout strategies highlighting recent advances in the knowledge of functional genomics in plants. PMID:28217003
Nagel, Zachary D.; Engelward, Bevin P.; Brenner, David J.; Begley, Thomas J.; Sobol, Robert W.; Bielas, Jason H.; Stambrook, Peter J.; Wei, Qingyi; Hu, Jennifer J.; Terry, Mary Beth; Dilworth, Caroline; McAllister, Kimberly A.; Reinlib, Les; Worth, Leroy; Shaughnessy, Daniel T.
2018-01-01
The rise of advanced technologies for characterizing human populations at the molecular level, from sequence to function, is shifting disease prevention paradigms toward personalized strategies. Because minimization of adverse outcomes is a key driver for treatment decisions for diseased populations, developing personalized therapy strategies represent an important dimension of both precision medicine and personalized prevention. In this commentary, we highlight recently developed enabling technologies in the field of DNA damage, DNA repair, and mutagenesis. We propose that omics approaches and functional assays can be integrated into population studies that fuse basic, translational and clinical research with commercial expertise in order to accelerate personalized prevention and treatment of cancer and other diseases linked to aberrant responses to DNA damage. This collaborative approach is generally applicable to efforts to develop data-driven, individualized prevention and treatment strategies for other diseases. We also recommend strategies for maximizing the use of biological samples for epidemiological studies, and for applying emerging technologies to clinical applications. PMID:28458064
Systems Biology of Metabolic Regulation by Estrogen Receptor Signaling in Breast Cancer.
Zhao, Yiru Chen; Madak Erdogan, Zeynep
2016-03-17
With the advent of the -omics approaches our understanding of the chronic diseases like cancer and metabolic syndrome has improved. However, effective mining of the information in the large-scale datasets that are obtained from gene expression microarrays, deep sequencing experiments or metabolic profiling is essential to uncover and then effectively target the critical regulators of diseased cell phenotypes. Estrogen Receptor α (ERα) is one of the master transcription factors regulating the gene programs that are important for estrogen responsive breast cancers. In order to understand to role of ERα signaling in breast cancer metabolism we utilized transcriptomic, cistromic and metabolomic data from MCF-7 cells treated with estradiol. In this report we described generation of samples for RNA-Seq, ChIP-Seq and metabolomics experiments and the integrative computational analysis of the obtained data. This approach is useful in delineating novel molecular mechanisms and gene regulatory circuits that are regulated by a particular transcription factor which impacts metabolism of normal or diseased cells.
Yan, Kang K; Zhao, Hongyu; Pang, Herbert
2017-12-06
High-throughput sequencing data are widely collected and analyzed in the study of complex diseases in quest of improving human health. Well-studied algorithms mostly deal with single data source, and cannot fully utilize the potential of these multi-omics data sources. In order to provide a holistic understanding of human health and diseases, it is necessary to integrate multiple data sources. Several algorithms have been proposed so far, however, a comprehensive comparison of data integration algorithms for classification of binary traits is currently lacking. In this paper, we focus on two common classes of integration algorithms, graph-based that depict relationships with subjects denoted by nodes and relationships denoted by edges, and kernel-based that can generate a classifier in feature space. Our paper provides a comprehensive comparison of their performance in terms of various measurements of classification accuracy and computation time. Seven different integration algorithms, including graph-based semi-supervised learning, graph sharpening integration, composite association network, Bayesian network, semi-definite programming-support vector machine (SDP-SVM), relevance vector machine (RVM) and Ada-boost relevance vector machine are compared and evaluated with hypertension and two cancer data sets in our study. In general, kernel-based algorithms create more complex models and require longer computation time, but they tend to perform better than graph-based algorithms. The performance of graph-based algorithms has the advantage of being faster computationally. The empirical results demonstrate that composite association network, relevance vector machine, and Ada-boost RVM are the better performers. We provide recommendations on how to choose an appropriate algorithm for integrating data from multiple sources.
Watson for Genomics: Moving Personalized Medicine Forward.
Rhrissorrakrai, Kahn; Koyama, Takahiko; Parida, Laxmi
2016-08-01
The confluence of genomic technologies and cognitive computing has brought us to the doorstep of widespread usage of personalized medicine. Cognitive systems, such as Watson for Genomics (WG), integrate massive amounts of new omic data with the current body of knowledge to assist physicians in analyzing and acting on patient's genomic profiles. Copyright © 2016 Elsevier Inc. All rights reserved.
Integration of Molecular Pathology, Epidemiology, and Social Science for Global Precision Medicine
Nishi, Akihiro; Milner, Danny A; Giovannucci, Edward L.; Nishihara, Reiko; Tan, Andy S.; Kawachi, Ichiro; Ogino, Shuji
2015-01-01
Summary The precision medicine concept and the unique disease principle imply that each patient has unique pathogenic processes resulting from heterogeneous cellular genetic and epigenetic alterations, and interactions between cells (including immune cells) and exposures, including dietary, environmental, microbial, and lifestyle factors. As a core method field in population health science and medicine, epidemiology is a growing scientific discipline that can analyze disease risk factors, and develop statistical methodologies to maximize utilization of big data on populations and disease pathology. The evolving transdisciplinary field of molecular pathological epidemiology (MPE) can advance biomedical and health research by linking exposures to molecular pathologic signatures, enhancing causal inference, and identifying potential biomarkers for clinical impact. The MPE approach can be applied to any diseases, although it has been most commonly used in neoplastic diseases (including breast, lung and colorectal cancers) because of availability of various molecular diagnostic tests. However, use of state-of-the-art genomic, epigenomic and other omic technologies and expensive drugs in modern healthcare systems increases racial, ethnic and socioeconomic disparities. To address this, we propose to integrate molecular pathology, epidemiology, and social science. Social epidemiology integrates the latter two fields. The integrative social MPE model can embrace sociology, economics and precision medicine, address global health disparities and inequalities, and elucidate biological effects of social environments, behaviors, and networks. We foresee advancements of molecular medicine, including molecular diagnostics, biomedical imaging, and targeted therapeutics, which should benefit individuals in a global population, by means of an interdisciplinary approach of integrative MPE and social health science. PMID:26636627
Integration of molecular pathology, epidemiology and social science for global precision medicine.
Nishi, Akihiro; Milner, Danny A; Giovannucci, Edward L; Nishihara, Reiko; Tan, Andy S; Kawachi, Ichiro; Ogino, Shuji
2016-01-01
The precision medicine concept and the unique disease principle imply that each patient has unique pathogenic processes resulting from heterogeneous cellular genetic and epigenetic alterations and interactions between cells (including immune cells) and exposures, including dietary, environmental, microbial and lifestyle factors. As a core method field in population health science and medicine, epidemiology is a growing scientific discipline that can analyze disease risk factors and develop statistical methodologies to maximize utilization of big data on populations and disease pathology. The evolving transdisciplinary field of molecular pathological epidemiology (MPE) can advance biomedical and health research by linking exposures to molecular pathologic signatures, enhancing causal inference and identifying potential biomarkers for clinical impact. The MPE approach can be applied to any diseases, although it has been most commonly used in neoplastic diseases (including breast, lung and colorectal cancers) because of availability of various molecular diagnostic tests. However, use of state-of-the-art genomic, epigenomic and other omic technologies and expensive drugs in modern healthcare systems increases racial, ethnic and socioeconomic disparities. To address this, we propose to integrate molecular pathology, epidemiology and social science. Social epidemiology integrates the latter two fields. The integrative social MPE model can embrace sociology, economics and precision medicine, address global health disparities and inequalities, and elucidate biological effects of social environments, behaviors and networks. We foresee advancements of molecular medicine, including molecular diagnostics, biomedical imaging and targeted therapeutics, which should benefit individuals in a global population, by means of an interdisciplinary approach of integrative MPE and social health science.
Cotten, Cameron; Reed, Jennifer L
2013-01-30
Constraint-based modeling uses mass balances, flux capacity, and reaction directionality constraints to predict fluxes through metabolism. Although transcriptional regulation and thermodynamic constraints have been integrated into constraint-based modeling, kinetic rate laws have not been extensively used. In this study, an in vivo kinetic parameter estimation problem was formulated and solved using multi-omic data sets for Escherichia coli. To narrow the confidence intervals for kinetic parameters, a series of kinetic model simplifications were made, resulting in fewer kinetic parameters than the full kinetic model. These new parameter values are able to account for flux and concentration data from 20 different experimental conditions used in our training dataset. Concentration estimates from the simplified kinetic model were within one standard deviation for 92.7% of the 790 experimental measurements in the training set. Gibbs free energy changes of reaction were calculated to identify reactions that were often operating close to or far from equilibrium. In addition, enzymes whose activities were positively or negatively influenced by metabolite concentrations were also identified. The kinetic model was then used to calculate the maximum and minimum possible flux values for individual reactions from independent metabolite and enzyme concentration data that were not used to estimate parameter values. Incorporating these kinetically-derived flux limits into the constraint-based metabolic model improved predictions for uptake and secretion rates and intracellular fluxes in constraint-based models of central metabolism. This study has produced a method for in vivo kinetic parameter estimation and identified strategies and outcomes of kinetic model simplification. We also have illustrated how kinetic constraints can be used to improve constraint-based model predictions for intracellular fluxes and biomass yield and identify potential metabolic limitations through the integrated analysis of multi-omics datasets.
Grafström, Roland C; Nymark, Penny; Hongisto, Vesa; Spjuth, Ola; Ceder, Rebecca; Willighagen, Egon; Hardy, Barry; Kaski, Samuel; Kohonen, Pekka
2015-11-01
This paper outlines the work for which Roland Grafström and Pekka Kohonen were awarded the 2014 Lush Science Prize. The research activities of the Grafström laboratory have, for many years, covered cancer biology studies, as well as the development and application of toxicity-predictive in vitro models to determine chemical safety. Through the integration of in silico analyses of diverse types of genomics data (transcriptomic and proteomic), their efforts have proved to fit well into the recently-developed Adverse Outcome Pathway paradigm. Genomics analysis within state-of-the-art cancer biology research and Toxicology in the 21st Century concepts share many technological tools. A key category within the Three Rs paradigm is the Replacement of animals in toxicity testing with alternative methods, such as bioinformatics-driven analyses of data obtained from human cell cultures exposed to diverse toxicants. This work was recently expanded within the pan-European SEURAT-1 project (Safety Evaluation Ultimately Replacing Animal Testing), to replace repeat-dose toxicity testing with data-rich analyses of sophisticated cell culture models. The aims and objectives of the SEURAT project have been to guide the application, analysis, interpretation and storage of 'omics' technology-derived data within the service-oriented sub-project, ToxBank. Particularly addressing the Lush Science Prize focus on the relevance of toxicity pathways, a 'data warehouse' that is under continuous expansion, coupled with the development of novel data storage and management methods for toxicology, serve to address data integration across multiple 'omics' technologies. The prize winners' guiding principles and concepts for modern knowledge management of toxicological data are summarised. The translation of basic discovery results ranged from chemical-testing and material-testing data, to information relevant to human health and environmental safety. 2015 FRAME.
Kang, Tianyu; Ding, Wei; Zhang, Luoyan; Ziemek, Daniel; Zarringhalam, Kourosh
2017-12-19
Stratification of patient subpopulations that respond favorably to treatment or experience and adverse reaction is an essential step toward development of new personalized therapies and diagnostics. It is currently feasible to generate omic-scale biological measurements for all patients in a study, providing an opportunity for machine learning models to identify molecular markers for disease diagnosis and progression. However, the high variability of genetic background in human populations hampers the reproducibility of omic-scale markers. In this paper, we develop a biological network-based regularized artificial neural network model for prediction of phenotype from transcriptomic measurements in clinical trials. To improve model sparsity and the overall reproducibility of the model, we incorporate regularization for simultaneous shrinkage of gene sets based on active upstream regulatory mechanisms into the model. We benchmark our method against various regression, support vector machines and artificial neural network models and demonstrate the ability of our method in predicting the clinical outcomes using clinical trial data on acute rejection in kidney transplantation and response to Infliximab in ulcerative colitis. We show that integration of prior biological knowledge into the classification as developed in this paper, significantly improves the robustness and generalizability of predictions to independent datasets. We provide a Java code of our algorithm along with a parsed version of the STRING DB database. In summary, we present a method for prediction of clinical phenotypes using baseline genome-wide expression data that makes use of prior biological knowledge on gene-regulatory interactions in order to increase robustness and reproducibility of omic-scale markers. The integrated group-wise regularization methods increases the interpretability of biological signatures and gives stable performance estimates across independent test sets.
2013-01-01
Background Constraint-based modeling uses mass balances, flux capacity, and reaction directionality constraints to predict fluxes through metabolism. Although transcriptional regulation and thermodynamic constraints have been integrated into constraint-based modeling, kinetic rate laws have not been extensively used. Results In this study, an in vivo kinetic parameter estimation problem was formulated and solved using multi-omic data sets for Escherichia coli. To narrow the confidence intervals for kinetic parameters, a series of kinetic model simplifications were made, resulting in fewer kinetic parameters than the full kinetic model. These new parameter values are able to account for flux and concentration data from 20 different experimental conditions used in our training dataset. Concentration estimates from the simplified kinetic model were within one standard deviation for 92.7% of the 790 experimental measurements in the training set. Gibbs free energy changes of reaction were calculated to identify reactions that were often operating close to or far from equilibrium. In addition, enzymes whose activities were positively or negatively influenced by metabolite concentrations were also identified. The kinetic model was then used to calculate the maximum and minimum possible flux values for individual reactions from independent metabolite and enzyme concentration data that were not used to estimate parameter values. Incorporating these kinetically-derived flux limits into the constraint-based metabolic model improved predictions for uptake and secretion rates and intracellular fluxes in constraint-based models of central metabolism. Conclusions This study has produced a method for in vivo kinetic parameter estimation and identified strategies and outcomes of kinetic model simplification. We also have illustrated how kinetic constraints can be used to improve constraint-based model predictions for intracellular fluxes and biomass yield and identify potential metabolic limitations through the integrated analysis of multi-omics datasets. PMID:23360254
Moon, Myungjin; Nakai, Kenta
2018-04-01
Currently, cancer biomarker discovery is one of the important research topics worldwide. In particular, detecting significant genes related to cancer is an important task for early diagnosis and treatment of cancer. Conventional studies mostly focus on genes that are differentially expressed in different states of cancer; however, noise in gene expression datasets and insufficient information in limited datasets impede precise analysis of novel candidate biomarkers. In this study, we propose an integrative analysis of gene expression and DNA methylation using normalization and unsupervised feature extractions to identify candidate biomarkers of cancer using renal cell carcinoma RNA-seq datasets. Gene expression and DNA methylation datasets are normalized by Box-Cox transformation and integrated into a one-dimensional dataset that retains the major characteristics of the original datasets by unsupervised feature extraction methods, and differentially expressed genes are selected from the integrated dataset. Use of the integrated dataset demonstrated improved performance as compared with conventional approaches that utilize gene expression or DNA methylation datasets alone. Validation based on the literature showed that a considerable number of top-ranked genes from the integrated dataset have known relationships with cancer, implying that novel candidate biomarkers can also be acquired from the proposed analysis method. Furthermore, we expect that the proposed method can be expanded for applications involving various types of multi-omics datasets.
Framework for the quality assurance of 'omics technologies considering GLP requirements.
Kauffmann, Hans-Martin; Kamp, Hennicke; Fuchs, Regine; Chorley, Brian N; Deferme, Lize; Ebbels, Timothy; Hackermüller, Jörg; Perdichizzi, Stefania; Poole, Alan; Sauer, Ursula G; Tollefsen, Knut E; Tralau, Tewes; Yauk, Carole; van Ravenzwaay, Ben
2017-12-01
'Omics technologies are gaining importance to support regulatory toxicity studies. Prerequisites for performing 'omics studies considering GLP principles were discussed at the European Centre for Ecotoxicology and Toxicology of Chemicals (ECETOC) Workshop Applying 'omics technologies in Chemical Risk Assessment. A GLP environment comprises a standard operating procedure system, proper pre-planning and documentation, and inspections of independent quality assurance staff. To prevent uncontrolled data changes, the raw data obtained in the respective 'omics data recording systems have to be specifically defined. Further requirements include transparent and reproducible data processing steps, and safe data storage and archiving procedures. The software for data recording and processing should be validated, and data changes should be traceable or disabled. GLP-compliant quality assurance of 'omics technologies appears feasible for many GLP requirements. However, challenges include (i) defining, storing, and archiving the raw data; (ii) transparent descriptions of data processing steps; (iii) software validation; and (iv) ensuring complete reproducibility of final results with respect to raw data. Nevertheless, 'omics studies can be supported by quality measures (e.g., GLP principles) to ensure quality control, reproducibility and traceability of experiments. This enables regulators to use 'omics data in a fit-for-purpose context, which enhances their applicability for risk assessment. Copyright © 2017 Elsevier Inc. All rights reserved.
Lu, Yunlong; Wei, Liqin; Wang, Tai
2015-01-01
The development of sperm cells (SCs) from microspores involves a set of finely regulated molecular and cellular events and the coordination of these events. The mechanisms underlying these events and their interconnections remain a major challenge. Systems analysis of genome-wide molecular networks and functional modules with high-throughput "omics" approaches is crucial for understanding the mechanisms; however, this study is hindered because of the difficulty in isolating a large amount of cells of different types, especially generative cells (GCs), from the pollen. Here, we optimized the conditions of tomato pollen germination and pollen tube growth to allow for long-term growth of pollen tubes in vitro with SCs generated in the tube. Using this culture system, we developed methods for isolating GCs, SCs and vegetative cell nuclei (VN) from just-germinated tomato pollen grains and growing pollen tubes and their purification by Percoll density gradient centrifugation. The purity and viability of isolated GCs and SCs were confirmed by microscopy examination and fluorescein diacetate staining, respectively, and the integrity of VN was confirmed by propidium iodide staining. We could obtain about 1.5 million GCs and 2.0 million SCs each from 180 mg initiated pollen grains, and 10 million VN from 270 mg initiated pollen grains germinated in vitro in each experiment. These methods provide the necessary preconditions for systematic biology studies of SC development and differentiation in higher plants.
Schmidt, Michael A; Goodwin, Thomas J; Pelligra, Ralph
The next major steps in human spaceflight include flyby, orbital, and landing missions to the Moon, Mars, and near earth asteroids. The first crewed deep space mission is expected to launch in 2022, which affords less than 7 years to address the complex question of whether and how to apply artificial gravity to counter the effects of prolonged weightlessness. Various phenotypic changes are demonstrated during artificial gravity experiments. However, the molecular dynamics (genotype and molecular phenotypes) that underlie these morphological, physiological, and behavioral phenotypes are far more complex than previously understood. Thus, targeted molecular assessment of subjects under various G conditions can be expected to miss important patterns of molecular variance that inform the more general phenotypes typically being measured. Use of omics methods can help detect changes across broad molecular networks, as various G-loading paradigms are applied. This will be useful in detecting off-target, or unanticipated effects of the different gravity paradigms applied to humans or animals. Insights gained from these approaches may eventually be used to inform countermeasure development or refine the deployment of existing countermeasures. This convergence of the omics and artificial gravity research communities may be critical if we are to develop the proper artificial gravity solutions under the severely compressed timelines currently established. Thus, the omics community may offer a unique ability to accelerate discovery, provide new insights, and benefit deep space missions in ways that have not been previously considered.
Integrating cell biology and proteomic approaches in plants.
Takáč, Tomáš; Šamajová, Olga; Šamaj, Jozef
2017-10-03
Significant improvements of protein extraction, separation, mass spectrometry and bioinformatics nurtured advancements of proteomics during the past years. The usefulness of proteomics in the investigation of biological problems can be enhanced by integration with other experimental methods from cell biology, genetics, biochemistry, pharmacology, molecular biology and other omics approaches including transcriptomics and metabolomics. This review aims to summarize current trends integrating cell biology and proteomics in plant science. Cell biology approaches are most frequently used in proteomic studies investigating subcellular and developmental proteomes, however, they were also employed in proteomic studies exploring abiotic and biotic stress responses, vesicular transport, cytoskeleton and protein posttranslational modifications. They are used either for detailed cellular or ultrastructural characterization of the object subjected to proteomic study, validation of proteomic results or to expand proteomic data. In this respect, a broad spectrum of methods is employed to support proteomic studies including ultrastructural electron microscopy studies, histochemical staining, immunochemical localization, in vivo imaging of fluorescently tagged proteins and visualization of protein-protein interactions. Thus, cell biological observations on fixed or living cell compartments, cells, tissues and organs are feasible, and in some cases fundamental for the validation and complementation of proteomic data. Validation of proteomic data by independent experimental methods requires development of new complementary approaches. Benefits of cell biology methods and techniques are not sufficiently highlighted in current proteomic studies. This encouraged us to review most popular cell biology methods used in proteomic studies and to evaluate their relevance and potential for proteomic data validation and enrichment of purely proteomic analyses. We also provide examples of representative studies combining proteomic and cell biology methods for various purposes. Integrating cell biology approaches with proteomic ones allow validation and better interpretation of proteomic data. Moreover, cell biology methods remarkably extend the knowledge provided by proteomic studies and might be fundamental for the functional complementation of proteomic data. This review article summarizes current literature linking proteomics with cell biology. Copyright © 2017 Elsevier B.V. All rights reserved.
Emerging molecular phenotypes of asthma
Ray, Anuradha; Oriss, Timothy B.
2014-01-01
Although asthma has long been considered a heterogeneous disease, attempts to define subgroups of asthma have been limited. In recent years, both clinical and statistical approaches have been utilized to better merge clinical characteristics, biology, and genetics. These combined characteristics have been used to define phenotypes of asthma, the observable characteristics of a patient determined by the interaction of genes and environment. Identification of consistent clinical phenotypes has now been reported across studies. Now the addition of various 'omics and identification of specific molecular pathways have moved the concept of clinical phenotypes toward the concept of molecular phenotypes. The importance of these molecular phenotypes is being confirmed through the integration of molecularly targeted biological therapies. Thus the global term asthma is poised to become obsolete, being replaced by terms that more specifically identify the pathology associated with the disease. PMID:25326577
The High-Throughput Analyses Era: Are We Ready for the Data Struggle?
D'Argenio, Valeria
2018-03-02
Recent and rapid technological advances in molecular sciences have dramatically increased the ability to carry out high-throughput studies characterized by big data production. This, in turn, led to the consequent negative effect of highlighting the presence of a gap between data yield and their analysis. Indeed, big data management is becoming an increasingly important aspect of many fields of molecular research including the study of human diseases. Now, the challenge is to identify, within the huge amount of data obtained, that which is of clinical relevance. In this context, issues related to data interpretation, sharing and storage need to be assessed and standardized. Once this is achieved, the integration of data from different -omic approaches will improve the diagnosis, monitoring and therapy of diseases by allowing the identification of novel, potentially actionably biomarkers in view of personalized medicine.
Innovative Tools and Technology for Analysis of Single Cells and Cell-Cell Interaction.
Konry, Tania; Sarkar, Saheli; Sabhachandani, Pooja; Cohen, Noa
2016-07-11
Heterogeneity in single-cell responses and intercellular interactions results from complex regulation of cell-intrinsic and environmental factors. Single-cell analysis allows not only detection of individual cellular characteristics but also correlation of genetic content with phenotypic traits in the same cell. Technological advances in micro- and nanofabrication have benefited single-cell analysis by allowing precise control of the localized microenvironment, cell manipulation, and sensitive detection capabilities. Additionally, microscale techniques permit rapid, high-throughput, multiparametric screening that has become essential for -omics research. This review highlights innovative applications of microscale platforms in genetic, proteomic, and metabolic detection in single cells; cell sorting strategies; and heterotypic cell-cell interaction. We discuss key design aspects of single-cell localization and isolation in microfluidic systems, dynamic and endpoint analyses, and approaches that integrate highly multiplexed detection of various intracellular species.
Pathak, Rajesh Kumar; Gupta, Sanjay Mohan; Gaur, Vikram Singh; Pandey, Dinesh
2015-01-01
Abstract In recent years, rapid developments in several omics platforms and next generation sequencing technology have generated a huge amount of biological data about plants. Systems biology aims to develop and use well-organized and efficient algorithms, data structure, visualization, and communication tools for the integration of these biological data with the goal of computational modeling and simulation. It studies crop plant systems by systematically perturbing them, checking the gene, protein, and informational pathway responses; integrating these data; and finally, formulating mathematical models that describe the structure of system and its response to individual perturbations. Consequently, systems biology approaches, such as integrative and predictive ones, hold immense potential in understanding of molecular mechanism of agriculturally important complex traits linked to agricultural productivity. This has led to identification of some key genes and proteins involved in networks of pathways involved in input use efficiency, biotic and abiotic stress resistance, photosynthesis efficiency, root, stem and leaf architecture, and nutrient mobilization. The developments in the above fields have made it possible to design smart crops with superior agronomic traits through genetic manipulation of key candidate genes. PMID:26484978
Vivek-Ananth, R P; Samal, Areejit
2016-09-01
A major goal of systems biology is to build predictive computational models of cellular metabolism. Availability of complete genome sequences and wealth of legacy biochemical information has led to the reconstruction of genome-scale metabolic networks in the last 15 years for several organisms across the three domains of life. Due to paucity of information on kinetic parameters associated with metabolic reactions, the constraint-based modelling approach, flux balance analysis (FBA), has proved to be a vital alternative to investigate the capabilities of reconstructed metabolic networks. In parallel, advent of high-throughput technologies has led to the generation of massive amounts of omics data on transcriptional regulation comprising mRNA transcript levels and genome-wide binding profile of transcriptional regulators. A frontier area in metabolic systems biology has been the development of methods to integrate the available transcriptional regulatory information into constraint-based models of reconstructed metabolic networks in order to increase the predictive capabilities of computational models and understand the regulation of cellular metabolism. Here, we review the existing methods to integrate transcriptional regulatory information into constraint-based models of metabolic networks. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
A community proposal to integrate proteomics activities in ELIXIR.
Vizcaíno, Juan Antonio; Walzer, Mathias; Jiménez, Rafael C; Bittremieux, Wout; Bouyssié, David; Carapito, Christine; Corrales, Fernando; Ferro, Myriam; Heck, Albert J R; Horvatovich, Peter; Hubalek, Martin; Lane, Lydie; Laukens, Kris; Levander, Fredrik; Lisacek, Frederique; Novak, Petr; Palmblad, Magnus; Piovesan, Damiano; Pühler, Alfred; Schwämmle, Veit; Valkenborg, Dirk; van Rijswijk, Merlijn; Vondrasek, Jiri; Eisenacher, Martin; Martens, Lennart; Kohlbacher, Oliver
2017-01-01
Computational approaches have been major drivers behind the progress of proteomics in recent years. The aim of this white paper is to provide a framework for integrating computational proteomics into ELIXIR in the near future, and thus to broaden the portfolio of omics technologies supported by this European distributed infrastructure. This white paper is the direct result of a strategy meeting on 'The Future of Proteomics in ELIXIR' that took place in March 2017 in Tübingen (Germany), and involved representatives of eleven ELIXIR nodes. These discussions led to a list of priority areas in computational proteomics that would complement existing activities and close gaps in the portfolio of tools and services offered by ELIXIR so far. We provide some suggestions on how these activities could be integrated into ELIXIR's existing platforms, and how it could lead to a new ELIXIR use case in proteomics. We also highlight connections to the related field of metabolomics, where similar activities are ongoing. This white paper could thus serve as a starting point for the integration of computational proteomics into ELIXIR. Over the next few months we will be working closely with all stakeholders involved, and in particular with other representatives of the proteomics community, to further refine this paper.
A community proposal to integrate proteomics activities in ELIXIR
Vizcaíno, Juan Antonio; Walzer, Mathias; Jiménez, Rafael C.; Bittremieux, Wout; Bouyssié, David; Carapito, Christine; Corrales, Fernando; Ferro, Myriam; Heck, Albert J.R.; Horvatovich, Peter; Hubalek, Martin; Lane, Lydie; Laukens, Kris; Levander, Fredrik; Lisacek, Frederique; Novak, Petr; Palmblad, Magnus; Piovesan, Damiano; Pühler, Alfred; Schwämmle, Veit; Valkenborg, Dirk; van Rijswijk, Merlijn; Vondrasek, Jiri; Eisenacher, Martin; Martens, Lennart; Kohlbacher, Oliver
2017-01-01
Computational approaches have been major drivers behind the progress of proteomics in recent years. The aim of this white paper is to provide a framework for integrating computational proteomics into ELIXIR in the near future, and thus to broaden the portfolio of omics technologies supported by this European distributed infrastructure. This white paper is the direct result of a strategy meeting on ‘The Future of Proteomics in ELIXIR’ that took place in March 2017 in Tübingen (Germany), and involved representatives of eleven ELIXIR nodes. These discussions led to a list of priority areas in computational proteomics that would complement existing activities and close gaps in the portfolio of tools and services offered by ELIXIR so far. We provide some suggestions on how these activities could be integrated into ELIXIR’s existing platforms, and how it could lead to a new ELIXIR use case in proteomics. We also highlight connections to the related field of metabolomics, where similar activities are ongoing. This white paper could thus serve as a starting point for the integration of computational proteomics into ELIXIR. Over the next few months we will be working closely with all stakeholders involved, and in particular with other representatives of the proteomics community, to further refine this paper. PMID:28713550
The European Bioinformatics Institute in 2017: data coordination and integration
Cochrane, Guy; Apweiler, Rolf; Birney, Ewan
2018-01-01
Abstract The European Bioinformatics Institute (EMBL-EBI) supports life-science research throughout the world by providing open data, open-source software and analytical tools, and technical infrastructure (https://www.ebi.ac.uk). We accommodate an increasingly diverse range of data types and integrate them, so that biologists in all disciplines can explore life in ever-increasing detail. We maintain over 40 data resources, many of which are run collaboratively with partners in 16 countries (https://www.ebi.ac.uk/services). Submissions continue to increase exponentially: our data storage has doubled in less than two years to 120 petabytes. Recent advances in cellular imaging and single-cell sequencing techniques are generating a vast amount of high-dimensional data, bringing to light new cell types and new perspectives on anatomy. Accordingly, one of our main focus areas is integrating high-quality information from bioimaging, biobanking and other types of molecular data. This is reflected in our deep involvement in Open Targets, stewarding of plant phenotyping standards (MIAPPE) and partnership in the Human Cell Atlas data coordination platform, as well as the 2017 launch of the Omics Discovery Index. This update gives a birds-eye view of EMBL-EBI’s approach to data integration and service development as genomics begins to enter the clinic. PMID:29186510
An Integrated Omics Analysis: Impact of Microgravity on Host Response to Lipopolysaccharide in vitro
2014-08-07
on host response to lipopolysaccharide in vitro 5a. CONTRACT NUMBER 5b. GRANT NUMBER 5c. PROGRAM ELEMENT NUMBER 6. AUTHOR(S) Nabarun Chakraborty...49. Mateescu B, Batista L, Cardon M, Gruosso T, de Feraudy Y, Mariani O, Nicolas A, Meyniel JP, Cottu P, Sastre Garau X, Mechta Grigoriou F: miR 141
Big Data Transforms Discovery-Utilization Therapeutics Continuum
Waldman, SA; Terzic, A
2015-01-01
Enabling omic technologies adopt a holistic view to produce unprecedented insights into the molecular underpinnings of health and disease, in part, by generating massive high-dimensional biological data. Leveraging these systems-level insights as an engine driving the healthcare evolution is maximized through integration with medical, demographic, and environmental datasets from individuals to populations. Big data analytics has accordingly emerged to add value to the technical aspects of storage, transfer, and analysis required for merging vast arrays of omic-, clinical- and eco-datasets. In turn, this new field at the interface of biology, medicine, and information science is systematically transforming modern therapeutics across discovery, development, regulation, and utilization. “…a man's discourse was like to a rich Persian carpet, the beautiful figures and patterns of which can be shown only by spreading and extending it out; when it is contracted and folded up, they are obscured and lost” Themistocles quoted by Plutarch AD 46 – AD 120 PMID:26888297
UniVIO: A Multiple Omics Database with Hormonome and Transcriptome Data from Rice
Sakurai, Tetsuya; Sakakibara, Hitoshi
2013-01-01
Plant hormones play important roles as signaling molecules in the regulation of growth and development by controlling the expression of downstream genes. Since the hormone signaling system represents a complex network involving functional cross-talk through the mutual regulation of signaling and metabolism, a comprehensive and integrative analysis of plant hormone concentrations and gene expression is important for a deeper understanding of hormone actions. We have developed a database named Uniformed Viewer for Integrated Omics (UniVIO: http://univio.psc.riken.jp/), which displays hormone-metabolome (hormonome) and transcriptome data in a single formatted (uniformed) heat map. At the present time, hormonome and transcriptome data obtained from 14 organ parts of rice plants at the reproductive stage and seedling shoots of three gibberellin signaling mutants are included in the database. The hormone concentration and gene expression data can be searched by substance name, probe ID, gene locus ID or gene description. A correlation search function has been implemented to enable users to obtain information of correlated substance accumulation and gene expression. In the correlation search, calculation method, range of correlation coefficient and plant samples can be selected freely. PMID:23314752
Omics for Precious Rare Biosamples: Characterization of Ancient Human Hair by a Proteomic Approach.
Fresnais, Margaux; Richardin, Pascale; Sepúlveda, Marcela; Leize-Wagner, Emmanuelle; Charrié-Duhaut, Armelle
2017-07-01
Omics technologies have far-reaching applications beyond clinical medicine. A case in point is the analysis of ancient hair samples. Indeed, hair is an important biological indicator that has become a material of choice in archeometry to study the ancient civilizations and their environment. Current characterization of ancient hair is based on elemental and structural analyses, but only few studies have focused on the molecular aspects of ancient hair proteins-keratins-and their conservation state. In such cases, applied extraction protocols require large amounts of raw hair, from 30 to 100 mg. In the present study, we report an optimized new proteomic approach to accurately identify archeological hair proteins, and assess their preservation state, while using a minimum of raw material. Testing and adaptation of three protocols and of nano liquid chromatography-tandem mass spectrometry (nanoLC-MS/MS) parameters were performed on modern hair. On the basis of mass spectrometry data quality, and of the required initial sample amount, the most promising workflow was selected and applied to an ancient archeological sample, dated to about 3880 years before present. Finally, and importantly, we were able to identify 11 ancient hair proteins and to visualize the preservation state of mummy's hair from only 500 μg of raw material. The results presented here pave the way for new insights into the understanding of hair protein alteration processes such as those due to aging and ecological exposures. This work could enable omics scientists to apply a proteomic approach to precious and rare samples, not only in the context of archeometrical studies but also for future applications that would require the use of very small amounts of sample.
aGEM: an integrative system for analyzing spatial-temporal gene-expression information
Jiménez-Lozano, Natalia; Segura, Joan; Macías, José Ramón; Vega, Juanjo; Carazo, José María
2009-01-01
Motivation: The work presented here describes the ‘anatomical Gene-Expression Mapping (aGEM)’ Platform, a development conceived to integrate phenotypic information with the spatial and temporal distributions of genes expressed in the mouse. The aGEM Platform has been built by extending the Distributed Annotation System (DAS) protocol, which was originally designed to share genome annotations over the WWW. DAS is a client-server system in which a single client integrates information from multiple distributed servers. Results: The aGEM Platform provides information to answer three main questions. (i) Which genes are expressed in a given mouse anatomical component? (ii) In which mouse anatomical structures are a given gene or set of genes expressed? And (iii) is there any correlation among these findings? Currently, this Platform includes several well-known mouse resources (EMAGE, GXD and GENSAT), hosting gene-expression data mostly obtained from in situ techniques together with a broad set of image-derived annotations. Availability: The Platform is optimized for Firefox 3.0 and it is accessed through a friendly and intuitive display: http://agem.cnb.csic.es Contact: natalia@cnb.csic.es Supplementary information: Supplementary data are available at http://bioweb.cnb.csic.es/VisualOmics/aGEM/home.html and http://bioweb.cnb.csic.es/VisualOmics/index_VO.html and Bioinformatics online. PMID:19592395
Environmental OMICS: Current Status and Future Directions.
Objectives: Applications of OMICS to high throughput studies of changes of genes, RNAs, proteins and metabolites, and their associated functions in cells or organisms exposed to environmental chemicals has led to the emergence of a very active research field: environmental OMICS....
NASA Astrophysics Data System (ADS)
Großerueschkamp, Frederik; Bracht, Thilo; Diehl, Hanna C.; Kuepper, Claus; Ahrens, Maike; Kallenbach-Thieltges, Angela; Mosig, Axel; Eisenacher, Martin; Marcus, Katrin; Behrens, Thomas; Brüning, Thomas; Theegarten, Dirk; Sitek, Barbara; Gerwert, Klaus
2017-03-01
Diffuse malignant mesothelioma (DMM) is a heterogeneous malignant neoplasia manifesting with three subtypes: epithelioid, sarcomatoid and biphasic. DMM exhibit a high degree of spatial heterogeneity that complicates a thorough understanding of the underlying different molecular processes in each subtype. We present a novel approach to spatially resolve the heterogeneity of a tumour in a label-free manner by integrating FTIR imaging and laser capture microdissection (LCM). Subsequent proteome analysis of the dissected homogenous samples provides in addition molecular resolution. FTIR imaging resolves tumour subtypes within tissue thin-sections in an automated and label-free manner with accuracy of about 85% for DMM subtypes. Even in highly heterogeneous tissue structures, our label-free approach can identify small regions of interest, which can be dissected as homogeneous samples using LCM. Subsequent proteome analysis provides a location specific molecular characterization. Applied to DMM subtypes, we identify 142 differentially expressed proteins, including five protein biomarkers commonly used in DMM immunohistochemistry panels. Thus, FTIR imaging resolves not only morphological alteration within tissue but it resolves even alterations at the level of single proteins in tumour subtypes. Our fully automated workflow FTIR-guided LCM opens new avenues collecting homogeneous samples for precise and predictive biomarkers from omics studies.
Großerueschkamp, Frederik; Bracht, Thilo; Diehl, Hanna C; Kuepper, Claus; Ahrens, Maike; Kallenbach-Thieltges, Angela; Mosig, Axel; Eisenacher, Martin; Marcus, Katrin; Behrens, Thomas; Brüning, Thomas; Theegarten, Dirk; Sitek, Barbara; Gerwert, Klaus
2017-03-30
Diffuse malignant mesothelioma (DMM) is a heterogeneous malignant neoplasia manifesting with three subtypes: epithelioid, sarcomatoid and biphasic. DMM exhibit a high degree of spatial heterogeneity that complicates a thorough understanding of the underlying different molecular processes in each subtype. We present a novel approach to spatially resolve the heterogeneity of a tumour in a label-free manner by integrating FTIR imaging and laser capture microdissection (LCM). Subsequent proteome analysis of the dissected homogenous samples provides in addition molecular resolution. FTIR imaging resolves tumour subtypes within tissue thin-sections in an automated and label-free manner with accuracy of about 85% for DMM subtypes. Even in highly heterogeneous tissue structures, our label-free approach can identify small regions of interest, which can be dissected as homogeneous samples using LCM. Subsequent proteome analysis provides a location specific molecular characterization. Applied to DMM subtypes, we identify 142 differentially expressed proteins, including five protein biomarkers commonly used in DMM immunohistochemistry panels. Thus, FTIR imaging resolves not only morphological alteration within tissue but it resolves even alterations at the level of single proteins in tumour subtypes. Our fully automated workflow FTIR-guided LCM opens new avenues collecting homogeneous samples for precise and predictive biomarkers from omics studies.
Newt-omics: a comprehensive repository for omics data from the newt Notophthalmus viridescens
Bruckskotten, Marc; Looso, Mario; Reinhardt, Richard; Braun, Thomas; Borchardt, Thilo
2012-01-01
Notophthalmus viridescens, a member of the salamander family is an excellent model organism to study regenerative processes due to its unique ability to replace lost appendages and to repair internal organs. Molecular insights into regenerative events have been severely hampered by the lack of genomic, transcriptomic and proteomic data, as well as an appropriate database to store such novel information. Here, we describe ‘Newt-omics’ (http://newt-omics.mpi-bn.mpg.de), a database, which enables researchers to locate, retrieve and store data sets dedicated to the molecular characterization of newts. Newt-omics is a transcript-centred database, based on an Expressed Sequence Tag (EST) data set from the newt, covering ∼50 000 Sanger sequenced transcripts and a set of high-density microarray data, generated from regenerating hearts. Newt-omics also contains a large set of peptides identified by mass spectrometry, which was used to validate 13 810 ESTs as true protein coding. Newt-omics is open to implement additional high-throughput data sets without changing the database structure. Via a user-friendly interface Newt-omics allows access to a huge set of molecular data without the need for prior bioinformatical expertise. PMID:22039101
NASA Technical Reports Server (NTRS)
Reinsch, S. S.; Galazka, J..; Berrios, D. C; Chakravarty, K.; Fogle, H.; Lai, S.; Bokyo, V.; Timucin, L. R.; Tran, P.; Skidmore, M.
2016-01-01
NASA's mission includes expanding our understanding of biological systems to improve life on Earth and to enable long-duration human exploration of space. The GeneLab Data System (GLDS) is NASA's premier open-access omics data platform for biological experiments. GLDS houses standards-compliant, high-throughput sequencing and other omics data from spaceflight-relevant experiments. The GeneLab project at NASA-Ames Research Center is developing the database, and also partnering with spaceflight projects through sharing or augmentation of experiment samples to expand omics analyses on precious spaceflight samples. The partnerships ensure that the maximum amount of data is garnered from spaceflight experiments and made publically available as rapidly as possible via the GLDS. GLDS Version 1.0, went online in April 2015. Software updates and new data releases occur at least quarterly. As of October 2016, the GLDS contains 80 datasets and has search and download capabilities. Version 2.0 is slated for release in September of 2017 and will have expanded, integrated search capabilities leveraging other public omics databases (NCBI GEO, PRIDE, MG-RAST). Future versions in this multi-phase project will provide a collaborative platform for omics data analysis. Data from experiments that explore the biological effects of the spaceflight environment on a wide variety of model organisms are housed in the GLDS including data from rodents, invertebrates, plants and microbes. Human datasets are currently limited to those with anonymized data (e.g., from cultured cell lines). GeneLab ensures prompt release and open access to high-throughput genomics, transcriptomics, proteomics, and metabolomics data from spaceflight and ground-based simulations of microgravity, radiation or other space environment factors. The data are meticulously curated to assure that accurate experimental and sample processing metadata are included with each data set. GLDS download volumes indicate strong interest of the scientific community in these data. To date GeneLab has partnered with multiple experiments including two plant (Arabidopsis thaliana) experiments, two mice experiments, and several microbe experiments. GeneLab optimized protocols in the rodent partnerships for maximum yield of RNA, DNA and protein from tissues harvested and preserved during the SpaceX-4 mission, as well as from tissues from mice that were frozen intact during spaceflight and later dissected on the ground. Analysis of GeneLab data will contribute fundamental knowledge of how the space environment affects biological systems, and as well as yield terrestrial benefits resulting from mitigation strategies to prevent effects observed during exposure to space environments.
Evaluation of hierarchical models for integrative genomic analyses.
Denis, Marie; Tadesse, Mahlet G
2016-03-01
Advances in high-throughput technologies have led to the acquisition of various types of -omic data on the same biological samples. Each data type gives independent and complementary information that can explain the biological mechanisms of interest. While several studies performing independent analyses of each dataset have led to significant results, a better understanding of complex biological mechanisms requires an integrative analysis of different sources of data. Flexible modeling approaches, based on penalized likelihood methods and expectation-maximization (EM) algorithms, are studied and tested under various biological relationship scenarios between the different molecular features and their effects on a clinical outcome. The models are applied to genomic datasets from two cancer types in the Cancer Genome Atlas project: glioblastoma multiforme and ovarian serous cystadenocarcinoma. The integrative models lead to improved model fit and predictive performance. They also provide a better understanding of the biological mechanisms underlying patients' survival. Source code implementing the integrative models is freely available at https://github.com/mgt000/IntegrativeAnalysis along with example datasets and sample R script applying the models to these data. The TCGA datasets used for analysis are publicly available at https://tcga-data.nci.nih.gov/tcga/tcgaDownload.jsp marie.denis@cirad.fr or mgt26@georgetown.edu Supplementary data are available at Bioinformatics online. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Schaumberg, Andrew
The Omics Tools package provides several small trivial tools for work in genomics. This single portable package, the omics.jar file, is a toolbox that works in any Java-based environment, including PCs, Macs, and supercomputers. The number of tools is expected to grow. One tool (called cmsearch.hadoop or cmsearch.local), calls the external cmsearch program to predict non-coding RNA in a genome. The cmsearch program is part of the third-party Infernal package. Omics Tools does not contain Infernal. Infernal may be installed separately. The cmsearch.hadoop subtool requires Apache Hadoop and runs on a supercomputer, though cmsearch.local does not and runs on amore » server. Omics Tools does not contain Hadoop. Hadoop mat be installed separartely The other tools (cmgbk, cmgff, fastats, pal, randgrp, randgrpr, randsub) do not interface with third-party tools. Omics Tools is written in Java and Scala programming languages. Invoking the help command shows currently available tools, as shown below: schaumbe@gpint06:~/proj/omics$ java -jar omics.jar help Known commands are: cmgbk : compare cmsearch and GenBank Infernal hits cmgff : compare hits among two GFF (version 3) files cmsearch.hadoop : find Infernal hits in a genome, on your supercomputer cmsearch.local : find Infernal hits in a genome, on your workstation fastats : FASTA stats, e.g. # bases, GC content pal : stem-loop motif detection by palindromic sequence search (code stub) randgrp : random subsample without replacement, of groups randgrpr : random subsample with replacement, of groups (fast) randsub : random subsample without replacement, of file lines For more help regarding a particular command, use: java -jar omics.jar command help Usage: java -jar omics.jar command args« less
Semantic Data Integration and Knowledge Management to Represent Biological Network Associations.
Losko, Sascha; Heumann, Klaus
2017-01-01
The vast quantities of information generated by academic and industrial research groups are reflected in a rapidly growing body of scientific literature and exponentially expanding resources of formalized data, including experimental data, originating from a multitude of "-omics" platforms, phenotype information, and clinical data. For bioinformatics, the challenge remains to structure this information so that scientists can identify relevant information, to integrate this information as specific "knowledge bases," and to formalize this knowledge across multiple scientific domains to facilitate hypothesis generation and validation. Here we report on progress made in building a generic knowledge management environment capable of representing and mining both explicit and implicit knowledge and, thus, generating new knowledge. Risk management in drug discovery and clinical research is used as a typical example to illustrate this approach. In this chapter we introduce techniques and concepts (such as ontologies, semantic objects, typed relationships, contexts, graphs, and information layers) that are used to represent complex biomedical networks. The BioXM™ Knowledge Management Environment is used as an example to demonstrate how a domain such as oncology is represented and how this representation is utilized for research.
Target-Pathogen: a structural bioinformatic approach to prioritize drug targets in pathogens
Sosa, Ezequiel J; Burguener, Germán; Lanzarotti, Esteban; Radusky, Leandro; Pardo, Agustín M; Marti, Marcelo
2018-01-01
Abstract Available genomic data for pathogens has created new opportunities for drug discovery and development to fight them, including new resistant and multiresistant strains. In particular structural data must be integrated with both, gene information and experimental results. In this sense, there is a lack of an online resource that allows genome wide-based data consolidation from diverse sources together with thorough bioinformatic analysis that allows easy filtering and scoring for fast target selection for drug discovery. Here, we present Target-Pathogen database (http://target.sbg.qb.fcen.uba.ar/patho), designed and developed as an online resource that allows the integration and weighting of protein information such as: function, metabolic role, off-targeting, structural properties including druggability, essentiality and omic experiments, to facilitate the identification and prioritization of candidate drug targets in pathogens. We include in the database 10 genomes of some of the most relevant microorganisms for human health (Mycobacterium tuberculosis, Mycobacterium leprae, Klebsiella pneumoniae, Plasmodium vivax, Toxoplasma gondii, Leishmania major, Wolbachia bancrofti, Trypanosoma brucei, Shigella dysenteriae and Schistosoma Smanosoni) and show its applicability. New genomes can be uploaded upon request. PMID:29106651
Durmaz, Arda; Henderson, Tim A D; Brubaker, Douglas; Bebek, Gurkan
2017-01-01
Large scale genomics studies have generated comprehensive molecular characterization of numerous cancer types. Subtypes for many tumor types have been established; however, these classifications are based on molecular characteristics of a small gene sets with limited power to detect dysregulation at the patient level. We hypothesize that frequent graph mining of pathways to gather pathways functionally relevant to tumors can characterize tumor types and provide opportunities for personalized therapies. In this study we present an integrative omics approach to group patients based on their altered pathway characteristics and show prognostic differences within breast cancer (p < 9:57E - 10) and glioblastoma multiforme (p < 0:05) patients. We were able validate this approach in secondary RNA-Seq datasets with p < 0:05 and p < 0:01 respectively. We also performed pathway enrichment analysis to further investigate the biological relevance of dysregulated pathways. We compared our approach with network-based classifier algorithms and showed that our unsupervised approach generates more robust and biologically relevant clustering whereas previous approaches failed to report specific functions for similar patient groups or classify patients into prognostic groups. These results could serve as a means to improve prognosis for future cancer patients, and to provide opportunities for improved treatment options and personalized interventions. The proposed novel graph mining approach is able to integrate PPI networks with gene expression in a biologically sound approach and cluster patients in to clinically distinct groups. We have utilized breast cancer and glioblastoma multiforme datasets from microarray and RNA-Seq platforms and identified disease mechanisms differentiating samples. Supplementary methods, figures, tables and code are available at https://github.com/bebeklab/dysprog.
Paganelli, Matteo; De Palma, Giuseppe; Apostoli, Pietro
2017-11-01
As Genomics aims at the collective characterization and quantification of genes, exposomics refers to the totality of lifetime environmental exposures, consisting in a novel approach to studying the role of the environment in human disease. The aim is to assess all human environmental and occupational exposures in order to better understand their contribution to human diseases. The "omics" revolution infact mostly regards the underlying method: scientific knowledge is expected to come from the analysis of increasingly extensive databases. The primary focus is on air pollution and water contaminants, but all the determinants of human exposure are conceptually part of the idea of exposome, including physical and psychological factors. Using 'omic' techniques the collected exposure data can be linked to biochemical and molecular changes in our body. Since the first formulation of the idea itself of Exposome many efforts have been made to translate the concept into research, in particular two important studies have been started in Europe. We herein suggest that Occupational Medicine could be a precious contributor to the growth of exposure science also in its omic side thanks to the methods and to the knowledges part of our background. Copyright© by Aracne Editrice, Roma, Italy.
A Multi-Omic View of Host-Pathogen-Commensal Interplay in Salmonella-Mediated Intestinal Infection
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kaiser, Brooke LD; Li, Jie; Sanford, James A.
The potential for commensal microorganisms indigenous to a host (the ‘microbiome’ or ‘microbiota’) to alter infection outcome by influencing host-pathogen interplay is largely unknown. We used a multi-omics “systems” approach, incorporating proteomics, metabolomics, glycomics, and metagenomics, to explore the molecular interplay between the murine host, the pathogen Salmonella enterica serovar Typhimurium (S. Typhimurium), and commensal gut microorganisms during intestinal infection with S. Typhimurium. We find proteomic evidence that S. Typhimurium thrives within the infected 129/SvJ mouse gut without antibiotic pre-treatment, inducing inflammation and disrupting the intestinal microbiome (e.g., suppressing Bacteroidetes and Firmicutes while promoting growth of Salmonella and Enterococcus). Alterationmore » of the host microbiome population structure was highly correlated with gut environmental changes, including the accumulation of metabolites normally consumed by commensal microbiota. Finally, the less characterized phase of S. Typhimurium’s lifecycle was investigated, and both proteomic and glycomic evidence suggests S. Typhimurium may take advantage of increased fucose moieties to metabolize fucose while growing in the gut. The application of multiple omics measurements to Salmonella-induced intestinal inflammation provides insights into complex molecular strategies employed during pathogenesis between host, pathogen, and the microbiome.« less
Cuenca, María del Sol; Roca, Amalia; Molina-Santiago, Carlos; Duque, Estrella; Armengaud, Jean; Gómez-Garcia, María R; Ramos, Juan L
2016-01-01
Pseudomonas putida BIRD-1 has the potential to be used for the industrial production of butanol due to its solvent tolerance and ability to metabolize low-cost compounds. However, the strain has two major limitations: it assimilates butanol as sole carbon source and butanol concentrations above 1% (v/v) are toxic. With the aim of facilitating BIRD-1 strain design for industrial use, a genome-wide mini-Tn5 transposon mutant library was screened for clones exhibiting increased butanol sensitivity or deficiency in butanol assimilation. Twenty-one mutants were selected that were affected in one or both of the processes. These mutants exhibited insertions in various genes, including those involved in the TCA cycle, fatty acid metabolism, transcription, cofactor synthesis and membrane integrity. An omics-based analysis revealed key genes involved in the butanol response. Transcriptomic and proteomic studies were carried out to compare short and long-term tolerance and assimilation traits. Pseudomonas putida initiates various butanol assimilation pathways via alcohol and aldehyde dehydrogenases that channel the compound to central metabolism through the glyoxylate shunt pathway. Accordingly, isocitrate lyase - a key enzyme of the pathway - was the most abundant protein when butanol was used as the sole carbon source. Upregulation of two genes encoding proteins PPUBIRD1_2240 and PPUBIRD1_2241 (acyl-CoA dehydrogenase and acyl-CoA synthetase respectively) linked butanol assimilation with acyl-CoA metabolism. Butanol tolerance was found to be primarily linked to classic solvent defense mechanisms, such as efflux pumps, membrane modifications and control of redox state. Our results also highlight the intensive energy requirements for butanol production and tolerance; thus, enhancing TCA cycle operation may represent a promising strategy for enhanced butanol production. © 2015 The Authors. Microbial Biotechnology published by John Wiley & Sons Ltd and Society for Applied Microbiology.
Montastier, Emilie; Villa-Vialaneix, Nathalie; Caspar-Bauguil, Sylvie; Hlavaty, Petr; Tvrzicka, Eva; Gonzalez, Ignacio; Saris, Wim H. M.; Langin, Dominique; Kunesova, Marie; Viguerie, Nathalie
2015-01-01
Nutrigenomics investigates relationships between nutrients and all genome-encoded molecular entities. This holistic approach requires systems biology to scrutinize the effects of diet on tissue biology. To decipher the adipose tissue (AT) response to diet induced weight changes we focused on key molecular (lipids and transcripts) AT species during a longitudinal dietary intervention. To obtain a systems model, a network approach was used to combine all sets of variables (bio-clinical, fatty acids and mRNA levels) and get an overview of their interactions. AT fatty acids and mRNA levels were quantified in 135 obese women at baseline, after an 8-week low calorie diet (LCD) and after 6 months of ad libitum weight maintenance diet (WMD). After LCD, individuals were stratified a posteriori according to weight change during WMD. A 3 steps approach was used to infer a global model involving the 3 sets of variables. It consisted in inferring intra-omic networks with sparse partial correlations and inter-omic networks with regularized canonical correlation analysis and finally combining the obtained omic-specific network in a single global model. The resulting networks were analyzed using node clustering, systematic important node extraction and cluster comparisons. Overall, AT showed both constant and phase-specific biological signatures in response to dietary intervention. AT from women regaining weight displayed growth factors, angiogenesis and proliferation signaling signatures, suggesting unfavorable tissue hyperplasia. By contrast, after LCD a strong positive relationship between AT myristoleic acid (a fatty acid with low AT level) content and de novo lipogenesis mRNAs was found. This relationship was also observed, after WMD, in the group of women that continued to lose weight. This original system biology approach provides novel insight in the AT response to weight control by highlighting the central role of myristoleic acid that may account for the beneficial effects of weight loss. PMID:25590576
Metabolome Integrated Analysis of High-Temperature Response in Pinus radiata.
Escandón, Mónica; Meijón, Mónica; Valledor, Luis; Pascual, Jesús; Pinto, Gloria; Cañal, María Jesús
2018-01-01
The integrative omics approach is crucial to identify the molecular mechanisms underlying high-temperature response in non-model species. Based on future scenarios of heat increase, Pinus radiata plants were exposed to a temperature of 40°C for a period of 5 days, including recovered plants (30 days after last exposure to 40°C) in the analysis. The analysis of the metabolome using complementary mass spectrometry techniques (GC-MS and LC-Orbitrap-MS) allowed the reliable quantification of 2,287 metabolites. The analysis of identified metabolites and highlighter metabolic pathways across heat time exposure reveal the dynamism of the metabolome in relation to high-temperature response in P. radiata , identifying the existence of a turning point (on day 3) at which P. radiata plants changed from an initial stress response program (shorter-term response) to an acclimation one (longer-term response). Furthermore, the integration of metabolome and physiological measurements, which cover from the photosynthetic state to hormonal profile, suggests a complex metabolic pathway interaction network related to heat-stress response. Cytokinins (CKs), fatty acid metabolism and flavonoid and terpenoid biosynthesis were revealed as the most important pathways involved in heat-stress response in P. radiata , with zeatin riboside (ZR) and isopentenyl adenosine (iPA) as the key hormones coordinating these multiple and complex interactions. On the other hand, the integrative approach allowed elucidation of crucial metabolic mechanisms involved in heat response in P. radiata , as well as the identification of thermotolerance metabolic biomarkers (L-phenylalanine, hexadecanoic acid, and dihydromyricetin), crucial metabolites which can reschedule the metabolic strategy to adapt to high temperature.
Metabolome Integrated Analysis of High-Temperature Response in Pinus radiata
Escandón, Mónica; Meijón, Mónica; Valledor, Luis; Pascual, Jesús; Pinto, Gloria; Cañal, María Jesús
2018-01-01
The integrative omics approach is crucial to identify the molecular mechanisms underlying high-temperature response in non-model species. Based on future scenarios of heat increase, Pinus radiata plants were exposed to a temperature of 40°C for a period of 5 days, including recovered plants (30 days after last exposure to 40°C) in the analysis. The analysis of the metabolome using complementary mass spectrometry techniques (GC-MS and LC-Orbitrap-MS) allowed the reliable quantification of 2,287 metabolites. The analysis of identified metabolites and highlighter metabolic pathways across heat time exposure reveal the dynamism of the metabolome in relation to high-temperature response in P. radiata, identifying the existence of a turning point (on day 3) at which P. radiata plants changed from an initial stress response program (shorter-term response) to an acclimation one (longer-term response). Furthermore, the integration of metabolome and physiological measurements, which cover from the photosynthetic state to hormonal profile, suggests a complex metabolic pathway interaction network related to heat-stress response. Cytokinins (CKs), fatty acid metabolism and flavonoid and terpenoid biosynthesis were revealed as the most important pathways involved in heat-stress response in P. radiata, with zeatin riboside (ZR) and isopentenyl adenosine (iPA) as the key hormones coordinating these multiple and complex interactions. On the other hand, the integrative approach allowed elucidation of crucial metabolic mechanisms involved in heat response in P. radiata, as well as the identification of thermotolerance metabolic biomarkers (L-phenylalanine, hexadecanoic acid, and dihydromyricetin), crucial metabolites which can reschedule the metabolic strategy to adapt to high temperature. PMID:29719546