Sample records for genes coexpression modules

  1. The Association of Multiple Interacting Genes with Specific Phenotypes in Rice Using Gene Coexpression Networks1[C][W][OA

    PubMed Central

    Ficklin, Stephen P.; Luo, Feng; Feltus, F. Alex

    2010-01-01

    Discovering gene sets underlying the expression of a given phenotype is of great importance, as many phenotypes are the result of complex gene-gene interactions. Gene coexpression networks, built using a set of microarray samples as input, can help elucidate tightly coexpressed gene sets (modules) that are mixed with genes of known and unknown function. Functional enrichment analysis of modules further subdivides the coexpressed gene set into cofunctional gene clusters that may coexist in the module with other functionally related gene clusters. In this study, 45 coexpressed gene modules and 76 cofunctional gene clusters were discovered for rice (Oryza sativa) using a global, knowledge-independent paradigm and the combination of two network construction methodologies. Some clusters were enriched for previously characterized mutant phenotypes, providing evidence for specific gene sets (and their annotated molecular functions) that underlie specific phenotypes. PMID:20668062

  2. The association of multiple interacting genes with specific phenotypes in rice using gene coexpression networks.

    PubMed

    Ficklin, Stephen P; Luo, Feng; Feltus, F Alex

    2010-09-01

    Discovering gene sets underlying the expression of a given phenotype is of great importance, as many phenotypes are the result of complex gene-gene interactions. Gene coexpression networks, built using a set of microarray samples as input, can help elucidate tightly coexpressed gene sets (modules) that are mixed with genes of known and unknown function. Functional enrichment analysis of modules further subdivides the coexpressed gene set into cofunctional gene clusters that may coexist in the module with other functionally related gene clusters. In this study, 45 coexpressed gene modules and 76 cofunctional gene clusters were discovered for rice (Oryza sativa) using a global, knowledge-independent paradigm and the combination of two network construction methodologies. Some clusters were enriched for previously characterized mutant phenotypes, providing evidence for specific gene sets (and their annotated molecular functions) that underlie specific phenotypes.

  3. Discovery and validation of a glioblastoma co-expressed gene module

    PubMed Central

    Dunwoodie, Leland J.; Poehlman, William L.; Ficklin, Stephen P.; Feltus, Frank Alexander

    2018-01-01

    Tumors exhibit complex patterns of aberrant gene expression. Using a knowledge-independent, noise-reducing gene co-expression network construction software called KINC, we created multiple RNAseq-based gene co-expression networks relevant to brain and glioblastoma biology. In this report, we describe the discovery and validation of a glioblastoma-specific gene module that contains 22 co-expressed genes. The genes are upregulated in glioblastoma relative to normal brain and lower grade glioma samples; they are also hypo-methylated in glioblastoma relative to lower grade glioma tumors. Among the proneural, neural, mesenchymal, and classical glioblastoma subtypes, these genes are most-highly expressed in the mesenchymal subtype. Furthermore, high expression of these genes is associated with decreased survival across each glioblastoma subtype. These genes are of interest to glioblastoma biology and our gene interaction discovery and validation workflow can be used to discover and validate co-expressed gene modules derived from any co-expression network. PMID:29541392

  4. Discovery and validation of a glioblastoma co-expressed gene module.

    PubMed

    Dunwoodie, Leland J; Poehlman, William L; Ficklin, Stephen P; Feltus, Frank Alexander

    2018-02-16

    Tumors exhibit complex patterns of aberrant gene expression. Using a knowledge-independent, noise-reducing gene co-expression network construction software called KINC, we created multiple RNAseq-based gene co-expression networks relevant to brain and glioblastoma biology. In this report, we describe the discovery and validation of a glioblastoma-specific gene module that contains 22 co-expressed genes. The genes are upregulated in glioblastoma relative to normal brain and lower grade glioma samples; they are also hypo-methylated in glioblastoma relative to lower grade glioma tumors. Among the proneural, neural, mesenchymal, and classical glioblastoma subtypes, these genes are most-highly expressed in the mesenchymal subtype. Furthermore, high expression of these genes is associated with decreased survival across each glioblastoma subtype. These genes are of interest to glioblastoma biology and our gene interaction discovery and validation workflow can be used to discover and validate co-expressed gene modules derived from any co-expression network.

  5. Identification of Transcriptional Modules and Key Genes in Chickens Infected with Salmonella enterica Serovar Pullorum Using Integrated Coexpression Analyses.

    PubMed

    Liu, Bao-Hong; Cai, Jian-Ping

    2017-01-01

    Salmonella enterica Pullorum is one of the leading causes of mortality in poultry. Understanding the molecular response in chickens in response to the infection by S. enterica is important in revealing the mechanisms of pathogenesis and disease progress. There have been studies on identifying genes associated with Salmonella infection by differential expression analysis, but the relationships among regulated genes have not been investigated. In this study, we employed weighted gene coexpression network analysis (WGCNA) and differential coexpression analysis (DCEA) to identify coexpression modules by exploring microarray data derived from chicken splenic tissues in response to the S. enterica infection. A total of 19 modules from 13,538 genes were associated with the Jak-STAT signaling pathway, the extracellular matrix, cytoskeleton organization, the regulation of the actin cytoskeleton, G-protein coupled receptor activity, Toll-like receptor signaling pathways, and immune system processes; among them, 14 differentially coexpressed modules (DCMs) and 2,856 differentially coexpressed genes (DCGs) were identified. The global expression of module genes between infected and uninfected chickens showed slight differences but considerable changes for global coexpression. Furthermore, DCGs were consistently linked to the hubs of the modules. These results will help prioritize candidate genes for future studies of Salmonella infection.

  6. Identification of Transcriptional Modules and Key Genes in Chickens Infected with Salmonella enterica Serovar Pullorum Using Integrated Coexpression Analyses

    PubMed Central

    2017-01-01

    Salmonella enterica Pullorum is one of the leading causes of mortality in poultry. Understanding the molecular response in chickens in response to the infection by S. enterica is important in revealing the mechanisms of pathogenesis and disease progress. There have been studies on identifying genes associated with Salmonella infection by differential expression analysis, but the relationships among regulated genes have not been investigated. In this study, we employed weighted gene coexpression network analysis (WGCNA) and differential coexpression analysis (DCEA) to identify coexpression modules by exploring microarray data derived from chicken splenic tissues in response to the S. enterica infection. A total of 19 modules from 13,538 genes were associated with the Jak-STAT signaling pathway, the extracellular matrix, cytoskeleton organization, the regulation of the actin cytoskeleton, G-protein coupled receptor activity, Toll-like receptor signaling pathways, and immune system processes; among them, 14 differentially coexpressed modules (DCMs) and 2,856 differentially coexpressed genes (DCGs) were identified. The global expression of module genes between infected and uninfected chickens showed slight differences but considerable changes for global coexpression. Furthermore, DCGs were consistently linked to the hubs of the modules. These results will help prioritize candidate genes for future studies of Salmonella infection. PMID:28529955

  7. Hybrid coexpression link similarity graph clustering for mining biological modules from multiple gene expression datasets.

    PubMed

    Salem, Saeed; Ozcaglar, Cagri

    2014-01-01

    Advances in genomic technologies have enabled the accumulation of vast amount of genomic data, including gene expression data for multiple species under various biological and environmental conditions. Integration of these gene expression datasets is a promising strategy to alleviate the challenges of protein functional annotation and biological module discovery based on a single gene expression data, which suffers from spurious coexpression. We propose a joint mining algorithm that constructs a weighted hybrid similarity graph whose nodes are the coexpression links. The weight of an edge between two coexpression links in this hybrid graph is a linear combination of the topological similarities and co-appearance similarities of the corresponding two coexpression links. Clustering the weighted hybrid similarity graph yields recurrent coexpression link clusters (modules). Experimental results on Human gene expression datasets show that the reported modules are functionally homogeneous as evident by their enrichment with biological process GO terms and KEGG pathways.

  8. Global Landscape of a Co-Expressed Gene Network in Barley and its Application to Gene Discovery in Triticeae Crops

    PubMed Central

    Mochida, Keiichi; Uehara-Yamaguchi, Yukiko; Yoshida, Takuhiro; Sakurai, Tetsuya; Shinozaki, Kazuo

    2011-01-01

    Accumulated transcriptome data can be used to investigate regulatory networks of genes involved in various biological systems. Co-expression analysis data sets generated from comprehensively collected transcriptome data sets now represent efficient resources that are capable of facilitating the discovery of genes with closely correlated expression patterns. In order to construct a co-expression network for barley, we analyzed 45 publicly available experimental series, which are composed of 1,347 sets of GeneChip data for barley. On the basis of a gene-to-gene weighted correlation coefficient, we constructed a global barley co-expression network and classified it into clusters of subnetwork modules. The resulting clusters are candidates for functional regulatory modules in the barley transcriptome. To annotate each of the modules, we performed comparative annotation using genes in Arabidopsis and Brachypodium distachyon. On the basis of a comparative analysis between barley and two model species, we investigated functional properties from the representative distributions of the gene ontology (GO) terms. Modules putatively involved in drought stress response and cellulose biogenesis have been identified. These modules are discussed to demonstrate the effectiveness of the co-expression analysis. Furthermore, we applied the data set of co-expressed genes coupled with comparative analysis in attempts to discover potentially Triticeae-specific network modules. These results demonstrate that analysis of the co-expression network of the barley transcriptome together with comparative analysis should promote the process of gene discovery in barley. Furthermore, the insights obtained should be transferable to investigations of Triticeae plants. The associated data set generated in this analysis is publicly accessible at http://coexpression.psc.riken.jp/barley/. PMID:21441235

  9. A Global Coexpression Network Approach for Connecting Genes to Specialized Metabolic Pathways in Plants

    PubMed Central

    Borowsky, Alexander T.

    2017-01-01

    Plants produce diverse specialized metabolites (SMs), but the genes responsible for their production and regulation remain largely unknown, hindering efforts to tap plant pharmacopeia. Given that genes comprising SM pathways exhibit environmentally dependent coregulation, we hypothesized that genes within a SM pathway would form tight associations (modules) with each other in coexpression networks, facilitating their identification. To evaluate this hypothesis, we used 10 global coexpression data sets, each a meta-analysis of hundreds to thousands of experiments, across eight plant species to identify hundreds of coexpressed gene modules per data set. In support of our hypothesis, 15.3 to 52.6% of modules contained two or more known SM biosynthetic genes, and module genes were enriched in SM functions. Moreover, modules recovered many experimentally validated SM pathways, including all six known to form biosynthetic gene clusters (BGCs). In contrast, bioinformatically predicted BGCs (i.e., those lacking an associated metabolite) were no more coexpressed than the null distribution for neighboring genes. These results suggest that most predicted plant BGCs are not genuine SM pathways and argue that BGCs are not a hallmark of plant specialized metabolism. We submit that global gene coexpression is a rich, largely untapped resource for discovering the genetic basis and architecture of plant natural products. PMID:28408660

  10. Identification of potential transcriptomic markers in developing pediatric sepsis: a weighted gene co-expression network analysis and a case-control validation study.

    PubMed

    Li, Yiping; Li, Yanhong; Bai, Zhenjiang; Pan, Jian; Wang, Jian; Fang, Fang

    2017-12-13

    Sepsis represents a complex disease with the dysregulated inflammatory response and high mortality rate. The goal of this study was to identify potential transcriptomic markers in developing pediatric sepsis by a co-expression module analysis of the transcriptomic dataset. Using the R software and Bioconductor packages, we performed a weighted gene co-expression network analysis to identify co-expression modules significantly associated with pediatric sepsis. Functional interpretation (gene ontology and pathway analysis) and enrichment analysis with known transcription factors and microRNAs of the identified candidate modules were then performed. In modules significantly associated with sepsis, the intramodular analysis was further performed and "hub genes" were identified and validated by quantitative real-time PCR (qPCR) in this study. 15 co-expression modules in total were detected, and four modules ("midnight blue", "cyan", "brown", and "tan") were most significantly associated with pediatric sepsis and suggested as potential sepsis-associated modules. Gene ontology analysis and pathway analysis revealed that these four modules strongly associated with immune response. Three of the four sepsis-associated modules were also enriched with known transcription factors (false discovery rate-adjusted P < 0.05). Hub genes were identified in each of the four modules. Four of the identified hub genes (MYB proto-oncogene like 1, killer cell lectin like receptor G1, stomatin, and membrane spanning 4-domains A4A) were further validated to be differentially expressed between septic children and controls by qPCR. Four pediatric sepsis-associated co-expression modules were identified in this study. qPCR results suggest that hub genes in these modules are potential transcriptomic markers for pediatric sepsis diagnosis. These results provide novel insights into the pathogenesis of pediatric sepsis and promote the generation of diagnostic gene sets.

  11. Analysis of bHLH coding genes using gene co-expression network approach.

    PubMed

    Srivastava, Swati; Sanchita; Singh, Garima; Singh, Noopur; Srivastava, Gaurava; Sharma, Ashok

    2016-07-01

    Network analysis provides a powerful framework for the interpretation of data. It uses novel reference network-based metrices for module evolution. These could be used to identify module of highly connected genes showing variation in co-expression network. In this study, a co-expression network-based approach was used for analyzing the genes from microarray data. Our approach consists of a simple but robust rank-based network construction. The publicly available gene expression data of Solanum tuberosum under cold and heat stresses were considered to create and analyze a gene co-expression network. The analysis provide highly co-expressed module of bHLH coding genes based on correlation values. Our approach was to analyze the variation of genes expression, according to the time period of stress through co-expression network approach. As the result, the seed genes were identified showing multiple connections with other genes in the same cluster. Seed genes were found to be vary in different time periods of stress. These analyzed seed genes may be utilized further as marker genes for developing the stress tolerant plant species.

  12. Hybrid coexpression link similarity graph clustering for mining biological modules from multiple gene expression datasets

    PubMed Central

    2014-01-01

    Background Advances in genomic technologies have enabled the accumulation of vast amount of genomic data, including gene expression data for multiple species under various biological and environmental conditions. Integration of these gene expression datasets is a promising strategy to alleviate the challenges of protein functional annotation and biological module discovery based on a single gene expression data, which suffers from spurious coexpression. Results We propose a joint mining algorithm that constructs a weighted hybrid similarity graph whose nodes are the coexpression links. The weight of an edge between two coexpression links in this hybrid graph is a linear combination of the topological similarities and co-appearance similarities of the corresponding two coexpression links. Clustering the weighted hybrid similarity graph yields recurrent coexpression link clusters (modules). Experimental results on Human gene expression datasets show that the reported modules are functionally homogeneous as evident by their enrichment with biological process GO terms and KEGG pathways. PMID:25221624

  13. From Saccharomyces cerevisiae to human: The important gene co-expression modules.

    PubMed

    Liu, Wei; Li, Li; Ye, Hua; Chen, Haiwei; Shen, Weibiao; Zhong, Yuexian; Tian, Tian; He, Huaqin

    2017-08-01

    Network-based systems biology has become an important method for analyzing high-throughput gene expression data and gene function mining. Yeast has long been a popular model organism for biomedical research. In the current study, a weighted gene co-expression network analysis algorithm was applied to construct a gene co-expression network in Saccharomyces cerevisiae . Seventeen stable gene co-expression modules were detected from 2,814 S. cerevisiae microarray data. Further characterization of these modules with the Database for Annotation, Visualization and Integrated Discovery tool indicated that these modules were associated with certain biological processes, such as heat response, cell cycle, translational regulation, mitochondrion oxidative phosphorylation, amino acid metabolism and autophagy. Hub genes were also screened by intra-modular connectivity. Finally, the module conservation was evaluated in a human disease microarray dataset. Functional modules were identified in budding yeast, some of which are associated with patient survival. The current study provided a paradigm for single cell microorganisms and potentially other organisms.

  14. Conserved Non-Coding Regulatory Signatures in Arabidopsis Co-Expressed Gene Modules

    PubMed Central

    Spangler, Jacob B.; Ficklin, Stephen P.; Luo, Feng; Freeling, Michael; Feltus, F. Alex

    2012-01-01

    Complex traits and other polygenic processes require coordinated gene expression. Co-expression networks model mRNA co-expression: the product of gene regulatory networks. To identify regulatory mechanisms underlying coordinated gene expression in a tissue-enriched context, ten Arabidopsis thaliana co-expression networks were constructed after manually sorting 4,566 RNA profiling datasets into aerial, flower, leaf, root, rosette, seedling, seed, shoot, whole plant, and global (all samples combined) groups. Collectively, the ten networks contained 30% of the measurable genes of Arabidopsis and were circumscribed into 5,491 modules. Modules were scrutinized for cis regulatory mechanisms putatively encoded in conserved non-coding sequences (CNSs) previously identified as remnants of a whole genome duplication event. We determined the non-random association of 1,361 unique CNSs to 1,904 co-expression network gene modules. Furthermore, the CNS elements were placed in the context of known gene regulatory networks (GRNs) by connecting 250 CNS motifs with known GRN cis elements. Our results provide support for a regulatory role of some CNS elements and suggest the functional consequences of CNS activation of co-expression in specific gene sets dispersed throughout the genome. PMID:23024789

  15. Conserved non-coding regulatory signatures in Arabidopsis co-expressed gene modules.

    PubMed

    Spangler, Jacob B; Ficklin, Stephen P; Luo, Feng; Freeling, Michael; Feltus, F Alex

    2012-01-01

    Complex traits and other polygenic processes require coordinated gene expression. Co-expression networks model mRNA co-expression: the product of gene regulatory networks. To identify regulatory mechanisms underlying coordinated gene expression in a tissue-enriched context, ten Arabidopsis thaliana co-expression networks were constructed after manually sorting 4,566 RNA profiling datasets into aerial, flower, leaf, root, rosette, seedling, seed, shoot, whole plant, and global (all samples combined) groups. Collectively, the ten networks contained 30% of the measurable genes of Arabidopsis and were circumscribed into 5,491 modules. Modules were scrutinized for cis regulatory mechanisms putatively encoded in conserved non-coding sequences (CNSs) previously identified as remnants of a whole genome duplication event. We determined the non-random association of 1,361 unique CNSs to 1,904 co-expression network gene modules. Furthermore, the CNS elements were placed in the context of known gene regulatory networks (GRNs) by connecting 250 CNS motifs with known GRN cis elements. Our results provide support for a regulatory role of some CNS elements and suggest the functional consequences of CNS activation of co-expression in specific gene sets dispersed throughout the genome.

  16. The Drosophila transcriptional network is structured by microbiota.

    PubMed

    Dobson, Adam J; Chaston, John M; Douglas, Angela E

    2016-11-25

    Resident microorganisms (microbiota) have far-reaching effects on the biology of their animal hosts, with major consequences for the host's health and fitness. A full understanding of microbiota-dependent gene regulation requires analysis of the overall architecture of the host transcriptome, by identifying suites of genes that are expressed synchronously. In this study, we investigated the impact of the microbiota on gene coexpression in Drosophila. Our transcriptomic analysis, of 17 lines representative of the global genetic diversity of Drosophila, yielded a total of 11 transcriptional modules of co-expressed genes. For seven of these modules, the strength of the transcriptional network (defined as gene-gene coexpression) differed significantly between flies bearing a defined gut microbiota (gnotobiotic flies) and flies reared under microbiologically sterile conditions (axenic flies). Furthermore, gene coexpression was uniformly stronger in these microbiota-dependent modules than in both the microbiota-independent modules in gnotobiotic flies and all modules in axenic flies, indicating that the presence of the microbiota directs gene regulation in a subset of the transcriptome. The genes constituting the microbiota-dependent transcriptional modules include regulators of growth, metabolism and neurophysiology, previously implicated in mediating phenotypic effects of microbiota on Drosophila phenotype. Together these results provide the first evidence that the microbiota enhances the coexpression of specific and functionally-related genes relative to the animal's intrinsic baseline level of coexpression. Our system-wide analysis demonstrates that the presence of microbiota enhances gene coexpression, thereby structuring the transcriptional network in the animal host. This finding has potentially major implications for understanding of the mechanisms by which microbiota affect host health and fitness, and the ways in which hosts and their resident microbiota coevolve.

  17. A Global Coexpression Network Approach for Connecting Genes to Specialized Metabolic Pathways in Plants.

    PubMed

    Wisecaver, Jennifer H; Borowsky, Alexander T; Tzin, Vered; Jander, Georg; Kliebenstein, Daniel J; Rokas, Antonis

    2017-05-01

    Plants produce diverse specialized metabolites (SMs), but the genes responsible for their production and regulation remain largely unknown, hindering efforts to tap plant pharmacopeia. Given that genes comprising SM pathways exhibit environmentally dependent coregulation, we hypothesized that genes within a SM pathway would form tight associations (modules) with each other in coexpression networks, facilitating their identification. To evaluate this hypothesis, we used 10 global coexpression data sets, each a meta-analysis of hundreds to thousands of experiments, across eight plant species to identify hundreds of coexpressed gene modules per data set. In support of our hypothesis, 15.3 to 52.6% of modules contained two or more known SM biosynthetic genes, and module genes were enriched in SM functions. Moreover, modules recovered many experimentally validated SM pathways, including all six known to form biosynthetic gene clusters (BGCs). In contrast, bioinformatically predicted BGCs (i.e., those lacking an associated metabolite) were no more coexpressed than the null distribution for neighboring genes. These results suggest that most predicted plant BGCs are not genuine SM pathways and argue that BGCs are not a hallmark of plant specialized metabolism. We submit that global gene coexpression is a rich, largely untapped resource for discovering the genetic basis and architecture of plant natural products. © 2017 American Society of Plant Biologists. All rights reserved.

  18. Co-expression modules construction by WGCNA and identify potential prognostic markers of uveal melanoma.

    PubMed

    Wan, Qi; Tang, Jing; Han, Yu; Wang, Dan

    2018-01-01

    Uveal melanoma is an aggressive cancer which has a high percentage recurrence and with a worse prognosis. Identify the potential prognostic markers of uveal melanoma may provide information for early detection of recurrence and treatment. RNA sequence data of uveal melanoma and patient clinic traits were obtained from The Cancer Genome Atlas (TCGA) database. Co-expression modules were built by weighted gene co -expression network analysis (WGCNA) and applied to investigate the relationship underlying modules and clinic traits. Besides, functional enrichment analysis was performed on these co-expression genes from interested modules. First, using WGCNA, identified 21 co-expression modules were constructed by the 10975 genes from the 80 human uveal melanoma samples. The number of genes in these modules ranged from 42 to 5091. Found four co -expression modules significantly correlated with three clinic traits (status, recurrence and recurrence Time). Module red, and purple positively correlated with patient's life status and recurrence Time. Module green positively correlates with recurrence. The result of functional enrichment analysis showed that the module magenta was mainly enriched genetic material assemble processes, the purple module was mainly enriched in tissue homeostasis and melanosome membrane and the module red was mainly enriched metastasis of cell, suggesting its critical role in the recurrence and development of the disease. Additionally, identified the hug gene (top connectivity with other genes) in each module. The hub gene SLC17A7, NTRK2, ABTB1 and ADPRHL1 might play a vital role in recurrence of uveal melanoma. Our findings provided the framework of co-expression gene modules of uveal melanoma and identified some prognostic markers might be detection of recurrence and treatment for uveal melanoma. Copyright © 2017 Elsevier Ltd. All rights reserved.

  19. No3CoGP: non-conserved and conserved coexpressed gene pairs.

    PubMed

    Mal, Chittabrata; Aftabuddin, Md; Kundu, Sudip

    2014-12-08

    Analyzing the microarray data of different conditions, one can identify the conserved and condition-specific genes and gene modules, and thus can infer the underlying cellular activities. All the available tools based on Bioconductor and R packages differ in how they extract differential coexpression and at what level they study. There is a need for a user-friendly, flexible tool which can start analysis using raw or preprocessed microarray data and can report different levels of useful information. We present a GUI software, No3CoGP: Non-Conserved and Conserved Coexpressed Gene Pairs which takes Affymetrix microarray data (.CEL files or log2 normalized.txt files) along with annotation file (.csv file), Chip Definition File (CDF file) and probe file as inputs, utilizes the concept of network density cut-off and Fisher's z-test to extract biologically relevant information. It can identify four possible types of gene pairs based on their coexpression relationships. These are (i) gene pair showing coexpression in one condition but not in the other, (ii) gene pair which is positively coexpressed in one condition but negatively coexpressed in the other condition, (iii) positively and (iv) negatively coexpressed in both the conditions. Further, it can generate modules of coexpressed genes. Easy-to-use GUI interface enables researchers without knowledge in R language to use No3CoGP. Utilization of one or more CPU cores, depending on the availability, speeds up the program. The output files stored in the respective directories under the user-defined project offer the researchers to unravel condition-specific functionalities of gene, gene sets or modules.

  20. Co-expression network with protein-protein interaction and transcription regulation in malaria parasite Plasmodium falciparum.

    PubMed

    Yu, Fu-Dong; Yang, Shao-You; Li, Yuan-Yuan; Hu, Wei

    2013-04-10

    Malaria continues to be one of the most severe global infectious diseases, as a major threat to human health and economic development. Network-based biological analysis is a promising approach to uncover key genes and biological processes from a network viewpoint, which could not be recognized from individual gene-based signatures. We integrated gene co-expression profile with protein-protein interaction and transcriptional regulation information to construct a comprehensive gene co-expression network of Plasmodium falciparum. Based on this network, we identified 10 core modules by using ICE (Iterative Clique Enumeration) algorithm, which were essential for malaria parasite development in intraerythrocytic developmental cycle (IDC) stages. In each module, all genes were highly correlated probably due to co-regulation or formation of a protein complex. Some of these genes were recognized to be differentially coexpressed among three close-by IDC stages. The gene of prpf8 (PFD0265w) encoding pre-mRNA processing splicing factor 8 product was identified as DCGs (differentially co-expressed genes) among IDC stages, although this gene function was seldom reported in previous researches. Integrating the species-specific gene prediction and differential co-expression gene detection, we found some modules could perform species-specific functions according to some of genes in these modules were species-specific genes, like the module 10. Furthermore, in order to reveal the underlying mechanisms of the erythrocyte invasion by P. falciparum, Steiner Tree algorithm was employed to identify the invasion subnetwork from our gene co-expression network. The subnetwork-based analysis indicated that some important Plasmodium parasite specific genes could corporate with each other and be co-regulated during the parasite invasion process, which including a head-to-head gene pair of PfRH2a (PF13_0198) and PfRH2b (MAL13P1.176). This study based on gene co-expression network could shed new insights on the mechanisms of pathogenesis, even virulence and P. falciparum development. Crown Copyright © 2012. Published by Elsevier B.V. All rights reserved.

  1. Gene Coexpression Network Alignment and Conservation of Gene Modules between Two Grass Species: Maize and Rice[C][W][OA

    PubMed Central

    Ficklin, Stephen P.; Feltus, F. Alex

    2011-01-01

    One major objective for plant biology is the discovery of molecular subsystems underlying complex traits. The use of genetic and genomic resources combined in a systems genetics approach offers a means for approaching this goal. This study describes a maize (Zea mays) gene coexpression network built from publicly available expression arrays. The maize network consisted of 2,071 loci that were divided into 34 distinct modules that contained 1,928 enriched functional annotation terms and 35 cofunctional gene clusters. Of note, 391 maize genes of unknown function were found to be coexpressed within modules along with genes of known function. A global network alignment was made between this maize network and a previously described rice (Oryza sativa) coexpression network. The IsoRankN tool was used, which incorporates both gene homology and network topology for the alignment. A total of 1,173 aligned loci were detected between the two grass networks, which condensed into 154 conserved subgraphs that preserved 4,758 coexpression edges in rice and 6,105 coexpression edges in maize. This study provides an early view into maize coexpression space and provides an initial network-based framework for the translation of functional genomic and genetic information between these two vital agricultural species. PMID:21606319

  2. Gene coexpression network alignment and conservation of gene modules between two grass species: maize and rice.

    PubMed

    Ficklin, Stephen P; Feltus, F Alex

    2011-07-01

    One major objective for plant biology is the discovery of molecular subsystems underlying complex traits. The use of genetic and genomic resources combined in a systems genetics approach offers a means for approaching this goal. This study describes a maize (Zea mays) gene coexpression network built from publicly available expression arrays. The maize network consisted of 2,071 loci that were divided into 34 distinct modules that contained 1,928 enriched functional annotation terms and 35 cofunctional gene clusters. Of note, 391 maize genes of unknown function were found to be coexpressed within modules along with genes of known function. A global network alignment was made between this maize network and a previously described rice (Oryza sativa) coexpression network. The IsoRankN tool was used, which incorporates both gene homology and network topology for the alignment. A total of 1,173 aligned loci were detected between the two grass networks, which condensed into 154 conserved subgraphs that preserved 4,758 coexpression edges in rice and 6,105 coexpression edges in maize. This study provides an early view into maize coexpression space and provides an initial network-based framework for the translation of functional genomic and genetic information between these two vital agricultural species.

  3. A conserved BDNF, glutamate- and GABA-enriched gene module related to human depression identified by coexpression meta-analysis and DNA variant genome-wide association studies.

    PubMed

    Chang, Lun-Ching; Jamain, Stephane; Lin, Chien-Wei; Rujescu, Dan; Tseng, George C; Sibille, Etienne

    2014-01-01

    Large scale gene expression (transcriptome) analysis and genome-wide association studies (GWAS) for single nucleotide polymorphisms have generated a considerable amount of gene- and disease-related information, but heterogeneity and various sources of noise have limited the discovery of disease mechanisms. As systematic dataset integration is becoming essential, we developed methods and performed meta-clustering of gene coexpression links in 11 transcriptome studies from postmortem brains of human subjects with major depressive disorder (MDD) and non-psychiatric control subjects. We next sought enrichment in the top 50 meta-analyzed coexpression modules for genes otherwise identified by GWAS for various sets of disorders. One coexpression module of 88 genes was consistently and significantly associated with GWAS for MDD, other neuropsychiatric disorders and brain functions, and for medical illnesses with elevated clinical risk of depression, but not for other diseases. In support of the superior discriminative power of this novel approach, we observed no significant enrichment for GWAS-related genes in coexpression modules extracted from single studies or in meta-modules using gene expression data from non-psychiatric control subjects. Genes in the identified module encode proteins implicated in neuronal signaling and structure, including glutamate metabotropic receptors (GRM1, GRM7), GABA receptors (GABRA2, GABRA4), and neurotrophic and development-related proteins [BDNF, reelin (RELN), Ephrin receptors (EPHA3, EPHA5)]. These results are consistent with the current understanding of molecular mechanisms of MDD and provide a set of putative interacting molecular partners, potentially reflecting components of a functional module across cells and biological pathways that are synchronously recruited in MDD, other brain disorders and MDD-related illnesses. Collectively, this study demonstrates the importance of integrating transcriptome data, gene coexpression modules and GWAS results for providing novel and complementary approaches to investigate the molecular pathology of MDD and other complex brain disorders.

  4. Differential co-expression analysis reveals a novel prognostic gene module in ovarian cancer.

    PubMed

    Gov, Esra; Arga, Kazim Yalcin

    2017-07-10

    Ovarian cancer is one of the most significant disease among gynecological disorders that women suffered from over the centuries. However, disease-specific and effective biomarkers were still not available, since studies have focused on individual genes associated with ovarian cancer, ignoring the interactions and associations among the gene products. Here, ovarian cancer differential co-expression networks were reconstructed via meta-analysis of gene expression data and co-expressed gene modules were identified in epithelial cells from ovarian tumor and healthy ovarian surface epithelial samples to propose ovarian cancer associated genes and their interactions. We propose a novel, highly interconnected, differentially co-expressed, and co-regulated gene module in ovarian cancer consisting of 84 prognostic genes. Furthermore, the specificity of the module to ovarian cancer was shown through analyses of datasets in nine other cancers. These observations underscore the importance of transcriptome based systems biomarkers research in deciphering the elusive pathophysiology of ovarian cancer, and here, we present reciprocal interplay between candidate ovarian cancer genes and their transcriptional regulatory dynamics. The corresponding gene module might provide new insights on ovarian cancer prognosis and treatment strategies that continue to place a significant burden on global health.

  5. Identification of gene expression profiles and key genes in subchondral bone of osteoarthritis using weighted gene coexpression network analysis.

    PubMed

    Guo, Sheng-Min; Wang, Jian-Xiong; Li, Jin; Xu, Fang-Yuan; Wei, Quan; Wang, Hai-Ming; Huang, Hou-Qiang; Zheng, Si-Lin; Xie, Yu-Jie; Zhang, Chi

    2018-06-15

    Osteoarthritis (OA) significantly influences the quality life of people around the world. It is urgent to find an effective way to understand the genetic etiology of OA. We used weighted gene coexpression network analysis (WGCNA) to explore the key genes involved in the subchondral bone pathological process of OA. Fifty gene expression profiles of GSE51588 were downloaded from the Gene Expression Omnibus database. The OA-associated genes and gene ontologies were acquired from JuniorDoc. Weighted gene coexpression network analysis was used to find disease-related networks based on 21756 gene expression correlation coefficients, hub-genes with the highest connectivity in each module were selected, and the correlation between module eigengene and clinical traits was calculated. The genes in the traits-related gene coexpression modules were subject to functional annotation and pathway enrichment analysis using ClusterProfiler. A total of 73 gene modules were identified, of which, 12 modules were found with high connectivity with clinical traits. Five modules were found with enriched OA-associated genes. Moreover, 310 OA-associated genes were found, and 34 of them were among hub-genes in each module. Consequently, enrichment results indicated some key metabolic pathways, such as extracellular matrix (ECM)-receptor interaction (hsa04512), focal adhesion (hsa04510), the phosphatidylinositol 3'-kinase (PI3K)-Akt signaling pathway (PI3K-AKT) (hsa04151), transforming growth factor beta pathway, and Wnt pathway. We intended to identify some core genes, collagen (COL)6A3, COL6A1, ITGA11, BAMBI, and HCK, which could influence downstream signaling pathways once they were activated. In this study, we identified important genes within key coexpression modules, which associate with a pathological process of subchondral bone in OA. Functional analysis results could provide important information to understand the mechanism of OA. © 2018 Wiley Periodicals, Inc.

  6. Comparison of gene co-networks reveals the molecular mechanisms of the rice (Oryza sativa L.) response to Rhizoctonia solani AG1 IA infection.

    PubMed

    Zhang, Jinfeng; Zhao, Wenjuan; Fu, Rong; Fu, Chenglin; Wang, Lingxia; Liu, Huainian; Li, Shuangcheng; Deng, Qiming; Wang, Shiquan; Zhu, Jun; Liang, Yueyang; Li, Ping; Zheng, Aiping

    2018-05-05

    Rhizoctonia solani causes rice sheath blight, an important disease affecting the growth of rice (Oryza sativa L.). Attempts to control the disease have met with little success. Based on transcriptional profiling, we previously identified more than 11,947 common differentially expressed genes (TPM > 10) between the rice genotypes TeQing and Lemont. In the current study, we extended these findings by focusing on an analysis of gene co-expression in response to R. solani AG1 IA and identified gene modules within the networks through weighted gene co-expression network analysis (WGCNA). We compared the different genes assigned to each module and the biological interpretations of gene co-expression networks at early and later modules in the two rice genotypes to reveal differential responses to AG1 IA. Our results show that different changes occurred in the two rice genotypes and that the modules in the two groups contain a number of candidate genes possibly involved in pathogenesis, such as the VQ protein. Furthermore, these gene co-expression networks provide comprehensive transcriptional information regarding gene expression in rice in response to AG1 IA. The co-expression networks derived from our data offer ideas for follow-up experimentation that will help advance our understanding of the translational regulation of rice gene expression changes in response to AG1 IA.

  7. Gene co-expression network analysis in Rhodobacter capsulatus and application to comparative expression analysis of Rhodobacter sphaeroides

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Pena-Castillo, Lourdes; Mercer, Ryan; Gurinovich, Anastasia

    2014-08-28

    The genus Rhodobacter contains purple nonsulfur bacteria found mostly in freshwater environments. Representative strains of two Rhodobacter species, R. capsulatus and R. sphaeroides, have had their genomes fully sequenced and both have been the subject of transcriptional profiling studies. Gene co-expression networks can be used to identify modules of genes with similar expression profiles. Functional analysis of gene modules can then associate co-expressed genes with biological pathways, and network statistics can determine the degree of module preservation in related networks. In this paper, we constructed an R. capsulatus gene co-expression network, performed functional analysis of identified gene modules, and investigatedmore » preservation of these modules in R. capsulatus proteomics data and in R. sphaeroides transcriptomics data. Results: The analysis identified 40 gene co-expression modules in R. capsulatus. Investigation of the module gene contents and expression profiles revealed patterns that were validated based on previous studies supporting the biological relevance of these modules. We identified two R. capsulatus gene modules preserved in the protein abundance data. We also identified several gene modules preserved between both Rhodobacter species, which indicate that these cellular processes are conserved between the species and are candidates for functional information transfer between species. Many gene modules were non-preserved, providing insight into processes that differentiate the two species. In addition, using Local Network Similarity (LNS), a recently proposed metric for expression divergence, we assessed the expression conservation of between-species pairs of orthologs, and within-species gene-protein expression profiles. Conclusions: Our analyses provide new sources of information for functional annotation in R. capsulatus because uncharacterized genes in modules are now connected with groups of genes that constitute a joint functional annotation. We identified R. capsulatus modules enriched with genes for ribosomal proteins, porphyrin and bacteriochlorophyll anabolism, and biosynthesis of secondary metabolites to be preserved in R. sphaeroides whereas modules related to RcGTA production and signalling showed lack of preservation in R. sphaeroides. In addition, we demonstrated that network statistics may also be applied within-species to identify congruence between mRNA expression and protein abundance data for which simple correlation measurements have previously had mixed results.« less

  8. Reconstruction of an Integrated Genome-Scale Co-Expression Network Reveals Key Modules Involved in Lung Adenocarcinoma

    PubMed Central

    Hosseini Ashtiani, Saman; Moeini, Ali; Nowzari-Dalini, Abbas; Masoudi-Nejad, Ali

    2013-01-01

    Our goal of this study was to reconstruct a “genome-scale co-expression network” and find important modules in lung adenocarcinoma so that we could identify the genes involved in lung adenocarcinoma. We integrated gene mutation, GWAS, CGH, array-CGH and SNP array data in order to identify important genes and loci in genome-scale. Afterwards, on the basis of the identified genes a co-expression network was reconstructed from the co-expression data. The reconstructed network was named “genome-scale co-expression network”. As the next step, 23 key modules were disclosed through clustering. In this study a number of genes have been identified for the first time to be implicated in lung adenocarcinoma by analyzing the modules. The genes EGFR, PIK3CA, TAF15, XIAP, VAPB, Appl1, Rab5a, ARF4, CLPTM1L, SP4, ZNF124, LPP, FOXP1, SOX18, MSX2, NFE2L2, SMARCC1, TRA2B, CBX3, PRPF6, ATP6V1C1, MYBBP1A, MACF1, GRM2, TBXA2R, PRKAR2A, PTK2, PGF and MYO10 are among the genes that belong to modules 1 and 22. All these genes, being implicated in at least one of the phenomena, namely cell survival, proliferation and metastasis, have an over-expression pattern similar to that of EGFR. In few modules, the genes such as CCNA2 (Cyclin A2), CCNB2 (Cyclin B2), CDK1, CDK5, CDC27, CDCA5, CDCA8, ASPM, BUB1, KIF15, KIF2C, NEK2, NUSAP1, PRC1, SMC4, SYCE2, TFDP1, CDC42 and ARHGEF9 are present that play a crucial role in cell cycle progression. In addition to the mentioned genes, there are some other genes (i.e. DLGAP5, BIRC5, PSMD2, Src, TTK, SENP2, PSMD2, DOK2, FUS and etc.) in the modules. PMID:23874428

  9. Reconstruction of an integrated genome-scale co-expression network reveals key modules involved in lung adenocarcinoma.

    PubMed

    Bidkhori, Gholamreza; Narimani, Zahra; Hosseini Ashtiani, Saman; Moeini, Ali; Nowzari-Dalini, Abbas; Masoudi-Nejad, Ali

    2013-01-01

    Our goal of this study was to reconstruct a "genome-scale co-expression network" and find important modules in lung adenocarcinoma so that we could identify the genes involved in lung adenocarcinoma. We integrated gene mutation, GWAS, CGH, array-CGH and SNP array data in order to identify important genes and loci in genome-scale. Afterwards, on the basis of the identified genes a co-expression network was reconstructed from the co-expression data. The reconstructed network was named "genome-scale co-expression network". As the next step, 23 key modules were disclosed through clustering. In this study a number of genes have been identified for the first time to be implicated in lung adenocarcinoma by analyzing the modules. The genes EGFR, PIK3CA, TAF15, XIAP, VAPB, Appl1, Rab5a, ARF4, CLPTM1L, SP4, ZNF124, LPP, FOXP1, SOX18, MSX2, NFE2L2, SMARCC1, TRA2B, CBX3, PRPF6, ATP6V1C1, MYBBP1A, MACF1, GRM2, TBXA2R, PRKAR2A, PTK2, PGF and MYO10 are among the genes that belong to modules 1 and 22. All these genes, being implicated in at least one of the phenomena, namely cell survival, proliferation and metastasis, have an over-expression pattern similar to that of EGFR. In few modules, the genes such as CCNA2 (Cyclin A2), CCNB2 (Cyclin B2), CDK1, CDK5, CDC27, CDCA5, CDCA8, ASPM, BUB1, KIF15, KIF2C, NEK2, NUSAP1, PRC1, SMC4, SYCE2, TFDP1, CDC42 and ARHGEF9 are present that play a crucial role in cell cycle progression. In addition to the mentioned genes, there are some other genes (i.e. DLGAP5, BIRC5, PSMD2, Src, TTK, SENP2, PSMD2, DOK2, FUS and etc.) in the modules.

  10. Identification of a gene module associated with BMD through the integration of network analysis and genome-wide association data.

    PubMed

    Farber, Charles R

    2010-11-01

    Bone mineral density (BMD) is influenced by a complex network of gene interactions; therefore, elucidating the relationships between genes and how those genes, in turn, influence BMD is critical for developing a comprehensive understanding of osteoporosis. To investigate the role of transcriptional networks in the regulation of BMD, we performed a weighted gene coexpression network analysis (WGCNA) using microarray expression data on monocytes from young individuals with low or high BMD. WGCNA groups genes into modules based on patterns of gene coexpression. and our analysis identified 11 gene modules. We observed that the overall expression of one module (referred to as module 9) was significantly higher in the low-BMD group (p = .03). Module 9 was highly enriched for genes belonging to the immune system-related gene ontology (GO) category "response to virus" (p = 7.6 × 10(-11)). Using publically available genome-wide association study data, we independently validated the importance of module 9 by demonstrating that highly connected module 9 hubs were more likely, relative to less highly connected genes, to be genetically associated with BMD. This study highlights the advantages of systems-level analyses to uncover coexpression modules associated with bone mass and suggests that particular monocyte expression patterns may mediate differences in BMD. © 2010 American Society for Bone and Mineral Research.

  11. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Wang, Jing; Ma, Zihao; Carr, Steven A.

    Coexpression of mRNAs under multiple conditions is commonly used to infer cofunctionality of their gene products despite well-known limitations of this “guilt-by-association” (GBA) approach. Recent advancements in mass spectrometry-based proteomic technologies have enabled global expression profiling at the protein level; however, whether proteome profiling data can outperform transcriptome profiling data for coexpression based gene function prediction has not been systematically investigated. Here, we address this question by constructing and analyzing mRNA and protein coexpression networks for three cancer types with matched mRNA and protein profiling data from The Cancer Genome Atlas (TCGA) and the Clinical Proteomic Tumor Analysis Consortium (CPTAC).more » Our analyses revealed a marked difference in wiring between the mRNA and protein coexpression networks. Whereas protein coexpression was driven primarily by functional similarity between coexpressed genes, mRNA coexpression was driven by both cofunction and chromosomal colocalization of the genes. Functionally coherent mRNA modules were more likely to have their edges preserved in corresponding protein networks than functionally incoherent mRNA modules. Proteomic data strengthened the link between gene expression and function for at least 75% of Gene Ontology (GO) biological processes and 90% of KEGG pathways. A web application Gene2Net (http://cptac.gene2net.org) developed based on the three protein coexpression networks revealed novel gene-function relationships, such as linking ERBB2 (HER2) to lipid biosynthetic process in breast cancer, identifying PLG as a new gene involved in complement activation, and identifying AEBP1 as a new epithelial-mesenchymal transition (EMT) marker. Our results demonstrate that proteome profiling outperforms transcriptome profiling for coexpression based gene function prediction. Proteomics should be integrated if not preferred in gene function and human disease studies. Molecular & Cellular Proteomics 16: 10.1074/mcp.M116.060301, 121–134, 2017.« less

  12. Identification of common coexpression modules based on quantitative network comparison.

    PubMed

    Jo, Yousang; Kim, Sanghyeon; Lee, Doheon

    2018-06-13

    Finding common molecular interactions from different samples is essential work to understanding diseases and other biological processes. Coexpression networks and their modules directly reflect sample-specific interactions among genes. Therefore, identification of common coexpression network or modules may reveal the molecular mechanism of complex disease or the relationship between biological processes. However, there has been no quantitative network comparison method for coexpression networks and we examined previous methods for other networks that cannot be applied to coexpression network. Therefore, we aimed to propose quantitative comparison methods for coexpression networks and to find common biological mechanisms between Huntington's disease and brain aging by the new method. We proposed two similarity measures for quantitative comparison of coexpression networks. Then, we performed experiments using known coexpression networks. We showed the validity of two measures and evaluated threshold values for similar coexpression network pairs from experiments. Using these similarity measures and thresholds, we quantitatively measured the similarity between disease-specific and aging-related coexpression modules and found similar Huntington's disease-aging coexpression module pairs. We identified similar Huntington's disease-aging coexpression module pairs and found that these modules are related to brain development, cell death, and immune response. It suggests that up-regulated cell signalling related cell death and immune/ inflammation response may be the common molecular mechanisms in the pathophysiology of HD and normal brain aging in the frontal cortex.

  13. Co-expression analysis reveals key gene modules and pathway of human coronary heart disease.

    PubMed

    Tang, Yu; Ke, Zun-Ping; Peng, Yi-Gen; Cai, Ping-Tai

    2018-02-01

    Coronary heart disease is a kind of disease which causes great injury to people world-widely. Although gene expression analyses had been performed previously, to our best knowledge, systemic co-expression analysis for this disease is still lacking to date. Microarray data of coronary heart disease was downloaded from NCBI with the accession number of GSE20681. Co-expression modules were constructed by WGCNA. Besides, the connectivity degree of eigengenes was analyzed. Furthermore, GO and KEGG enrichment analysis was performed on these eigengenes in these constructed modules. A total of 11 co-expression modules were constructed by the 3000 up-regulated genes from the 99 samples with coronary heart disease. The average number of genes in these modules was 270. The interaction analysis indicated the relative independence of gene expression in these modules. The functional enrichment analysis showed that there was a significant difference in the enriched terms and degree among these 11 modules. The results showed that modules 9 and 10 played critical roles in the occurrence of coronary disease. Pathways of hsa00190 (oxidative phosphorylation) and (hsa01130: biosynthesis of antibiotics) were thought to be closely related to the occurrence and development of coronary heart disease. Our result demonstrated that modules 9 and 10 were the most critical modules in the occurrence of coronary heart disease. Pathways as hsa00190 (oxidative phosphorylation) and (hsa01130: biosynthesis of antibiotics) had the potential to serve as the prognostic and predictive marker of coronary heart disease. © 2017 Wiley Periodicals, Inc.

  14. Using scale and feather traits for module construction provides a functional approach to chicken epidermal development.

    PubMed

    Bao, Weier; Greenwold, Matthew J; Sawyer, Roger H

    2017-11-01

    Gene co-expression network analysis has been a research method widely used in systematically exploring gene function and interaction. Using the Weighted Gene Co-expression Network Analysis (WGCNA) approach to construct a gene co-expression network using data from a customized 44K microarray transcriptome of chicken epidermal embryogenesis, we have identified two distinct modules that are highly correlated with scale or feather development traits. Signaling pathways related to feather development were enriched in the traditional KEGG pathway analysis and functional terms relating specifically to embryonic epidermal development were also enriched in the Gene Ontology analysis. Significant enrichment annotations were discovered from customized enrichment tools such as Modular Single-Set Enrichment Test (MSET) and Medical Subject Headings (MeSH). Hub genes in both trait-correlated modules showed strong specific functional enrichment toward epidermal development. Also, regulatory elements, such as transcription factors and miRNAs, were targeted in the significant enrichment result. This work highlights the advantage of this methodology for functional prediction of genes not previously associated with scale- and feather trait-related modules.

  15. Comparison of co-expression measures: mutual information, correlation, and model based indices.

    PubMed

    Song, Lin; Langfelder, Peter; Horvath, Steve

    2012-12-09

    Co-expression measures are often used to define networks among genes. Mutual information (MI) is often used as a generalized correlation measure. It is not clear how much MI adds beyond standard (robust) correlation measures or regression model based association measures. Further, it is important to assess what transformations of these and other co-expression measures lead to biologically meaningful modules (clusters of genes). We provide a comprehensive comparison between mutual information and several correlation measures in 8 empirical data sets and in simulations. We also study different approaches for transforming an adjacency matrix, e.g. using the topological overlap measure. Overall, we confirm close relationships between MI and correlation in all data sets which reflects the fact that most gene pairs satisfy linear or monotonic relationships. We discuss rare situations when the two measures disagree. We also compare correlation and MI based approaches when it comes to defining co-expression network modules. We show that a robust measure of correlation (the biweight midcorrelation transformed via the topological overlap transformation) leads to modules that are superior to MI based modules and maximal information coefficient (MIC) based modules in terms of gene ontology enrichment. We present a function that relates correlation to mutual information which can be used to approximate the mutual information from the corresponding correlation coefficient. We propose the use of polynomial or spline regression models as an alternative to MI for capturing non-linear relationships between quantitative variables. The biweight midcorrelation outperforms MI in terms of elucidating gene pairwise relationships. Coupled with the topological overlap matrix transformation, it often leads to more significantly enriched co-expression modules. Spline and polynomial networks form attractive alternatives to MI in case of non-linear relationships. Our results indicate that MI networks can safely be replaced by correlation networks when it comes to measuring co-expression relationships in stationary data.

  16. Identification of the transcriptional regulators by expression profiling infected with hepatitis B virus.

    PubMed

    Chai, Xiaoqiang; Han, Yanan; Yang, Jian; Zhao, Xianxian; Liu, Yewang; Hou, Xugang; Tang, Yiheng; Zhao, Shirong; Li, Xiao

    2016-02-01

    The molecular pathogenesis of infection by hepatitis B virus with human is extremely complex and heterogeneous. To date the molecular information is not clearly defined despite intensive research efforts. Thus, studies aimed at transcription and regulation during virus infection or combined researches of those already known to be beneficial are needed. With the purpose of identifying the transcriptional regulators related to infection of hepatitis B virus in gene level, the gene expression profiles from some normal individuals and hepatitis B patients were analyzed in our study. In this work, the differential expressed genes were selected primarily. The several genes among those were validated in an independent set by qRT-PCR. Then the differentially co-expression analysis was conducted to identify differentially co-expressed links and differential co-expressed genes. Next, the analysis of the regulatory impact factors was performed through mapping the links and regulatory data. In order to give a further insight to these regulators, the co-expression gene modules were identified using a threshold-based hierarchical clustering method. Incidentally, the construction of the regulatory network was generated using the computer software. A total of 137,284 differentially co-expressed links and 780 differential co-expressed genes were identified. These co-expressed genes were significantly enriched inflammatory response. The results of regulatory impact factors revealed several crucial regulators related to hepatocellular carcinoma and other high-rank regulators. Meanwhile, more than one hundred co-expression gene modules were identified using clustering method. In our study, some important transcriptional regulators were identified using a computational method, which may enhance the understanding of disease mechanisms and lead to an improved treatment of hepatitis B. However, further experimental studies are required to confirm these findings. Copyright © 2015 Elsevier Masson SAS. All rights reserved.

  17. Cooperation and coexpression: How coexpression networks shift in response to multiple mutualists.

    PubMed

    Palakurty, Sathvik X; Stinchcombe, John R; Afkhami, Michelle E

    2018-04-01

    A mechanistic understanding of community ecology requires tackling the nonadditive effects of multispecies interactions, a challenge that necessitates integration of ecological and molecular complexity-namely moving beyond pairwise ecological interaction studies and the "gene at a time" approach to mechanism. Here, we investigate the consequences of multispecies mutualisms for the structure and function of genomewide differential coexpression networks for the first time, using the tractable and ecologically important interaction between legume Medicago truncatula, rhizobia and mycorrhizal fungi. First, we found that genes whose expression is affected nonadditively by multiple mutualists are more highly connected in gene networks than expected by chance and had 94% greater network centrality than genes showing additive effects, suggesting that nonadditive genes may be key players in the widespread transcriptomic responses to multispecies symbioses. Second, multispecies mutualisms substantially changed coexpression network structure of 18 modules of host plant genes and 22 modules of the fungal symbionts' genes, indicating that third-party mutualists can cause significant rewiring of plant and fungal molecular networks. Third, we found that 60% of the coexpressed gene sets that explained variation in plant performance had coexpression structures that were altered by interactive effects of rhizobia and fungi. Finally, an "across-symbiosis" approach identified sets of plant and mycorrhizal genes whose coexpression structure was unique to the multiple mutualist context and suggested coupled responses across the plant-mycorrhizal interaction to rhizobial mutualists. Taken together, these results show multispecies mutualisms have substantial effects on the molecular interactions in host plants, microbes and across symbiotic boundaries. © 2018 John Wiley & Sons Ltd.

  18. Dissecting nutrient-related co-expression networks in phosphate starved poplars.

    PubMed

    Kavka, Mareike; Polle, Andrea

    2017-01-01

    Phosphorus (P) is an essential plant nutrient, but its availability is often limited in soil. Here, we studied changes in the transcriptome and in nutrient element concentrations in leaves and roots of poplars (Populus × canescens) in response to P deficiency. P starvation resulted in decreased concentrations of S and major cations (K, Mg, Ca), in increased concentrations of N, Zn and Al, while C, Fe and Mn were only little affected. In roots and leaves >4,000 and >9,000 genes were differently expressed upon P starvation. These genes clustered in eleven co-expression modules of which seven were correlated with distinct elements in the plant tissues. One module (4.7% of all differentially expressed genes) was strongly correlated with changes in the P concentration in the plant. In this module the GO term "response to P starvation" was enriched with phosphoenolpyruvate carboxylase kinases, phosphatases and pyrophosphatases as well as regulatory domains such as SPX, but no phosphate transporters. The P-related module was also enriched in genes of the functional category "galactolipid synthesis". Galactolipids substitute phospholipids in membranes under P limitation. Two modules, one correlated with C and N and the other with biomass, S and Mg, were connected with the P-related module by co-expression. In these modules GO terms indicating "DNA modification" and "cell division" as well as "defense" and "RNA modification" and "signaling" were enriched; they contained phosphate transporters. Bark storage proteins were among the most strongly upregulated genes in the growth-related module suggesting that N, which could not be used for growth, accumulated in typical storage compounds. In conclusion, weighted gene coexpression network analysis revealed a hierarchical structure of gene clusters, which separated phosphate starvation responses correlated with P tissue concentrations from other gene modules, which most likely represented transcriptional adjustments related to down-stream nutritional changes and stress.

  19. Co-expression network analysis of duplicate genes in maize (Zea mays L.) reveals no subgenome bias.

    PubMed

    Li, Lin; Briskine, Roman; Schaefer, Robert; Schnable, Patrick S; Myers, Chad L; Flagel, Lex E; Springer, Nathan M; Muehlbauer, Gary J

    2016-11-04

    Gene duplication is prevalent in many species and can result in coding and regulatory divergence. Gene duplications can be classified as whole genome duplication (WGD), tandem and inserted (non-syntenic). In maize, WGD resulted in the subgenomes maize1 and maize2, of which maize1 is considered the dominant subgenome. However, the landscape of co-expression network divergence of duplicate genes in maize is still largely uncharacterized. To address the consequence of gene duplication on co-expression network divergence, we developed a gene co-expression network from RNA-seq data derived from 64 different tissues/stages of the maize reference inbred-B73. WGD, tandem and inserted gene duplications exhibited distinct regulatory divergence. Inserted duplicate genes were more likely to be singletons in the co-expression networks, while WGD duplicate genes were likely to be co-expressed with other genes. Tandem duplicate genes were enriched in the co-expression pattern where co-expressed genes were nearly identical for the duplicates in the network. Older gene duplications exhibit more extensive co-expression variation than younger duplications. Overall, non-syntenic genes primarily from inserted duplications show more co-expression divergence. Also, such enlarged co-expression divergence is significantly related to duplication age. Moreover, subgenome dominance was not observed in the co-expression networks - maize1 and maize2 exhibit similar levels of intra subgenome correlations. Intriguingly, the level of inter subgenome co-expression was similar to the level of intra subgenome correlations, and genes from specific subgenomes were not likely to be the enriched in co-expression network modules and the hub genes were not predominantly from any specific subgenomes in maize. Our work provides a comprehensive analysis of maize co-expression network divergence for three different types of gene duplications and identifies potential relationships between duplication types, duplication ages and co-expression consequences.

  20. Identifying gene coexpression networks underlying the dynamic regulation of wood-forming tissues in Populus under diverse environmental conditions.

    PubMed

    Zinkgraf, Matthew; Liu, Lijun; Groover, Andrew; Filkov, Vladimir

    2017-06-01

    Trees modify wood formation through integration of environmental and developmental signals in complex but poorly defined transcriptional networks, allowing trees to produce woody tissues appropriate to diverse environmental conditions. In order to identify relationships among genes expressed during wood formation, we integrated data from new and publically available datasets in Populus. These datasets were generated from woody tissue and include transcriptome profiling, transcription factor binding, DNA accessibility and genome-wide association mapping experiments. Coexpression modules were calculated, each of which contains genes showing similar expression patterns across experimental conditions, genotypes and treatments. Conserved gene coexpression modules (four modules totaling 8398 genes) were identified that were highly preserved across diverse environmental conditions and genetic backgrounds. Functional annotations as well as correlations with specific experimental treatments associated individual conserved modules with distinct biological processes underlying wood formation, such as cell-wall biosynthesis, meristem development and epigenetic pathways. Module genes were also enriched for DNase I hypersensitivity footprints and binding from four transcription factors associated with wood formation. The conserved modules are excellent candidates for modeling core developmental pathways common to wood formation in diverse environments and genotypes, and serve as testbeds for hypothesis generation and testing for future studies. No claim to original US government works. New Phytologist © 2017 New Phytologist Trust.

  1. Matrix factorization reveals aging-specific co-expression gene modules in the fat and muscle tissues in nonhuman primates

    NASA Astrophysics Data System (ADS)

    Wang, Yongcui; Zhao, Weiling; Zhou, Xiaobo

    2016-10-01

    Accurate identification of coherent transcriptional modules (subnetworks) in adipose and muscle tissues is important for revealing the related mechanisms and co-regulated pathways involved in the development of aging-related diseases. Here, we proposed a systematically computational approach, called ICEGM, to Identify the Co-Expression Gene Modules through a novel mathematical framework of Higher-Order Generalized Singular Value Decomposition (HO-GSVD). ICEGM was applied on the adipose, and heart and skeletal muscle tissues in old and young female African green vervet monkeys. The genes associated with the development of inflammation, cardiovascular and skeletal disorder diseases, and cancer were revealed by the ICEGM. Meanwhile, genes in the ICEGM modules were also enriched in the adipocytes, smooth muscle cells, cardiac myocytes, and immune cells. Comprehensive disease annotation and canonical pathway analysis indicated that immune cells, adipocytes, cardiomyocytes, and smooth muscle cells played a synergistic role in cardiac and physical functions in the aged monkeys by regulation of the biological processes associated with metabolism, inflammation, and atherosclerosis. In conclusion, the ICEGM provides an efficiently systematic framework for decoding the co-expression gene modules in multiple tissues. Analysis of genes in the ICEGM module yielded important insights on the cooperative role of multiple tissues in the development of diseases.

  2. Resolving stem and progenitor cells in the adult mouse incisor through gene co-expression analysis

    PubMed Central

    Seidel, Kerstin; Marangoni, Pauline; Tang, Cynthia; Houshmand, Bahar; Du, Wen; Maas, Richard L; Murray, Steven; Oldham, Michael C; Klein, Ophir D

    2017-01-01

    Investigations into stem cell-fueled renewal of an organ benefit from an inventory of cell type-specific markers and a deep understanding of the cellular diversity within stem cell niches. Using the adult mouse incisor as a model for a continuously renewing organ, we performed an unbiased analysis of gene co-expression relationships to identify modules of co-expressed genes that represent differentiated cells, transit-amplifying cells, and residents of stem cell niches. Through in vivo lineage tracing, we demonstrated the power of this approach by showing that co-expression module members Lrig1 and Igfbp5 define populations of incisor epithelial and mesenchymal stem cells. We further discovered that two adjacent mesenchymal tissues, the periodontium and dental pulp, are maintained by distinct pools of stem cells. These findings reveal novel mechanisms of incisor renewal and illustrate how gene co-expression analysis of intact biological systems can provide insights into the transcriptional basis of cellular identity. DOI: http://dx.doi.org/10.7554/eLife.24712.001 PMID:28475038

  3. THD-Module Extractor: An Application for CEN Module Extraction and Interesting Gene Identification for Alzheimer's Disease.

    PubMed

    Kakati, Tulika; Kashyap, Hirak; Bhattacharyya, Dhruba K

    2016-11-30

    There exist many tools and methods for construction of co-expression network from gene expression data and for extraction of densely connected gene modules. In this paper, a method is introduced to construct co-expression network and to extract co-expressed modules having high biological significance. The proposed method has been validated on several well known microarray datasets extracted from a diverse set of species, using statistical measures, such as p and q values. The modules obtained in these studies are found to be biologically significant based on Gene Ontology enrichment analysis, pathway analysis, and KEGG enrichment analysis. Further, the method was applied on an Alzheimer's disease dataset and some interesting genes are found, which have high semantic similarity among them, but are not significantly correlated in terms of expression similarity. Some of these interesting genes, such as MAPT, CASP2, and PSEN2, are linked with important aspects of Alzheimer's disease, such as dementia, increase cell death, and deposition of amyloid-beta proteins in Alzheimer's disease brains. The biological pathways associated with Alzheimer's disease, such as, Wnt signaling, Apoptosis, p53 signaling, and Notch signaling, incorporate these interesting genes. The proposed method is evaluated in regard to existing literature.

  4. Enhancing biological relevance of a weighted gene co-expression network for functional module identification.

    PubMed

    Prom-On, Santitham; Chanthaphan, Atthawut; Chan, Jonathan Hoyin; Meechai, Asawin

    2011-02-01

    Relationships among gene expression levels may be associated with the mechanisms of the disease. While identifying a direct association such as a difference in expression levels between case and control groups links genes to disease mechanisms, uncovering an indirect association in the form of a network structure may help reveal the underlying functional module associated with the disease under scrutiny. This paper presents a method to improve the biological relevance in functional module identification from the gene expression microarray data by enhancing the structure of a weighted gene co-expression network using minimum spanning tree. The enhanced network, which is called a backbone network, contains only the essential structural information to represent the gene co-expression network. The entire backbone network is decoupled into a number of coherent sub-networks, and then the functional modules are reconstructed from these sub-networks to ensure minimum redundancy. The method was tested with a simulated gene expression dataset and case-control expression datasets of autism spectrum disorder and colorectal cancer studies. The results indicate that the proposed method can accurately identify clusters in the simulated dataset, and the functional modules of the backbone network are more biologically relevant than those obtained from the original approach.

  5. THD-Module Extractor: An Application for CEN Module Extraction and Interesting Gene Identification for Alzheimer’s Disease

    PubMed Central

    Kakati, Tulika; Kashyap, Hirak; Bhattacharyya, Dhruba K.

    2016-01-01

    There exist many tools and methods for construction of co-expression network from gene expression data and for extraction of densely connected gene modules. In this paper, a method is introduced to construct co-expression network and to extract co-expressed modules having high biological significance. The proposed method has been validated on several well known microarray datasets extracted from a diverse set of species, using statistical measures, such as p and q values. The modules obtained in these studies are found to be biologically significant based on Gene Ontology enrichment analysis, pathway analysis, and KEGG enrichment analysis. Further, the method was applied on an Alzheimer’s disease dataset and some interesting genes are found, which have high semantic similarity among them, but are not significantly correlated in terms of expression similarity. Some of these interesting genes, such as MAPT, CASP2, and PSEN2, are linked with important aspects of Alzheimer’s disease, such as dementia, increase cell death, and deposition of amyloid-beta proteins in Alzheimer’s disease brains. The biological pathways associated with Alzheimer’s disease, such as, Wnt signaling, Apoptosis, p53 signaling, and Notch signaling, incorporate these interesting genes. The proposed method is evaluated in regard to existing literature. PMID:27901073

  6. An additional k-means clustering step improves the biological features of WGCNA gene co-expression networks.

    PubMed

    Botía, Juan A; Vandrovcova, Jana; Forabosco, Paola; Guelfi, Sebastian; D'Sa, Karishma; Hardy, John; Lewis, Cathryn M; Ryten, Mina; Weale, Michael E

    2017-04-12

    Weighted Gene Co-expression Network Analysis (WGCNA) is a widely used R software package for the generation of gene co-expression networks (GCN). WGCNA generates both a GCN and a derived partitioning of clusters of genes (modules). We propose k-means clustering as an additional processing step to conventional WGCNA, which we have implemented in the R package km2gcn (k-means to gene co-expression network, https://github.com/juanbot/km2gcn ). We assessed our method on networks created from UKBEC data (10 different human brain tissues), on networks created from GTEx data (42 human tissues, including 13 brain tissues), and on simulated networks derived from GTEx data. We observed substantially improved module properties, including: (1) few or zero misplaced genes; (2) increased counts of replicable clusters in alternate tissues (x3.1 on average); (3) improved enrichment of Gene Ontology terms (seen in 48/52 GCNs) (4) improved cell type enrichment signals (seen in 21/23 brain GCNs); and (5) more accurate partitions in simulated data according to a range of similarity indices. The results obtained from our investigations indicate that our k-means method, applied as an adjunct to standard WGCNA, results in better network partitions. These improved partitions enable more fruitful downstream analyses, as gene modules are more biologically meaningful.

  7. Frontotemporal dementia: insights into the biological underpinnings of disease through gene co-expression network analysis.

    PubMed

    Ferrari, Raffaele; Forabosco, Paola; Vandrovcova, Jana; Botía, Juan A; Guelfi, Sebastian; Warren, Jason D; Momeni, Parastoo; Weale, Michael E; Ryten, Mina; Hardy, John

    2016-02-24

    In frontotemporal dementia (FTD) there is a critical lack in the understanding of biological and molecular mechanisms involved in disease pathogenesis. The heterogeneous genetic features associated with FTD suggest that multiple disease-mechanisms are likely to contribute to the development of this neurodegenerative condition. We here present a systems biology approach with the scope of i) shedding light on the biological processes potentially implicated in the pathogenesis of FTD and ii) identifying novel potential risk factors for FTD. We performed a gene co-expression network analysis of microarray expression data from 101 individuals without neurodegenerative diseases to explore regional-specific co-expression patterns in the frontal and temporal cortices for 12 genes (MAPT, GRN, CHMP2B, CTSC, HLA-DRA, TMEM106B, C9orf72, VCP, UBQLN2, OPTN, TARDBP and FUS) associated with FTD and we then carried out gene set enrichment and pathway analyses, and investigated known protein-protein interactors (PPIs) of FTD-genes products. Gene co-expression networks revealed that several FTD-genes (such as MAPT and GRN, CTSC and HLA-DRA, TMEM106B, and C9orf72, VCP, UBQLN2 and OPTN) were clustering in modules of relevance in the frontal and temporal cortices. Functional annotation and pathway analyses of such modules indicated enrichment for: i) DNA metabolism, i.e. transcription regulation, DNA protection and chromatin remodelling (MAPT and GRN modules); ii) immune and lysosomal processes (CTSC and HLA-DRA modules), and; iii) protein meta/catabolism (C9orf72, VCP, UBQLN2 and OPTN, and TMEM106B modules). PPI analysis supported the results of the functional annotation and pathway analyses. This work further characterizes known FTD-genes and elaborates on their biological relevance to disease: not only do we indicate likely impacted regional-specific biological processes driven by FTD-genes containing modules, but also do we suggest novel potential risk factors among the FTD-genes interactors as targets for further mechanistic characterization in hypothesis driven cell biology work.

  8. Network Analysis Implicates Alpha-Synuclein (Snca) in the Regulation of Ovariectomy-Induced Bone Loss

    PubMed Central

    Calabrese, Gina; Mesner, Larry D.; Foley, Patricia L.; Rosen, Clifford J.; Farber, Charles R.

    2016-01-01

    The postmenopausal period in women is associated with decreased circulating estrogen levels, which accelerate bone loss and increase the risk of fracture. Here, we gained novel insight into the molecular mechanisms mediating bone loss in ovariectomized (OVX) mice, a model of human menopause, using co-expression network analysis. Specifically, we generated a co-expression network consisting of 53 gene modules using expression profiles from intact and OVX mice from a panel of inbred strains. The expression of four modules was altered by OVX, including module 23 whose expression was decreased by OVX across all strains. Module 23 was enriched for genes involved in the response to oxidative stress, a process known to be involved in OVX-induced bone loss. Additionally, module 23 homologs were co-expressed in human bone marrow. Alpha synuclein (Snca) was one of the most highly connected “hub” genes in module 23. We characterized mice deficient in Snca and observed a 40% reduction in OVX-induced bone loss. Furthermore, protection was associated with the altered expression of specific network modules, including module 23. In summary, the results of this study suggest that Snca regulates bone network homeostasis and ovariectomy-induced bone loss. PMID:27378017

  9. Correlated mRNAs and miRNAs from co-expression and regulatory networks affect porcine muscle and finally meat properties.

    PubMed

    Ponsuksili, Siriluck; Du, Yang; Hadlich, Frieder; Siengdee, Puntita; Murani, Eduard; Schwerin, Manfred; Wimmers, Klaus

    2013-08-05

    Physiological processes aiding the conversion of muscle to meat involve many genes associated with muscle structure and metabolic processes. MicroRNAs regulate networks of genes to orchestrate cellular functions, in turn regulating phenotypes. We applied weighted gene co-expression network analysis to identify co-expression modules that correlated to meat quality phenotypes and were highly enriched for genes involved in glucose metabolism, response to wounding, mitochondrial ribosome, mitochondrion, and extracellular matrix. Negative correlation of miRNA with mRNA and target prediction were used to select transcripts out of the modules of trait-associated mRNAs to further identify those genes that are correlated with post mortem traits. Porcine muscle co-expression transcript networks that correlated to post mortem traits were identified. The integration of miRNA and mRNA expression analyses, as well as network analysis, enabled us to interpret the differentially-regulated genes from a systems perspective. Linking co-expression networks of transcripts and hierarchically organized pairs of miRNAs and mRNAs to meat properties yields new insight into several biological pathways underlying phenotype differences. These pathways may also be diagnostic for many myopathies, which are accompanied by deficient nutrient and oxygen supply of muscle fibers.

  10. Biological mechanism analysis of acute renal allograft rejection: integrated of mRNA and microRNA expression profiles.

    PubMed

    Huang, Shi-Ming; Zhao, Xia; Zhao, Xue-Mei; Wang, Xiao-Ying; Li, Shan-Shan; Zhu, Yu-Hui

    2014-01-01

    Renal transplantation is the preferred method for most patients with end-stage renal disease, however, acute renal allograft rejection is still a major risk factor for recipients leading to renal injury. To improve the early diagnosis and treatment of acute rejection, study on the molecular mechanism of it is urgent. MicroRNA (miRNA) expression profile and mRNA expression profile of acute renal allograft rejection and well-functioning allograft downloaded from ArrayExpress database were applied to identify differentially expressed (DE) miRNAs and DE mRNAs. DE miRNAs targets were predicted by combining five algorithm. By overlapping the DE mRNAs and DE miRNAs targets, common genes were obtained. Differentially co-expressed genes (DCGs) were identified by differential co-expression profile (DCp) and differential co-expression enrichment (DCe) methods in Differentially Co-expressed Genes and Links (DCGL) package. Then, co-expression network of DCGs and the cluster analysis were performed. Functional enrichment analysis for DCGs was undergone. A total of 1270 miRNA targets were predicted and 698 DE mRNAs were obtained. While overlapping miRNA targets and DE mRNAs, 59 common genes were gained. We obtained 103 DCGs and 5 transcription factors (TFs) based on regulatory impact factors (RIF), then built the regulation network of miRNA targets and DE mRNAs. By clustering the co-expression network, 5 modules were obtained. Thereinto, module 1 had the highest degree and module 2 showed the most number of DCGs and common genes. TF CEBPB and several common genes, such as RXRA, BASP1 and AKAP10, were mapped on the co-expression network. C1R showed the highest degree in the network. These genes might be associated with human acute renal allograft rejection. We conducted biological analysis on integration of DE mRNA and DE miRNA in acute renal allograft rejection, displayed gene expression patterns and screened out genes and TFs that may be related to acute renal allograft rejection.

  11. Biological mechanism analysis of acute renal allograft rejection: integrated of mRNA and microRNA expression profiles

    PubMed Central

    Huang, Shi-Ming; Zhao, Xia; Zhao, Xue-Mei; Wang, Xiao-Ying; Li, Shan-Shan; Zhu, Yu-Hui

    2014-01-01

    Objectives: Renal transplantation is the preferred method for most patients with end-stage renal disease, however, acute renal allograft rejection is still a major risk factor for recipients leading to renal injury. To improve the early diagnosis and treatment of acute rejection, study on the molecular mechanism of it is urgent. Methods: MicroRNA (miRNA) expression profile and mRNA expression profile of acute renal allograft rejection and well-functioning allograft downloaded from ArrayExpress database were applied to identify differentially expressed (DE) miRNAs and DE mRNAs. DE miRNAs targets were predicted by combining five algorithm. By overlapping the DE mRNAs and DE miRNAs targets, common genes were obtained. Differentially co-expressed genes (DCGs) were identified by differential co-expression profile (DCp) and differential co-expression enrichment (DCe) methods in Differentially Co-expressed Genes and Links (DCGL) package. Then, co-expression network of DCGs and the cluster analysis were performed. Functional enrichment analysis for DCGs was undergone. Results: A total of 1270 miRNA targets were predicted and 698 DE mRNAs were obtained. While overlapping miRNA targets and DE mRNAs, 59 common genes were gained. We obtained 103 DCGs and 5 transcription factors (TFs) based on regulatory impact factors (RIF), then built the regulation network of miRNA targets and DE mRNAs. By clustering the co-expression network, 5 modules were obtained. Thereinto, module 1 had the highest degree and module 2 showed the most number of DCGs and common genes. TF CEBPB and several common genes, such as RXRA, BASP1 and AKAP10, were mapped on the co-expression network. C1R showed the highest degree in the network. These genes might be associated with human acute renal allograft rejection. Conclusions: We conducted biological analysis on integration of DE mRNA and DE miRNA in acute renal allograft rejection, displayed gene expression patterns and screened out genes and TFs that may be related to acute renal allograft rejection. PMID:25664019

  12. VTCdb: a gene co-expression database for the crop species Vitis vinifera (grapevine).

    PubMed

    Wong, Darren C J; Sweetman, Crystal; Drew, Damian P; Ford, Christopher M

    2013-12-16

    Gene expression datasets in model plants such as Arabidopsis have contributed to our understanding of gene function and how a single underlying biological process can be governed by a diverse network of genes. The accumulation of publicly available microarray data encompassing a wide range of biological and environmental conditions has enabled the development of additional capabilities including gene co-expression analysis (GCA). GCA is based on the understanding that genes encoding proteins involved in similar and/or related biological processes may exhibit comparable expression patterns over a range of experimental conditions, developmental stages and tissues. We present an open access database for the investigation of gene co-expression networks within the cultivated grapevine, Vitis vinifera. The new gene co-expression database, VTCdb (http://vtcdb.adelaide.edu.au/Home.aspx), offers an online platform for transcriptional regulatory inference in the cultivated grapevine. Using condition-independent and condition-dependent approaches, grapevine co-expression networks were constructed using the latest publicly available microarray datasets from diverse experimental series, utilising the Affymetrix Vitis vinifera GeneChip (16 K) and the NimbleGen Grape Whole-genome microarray chip (29 K), thus making it possible to profile approximately 29,000 genes (95% of the predicted grapevine transcriptome). Applications available with the online platform include the use of gene names, probesets, modules or biological processes to query the co-expression networks, with the option to choose between Affymetrix or Nimblegen datasets and between multiple co-expression measures. Alternatively, the user can browse existing network modules using interactive network visualisation and analysis via CytoscapeWeb. To demonstrate the utility of the database, we present examples from three fundamental biological processes (berry development, photosynthesis and flavonoid biosynthesis) whereby the recovered sub-networks reconfirm established plant gene functions and also identify novel associations. Together, we present valuable insights into grapevine transcriptional regulation by developing network models applicable to researchers in their prioritisation of gene candidates, for on-going study of biological processes related to grapevine development, metabolism and stress responses.

  13. Exploring the Transcriptome of Ciliated Cells Using In Silico Dissection of Human Tissues

    PubMed Central

    Ivliev, Alexander E.; 't Hoen, Peter A. C.; van Roon-Mom, Willeke M. C.; Peters, Dorien J. M.; Sergeeva, Marina G.

    2012-01-01

    Cilia are cell organelles that play important roles in cell motility, sensory and developmental functions and are involved in a range of human diseases, known as ciliopathies. Here, we search for novel human genes related to cilia using a strategy that exploits the previously reported tendency of cell type-specific genes to be coexpressed in the transcriptome of complex tissues. Gene coexpression networks were constructed using the noise-resistant WGCNA algorithm in 12 publicly available microarray datasets from human tissues rich in motile cilia: airways, fallopian tubes and brain. A cilia-related coexpression module was detected in 10 out of the 12 datasets. A consensus analysis of this module's gene composition recapitulated 297 known and predicted 74 novel cilia-related genes. 82% of the novel candidates were supported by tissue-specificity expression data from GEO and/or proteomic data from the Human Protein Atlas. The novel findings included a set of genes (DCDC2, DYX1C1, KIAA0319) related to a neurological disease dyslexia suggesting their potential involvement in ciliary functions. Furthermore, we searched for differences in gene composition of the ciliary module between the tissues. A multidrug-and-toxin extrusion transporter MATE2 (SLC47A2) was found as a brain-specific central gene in the ciliary module. We confirm the localization of MATE2 in cilia by immunofluorescence staining using MDCK cells as a model. While MATE2 has previously gained attention as a pharmacologically relevant transporter, its potential relation to cilia is suggested for the first time. Taken together, our large-scale analysis of gene coexpression networks identifies novel genes related to human cell cilia. PMID:22558177

  14. Co-expression analysis and identification of fecundity-related long non-coding RNAs in sheep ovaries

    PubMed Central

    Miao, Xiangyang; Luo, Qingmiao; Zhao, Huijing; Qin, Xiaoyu

    2016-01-01

    Small Tail Han sheep, including the FecBBFecBB (Han BB) and FecB+ FecB+ (Han++) genotypes, and Dorset sheep exhibit different fecundities. To identify novel long non-coding RNAs (lncRNAs) associated with sheep fecundity to better understand their molecular mechanisms, a genome-wide analysis of mRNAs and lncRNAs from Han BB, Han++ and Dorset sheep was performed. After the identification of differentially expressed mRNAs and lncRNAs, 16 significant modules were explored by using weighted gene coexpression network analysis (WGCNA) followed by functional enrichment analysis of the genes and lncRNAs in significant modules. Among these selected modules, the yellow and brown modules were significantly related to sheep fecundity. lncRNAs (e.g., NR0B1, XLOC_041882, and MYH15) in the yellow module were mainly involved in the TGF-β signalling pathway, and NYAP1 and BCORL1 were significantly associated with the oxytocin signalling pathway, which regulates several genes in the coexpression network of the brown module. Overall, we identified several gene modules associated with sheep fecundity, as well as networks consisting of hub genes and lncRNAs that may contribute to sheep prolificacy by regulating the target mRNAs related to the TGF-β and oxytocin signalling pathways. This study provides an alternative strategy for the identification of potential candidate regulatory lncRNAs. PMID:27982099

  15. Co-expression analysis and identification of fecundity-related long non-coding RNAs in sheep ovaries.

    PubMed

    Miao, Xiangyang; Luo, Qingmiao; Zhao, Huijing; Qin, Xiaoyu

    2016-12-16

    Small Tail Han sheep, including the FecB B FecB B (Han BB) and FecB + FecB + (Han++) genotypes, and Dorset sheep exhibit different fecundities. To identify novel long non-coding RNAs (lncRNAs) associated with sheep fecundity to better understand their molecular mechanisms, a genome-wide analysis of mRNAs and lncRNAs from Han BB, Han++ and Dorset sheep was performed. After the identification of differentially expressed mRNAs and lncRNAs, 16 significant modules were explored by using weighted gene coexpression network analysis (WGCNA) followed by functional enrichment analysis of the genes and lncRNAs in significant modules. Among these selected modules, the yellow and brown modules were significantly related to sheep fecundity. lncRNAs (e.g., NR0B1, XLOC_041882, and MYH15) in the yellow module were mainly involved in the TGF-β signalling pathway, and NYAP1 and BCORL1 were significantly associated with the oxytocin signalling pathway, which regulates several genes in the coexpression network of the brown module. Overall, we identified several gene modules associated with sheep fecundity, as well as networks consisting of hub genes and lncRNAs that may contribute to sheep prolificacy by regulating the target mRNAs related to the TGF-β and oxytocin signalling pathways. This study provides an alternative strategy for the identification of potential candidate regulatory lncRNAs.

  16. Hierarchical cortical transcriptome disorganization in autism.

    PubMed

    Lombardo, Michael V; Courchesne, Eric; Lewis, Nathan E; Pramparo, Tiziano

    2017-01-01

    Autism spectrum disorders (ASD) are etiologically heterogeneous and complex. Functional genomics work has begun to identify a diverse array of dysregulated transcriptomic programs (e.g., synaptic, immune, cell cycle, DNA damage, WNT signaling, cortical patterning and differentiation) potentially involved in ASD brain abnormalities during childhood and adulthood. However, it remains unclear whether such diverse dysregulated pathways are independent of each other or instead reflect coordinated hierarchical systems-level pathology. Two ASD cortical transcriptome datasets were re-analyzed using consensus weighted gene co-expression network analysis (WGCNA) to identify common co-expression modules across datasets. Linear mixed-effect models and Bayesian replication statistics were used to identify replicable differentially expressed modules. Eigengene network analysis was then utilized to identify between-group differences in how co-expression modules interact and cluster into hierarchical meta-modular organization. Protein-protein interaction analyses were also used to determine whether dysregulated co-expression modules show enhanced interactions. We find replicable evidence for 10 gene co-expression modules that are differentially expressed in ASD cortex. Rather than being independent non-interacting sources of pathology, these dysregulated co-expression modules work in synergy and physically interact at the protein level. These systems-level transcriptional signals are characterized by downregulation of synaptic processes coordinated with upregulation of immune/inflammation, response to other organism, catabolism, viral processes, translation, protein targeting and localization, cell proliferation, and vasculature development. Hierarchical organization of meta-modules (clusters of highly correlated modules) is also highly affected in ASD. These findings highlight that dysregulation of the ASD cortical transcriptome is characterized by the dysregulation of multiple coordinated transcriptional programs producing synergistic systems-level effects that cannot be fully appreciated by studying the individual component biological processes in isolation.

  17. Co-expression network analysis identified six hub genes in association with metastasis risk and prognosis in hepatocellular carcinoma

    PubMed Central

    Feng, Juerong; Zhou, Rui; Chang, Ying; Liu, Jing; Zhao, Qiu

    2017-01-01

    Hepatocellular carcinoma (HCC) has a high incidence and mortality worldwide, and its carcinogenesis and progression are influenced by a complex network of gene interactions. A weighted gene co-expression network was constructed to identify gene modules associated with the clinical traits in HCC (n = 214). Among the 13 modules, high correlation was only found between the red module and metastasis risk (classified by the HCC metastasis gene signature) (R2 = −0.74). Moreover, in the red module, 34 network hub genes for metastasis risk were identified, six of which (ABAT, AGXT, ALDH6A1, CYP4A11, DAO and EHHADH) were also hub nodes in the protein-protein interaction network of the module genes. Thus, a total of six hub genes were identified. In validation, all hub genes showed a negative correlation with the four-stage HCC progression (P for trend < 0.05) in the test set. Furthermore, in the training set, HCC samples with any hub gene lowly expressed demonstrated a higher recurrence rate and poorer survival rate (hazard ratios with 95% confidence intervals > 1). RNA-sequencing data of 142 HCC samples showed consistent results in the prognosis. Gene set enrichment analysis (GSEA) demonstrated that in the samples with any hub gene highly expressed, a total of 24 functional gene sets were enriched, most of which focused on amino acid metabolism and oxidation. In conclusion, co-expression network analysis identified six hub genes in association with HCC metastasis risk and prognosis, which might improve the prognosis by influencing amino acid metabolism and oxidation. PMID:28430663

  18. Preservation affinity in consensus modules among stages of HIV-1 progression.

    PubMed

    Mosaddek Hossain, Sk Md; Ray, Sumanta; Mukhopadhyay, Anirban

    2017-03-20

    Analysis of gene expression data provides valuable insights into disease mechanism. Investigating relationship among co-expression modules of different stages is a meaningful tool to understand the way in which a disease progresses. Identifying topological preservation of modular structure also contributes to that understanding. HIV-1 disease provides a well-documented progression pattern through three stages of infection: acute, chronic and non-progressor. In this article, we have developed a novel framework to describe the relationship among the consensus (or shared) co-expression modules for each pair of HIV-1 infection stages. The consensus modules are identified to assess the preservation of network properties. We have investigated the preservation patterns of co-expression networks during HIV-1 disease progression through an eigengene-based approach. We discovered that the expression patterns of consensus modules have a strong preservation during the transitions of three infection stages. In particular, it is noticed that between acute and non-progressor stages the preservation is slightly more than the other pair of stages. Moreover, we have constructed eigengene networks for the identified consensus modules and observed the preservation structure among them. Some consensus modules are marked as preserved in two pairs of stages and are analyzed further to form a higher order meta-network consisting of a group of preserved modules. Additionally, we observed that module membership (MM) values of genes within a module are consistent with the preservation characteristics. The MM values of genes within a pair of preserved modules show strong correlation patterns across two infection stages. We have performed an extensive analysis to discover preservation pattern of co-expression network constructed from microarray gene expression data of three different HIV-1 progression stages. The preservation pattern is investigated through identification of consensus modules in each pair of infection stages. It is observed that the preservation of the expression pattern of consensus modules remains more prominent during the transition of infection from acute stage to non-progressor stage. Additionally, we observed that the module membership values of genes are coherent with preserved modules across the HIV-1 progression stages.

  19. ConGEMs: Condensed Gene Co-Expression Module Discovery Through Rule-Based Clustering and Its Application to Carcinogenesis.

    PubMed

    Mallik, Saurav; Zhao, Zhongming

    2017-12-28

    For transcriptomic analysis, there are numerous microarray-based genomic data, especially those generated for cancer research. The typical analysis measures the difference between a cancer sample-group and a matched control group for each transcript or gene. Association rule mining is used to discover interesting item sets through rule-based methodology. Thus, it has advantages to find causal effect relationships between the transcripts. In this work, we introduce two new rule-based similarity measures-weighted rank-based Jaccard and Cosine measures-and then propose a novel computational framework to detect condensed gene co-expression modules ( C o n G E M s) through the association rule-based learning system and the weighted similarity scores. In practice, the list of evolved condensed markers that consists of both singular and complex markers in nature depends on the corresponding condensed gene sets in either antecedent or consequent of the rules of the resultant modules. In our evaluation, these markers could be supported by literature evidence, KEGG (Kyoto Encyclopedia of Genes and Genomes) pathway and Gene Ontology annotations. Specifically, we preliminarily identified differentially expressed genes using an empirical Bayes test. A recently developed algorithm-RANWAR-was then utilized to determine the association rules from these genes. Based on that, we computed the integrated similarity scores of these rule-based similarity measures between each rule-pair, and the resultant scores were used for clustering to identify the co-expressed rule-modules. We applied our method to a gene expression dataset for lung squamous cell carcinoma and a genome methylation dataset for uterine cervical carcinogenesis. Our proposed module discovery method produced better results than the traditional gene-module discovery measures. In summary, our proposed rule-based method is useful for exploring biomarker modules from transcriptomic data.

  20. CLIC, a tool for expanding biological pathways based on co-expression across thousands of datasets

    PubMed Central

    Li, Yang; Liu, Jun S.; Mootha, Vamsi K.

    2017-01-01

    In recent years, there has been a huge rise in the number of publicly available transcriptional profiling datasets. These massive compendia comprise billions of measurements and provide a special opportunity to predict the function of unstudied genes based on co-expression to well-studied pathways. Such analyses can be very challenging, however, since biological pathways are modular and may exhibit co-expression only in specific contexts. To overcome these challenges we introduce CLIC, CLustering by Inferred Co-expression. CLIC accepts as input a pathway consisting of two or more genes. It then uses a Bayesian partition model to simultaneously partition the input gene set into coherent co-expressed modules (CEMs), while assigning the posterior probability for each dataset in support of each CEM. CLIC then expands each CEM by scanning the transcriptome for additional co-expressed genes, quantified by an integrated log-likelihood ratio (LLR) score weighted for each dataset. As a byproduct, CLIC automatically learns the conditions (datasets) within which a CEM is operative. We implemented CLIC using a compendium of 1774 mouse microarray datasets (28628 microarrays) or 1887 human microarray datasets (45158 microarrays). CLIC analysis reveals that of 910 canonical biological pathways, 30% consist of strongly co-expressed gene modules for which new members are predicted. For example, CLIC predicts a functional connection between protein C7orf55 (FMC1) and the mitochondrial ATP synthase complex that we have experimentally validated. CLIC is freely available at www.gene-clic.org. We anticipate that CLIC will be valuable both for revealing new components of biological pathways as well as the conditions in which they are active. PMID:28719601

  1. Gene Network Construction from Microarray Data Identifies a Key Network Module and Several Candidate Hub Genes in Age-Associated Spatial Learning Impairment

    PubMed Central

    Uddin, Raihan; Singh, Shiva M.

    2017-01-01

    As humans age many suffer from a decrease in normal brain functions including spatial learning impairments. This study aimed to better understand the molecular mechanisms in age-associated spatial learning impairment (ASLI). We used a mathematical modeling approach implemented in Weighted Gene Co-expression Network Analysis (WGCNA) to create and compare gene network models of young (learning unimpaired) and aged (predominantly learning impaired) brains from a set of exploratory datasets in rats in the context of ASLI. The major goal was to overcome some of the limitations previously observed in the traditional meta- and pathway analysis using these data, and identify novel ASLI related genes and their networks based on co-expression relationship of genes. This analysis identified a set of network modules in the young, each of which is highly enriched with genes functioning in broad but distinct GO functional categories or biological pathways. Interestingly, the analysis pointed to a single module that was highly enriched with genes functioning in “learning and memory” related functions and pathways. Subsequent differential network analysis of this “learning and memory” module in the aged (predominantly learning impaired) rats compared to the young learning unimpaired rats allowed us to identify a set of novel ASLI candidate hub genes. Some of these genes show significant repeatability in networks generated from independent young and aged validation datasets. These hub genes are highly co-expressed with other genes in the network, which not only show differential expression but also differential co-expression and differential connectivity across age and learning impairment. The known function of these hub genes indicate that they play key roles in critical pathways, including kinase and phosphatase signaling, in functions related to various ion channels, and in maintaining neuronal integrity relating to synaptic plasticity and memory formation. Taken together, they provide a new insight and generate new hypotheses into the molecular mechanisms responsible for age associated learning impairment, including spatial learning. PMID:29066959

  2. Gene Network Construction from Microarray Data Identifies a Key Network Module and Several Candidate Hub Genes in Age-Associated Spatial Learning Impairment.

    PubMed

    Uddin, Raihan; Singh, Shiva M

    2017-01-01

    As humans age many suffer from a decrease in normal brain functions including spatial learning impairments. This study aimed to better understand the molecular mechanisms in age-associated spatial learning impairment (ASLI). We used a mathematical modeling approach implemented in Weighted Gene Co-expression Network Analysis (WGCNA) to create and compare gene network models of young (learning unimpaired) and aged (predominantly learning impaired) brains from a set of exploratory datasets in rats in the context of ASLI. The major goal was to overcome some of the limitations previously observed in the traditional meta- and pathway analysis using these data, and identify novel ASLI related genes and their networks based on co-expression relationship of genes. This analysis identified a set of network modules in the young, each of which is highly enriched with genes functioning in broad but distinct GO functional categories or biological pathways. Interestingly, the analysis pointed to a single module that was highly enriched with genes functioning in "learning and memory" related functions and pathways. Subsequent differential network analysis of this "learning and memory" module in the aged (predominantly learning impaired) rats compared to the young learning unimpaired rats allowed us to identify a set of novel ASLI candidate hub genes. Some of these genes show significant repeatability in networks generated from independent young and aged validation datasets. These hub genes are highly co-expressed with other genes in the network, which not only show differential expression but also differential co-expression and differential connectivity across age and learning impairment. The known function of these hub genes indicate that they play key roles in critical pathways, including kinase and phosphatase signaling, in functions related to various ion channels, and in maintaining neuronal integrity relating to synaptic plasticity and memory formation. Taken together, they provide a new insight and generate new hypotheses into the molecular mechanisms responsible for age associated learning impairment, including spatial learning.

  3. Unsupervised, statistically-based systems biology approach for unraveling the genetics of complex traits: A demonstration with ethanol metabolism.

    PubMed

    Lusk, Ryan; Saba, Laura M; Vanderlinden, Lauren A; Zidek, Vaclav; Silhavy, Jan; Pravenec, Michal; Hoffman, Paula L; Tabakoff, Boris

    2018-04-24

    A statistical pipeline was developed and used for determining candidate genes and candidate gene co-expression networks involved in two alcohol (i.e., ethanol) metabolism phenotypes, namely alcohol clearance and acetate area under the curve (AUC) in a recombinant inbred (HXB/BXH) rat panel. The approach was also used to provide an indication of how ethanol metabolism can impact the normal function of the identified networks. RNA was extracted from alcohol-naïve liver tissue of 30 strains of HXB/BXH recombinant inbred rats. The reconstructed transcripts were quantitated and data was used to construct gene co-expression modules and networks. A separate group of rats, comprising the same 30 strains, were injected with ethanol (2 gm/kg) for measurement of blood ethanol and acetate levels. These data were used for QTL analysis of the rate of ethanol disappearance and circulating acetate levels. The analysis pipeline required calculation of the module eigengene values, the correction of these values with ethanol metabolism rates and acetate levels across the rat strains and the determination of the eigengene QTLs. For a module to be considered a candidate for determining phenotype, the module eigengene values had to have significant correlation with the strain phenotypic values and the module eigengene QTLs had to overlap the phenotypic QTLs. Of the 658 transcript co-expression modules generated from liver RNA sequencing data, a single module satisfied all criteria for being a candidate for determining the alcohol clearance trait. This module contained two alcohol dehydrogenase genes, including the gene whose product was previously shown to be responsible for the majority of alcohol elimination in the rat. This module was also the only module identified as a candidate for influencing circulating acetate levels. This module was also linked to the process of generation and utilization of retinoic acid as related to the autonomous immune response. We propose that our analytical pipeline can successfully identify genetic regions and transcripts which predispose a particular phenotype and our analysis provides functional context for co-expression module components. This article is protected by copyright. All rights reserved. This article is protected by copyright. All rights reserved.

  4. Functional organization of the transcriptome in human brain

    PubMed Central

    Oldham, Michael C; Konopka, Genevieve; Iwamoto, Kazuya; Langfelder, Peter; Kato, Tadafumi; Horvath, Steve; Geschwind, Daniel H

    2009-01-01

    The enormous complexity of the human brain ultimately derives from a finite set of molecular instructions encoded in the human genome. These instructions can be directly studied by exploring the organization of the brain’s transcriptome through systematic analysis of gene coexpression relationships. We analyzed gene coexpression relationships in microarray data generated from specific human brain regions and identified modules of coexpressed genes that correspond to neurons, oligodendrocytes, astrocytes and microglia. These modules provide an initial description of the transcriptional programs that distinguish the major cell classes of the human brain and indicate that cell type–specific information can be obtained from whole brain tissue without isolating homogeneous populations of cells. Other modules corresponded to additional cell types, organelles, synaptic function, gender differences and the subventricular neurogenic niche. We found that subventricular zone astrocytes, which are thought to function as neural stem cells in adults, have a distinct gene expression pattern relative to protoplasmic astrocytes. Our findings provide a new foundation for neurogenetic inquiries by revealing a robust and previously unrecognized organization to the human brain transcriptome. PMID:18849986

  5. Gene co-expression analysis identifies gene clusters associated with isotropic and polarized growth in Aspergillus fumigatus conidia.

    PubMed

    Baltussen, Tim J H; Coolen, Jordy P M; Zoll, Jan; Verweij, Paul E; Melchers, Willem J G

    2018-04-26

    Aspergillus fumigatus is a saprophytic fungus that extensively produces conidia. These microscopic asexually reproductive structures are small enough to reach the lungs. Germination of conidia followed by hyphal growth inside human lungs is a key step in the establishment of infection in immunocompromised patients. RNA-Seq was used to analyze the transcriptome of dormant and germinating A. fumigatus conidia. Construction of a gene co-expression network revealed four gene clusters (modules) correlated with a growth phase (dormant, isotropic growth, polarized growth). Transcripts levels of genes encoding for secondary metabolites were high in dormant conidia. During isotropic growth, transcript levels of genes involved in cell wall modifications increased. Two modules encoding for growth and cell cycle/DNA processing were associated with polarized growth. In addition, the co-expression network was used to identify highly connected intermodular hub genes. These genes may have a pivotal role in the respective module and could therefore be compelling therapeutic targets. Generally, cell wall remodeling is an important process during isotropic and polarized growth, characterized by an increase of transcripts coding for hyphal growth and cell cycle/DNA processing when polarized growth is initiated. Copyright © 2018 The Authors. Published by Elsevier Inc. All rights reserved.

  6. Module discovery by exhaustive search for densely connected, co-expressed regions in biomolecular interaction networks.

    PubMed

    Colak, Recep; Moser, Flavia; Chu, Jeffrey Shih-Chieh; Schönhuth, Alexander; Chen, Nansheng; Ester, Martin

    2010-10-25

    Computational prediction of functionally related groups of genes (functional modules) from large-scale data is an important issue in computational biology. Gene expression experiments and interaction networks are well studied large-scale data sources, available for many not yet exhaustively annotated organisms. It has been well established, when analyzing these two data sources jointly, modules are often reflected by highly interconnected (dense) regions in the interaction networks whose participating genes are co-expressed. However, the tractability of the problem had remained unclear and methods by which to exhaustively search for such constellations had not been presented. We provide an algorithmic framework, referred to as Densely Connected Biclustering (DECOB), by which the aforementioned search problem becomes tractable. To benchmark the predictive power inherent to the approach, we computed all co-expressed, dense regions in physical protein and genetic interaction networks from human and yeast. An automatized filtering procedure reduces our output which results in smaller collections of modules, comparable to state-of-the-art approaches. Our results performed favorably in a fair benchmarking competition which adheres to standard criteria. We demonstrate the usefulness of an exhaustive module search, by using the unreduced output to more quickly perform GO term related function prediction tasks. We point out the advantages of our exhaustive output by predicting functional relationships using two examples. We demonstrate that the computation of all densely connected and co-expressed regions in interaction networks is an approach to module discovery of considerable value. Beyond confirming the well settled hypothesis that such co-expressed, densely connected interaction network regions reflect functional modules, we open up novel computational ways to comprehensively analyze the modular organization of an organism based on prevalent and largely available large-scale datasets. Software and data sets are available at http://www.sfu.ca/~ester/software/DECOB.zip.

  7. Pan- and core- network analysis of co-expression genes in a model plant

    DOE PAGES

    He, Fei; Maslov, Sergei

    2016-12-16

    Genome-wide gene expression experiments have been performed using the model plant Arabidopsis during the last decade. Some studies involved construction of coexpression networks, a popular technique used to identify groups of co-regulated genes, to infer unknown gene functions. One approach is to construct a single coexpression network by combining multiple expression datasets generated in different labs. We advocate a complementary approach in which we construct a large collection of 134 coexpression networks based on expression datasets reported in individual publications. To this end we reanalyzed public expression data. To describe this collection of networks we introduced concepts of ‘pan-network’ andmore » ‘core-network’ representing union and intersection between a sizeable fractions of individual networks, respectively. Here, we showed that these two types of networks are different both in terms of their topology and biological function of interacting genes. For example, the modules of the pan-network are enriched in regulatory and signaling functions, while the modules of the core-network tend to include components of large macromolecular complexes such as ribosomes and photosynthetic machinery. Our analysis is aimed to help the plant research community to better explore the information contained within the existing vast collection of gene expression data in Arabidopsis.« less

  8. Pan- and core- network analysis of co-expression genes in a model plant

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    He, Fei; Maslov, Sergei

    Genome-wide gene expression experiments have been performed using the model plant Arabidopsis during the last decade. Some studies involved construction of coexpression networks, a popular technique used to identify groups of co-regulated genes, to infer unknown gene functions. One approach is to construct a single coexpression network by combining multiple expression datasets generated in different labs. We advocate a complementary approach in which we construct a large collection of 134 coexpression networks based on expression datasets reported in individual publications. To this end we reanalyzed public expression data. To describe this collection of networks we introduced concepts of ‘pan-network’ andmore » ‘core-network’ representing union and intersection between a sizeable fractions of individual networks, respectively. Here, we showed that these two types of networks are different both in terms of their topology and biological function of interacting genes. For example, the modules of the pan-network are enriched in regulatory and signaling functions, while the modules of the core-network tend to include components of large macromolecular complexes such as ribosomes and photosynthetic machinery. Our analysis is aimed to help the plant research community to better explore the information contained within the existing vast collection of gene expression data in Arabidopsis.« less

  9. Identifying biomarkers of papillary renal cell carcinoma associated with pathological stage by weighted gene co-expression network analysis.

    PubMed

    He, Zhongshi; Sun, Min; Ke, Yuan; Lin, Rongjie; Xiao, Youde; Zhou, Shuliang; Zhao, Hong; Wang, Yan; Zhou, Fuxiang; Zhou, Yunfeng

    2017-04-25

    Although papillary renal cell carcinoma (PRCC) accounts for 10%-15% of renal cell carcinoma (RCC), no predictive molecular biomarker is currently applicable to guiding disease stage of PRCC patients. The mRNASeq data of PRCC and adjacent normal tissue in The Cancer Genome Atlas was analyzed to identify 1148 differentially expressed genes, on which weighted gene co-expression network analysis was performed. Then 11 co-expressed gene modules were identified. The highest association was found between blue module and pathological stage (r = 0.45) by Pearson's correlation analysis. Functional enrichment analysis revealed that biological processes of blue module focused on nuclear division, cell cycle phase, and spindle (all P < 1e-10). All 40 hub genes in blue module can distinguish localized (pathological stage I, II) from non-localized (pathological stage III, IV) PRCC (P < 0.01). A good molecular biomarker for pathological stage of RCC must be a prognostic gene in clinical practice. Survival analysis was performed to reversely validate if hub genes were associated with pathological stage. Survival analysis unveiled that all hub genes were associated with patient prognosis (P < 0.01).The validation cohort GSE2748 verified that 30 hub genes can differentiate localized from non-localized PRCC (P < 0.01), and 18 hub genes are prognosis-associated (P < 0.01).ROC curve indicated that the 17 hub genes exhibited excellent diagnostic efficiency for localized and non-localized PRCC (AUC > 0.7). These hub genes may serve as a biomarker and help to distinguish different pathological stages for PRCC patients.

  10. 8D.07: GENE EXPRESSION ANALYSIS AND BIOINFORMATICS REVEALED POTENTIAL TRANSCRIPTION FACTORS ASSOCIATED WITH RENIN-ANGIOTENSIN-ALDOSTERONE SYSTEM IN ATHEROMA.

    PubMed

    Nehme, A; Zibara, K; Cerutti, C; Bricca, G

    2015-06-01

    The implication of the renin-angiotensin-aldosterone system (RAAS) in atheroma development is well described. However, a complete view of the local RAAS in atheroma is still missing. In this study we aimed to reveal the organization of RAAS in atheroma at the transcriptomic level and identify the transcriptional regulators behind it. Extended RAAS (extRAAS) was defined as the set of 37 genes coding for classical and novel RAAS participants (Figure 1). Five microarray datasets containing overall 590 samples representing carotid and peripheral atheroma were downloaded from the GEO database. Correlation-based hierarchical clustering (R software) of extRAAS genes within each dataset allowed the identification of modules of co-expressed genes. Reproducible co-expression modules across datasets were then extracted. Transcription factors (TFs) having common binding sites (TFBSs) in the promoters of coordinated genes were identified using the Genomatix database tools and analyzed for their correlation with extRAAS genes in the microarray datasets. Expression data revealed the expressed extRAAS components and their relative abundance displaying the favored pathways in atheroma. Three co-expression modules with more than 80% reproducibility across datasets were extracted. Two of them (M1 and M2) contained genes coding for angiotensin metabolizing enzymes involved in different pathways: M1 included ACE, MME, RNPEP, and DPP3, in addition to 7 other genes; and M2 included CMA1, CTSG, and CPA3. The third module (M3) contained genes coding for receptors known to be implicated in atheroma (AGTR1, MR, GR, LNPEP, EGFR and GPER). M1 and M3 were negatively correlated in 3 of 5 datasets. We identified 19 TFs that have enriched TFBSs in the promoters of genes of M1, and two for M3, but none was found for M2. Among the extracted TFs, ELF1, MAX, and IRF5 showed significant positive correlations with peptidase-coding genes from M1 and negative correlations with receptors-coding genes from M3 (p < 0.05). The identified co-expression modules display the transcriptional organization of local extRAAS in human carotid atheroma. The identification of several TFs potentially associated to extRAAS genes may provide a frame for the discovery of atheroma-specific modulators of extRAAS activity.(Figure is included in full-text article.).

  11. Multiscale Embedded Gene Co-expression Network Analysis

    PubMed Central

    Song, Won-Min; Zhang, Bin

    2015-01-01

    Gene co-expression network analysis has been shown effective in identifying functional co-expressed gene modules associated with complex human diseases. However, existing techniques to construct co-expression networks require some critical prior information such as predefined number of clusters, numerical thresholds for defining co-expression/interaction, or do not naturally reproduce the hallmarks of complex systems such as the scale-free degree distribution of small-worldness. Previously, a graph filtering technique called Planar Maximally Filtered Graph (PMFG) has been applied to many real-world data sets such as financial stock prices and gene expression to extract meaningful and relevant interactions. However, PMFG is not suitable for large-scale genomic data due to several drawbacks, such as the high computation complexity O(|V|3), the presence of false-positives due to the maximal planarity constraint, and the inadequacy of the clustering framework. Here, we developed a new co-expression network analysis framework called Multiscale Embedded Gene Co-expression Network Analysis (MEGENA) by: i) introducing quality control of co-expression similarities, ii) parallelizing embedded network construction, and iii) developing a novel clustering technique to identify multi-scale clustering structures in Planar Filtered Networks (PFNs). We applied MEGENA to a series of simulated data and the gene expression data in breast carcinoma and lung adenocarcinoma from The Cancer Genome Atlas (TCGA). MEGENA showed improved performance over well-established clustering methods and co-expression network construction approaches. MEGENA revealed not only meaningful multi-scale organizations of co-expressed gene clusters but also novel targets in breast carcinoma and lung adenocarcinoma. PMID:26618778

  12. Multiscale Embedded Gene Co-expression Network Analysis.

    PubMed

    Song, Won-Min; Zhang, Bin

    2015-11-01

    Gene co-expression network analysis has been shown effective in identifying functional co-expressed gene modules associated with complex human diseases. However, existing techniques to construct co-expression networks require some critical prior information such as predefined number of clusters, numerical thresholds for defining co-expression/interaction, or do not naturally reproduce the hallmarks of complex systems such as the scale-free degree distribution of small-worldness. Previously, a graph filtering technique called Planar Maximally Filtered Graph (PMFG) has been applied to many real-world data sets such as financial stock prices and gene expression to extract meaningful and relevant interactions. However, PMFG is not suitable for large-scale genomic data due to several drawbacks, such as the high computation complexity O(|V|3), the presence of false-positives due to the maximal planarity constraint, and the inadequacy of the clustering framework. Here, we developed a new co-expression network analysis framework called Multiscale Embedded Gene Co-expression Network Analysis (MEGENA) by: i) introducing quality control of co-expression similarities, ii) parallelizing embedded network construction, and iii) developing a novel clustering technique to identify multi-scale clustering structures in Planar Filtered Networks (PFNs). We applied MEGENA to a series of simulated data and the gene expression data in breast carcinoma and lung adenocarcinoma from The Cancer Genome Atlas (TCGA). MEGENA showed improved performance over well-established clustering methods and co-expression network construction approaches. MEGENA revealed not only meaningful multi-scale organizations of co-expressed gene clusters but also novel targets in breast carcinoma and lung adenocarcinoma.

  13. CoNekT: an open-source framework for comparative genomic and transcriptomic network analyses.

    PubMed

    Proost, Sebastian; Mutwil, Marek

    2018-05-01

    The recent accumulation of gene expression data in the form of RNA sequencing creates unprecedented opportunities to study gene regulation and function. Furthermore, comparative analysis of the expression data from multiple species can elucidate which functional gene modules are conserved across species, allowing the study of the evolution of these modules. However, performing such comparative analyses on raw data is not feasible for many biologists. Here, we present CoNekT (Co-expression Network Toolkit), an open source web server, that contains user-friendly tools and interactive visualizations for comparative analyses of gene expression data and co-expression networks. These tools allow analysis and cross-species comparison of (i) gene expression profiles; (ii) co-expression networks; (iii) co-expressed clusters involved in specific biological processes; (iv) tissue-specific gene expression; and (v) expression profiles of gene families. To demonstrate these features, we constructed CoNekT-Plants for green alga, seed plants and flowering plants (Picea abies, Chlamydomonas reinhardtii, Vitis vinifera, Arabidopsis thaliana, Oryza sativa, Zea mays and Solanum lycopersicum) and thus provide a web-tool with the broadest available collection of plant phyla. CoNekT-Plants is freely available from http://conekt.plant.tools, while the CoNekT source code and documentation can be found at https://github.molgen.mpg.de/proost/CoNekT/.

  14. A systems-genetics approach and data mining tool to assist in the discovery of genes underlying complex traits in Oryza sativa.

    PubMed

    Ficklin, Stephen P; Feltus, Frank Alex

    2013-01-01

    Many traits of biological and agronomic significance in plants are controlled in a complex manner where multiple genes and environmental signals affect the expression of the phenotype. In Oryza sativa (rice), thousands of quantitative genetic signals have been mapped to the rice genome. In parallel, thousands of gene expression profiles have been generated across many experimental conditions. Through the discovery of networks with real gene co-expression relationships, it is possible to identify co-localized genetic and gene expression signals that implicate complex genotype-phenotype relationships. In this work, we used a knowledge-independent, systems genetics approach, to discover a high-quality set of co-expression networks, termed Gene Interaction Layers (GILs). Twenty-two GILs were constructed from 1,306 Affymetrix microarray rice expression profiles that were pre-clustered to allow for improved capture of gene co-expression relationships. Functional genomic and genetic data, including over 8,000 QTLs and 766 phenotype-tagged SNPs (p-value < = 0.001) from genome-wide association studies, both covering over 230 different rice traits were integrated with the GILs. An online systems genetics data-mining resource, the GeneNet Engine, was constructed to enable dynamic discovery of gene sets (i.e. network modules) that overlap with genetic traits. GeneNet Engine does not provide the exact set of genes underlying a given complex trait, but through the evidence of gene-marker correspondence, co-expression, and functional enrichment, site visitors can identify genes with potential shared causality for a trait which could then be used for experimental validation. A set of 2 million SNPs was incorporated into the database and serve as a potential set of testable biomarkers for genes in modules that overlap with genetic traits. Herein, we describe two modules found using GeneNet Engine, one with significant overlap with the trait amylose content and another with significant overlap with blast disease resistance.

  15. A Systems-Genetics Approach and Data Mining Tool to Assist in the Discovery of Genes Underlying Complex Traits in Oryza sativa

    PubMed Central

    Ficklin, Stephen P.; Feltus, Frank Alex

    2013-01-01

    Many traits of biological and agronomic significance in plants are controlled in a complex manner where multiple genes and environmental signals affect the expression of the phenotype. In Oryza sativa (rice), thousands of quantitative genetic signals have been mapped to the rice genome. In parallel, thousands of gene expression profiles have been generated across many experimental conditions. Through the discovery of networks with real gene co-expression relationships, it is possible to identify co-localized genetic and gene expression signals that implicate complex genotype-phenotype relationships. In this work, we used a knowledge-independent, systems genetics approach, to discover a high-quality set of co-expression networks, termed Gene Interaction Layers (GILs). Twenty-two GILs were constructed from 1,306 Affymetrix microarray rice expression profiles that were pre-clustered to allow for improved capture of gene co-expression relationships. Functional genomic and genetic data, including over 8,000 QTLs and 766 phenotype-tagged SNPs (p-value < = 0.001) from genome-wide association studies, both covering over 230 different rice traits were integrated with the GILs. An online systems genetics data-mining resource, the GeneNet Engine, was constructed to enable dynamic discovery of gene sets (i.e. network modules) that overlap with genetic traits. GeneNet Engine does not provide the exact set of genes underlying a given complex trait, but through the evidence of gene-marker correspondence, co-expression, and functional enrichment, site visitors can identify genes with potential shared causality for a trait which could then be used for experimental validation. A set of 2 million SNPs was incorporated into the database and serve as a potential set of testable biomarkers for genes in modules that overlap with genetic traits. Herein, we describe two modules found using GeneNet Engine, one with significant overlap with the trait amylose content and another with significant overlap with blast disease resistance. PMID:23874666

  16. Transcription Factor Binding Site Enrichment Analysis in Co-Expression Modules in Celiac Disease

    PubMed Central

    Romero-Garmendia, Irati; Jauregi-Miguel, Amaia; Plaza-Izurieta, Leticia; Cros, Marie-Pierre; Legarda, Maria; Irastorza, Iñaki; Herceg, Zdenko; Fernandez-Jimenez, Nora

    2018-01-01

    The aim of this study was to construct celiac co-expression patterns at a whole genome level and to identify transcription factors (TFs) that could drive the gliadin-related changes in coordination of gene expression observed in celiac disease (CD). Differential co-expression modules were identified in the acute and chronic responses to gliadin using expression data from a previous microarray study in duodenal biopsies. Transcription factor binding site (TFBS) and Gene Ontology (GO) annotation enrichment analyses were performed in differentially co-expressed genes (DCGs) and selection of candidate regulators was performed. Expression of candidates was measured in clinical samples and the activation of the TFs was further characterized in C2BBe1 cells upon gliadin challenge. Enrichment analyses of the DCGs identified 10 TFs and five were selected for further investigation. Expression changes related to active CD were detected in four TFs, as well as in several of their in silico predicted targets. The activation of TFs was further characterized in C2BBe1 cells upon gliadin challenge, and an increase in nuclear translocation of CAMP Responsive Element Binding Protein 1 (CREB1) and IFN regulatory factor-1 (IRF1) in response to gliadin was observed. Using transcriptome-wide co-expression analyses we are able to propose novel genes involved in CD pathogenesis that respond upon gliadin stimulation, also in non-celiac models. PMID:29748492

  17. Transcription Factor Binding Site Enrichment Analysis in Co-Expression Modules in Celiac Disease.

    PubMed

    Romero-Garmendia, Irati; Garcia-Etxebarria, Koldo; Hernandez-Vargas, Hector; Santin, Izortze; Jauregi-Miguel, Amaia; Plaza-Izurieta, Leticia; Cros, Marie-Pierre; Legarda, Maria; Irastorza, Iñaki; Herceg, Zdenko; Fernandez-Jimenez, Nora; Bilbao, Jose Ramon

    2018-05-10

    The aim of this study was to construct celiac co-expression patterns at a whole genome level and to identify transcription factors (TFs) that could drive the gliadin-related changes in coordination of gene expression observed in celiac disease (CD). Differential co-expression modules were identified in the acute and chronic responses to gliadin using expression data from a previous microarray study in duodenal biopsies. Transcription factor binding site (TFBS) and Gene Ontology (GO) annotation enrichment analyses were performed in differentially co-expressed genes (DCGs) and selection of candidate regulators was performed. Expression of candidates was measured in clinical samples and the activation of the TFs was further characterized in C2BBe1 cells upon gliadin challenge. Enrichment analyses of the DCGs identified 10 TFs and five were selected for further investigation. Expression changes related to active CD were detected in four TFs, as well as in several of their in silico predicted targets. The activation of TFs was further characterized in C2BBe1 cells upon gliadin challenge, and an increase in nuclear translocation of CAMP Responsive Element Binding Protein 1 (CREB1) and IFN regulatory factor-1 (IRF1) in response to gliadin was observed. Using transcriptome-wide co-expression analyses we are able to propose novel genes involved in CD pathogenesis that respond upon gliadin stimulation, also in non-celiac models.

  18. FamNet: A Framework to Identify Multiplied Modules Driving Pathway Expansion in Plants1

    PubMed Central

    Tohge, Takayuki; Klie, Sebastian; Fernie, Alisdair R.

    2016-01-01

    Gene duplications generate new genes that can acquire similar but often diversified functions. Recent studies of gene coexpression networks have indicated that, not only genes, but also pathways can be multiplied and diversified to perform related functions in different parts of an organism. Identification of such diversified pathways, or modules, is needed to expand our knowledge of biological processes in plants and to understand how biological functions evolve. However, systematic explorations of modules remain scarce, and no user-friendly platform to identify them exists. We have established a statistical framework to identify modules and show that approximately one-third of the genes of a plant’s genome participate in hundreds of multiplied modules. Using this framework as a basis, we implemented a platform that can explore and visualize multiplied modules in coexpression networks of eight plant species. To validate the usefulness of the platform, we identified and functionally characterized pollen- and root-specific cell wall modules that multiplied to confer tip growth in pollen tubes and root hairs, respectively. Furthermore, we identified multiplied modules involved in secondary metabolite synthesis and corroborated them by metabolite profiling of tobacco (Nicotiana tabacum) tissues. The interactive platform, referred to as FamNet, is available at http://www.gene2function.de/famnet.html. PMID:26754669

  19. Weighted gene co-expression network analysis of gene modules for the prognosis of esophageal cancer.

    PubMed

    Zhang, Cong; Sun, Qian

    2017-06-01

    Esophageal cancer is a common malignant tumor, whose pathogenesis and prognosis factors are not fully understood. This study aimed to discover the gene clusters that have similar functions and can be used to predict the prognosis of esophageal cancer. The matched microarray and RNA sequencing data of 185 patients with esophageal cancer were downloaded from The Cancer Genome Atlas (TCGA), and gene co-expression networks were built without distinguishing between squamous carcinoma and adenocarcinoma. The result showed that 12 modules were associated with one or more survival data such as recurrence status, recurrence time, vital status or vital time. Furthermore, survival analysis showed that 5 out of the 12 modules were related to progression-free survival (PFS) or overall survival (OS). As the most important module, the midnight blue module with 82 genes was related to PFS, apart from the patient age, tumor grade, primary treatment success, and duration of smoking and tumor histological type. Gene ontology enrichment analysis revealed that "glycoprotein binding" was the top enriched function of midnight blue module genes. Additionally, the blue module was the exclusive gene clusters related to OS. Platelet activating factor receptor (PTAFR) and feline Gardner-Rasheed (FGR) were the top hub genes in both modeling datasets and the STRING protein interaction database. In conclusion, our study provides novel insights into the prognosis-associated genes and screens out candidate biomarkers for esophageal cancer.

  20. Expression atlas and comparative coexpression network analyses reveal important genes involved in the formation of lignified cell wall in Brachypodium distachyon.

    PubMed

    Sibout, Richard; Proost, Sebastian; Hansen, Bjoern Oest; Vaid, Neha; Giorgi, Federico M; Ho-Yue-Kuang, Severine; Legée, Frédéric; Cézart, Laurent; Bouchabké-Coussa, Oumaya; Soulhat, Camille; Provart, Nicholas; Pasha, Asher; Le Bris, Philippe; Roujol, David; Hofte, Herman; Jamet, Elisabeth; Lapierre, Catherine; Persson, Staffan; Mutwil, Marek

    2017-08-01

    While Brachypodium distachyon (Brachypodium) is an emerging model for grasses, no expression atlas or gene coexpression network is available. Such tools are of high importance to provide insights into the function of Brachypodium genes. We present a detailed Brachypodium expression atlas, capturing gene expression in its major organs at different developmental stages. The data were integrated into a large-scale coexpression database ( www.gene2function.de), enabling identification of duplicated pathways and conserved processes across 10 plant species, thus allowing genome-wide inference of gene function. We highlight the importance of the atlas and the platform through the identification of duplicated cell wall modules, and show that a lignin biosynthesis module is conserved across angiosperms. We identified and functionally characterised a putative ferulate 5-hydroxylase gene through overexpression of it in Brachypodium, which resulted in an increase in lignin syringyl units and reduced lignin content of mature stems, and led to improved saccharification of the stem biomass. Our Brachypodium expression atlas thus provides a powerful resource to reveal functionally related genes, which may advance our understanding of important biological processes in grasses. © 2017 The Authors. New Phytologist © 2017 New Phytologist Trust.

  1. WGCNA: an R package for weighted correlation network analysis.

    PubMed

    Langfelder, Peter; Horvath, Steve

    2008-12-29

    Correlation networks are increasingly being used in bioinformatics applications. For example, weighted gene co-expression network analysis is a systems biology method for describing the correlation patterns among genes across microarray samples. Weighted correlation network analysis (WGCNA) can be used for finding clusters (modules) of highly correlated genes, for summarizing such clusters using the module eigengene or an intramodular hub gene, for relating modules to one another and to external sample traits (using eigengene network methodology), and for calculating module membership measures. Correlation networks facilitate network based gene screening methods that can be used to identify candidate biomarkers or therapeutic targets. These methods have been successfully applied in various biological contexts, e.g. cancer, mouse genetics, yeast genetics, and analysis of brain imaging data. While parts of the correlation network methodology have been described in separate publications, there is a need to provide a user-friendly, comprehensive, and consistent software implementation and an accompanying tutorial. The WGCNA R software package is a comprehensive collection of R functions for performing various aspects of weighted correlation network analysis. The package includes functions for network construction, module detection, gene selection, calculations of topological properties, data simulation, visualization, and interfacing with external software. Along with the R package we also present R software tutorials. While the methods development was motivated by gene expression data, the underlying data mining approach can be applied to a variety of different settings. The WGCNA package provides R functions for weighted correlation network analysis, e.g. co-expression network analysis of gene expression data. The R package along with its source code and additional material are freely available at http://www.genetics.ucla.edu/labs/horvath/CoexpressionNetwork/Rpackages/WGCNA.

  2. Identification and functional analysis of a core gene module associated with hepatitis C virus-induced human hepatocellular carcinoma progression.

    PubMed

    Bai, Gaobo; Zheng, Wenling; Ma, Wenli

    2018-05-01

    Hepatitis C virus (HCV)-induced human hepatocellular carcinoma (HCC) progression may be due to a complex multi-step processes. The developmental mechanism of these processes is worth investigating for the prevention, diagnosis and therapy of HCC. The aim of the present study was to investigate the molecular mechanism underlying the progression of HCV-induced hepatocarcinogenesis. First, the dynamic gene module, consisting of key genes associated with progression between the normal stage and HCC, was identified using the Weighted Gene Co-expression Network Analysis tool from R language. By defining those genes in the module as seeds, the change of co-expression in differentially expressed gene sets in two consecutive stages of pathological progression was examined. Finally, interaction pairs of HCV viral proteins and their directly targeted proteins in the identified module were extracted from the literature and a comprehensive interaction dataset from yeast two-hybrid experiments. By combining the interactions between HCV and their targets, and protein-protein interactions in the Search Tool for the Retrieval of Interacting Genes database (STRING), the HCV-key genes interaction network was constructed and visualized using Cytoscape software 3.2. As a result, a module containing 44 key genes was identified to be associated with HCC progression, due to the dynamic features and functions of those genes in the module. Several important differentially co-expressed gene pairs were identified between non-HCC and HCC stages. In the key genes, cyclin dependent kinase 1 (CDK1), NDC80, cyclin A2 (CCNA2) and rac GTPase activating protein 1 (RACGAP1) were shown to be targeted by the HCV nonstructural proteins NS5A, NS3 and NS5B, respectively. The four genes perform an intermediary role between the HCV viral proteins and the dysfunctional module in the HCV key genes interaction network. These findings provided valuable information for understanding the mechanism of HCV-induced HCC progression and for seeking drug targets for the therapy and prevention of HCC.

  3. A comprehensive analysis on preservation patterns of gene co-expression networks during Alzheimer's disease progression.

    PubMed

    Ray, Sumanta; Hossain, Sk Md Mosaddek; Khatun, Lutfunnesa; Mukhopadhyay, Anirban

    2017-12-20

    Alzheimer's disease (AD) is a chronic neuro-degenerative disruption of the brain which involves in large scale transcriptomic variation. The disease does not impact every regions of the brain at the same time, instead it progresses slowly involving somewhat sequential interaction with different regions. Analysis of the expression patterns of the genes in different regions of the brain influenced in AD surely contribute for a enhanced comprehension of AD pathogenesis and shed light on the early characterization of the disease. Here, we have proposed a framework to identify perturbation and preservation characteristics of gene expression patterns across six distinct regions of the brain ("EC", "HIP", "PC", "MTG", "SFG", and "VCX") affected in AD. Co-expression modules were discovered considering a couple of regions at once. These are then analyzed to know the preservation and perturbation characteristics. Different module preservation statistics and a rank aggregation mechanism have been adopted to detect the changes of expression patterns across brain regions. Gene ontology (GO) and pathway based analysis were also carried out to know the biological meaning of preserved and perturbed modules. In this article, we have extensively studied the preservation patterns of co-expressed modules in six distinct brain regions affected in AD. Some modules are emerged as the most preserved while some others are detected as perturbed between a pair of brain regions. Further investigation on the topological properties of preserved and non-preserved modules reveals a substantial association amongst "betweenness centrality" and "degree" of the involved genes. Our findings may render a deeper realization of the preservation characteristics of gene expression patterns in discrete brain regions affected by AD.

  4. Chronic Ethanol Exposure Produces Time- and Brain Region-Dependent Changes in Gene Coexpression Networks

    PubMed Central

    Osterndorff-Kahanek, Elizabeth A.; Becker, Howard C.; Lopez, Marcelo F.; Farris, Sean P.; Tiwari, Gayatri R.; Nunez, Yury O.; Harris, R. Adron; Mayfield, R. Dayne

    2015-01-01

    Repeated ethanol exposure and withdrawal in mice increases voluntary drinking and represents an animal model of physical dependence. We examined time- and brain region-dependent changes in gene coexpression networks in amygdala (AMY), nucleus accumbens (NAC), prefrontal cortex (PFC), and liver after four weekly cycles of chronic intermittent ethanol (CIE) vapor exposure in C57BL/6J mice. Microarrays were used to compare gene expression profiles at 0-, 8-, and 120-hours following the last ethanol exposure. Each brain region exhibited a large number of differentially expressed genes (2,000-3,000) at the 0- and 8-hour time points, but fewer changes were detected at the 120-hour time point (400-600). Within each region, there was little gene overlap across time (~20%). All brain regions were significantly enriched with differentially expressed immune-related genes at the 8-hour time point. Weighted gene correlation network analysis identified modules that were highly enriched with differentially expressed genes at the 0- and 8-hour time points with virtually no enrichment at 120 hours. Modules enriched for both ethanol-responsive and cell-specific genes were identified in each brain region. These results indicate that chronic alcohol exposure causes global ‘rewiring‘ of coexpression systems involving glial and immune signaling as well as neuronal genes. PMID:25803291

  5. Identification of PEG-induced water stress responsive transcripts using co-expression network in Eucalyptus grandis.

    PubMed

    Ghosh Dasgupta, Modhumita; Dharanishanthi, Veeramuthu

    2017-09-05

    Ecophysiological studies in Eucalyptus have shown that water is the principal factor limiting stem growth. Effect of water deficit conditions on physiological and biochemical parameters has been extensively reported in Eucalyptus. The present study was conducted to identify major polyethylene glycol induced water stress responsive transcripts in Eucalyptus grandis using gene co-expression network. A customized array representing 3359 water stress responsive genes was designed to document their expression in leaves of E. grandis cuttings subjected to -0.225MPa of PEG treatment. The differentially expressed transcripts were documented and significantly co-expressed transcripts were used for construction of network. The co-expression network was constructed with 915 nodes and 3454 edges with degree ranging from 2 to 45. Ninety four GO categories and 117 functional pathways were identified in the network. MCODE analysis generated 27 modules and module 6 with 479 nodes and 1005 edges was identified as the biologically relevant network. The major water responsive transcripts represented in the module included dehydrin, osmotin, LEA protein, expansin, arabinogalactans, heat shock proteins, major facilitator proteins, ARM repeat proteins, raffinose synthase, tonoplast intrinsic protein and transcription factors like DREB2A, ARF9, AGL24, UNE12, WLIM1 and MYB66, MYB70, MYB 55, MYB 16 and MYB 103. The coordinated analysis of gene expression patterns and coexpression networks developed in this study identified an array of transcripts that may regulate PEG induced water stress responses in E. grandis. Copyright © 2017 Elsevier B.V. All rights reserved.

  6. Identifying key genes in rheumatoid arthritis by weighted gene co-expression network analysis.

    PubMed

    Ma, Chunhui; Lv, Qi; Teng, Songsong; Yu, Yinxian; Niu, Kerun; Yi, Chengqin

    2017-08-01

    This study aimed to identify rheumatoid arthritis (RA) related genes based on microarray data using the WGCNA (weighted gene co-expression network analysis) method. Two gene expression profile datasets GSE55235 (10 RA samples and 10 healthy controls) and GSE77298 (16 RA samples and seven healthy controls) were downloaded from Gene Expression Omnibus database. Characteristic genes were identified using metaDE package. WGCNA was used to find disease-related networks based on gene expression correlation coefficients, and module significance was defined as the average gene significance of all genes used to assess the correlation between the module and RA status. Genes in the disease-related gene co-expression network were subject to functional annotation and pathway enrichment analysis using Database for Annotation Visualization and Integrated Discovery. Characteristic genes were also mapped to the Connectivity Map to screen small molecules. A total of 599 characteristic genes were identified. For each dataset, characteristic genes in the green, red and turquoise modules were most closely associated with RA, with gene numbers of 54, 43 and 79, respectively. These genes were enriched in totally enriched in 17 Gene Ontology terms, mainly related to immune response (CD97, FYB, CXCL1, IKBKE, CCR1, etc.), inflammatory response (CD97, CXCL1, C3AR1, CCR1, LYZ, etc.) and homeostasis (C3AR1, CCR1, PLN, CCL19, PPT1, etc.). Two small-molecule drugs sanguinarine and papaverine were predicted to have a therapeutic effect against RA. Genes related to immune response, inflammatory response and homeostasis presumably have critical roles in RA pathogenesis. Sanguinarine and papaverine have a potential therapeutic effect against RA. © 2017 Asia Pacific League of Associations for Rheumatology and John Wiley & Sons Australia, Ltd.

  7. Identification of key candidate genes and pathways in hepatitis B virus-associated acute liver failure by bioinformatical analysis

    PubMed Central

    Lin, Huapeng; Zhang, Qian; Li, Xiaocheng; Wu, Yushen; Liu, Ye; Hu, Yingchun

    2018-01-01

    Abstract Hepatitis B virus-associated acute liver failure (HBV-ALF) is a rare but life-threatening syndrome that carried a high morbidity and mortality. Our study aimed to explore the possible molecular mechanisms of HBV-ALF by means of bioinformatics analysis. In this study, genes expression microarray datasets of HBV-ALF from Gene Expression Omnibus were collected, and then we identified differentially expressed genes (DEGs) by the limma package in R. After functional enrichment analysis, we constructed the protein–protein interaction (PPI) network by the Search Tool for the Retrieval of Interacting Genes online database and weighted genes coexpression network by the WGCNA package in R. Subsequently, we picked out the hub genes among the DEGs. A total of 423 DEGs with 198 upregulated genes and 225 downregulated genes were identified between HBV-ALF and normal samples. The upregulated genes were mainly enriched in immune response, and the downregulated genes were mainly enriched in complement and coagulation cascades. Orosomucoid 1 (ORM1), orosomucoid 2 (ORM2), plasminogen (PLG), and aldehyde oxidase 1 (AOX1) were picked out as the hub genes that with a high degree in both PPI network and weighted genes coexpression network. The weighted genes coexpression network analysis found out 3 of the 5 modules that upregulated genes enriched in were closely related to immune system. The downregulated genes enriched in only one module, and the genes in this module majorly enriched in the complement and coagulation cascades pathway. In conclusion, 4 genes (ORM1, ORM2, PLG, and AOX1) with immune response and the complement and coagulation cascades pathway may take part in the pathogenesis of HBV-ALF, and these candidate genes and pathways could be therapeutic targets for HBV-ALF. PMID:29384847

  8. ModuleMiner - improved computational detection of cis-regulatory modules: are there different modes of gene regulation in embryonic development and adult tissues?

    PubMed Central

    Van Loo, Peter; Aerts, Stein; Thienpont, Bernard; De Moor, Bart; Moreau, Yves; Marynen, Peter

    2008-01-01

    We present ModuleMiner, a novel algorithm for computationally detecting cis-regulatory modules (CRMs) in a set of co-expressed genes. ModuleMiner outperforms other methods for CRM detection on benchmark data, and successfully detects CRMs in tissue-specific microarray clusters and in embryonic development gene sets. Interestingly, CRM predictions for differentiated tissues exhibit strong enrichment close to the transcription start site, whereas CRM predictions for embryonic development gene sets are depleted in this region. PMID:18394174

  9. Weighted gene co-expression network analysis of expression data of monozygotic twins identifies specific modules and hub genes related to BMI.

    PubMed

    Wang, Weijing; Jiang, Wenjie; Hou, Lin; Duan, Haiping; Wu, Yili; Xu, Chunsheng; Tan, Qihua; Li, Shuxia; Zhang, Dongfeng

    2017-11-13

    The therapeutic management of obesity is challenging, hence further elucidating the underlying mechanisms of obesity development and identifying new diagnostic biomarkers and therapeutic targets are urgent and necessary. Here, we performed differential gene expression analysis and weighted gene co-expression network analysis (WGCNA) to identify significant genes and specific modules related to BMI based on gene expression profile data of 7 discordant monozygotic twins. In the differential gene expression analysis, it appeared that 32 differentially expressed genes (DEGs) were with a trend of up-regulation in twins with higher BMI when compared to their siblings. Categories of positive regulation of nitric-oxide synthase biosynthetic process, positive regulation of NF-kappa B import into nucleus, and peroxidase activity were significantly enriched within GO database and NF-kappa B signaling pathway within KEGG database. DEGs of NAMPT, TLR9, PTGS2, HBD, and PCSK1N might be associated with obesity. In the WGCNA, among the total 20 distinct co-expression modules identified, coral1 module (68 genes) had the strongest positive correlation with BMI (r = 0.56, P = 0.04) and disease status (r = 0.56, P = 0.04). Categories of positive regulation of phospholipase activity, high-density lipoprotein particle clearance, chylomicron remnant clearance, reverse cholesterol transport, intermediate-density lipoprotein particle, chylomicron, low-density lipoprotein particle, very-low-density lipoprotein particle, voltage-gated potassium channel complex, cholesterol transporter activity, and neuropeptide hormone activity were significantly enriched within GO database for this module. And alcoholism and cell adhesion molecules pathways were significantly enriched within KEGG database. Several hub genes, such as GAL, ASB9, NPPB, TBX2, IL17C, APOE, ABCG4, and APOC2 were also identified. The module eigengene of saddlebrown module (212 genes) was also significantly correlated with BMI (r = 0.56, P = 0.04), and hub genes of KCNN1 and AQP10 were differentially expressed. We identified significant genes and specific modules potentially related to BMI based on the gene expression profile data of monozygotic twins. The findings may help further elucidate the underlying mechanisms of obesity development and provide novel insights to research potential gene biomarkers and signaling pathways for obesity treatment. Further analysis and validation of the findings reported here are important and necessary when more sample size is acquired.

  10. Screening of biomarkers for prediction of response to and prognosis after chemotherapy for breast cancers.

    PubMed

    Bing, Feng; Zhao, Yu

    2016-01-01

    To screen the biomarkers having the ability to predict prognosis after chemotherapy for breast cancers. Three microarray data of breast cancer patients undergoing chemotherapy were collected from Gene Expression Omnibus database. After preprocessing, data in GSE41112 were analyzed using significance analysis of microarrays to screen the differentially expressed genes (DEGs). The DEGs were further analyzed by Differentially Coexpressed Genes and Links to construct a function module, the prognosis efficacy of which was verified by the other two datasets (GSE22226 and GSE58644) using Kaplan-Meier plots. The involved genes in function module were subjected to a univariate Cox regression analysis to confirm whether the expression of each prognostic gene was associated with survival. A total of 511 DEGs between breast cancer patients who received chemotherapy or not were obtained, consisting of 421 upregulated and 90 downregulated genes. Using the Differentially Coexpressed Genes and Links package, 1,244 differentially coexpressed genes (DCGs) were identified, among which 36 DCGs were regulated by the transcription factor complex NFY (NFYA, NFYB, NFYC). These 39 genes constructed a gene module to classify the samples in GSE22226 and GSE58644 into three subtypes and these subtypes exhibited significantly different survival rates. Furthermore, several genes of the 39 DCGs were shown to be significantly associated with good (such as CDC20) and poor (such as ARID4A) prognoses following chemotherapy. Our present study provided a serial of biomarkers for predicting the prognosis of chemotherapy or targets for development of alternative treatment (ie, CDC20 and ARID4A) in breast cancer patients.

  11. Integrating Genetic and Gene Co-expression Analysis Identifies Gene Networks Involved in Alcohol and Stress Responses

    PubMed Central

    Luo, Jie; Xu, Pei; Cao, Peijian; Wan, Hongjian; Lv, Xiaonan; Xu, Shengchun; Wang, Gangjun; Cook, Melloni N.; Jones, Byron C.; Lu, Lu; Wang, Xusheng

    2018-01-01

    Although the link between stress and alcohol is well recognized, the underlying mechanisms of how they interplay at the molecular level remain unclear. The purpose of this study is to identify molecular networks underlying the effects of alcohol and stress responses, as well as their interaction on anxiety behaviors in the hippocampus of mice using a systems genetics approach. Here, we applied a gene co-expression network approach to transcriptomes of 41 BXD mouse strains under four conditions: stress, alcohol, stress-induced alcohol and control. The co-expression analysis identified 14 modules and characterized four expression patterns across the four conditions. The four expression patterns include up-regulation in no restraint stress and given an ethanol injection (NOE) but restoration in restraint stress followed by an ethanol injection (RSE; pattern 1), down-regulation in NOE but rescue in RSE (pattern 2), up-regulation in both restraint stress followed by a saline injection (RSS) and NOE, and further amplification in RSE (pattern 3), and up-regulation in RSS but reduction in both NOE and RSE (pattern 4). We further identified four functional subnetworks by superimposing protein-protein interactions (PPIs) to the 14 co-expression modules, including γ-aminobutyric acid receptor (GABA) signaling, glutamate signaling, neuropeptide signaling, cAMP-dependent signaling. We further performed module specificity analysis to identify modules that are specific to stress, alcohol, or stress-induced alcohol responses. Finally, we conducted causality analysis to link genetic variation to these identified modules, and anxiety behaviors after stress and alcohol treatments. This study underscores the importance of integrative analysis and offers new insights into the molecular networks underlying stress and alcohol responses. PMID:29674951

  12. A regulon conserved in monocot and dicot plants defines a functional module in antifungal plant immunity

    PubMed Central

    Humphry, Matt; Bednarek, Paweł; Kemmerling, Birgit; Koh, Serry; Stein, Mónica; Göbel, Ulrike; Stüber, Kurt; Piślewska-Bednarek, Mariola; Loraine, Ann; Schulze-Lefert, Paul; Somerville, Shauna; Panstruga, Ralph

    2010-01-01

    At least two components that modulate plant resistance against the fungal powdery mildew disease are ancient and have been conserved since the time of the monocot–dicot split (≈200 Mya). These components are the seven transmembrane domain containing MLO/MLO2 protein and the syntaxin ROR2/PEN1, which act antagonistically and have been identified in the monocot barley (Hordeum vulgare) and the dicot Arabidopsis thaliana, respectively. Additionally, syntaxin-interacting N-ethylmaleimide sensitive factor adaptor protein receptor proteins (VAMP721/722 and SNAP33/34) as well as a myrosinase (PEN2) and an ABC transporter (PEN3) contribute to antifungal resistance in both barley and/or Arabidopsis. Here, we show that these genetically defined defense components share a similar set of coexpressed genes in the two plant species, comprising a statistically significant overrepresentation of gene products involved in regulation of transcription, posttranslational modification, and signaling. Most of the coexpressed Arabidopsis genes possess a common cis-regulatory element that may dictate their coordinated expression. We exploited gene coexpression to uncover numerous components in Arabidopsis involved in antifungal defense. Together, our data provide evidence for an evolutionarily conserved regulon composed of core components and clade/species-specific innovations that functions as a module in plant innate immunity. PMID:21098265

  13. The WRKY transcription factor family and senescence in switchgrass.

    PubMed

    Rinerson, Charles I; Scully, Erin D; Palmer, Nathan A; Donze-Reiner, Teresa; Rabara, Roel C; Tripathi, Prateek; Shen, Qingxi J; Sattler, Scott E; Rohila, Jai S; Sarath, Gautam; Rushton, Paul J

    2015-11-09

    Early aerial senescence in switchgrass (Panicum virgatum) can significantly limit biomass yields. WRKY transcription factors that can regulate senescence could be used to reprogram senescence and enhance biomass yields. All potential WRKY genes present in the version 1.0 of the switchgrass genome were identified and curated using manual and bioinformatic methods. Expression profiles of WRKY genes in switchgrass flag leaf RNA-Seq datasets were analyzed using clustering and network analyses tools to identify both WRKY and WRKY-associated gene co-expression networks during leaf development and senescence onset. We identified 240 switchgrass WRKY genes including members of the RW5 and RW6 families of resistance proteins. Weighted gene co-expression network analysis of the flag leaf transcriptomes across development readily separated clusters of co-expressed genes into thirteen modules. A visualization highlighted separation of modules associated with the early and senescence-onset phases of flag leaf growth. The senescence-associated module contained 3000 genes including 23 WRKYs. Putative promoter regions of senescence-associated WRKY genes contained several cis-element-like sequences suggestive of responsiveness to both senescence and stress signaling pathways. A phylogenetic comparison of senescence-associated WRKY genes from switchgrass flag leaf with senescence-associated WRKY genes from other plants revealed notable hotspots in Group I, IIb, and IIe of the phylogenetic tree. We have identified and named 240 WRKY genes in the switchgrass genome. Twenty three of these genes show elevated mRNA levels during the onset of flag leaf senescence. Eleven of the WRKY genes were found in hotspots of related senescence-associated genes from multiple species and thus represent promising targets for future switchgrass genetic improvement. Overall, individual WRKY gene expression profiles could be readily linked to developmental stages of flag leaves.

  14. Identification of co-expression gene networks, regulatory genes and pathways for obesity based on adipose tissue RNA Sequencing in a porcine model.

    PubMed

    Kogelman, Lisette J A; Cirera, Susanna; Zhernakova, Daria V; Fredholm, Merete; Franke, Lude; Kadarmideen, Haja N

    2014-09-30

    Obesity is a complex metabolic condition in strong association with various diseases, like type 2 diabetes, resulting in major public health and economic implications. Obesity is the result of environmental and genetic factors and their interactions, including genome-wide genetic interactions. Identification of co-expressed and regulatory genes in RNA extracted from relevant tissues representing lean and obese individuals provides an entry point for the identification of genes and pathways of importance to the development of obesity. The pig, an omnivorous animal, is an excellent model for human obesity, offering the possibility to study in-depth organ-level transcriptomic regulations of obesity, unfeasible in humans. Our aim was to reveal adipose tissue co-expression networks, pathways and transcriptional regulations of obesity using RNA Sequencing based systems biology approaches in a porcine model. We selected 36 animals for RNA Sequencing from a previously created F2 pig population representing three extreme groups based on their predicted genetic risks for obesity. We applied Weighted Gene Co-expression Network Analysis (WGCNA) to detect clusters of highly co-expressed genes (modules). Additionally, regulator genes were detected using Lemon-Tree algorithms. WGCNA revealed five modules which were strongly correlated with at least one obesity-related phenotype (correlations ranging from -0.54 to 0.72, P < 0.001). Functional annotation identified pathways enlightening the association between obesity and other diseases, like osteoporosis (osteoclast differentiation, P = 1.4E-7), and immune-related complications (e.g. Natural killer cell mediated cytotoxity, P = 3.8E-5; B cell receptor signaling pathway, P = 7.2E-5). Lemon-Tree identified three potential regulator genes, using confident scores, for the WGCNA module which was associated with osteoclast differentiation: CCR1, MSR1 and SI1 (probability scores respectively 95.30, 62.28, and 34.58). Moreover, detection of differentially connected genes identified various genes previously identified to be associated with obesity in humans and rodents, e.g. CSF1R and MARC2. To our knowledge, this is the first study to apply systems biology approaches using porcine adipose tissue RNA-Sequencing data in a genetically characterized porcine model for obesity. We revealed complex networks, pathways, candidate and regulatory genes related to obesity, confirming the complexity of obesity and its association with immune-related disorders and osteoporosis.

  15. [Exploration of common biological pathways for attention deficit hyperactivity disorder and low birth weight].

    PubMed

    Xiang, Bo; Yu, Minglan; Liang, Xuemei; Lei, Wei; Huang, Chaohua; Chen, Jing; He, Wenying; Zhang, Tao; Li, Tao; Liu, Kezhi

    2017-12-10

    To explore common biological pathways for attention deficit hyperactivity disorder (ADHD) and low birth weight (LBW). Thei-Gsea4GwasV2 software was used to analyze the result of genome-wide association analysis (GWAS) for LBW (pathways were derived from Reactome), and nominally significant (P< 0.05, FDR< 0.25) pathways were tested for replication in ADHD.Significant pathways were analyzed with DAPPLE and Reatome FI software to identify genes involved in such pathways, with each cluster enriched with the gene ontology (GO). The Centiscape2.0 software was used to calculate the degree of genetic networks and the betweenness value to explore the core node (gene). Weighed gene co-expression network analysis (WGCNA) was then used to explore the co-expression of genes in these pathways.With gene expression data derived from BrainSpan, GO enrichment was carried out for each gene module. Eleven significant biological pathways was identified in association with LBW, among which two (Selenoamino acid metabolism and Diseases associated with glycosaminoglycan metabolism) were replicated during subsequent ADHD analysis. Network analysis of 130 genes in these pathways revealed that some of the sub-networksare related with morphology of cerebellum, development of hippocampus, and plasticity of synaptic structure. Upon co-expression network analysis, 120 genes passed the quality control and were found to express in 3 gene modules. These modules are mainly related to the regulation of synaptic structure and activity regulation. ADHD and LBW share some biological regulation processes. Anomalies of such proces sesmay predispose to ADHD.

  16. Differentially co-expressed interacting protein pairs discriminate samples under distinct stages of HIV type 1 infection.

    PubMed

    Yoon, Dukyong; Kim, Hyosil; Suh-Kim, Haeyoung; Park, Rae Woong; Lee, KiYoung

    2011-01-01

    Microarray analyses based on differentially expressed genes (DEGs) have been widely used to distinguish samples across different cellular conditions. However, studies based on DEGs have not been able to clearly determine significant differences between samples of pathophysiologically similar HIV-1 stages, e.g., between acute and chronic progressive (or AIDS) or between uninfected and clinically latent stages. We here suggest a novel approach to allow such discrimination based on stage-specific genetic features of HIV-1 infection. Our approach is based on co-expression changes of genes known to interact. The method can identify a genetic signature for a single sample as contrasted with existing protein-protein-based analyses with correlational designs. Our approach distinguishes each sample using differentially co-expressed interacting protein pairs (DEPs) based on co-expression scores of individual interacting pairs within a sample. The co-expression score has positive value if two genes in a sample are simultaneously up-regulated or down-regulated. And the score has higher absolute value if expression-changing ratios are similar between the two genes. We compared characteristics of DEPs with that of DEGs by evaluating their usefulness in separation of HIV-1 stage. And we identified DEP-based network-modules and their gene-ontology enrichment to find out the HIV-1 stage-specific gene signature. Based on the DEP approach, we observed clear separation among samples from distinct HIV-1 stages using clustering and principal component analyses. Moreover, the discrimination power of DEPs on the samples (70-100% accuracy) was much higher than that of DEGs (35-45%) using several well-known classifiers. DEP-based network analysis also revealed the HIV-1 stage-specific network modules; the main biological processes were related to "translation," "RNA splicing," "mRNA, RNA, and nucleic acid transport," and "DNA metabolism." Through the HIV-1 stage-related modules, changing stage-specific patterns of protein interactions could be observed. DEP-based method discriminated the HIV-1 infection stages clearly, and revealed a HIV-1 stage-specific gene signature. The proposed DEP-based method might complement existing DEG-based approaches in various microarray expression analyses.

  17. Integrating mRNA and miRNA Weighted Gene Co-Expression Networks with eQTLs in the Nucleus Accumbens of Subjects with Alcohol Dependence

    PubMed Central

    Blevins, Tana; Aliev, Fazil; Adkins, Amy; Hack, Laura; Bigdeli, Tim; D. van der Vaart, Andrew; Web, Bradley Todd; Bacanu, Silviu-Alin; Kalsi, Gursharan; Kendler, Kenneth S.; Miles, Michael F.; Dick, Danielle; Riley, Brien P.; Dumur, Catherine; Vladimirov, Vladimir I.

    2015-01-01

    Alcohol consumption is known to lead to gene expression changes in the brain. After performing weighted gene co-expression network analyses (WGCNA) on genome-wide mRNA and microRNA (miRNA) expression in Nucleus Accumbens (NAc) of subjects with alcohol dependence (AD; N = 18) and of matched controls (N = 18), six mRNA and three miRNA modules significantly correlated with AD were identified (Bonferoni-adj. p≤ 0.05). Cell-type-specific transcriptome analyses revealed two of the mRNA modules to be enriched for neuronal specific marker genes and downregulated in AD, whereas the remaining four mRNA modules were enriched for astrocyte and microglial specific marker genes and upregulated in AD. Gene set enrichment analysis demonstrated that neuronal specific modules were enriched for genes involved in oxidative phosphorylation, mitochondrial dysfunction and MAPK signaling. Glial-specific modules were predominantly enriched for genes involved in processes related to immune functions, i.e. cytokine signaling (all adj. p≤ 0.05). In mRNA and miRNA modules, 461 and 25 candidate hub genes were identified, respectively. In contrast to the expected biological functions of miRNAs, correlation analyses between mRNA and miRNA hub genes revealed a higher number of positive than negative correlations (χ2 test p≤ 0.0001). Integration of hub gene expression with genome-wide genotypic data resulted in 591 mRNA cis-eQTLs and 62 miRNA cis-eQTLs. mRNA cis-eQTLs were significantly enriched for AD diagnosis and AD symptom counts (adj. p = 0.014 and p = 0.024, respectively) in AD GWAS signals in a large, independent genetic sample from the Collaborative Study on Genetics of Alcohol (COGA). In conclusion, our study identified putative gene network hubs coordinating mRNA and miRNA co-expression changes in the NAc of AD subjects, and our genetic (cis-eQTL) analysis provides novel insights into the etiological mechanisms of AD. PMID:26381263

  18. BFDCA: A Comprehensive Tool of Using Bayes Factor for Differential Co-Expression Analysis.

    PubMed

    Wang, Duolin; Wang, Juexin; Jiang, Yuexu; Liang, Yanchun; Xu, Dong

    2017-02-03

    Comparing the gene-expression profiles between biological conditions is useful for understanding gene regulation underlying complex phenotypes. Along this line, analysis of differential co-expression (DC) has gained attention in the recent years, where genes under one condition have different co-expression patterns compared with another. We developed an R package Bayes Factor approach for Differential Co-expression Analysis (BFDCA) for DC analysis. BFDCA is unique in integrating various aspects of DC patterns (including Shift, Cross, and Re-wiring) into one uniform Bayes factor. We tested BFDCA using simulation data and experimental data. Simulation results indicate that BFDCA outperforms existing methods in accuracy and robustness of detecting DC pairs and DC modules. Results of using experimental data suggest that BFDCA can cluster disease-related genes into functional DC subunits and estimate the regulatory impact of disease-related genes well. BFDCA also achieves high accuracy in predicting case-control phenotypes by using significant DC gene pairs as markers. BFDCA is publicly available at http://dx.doi.org/10.17632/jdz4vtvnm3.1. Copyright © 2016 Elsevier Ltd. All rights reserved.

  19. Multi-tissue analysis of co-expression networks by higher-order generalized singular value decomposition identifies functionally coherent transcriptional modules.

    PubMed

    Xiao, Xiaolin; Moreno-Moral, Aida; Rotival, Maxime; Bottolo, Leonardo; Petretto, Enrico

    2014-01-01

    Recent high-throughput efforts such as ENCODE have generated a large body of genome-scale transcriptional data in multiple conditions (e.g., cell-types and disease states). Leveraging these data is especially important for network-based approaches to human disease, for instance to identify coherent transcriptional modules (subnetworks) that can inform functional disease mechanisms and pathological pathways. Yet, genome-scale network analysis across conditions is significantly hampered by the paucity of robust and computationally-efficient methods. Building on the Higher-Order Generalized Singular Value Decomposition, we introduce a new algorithmic approach for efficient, parameter-free and reproducible identification of network-modules simultaneously across multiple conditions. Our method can accommodate weighted (and unweighted) networks of any size and can similarly use co-expression or raw gene expression input data, without hinging upon the definition and stability of the correlation used to assess gene co-expression. In simulation studies, we demonstrated distinctive advantages of our method over existing methods, which was able to recover accurately both common and condition-specific network-modules without entailing ad-hoc input parameters as required by other approaches. We applied our method to genome-scale and multi-tissue transcriptomic datasets from rats (microarray-based) and humans (mRNA-sequencing-based) and identified several common and tissue-specific subnetworks with functional significance, which were not detected by other methods. In humans we recapitulated the crosstalk between cell-cycle progression and cell-extracellular matrix interactions processes in ventricular zones during neocortex expansion and further, we uncovered pathways related to development of later cognitive functions in the cortical plate of the developing brain which were previously unappreciated. Analyses of seven rat tissues identified a multi-tissue subnetwork of co-expressed heat shock protein (Hsp) and cardiomyopathy genes (Bag3, Cryab, Kras, Emd, Plec), which was significantly replicated using separate failing heart and liver gene expression datasets in humans, thus revealing a conserved functional role for Hsp genes in cardiovascular disease.

  20. Functional modules by relating protein interaction networks and gene expression.

    PubMed

    Tornow, Sabine; Mewes, H W

    2003-11-01

    Genes and proteins are organized on the basis of their particular mutual relations or according to their interactions in cellular and genetic networks. These include metabolic or signaling pathways and protein interaction, regulatory or co-expression networks. Integrating the information from the different types of networks may lead to the notion of a functional network and functional modules. To find these modules, we propose a new technique which is based on collective, multi-body correlations in a genetic network. We calculated the correlation strength of a group of genes (e.g. in the co-expression network) which were identified as members of a module in a different network (e.g. in the protein interaction network) and estimated the probability that this correlation strength was found by chance. Groups of genes with a significant correlation strength in different networks have a high probability that they perform the same function. Here, we propose evaluating the multi-body correlations by applying the superparamagnetic approach. We compare our method to the presently applied mean Pearson correlations and show that our method is more sensitive in revealing functional relationships.

  1. Functional modules by relating protein interaction networks and gene expression

    PubMed Central

    Tornow, Sabine; Mewes, H. W.

    2003-01-01

    Genes and proteins are organized on the basis of their particular mutual relations or according to their interactions in cellular and genetic networks. These include metabolic or signaling pathways and protein interaction, regulatory or co-expression networks. Integrating the information from the different types of networks may lead to the notion of a functional network and functional modules. To find these modules, we propose a new technique which is based on collective, multi-body correlations in a genetic network. We calculated the correlation strength of a group of genes (e.g. in the co-expression network) which were identified as members of a module in a different network (e.g. in the protein interaction network) and estimated the probability that this correlation strength was found by chance. Groups of genes with a significant correlation strength in different networks have a high probability that they perform the same function. Here, we propose evaluating the multi-body correlations by applying the superparamagnetic approach. We compare our method to the presently applied mean Pearson correlations and show that our method is more sensitive in revealing functional relationships. PMID:14576317

  2. Integrative Transcriptomic Analysis Uncovers Novel Gene Modules That Underlie the Sulfate Response in Arabidopsis thaliana

    PubMed Central

    Henríquez-Valencia, Carlos; Arenas-M, Anita; Medina, Joaquín; Canales, Javier

    2018-01-01

    Sulfur is an essential nutrient for plant growth and development. Sulfur is a constituent of proteins, the plasma membrane and cell walls, among other important cellular components. To obtain new insights into the gene regulatory networks underlying the sulfate response, we performed an integrative meta-analysis of transcriptomic data from five different sulfate experiments available in public databases. This bioinformatic approach allowed us to identify a robust set of genes whose expression depends only on sulfate availability, indicating that those genes play an important role in the sulfate response. In relation to sulfate metabolism, the biological function of approximately 45% of these genes is currently unknown. Moreover, we found several consistent Gene Ontology terms related to biological processes that have not been extensively studied in the context of the sulfate response; these processes include cell wall organization, carbohydrate metabolism, nitrogen compound transport, and the regulation of proteolysis. Gene co-expression network analyses revealed relationships between the sulfate-responsive genes that were distributed among seven function-specific co-expression modules. The most connected genes in the sulfate co-expression network belong to a module related to the carbon response, suggesting that this biological function plays an important role in the control of the sulfate response. Temporal analyses of the network suggest that sulfate starvation generates a biphasic response, which involves that major changes in gene expression occur during both the early and late responses. Network analyses predicted that the sulfate response is regulated by a limited number of transcription factors, including MYBs, bZIPs, and NF-YAs. In conclusion, our analysis identified new candidate genes and provided new hypotheses to advance our understanding of the transcriptional regulation of sulfate metabolism in plants. PMID:29692794

  3. Integrative Transcriptomic Analysis Uncovers Novel Gene Modules That Underlie the Sulfate Response in Arabidopsis thaliana.

    PubMed

    Henríquez-Valencia, Carlos; Arenas-M, Anita; Medina, Joaquín; Canales, Javier

    2018-01-01

    Sulfur is an essential nutrient for plant growth and development. Sulfur is a constituent of proteins, the plasma membrane and cell walls, among other important cellular components. To obtain new insights into the gene regulatory networks underlying the sulfate response, we performed an integrative meta-analysis of transcriptomic data from five different sulfate experiments available in public databases. This bioinformatic approach allowed us to identify a robust set of genes whose expression depends only on sulfate availability, indicating that those genes play an important role in the sulfate response. In relation to sulfate metabolism, the biological function of approximately 45% of these genes is currently unknown. Moreover, we found several consistent Gene Ontology terms related to biological processes that have not been extensively studied in the context of the sulfate response; these processes include cell wall organization, carbohydrate metabolism, nitrogen compound transport, and the regulation of proteolysis. Gene co-expression network analyses revealed relationships between the sulfate-responsive genes that were distributed among seven function-specific co-expression modules. The most connected genes in the sulfate co-expression network belong to a module related to the carbon response, suggesting that this biological function plays an important role in the control of the sulfate response. Temporal analyses of the network suggest that sulfate starvation generates a biphasic response, which involves that major changes in gene expression occur during both the early and late responses. Network analyses predicted that the sulfate response is regulated by a limited number of transcription factors, including MYBs, bZIPs, and NF-YAs. In conclusion, our analysis identified new candidate genes and provided new hypotheses to advance our understanding of the transcriptional regulation of sulfate metabolism in plants.

  4. Network-based co-expression analysis for exploring the potential diagnostic biomarkers of metastatic melanoma.

    PubMed

    Wang, Li-Xin; Li, Yang; Chen, Guan-Zhi

    2018-01-01

    Metastatic melanoma is an aggressive skin cancer and is one of the global malignancies with high mortality and morbidity. It is essential to identify and verify diagnostic biomarkers of early metastatic melanoma. Previous studies have systematically assessed protein biomarkers and mRNA-based expression characteristics. However, molecular markers for the early diagnosis of metastatic melanoma have not been identified. To explore potential regulatory targets, we have analyzed the gene microarray expression profiles of malignant melanoma samples by co-expression analysis based on the network approach. The differentially expressed genes (DEGs) were screened by the EdgeR package of R software. A weighted gene co-expression network analysis (WGCNA) was used for the identification of DEGs in the special gene modules and hub genes. Subsequently, a protein-protein interaction network was constructed to extract hub genes associated with gene modules. Finally, twenty-four important hub genes (RASGRP2, IKZF1, CXCR5, LTB, BLK, LINGO3, CCR6, P2RY10, RHOH, JUP, KRT14, PLA2G3, SPRR1A, KRT78, SFN, CLDN4, IL1RN, PKP3, CBLC, KRT16, TMEM79, KLK8, LYPD3 and LYPD5) were treated as valuable factors involved in the immune response and tumor cell development in tumorigenesis. In addition, a transcriptional regulatory network was constructed for these specific modules or hub genes, and a few core transcriptional regulators were found to be mostly associated with our hub genes, including GATA1, STAT1, SP1, and PSG1. In summary, our findings enhance our understanding of the biological process of malignant melanoma metastasis, enabling us to identify specific genes to use for diagnostic and prognostic markers and possibly for targeted therapy.

  5. Screening of biomarkers for prediction of response to and prognosis after chemotherapy for breast cancers

    PubMed Central

    Bing, Feng; Zhao, Yu

    2016-01-01

    Objective To screen the biomarkers having the ability to predict prognosis after chemotherapy for breast cancers. Methods Three microarray data of breast cancer patients undergoing chemotherapy were collected from Gene Expression Omnibus database. After preprocessing, data in GSE41112 were analyzed using significance analysis of microarrays to screen the differentially expressed genes (DEGs). The DEGs were further analyzed by Differentially Coexpressed Genes and Links to construct a function module, the prognosis efficacy of which was verified by the other two datasets (GSE22226 and GSE58644) using Kaplan–Meier plots. The involved genes in function module were subjected to a univariate Cox regression analysis to confirm whether the expression of each prognostic gene was associated with survival. Results A total of 511 DEGs between breast cancer patients who received chemotherapy or not were obtained, consisting of 421 upregulated and 90 downregulated genes. Using the Differentially Coexpressed Genes and Links package, 1,244 differentially coexpressed genes (DCGs) were identified, among which 36 DCGs were regulated by the transcription factor complex NFY (NFYA, NFYB, NFYC). These 39 genes constructed a gene module to classify the samples in GSE22226 and GSE58644 into three subtypes and these subtypes exhibited significantly different survival rates. Furthermore, several genes of the 39 DCGs were shown to be significantly associated with good (such as CDC20) and poor (such as ARID4A) prognoses following chemotherapy. Conclusion Our present study provided a serial of biomarkers for predicting the prognosis of chemotherapy or targets for development of alternative treatment (ie, CDC20 and ARID4A) in breast cancer patients. PMID:27217777

  6. Directed module detection in a large-scale expression compendium.

    PubMed

    Fu, Qiang; Lemmens, Karen; Sanchez-Rodriguez, Aminael; Thijs, Inge M; Meysman, Pieter; Sun, Hong; Fierro, Ana Carolina; Engelen, Kristof; Marchal, Kathleen

    2012-01-01

    Public online microarray databases contain tremendous amounts of expression data. Mining these data sources can provide a wealth of information on the underlying transcriptional networks. In this chapter, we illustrate how the web services COLOMBOS and DISTILLER can be used to identify condition-dependent coexpression modules by exploring compendia of public expression data. COLOMBOS is designed for user-specified query-driven analysis, whereas DISTILLER generates a global regulatory network overview. The user is guided through both web services by means of a case study in which condition-dependent coexpression modules comprising a gene of interest (i.e., "directed") are identified.

  7. S187. SEARCHING FOR BRAIN CO-EXPRESSION MODULES THAT CONTRIBUTE DISPROPORTIONATELY TO THE COMMON POLYGENIC RISK FOR SCHIZOPHRENIA

    PubMed Central

    Costas, Javier; Paramo, Mario; Arrojo, Manuel

    2018-01-01

    Abstract Background Genomic research has revealed that schizophrenia is a highly polygenic disease. Recent estimates indicate that at least 71% of genomic segments of 1 Mb include one or more risk loci for schizophrenia (Loh et al., Nature Genet 2015). This extremely high polygenicity represents a challenge to decipher the biological basis of schizophrenia, as it is expected that any set of SNPs with enough size will be associated with the disorder. Among the different gene sets available for study (such as those from Gene Ontology, KEGG pathway, Reactome pathways or protein protein interaction datasets), those based on brain co-expression networks represent putative functional relationships in the relevant tissue. The aim of this work was to identify brain co-expression networks that contribute disproportionately to the common polygenic risk for schizophrenia to get more insight on schizophrenia etiopathology. Methods We analyzed a case -control dataset consisting of 582 schizophrenia patients from Galicia, NW Spain, and 591 ancestrally matched controls, genotyped with the Illumina PsychArray. Using as discovery sample the summary results from the largest GWAS of schizophrenia to date (Psychiatric Genomics Consortium, SCZ2), we generated polygenic risk scores (PRS) in our sample based on SNPs located at genes belonging to brain co-expression modules determined by the CommonMind Consortium (Fromer et al., Nature Neurosci 2016). PRS were generated using the clumping procedure of PLINK, considering several different thresholds to select SNPs from the discovery sample. In order to test if any specific module increased risk to schizophrenia more than expected by their size, we generated up to 10,000 random permutations of the same number of SNPs, matched by frequency, distance to nearest gene, number of SNPs in LD and gene density, using SNPsnap. Results As expected, most modules with enough number of independent SNPs belonging to them showed a significant increase in Nagelkerke’s R2 in our case-control sample after the addition of the module-specific PRS in a logistic regression model. Our permutation strategy revealed that most modules did not show an excess of risk, measured by increase in Nagelkerke’s R2, in comparison to equal number of SNPs with similar characteristics. But one module, M2c from Fromer et al., remained highly significant after multiple tests’ correction. Reactome pathways analysis revealed an over-representation of genes involved in “Neuronal System” and “Axon guidance” among genes from this module. Using the same protocol, we detected that the 84 genes from the neuronal system pathway at this module, representing less than 6% of the genes from the module, explained a higher level of risk than expected. “Voltage-gated Potassium channels” and “Neurexins and neuroligins” are overrepresented among the Neuronal System genes from module M2c. Discussion Here, we show that, in spite of the high polygenicity of schizophrenia, it is possible to identify gene sets contributing disproportionately to total risk, as it was the case for the M2c module from Fromer et al. These authors have previously reported that the M2c module was enriched in GWAS signals, as well as CNVs and rare variants associated with schizophrenia. Therefore, this module shows a disproportionately contribution to schizophrenia risk. Study supported by Grant PI14/01020 from Instituto de Salud Carlos III, Ministry of Health, Spanish Government.

  8. Systems Toxicology of Chemically Induced Liver and Kidney Injuries: Histopathology-Associated Gene Co-Expression Modules

    DTIC Science & Technology

    2016-01-04

    2016 (wileyonlinelibrary.com) DOI 10.1002/jat.3278Systems toxicology of chemically induced liver and kidney injuries: histopathology-associated gene...injuries that classify 11 liver and eight kidney histopathology endpoints based on dose-dependent activation of the identified modules. We showed that...well as determine whether the injury module activation was specific to the tissue of origin (liver and kidney ). The generated modules provide a link

  9. Uncovering co-expression gene network modules regulating fruit acidity in diverse apples.

    PubMed

    Bai, Yang; Dougherty, Laura; Cheng, Lailiang; Zhong, Gan-Yuan; Xu, Kenong

    2015-08-16

    Acidity is a major contributor to fruit quality. Several organic acids are present in apple fruit, but malic acid is predominant and determines fruit acidity. The trait is largely controlled by the Malic acid (Ma) locus, underpinning which Ma1 that putatively encodes a vacuolar aluminum-activated malate transporter1 (ALMT1)-like protein is a strong candidate gene. We hypothesize that fruit acidity is governed by a gene network in which Ma1 is key member. The goal of this study is to identify the gene network and the potential mechanisms through which the network operates. Guided by Ma1, we analyzed the transcriptomes of mature fruit of contrasting acidity from six apple accessions of genotype Ma_ (MaMa or Mama) and four of mama using RNA-seq and identified 1301 fruit acidity associated genes, among which 18 were most significant acidity genes (MSAGs). Network inferring using weighted gene co-expression network analysis (WGCNA) revealed five co-expression gene network modules of significant (P < 0.001) correlation with malate. Of these, the Ma1 containing module (Turquoise) of 336 genes showed the highest correlation (0.79). We also identified 12 intramodular hub genes from each of the five modules and 18 enriched gene ontology (GO) terms and MapMan sub-bines, including two GO terms (GO:0015979 and GO:0009765) and two MapMap sub-bins (1.3.4 and 1.1.1.1) related to photosynthesis in module Turquoise. Using Lemon-Tree algorithms, we identified 12 regulator genes of probabilistic scores 35.5-81.0, including MDP0000525602 (a LLR receptor kinase), MDP0000319170 (an IQD2-like CaM binding protein) and MDP0000190273 (an EIN3-like transcription factor) of greater interest for being one of the 18 MSAGs or one of the 12 intramodular hub genes in Turquoise, and/or a regulator to the cluster containing Ma1. The most relevant finding of this study is the identification of the MSAGs, intramodular hub genes, enriched photosynthesis related processes, and regulator genes in a WGCNA module Turquoise that not only encompasses Ma1 but also shows the highest modular correlation with acidity. Overall, this study provides important insight into the Ma1-mediated gene network controlling acidity in mature apple fruit of diverse genetic background.

  10. GEM2Net: from gene expression modeling to -omics networks, a new CATdb module to investigate Arabidopsis thaliana genes involved in stress response.

    PubMed

    Zaag, Rim; Tamby, Jean Philippe; Guichard, Cécile; Tariq, Zakia; Rigaill, Guillem; Delannoy, Etienne; Renou, Jean-Pierre; Balzergue, Sandrine; Mary-Huard, Tristan; Aubourg, Sébastien; Martin-Magniette, Marie-Laure; Brunaud, Véronique

    2015-01-01

    CATdb (http://urgv.evry.inra.fr/CATdb) is a database providing a public access to a large collection of transcriptomic data, mainly for Arabidopsis but also for other plants. This resource has the rare advantage to contain several thousands of microarray experiments obtained with the same technical protocol and analyzed by the same statistical pipelines. In this paper, we present GEM2Net, a new module of CATdb that takes advantage of this homogeneous dataset to mine co-expression units and decipher Arabidopsis gene functions. GEM2Net explores 387 stress conditions organized into 18 biotic and abiotic stress categories. For each one, a model-based clustering is applied on expression differences to identify clusters of co-expressed genes. To characterize functions associated with these clusters, various resources are analyzed and integrated: Gene Ontology, subcellular localization of proteins, Hormone Families, Transcription Factor Families and a refined stress-related gene list associated to publications. Exploiting protein-protein interactions and transcription factors-targets interactions enables to display gene networks. GEM2Net presents the analysis of the 18 stress categories, in which 17,264 genes are involved and organized within 681 co-expression clusters. The meta-data analyses were stored and organized to compose a dynamic Web resource. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

  11. Weighted gene co-expression network analysis reveals potential genes involved in early metamorphosis process in sea cucumber Apostichopus japonicus.

    PubMed

    Li, Yongxin; Kikuchi, Mani; Li, Xueyan; Gao, Qionghua; Xiong, Zijun; Ren, Yandong; Zhao, Ruoping; Mao, Bingyu; Kondo, Mariko; Irie, Naoki; Wang, Wen

    2018-01-01

    Sea cucumbers, one main class of Echinoderms, have a very fast and drastic metamorphosis process during their development. However, the molecular basis under this process remains largely unknown. Here we systematically examined the gene expression profiles of Japanese common sea cucumber (Apostichopus japonicus) for the first time by RNA sequencing across 16 developmental time points from fertilized egg to juvenile stage. Based on the weighted gene co-expression network analysis (WGCNA), we identified 21 modules. Among them, MEdarkmagenta was highly expressed and correlated with the early metamorphosis process from late auricularia to doliolaria larva. Furthermore, gene enrichment and differentially expressed gene analysis identified several genes in the module that may play key roles in the metamorphosis process. Our results not only provide a molecular basis for experimentally studying the development and morphological complexity of sea cucumber, but also lay a foundation for improving its emergence rate. Copyright © 2017 Elsevier Inc. All rights reserved.

  12. A Gene Co-Expression Network in Whole Blood of Schizophrenia Patients Is Independent of Antipsychotic-Use and Enriched for Brain-Expressed Genes

    PubMed Central

    de Jong, Simone; Boks, Marco P. M.; Fuller, Tova F.; Strengman, Eric; Janson, Esther; de Kovel, Carolien G. F.; Ori, Anil P. S.; Vi, Nancy; Mulder, Flip; Blom, Jan Dirk; Glenthøj, Birte; Schubart, Chris D.; Cahn, Wiepke; Kahn, René S.; Horvath, Steve; Ophoff, Roel A.

    2012-01-01

    Despite large-scale genome-wide association studies (GWAS), the underlying genes for schizophrenia are largely unknown. Additional approaches are therefore required to identify the genetic background of this disorder. Here we report findings from a large gene expression study in peripheral blood of schizophrenia patients and controls. We applied a systems biology approach to genome-wide expression data from whole blood of 92 medicated and 29 antipsychotic-free schizophrenia patients and 118 healthy controls. We show that gene expression profiling in whole blood can identify twelve large gene co-expression modules associated with schizophrenia. Several of these disease related modules are likely to reflect expression changes due to antipsychotic medication. However, two of the disease modules could be replicated in an independent second data set involving antipsychotic-free patients and controls. One of these robustly defined disease modules is significantly enriched with brain-expressed genes and with genetic variants that were implicated in a GWAS study, which could imply a causal role in schizophrenia etiology. The most highly connected intramodular hub gene in this module (ABCF1), is located in, and regulated by the major histocompatibility (MHC) complex, which is intriguing in light of the fact that common allelic variants from the MHC region have been implicated in schizophrenia. This suggests that the MHC increases schizophrenia susceptibility via altered gene expression of regulatory genes in this network. PMID:22761806

  13. Transcriptomic context of DRD1 is associated with prefrontal activity and behavior during working memory.

    PubMed

    Fazio, Leonardo; Pergola, Giulio; Papalino, Marco; Di Carlo, Pasquale; Monda, Anna; Gelao, Barbara; Amoroso, Nicola; Tangaro, Sabina; Rampino, Antonio; Popolizio, Teresa; Bertolino, Alessandro; Blasi, Giuseppe

    2018-05-22

    Dopamine D 1 receptor (D 1 R) signaling shapes prefrontal cortex (PFC) activity during working memory (WM). Previous reports found higher WM performance associated with alleles linked to greater expression of the gene coding for D 1 Rs ( DRD1 ). However, there is no evidence on the relationship between genetic modulation of DRD1 expression in PFC and patterns of prefrontal activity during WM. Furthermore, previous studies have not considered that D 1 Rs are part of a coregulated molecular environment, which may contribute to D 1 R-related prefrontal WM processing. Thus, we hypothesized a reciprocal link between a coregulated (i.e., coexpressed) molecular network including DRD1 and PFC activity. To explore this relationship, we used three independent postmortem prefrontal mRNA datasets (total n = 404) to characterize a coexpression network including DRD1 Then, we indexed network coexpression using a measure (polygenic coexpression index- DRD1 -PCI) combining the effect of single nucleotide polymorphisms (SNPs) on coexpression. Finally, we associated the DRD1 -PCI with WM performance and related brain activity in independent samples of healthy participants (total n = 371). We identified and replicated a coexpression network including DRD1 , whose coexpression was correlated with DRD1 -PCI. We also found that DRD1 -PCI was associated with lower PFC activity and higher WM performance. Behavioral and imaging results were replicated in independent samples. These findings suggest that genetically predicted expression of DRD1 and of its coexpression partners stratifies healthy individuals in terms of WM performance and related prefrontal activity. They also highlight genes and SNPs potentially relevant to pharmacological trials aimed to test cognitive enhancers modulating DRD1 signaling.

  14. Transcriptome Analysis of Gelatin Seed Treatment as a Biostimulant of Cucumber Plant Growth

    PubMed Central

    Wilson, H. T.; Xu, K.; Taylor, A. G.

    2015-01-01

    The beneficial effects of gelatin capsule seed treatment on enhanced plant growth and tolerance to abiotic stress have been reported in a number of crops, but the molecular mechanisms underlying such effects are poorly understood. Using mRNA sequencing based approach, transcriptomes of one- and two-week-old cucumber plants from gelatin capsule treated and nontreated seeds were characterized. The gelatin treated plants had greater total leaf area, fresh weight, frozen weight, and nitrogen content. Pairwise comparisons of the RNA-seq data identified 620 differentially expressed genes between treated and control two-week-old plants, consistent with the timing when the growth related measurements also showed the largest differences. Using weighted gene coexpression network analysis, significant coexpression gene network module of 208 of the 620 differentially expressed genes was identified, which included 16 hub genes in the blue module, a NAC transcription factor, a MYB transcription factor, an amino acid transporter, an ammonium transporter, a xenobiotic detoxifier-glutathione S-transferase, and others. Based on the putative functions of these genes, the identification of the significant WGCNA module and the hub genes provided important insights into the molecular mechanisms of gelatin seed treatment as a biostimulant to enhance plant growth. PMID:26558288

  15. The Role of Vitamin D in the Transcriptional Program of Human Pregnancy

    PubMed Central

    Al-Garawi, Amal; Carey, Vincent J.; Chhabra, Divya; Morrow, Jarrett; Lasky-Su, Jessica; Qiu, Weiliang; Laranjo, Nancy; Litonjua, Augusto A.; Weiss, Scott T.

    2016-01-01

    Background Patterns of gene expression of human pregnancy are poorly understood. In a trial of vitamin D supplementation in pregnant women, peripheral blood transcriptomes were measured longitudinally on 30 women and used to characterize gene co-expression networks. Objective Studies suggest that increased maternal Vitamin D levels may reduce the risk of asthma in early life, yet the underlying mechanisms have not been examined. In this study, we used a network-based approach to examine changes in gene expression profiles during the course of normal pregnancy and evaluated their association with maternal Vitamin D levels. Design The VDAART study is a randomized clinical trial of vitamin D supplementation in pregnancy for reduction of pediatric asthma risk. The trial enrolled 881 women at 10–18 weeks of gestation. Longitudinal gene expression measures were obtained on thirty pregnant women, using RNA isolated from peripheral blood samples obtained in the first and third trimesters. Differentially expressed genes were identified using significance of analysis of microarrays (SAM), and clustered using a weighted gene co-expression network analysis (WGCNA). Gene-set enrichment was performed to identify major biological pathways. Results Comparison of transcriptional profiles between first and third trimesters of pregnancy identified 5839 significantly differentially expressed genes (FDR<0.05). Weighted gene co-expression network analysis clustered these transcripts into 14 co-expression modules of which two showed significant correlation with maternal vitamin D levels. Pathway analysis of these two modules revealed genes enriched in immune defense pathways and extracellular matrix reorganization as well as genes enriched in notch signaling and transcription factor networks. Conclusion Our data show that gene expression profiles of healthy pregnant women change during the course of pregnancy and suggest that maternal Vitamin D levels influence transcriptional profiles. These alterations of the maternal transcriptome may contribute to fetal immune imprinting and reduce allergic sensitization in early life. Trial Registration clinicaltrials.gov NCT00920621 PMID:27711190

  16. DCGL v2.0: an R package for unveiling differential regulation from differential co-expression.

    PubMed

    Yang, Jing; Yu, Hui; Liu, Bao-Hong; Zhao, Zhongming; Liu, Lei; Ma, Liang-Xiao; Li, Yi-Xue; Li, Yuan-Yuan

    2013-01-01

    Differential co-expression analysis (DCEA) has emerged in recent years as a novel, systematic investigation into gene expression data. While most DCEA studies or tools focus on the co-expression relationships among genes, some are developing a potentially more promising research domain, differential regulation analysis (DRA). In our previously proposed R package DCGL v1.0, we provided functions to facilitate basic differential co-expression analyses; however, the output from DCGL v1.0 could not be translated into differential regulation mechanisms in a straightforward manner. To advance from DCEA to DRA, we upgraded the DCGL package from v1.0 to v2.0. A new module named "Differential Regulation Analysis" (DRA) was designed, which consists of three major functions: DRsort, DRplot, and DRrank. DRsort selects differentially regulated genes (DRGs) and differentially regulated links (DRLs) according to the transcription factor (TF)-to-target information. DRrank prioritizes the TFs in terms of their potential relevance to the phenotype of interest. DRplot graphically visualizes differentially co-expressed links (DCLs) and/or TF-to-target links in a network context. In addition to these new modules, we streamlined the codes from v1.0. The evaluation results proved that our differential regulation analysis is able to capture the regulators relevant to the biological subject. With ample functions to facilitate differential regulation analysis, DCGL v2.0 was upgraded from a DCEA tool to a DRA tool, which may unveil the underlying differential regulation from the observed differential co-expression. DCGL v2.0 can be applied to a wide range of gene expression data in order to systematically identify novel regulators that have not yet been documented as critical. DCGL v2.0 package is available at http://cran.r-project.org/web/packages/DCGL/index.html or at our project home page http://lifecenter.sgst.cn/main/en/dcgl.jsp.

  17. Massive-scale gene co-expression network construction and robustness testing using random matrix theory.

    PubMed

    Gibson, Scott M; Ficklin, Stephen P; Isaacson, Sven; Luo, Feng; Feltus, Frank A; Smith, Melissa C

    2013-01-01

    The study of gene relationships and their effect on biological function and phenotype is a focal point in systems biology. Gene co-expression networks built using microarray expression profiles are one technique for discovering and interpreting gene relationships. A knowledge-independent thresholding technique, such as Random Matrix Theory (RMT), is useful for identifying meaningful relationships. Highly connected genes in the thresholded network are then grouped into modules that provide insight into their collective functionality. While it has been shown that co-expression networks are biologically relevant, it has not been determined to what extent any given network is functionally robust given perturbations in the input sample set. For such a test, hundreds of networks are needed and hence a tool to rapidly construct these networks. To examine functional robustness of networks with varying input, we enhanced an existing RMT implementation for improved scalability and tested functional robustness of human (Homo sapiens), rice (Oryza sativa) and budding yeast (Saccharomyces cerevisiae). We demonstrate dramatic decrease in network construction time and computational requirements and show that despite some variation in global properties between networks, functional similarity remains high. Moreover, the biological function captured by co-expression networks thresholded by RMT is highly robust.

  18. Effects of selection for ethanol preference on gene expression in the nucleus accumbens of HS-CC mice

    PubMed Central

    Colville, A. M.; Iancu, O. D.; Oberbeck, D. L.; Darakjian, P.; Zheng, C. L.; Walter, N. A. R.; Harrington, C. A.; Searles, R. P.; McWeeney, S.; Hitzemann, R. J.

    2017-01-01

    Previous studies on changes in murine brain gene expression associated with the selection for ethanol preference have used F2 intercross or heterogeneous stock (HS) founders, derived from standard laboratory strains. However, these populations represent only a small proportion of the genetic variance available in Mus musculus. To investigate a wider range of genetic diversity, we selected mice for ethanol preference using an HS derived from the eight strains of the collaborative cross. These HS mice were selectively bred (four generations) for high and low ethanol preference. The nucleus accumbens shell of naive S4 mice was interrogated using RNA sequencing (RNA-Seq). Gene networks were constructed using the weighted gene coexpression network analysis assessing both coexpression and cosplicing. Selection targeted one of the network coexpression modules (greenyellow) that was significantly enriched in genes associated with receptor signaling activity including Chrna7, Grin2a, Htr2a and Oprd1. Connectivity in the module as measured by changes in the hub nodes was significantly reduced in the low preference line. Of particular interest was the observation that selection had marked effects on a large number of cell adhesion molecules, including cadherins and protocadherins. In addition, the coexpression data showed that selection had marked effects on long non-coding RNA hub nodes. Analysis of the cosplicing network data showed a significant effect of selection on a large cluster of Ras GTPase-binding genes including Cdkl5, Cyfip1, Ndrg1, Sod1 and Stxbp5. These data in part support the earlier observation that preference is linked to Ras/Mapk pathways. PMID:28058793

  19. An iterative network partition algorithm for accurate identification of dense network modules

    PubMed Central

    Sun, Siqi; Dong, Xinran; Fu, Yao; Tian, Weidong

    2012-01-01

    A key step in network analysis is to partition a complex network into dense modules. Currently, modularity is one of the most popular benefit functions used to partition network modules. However, recent studies suggested that it has an inherent limitation in detecting dense network modules. In this study, we observed that despite the limitation, modularity has the advantage of preserving the primary network structure of the undetected modules. Thus, we have developed a simple iterative Network Partition (iNP) algorithm to partition a network. The iNP algorithm provides a general framework in which any modularity-based algorithm can be implemented in the network partition step. Here, we tested iNP with three modularity-based algorithms: multi-step greedy (MSG), spectral clustering and Qcut. Compared with the original three methods, iNP achieved a significant improvement in the quality of network partition in a benchmark study with simulated networks, identified more modules with significantly better enrichment of functionally related genes in both yeast protein complex network and breast cancer gene co-expression network, and discovered more cancer-specific modules in the cancer gene co-expression network. As such, iNP should have a broad application as a general method to assist in the analysis of biological networks. PMID:22121225

  20. FastGCN: A GPU Accelerated Tool for Fast Gene Co-Expression Networks

    PubMed Central

    Liang, Meimei; Zhang, Futao; Jin, Gulei; Zhu, Jun

    2015-01-01

    Gene co-expression networks comprise one type of valuable biological networks. Many methods and tools have been published to construct gene co-expression networks; however, most of these tools and methods are inconvenient and time consuming for large datasets. We have developed a user-friendly, accelerated and optimized tool for constructing gene co-expression networks that can fully harness the parallel nature of GPU (Graphic Processing Unit) architectures. Genetic entropies were exploited to filter out genes with no or small expression changes in the raw data preprocessing step. Pearson correlation coefficients were then calculated. After that, we normalized these coefficients and employed the False Discovery Rate to control the multiple tests. At last, modules identification was conducted to construct the co-expression networks. All of these calculations were implemented on a GPU. We also compressed the coefficient matrix to save space. We compared the performance of the GPU implementation with those of multi-core CPU implementations with 16 CPU threads, single-thread C/C++ implementation and single-thread R implementation. Our results show that GPU implementation largely outperforms single-thread C/C++ implementation and single-thread R implementation, and GPU implementation outperforms multi-core CPU implementation when the number of genes increases. With the test dataset containing 16,000 genes and 590 individuals, we can achieve greater than 63 times the speed using a GPU implementation compared with a single-thread R implementation when 50 percent of genes were filtered out and about 80 times the speed when no genes were filtered out. PMID:25602758

  1. FastGCN: a GPU accelerated tool for fast gene co-expression networks.

    PubMed

    Liang, Meimei; Zhang, Futao; Jin, Gulei; Zhu, Jun

    2015-01-01

    Gene co-expression networks comprise one type of valuable biological networks. Many methods and tools have been published to construct gene co-expression networks; however, most of these tools and methods are inconvenient and time consuming for large datasets. We have developed a user-friendly, accelerated and optimized tool for constructing gene co-expression networks that can fully harness the parallel nature of GPU (Graphic Processing Unit) architectures. Genetic entropies were exploited to filter out genes with no or small expression changes in the raw data preprocessing step. Pearson correlation coefficients were then calculated. After that, we normalized these coefficients and employed the False Discovery Rate to control the multiple tests. At last, modules identification was conducted to construct the co-expression networks. All of these calculations were implemented on a GPU. We also compressed the coefficient matrix to save space. We compared the performance of the GPU implementation with those of multi-core CPU implementations with 16 CPU threads, single-thread C/C++ implementation and single-thread R implementation. Our results show that GPU implementation largely outperforms single-thread C/C++ implementation and single-thread R implementation, and GPU implementation outperforms multi-core CPU implementation when the number of genes increases. With the test dataset containing 16,000 genes and 590 individuals, we can achieve greater than 63 times the speed using a GPU implementation compared with a single-thread R implementation when 50 percent of genes were filtered out and about 80 times the speed when no genes were filtered out.

  2. Analysis of Bos taurus and Sus scrofa X and Y chromosome transcriptome highlights reproductive driver genes

    PubMed Central

    Khan, Faheem Ahmed; Liu, Hui; Zhou, Hao; Wang, Kai; Qamar, Muhammad Tahir Ul; Pandupuspitasari, Nuruliarizki Shinta; Shujun, Zhang

    2017-01-01

    The biology of sperm, its capability of fertilizing an egg and its role in sex ratio are the major biological questions in reproductive biology. To answer these question we integrated X and Y chromosome transcriptome across different species: Bos taurus and Sus scrofa and identified reproductive driver genes based on Weighted Gene Co-Expression Network Analysis (WGCNA) algorithm. Our strategy resulted in 11007 and 10445 unique genes consisting of 9 and 11 reproductive modules in Bos taurus and Sus scrofa, respectively. The consensus module calculation yields an overall 167 overlapped genes which were mapped to 846 DEGs in Bos taurus to finally get a list of 67 dual feature genes. We develop gene co-expression network of selected 67 genes that consists of 58 nodes (27 down-regulated and 31 up-regulated genes) enriched to 66 GO biological process (BP) including 6 GO annotations related to reproduction and two KEGG pathways. Moreover, we searched significantly related TF (ISRE, AP1FJ, RP58, CREL) and miRNAs (bta-miR-181a, bta-miR-17-5p, bta-miR-146b, bta-miR-146a) which targeted the genes in co-expression network. In addition we performed genetic analysis including phylogenetic, functional domain identification, epigenetic modifications, mutation analysis of the most important reproductive driver genes PRM1, PPP2R2B and PAFAH1B1 and finally performed a protein docking analysis to visualize their therapeutic and gene expression regulation ability. PMID:28903352

  3. Co-expression Network Approach to Studying the Effects of Botulinum Neurotoxin-A.

    PubMed

    Mukund, Kavitha; Ward, Samuel R; Lieber, Richard L; Subramaniam, Shankar

    2017-10-16

    Botulinum Neurotoxin A (BoNT-A) is a potent neurotoxin with several clinical applications.The goal of this study was to utilize co-expression network theory to analyze temporal transcriptional data from skeletal muscle after BoNT-A treatment. Expression data for 2000 genes (extracted using a ranking heuristic) served as the basis for this analysis. Using weighted gene co-expression network analysis (WGCNA), we identified 19 co-expressed modules, further hierarchically clustered into 5 groups. Quantifying average expression and co-expression patterns across these groups revealed temporal aspects of muscle's response to BoNT-A. Functional analysis revealed enrichment of group 1 with metabolism; group 5 with contradictory functions of atrophy and cellular recovery; and groups 2 and 3 with extracellular matrix (ECM) and non-fast fiber isoforms. Topological positioning of two highly ranked, significantly expressed genes- Dclk1 and Ostalpha within group 5 suggested possible mechanistic roles in recovery from BoNT-A induced atrophy. Phenotypic correlations of groups with titin and myosin protein content further emphasized the effect of BoNT-A on the sarcomeric contraction machinery in early phase of chemodenervation. In summary, our approach revealed a hierarchical functional response to BoNT-A induced paralysis with early metabolic and later ECM responses and identified putative biomarkers associated with chemodenervation. Additionally, our results provide an unbiased validation of the response documented in our previous workBotulinum Neurotoxin A (BoNT-A) is a potent neurotoxin with several clinical applications.The goal of this study was to utilize co-expression network theory to analyze temporal transcriptional data from skeletal muscle after BoNT-A treatment. Expression data for 2000 genes (extracted using a ranking heuristic) served as the basis for this analysis. Using weighted gene co-expression network analysis (WGCNA), we identified 19 co-expressed modules, further hierarchically clustered into 5 groups. Quantifying average expression and co-expression patterns across these groups revealed temporal aspects of muscle's response to BoNT-A. Functional analysis revealed enrichment of group 1 with metabolism; group 5 with contradictory functions of atrophy and cellular recovery; and groups 2 and 3 with extracellular matrix (ECM) and non-fast fiber isoforms. Topological positioning of two highly ranked, significantly expressed genes- Dclk1 and Ostalpha within group 5 suggested possible mechanistic roles in recovery from BoNT-A induced atrophy. Phenotypic correlations of groups with titin and myosin protein content further emphasized the effect of BoNT-A on the sarcomeric contraction machinery in early phase of chemodenervation. In summary, our approach revealed a hierarchical functional response to BoNT-A induced paralysis with early metabolic and later ECM responses and identified putative biomarkers associated with chemodenervation. Additionally, our results provide an unbiased validation of the response documented in our previous work.

  4. Signed weighted gene co-expression network analysis of transcriptional regulation in murine embryonic stem cells

    PubMed Central

    Mason, Mike J; Fan, Guoping; Plath, Kathrin; Zhou, Qing; Horvath, Steve

    2009-01-01

    Background Recent work has revealed that a core group of transcription factors (TFs) regulates the key characteristics of embryonic stem (ES) cells: pluripotency and self-renewal. Current efforts focus on identifying genes that play important roles in maintaining pluripotency and self-renewal in ES cells and aim to understand the interactions among these genes. To that end, we investigated the use of unsigned and signed network analysis to identify pluripotency and differentiation related genes. Results We show that signed networks provide a better systems level understanding of the regulatory mechanisms of ES cells than unsigned networks, using two independent murine ES cell expression data sets. Specifically, using signed weighted gene co-expression network analysis (WGCNA), we found a pluripotency module and a differentiation module, which are not identified in unsigned networks. We confirmed the importance of these modules by incorporating genome-wide TF binding data for key ES cell regulators. Interestingly, we find that the pluripotency module is enriched with genes related to DNA damage repair and mitochondrial function in addition to transcriptional regulation. Using a connectivity measure of module membership, we not only identify known regulators of ES cells but also show that Mrpl15, Msh6, Nrf1, Nup133, Ppif, Rbpj, Sh3gl2, and Zfp39, among other genes, have important roles in maintaining ES cell pluripotency and self-renewal. We also report highly significant relationships between module membership and epigenetic modifications (histone modifications and promoter CpG methylation status), which are known to play a role in controlling gene expression during ES cell self-renewal and differentiation. Conclusion Our systems biologic re-analysis of gene expression, transcription factor binding, epigenetic and gene ontology data provides a novel integrative view of ES cell biology. PMID:19619308

  5. Identifying osteosarcoma metastasis associated genes by weighted gene co-expression network analysis (WGCNA).

    PubMed

    Tian, Honglai; Guan, Donghui; Li, Jianmin

    2018-06-01

    Osteosarcoma (OS), the most common malignant bone tumor, accounts for the heavy healthy threat in the period of children and adolescents. OS occurrence usually correlates with early metastasis and high death rate. This study aimed to better understand the mechanism of OS metastasis.Based on Gene Expression Omnibus (GEO) database, we downloaded 4 expression profile data sets associated with OS metastasis, and selected differential expressed genes. Weighted gene co-expression network analysis (WGCNA) approach allowed us to investigate the most OS metastasis-correlated module. Gene Ontology functional and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analyses were used to give annotation of selected OS metastasis-associated genes.We select 897 differential expressed genes from OS metastasis and OS non-metastasis groups. Based on these selected genes, WGCNA further explored 142 genes included in the most OS metastasis-correlated module. Gene Ontology functional and KEGG pathway enrichment analyses showed that significantly OS metastasis-associated genes were involved in pathway correlated with insulin-like growth factor binding.Our research figured out several potential molecules participating in metastasis process and factors acting as biomarker. With this study, we could better explore the mechanism of OS metastasis and further discover more therapy targets.

  6. Coexpression network analysis identifies transcriptional modules associated with genomic alterations in neuroblastoma.

    PubMed

    Yang, Liulin; Li, Yun; Wei, Zhi; Chang, Xiao

    2018-06-01

    Neuroblastoma is a highly complex and heterogeneous cancer in children. Acquired genomic alterations including MYCN amplification, 1p deletion and 11q deletion are important risk factors and biomarkers in neuroblastoma. Here, we performed a co-expression-based gene network analysis to study the intrinsic association between specific genomic changes and transcriptome organization. We identified multiple gene coexpression modules which are recurrent in two independent datasets and associated with functional pathways including nervous system development, cell cycle, immune system process and extracellular matrix/space. Our results also indicated that modules involved in nervous system development and cell cycle are highly associated with MYCN amplification and 1p deletion, while modules responding to immune system process are associated with MYCN amplification only. In summary, this integrated analysis provides novel insights into molecular heterogeneity and pathogenesis of neuroblastoma. This article is part of a Special Issue entitled: Accelerating Precision Medicine through Genetic and Genomic Big Data Analysis edited by Yudong Cai & Tao Huang. Copyright © 2017. Published by Elsevier B.V.

  7. Is My Network Module Preserved and Reproducible?

    PubMed Central

    Langfelder, Peter; Luo, Rui; Oldham, Michael C.; Horvath, Steve

    2011-01-01

    In many applications, one is interested in determining which of the properties of a network module change across conditions. For example, to validate the existence of a module, it is desirable to show that it is reproducible (or preserved) in an independent test network. Here we study several types of network preservation statistics that do not require a module assignment in the test network. We distinguish network preservation statistics by the type of the underlying network. Some preservation statistics are defined for a general network (defined by an adjacency matrix) while others are only defined for a correlation network (constructed on the basis of pairwise correlations between numeric variables). Our applications show that the correlation structure facilitates the definition of particularly powerful module preservation statistics. We illustrate that evaluating module preservation is in general different from evaluating cluster preservation. We find that it is advantageous to aggregate multiple preservation statistics into summary preservation statistics. We illustrate the use of these methods in six gene co-expression network applications including 1) preservation of cholesterol biosynthesis pathway in mouse tissues, 2) comparison of human and chimpanzee brain networks, 3) preservation of selected KEGG pathways between human and chimpanzee brain networks, 4) sex differences in human cortical networks, 5) sex differences in mouse liver networks. While we find no evidence for sex specific modules in human cortical networks, we find that several human cortical modules are less preserved in chimpanzees. In particular, apoptosis genes are differentially co-expressed between humans and chimpanzees. Our simulation studies and applications show that module preservation statistics are useful for studying differences between the modular structure of networks. Data, R software and accompanying tutorials can be downloaded from the following webpage: http://www.genetics.ucla.edu/labs/horvath/CoexpressionNetwork/ModulePreservation. PMID:21283776

  8. svdPPCS: an effective singular value decomposition-based method for conserved and divergent co-expression gene module identification.

    PubMed

    Zhang, Wensheng; Edwards, Andrea; Fan, Wei; Zhu, Dongxiao; Zhang, Kun

    2010-06-22

    Comparative analysis of gene expression profiling of multiple biological categories, such as different species of organisms or different kinds of tissue, promises to enhance the fundamental understanding of the universality as well as the specialization of mechanisms and related biological themes. Grouping genes with a similar expression pattern or exhibiting co-expression together is a starting point in understanding and analyzing gene expression data. In recent literature, gene module level analysis is advocated in order to understand biological network design and system behaviors in disease and life processes; however, practical difficulties often lie in the implementation of existing methods. Using the singular value decomposition (SVD) technique, we developed a new computational tool, named svdPPCS (SVD-based Pattern Pairing and Chart Splitting), to identify conserved and divergent co-expression modules of two sets of microarray experiments. In the proposed methods, gene modules are identified by splitting the two-way chart coordinated with a pair of left singular vectors factorized from the gene expression matrices of the two biological categories. Importantly, the cutoffs are determined by a data-driven algorithm using the well-defined statistic, SVD-p. The implementation was illustrated on two time series microarray data sets generated from the samples of accessory gland (ACG) and malpighian tubule (MT) tissues of the line W118 of M. drosophila. Two conserved modules and six divergent modules, each of which has a unique characteristic profile across tissue kinds and aging processes, were identified. The number of genes contained in these models ranged from five to a few hundred. Three to over a hundred GO terms were over-represented in individual modules with FDR < 0.1. One divergent module suggested the tissue-specific relationship between the expressions of mitochondrion-related genes and the aging process. This finding, together with others, may be of biological significance. The validity of the proposed SVD-based method was further verified by a simulation study, as well as the comparisons with regression analysis and cubic spline regression analysis plus PAM based clustering. svdPPCS is a novel computational tool for the comparative analysis of transcriptional profiling. It especially fits the comparison of time series data of related organisms or different tissues of the same organism under equivalent or similar experimental conditions. The general scheme can be directly extended to the comparisons of multiple data sets. It also can be applied to the integration of data sets from different platforms and of different sources.

  9. Functional Module Analysis for Gene Coexpression Networks with Network Integration.

    PubMed

    Zhang, Shuqin; Zhao, Hongyu; Ng, Michael K

    2015-01-01

    Network has been a general tool for studying the complex interactions between different genes, proteins, and other small molecules. Module as a fundamental property of many biological networks has been widely studied and many computational methods have been proposed to identify the modules in an individual network. However, in many cases, a single network is insufficient for module analysis due to the noise in the data or the tuning of parameters when building the biological network. The availability of a large amount of biological networks makes network integration study possible. By integrating such networks, more informative modules for some specific disease can be derived from the networks constructed from different tissues, and consistent factors for different diseases can be inferred. In this paper, we have developed an effective method for module identification from multiple networks under different conditions. The problem is formulated as an optimization model, which combines the module identification in each individual network and alignment of the modules from different networks together. An approximation algorithm based on eigenvector computation is proposed. Our method outperforms the existing methods, especially when the underlying modules in multiple networks are different in simulation studies. We also applied our method to two groups of gene coexpression networks for humans, which include one for three different cancers, and one for three tissues from the morbidly obese patients. We identified 13 modules with three complete subgraphs, and 11 modules with two complete subgraphs, respectively. The modules were validated through Gene Ontology enrichment and KEGG pathway enrichment analysis. We also showed that the main functions of most modules for the corresponding disease have been addressed by other researchers, which may provide the theoretical basis for further studying the modules experimentally.

  10. Identifying a gene expression signature of cluster headache in blood

    PubMed Central

    Eising, Else; Pelzer, Nadine; Vijfhuizen, Lisanne S.; Vries, Boukje de; Ferrari, Michel D.; ‘t Hoen, Peter A. C.; Terwindt, Gisela M.; van den Maagdenberg, Arn M. J. M.

    2017-01-01

    Cluster headache is a relatively rare headache disorder, typically characterized by multiple daily, short-lasting attacks of excruciating, unilateral (peri-)orbital or temporal pain associated with autonomic symptoms and restlessness. To better understand the pathophysiology of cluster headache, we used RNA sequencing to identify differentially expressed genes and pathways in whole blood of patients with episodic (n = 19) or chronic (n = 20) cluster headache in comparison with headache-free controls (n = 20). Gene expression data were analysed by gene and by module of co-expressed genes with particular attention to previously implicated disease pathways including hypocretin dysregulation. Only moderate gene expression differences were identified and no associations were found with previously reported pathogenic mechanisms. At the level of functional gene sets, associations were observed for genes involved in several brain-related mechanisms such as GABA receptor function and voltage-gated channels. In addition, genes and modules of co-expressed genes showed a role for intracellular signalling cascades, mitochondria and inflammation. Although larger study samples may be required to identify the full range of involved pathways, these results indicate a role for mitochondria, intracellular signalling and inflammation in cluster headache. PMID:28074859

  11. Dynamic sporulation gene co-expression networks for Bacillus subtilis 168 and the food-borne isolate Bacillus amyloliquefaciens: a transcriptomic model

    PubMed Central

    Omony, Jimmy; de Jong, Anne; Krawczyk, Antonina O.; Eijlander, Robyn T.; Kuipers, Oscar P.

    2018-01-01

    Sporulation is a survival strategy, adapted by bacterial cells in response to harsh environmental adversities. The adaptation potential differs between strains and the variations may arise from differences in gene regulation. Gene networks are a valuable way of studying such regulation processes and establishing associations between genes. We reconstructed and compared sporulation gene co-expression networks (GCNs) of the model laboratory strain Bacillus subtilis 168 and the food-borne industrial isolate Bacillus amyloliquefaciens. Transcriptome data obtained from samples of six stages during the sporulation process were used for network inference. Subsequently, a gene set enrichment analysis was performed to compare the reconstructed GCNs of B. subtilis 168 and B. amyloliquefaciens with respect to biological functions, which showed the enriched modules with coherent functional groups associated with sporulation. On basis of the GCNs and time-evolution of differentially expressed genes, we could identify novel candidate genes strongly associated with sporulation in B. subtilis 168 and B. amyloliquefaciens. The GCNs offer a framework for exploring transcription factors, their targets, and co-expressed genes during sporulation. Furthermore, the methodology described here can conveniently be applied to other species or biological processes. PMID:29424683

  12. Dynamic sporulation gene co-expression networks for Bacillus subtilis 168 and the food-borne isolate Bacillus amyloliquefaciens: a transcriptomic model.

    PubMed

    Omony, Jimmy; de Jong, Anne; Krawczyk, Antonina O; Eijlander, Robyn T; Kuipers, Oscar P

    2018-02-09

    Sporulation is a survival strategy, adapted by bacterial cells in response to harsh environmental adversities. The adaptation potential differs between strains and the variations may arise from differences in gene regulation. Gene networks are a valuable way of studying such regulation processes and establishing associations between genes. We reconstructed and compared sporulation gene co-expression networks (GCNs) of the model laboratory strain Bacillus subtilis 168 and the food-borne industrial isolate Bacillus amyloliquefaciens. Transcriptome data obtained from samples of six stages during the sporulation process were used for network inference. Subsequently, a gene set enrichment analysis was performed to compare the reconstructed GCNs of B. subtilis 168 and B. amyloliquefaciens with respect to biological functions, which showed the enriched modules with coherent functional groups associated with sporulation. On basis of the GCNs and time-evolution of differentially expressed genes, we could identify novel candidate genes strongly associated with sporulation in B. subtilis 168 and B. amyloliquefaciens. The GCNs offer a framework for exploring transcription factors, their targets, and co-expressed genes during sporulation. Furthermore, the methodology described here can conveniently be applied to other species or biological processes.

  13. Massive-Scale Gene Co-Expression Network Construction and Robustness Testing Using Random Matrix Theory

    PubMed Central

    Isaacson, Sven; Luo, Feng; Feltus, Frank A.; Smith, Melissa C.

    2013-01-01

    The study of gene relationships and their effect on biological function and phenotype is a focal point in systems biology. Gene co-expression networks built using microarray expression profiles are one technique for discovering and interpreting gene relationships. A knowledge-independent thresholding technique, such as Random Matrix Theory (RMT), is useful for identifying meaningful relationships. Highly connected genes in the thresholded network are then grouped into modules that provide insight into their collective functionality. While it has been shown that co-expression networks are biologically relevant, it has not been determined to what extent any given network is functionally robust given perturbations in the input sample set. For such a test, hundreds of networks are needed and hence a tool to rapidly construct these networks. To examine functional robustness of networks with varying input, we enhanced an existing RMT implementation for improved scalability and tested functional robustness of human (Homo sapiens), rice (Oryza sativa) and budding yeast (Saccharomyces cerevisiae). We demonstrate dramatic decrease in network construction time and computational requirements and show that despite some variation in global properties between networks, functional similarity remains high. Moreover, the biological function captured by co-expression networks thresholded by RMT is highly robust. PMID:23409071

  14. Sequencing-based gene network analysis provides a core set of gene resource for understanding thermal adaptation in Zhikong scallop Chlamys farreri.

    PubMed

    Fu, X; Sun, Y; Wang, J; Xing, Q; Zou, J; Li, R; Wang, Z; Wang, S; Hu, X; Zhang, L; Bao, Z

    2014-01-01

    Marine organisms are commonly exposed to variable environmental conditions, and many of them are under threat from increased sea temperatures caused by global climate change. Generating transcriptomic resources under different stress conditions are crucial for understanding molecular mechanisms underlying thermal adaptation. In this study, we conducted transcriptome-wide gene expression profiling of the scallop Chlamys farreri challenged by acute and chronic heat stress. Of the 13 953 unique tags, more than 850 were significantly differentially expressed at each time point after acute heat stress, which was more than the number of tags differentially expressed (320-350) under chronic heat stress. To obtain a systemic view of gene expression alterations during thermal stress, a weighted gene coexpression network was constructed. Six modules were identified as acute heat stress-responsive modules. Among them, four modules involved in apoptosis regulation, mRNA binding, mitochondrial envelope formation and oxidation reduction were downregulated. The remaining two modules were upregulated. One was enriched with chaperone and the other with microsatellite sequences, whose coexpression may originate from a transcription factor binding site. These results indicated that C. farreri triggered several cellular processes to acclimate to elevated temperature. No modules responded to chronic heat stress, suggesting that the scallops might have acclimated to elevated temperature within 3 days. This study represents the first sequencing-based gene network analysis in a nonmodel aquatic species and provides valuable gene resources for the study of thermal adaptation, which should assist in the development of heat-tolerant scallop lines for aquaculture. © 2013 John Wiley & Sons Ltd.

  15. Identification of rice genes associated with cosmic-ray response via co-expression gene network analysis.

    PubMed

    Hwang, Sun-Goo; Kim, Dong Sub; Hwang, Jung Eun; Han, A-Reum; Jang, Cheol Seong

    2014-05-15

    In order to better understand the biological systems that are affected in response to cosmic ray (CR), we conducted weighted gene co-expression network analysis using the module detection method. By using the Pearson's correlation coefficient (PCC) value, we evaluated complex gene-gene functional interactions between 680 CR-responsive probes from integrated microarray data sets, which included large-scale transcriptional profiling of 1000 microarray samples. These probes were divided into 6 distinct modules that contained 20 enriched gene ontology (GO) functions, such as oxidoreductase activity, hydrolase activity, and response to stimulus and stress. In particular, modules 1 and 2 commonly showed enriched annotation categories such as oxidoreductase activity, including enriched cis-regulatory elements known as ROS-specific regulators. These results suggest that the ROS-mediated irradiation response pathway is affected by CR in modules 1 and 2. We found 243 ionizing radiation (IR)-responsive probes that exhibited similarities in expression patterns in various irradiation microarray data sets. The expression patterns of 6 randomly selected IR-responsive genes were evaluated by quantitative reverse transcription polymerase chain reaction following treatment with CR, gamma rays (GR), and ion beam (IB); similar patterns were observed among these genes under these 3 treatments. Moreover, we constructed subnetworks of IR-responsive genes and evaluated the expression levels of their neighboring genes following GR treatment; similar patterns were observed among them. These results of network-based analyses might provide a clue to understanding the complex biological system related to the CR response in plants. Copyright © 2014 Elsevier B.V. All rights reserved.

  16. Evidence of inflammatory immune signaling in chronic fatigue syndrome: A pilot study of gene expression in peripheral blood.

    PubMed

    Aspler, Anne L; Bolshin, Carly; Vernon, Suzanne D; Broderick, Gordon

    2008-09-26

    Genomic profiling of peripheral blood reveals altered immunity in chronic fatigue syndrome (CFS) however interpretation remains challenging without immune demographic context. The object of this work is to identify modulation of specific immune functional components and restructuring of co-expression networks characteristic of CFS using the quantitative genomics of peripheral blood. Gene sets were constructed a priori for CD4+ T cells, CD8+ T cells, CD19+ B cells, CD14+ monocytes and CD16+ neutrophils from published data. A group of 111 women were classified using empiric case definition (U.S. Centers for Disease Control and Prevention) and unsupervised latent cluster analysis (LCA). Microarray profiles of peripheral blood were analyzed for expression of leukocyte-specific gene sets and characteristic changes in co-expression identified from topological evaluation of linear correlation networks. Median expression for a set of 6 genes preferentially up-regulated in CD19+ B cells was significantly lower in CFS (p = 0.01) due mainly to PTPRK and TSPAN3 expression. Although no other gene set was differentially expressed at p < 0.05, patterns of co-expression in each group differed markedly. Significant co-expression of CD14+ monocyte with CD16+ neutrophil (p = 0.01) and CD19+ B cell sets (p = 0.00) characterized CFS and fatigue phenotype groups. Also in CFS was a significant negative correlation between CD8+ and both CD19+ up-regulated (p = 0.02) and NK gene sets (p = 0.08). These patterns were absent in controls. Dissection of blood microarray profiles points to B cell dysfunction with coordinated immune activation supporting persistent inflammation and antibody-mediated NK cell modulation of T cell activity. This has clinical implications as the CD19+ genes identified could provide robust and biologically meaningful basis for the early detection and unambiguous phenotyping of CFS.

  17. FoxP2 isoforms delineate spatiotemporal transcriptional networks for vocal learning in the zebra finch

    PubMed Central

    Day, Nancy F; Kimball, Todd Haswell; Aamodt, Caitlin M; Heston, Jonathan B; Hilliard, Austin T; Xiao, Xinshu; White, Stephanie A

    2018-01-01

    Human speech is one of the few examples of vocal learning among mammals yet ~half of avian species exhibit this ability. Its neurogenetic basis is largely unknown beyond a shared requirement for FoxP2 in both humans and zebra finches. We manipulated FoxP2 isoforms in Area X, a song-specific region of the avian striatopallidum analogous to human anterior striatum, during a critical period for song development. We delineate, for the first time, unique contributions of each isoform to vocal learning. Weighted gene coexpression network analysis of RNA-seq data revealed gene modules correlated to singing, learning, or vocal variability. Coexpression related to singing was found in juvenile and adult Area X whereas coexpression correlated to learning was unique to juveniles. The confluence of learning and singing coexpression in juvenile Area X may underscore molecular processes that drive vocal learning in young zebra finches and, by analogy, humans. PMID:29360038

  18. Gene Expression Correlated with Severe Asthma Characteristics Reveals Heterogeneous Mechanisms of Severe Disease.

    PubMed

    Modena, Brian D; Bleecker, Eugene R; Busse, William W; Erzurum, Serpil C; Gaston, Benjamin M; Jarjour, Nizar N; Meyers, Deborah A; Milosevic, Jadranka; Tedrow, John R; Wu, Wei; Kaminski, Naftali; Wenzel, Sally E

    2017-06-01

    Severe asthma (SA) is a heterogeneous disease with multiple molecular mechanisms. Gene expression studies of bronchial epithelial cells in individuals with asthma have provided biological insight and underscored possible mechanistic differences between individuals. Identify networks of genes reflective of underlying biological processes that define SA. Airway epithelial cell gene expression from 155 subjects with asthma and healthy control subjects in the Severe Asthma Research Program was analyzed by weighted gene coexpression network analysis to identify gene networks and profiles associated with SA and its specific characteristics (i.e., pulmonary function tests, quality of life scores, urgent healthcare use, and steroid use), which potentially identified underlying biological processes. A linear model analysis confirmed these findings while adjusting for potential confounders. Weighted gene coexpression network analysis constructed 64 gene network modules, including modules corresponding to T1 and T2 inflammation, neuronal function, cilia, epithelial growth, and repair mechanisms. Although no network selectively identified SA, genes in modules linked to epithelial growth and repair and neuronal function were markedly decreased in SA. Several hub genes of the epithelial growth and repair module were found located at the 17q12-21 locus, near a well-known asthma susceptibility locus. T2 genes increased with severity in those treated with corticosteroids but were also elevated in untreated, mild-to-moderate disease compared with healthy control subjects. T1 inflammation, especially when associated with increased T2 gene expression, was elevated in a subgroup of younger patients with SA. In this hypothesis-generating analysis, gene expression networks in relation to asthma severity provided potentially new insight into biological mechanisms associated with the development of SA and its phenotypes.

  19. Gene Expression Correlated with Severe Asthma Characteristics Reveals Heterogeneous Mechanisms of Severe Disease

    PubMed Central

    Modena, Brian D.; Bleecker, Eugene R.; Busse, William W.; Erzurum, Serpil C.; Gaston, Benjamin M.; Jarjour, Nizar N.; Meyers, Deborah A.; Milosevic, Jadranka; Tedrow, John R.; Wu, Wei; Kaminski, Naftali

    2017-01-01

    Rationale: Severe asthma (SA) is a heterogeneous disease with multiple molecular mechanisms. Gene expression studies of bronchial epithelial cells in individuals with asthma have provided biological insight and underscored possible mechanistic differences between individuals. Objectives: Identify networks of genes reflective of underlying biological processes that define SA. Methods: Airway epithelial cell gene expression from 155 subjects with asthma and healthy control subjects in the Severe Asthma Research Program was analyzed by weighted gene coexpression network analysis to identify gene networks and profiles associated with SA and its specific characteristics (i.e., pulmonary function tests, quality of life scores, urgent healthcare use, and steroid use), which potentially identified underlying biological processes. A linear model analysis confirmed these findings while adjusting for potential confounders. Measurements and Main Results: Weighted gene coexpression network analysis constructed 64 gene network modules, including modules corresponding to T1 and T2 inflammation, neuronal function, cilia, epithelial growth, and repair mechanisms. Although no network selectively identified SA, genes in modules linked to epithelial growth and repair and neuronal function were markedly decreased in SA. Several hub genes of the epithelial growth and repair module were found located at the 17q12–21 locus, near a well-known asthma susceptibility locus. T2 genes increased with severity in those treated with corticosteroids but were also elevated in untreated, mild-to-moderate disease compared with healthy control subjects. T1 inflammation, especially when associated with increased T2 gene expression, was elevated in a subgroup of younger patients with SA. Conclusions: In this hypothesis-generating analysis, gene expression networks in relation to asthma severity provided potentially new insight into biological mechanisms associated with the development of SA and its phenotypes. PMID:27984699

  20. Mimosa: Mixture Model of Co-expression to Detect Modulators of Regulatory Interaction

    NASA Astrophysics Data System (ADS)

    Hansen, Matthew; Everett, Logan; Singh, Larry; Hannenhalli, Sridhar

    Functionally related genes tend to be correlated in their expression patterns across multiple conditions and/or tissue-types. Thus co-expression networks are often used to investigate functional groups of genes. In particular, when one of the genes is a transcription factor (TF), the co-expression-based interaction is interpreted, with caution, as a direct regulatory interaction. However, any particular TF, and more importantly, any particular regulatory interaction, is likely to be active only in a subset of experimental conditions. Moreover, the subset of expression samples where the regulatory interaction holds may be marked by presence or absence of a modifier gene, such as an enzyme that post-translationally modifies the TF. Such subtlety of regulatory interactions is overlooked when one computes an overall expression correlation. Here we present a novel mixture modeling approach where a TF-Gene pair is presumed to be significantly correlated (with unknown coefficient) in a (unknown) subset of expression samples. The parameters of the model are estimated using a Maximum Likelihood approach. The estimated mixture of expression samples is then mined to identify genes potentially modulating the TF-Gene interaction. We have validated our approach using synthetic data and on three biological cases in cow and in yeast. While limited in some ways, as discussed, the work represents a novel approach to mine expression data and detect potential modulators of regulatory interactions.

  1. SLC9A9 Co-expression modules in autism-associated brain regions.

    PubMed

    Patak, Jameson; Hess, Jonathan L; Zhang-James, Yanli; Glatt, Stephen J; Faraone, Stephen V

    2017-03-01

    SLC9A9 is a sodium hydrogen exchanger present in the recycling endosome and highly expressed in the brain. It is implicated in neuropsychiatric disorders, including autism spectrum disorders (ASDs). Little research concerning its gene expression patterns and biological pathways has been conducted. We sought to investigate its possible biological roles in autism-associated brain regions throughout development. We conducted a weighted gene co-expression network analysis on RNA-seq data downloaded from Brainspan. We compared prenatal and postnatal gene expression networks for three ASD-associated brain regions known to have high SLC9A9 gene expression. We also performed an ASD-associated single nucleotide polymorphism enrichment analysis and a cell signature enrichment analysis. The modules showed differences in gene constituents (membership), gene number, and connectivity throughout time. SLC9A9 was highly associated with immune system functions, metabolism, apoptosis, endocytosis, and signaling cascades. Gene list comparison with co-immunoprecipitation data was significant for multiple modules. We found a disproportionately high autism risk signal among genes constituting the prenatal hippocampal module. The modules were enriched with astrocyte and oligodendrocyte markers. SLC9A9 is potentially involved in the pathophysiology of ASDs. Our investigation confirmed proposed functions for SLC9A9, such as endocytosis and immune regulation, while also revealing potential roles in mTOR signaling and cell survival.. By providing a concise molecular map and interactions, evidence of cell type and implicated brain regions we hope this will guide future research on SLC9A9. Autism Res 2017, 10: 414-429. © 2016 International Society for Autism Research, Wiley Periodicals, Inc. © 2016 International Society for Autism Research, Wiley Periodicals, Inc.

  2. WGCNA: an R package for weighted correlation network analysis

    PubMed Central

    Langfelder, Peter; Horvath, Steve

    2008-01-01

    Background Correlation networks are increasingly being used in bioinformatics applications. For example, weighted gene co-expression network analysis is a systems biology method for describing the correlation patterns among genes across microarray samples. Weighted correlation network analysis (WGCNA) can be used for finding clusters (modules) of highly correlated genes, for summarizing such clusters using the module eigengene or an intramodular hub gene, for relating modules to one another and to external sample traits (using eigengene network methodology), and for calculating module membership measures. Correlation networks facilitate network based gene screening methods that can be used to identify candidate biomarkers or therapeutic targets. These methods have been successfully applied in various biological contexts, e.g. cancer, mouse genetics, yeast genetics, and analysis of brain imaging data. While parts of the correlation network methodology have been described in separate publications, there is a need to provide a user-friendly, comprehensive, and consistent software implementation and an accompanying tutorial. Results The WGCNA R software package is a comprehensive collection of R functions for performing various aspects of weighted correlation network analysis. The package includes functions for network construction, module detection, gene selection, calculations of topological properties, data simulation, visualization, and interfacing with external software. Along with the R package we also present R software tutorials. While the methods development was motivated by gene expression data, the underlying data mining approach can be applied to a variety of different settings. Conclusion The WGCNA package provides R functions for weighted correlation network analysis, e.g. co-expression network analysis of gene expression data. The R package along with its source code and additional material are freely available at . PMID:19114008

  3. Full-length Transcriptome Sequencing and Modular Organization Analysis of Naringin/Neoeriocitrin Related Gene Expression Pattern in Drynaria roosii.

    PubMed

    Sun, Mei-Yu; Li, Jing-Yi; Li, Dong; Huang, Feng-Jie; Wang, Di; Li, Hui; Xing, Quan; Zhu, Hui-Bin; Shi, Lei

    2018-04-12

    Drynaria roosii (Nakaike) is a traditional Chinese medicinal fern, known as 'GuSuiBu'. The corresponding effective components of naringin/neoeriocitrin share highly similar chemical structure and medicinal function. Our HPLC-MS/MS results showed that the accumulation of naringin/neoeriocitrin depended on specific tissues or ages. However, little was known about the expression patterns of naringin/neoeriocitrin related genes involved in their regulatory pathways. For lack of the basic genetic information, we applied a combination of SMRT sequencing and SGS to generate the complete and full-length transcriptome of D. roosii. According to the SGS data, the DEG-based heat map analysis revealed the naringin/neoeriocitrin related gene expression exhibited obvious tissue- and time-specific transcriptomic differences. Using the systems biology method of modular organization analysis, we clustered 16,472 DEGs into 17 gene modules and studied the relationships between modules and tissue/time point samples, as well as modules and naringin/neoeriocitrin contents. Hereinto, naringin/neoeriocitrin related DEGs distributed in nine distinct modules, and DEGs in these modules showed significant different patterns of transcript abundance to be linked with specific tissues or ages. Moreover, WGCNA results further identified that PAL, 4CL, C4H and C3H, HCT acted as the major hub genes involved in naringin and neoeriocitrin synthesis respectively and exhibited high co-expression with MYB- and bHLH-regulated genes. In this work, modular organization and co-expression networks elucidated the tissue- and time-specificity of gene expression pattern, as well as hub genes associated with naringin/neoeriocitrin synthesis in D. roosii. Simultaneously, the comprehensive transcriptome dataset provided the important genetic information for further research on D. roosii.

  4. Partial Roc Reveals Superiority of Mutual Rank of Pearson's Correlation Coefficient as a Coexpression Measure to Elucidate Functional Association of Genes

    NASA Astrophysics Data System (ADS)

    Obayashi, Takeshi; Kinoshita, Kengo

    2013-01-01

    Gene coexpression analysis is a powerful approach to elucidate gene function. We have established and developed this approach using vast amount of publicly available gene expression data measured by microarray techniques. The coexpressed genes are used to estimate gene function of the guide gene or to construct gene coexpression networks. In the case to construct gene networks, researchers should introduce an arbitrary threshold of gene coexpression, because gene coexpression value is continuous value. In the viewpoint to introduce common threshold of gene coexpression, we previously reported rank of Pearson's correlation coefficient (PCC) is more useful than the original PCC value. In this manuscript, we re-assessed the measure of gene coexpression to construct gene coexpression network, and found that mutual rank (MR) of PCC showed better performance than rank of PCC and the original PCC in low false positive rate.

  5. Transcriptomic changes during maize roots development responsive to Cadmium (Cd) pollution using comparative RNAseq-based approach

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Peng, Hua; Sichuan Tourism College, Chengdu, 610000, Sichuan; He, Xiujing

    The heavy metal cadmium (Cd), acts as a widespread environmental contaminant, which has shown to adversely affect human health, food safety and ecosystem safety in recent years. However, research on how plant respond to various kinds of heavy metal stress is scarcely reported, especially for understanding of complex molecular regulatory mechanisms and elucidating the gene networks of plant respond to Cd stress. Here, transcriptomic changes during Mo17 and B73 seedlings development responsive to Cd pollution were investigated and comparative RNAseq-based approach in both genotypes were performed. 115 differential expression genes (DEGs) with significant alteration in expression were found co-modulated inmore » both genotypes during the maize seedling development; of those, most of DGEs were found comprised of stress and defense responses proteins, transporters, as well as transcription factors, such as thaumatin-like protein, ZmOPR2 and ZmOPR5. More interestingly, genotype-specific transcriptional factors changes induced by Cd stress were found contributed to the regulatory mechanism of Cd sensitivity in both different genotypes. Moreover, 12 co-expression modules associated with specific biological processes or pathways (M1 to M12) were identified by consensus co-expression network. These results will expand our understanding of complex molecular mechanism of response and defense to Cd exposure in maize seedling roots. - Highlights: • Transcriptomic changes responsive to Cd pollution using comparative RNAseq-based approach. • 115 differential expression genes (DEGs) were found co-modulated in both genotypes. • Most of DGEs belong to stress and defense responses proteins, transporters, transcription factors. • 12 co-expression modules associated with specific biological processes or pathways. • Genotype-specific transcriptional factors changes induced by Cd stress were found.« less

  6. Gene networks specific for innate immunity define post-traumatic stress disorder.

    PubMed

    Breen, M S; Maihofer, A X; Glatt, S J; Tylee, D S; Chandler, S D; Tsuang, M T; Risbrough, V B; Baker, D G; O'Connor, D T; Nievergelt, C M; Woelk, C H

    2015-12-01

    The molecular factors involved in the development of Post-Traumatic Stress Disorder (PTSD) remain poorly understood. Previous transcriptomic studies investigating the mechanisms of PTSD apply targeted approaches to identify individual genes under a cross-sectional framework lack a holistic view of the behaviours and properties of these genes at the system-level. Here we sought to apply an unsupervised gene-network based approach to a prospective experimental design using whole-transcriptome RNA-Seq gene expression from peripheral blood leukocytes of U.S. Marines (N=188), obtained both pre- and post-deployment to conflict zones. We identified discrete groups of co-regulated genes (i.e., co-expression modules) and tested them for association to PTSD. We identified one module at both pre- and post-deployment containing putative causal signatures for PTSD development displaying an over-expression of genes enriched for functions of innate-immune response and interferon signalling (Type-I and Type-II). Importantly, these results were replicated in a second non-overlapping independent dataset of U.S. Marines (N=96), further outlining the role of innate immune and interferon signalling genes within co-expression modules to explain at least part of the causal pathophysiology for PTSD development. A second module, consequential of trauma exposure, contained PTSD resiliency signatures and an over-expression of genes involved in hemostasis and wound responsiveness suggesting that chronic levels of stress impair proper wound healing during/after exposure to the battlefield while highlighting the role of the hemostatic system as a clinical indicator of chronic-based stress. These findings provide novel insights for early preventative measures and advanced PTSD detection, which may lead to interventions that delay or perhaps abrogate the development of PTSD.

  7. Identification of conserved drought stress responsive gene-network across tissues and developmental stages in rice.

    PubMed

    Smita, Shuchi; Katiyar, Amit; Pandey, Dev Mani; Chinnusamy, Viswanathan; Archak, Sunil; Bansal, Kailash Chander

    2013-01-01

    Identification of genes that are coexpressed across various tissues and environmental stresses is biologically interesting, since they may play coordinated role in similar biological processes. Genes with correlated expression patterns can be best identified by using coexpression network analysis of transcriptome data. In the present study, we analyzed the temporal-spatial coordination of gene expression in root, leaf and panicle of rice under drought stress and constructed network using WGCNA and Cytoscape. Total of 2199 differentially expressed genes (DEGs) were identified in at least three or more tissues, wherein 88 genes have coordinated expression profile among all the six tissues under drought stress. These 88 highly coordinated genes were further subjected to module identification in the coexpression network. Based on chief topological properties we identified 18 hub genes such as ABC transporter, ATP-binding protein, dehydrin, protein phosphatase 2C, LTPL153 - Protease inhibitor, phosphatidylethanolaminebinding protein, lactose permease-related, NADP-dependent malic enzyme, etc. Motif enrichment analysis showed the presence of ABRE cis-elements in the promoters of > 62% of the coordinately expressed genes. Our results suggest that drought stress mediated upregulated gene expression was coordinated through an ABA-dependent signaling pathway across tissues, at least for the subset of genes identified in this study, while down regulation appears to be regulated by tissue specific pathways in rice.

  8. Differentially Coexpressed Disease Gene Identification Based on Gene Coexpression Network.

    PubMed

    Jiang, Xue; Zhang, Han; Quan, Xiongwen

    2016-01-01

    Screening disease-related genes by analyzing gene expression data has become a popular theme. Traditional disease-related gene selection methods always focus on identifying differentially expressed gene between case samples and a control group. These traditional methods may not fully consider the changes of interactions between genes at different cell states and the dynamic processes of gene expression levels during the disease progression. However, in order to understand the mechanism of disease, it is important to explore the dynamic changes of interactions between genes in biological networks at different cell states. In this study, we designed a novel framework to identify disease-related genes and developed a differentially coexpressed disease-related gene identification method based on gene coexpression network (DCGN) to screen differentially coexpressed genes. We firstly constructed phase-specific gene coexpression network using time-series gene expression data and defined the conception of differential coexpression of genes in coexpression network. Then, we designed two metrics to measure the value of gene differential coexpression according to the change of local topological structures between different phase-specific networks. Finally, we conducted meta-analysis of gene differential coexpression based on the rank-product method. Experimental results demonstrated the feasibility and effectiveness of DCGN and the superior performance of DCGN over other popular disease-related gene selection methods through real-world gene expression data sets.

  9. Discovering Condition-Specific Gene Co-Expression Patterns Using Gaussian Mixture Models: A Cancer Case Study.

    PubMed

    Ficklin, Stephen P; Dunwoodie, Leland J; Poehlman, William L; Watson, Christopher; Roche, Kimberly E; Feltus, F Alex

    2017-08-17

    A gene co-expression network (GCN) describes associations between genes and points to genetic coordination of biochemical pathways. However, genetic correlations in a GCN are only detectable if they are present in the sampled conditions. With the increasing quantity of gene expression samples available in public repositories, there is greater potential for discovery of genetic correlations from a variety of biologically interesting conditions. However, even if gene correlations are present, their discovery can be masked by noise. Noise is introduced from natural variation (intrinsic and extrinsic), systematic variation (caused by sample measurement protocols and instruments), and algorithmic and statistical variation created by selection of data processing tools. A variety of published studies, approaches and methods attempt to address each of these contributions of variation to reduce noise. Here we describe an approach using Gaussian Mixture Models (GMMs) to address natural extrinsic (condition-specific) variation during network construction from mixed input conditions. To demonstrate utility, we build and analyze a condition-annotated GCN from a compendium of 2,016 mixed gene expression data sets from five tumor subtypes obtained from The Cancer Genome Atlas. Our results show that GMMs help discover tumor subtype specific gene co-expression patterns (modules) that are significantly enriched for clinical attributes.

  10. Detection of Significant Pneumococcal Meningitis Biomarkers by Ego Network.

    PubMed

    Wang, Qian; Lou, Zhifeng; Zhai, Liansuo; Zhao, Haibin

    2017-06-01

    To identify significant biomarkers for detection of pneumococcal meningitis based on ego network. Based on the gene expression data of pneumococcal meningitis and global protein-protein interactions (PPIs) data recruited from open access databases, the authors constructed a differential co-expression network (DCN) to identify pneumococcal meningitis biomarkers in a network view. Here EgoNet algorithm was employed to screen the significant ego networks that could accurately distinguish pneumococcal meningitis from healthy controls, by sequentially seeking ego genes, searching candidate ego networks, refinement of candidate ego networks and significance analysis to identify ego networks. Finally, the functional inference of the ego networks was performed to identify significant pathways for pneumococcal meningitis. By differential co-expression analysis, the authors constructed the DCN that covered 1809 genes and 3689 interactions. From the DCN, a total of 90 ego genes were identified. Starting from these ego genes, three significant ego networks (Module 19, Module 70 and Module 71) that could predict clinical outcomes for pneumococcal meningitis were identified by EgoNet algorithm, and the corresponding ego genes were GMNN, MAD2L1 and TPX2, respectively. Pathway analysis showed that these three ego networks were related to CDT1 association with the CDC6:ORC:origin complex, inactivation of APC/C via direct inhibition of the APC/C complex pathway, and DNA strand elongation, respectively. The authors successfully screened three significant ego modules which could accurately predict the clinical outcomes for pneumococcal meningitis and might play important roles in host response to pathogen infection in pneumococcal meningitis.

  11. Network-based differential gene expression analysis suggests cell cycle related genes regulated by E2F1 underlie the molecular difference between smoker and non-smoker lung adenocarcinoma

    PubMed Central

    2013-01-01

    Background Differential gene expression (DGE) analysis is commonly used to reveal the deregulated molecular mechanisms of complex diseases. However, traditional DGE analysis (e.g., the t test or the rank sum test) tests each gene independently without considering interactions between them. Top-ranked differentially regulated genes prioritized by the analysis may not directly relate to the coherent molecular changes underlying complex diseases. Joint analyses of co-expression and DGE have been applied to reveal the deregulated molecular modules underlying complex diseases. Most of these methods consist of separate steps: first to identify gene-gene relationships under the studied phenotype then to integrate them with gene expression changes for prioritizing signature genes, or vice versa. It is warrant a method that can simultaneously consider gene-gene co-expression strength and corresponding expression level changes so that both types of information can be leveraged optimally. Results In this paper, we develop a gene module based method for differential gene expression analysis, named network-based differential gene expression (nDGE) analysis, a one-step integrative process for prioritizing deregulated genes and grouping them into gene modules. We demonstrate that nDGE outperforms existing methods in prioritizing deregulated genes and discovering deregulated gene modules using simulated data sets. When tested on a series of smoker and non-smoker lung adenocarcinoma data sets, we show that top differentially regulated genes identified by the rank sum test in different sets are not consistent while top ranked genes defined by nDGE in different data sets significantly overlap. nDGE results suggest that a differentially regulated gene module, which is enriched for cell cycle related genes and E2F1 targeted genes, plays a role in the molecular differences between smoker and non-smoker lung adenocarcinoma. Conclusions In this paper, we develop nDGE to prioritize deregulated genes and group them into gene modules by simultaneously considering gene expression level changes and gene-gene co-regulations. When applied to both simulated and empirical data, nDGE outperforms the traditional DGE method. More specifically, when applied to smoker and non-smoker lung cancer sets, nDGE results illustrate the molecular differences between smoker and non-smoker lung cancer. PMID:24341432

  12. Transcriptomic profiling in muscle and adipose tissue identifies genes related to growth and lipid deposition

    PubMed Central

    Pang, Jianhui; Zhong, Zhijun; Chen, Xiaohui; Yang, Yuekui; Zeng, Kai; Kang, Runming; Lei, Yunfeng; Ying, Sancheng; Gong, Jianjun; Gu, Yiren

    2017-01-01

    Growth performance and meat quality are important traits for the pig industry and consumers. Adipose tissue is the main site at which fat storage and fatty acid synthesis occur. Therefore, we combined high-throughput transcriptomic sequencing in adipose and muscle tissues with the quantification of corresponding phenotypic features using seven Chinese indigenous pig breeds and one Western commercial breed (Yorkshire). We obtained data on 101 phenotypic traits, from which principal component analysis distinguished two groups: one associated with the Chinese breeds and one with Yorkshire. The numbers of differentially expressed genes between all Chinese breeds and Yorkshire were shown to be 673 and 1056 in adipose and muscle tissues, respectively. Functional enrichment analysis revealed that these genes are associated with biological functions and canonical pathways related to oxidoreductase activity, immune response, and metabolic process. Weighted gene coexpression network analysis found more coexpression modules significantly correlated with the measured phenotypic traits in adipose than in muscle, indicating that adipose regulates meat and carcass quality. Using the combination of differential expression, QTL information, gene significance, and module hub genes, we identified a large number of candidate genes potentially related to economically important traits in pig, which should help us improve meat production and quality. PMID:28877211

  13. Transcriptomic profiling in muscle and adipose tissue identifies genes related to growth and lipid deposition.

    PubMed

    Tao, Xuan; Liang, Yan; Yang, Xuemei; Pang, Jianhui; Zhong, Zhijun; Chen, Xiaohui; Yang, Yuekui; Zeng, Kai; Kang, Runming; Lei, Yunfeng; Ying, Sancheng; Gong, Jianjun; Gu, Yiren; Lv, Xuebin

    2017-01-01

    Growth performance and meat quality are important traits for the pig industry and consumers. Adipose tissue is the main site at which fat storage and fatty acid synthesis occur. Therefore, we combined high-throughput transcriptomic sequencing in adipose and muscle tissues with the quantification of corresponding phenotypic features using seven Chinese indigenous pig breeds and one Western commercial breed (Yorkshire). We obtained data on 101 phenotypic traits, from which principal component analysis distinguished two groups: one associated with the Chinese breeds and one with Yorkshire. The numbers of differentially expressed genes between all Chinese breeds and Yorkshire were shown to be 673 and 1056 in adipose and muscle tissues, respectively. Functional enrichment analysis revealed that these genes are associated with biological functions and canonical pathways related to oxidoreductase activity, immune response, and metabolic process. Weighted gene coexpression network analysis found more coexpression modules significantly correlated with the measured phenotypic traits in adipose than in muscle, indicating that adipose regulates meat and carcass quality. Using the combination of differential expression, QTL information, gene significance, and module hub genes, we identified a large number of candidate genes potentially related to economically important traits in pig, which should help us improve meat production and quality.

  14. Identifying modules of coexpressed transcript units and their organization of Saccharopolyspora erythraea from time series gene expression profiles.

    PubMed

    Chang, Xiao; Liu, Shuai; Yu, Yong-Tao; Li, Yi-Xue; Li, Yuan-Yuan

    2010-08-12

    The Saccharopolyspora erythraea genome sequence was released in 2007. In order to look at the gene regulations at whole transcriptome level, an expression microarray was specifically designed on the S. erythraea strain NRRL 2338 genome sequence. Based on these data, we set out to investigate the potential transcriptional regulatory networks and their organization. In view of the hierarchical structure of bacterial transcriptional regulation, we constructed a hierarchical coexpression network at whole transcriptome level. A total of 27 modules were identified from 1255 differentially expressed transcript units (TUs) across time course, which were further classified in to four groups. Functional enrichment analysis indicated the biological significance of our hierarchical network. It was indicated that primary metabolism is activated in the first rapid growth phase (phase A), and secondary metabolism is induced when the growth is slowed down (phase B). Among the 27 modules, two are highly correlated to erythromycin production. One contains all genes in the erythromycin-biosynthetic (ery) gene cluster and the other seems to be associated with erythromycin production by sharing common intermediate metabolites. Non-concomitant correlation between production and expression regulation was observed. Especially, by calculating the partial correlation coefficients and building the network based on Gaussian graphical model, intrinsic associations between modules were found, and the association between those two erythromycin production-correlated modules was included as expected. This work created a hierarchical model clustering transcriptome data into coordinated modules, and modules into groups across the time course, giving insight into the concerted transcriptional regulations especially the regulation corresponding to erythromycin production of S. erythraea. This strategy may be extendable to studies on other prokaryotic microorganisms.

  15. Genome-wide patterns of promoter sharing and co-expression in bovine skeletal muscle.

    PubMed

    Gu, Quan; Nagaraj, Shivashankar H; Hudson, Nicholas J; Dalrymple, Brian P; Reverter, Antonio

    2011-01-12

    Gene regulation by transcription factors (TF) is species, tissue and time specific. To better understand how the genetic code controls gene expression in bovine muscle we associated gene expression data from developing Longissimus thoracis et lumborum skeletal muscle with bovine promoter sequence information. We created a highly conserved genome-wide promoter landscape comprising 87,408 interactions relating 333 TFs with their 9,242 predicted target genes (TGs). We discovered that the complete set of predicted TGs share an average of 2.75 predicted TF binding sites (TFBSs) and that the average co-expression between a TF and its predicted TGs is higher than the average co-expression between the same TF and all genes. Conversely, pairs of TFs sharing predicted TGs showed a co-expression correlation higher that pairs of TFs not sharing TGs. Finally, we exploited the co-occurrence of predicted TFBS in the context of muscle-derived functionally-coherent modules including cell cycle, mitochondria, immune system, fat metabolism, muscle/glycolysis, and ribosome. Our findings enabled us to reverse engineer a regulatory network of core processes, and correctly identified the involvement of E2F1, GATA2 and NFKB1 in the regulation of cell cycle, fat, and muscle/glycolysis, respectively. The pivotal implication of our research is two-fold: (1) there exists a robust genome-wide expression signal between TFs and their predicted TGs in cattle muscle consistent with the extent of promoter sharing; and (2) this signal can be exploited to recover the cellular mechanisms underpinning transcription regulation of muscle structure and development in bovine. Our study represents the first genome-wide report linking tissue specific co-expression to co-regulation in a non-model vertebrate.

  16. An integrative systems genetics approach reveals potential causal genes and pathways related to obesity.

    PubMed

    Kogelman, Lisette J A; Zhernakova, Daria V; Westra, Harm-Jan; Cirera, Susanna; Fredholm, Merete; Franke, Lude; Kadarmideen, Haja N

    2015-10-20

    Obesity is a multi-factorial health problem in which genetic factors play an important role. Limited results have been obtained in single-gene studies using either genomic or transcriptomic data. RNA sequencing technology has shown its potential in gaining accurate knowledge about the transcriptome, and may reveal novel genes affecting complex diseases. Integration of genomic and transcriptomic variation (expression quantitative trait loci [eQTL] mapping) has identified causal variants that affect complex diseases. We integrated transcriptomic data from adipose tissue and genomic data from a porcine model to investigate the mechanisms involved in obesity using a systems genetics approach. Using a selective gene expression profiling approach, we selected 36 animals based on a previously created genomic Obesity Index for RNA sequencing of subcutaneous adipose tissue. Differential expression analysis was performed using the Obesity Index as a continuous variable in a linear model. eQTL mapping was then performed to integrate 60 K porcine SNP chip data with the RNA sequencing data. Results were restricted based on genome-wide significant single nucleotide polymorphisms, detected differentially expressed genes, and previously detected co-expressed gene modules. Further data integration was performed by detecting co-expression patterns among eQTLs and integration with protein data. Differential expression analysis of RNA sequencing data revealed 458 differentially expressed genes. The eQTL mapping resulted in 987 cis-eQTLs and 73 trans-eQTLs (false discovery rate < 0.05), of which the cis-eQTLs were associated with metabolic pathways. We reduced the eQTL search space by focusing on differentially expressed and co-expressed genes and disease-associated single nucleotide polymorphisms to detect obesity-related genes and pathways. Building a co-expression network using eQTLs resulted in the detection of a module strongly associated with lipid pathways. Furthermore, we detected several obesity candidate genes, for example, ENPP1, CTSL, and ABHD12B. To our knowledge, this is the first study to perform an integrated genomics and transcriptomics (eQTL) study using, and modeling, genomic and subcutaneous adipose tissue RNA sequencing data on obesity in a porcine model. We detected several pathways and potential causal genes for obesity. Further validation and investigation may reveal their exact function and association with obesity.

  17. Two microRNA signatures for malignancy and immune infiltration predict overall survival in advanced epithelial ovarian cancer.

    PubMed

    Korsunsky, Ilya; Parameswaran, Janaki; Shapira, Iuliana; Lovecchio, John; Menzin, Andrew; Whyte, Jill; Dos Santos, Lisa; Liang, Sharon; Bhuiya, Tawfiqul; Keogh, Mary; Khalili, Houman; Pond, Cassandra; Liew, Anthony; Shih, Andrew; Gregersen, Peter K; Lee, Annette T

    2017-10-01

    MicroRNAs have been established as key regulators of tumor gene expression and as prime biomarker candidates for clinical phenotypes in epithelial ovarian cancer (EOC). We analyzed the coexpression and regulatory structure of microRNAs and their co-localized gene targets in primary tumor tissue of 20 patients with advanced EOC in order to construct a regulatory signature for clinical prognosis. We performed an integrative analysis to identify two prognostic microRNA/mRNA coexpression modules, each enriched for consistent biological functions. One module, enriched for malignancy-related functions, was found to be upregulated in malignant versus benign samples. The second module, enriched for immune-related functions, was strongly correlated with imputed intratumoral immune infiltrates of T cells, natural killer cells, cytotoxic lymphocytes, and macrophages. We validated the prognostic relevance of the immunological module microRNAs in the publicly available The Cancer Genome Atlas data set. These findings provide novel functional roles for microRNAs in the progression of advanced EOC and possible prognostic signatures for survival. © American Federation for Medical Research (unless otherwise stated in the text of the article) 2017. All rights reserved. No commercial use is permitted unless otherwise expressly granted.

  18. The dynamic landscape of gene regulation during Bombyx mori oogenesis.

    PubMed

    Zhang, Qiang; Sun, Wei; Sun, Bang-Yong; Xiao, Yang; Zhang, Ze

    2017-09-11

    Oogenesis in the domestic silkworm (Bombyx mori) is a complex process involving previtellogenesis, vitellogenesis and choriogenesis. During this process, follicles show drastic morphological and physiological changes. However, the genome-wide regulatory profiles of gene expression during oogenesis remain to be determined. In this study, we obtained time-series transcriptome data and used these data to reveal the dynamic landscape of gene regulation during oogenesis. A total of 1932 genes were identified to be differentially expressed among different stages, most of which occurred during the transition from late vitellogenesis to early choriogenesis. Using weighted gene co-expression network analysis, we identified six stage-specific gene modules that correspond to multiple regulatory pathways. Strikingly, the biosynthesis pathway of the molting hormone 20-hydroxyecdysone (20E) was enriched in one of the modules. Further analysis showed that the ecdysteroid 20-hydroxylase gene (CYP314A1) of steroidgenesis genes was mainly expressed in previtellogenesis and early vitellogenesis. However, the 20E-inactivated genes, particularly the ecdysteroid 26-hydroxylase encoding gene (Cyp18a1), were highly expressed in late vitellogenesis. These distinct expression patterns between 20E synthesis and catabolism-related genes might ensure the rapid decline of the hormone titer at the transition point from vitellogenesis to choriogenesis. In addition, we compared landscapes of gene regulation between silkworm (Lepidoptera) and fruit fly (Diptera) oogeneses. Our results show that there is some consensus in the modules of gene co-expression during oogenesis in these insects. The data presented in this study provide new insights into the regulatory mechanisms underlying oogenesis in insects with polytrophic meroistic ovaries. The results also provide clues for further investigating the roles of epigenetic reconfiguration and circadian rhythm in insect oogenesis.

  19. A Systems Level, Functional Genomics Analysis of Chronic Epilepsy

    PubMed Central

    Bragin, Anatol; Kudo, Lili C.; Gehman, Lauren; Ruidera, Josephine; Geschwind, Daniel H.; Engel, Jerome

    2011-01-01

    Neither the molecular basis of the pathologic tendency of neuronal circuits to generate spontaneous seizures (epileptogenicity) nor anti-epileptogenic mechanisms that maintain a seizure-free state are well understood. Here, we performed transcriptomic analysis in the intrahippocampal kainate model of temporal lobe epilepsy in rats using both Agilent and Codelink microarray platforms to characterize the epileptic processes. The experimental design allowed subtraction of the confounding effects of the lesion, identification of expression changes associated with epileptogenicity, and genes upregulated by seizures with potential homeostatic anti-epileptogenic effects. Using differential expression analysis, we identified several hundred expression changes in chronic epilepsy, including candidate genes associated with epileptogenicity such as Bdnf and Kcnj13. To analyze these data from a systems perspective, we applied weighted gene co-expression network analysis (WGCNA) to identify groups of co-expressed genes (modules) and their central (hub) genes. One such module contained genes upregulated in the epileptogenic region, including multiple epileptogenicity candidate genes, and was found to be involved the protection of glial cells against oxidative stress, implicating glial oxidative stress in epileptogenicity. Another distinct module corresponded to the effects of chronic seizures and represented changes in neuronal synaptic vesicle trafficking. We found that the network structure and connectivity of one hub gene, Sv2a, showed significant changes between normal and epileptogenic tissue, becoming more highly connected in epileptic brain. Since Sv2a is a target of the antiepileptic levetiracetam, this module may be important in controlling seizure activity. Bioinformatic analysis of this module also revealed a potential mechanism for the observed transcriptional changes via generation of longer alternatively polyadenlyated transcripts through the upregulation of the RNA binding protein HuD. In summary, combining conventional statistical methods and network analysis allowed us to interpret the differentially regulated genes from a systems perspective, yielding new insight into several biological pathways underlying homeostatic anti-epileptogenic effects and epileptogenicity. PMID:21695113

  20. Gene networks associated with conditional fear in mice identified using a systems genetics approach

    PubMed Central

    2011-01-01

    Background Our understanding of the genetic basis of learning and memory remains shrouded in mystery. To explore the genetic networks governing the biology of conditional fear, we used a systems genetics approach to analyze a hybrid mouse diversity panel (HMDP) with high mapping resolution. Results A total of 27 behavioral quantitative trait loci were mapped with a false discovery rate of 5%. By integrating fear phenotypes, transcript profiling data from hippocampus and striatum and also genotype information, two gene co-expression networks correlated with context-dependent immobility were identified. We prioritized the key markers and genes in these pathways using intramodular connectivity measures and structural equation modeling. Highly connected genes in the context fear modules included Psmd6, Ube2a and Usp33, suggesting an important role for ubiquitination in learning and memory. In addition, we surveyed the architecture of brain transcript regulation and demonstrated preservation of gene co-expression modules in hippocampus and striatum, while also highlighting important differences. Rps15a, Kif3a, Stard7, 6330503K22RIK, and Plvap were among the individual genes whose transcript abundance were strongly associated with fear phenotypes. Conclusion Application of our multi-faceted mapping strategy permits an increasingly detailed characterization of the genetic networks underlying behavior. PMID:21410935

  1. Breast Tumors with Elevated Expression of 1q Candidate Genes Confer Poor Clinical Outcome and Sensitivity to Ras/PI3K Inhibition

    PubMed Central

    Viveka Thangaraj, Soundara; Periasamy, Jayaprakash; Bhaskar Rao, Divya; Barnabas, Georgina D.; Raghavan, Swetha; Ganesan, Kumaresan

    2013-01-01

    Genomic aberrations are common in cancers and the long arm of chromosome 1 is known for its frequent amplifications in breast cancer. However, the key candidate genes of 1q, and their contribution in breast cancer pathogenesis remain unexplored. We have analyzed the gene expression profiles of 1635 breast tumor samples using meta-analysis based approach and identified clinically significant candidates from chromosome 1q. Seven candidate genes including exonuclease 1 (EXO1) are consistently over expressed in breast tumors, specifically in high grade and aggressive breast tumors with poor clinical outcome. We derived a EXO1 co-expression module from the mRNA profiles of breast tumors which comprises 1q candidate genes and their co-expressed genes. By integrative functional genomics investigation, we identified the involvement of EGFR, RAS, PI3K / AKT, MYC, E2F signaling in the regulation of these selected 1q genes in breast tumors and breast cancer cell lines. Expression of EXO1 module was found as indicative of elevated cell proliferation, genomic instability, activated RAS/AKT/MYC/E2F1 signaling pathways and loss of p53 activity in breast tumors. mRNA–drug connectivity analysis indicates inhibition of RAS/PI3K as a possible targeted therapeutic approach for the patients with activated EXO1 module in breast tumors. Thus, we identified seven 1q candidate genes strongly associated with the poor survival of breast cancer patients and identified the possibility of targeting them with EGFR/RAS/PI3K inhibitors. PMID:24147022

  2. Analysis of global gene expression in Brachypodium distachyon reveals extensive network plasticity in response to abiotic stress.

    PubMed

    Priest, Henry D; Fox, Samuel E; Rowley, Erik R; Murray, Jessica R; Michael, Todd P; Mockler, Todd C

    2014-01-01

    Brachypodium distachyon is a close relative of many important cereal crops. Abiotic stress tolerance has a significant impact on productivity of agriculturally important food and feedstock crops. Analysis of the transcriptome of Brachypodium after chilling, high-salinity, drought, and heat stresses revealed diverse differential expression of many transcripts. Weighted Gene Co-Expression Network Analysis revealed 22 distinct gene modules with specific profiles of expression under each stress. Promoter analysis implicated short DNA sequences directly upstream of module members in the regulation of 21 of 22 modules. Functional analysis of module members revealed enrichment in functional terms for 10 of 22 network modules. Analysis of condition-specific correlations between differentially expressed gene pairs revealed extensive plasticity in the expression relationships of gene pairs. Photosynthesis, cell cycle, and cell wall expression modules were down-regulated by all abiotic stresses. Modules which were up-regulated by each abiotic stress fell into diverse and unique gene ontology GO categories. This study provides genomics resources and improves our understanding of abiotic stress responses of Brachypodium.

  3. A Systems Biology Framework Identifies Molecular Underpinnings of Coronary Heart Disease

    PubMed Central

    Huan, Tianxiao; Zhang, Bin; Wang, Zhi; Joehanes, Roby; Zhu, Jun; Johnson, Andrew D.; Ying, Saixia; Munson, Peter J.; Raghavachari, Nalini; Wang, Richard; Liu, Poching; Courchesne, Paul; Hwang, Shih-Jen; Assimes, Themistocles L.; McPherson, Ruth; Samani, Nilesh J.; Schunkert, Heribert; Meng, Qingying; Suver, Christine; O'Donnell, Christopher J.; Derry, Jonathan; Yang, Xia; Levy, Daniel

    2013-01-01

    Objective Genetic approaches have identified numerous loci associated with coronary heart disease (CHD). The molecular mechanisms underlying CHD gene-disease associations, however, remain unclear. We hypothesized that genetic variants with both strong and subtle effects drive gene subnetworks that in turn affect CHD. Approach and Results We surveyed CHD-associated molecular interactions by constructing coexpression networks using whole blood gene expression profiles from 188 CHD cases and 188 age- and sex-matched controls. 24 coexpression modules were identified including one case-specific and one control-specific differential module (DM). The DMs were enriched for genes involved in B-cell activation, immune response, and ion transport. By integrating the DMs with altered gene expression associated SNPs (eSNPs) and with results of GWAS of CHD and its risk factors, the control-specific DM was implicated as CHD-causal based on its significant enrichment for both CHD and lipid eSNPs. This causal DM was further integrated with tissue-specific Bayesian networks and protein-protein interaction networks to identify regulatory key driver (KD) genes. Multi-tissue KDs (SPIB and TNFRSF13C) and tissue-specific KDs (e.g. EBF1) were identified. Conclusions Our network-driven integrative analysis not only identified CHD-related genes, but also defined network structure that sheds light on the molecular interactions of genes associated with CHD risk. PMID:23539213

  4. Evolutionary Conservation and Divergence of Gene Coexpression Networks in Gossypium (Cotton) Seeds.

    PubMed

    Hu, Guanjing; Hovav, Ran; Grover, Corrinne E; Faigenboim-Doron, Adi; Kadmon, Noa; Page, Justin T; Udall, Joshua A; Wendel, Jonathan F

    2016-12-01

    The cotton genus (Gossypium) provides a superior system for the study of diversification, genome evolution, polyploidization, and human-mediated selection. To gain insight into phenotypic diversification in cotton seeds, we conducted coexpression network analysis of developing seeds from diploid and allopolyploid cotton species and explored network properties. Key network modules and functional associations were identified related to seed oil content and seed weight. We compared species-specific networks to reveal topological changes, including rewired edges and differentially coexpressed genes, associated with speciation, polyploidy, and cotton domestication. Network comparisons among species indicate that topologies are altered in addition to gene expression profiles, indicating that changes in transcriptomic coexpression relationships play a role in the developmental architecture of cotton seed development. The global network topology of allopolyploids, especially for domesticated G. hirsutum, resembles the network of the A-genome diploid more than that of the D-genome parent, despite its D-like phenotype in oil content. Expression modifications associated with allopolyploidy include coexpression level dominance and transgressive expression, suggesting that the transcriptomic architecture in polyploids is to some extent a modular combination of that of its progenitor genomes. Among allopolyploids, intermodular relationships are more preserved between two different wild allopolyploid species than they are between wild and domesticated forms of a cultivated cotton, and regulatory connections of oil synthesis-related pathways are denser and more closely clustered in domesticated vs. wild G. hirsutum. These results demonstrate substantial modification of genic coexpression under domestication. Our work demonstrates how network inference informs our understanding of the transcriptomic architecture of phenotypic variation associated with temporal scales ranging from thousands (domestication) to millions (speciation) of years, and by polyploidy. © The Author(s) 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  5. Reconstruction of the genome-scale co-expression network for the Hippo signaling pathway in colorectal cancer.

    PubMed

    Dehghanian, Fariba; Hojati, Zohreh; Hosseinkhan, Nazanin; Mousavian, Zaynab; Masoudi-Nejad, Ali

    2018-05-26

    The Hippo signaling pathway (HSP) has been identified as an essential and complex signaling pathway for tumor suppression that coordinates proliferation, differentiation, cell death, cell growth and stemness. In the present study, we conducted a genome-scale co-expression analysis to reconstruct the HSP in colorectal cancer (CRC). Five key modules were detected through network clustering, and a detailed discussion of two modules containing respectively 18 and 13 over and down-regulated members of HSP was provided. Our results suggest new potential regulatory factors in the HSP. The detected modules also suggest novel genes contributing to CRC. Moreover, differential expression analysis confirmed the differential expression pattern of HSP members and new suggested regulatory factors between tumor and normal samples. These findings can further reveal the importance of HSP in CRC. Copyright © 2018 Elsevier Ltd. All rights reserved.

  6. Identification of Key Pathways and Genes in the Dynamic Progression of HCC Based on WGCNA.

    PubMed

    Yin, Li; Cai, Zhihui; Zhu, Baoan; Xu, Cunshuan

    2018-02-14

    Hepatocellular carcinoma (HCC) is a devastating disease worldwide. Though many efforts have been made to elucidate the process of HCC, its molecular mechanisms of development remain elusive due to its complexity. To explore the stepwise carcinogenic process from pre-neoplastic lesions to the end stage of HCC, we employed weighted gene co-expression network analysis (WGCNA) which has been proved to be an effective method in many diseases to detect co-expressed modules and hub genes using eight pathological stages including normal, cirrhosis without HCC, cirrhosis, low-grade dysplastic, high-grade dysplastic, very early and early, advanced HCC and very advanced HCC. Among the eight consecutive pathological stages, five representative modules are selected to perform canonical pathway enrichment and upstream regulator analysis by using ingenuity pathway analysis (IPA) software. We found that cell cycle related biological processes were activated at four neoplastic stages, and the degree of activation of the cell cycle corresponded to the deterioration degree of HCC. The orange and yellow modules enriched in energy metabolism, especially oxidative metabolism, and the expression value of the genes decreased only at four neoplastic stages. The brown module, enriched in protein ubiquitination and ephrin receptor signaling pathways, correlated mainly with the very early stage of HCC. The darkred module, enriched in hepatic fibrosis/hepatic stellate cell activation, correlated with the cirrhotic stage only. The high degree hub genes were identified based on the protein-protein interaction (PPI) network and were verified by Kaplan-Meier survival analysis. The novel five high degree hub genes signature that was identified in our study may shed light on future prognostic and therapeutic approaches. Our study brings a new perspective to the understanding of the key pathways and genes in the dynamic changes of HCC progression. These findings shed light on further investigations.

  7. A novel method to identify pathways associated with renal cell carcinoma based on a gene co-expression network

    PubMed Central

    RUAN, XIYUN; LI, HONGYUN; LIU, BO; CHEN, JIE; ZHANG, SHIBAO; SUN, ZEQIANG; LIU, SHUANGQING; SUN, FAHAI; LIU, QINGYONG

    2015-01-01

    The aim of the present study was to develop a novel method for identifying pathways associated with renal cell carcinoma (RCC) based on a gene co-expression network. A framework was established where a co-expression network was derived from the database as well as various co-expression approaches. First, the backbone of the network based on differentially expressed (DE) genes between RCC patients and normal controls was constructed by the Search Tool for the Retrieval of Interacting Genes/Proteins (STRING) database. The differentially co-expressed links were detected by Pearson’s correlation, the empirical Bayesian (EB) approach and Weighted Gene Co-expression Network Analysis (WGCNA). The co-expressed gene pairs were merged by a rank-based algorithm. We obtained 842; 371; 2,883 and 1,595 co-expressed gene pairs from the co-expression networks of the STRING database, Pearson’s correlation EB method and WGCNA, respectively. Two hundred and eighty-one differentially co-expressed (DC) gene pairs were obtained from the merged network using this novel method. Pathway enrichment analysis based on the Kyoto Encyclopedia of Genes and Genomes (KEGG) database and the network enrichment analysis (NEA) method were performed to verify feasibility of the merged method. Results of the KEGG and NEA pathway analyses showed that the network was associated with RCC. The suggested method was computationally efficient to identify pathways associated with RCC and has been identified as a useful complement to traditional co-expression analysis. PMID:26058425

  8. The common transcriptional subnetworks of the grape berry skin in the late stages of ripening.

    PubMed

    Ghan, Ryan; Petereit, Juli; Tillett, Richard L; Schlauch, Karen A; Toubiana, David; Fait, Aaron; Cramer, Grant R

    2017-05-30

    Wine grapes are important economically in many countries around the world. Defining the optimum time for grape harvest is a major challenge to the grower and winemaker. Berry skins are an important source of flavor, color and other quality traits in the ripening stage. Senescent-like processes such as chloroplast disorganization and cell death characterize the late ripening stage. To better understand the molecular and physiological processes involved in the late stages of berry ripening, RNA-seq analysis of the skins of seven wine grape cultivars (Cabernet Franc, Cabernet Sauvignon, Merlot, Pinot Noir, Chardonnay, Sauvignon Blanc and Semillon) was performed. RNA-seq analysis identified approximately 2000 common differentially expressed genes for all seven cultivars across four different berry sugar levels (20 to 26 °Brix). Network analyses, both a posteriori (standard) and a priori (gene co-expression network analysis), were used to elucidate transcriptional subnetworks and hub genes associated with traits in the berry skins of the late stages of berry ripening. These independent approaches revealed genes involved in photosynthesis, catabolism, and nucleotide metabolism. The transcript abundance of most photosynthetic genes declined with increasing sugar levels in the berries. The transcript abundance of other processes increased such as nucleic acid metabolism, chromosome organization and lipid catabolism. Weighted gene co-expression network analysis (WGCNA) identified 64 gene modules that were organized into 12 subnetworks of three modules or more and six higher order gene subnetworks. Some gene subnetworks were highly correlated with sugar levels and some subnetworks were highly enriched in the chloroplast and nucleus. The petal R package was utilized independently to construct a true small-world and scale-free complex gene co-expression network model. A subnetwork of 216 genes with the highest connectivity was elucidated, consistent with the module results from WGCNA. Hub genes in these subnetworks were identified including numerous members of the core circadian clock, RNA splicing, proteolysis and chromosome organization. An integrated model was constructed linking light sensing with alternative splicing, chromosome remodeling and the circadian clock. A common set of differentially expressed genes and gene subnetworks from seven different cultivars were examined in the skin of the late stages of grapevine berry ripening. A densely connected gene subnetwork was elucidated involving a complex interaction of berry senescent processes (autophagy), catabolism, the circadian clock, RNA splicing, proteolysis and epigenetic regulation. Hypotheses were induced from these data sets involving sugar accumulation, light, autophagy, epigenetic regulation, and fruit development. This work provides a better understanding of berry development and the transcriptional processes involved in the late stages of ripening.

  9. Analysis of the dynamic co-expression network of heart regeneration in the zebrafish

    PubMed Central

    Rodius, Sophie; Androsova, Ganna; Götz, Lou; Liechti, Robin; Crespo, Isaac; Merz, Susanne; Nazarov, Petr V.; de Klein, Niek; Jeanty, Céline; González-Rosa, Juan M.; Muller, Arnaud; Bernardin, Francois; Niclou, Simone P.; Vallar, Laurent; Mercader, Nadia; Ibberson, Mark; Xenarios, Ioannis; Azuaje, Francisco

    2016-01-01

    The zebrafish has the capacity to regenerate its heart after severe injury. While the function of a few genes during this process has been studied, we are far from fully understanding how genes interact to coordinate heart regeneration. To enable systematic insights into this phenomenon, we generated and integrated a dynamic co-expression network of heart regeneration in the zebrafish and linked systems-level properties to the underlying molecular events. Across multiple post-injury time points, the network displays topological attributes of biological relevance. We show that regeneration steps are mediated by modules of transcriptionally coordinated genes, and by genes acting as network hubs. We also established direct associations between hubs and validated drivers of heart regeneration with murine and human orthologs. The resulting models and interactive analysis tools are available at http://infused.vital-it.ch. Using a worked example, we demonstrate the usefulness of this unique open resource for hypothesis generation and in silico screening for genes involved in heart regeneration. PMID:27241320

  10. Analysis of the dynamic co-expression network of heart regeneration in the zebrafish

    NASA Astrophysics Data System (ADS)

    Rodius, Sophie; Androsova, Ganna; Götz, Lou; Liechti, Robin; Crespo, Isaac; Merz, Susanne; Nazarov, Petr V.; de Klein, Niek; Jeanty, Céline; González-Rosa, Juan M.; Muller, Arnaud; Bernardin, Francois; Niclou, Simone P.; Vallar, Laurent; Mercader, Nadia; Ibberson, Mark; Xenarios, Ioannis; Azuaje, Francisco

    2016-05-01

    The zebrafish has the capacity to regenerate its heart after severe injury. While the function of a few genes during this process has been studied, we are far from fully understanding how genes interact to coordinate heart regeneration. To enable systematic insights into this phenomenon, we generated and integrated a dynamic co-expression network of heart regeneration in the zebrafish and linked systems-level properties to the underlying molecular events. Across multiple post-injury time points, the network displays topological attributes of biological relevance. We show that regeneration steps are mediated by modules of transcriptionally coordinated genes, and by genes acting as network hubs. We also established direct associations between hubs and validated drivers of heart regeneration with murine and human orthologs. The resulting models and interactive analysis tools are available at http://infused.vital-it.ch. Using a worked example, we demonstrate the usefulness of this unique open resource for hypothesis generation and in silico screening for genes involved in heart regeneration.

  11. A methodology for the analysis of differential coexpression across the human lifespan.

    PubMed

    Gillis, Jesse; Pavlidis, Paul

    2009-09-22

    Differential coexpression is a change in coexpression between genes that may reflect 'rewiring' of transcriptional networks. It has previously been hypothesized that such changes might be occurring over time in the lifespan of an organism. While both coexpression and differential expression of genes have been previously studied in life stage change or aging, differential coexpression has not. Generalizing differential coexpression analysis to many time points presents a methodological challenge. Here we introduce a method for analyzing changes in coexpression across multiple ordered groups (e.g., over time) and extensively test its validity and usefulness. Our method is based on the use of the Haar basis set to efficiently represent changes in coexpression at multiple time scales, and thus represents a principled and generalizable extension of the idea of differential coexpression to life stage data. We used published microarray studies categorized by age to test the methodology. We validated the methodology by testing our ability to reconstruct Gene Ontology (GO) categories using our measure of differential coexpression and compared this result to using coexpression alone. Our method allows significant improvement in characterizing these groups of genes. Further, we examine the statistical properties of our measure of differential coexpression and establish that the results are significant both statistically and by an improvement in semantic similarity. In addition, we found that our method finds more significant changes in gene relationships compared to several other methods of expressing temporal relationships between genes, such as coexpression over time. Differential coexpression over age generates significant and biologically relevant information about the genes producing it. Our Haar basis methodology for determining age-related differential coexpression performs better than other tested methods. The Haar basis set also lends itself to ready interpretation in terms of both evolutionary and physiological mechanisms of aging and can be seen as a natural generalization of two-category differential coexpression. paul@bioinformatics.ubc.ca.

  12. The DinJ/RelE toxin-antitoxin system suppresses virulence in Xylella fastidiosa

    USDA-ARS?s Scientific Manuscript database

    Xylella fastidiosa, the causal agent of a number agriculturally important plant diseases, encodes multiple toxin-antitoxin (TA) systems. TA modules consist of a toxin protein co-expressed with a specific antitoxin, and are often acquired through horizontal gene transfer. Antitoxin molecules (RNA or ...

  13. Identifying candidate driver genes by integrative ovarian cancer genomics data

    NASA Astrophysics Data System (ADS)

    Lu, Xinguo; Lu, Jibo

    2017-08-01

    Integrative analysis of molecular mechanics underlying cancer can distinguish interactions that cannot be revealed based on one kind of data for the appropriate diagnosis and treatment of cancer patients. Tumor samples exhibit heterogeneity in omics data, such as somatic mutations, Copy Number Variations CNVs), gene expression profiles and so on. In this paper we combined gene co-expression modules and mutation modulators separately in tumor patients to obtain the candidate driver genes for resistant and sensitive tumor from the heterogeneous data. The final list of modulators identified are well known in biological processes associated with ovarian cancer, such as CCL17, CACTIN, CCL16, CCL22, APOB, KDF1, CCL11, HNF1B, LRG1, MED1 and so on, which can help to facilitate the discovery of biomarkers, molecular diagnostics, and drug discovery.

  14. VisANT 3.0: new modules for pathway visualization, editing, prediction and construction.

    PubMed

    Hu, Zhenjun; Ng, David M; Yamada, Takuji; Chen, Chunnuan; Kawashima, Shuichi; Mellor, Joe; Linghu, Bolan; Kanehisa, Minoru; Stuart, Joshua M; DeLisi, Charles

    2007-07-01

    With the integration of the KEGG and Predictome databases as well as two search engines for coexpressed genes/proteins using data sets obtained from the Stanford Microarray Database (SMD) and Gene Expression Omnibus (GEO) database, VisANT 3.0 supports exploratory pathway analysis, which includes multi-scale visualization of multiple pathways, editing and annotating pathways using a KEGG compatible visual notation and visualization of expression data in the context of pathways. Expression levels are represented either by color intensity or by nodes with an embedded expression profile. Multiple experiments can be navigated or animated. Known KEGG pathways can be enriched by querying either coexpressed components of known pathway members or proteins with known physical interactions. Predicted pathways for genes/proteins with unknown functions can be inferred from coexpression or physical interaction data. Pathways produced in VisANT can be saved as computer-readable XML format (VisML), graphic images or high-resolution Scalable Vector Graphics (SVG). Pathways in the format of VisML can be securely shared within an interested group or published online using a simple Web link. VisANT is freely available at http://visant.bu.edu.

  15. Coexpression landscape in ATTED-II: usage of gene list and gene network for various types of pathways.

    PubMed

    Obayashi, Takeshi; Kinoshita, Kengo

    2010-05-01

    Gene coexpression analyses are a powerful method to predict the function of genes and/or to identify genes that are functionally related to query genes. The basic idea of gene coexpression analyses is that genes with similar functions should have similar expression patterns under many different conditions. This approach is now widely used by many experimental researchers, especially in the field of plant biology. In this review, we will summarize recent successful examples obtained by using our gene coexpression database, ATTED-II. Specifically, the examples will describe the identification of new genes, such as the subunits of a complex protein, the enzymes in a metabolic pathway and transporters. In addition, we will discuss the discovery of a new intercellular signaling factor and new regulatory relationships between transcription factors and their target genes. In ATTED-II, we provide two basic views of gene coexpression, a gene list view and a gene network view, which can be used as guide gene approach and narrow-down approach, respectively. In addition, we will discuss the coexpression effectiveness for various types of gene sets.

  16. Effect of genomic distance on coexpression of coregulated genes in E. coli

    PubMed Central

    Merino, Enrique; Marchal, Kathleen; Collado-Vides, Julio

    2017-01-01

    In prokaryotes, genomic distance is a feature that in addition to coregulation affects coexpression. Several observations, such as genomic clustering of highly coexpressed small regulons, support the idea that coexpression behavior of coregulated genes is affected by the distance between the coregulated genes. However, the specific contribution of distance in addition to coregulation in determining the degree of coexpression has not yet been studied systematically. In this work, we exploit the rich information in RegulonDB to study how the genomic distance between coregulated genes affects their degree of coexpression, measured by pairwise similarity of expression profiles obtained under a large number of conditions. We observed that, in general, coregulated genes display higher degrees of coexpression as they are more closely located on the genome. This contribution of genomic distance in determining the degree of coexpression was relatively small compared to the degree of coexpression that was determined by the tightness of the coregulation (degree of overlap of regulatory programs) but was shown to be evolutionary constrained. In addition, the distance effect was sufficient to guarantee coexpression of coregulated genes that are located at very short distances, irrespective of their tightness of coregulation. This is partly but definitely not always because the close distance is also the cause of the coregulation. In cases where it is not, we hypothesize that the effect of the distance on coexpression could be caused by the fact that coregulated genes closely located to each other are also relatively more equidistantly located from their common TF and therefore subject to more similar levels of TF molecules. The absolute genomic distance of the coregulated genes to their common TF-coding gene tends to be less important in determining the degree of coexpression. Our results pinpoint the importance of taking into account the combined effect of distance and coregulation when studying prokaryotic coexpression and transcriptional regulation. PMID:28419102

  17. Discover mouse gene coexpression landscapes using dictionary learning and sparse coding.

    PubMed

    Li, Yujie; Chen, Hanbo; Jiang, Xi; Li, Xiang; Lv, Jinglei; Peng, Hanchuan; Tsien, Joe Z; Liu, Tianming

    2017-12-01

    Gene coexpression patterns carry rich information regarding enormously complex brain structures and functions. Characterization of these patterns in an unbiased, integrated, and anatomically comprehensive manner will illuminate the higher-order transcriptome organization and offer genetic foundations of functional circuitry. Here using dictionary learning and sparse coding, we derived coexpression networks from the space-resolved anatomical comprehensive in situ hybridization data from Allen Mouse Brain Atlas dataset. The key idea is that if two genes use the same dictionary to represent their original signals, then their gene expressions must share similar patterns, thereby considering them as "coexpressed." For each network, we have simultaneous knowledge of spatial distributions, the genes in the network and the extent a particular gene conforms to the coexpression pattern. Gene ontologies and the comparisons with published gene lists reveal biologically identified coexpression networks, some of which correspond to major cell types, biological pathways, and/or anatomical regions.

  18. Co-LncRNA: investigating the lncRNA combinatorial effects in GO annotations and KEGG pathways based on human RNA-Seq data

    PubMed Central

    Zhao, Zheng; Bai, Jing; Wu, Aiwei; Wang, Yuan; Zhang, Jinwen; Wang, Zishan; Li, Yongsheng; Xu, Juan; Li, Xia

    2015-01-01

    Long non-coding RNAs (lncRNAs) are emerging as key regulators of diverse biological processes and diseases. However, the combinatorial effects of these molecules in a specific biological function are poorly understood. Identifying co-expressed protein-coding genes of lncRNAs would provide ample insight into lncRNA functions. To facilitate such an effort, we have developed Co-LncRNA, which is a web-based computational tool that allows users to identify GO annotations and KEGG pathways that may be affected by co-expressed protein-coding genes of a single or multiple lncRNAs. LncRNA co-expressed protein-coding genes were first identified in publicly available human RNA-Seq datasets, including 241 datasets across 6560 total individuals representing 28 tissue types/cell lines. Then, the lncRNA combinatorial effects in a given GO annotations or KEGG pathways are taken into account by the simultaneous analysis of multiple lncRNAs in user-selected individual or multiple datasets, which is realized by enrichment analysis. In addition, this software provides a graphical overview of pathways that are modulated by lncRNAs, as well as a specific tool to display the relevant networks between lncRNAs and their co-expressed protein-coding genes. Co-LncRNA also supports users in uploading their own lncRNA and protein-coding gene expression profiles to investigate the lncRNA combinatorial effects. It will be continuously updated with more human RNA-Seq datasets on an annual basis. Taken together, Co-LncRNA provides a web-based application for investigating lncRNA combinatorial effects, which could shed light on their biological roles and could be a valuable resource for this community. Database URL: http://www.bio-bigdata.com/Co-LncRNA/ PMID:26363020

  19. Tissue and cell-type co-expression networks of transcription factors and wood component genes in Populus trichocarpa.

    PubMed

    Shi, Rui; Wang, Jack P; Lin, Ying-Chung; Li, Quanzi; Sun, Ying-Hsuan; Chen, Hao; Sederoff, Ronald R; Chiang, Vincent L

    2017-05-01

    Co-expression networks based on transcriptomes of Populus trichocarpa major tissues and specific cell types suggest redundant control of cell wall component biosynthetic genes by transcription factors in wood formation. We analyzed the transcriptomes of five tissues (xylem, phloem, shoot, leaf, and root) and two wood forming cell types (fiber and vessel) of Populus trichocarpa to assemble gene co-expression subnetworks associated with wood formation. We identified 165 transcription factors (TFs) that showed xylem-, fiber-, and vessel-specific expression. Of these 165 TFs, 101 co-expressed (correlation coefficient, r > 0.7) with the 45 secondary cell wall cellulose, hemicellulose, and lignin biosynthetic genes. Each cell wall component gene co-expressed on average with 34 TFs, suggesting redundant control of the cell wall component gene expression. Co-expression analysis showed that the 101 TFs and the 45 cell wall component genes each has two distinct groups (groups 1 and 2), based on their co-expression patterns. The group 1 TFs (44 members) are predominantly xylem and fiber specific, and are all highly positively co-expressed with the group 1 cell wall component genes (30 members), suggesting their roles as major wood formation regulators. Group 1 TFs include a lateral organ boundary domain gene (LBD) that has the highest number of positively correlated cell wall component genes (36) and TFs (47). The group 2 TFs have 57 members, including 14 vessel-specific TFs, and are generally less correlated with the cell wall component genes. An exception is a vessel-specific basic helix-loop-helix (bHLH) gene that negatively correlates with 20 cell wall component genes, and may function as a key transcriptional suppressor. The co-expression networks revealed here suggest a well-structured transcriptional homeostasis for cell wall component biosynthesis during wood formation.

  20. Gene co-expression networks shed light into diseases of brain iron accumulation

    PubMed Central

    Bettencourt, Conceição; Forabosco, Paola; Wiethoff, Sarah; Heidari, Moones; Johnstone, Daniel M.; Botía, Juan A.; Collingwood, Joanna F.; Hardy, John; Milward, Elizabeth A.; Ryten, Mina; Houlden, Henry

    2016-01-01

    Aberrant brain iron deposition is observed in both common and rare neurodegenerative disorders, including those categorized as Neurodegeneration with Brain Iron Accumulation (NBIA), which are characterized by focal iron accumulation in the basal ganglia. Two NBIA genes are directly involved in iron metabolism, but whether other NBIA-related genes also regulate iron homeostasis in the human brain, and whether aberrant iron deposition contributes to neurodegenerative processes remains largely unknown. This study aims to expand our understanding of these iron overload diseases and identify relationships between known NBIA genes and their main interacting partners by using a systems biology approach. We used whole-transcriptome gene expression data from human brain samples originating from 101 neuropathologically normal individuals (10 brain regions) to generate weighted gene co-expression networks and cluster the 10 known NBIA genes in an unsupervised manner. We investigated NBIA-enriched networks for relevant cell types and pathways, and whether they are disrupted by iron loading in NBIA diseased tissue and in an in vivo mouse model. We identified two basal ganglia gene co-expression modules significantly enriched for NBIA genes, which resemble neuronal and oligodendrocytic signatures. These NBIA gene networks are enriched for iron-related genes, and implicate synapse and lipid metabolism related pathways. Our data also indicates that these networks are disrupted by excessive brain iron loading. We identified multiple cell types in the origin of NBIA disorders. We also found unforeseen links between NBIA networks and iron-related processes, and demonstrate convergent pathways connecting NBIAs and phenotypically overlapping diseases. Our results are of further relevance for these diseases by providing candidates for new causative genes and possible points for therapeutic intervention. PMID:26707700

  1. Gene co-expression networks shed light into diseases of brain iron accumulation.

    PubMed

    Bettencourt, Conceição; Forabosco, Paola; Wiethoff, Sarah; Heidari, Moones; Johnstone, Daniel M; Botía, Juan A; Collingwood, Joanna F; Hardy, John; Milward, Elizabeth A; Ryten, Mina; Houlden, Henry

    2016-03-01

    Aberrant brain iron deposition is observed in both common and rare neurodegenerative disorders, including those categorized as Neurodegeneration with Brain Iron Accumulation (NBIA), which are characterized by focal iron accumulation in the basal ganglia. Two NBIA genes are directly involved in iron metabolism, but whether other NBIA-related genes also regulate iron homeostasis in the human brain, and whether aberrant iron deposition contributes to neurodegenerative processes remains largely unknown. This study aims to expand our understanding of these iron overload diseases and identify relationships between known NBIA genes and their main interacting partners by using a systems biology approach. We used whole-transcriptome gene expression data from human brain samples originating from 101 neuropathologically normal individuals (10 brain regions) to generate weighted gene co-expression networks and cluster the 10 known NBIA genes in an unsupervised manner. We investigated NBIA-enriched networks for relevant cell types and pathways, and whether they are disrupted by iron loading in NBIA diseased tissue and in an in vivo mouse model. We identified two basal ganglia gene co-expression modules significantly enriched for NBIA genes, which resemble neuronal and oligodendrocytic signatures. These NBIA gene networks are enriched for iron-related genes, and implicate synapse and lipid metabolism related pathways. Our data also indicates that these networks are disrupted by excessive brain iron loading. We identified multiple cell types in the origin of NBIA disorders. We also found unforeseen links between NBIA networks and iron-related processes, and demonstrate convergent pathways connecting NBIAs and phenotypically overlapping diseases. Our results are of further relevance for these diseases by providing candidates for new causative genes and possible points for therapeutic intervention. Copyright © 2015 The Authors. Published by Elsevier Inc. All rights reserved.

  2. High motivation for exercise is associated with altered chromatin regulators of monoamine receptor gene expression in the striatum of selectively bred mice.

    PubMed

    Saul, M C; Majdak, P; Perez, S; Reilly, M; Garland, T; Rhodes, J S

    2017-03-01

    Although exercise is critical for health, many lack the motivation to exercise, and it is unclear how motivation might be increased. To uncover the molecular underpinnings of increased motivation for exercise, we analyzed the transcriptome of the striatum in four mouse lines selectively bred for high voluntary wheel running and four non-selected control lines. The striatum was dissected and RNA was extracted and sequenced from four individuals of each line. We found multiple genes and gene systems with strong relationships to both selection and running history over the previous 6 days. Among these genes were Htr1b, a serotonin receptor subunit and Slc38a2, a marker for both glutamatergic and γ-aminobutyric acid (GABA)-ergic signaling. System analysis of the raw results found enrichment of transcriptional regulation and kinase genes. Further, we identified a splice variant affecting the Wnt-related Golgi signaling gene Tmed5. Using coexpression network analysis, we found a cluster of interrelated coexpression modules with relationships to running behavior. From these modules, we built a network correlated with running that predicts a mechanistic relationship between transcriptional regulation by nucleosome structure and Htr1b expression. The Library of Integrated Network-Based Cellular Signatures identified the protein kinase C δ inhibitor, rottlerin, the tyrosine kinase inhibitor, Linifanib and the delta-opioid receptor antagonist 7-benzylidenenaltrexone as potential compounds for increasing the motivation to run. Taken together, our findings support a neurobiological framework of exercise motivation where chromatin state leads to differences in dopamine signaling through modulation of both the primary neurotransmitters glutamate and GABA, and by neuromodulators such as serotonin. © 2016 John Wiley & Sons Ltd and International Behavioural and Neural Genetics Society.

  3. Weighted gene co-expression network analysis of colorectal cancer liver metastasis genome sequencing data and screening of anti-metastasis drugs.

    PubMed

    Gao, Bo; Shao, Qin; Choudhry, Hani; Marcus, Victoria; Dong, Kung; Ragoussis, Jiannis; Gao, Zu-Hua

    2016-09-01

    Approximately 9% of cancer-related deaths are caused by colorectal cancer (CRC). CRC patients are prone to liver metastasis, which is the most important cause for the high CRC mortality rate. Understanding the molecular mechanism of CRC liver metastasis could help us to find novel targets for the effective treatment of this deadly disease. Using weighted gene co-expression network analysis on the sequencing data of CRC with and with metastasis, we identified 5 colorectal cancer liver metastasis related modules which were labeled as brown, blue, grey, yellow and turquoise. In the brown module, which represents the metastatic tumor in the liver, gene ontology (GO) analysis revealed functions including the G-protein coupled receptor protein signaling pathway, epithelial cell differentiation and cell surface receptor linked signal transduction. In the blue module, which represents the primary CRC that has metastasized, GO analysis showed that the genes were mainly enriched in GO terms including G-protein coupled receptor protein signaling pathway, cell surface receptor linked signal transduction, and negative regulation of cell differentiation. In the yellow and turquoise modules, which represent the primary non-metastatic CRC, 13 downregulated CRC liver metastasis-related candidate miRNAs were identified (e.g. hsa-miR-204, hsa-miR-455, etc.). Furthermore, analyzing the DrugBank database and mining the literature identified 25 and 12 candidate drugs that could potentially block the metastatic processes of the primary tumor and inhibit the progression of metastatic tumors in the liver, respectively. Data generated from this study not only furthers our understanding of the genetic alterations that drive the metastatic process, but also guides the development of molecular-targeted therapy of colorectal cancer liver metastasis.

  4. From SNP co-association to RNA co-expression: novel insights into gene networks for intramuscular fatty acid composition in porcine.

    PubMed

    Ramayo-Caldas, Yuliaxis; Ballester, Maria; Fortes, Marina R S; Esteve-Codina, Anna; Castelló, Anna; Noguera, Jose L; Fernández, Ana I; Pérez-Enciso, Miguel; Reverter, Antonio; Folch, Josep M

    2014-03-26

    Fatty acids (FA) play a critical role in energy homeostasis and metabolic diseases; in the context of livestock species, their profile also impacts on meat quality for healthy human consumption. Molecular pathways controlling lipid metabolism are highly interconnected and are not fully understood. Elucidating these molecular processes will aid technological development towards improvement of pork meat quality and increased knowledge of FA metabolism, underpinning metabolic diseases in humans. The results from genome-wide association studies (GWAS) across 15 phenotypes were subjected to an Association Weight Matrix (AWM) approach to predict a network of 1,096 genes related to intramuscular FA composition in pigs. To identify the key regulators of FA metabolism, we focused on the minimal set of transcription factors (TF) that the explored the majority of the network topology. Pathway and network analyses pointed towards a trio of TF as key regulators of FA metabolism: NCOA2, FHL2 and EP300. Promoter sequence analyses confirmed that these TF have binding sites for some well-know regulators of lipid and carbohydrate metabolism. For the first time in a non-model species, some of the co-associations observed at the genetic level were validated through co-expression at the transcriptomic level based on real-time PCR of 40 genes in adipose tissue, and a further 55 genes in liver. In particular, liver expression of NCOA2 and EP300 differed between pig breeds (Iberian and Landrace) extreme in terms of fat deposition. Highly clustered co-expression networks in both liver and adipose tissues were observed. EP300 and NCOA2 showed centrality parameters above average in the both networks. Over all genes, co-expression analyses confirmed 28.9% of the AWM predicted gene-gene interactions in liver and 33.0% in adipose tissue. The magnitude of this validation varied across genes, with up to 60.8% of the connections of NCOA2 in adipose tissue being validated via co-expression. Our results recapitulate the known transcriptional regulation of FA metabolism, predict gene interactions that can be experimentally validated, and suggest that genetic variants mapped to EP300, FHL2, and NCOA2 modulate lipid metabolism and control energy homeostasis in pigs.

  5. Interactions between the colonic transcriptome, metabolome, and microbiome in mouse models of obesity-induced intestinal cancer.

    PubMed

    Pfalzer, Anna C; Kamanu, Frederick K; Parnell, Laurence D; Tai, Albert K; Liu, Zhenhua; Mason, Joel B; Crott, Jimmy W

    2016-08-01

    Obesity is a significant risk factor for colorectal cancer (CRC); however, the relative contribution of high-fat (HF) consumption and excess adiposity remains unclear. It is becoming apparent that obesity perturbs both the intestinal microbiome and metabolome, and each has the potential to induce protumorigenic changes in the epithelial transcriptome. The physiological consequences and the degree to which these different biologic systems interact remain poorly defined. To understand the mechanisms by which obesity drives colonic tumorigenesis, we profiled the colonic epithelial transcriptome of HF-fed and genetically obese (DbDb) mice with a genetic predisposition to intestinal tumorigenesis (Apc(1638N)); 266 and 584 genes were differentially expressed in the colonic mucosa of HF and DbDb mice, respectively. These genes mapped to pathways involved in immune function, and cellular proliferation and cancer. Furthermore, Akt was central within the networks of interacting genes identified in both gene sets. Regression analyses of coexpressed genes with the abundance of bacterial taxa identified three taxa, previously correlated with tumor burden, to be significantly correlated with a gene module enriched for Akt-related genes. Similarly, regression of coexpressed genes with metabolites found that adenosine, which was negatively associated with inflammatory markers and tumor burden, was also correlated with a gene module enriched with Akt regulators. Our findings provide evidence that HF consumption and excess adiposity result in changes in the colonic transcriptome that, although distinct, both appear to converge on Akt signaling. Such changes could be mediated by alterations in the colonic microbiome and metabolome.

  6. A Systems Approach Identifies Networks and Genes Linking Sleep and Stress: Implications for Neuropsychiatric Disorders

    PubMed Central

    Jiang, Peng; Scarpa, Joseph R.; Fitzpatrick, Karrie; Losic, Bojan; Gao, Vance D.; Hao, Ke; Summa, Keith C.; Yang, He S.; Zhang, Bin; Allada, Ravi; Vitaterna, Martha H.; Turek, Fred W.; Kasarskis, Andrew

    2016-01-01

    SUMMARY Sleep dysfunction and stress susceptibility are co-morbid complex traits, which often precede and predispose patients to a variety of neuropsychiatric diseases. Here, we demonstrate multi-level organizations of genetic landscape, candidate genes, and molecular networks associated with 328 stress and sleep traits in a chronically stressed population of 338 (C57BL/6J×A/J) F2 mice. We constructed striatal gene co-expression networks, revealing functionally and cell-type specific gene co-regulations important for stress and sleep. Using a composite ranking system, we identified network modules most relevant for 15 independent phenotypic categories, highlighting a mitochondria/synaptic module that links sleep and stress. The key network regulators of this module are overrepresented with genes implicated in neuropsychiatric diseases. Our work suggests the interplay between sleep, stress, and neuropathology emerge from genetic influences on gene expression and their collective organization through complex molecular networks, providing a framework to interrogate the mechanisms underlying sleep, stress susceptibility, and related neuropsychiatric disorders. PMID:25921536

  7. Construction of an integrated gene regulatory network link to stress-related immune system in cattle.

    PubMed

    Behdani, Elham; Bakhtiarizadeh, Mohammad Reza

    2017-10-01

    The immune system is an important biological system that is negatively impacted by stress. This study constructed an integrated regulatory network to enhance our understanding of the regulatory gene network used in the stress-related immune system. Module inference was used to construct modules of co-expressed genes with bovine leukocyte RNA-Seq data. Transcription factors (TFs) were then assigned to these modules using Lemon-Tree algorithms. In addition, the TFs assigned to each module were confirmed using the promoter analysis and protein-protein interactions data. Therefore, our integrated method identified three TFs which include one TF that is previously known to be involved in immune response (MYBL2) and two TFs (E2F8 and FOXS1) that had not been recognized previously and were identified for the first time in this study as novel regulatory candidates in immune response. This study provides valuable insights on the regulatory programs of genes involved in the stress-related immune system.

  8. Differential Coexpression Analysis Reveals Extensive Rewiring of Arabidopsis Gene Coexpression in Response to Pseudomonas syringae Infection

    PubMed Central

    Jiang, Zhenhong; Dong, Xiaobao; Li, Zhi-Gang; He, Fei; Zhang, Ziding

    2016-01-01

    Plant defense responses to pathogens involve massive transcriptional reprogramming. Recently, differential coexpression analysis has been developed to study the rewiring of gene networks through microarray data, which is becoming an important complement to traditional differential expression analysis. Using time-series microarray data of Arabidopsis thaliana infected with Pseudomonas syringae, we analyzed Arabidopsis defense responses to P. syringae through differential coexpression analysis. Overall, we found that differential coexpression was a common phenomenon of plant immunity. Genes that were frequently involved in differential coexpression tend to be related to plant immune responses. Importantly, many of those genes have similar average expression levels between normal plant growth and pathogen infection but have different coexpression partners. By integrating the Arabidopsis regulatory network into our analysis, we identified several transcription factors that may be regulators of differential coexpression during plant immune responses. We also observed extensive differential coexpression between genes within the same metabolic pathways. Several metabolic pathways, such as photosynthesis light reactions, exhibited significant changes in expression correlation between normal growth and pathogen infection. Taken together, differential coexpression analysis provides a new strategy for analyzing transcriptional data related to plant defense responses and new insights into the understanding of plant-pathogen interactions. PMID:27721457

  9. Systems Genetic Analysis of Osteoblast-Lineage Cells

    PubMed Central

    Calabrese, Gina; Bennett, Brian J.; Orozco, Luz; Kang, Hyun M.; Eskin, Eleazar; Dombret, Carlos; De Backer, Olivier; Lusis, Aldons J.; Farber, Charles R.

    2012-01-01

    The osteoblast-lineage consists of cells at various stages of maturation that are essential for skeletal development, growth, and maintenance. Over the past decade, many of the signaling cascades that regulate this lineage have been elucidated; however, little is known of the networks that coordinate, modulate, and transmit these signals. Here, we identify a gene network specific to the osteoblast-lineage through the reconstruction of a bone co-expression network using microarray profiles collected on 96 Hybrid Mouse Diversity Panel (HMDP) inbred strains. Of the 21 modules that comprised the bone network, module 9 (M9) contained genes that were highly correlated with prototypical osteoblast maker genes and were more highly expressed in osteoblasts relative to other bone cells. In addition, the M9 contained many of the key genes that define the osteoblast-lineage, which together suggested that it was specific to this lineage. To use the M9 to identify novel osteoblast genes and highlight its biological relevance, we knocked-down the expression of its two most connected “hub” genes, Maged1 and Pard6g. Their perturbation altered both osteoblast proliferation and differentiation. Furthermore, we demonstrated the mice deficient in Maged1 had decreased bone mineral density (BMD). It was also discovered that a local expression quantitative trait locus (eQTL) regulating the Wnt signaling antagonist Sfrp1 was a key driver of the M9. We also show that the M9 is associated with BMD in the HMDP and is enriched for genes implicated in the regulation of human BMD through genome-wide association studies. In conclusion, we have identified a physiologically relevant gene network and used it to discover novel genes and regulatory mechanisms involved in the function of osteoblast-lineage cells. Our results highlight the power of harnessing natural genetic variation to generate co-expression networks that can be used to gain insight into the function of specific cell-types. PMID:23300464

  10. Assessing the utility of gene co-expression stability in combination with correlation in the analysis of protein-protein interaction networks

    PubMed Central

    2011-01-01

    Background Gene co-expression, in the form of a correlation coefficient, has been valuable in the analysis, classification and prediction of protein-protein interactions. However, it is susceptible to bias from a few samples having a large effect on the correlation coefficient. Gene co-expression stability is a means of quantifying this bias, with high stability indicating robust, unbiased co-expression correlation coefficients. We assess the utility of gene co-expression stability as an additional measure to support the co-expression correlation in the analysis of protein-protein interaction networks. Results We studied the patterns of co-expression correlation and stability in interacting proteins with respect to their interaction promiscuity, levels of intrinsic disorder, and essentiality or disease-relatedness. Co-expression stability, along with co-expression correlation, acts as a better classifier of hub proteins in interaction networks, than co-expression correlation alone, enabling the identification of a class of hubs that are functionally distinct from the widely accepted transient (date) and obligate (party) hubs. Proteins with high levels of intrinsic disorder have low co-expression correlation and high stability with their interaction partners suggesting their involvement in transient interactions, except for a small group that have high co-expression correlation and are typically subunits of stable complexes. Similar behavior was seen for disease-related and essential genes. Interacting proteins that are both disordered have higher co-expression stability than ordered protein pairs. Using co-expression correlation and stability, we found that transient interactions are more likely to occur between an ordered and a disordered protein while obligate interactions primarily occur between proteins that are either both ordered, or disordered. Conclusions We observe that co-expression stability shows distinct patterns in structurally and functionally different groups of proteins and interactions. We conclude that it is a useful and important measure to be used in concert with gene co-expression correlation for further insights into the characteristics of proteins in the context of their interaction network. PMID:22369639

  11. Chilling Affects Phytohormone and Post-Embryonic Development Pathways during Bud Break and Fruit Set in Apple (Malus domestica Borkh.)

    PubMed Central

    Kumar, Gulshan; Gupta, Khushboo; Pathania, Shivalika; Swarnkar, Mohit Kumar; Rattan, Usha Kumari; Singh, Gagandeep; Sharma, Ram Kumar; Singh, Anil Kumar

    2017-01-01

    The availability of sufficient chilling during bud dormancy plays an important role in the subsequent yield and quality of apple fruit, whereas, insufficient chilling availability negatively impacts the apple production. The transcriptome profiling during bud dormancy release and initial fruit set under low and high chill conditions was performed using RNA-seq. The comparative high number of differentially expressed genes during bud break and fruit set under high chill condition indicates that chilling availability was associated with transcriptional reorganization. The comparative analysis reveals the differential expression of genes involved in phytohormone metabolism, particularly for Abscisic acid, gibberellic acid, ethylene, auxin and cytokinin. The expression of Dormancy Associated MADS-box, Flowering Locus C-like, Flowering Locus T-like and Terminal Flower 1-like genes was found to be modulated under differential chilling. The co-expression network analysis indentified two high chill specific modules that were found to be enriched for “post-embryonic development” GO terms. The network analysis also identified hub genes including Early flowering 7, RAF10, ZEP4 and F-box, which may be involved in regulating chilling-mediated dormancy release and fruit set. The results of transcriptome and co-expression network analysis indicate that chilling availability majorly regulates phytohormone-related pathways and post-embryonic development during bud break. PMID:28198417

  12. Identification of a negative regulator of gibberellin action, HvSPY, in barley.

    PubMed Central

    Robertson, M; Swain, S M; Chandler, P M; Olszewski, N E

    1998-01-01

    To broaden our understanding of the molecular mechanisms of gibberellin (GA) action, we isolated a spindly clone (HvSPY) from barley cultivar Himalaya and tested whether the HvSPY protein would modulate GA action in barley aleurone. The HvSPY cDNA showed high sequence identity to Arabidopsis SPY along its entire length, and the barley protein functionally complemented the spy-3 mutation. HvSPY and SPY proteins showed sequence relatedness with animal O-linked N-acetylglucosamine transferases (OGTs), suggesting that they may also have OGT activity. HvSPY has a locus distinct from that of Sln, a mutation that causes the constitutive GA responses of slender barley, which phenotypically resembles Arabidopsis spy mutants. The possibility that the HvSPY gene encodes a negative regulator of GA action was tested by expressing HvSPY in a barley aleurone transient assay system. HvSPY coexpression largely abolished GA3-induced activity of an alpha-amylase promoter. Surprisingly, HvSPY coexpression increased reporter gene activity from an abscisic acid (ABA)-inducible gene promoter (dehydrin), even in the absence of exogenous ABA. These results show that HvSPY modulates the transcriptional activities of two hormonally regulated promoters: negatively for a GA-induced promoter and positively for an ABA-induced promoter. PMID:9634587

  13. Transcriptomics analysis of hulless barley during grain development with a focus on starch biosynthesis.

    PubMed

    Tang, Yawei; Zeng, Xingquan; Wang, Yulin; Bai, Lijun; Xu, Qijun; Wei, Zexiu; Yuan, Hongjun; Nyima, Tashi

    2017-01-01

    Hulless barley, with its unique nutritional value and potential health benefits, has increasingly attracted attentions in recent years. However, the transcription dynamics during hulless barley grain development is not well understood. In the present study, we investigated the transcriptome changes during barley grain development using Illumina paired-end RNA-sequencing. Two datasets of the developing grain transcriptomes from two barley landraces with the differential seed starch synthesis traits were generated, and comparative transcriptome approach in both genotypes was performed. The results showed that 38 differentially expressed genes (DEGs) were found co-modulated in both genotypes during the barley grain development. Of those, the proteins encoded by most of those DGEs were found, such as alpha-amylase-related proteins, lipid-transfer protein, homeodomain leucine zipper (HD-Zip), NUCLEAR FACTOR-Y, subunit B (NF-YBs), as well as MYB transcription factors. More interestingly, two genes Hvulgare_GLEAN_10012370 and Hvulgare_GLEAN_10021199 encoding SuSy, AGPase (Hvulgare_GLEAN_10033640 and Hvulgare_GLEAN_10056301), as well as SBE2b (Hvulgare_GLEAN_10018352) were found to significantly contribute to the regulatory mechanism during grain development in both genotypes. Moreover, six co-expression modules associated with specific biological processes or pathways (M1 to M6) were identified by consensus co-expression network. Significantly enriched pathways of those module genes showed difference in both genotypes. These results will expand our understanding of the complex molecular mechanism of starch synthesis during barley grain development.

  14. Use of transcriptome sequencing to understand the pistillate flowering in hickory (Carya cathayensis Sarg.).

    PubMed

    Huang, You-Jun; Liu, Li-Li; Huang, Jian-Qin; Wang, Zheng-Jia; Chen, Fang-Fang; Zhang, Qi-Xiang; Zheng, Bing-Song; Chen, Ming

    2013-10-10

    Different from herbaceous plants, the woody plants undergo a long-period vegetative stage to achieve floral transition. They then turn into seasonal plants, flowering annually. In this study, a preliminary model of gene regulations for seasonal pistillate flowering in hickory (Carya cathayensis) was proposed. The genome-wide dynamic transcriptome was characterized via the joint-approach of RNA sequencing and microarray analysis. Differential transcript abundance analysis uncovered the dynamic transcript abundance patterns of flowering correlated genes and their major functions based on Gene Ontology (GO) analysis. To explore pistillate flowering mechanism in hickory, a comprehensive flowering gene regulatory network based on Arabidopsis thaliana was constructed by additional literature mining. A total of 114 putative flowering or floral genes including 31 with differential transcript abundance were identified in hickory. The locations, functions and dynamic transcript abundances were analyzed in the gene regulatory networks. A genome-wide co-expression network for the putative flowering or floral genes shows three flowering regulatory modules corresponding to response to light abiotic stimulus, cold stress, and reproductive development process, respectively. Totally 27 potential flowering or floral genes were recruited which are meaningful to understand the hickory specific seasonal flowering mechanism better. Flowering event of pistillate flower bud in hickory is triggered by several pathways synchronously including the photoperiod, autonomous, vernalization, gibberellin, and sucrose pathway. Totally 27 potential flowering or floral genes were recruited from the genome-wide co-expression network function module analysis. Moreover, the analysis provides a potential FLC-like gene based vernalization pathway and an 'AC' model for pistillate flower development in hickory. This work provides an available framework for pistillate flower development in hickory, which is significant for insight into regulation of flowering and floral development of woody plants.

  15. Use of transcriptome sequencing to understand the pistillate flowering in hickory (Carya cathayensis Sarg.)

    PubMed Central

    2013-01-01

    Background Different from herbaceous plants, the woody plants undergo a long-period vegetative stage to achieve floral transition. They then turn into seasonal plants, flowering annually. In this study, a preliminary model of gene regulations for seasonal pistillate flowering in hickory (Carya cathayensis) was proposed. The genome-wide dynamic transcriptome was characterized via the joint-approach of RNA sequencing and microarray analysis. Results Differential transcript abundance analysis uncovered the dynamic transcript abundance patterns of flowering correlated genes and their major functions based on Gene Ontology (GO) analysis. To explore pistillate flowering mechanism in hickory, a comprehensive flowering gene regulatory network based on Arabidopsis thaliana was constructed by additional literature mining. A total of 114 putative flowering or floral genes including 31 with differential transcript abundance were identified in hickory. The locations, functions and dynamic transcript abundances were analyzed in the gene regulatory networks. A genome-wide co-expression network for the putative flowering or floral genes shows three flowering regulatory modules corresponding to response to light abiotic stimulus, cold stress, and reproductive development process, respectively. Totally 27 potential flowering or floral genes were recruited which are meaningful to understand the hickory specific seasonal flowering mechanism better. Conclusions Flowering event of pistillate flower bud in hickory is triggered by several pathways synchronously including the photoperiod, autonomous, vernalization, gibberellin, and sucrose pathway. Totally 27 potential flowering or floral genes were recruited from the genome-wide co-expression network function module analysis. Moreover, the analysis provides a potential FLC-like gene based vernalization pathway and an 'AC’ model for pistillate flower development in hickory. This work provides an available framework for pistillate flower development in hickory, which is significant for insight into regulation of flowering and floral development of woody plants. PMID:24106755

  16. Gene coexpression network analysis of fruit transcriptomes uncovers a possible mechanistically distinct class of sugar/acid ratio-associated genes in sweet orange.

    PubMed

    Qiao, Liang; Cao, Minghao; Zheng, Jian; Zhao, Yihong; Zheng, Zhi-Liang

    2017-10-30

    The ratio of sugars to organic acids, two of the major metabolites in fleshy fruits, has been considered the most important contributor to fruit sweetness. Although accumulation of sugars and acids have been extensively studied, whether plants evolve a mechanism to maintain, sense or respond to the fruit sugar/acid ratio remains a mystery. In a prior study, we used an integrated systems biology tool to identify a group of 39 acid-associated genes from the fruit transcriptomes in four sweet orange varieties (Citrus sinensis L. Osbeck) with varying fruit acidity, Succari (acidless), Bingtang (low acid), and Newhall and Xinhui (normal acid). We reanalyzed the prior sweet orange fruit transcriptome data, leading to the identification of 72 genes highly correlated with the fruit sugar/acid ratio. The majority of these sugar/acid ratio-related genes are predicted to be involved in regulatory functions such as transport, signaling and transcription or encode enzymes involved in metabolism. Surprisingly, only three of these sugar/acid ratio-correlated genes are weakly correlated with sugar level and none of them overlaps with the acid-associated genes. Weighted Gene Coexpression Network Analysis (WGCNA) has revealed that these genes belong to four modules, Blue, Grey, Brown and Turquoise, with the former two modules being unique to the sugar/acid ratio control. Our results indicate that orange fruits contain a possible mechanistically distinct class of genes that may potentially be involved in maintaining fruit sugar/acid ratios and/or responding to the cellular sugar/acid ratio status. Therefore, our analysis of orange transcriptomes provides an intriguing insight into the potentially novel genetic or molecular mechanisms controlling the sugar/acid ratio in fruits.

  17. Disease networks. Uncovering disease-disease relationships through the incomplete interactome.

    PubMed

    Menche, Jörg; Sharma, Amitabh; Kitsak, Maksim; Ghiassian, Susan Dina; Vidal, Marc; Loscalzo, Joseph; Barabási, Albert-László

    2015-02-20

    According to the disease module hypothesis, the cellular components associated with a disease segregate in the same neighborhood of the human interactome, the map of biologically relevant molecular interactions. Yet, given the incompleteness of the interactome and the limited knowledge of disease-associated genes, it is not obvious if the available data have sufficient coverage to map out modules associated with each disease. Here we derive mathematical conditions for the identifiability of disease modules and show that the network-based location of each disease module determines its pathobiological relationship to other diseases. For example, diseases with overlapping network modules show significant coexpression patterns, symptom similarity, and comorbidity, whereas diseases residing in separated network neighborhoods are phenotypically distinct. These tools represent an interactome-based platform to predict molecular commonalities between phenotypically related diseases, even if they do not share primary disease genes. Copyright © 2015, American Association for the Advancement of Science.

  18. Connecting genes, coexpression modules, and molecular signatures to environmental stress phenotypes in plants

    PubMed Central

    Weston, David J; Gunter, Lee E; Rogers, Alistair; Wullschleger, Stan D

    2008-01-01

    Background One of the eminent opportunities afforded by modern genomic technologies is the potential to provide a mechanistic understanding of the processes by which genetic change translates to phenotypic variation and the resultant appearance of distinct physiological traits. Indeed much progress has been made in this area, particularly in biomedicine where functional genomic information can be used to determine the physiological state (e.g., diagnosis) and predict phenotypic outcome (e.g., patient survival). Ecology currently lacks an analogous approach where genomic information can be used to diagnose the presence of a given physiological state (e.g., stress response) and then predict likely phenotypic outcomes (e.g., stress duration and tolerance, fitness). Results Here, we demonstrate that a compendium of genomic signatures can be used to classify the plant abiotic stress phenotype in Arabidopsis according to the architecture of the transcriptome, and then be linked with gene coexpression network analysis to determine the underlying genes governing the phenotypic response. Using this approach, we confirm the existence of known stress responsive pathways and marker genes, report a common abiotic stress responsive transcriptome and relate phenotypic classification to stress duration. Conclusion Linking genomic signatures to gene coexpression analysis provides a unique method of relating an observed plant phenotype to changes in gene expression that underlie that phenotype. Such information is critical to current and future investigations in plant biology and, in particular, to evolutionary ecology, where a mechanistic understanding of adaptive physiological responses to abiotic stress can provide researchers with a tool of great predictive value in understanding species and population level adaptation to climate change. PMID:18248680

  19. oPOSSUM: identification of over-represented transcription factor binding sites in co-expressed genes

    PubMed Central

    Ho Sui, Shannan J.; Mortimer, James R.; Arenillas, David J.; Brumm, Jochen; Walsh, Christopher J.; Kennedy, Brian P.; Wasserman, Wyeth W.

    2005-01-01

    Targeted transcript profiling studies can identify sets of co-expressed genes; however, identification of the underlying functional mechanism(s) is a significant challenge. Established methods for the analysis of gene annotations, particularly those based on the Gene Ontology, can identify functional linkages between genes. Similar methods for the identification of over-represented transcription factor binding sites (TFBSs) have been successful in yeast, but extension to human genomics has largely proved ineffective. Creation of a system for the efficient identification of common regulatory mechanisms in a subset of co-expressed human genes promises to break a roadblock in functional genomics research. We have developed an integrated system that searches for evidence of co-regulation by one or more transcription factors (TFs). oPOSSUM combines a pre-computed database of conserved TFBSs in human and mouse promoters with statistical methods for identification of sites over-represented in a set of co-expressed genes. The algorithm successfully identified mediating TFs in control sets of tissue-specific genes and in sets of co-expressed genes from three transcript profiling studies. Simulation studies indicate that oPOSSUM produces few false positives using empirically defined thresholds and can tolerate up to 50% noise in a set of co-expressed genes. PMID:15933209

  20. Differentiated transcriptional signatures in the maize landraces of Chiapas, Mexico.

    PubMed

    Kost, Matthew A; Perales, Hugo R; Wijeratne, Saranga; Wijeratne, Asela J; Stockinger, Eric; Mercer, Kristin L

    2017-09-08

    Landrace farmers are the keepers of crops locally adapted to the environments where they are cultivated. Patterns of diversity across the genome can provide signals of past evolution in the face of abiotic and biotic change. Understanding this rich genetic resource is imperative especially since diversity can provide agricultural security as climate continues to shift. Here we employ RNA sequencing (RNA-seq) to understand the role that conditions that vary across a landscape may have played in shaping genetic diversity in the maize landraces of Chiapas, Mexico. We collected landraces from three distinct elevational zones and planted them in a midland common garden. Early season leaf tissue was collected for RNA-seq and we performed weighted gene co-expression network analysis (WGCNA). We then used association analysis between landrace co-expression module expression values and environmental parameters of landrace origin to elucidate genes and gene networks potentially shaped by environmental factors along our study gradient. Elevation of landrace origin affected the transcriptome profiles. Two co-expression modules were highly correlated with temperature parameters of landrace origin and queries into their 'hub' genes suggested that temperature may have led to differentiation among landraces in hormone biosynthesis/signaling and abiotic and biotic stress responses. We identified several 'hub' transcription factors and kinases as candidates for the regulation of these responses. These findings indicate that natural selection may influence the transcriptomes of crop landraces along an elevational gradient in a major diversity center, and provide a foundation for exploring the genetic basis of local adaptation. While we cannot rule out the role of neutral evolutionary forces in the patterns we have identified, combining whole transcriptome sequencing technologies, established bioinformatics techniques, and common garden experimentation can powerfully elucidate structure of adaptive diversity across a varied landscape. Ultimately, gaining such understanding can facilitate the conservation and strategic utilization of crop genetic diversity in a time of climate change.

  1. Hi-C Chromatin Interaction Networks Predict Co-expression in the Mouse Cortex

    PubMed Central

    Hulsman, Marc; Lelieveldt, Boudewijn P. F.; de Ridder, Jeroen; Reinders, Marcel

    2015-01-01

    The three dimensional conformation of the genome in the cell nucleus influences important biological processes such as gene expression regulation. Recent studies have shown a strong correlation between chromatin interactions and gene co-expression. However, predicting gene co-expression from frequent long-range chromatin interactions remains challenging. We address this by characterizing the topology of the cortical chromatin interaction network using scale-aware topological measures. We demonstrate that based on these characterizations it is possible to accurately predict spatial co-expression between genes in the mouse cortex. Consistent with previous findings, we find that the chromatin interaction profile of a gene-pair is a good predictor of their spatial co-expression. However, the accuracy of the prediction can be substantially improved when chromatin interactions are described using scale-aware topological measures of the multi-resolution chromatin interaction network. We conclude that, for co-expression prediction, it is necessary to take into account different levels of chromatin interactions ranging from direct interaction between genes (i.e. small-scale) to chromatin compartment interactions (i.e. large-scale). PMID:25965262

  2. Co-LncRNA: investigating the lncRNA combinatorial effects in GO annotations and KEGG pathways based on human RNA-Seq data.

    PubMed

    Zhao, Zheng; Bai, Jing; Wu, Aiwei; Wang, Yuan; Zhang, Jinwen; Wang, Zishan; Li, Yongsheng; Xu, Juan; Li, Xia

    2015-01-01

    Long non-coding RNAs (lncRNAs) are emerging as key regulators of diverse biological processes and diseases. However, the combinatorial effects of these molecules in a specific biological function are poorly understood. Identifying co-expressed protein-coding genes of lncRNAs would provide ample insight into lncRNA functions. To facilitate such an effort, we have developed Co-LncRNA, which is a web-based computational tool that allows users to identify GO annotations and KEGG pathways that may be affected by co-expressed protein-coding genes of a single or multiple lncRNAs. LncRNA co-expressed protein-coding genes were first identified in publicly available human RNA-Seq datasets, including 241 datasets across 6560 total individuals representing 28 tissue types/cell lines. Then, the lncRNA combinatorial effects in a given GO annotations or KEGG pathways are taken into account by the simultaneous analysis of multiple lncRNAs in user-selected individual or multiple datasets, which is realized by enrichment analysis. In addition, this software provides a graphical overview of pathways that are modulated by lncRNAs, as well as a specific tool to display the relevant networks between lncRNAs and their co-expressed protein-coding genes. Co-LncRNA also supports users in uploading their own lncRNA and protein-coding gene expression profiles to investigate the lncRNA combinatorial effects. It will be continuously updated with more human RNA-Seq datasets on an annual basis. Taken together, Co-LncRNA provides a web-based application for investigating lncRNA combinatorial effects, which could shed light on their biological roles and could be a valuable resource for this community. Database URL: http://www.bio-bigdata.com/Co-LncRNA/. © The Author(s) 2015. Published by Oxford University Press.

  3. A proteomic network approach across the ALS-FTD disease spectrum resolves clinical phenotypes and genetic vulnerability in human brain.

    PubMed

    Umoh, Mfon E; Dammer, Eric B; Dai, Jingting; Duong, Duc M; Lah, James J; Levey, Allan I; Gearing, Marla; Glass, Jonathan D; Seyfried, Nicholas T

    2018-01-01

    Amyotrophic lateral sclerosis (ALS) and frontotemporal dementia (FTD) are neurodegenerative diseases with overlap in clinical presentation, neuropathology, and genetic underpinnings. The molecular basis for the overlap of these disorders is not well established. We performed a comparative unbiased mass spectrometry-based proteomic analysis of frontal cortical tissues from postmortem cases clinically defined as ALS, FTD, ALS and FTD (ALS/FTD), and controls. We also included a subset of patients with the C9orf72 expansion mutation, the most common genetic cause of both ALS and FTD Our systems-level analysis of the brain proteome integrated both differential expression and co-expression approaches to assess the relationship of these differences to clinical and pathological phenotypes. Weighted co-expression network analysis revealed 15 modules of co-expressed proteins, eight of which were significantly different across the ALS-FTD disease spectrum. These included modules associated with RNA binding proteins, synaptic transmission, and inflammation with cell-type specificity that showed correlation with TDP-43 pathology and cognitive dysfunction. Modules were also examined for their overlap with TDP-43 protein-protein interactions, revealing one module enriched with RNA-binding proteins and other causal ALS genes that increased in FTD/ALS and FTD cases. A module enriched with astrocyte and microglia proteins was significantly increased in ALS cases carrying the C9orf72 mutation compared to sporadic ALS cases, suggesting that the genetic expansion is associated with inflammation in the brain even without clinical evidence of dementia. Together, these findings highlight the utility of integrative systems-level proteomic approaches to resolve clinical phenotypes and genetic mechanisms underlying the ALS-FTD disease spectrum in human brain. © 2017 The Authors. Published under the terms of the CC BY 4.0 license.

  4. A systems approach identifies networks and genes linking sleep and stress: implications for neuropsychiatric disorders.

    PubMed

    Jiang, Peng; Scarpa, Joseph R; Fitzpatrick, Karrie; Losic, Bojan; Gao, Vance D; Hao, Ke; Summa, Keith C; Yang, He S; Zhang, Bin; Allada, Ravi; Vitaterna, Martha H; Turek, Fred W; Kasarskis, Andrew

    2015-05-05

    Sleep dysfunction and stress susceptibility are comorbid complex traits that often precede and predispose patients to a variety of neuropsychiatric diseases. Here, we demonstrate multilevel organizations of genetic landscape, candidate genes, and molecular networks associated with 328 stress and sleep traits in a chronically stressed population of 338 (C57BL/6J × A/J) F2 mice. We constructed striatal gene co-expression networks, revealing functionally and cell-type-specific gene co-regulations important for stress and sleep. Using a composite ranking system, we identified network modules most relevant for 15 independent phenotypic categories, highlighting a mitochondria/synaptic module that links sleep and stress. The key network regulators of this module are overrepresented with genes implicated in neuropsychiatric diseases. Our work suggests that the interplay among sleep, stress, and neuropathology emerges from genetic influences on gene expression and their collective organization through complex molecular networks, providing a framework for interrogating the mechanisms underlying sleep, stress susceptibility, and related neuropsychiatric disorders. Copyright © 2015 The Authors. Published by Elsevier Inc. All rights reserved.

  5. Construct and Compare Gene Coexpression Networks with DAPfinder and DAPview.

    PubMed

    Skinner, Jeff; Kotliarov, Yuri; Varma, Sudhir; Mine, Karina L; Yambartsev, Anatoly; Simon, Richard; Huyen, Yentram; Morgun, Andrey

    2011-07-14

    DAPfinder and DAPview are novel BRB-ArrayTools plug-ins to construct gene coexpression networks and identify significant differences in pairwise gene-gene coexpression between two phenotypes. Each significant difference in gene-gene association represents a Differentially Associated Pair (DAP). Our tools include several choices of filtering methods, gene-gene association metrics, statistical testing methods and multiple comparison adjustments. Network results are easily displayed in Cytoscape. Analyses of glioma experiments and microarray simulations demonstrate the utility of these tools. DAPfinder is a new friendly-user tool for reconstruction and comparison of biological networks.

  6. Employing conservation of co-expression to improve functional inference

    PubMed Central

    Daub, Carsten O; Sonnhammer, Erik LL

    2008-01-01

    Background Observing co-expression between genes suggests that they are functionally coupled. Co-expression of orthologous gene pairs across species may improve function prediction beyond the level achieved in a single species. Results We used orthology between genes of the three different species S. cerevisiae, D. melanogaster, and C. elegans to combine co-expression across two species at a time. This led to increased function prediction accuracy when we incorporated expression data from either of the other two species and even further increased when conservation across both of the two other species was considered at the same time. Employing the conservation across species to incorporate abundant model organism data for the prediction of protein interactions in poorly characterized species constitutes a very powerful annotation method. Conclusion To be able to employ the most suitable co-expression distance measure for our analysis, we evaluated the ability of four popular gene co-expression distance measures to detect biologically relevant interactions between pairs of genes. For the expression datasets employed in our co-expression conservation analysis above, we used the GO and the KEGG PATHWAY databases as gold standards. While the differences between distance measures were small, Spearman correlation showed to give most robust results. PMID:18808668

  7. A powerful nonparametric method for detecting differentially co-expressed genes: distance correlation screening and edge-count test.

    PubMed

    Zhang, Qingyang

    2018-05-16

    Differential co-expression analysis, as a complement of differential expression analysis, offers significant insights into the changes in molecular mechanism of different phenotypes. A prevailing approach to detecting differentially co-expressed genes is to compare Pearson's correlation coefficients in two phenotypes. However, due to the limitations of Pearson's correlation measure, this approach lacks the power to detect nonlinear changes in gene co-expression which is common in gene regulatory networks. In this work, a new nonparametric procedure is proposed to search differentially co-expressed gene pairs in different phenotypes from large-scale data. Our computational pipeline consisted of two main steps, a screening step and a testing step. The screening step is to reduce the search space by filtering out all the independent gene pairs using distance correlation measure. In the testing step, we compare the gene co-expression patterns in different phenotypes by a recently developed edge-count test. Both steps are distribution-free and targeting nonlinear relations. We illustrate the promise of the new approach by analyzing the Cancer Genome Atlas data and the METABRIC data for breast cancer subtypes. Compared with some existing methods, the new method is more powerful in detecting nonlinear type of differential co-expressions. The distance correlation screening can greatly improve computational efficiency, facilitating its application to large data sets.

  8. Coexpression network analysis of the genes regulated by two types of resistance responses to powdery mildew in wheat

    PubMed Central

    Zhang, Juncheng; Zheng, Hongyuan; Li, Yiwen; Li, Hongjie; Liu, Xin; Qin, Huanju; Dong, Lingli; Wang, Daowen

    2016-01-01

    Powdery mildew disease caused by Blumeria graminis f. sp. tritici (Bgt) inflicts severe economic losses in wheat crops. A systematic understanding of the molecular mechanisms involved in wheat resistance to Bgt is essential for effectively controlling the disease. Here, using the diploid wheat Triticum urartu as a host, the genes regulated by immune (IM) and hypersensitive reaction (HR) resistance responses to Bgt were investigated through transcriptome sequencing. Four gene coexpression networks (GCNs) were developed using transcriptomic data generated for 20 T. urartu accessions showing IM, HR or susceptible responses. The powdery mildew resistance regulated (PMRR) genes whose expression was significantly correlated with Bgt resistance were identified, and they tended to be hubs and enriched in six major modules. A wide occurrence of negative regulation of PMRR genes was observed. Three new candidate immune receptor genes (TRIUR3_13045, TRIUR3_01037 and TRIUR3_06195) positively associated with Bgt resistance were discovered. Finally, the involvement of TRIUR3_01037 in Bgt resistance was tentatively verified through cosegregation analysis in a F2 population and functional expression assay in Bgt susceptible leaf cells. This research provides insights into the global network properties of PMRR genes. Potential molecular differences between IM and HR resistance responses to Bgt are discussed. PMID:27033636

  9. Functional signaling and gene regulatory networks between the oocyte and the surrounding cumulus cells.

    PubMed

    Biase, Fernando H; Kimble, Katelyn M

    2018-05-10

    The maturation and successful acquisition of developmental competence by an oocyte, the female gamete, during folliculogenesis is highly dependent on molecular interactions with somatic cells. Most of the cellular interactions identified, thus far, are modulated by growth factors, ions or metabolites. We hypothesized that this interaction is also modulated at the transcriptional level, which leads to the formation of gene regulatory networks between the oocyte and cumulus cells. We tested this hypothesis by analyzing transcriptome data from single oocytes and the surrounding cumulus cells collected from antral follicles employing an analytical framework to determine interdependencies at the transcript level. We overlapped our transcriptome data with putative protein-protein interactions and identified hundreds of ligand-receptor pairs that can transduce paracrine signaling between an oocyte and cumulus cells. We determined that 499 ligand-encoding genes expressed in oocytes and cumulus cells are functionally associated with transcription regulation (FDR < 0.05). Ligand-encoding genes with specific expression in oocytes or cumulus cells were enriched for biological functions that are likely associated with the coordinated formation of transzonal projections from cumulus cells that reach the oocyte's membrane. Thousands of gene pairs exhibit significant linear co-expression (absolute correlation > 0.85, FDR < 1.8 × 10 - 5 ) patterns between oocytes and cumulus cells. Hundreds of co-expressing genes showed clustering patterns associated with biological functions (FDR < 0.5) necessary for a coordinated function between the oocyte and cumulus cells during folliculogenesis (i.e. regulation of transcription, translation, apoptosis, cell differentiation and transport). Our analyses revealed a complex and functional gene regulatory circuit between the oocyte and surrounding cumulus cells. The regulatory profile of each cumulus-oocyte complex is likely associated with the oocytes' developmental potential to derive an embryo.

  10. Rapid and tunable method to temporally control gene editing based on conditional Cas9 stabilization. | Office of Cancer Genomics

    Cancer.gov

    The CRISPR/Cas9 system is a powerful tool for studying gene function. Here, we describe a method that allows temporal control of CRISPR/Cas9 activity based on conditional Cas9 destabilization. We demonstrate that fusing an FKBP12-derived destabilizing domain to Cas9 (DD-Cas9) enables conditional Cas9 expression and temporal control of gene editing in the presence of an FKBP12 synthetic ligand. This system can be easily adapted to co-express, from the same promoter, DD-Cas9 with any other gene of interest without co-modulation of the latter.

  11. Interhemispheric gene expression differences in the cerebral cortex of humans and macaque monkeys.

    PubMed

    Muntané, Gerard; Santpere, Gabriel; Verendeev, Andrey; Seeley, William W; Jacobs, Bob; Hopkins, William D; Navarro, Arcadi; Sherwood, Chet C

    2017-09-01

    Handedness and language are two well-studied examples of asymmetrical brain function in humans. Approximately 90% of humans exhibit a right-hand preference, and the vast majority shows left-hemisphere dominance for language function. Although genetic models of human handedness and language have been proposed, the actual gene expression differences between cerebral hemispheres in humans remain to be fully defined. In the present study, gene expression profiles were examined in both hemispheres of three cortical regions involved in handedness and language in humans and their homologues in rhesus macaques: ventrolateral prefrontal cortex, posterior superior temporal cortex (STC), and primary motor cortex. Although the overall pattern of gene expression was very similar between hemispheres in both humans and macaques, weighted gene correlation network analysis revealed gene co-expression modules associated with hemisphere, which are different among the three cortical regions examined. Notably, a receptor-enriched gene module in STC was particularly associated with hemisphere and showed different expression levels between hemispheres only in humans.

  12. Whole-genome transcriptomic insights into protective molecular mechanisms in metabolically healthy obese African Americans.

    PubMed

    Gaye, Amadou; Doumatey, Ayo P; Davis, Sharon K; Rotimi, Charles N; Gibbons, Gary H

    2018-01-01

    Several clinical guidelines have been proposed to distinguish metabolically healthy obesity (MHO) from other subgroups of obesity but the molecular mechanisms by which MHO individuals remain metabolically healthy despite having a high fat mass are yet to be elucidated. We conducted the first whole blood transcriptomic study designed to identify specific sets of genes that might shed novel insights into the molecular mechanisms that protect or delay the occurrence of obesity-related co-morbidities in MHO. The study included 29 African-American obese individuals, 8 MHO and 21 metabolically abnormal obese (MAO). Unbiased transcriptome-wide network analysis was carried out to identify molecular modules of co-expressed genes that are collectively associated with MHO. Network analysis identified a group of 23 co-expressed genes, including ribosomal protein genes (RPs), which were significantly downregulated in MHO subjects. The three pathways enriched in the group of co-expressed genes are EIF2 signaling, regulation of eIF4 and p70S6K signaling, and mTOR signaling. The expression of ten of the RPs collectively predicted MHO status with an area under the curve of 0.81. Triglycerides/HDL (TG/HDL) ratio, an index of insulin resistance, was the best predictor of the expression of genes in the MHO group. The higher TG/HDL values observed in the MAO subjects may underlie the activation of endoplasmic reticulum (ER) and related-stress pathways that lead to a chronic inflammatory state. In summary, these findings suggest that controlling ER stress and/or ribosomal stress by downregulating RPs or controlling TG/HDL ratio may represent effective strategies to prevent or delay the occurrence of metabolic disorders in obese individuals.

  13. Single-cell characterization of metabolic switching in the sugar phosphotransferase system of Escherichia coli.

    PubMed

    Westermayer, Sonja A; Fritz, Georg; Gutiérrez, Joaquín; Megerle, Judith A; Weißl, Mira P S; Schnetz, Karin; Gerland, Ulrich; Rädler, Joachim O

    2016-05-01

    The utilization of several sugars in Escherichia coli is regulated by the Phosphotransferase System (PTS), in which diverse sugar utilization modules compete for phosphoryl flux from the general PTS proteins. Existing theoretical work predicts a winner-take-all outcome when this flux limits carbon uptake. To date, no experimental work has interrogated competing PTS uptake modules with single-cell resolution. Using time-lapse microscopy in perfused microchannels, we analyzed the competition between N-acetyl-glucosamine and sorbitol, as representative PTS sugars, by measuring both the expression of their utilization systems and the concomitant impact of sugar utilization on growth rates. We find two distinct regimes: hierarchical usage of the carbohydrates, and co-expression of the genes for both systems. Simulations of a mathematical model incorporating asymmetric sugar quality reproduce our metabolic phase diagram, indicating that under conditions of nonlimiting phosphate flux, co-expression is due to uncoupling of both sugar utilization systems. Our model reproduces hierarchical winner-take-all behaviour and stochastic co-expression, and predicts the switching between both strategies as a function of available phosphate flux. Hence, experiments and theory both suggest that PTS sugar utilization involves not only switching between the sugars utilized but also switching of utilization strategies to accommodate prevailing environmental conditions. © 2016 John Wiley & Sons Ltd.

  14. Environment-dependent striatal gene expression in the BACHD rat model for Huntington disease.

    PubMed

    Novati, Arianna; Hentrich, Thomas; Wassouf, Zinah; Weber, Jonasz J; Yu-Taeger, Libo; Déglon, Nicole; Nguyen, Huu Phuc; Schulze-Hentrich, Julia M

    2018-04-11

    Huntington disease (HD) is an autosomal dominant neurodegenerative disorder caused by a mutation in the huntingtin (HTT) gene which results in progressive neurodegeneration in the striatum, cortex, and eventually most brain areas. Despite being a monogenic disorder, environmental factors influence HD characteristics. Both human and mouse studies suggest that mutant HTT (mHTT) leads to gene expression changes that harbor potential to be modulated by the environment. Yet, the underlying mechanisms integrating environmental cues into the gene regulatory program have remained largely unclear. To better understand gene-environment interactions in the context of mHTT, we employed RNA-seq to examine effects of maternal separation (MS) and environmental enrichment (EE) on striatal gene expression during development of BACHD rats. We integrated our results with striatal consensus modules defined on HTT-CAG length and age-dependent co-expression gene networks to relate the environmental factors with disease progression. While mHTT was the main determinant of expression changes, both MS and EE were capable of modulating these disturbances, resulting in distinctive and in several cases opposing effects of MS and EE on consensus modules. This bivalent response to maternal separation and environmental enrichment may aid in explaining their distinct effects observed on disease phenotypes in animal models of HD and related neurodegenerative disorders.

  15. MIrExpress: A Database for Gene Coexpression Correlation in Immune Cells Based on Mutual Information and Pearson Correlation

    PubMed Central

    Wang, Luman; Mo, Qiaochu; Wang, Jianxin

    2015-01-01

    Most current gene coexpression databases support the analysis for linear correlation of gene pairs, but not nonlinear correlation of them, which hinders precisely evaluating the gene-gene coexpression strengths. Here, we report a new database, MIrExpress, which takes advantage of the information theory, as well as the Pearson linear correlation method, to measure the linear correlation, nonlinear correlation, and their hybrid of cell-specific gene coexpressions in immune cells. For a given gene pair or probe set pair input by web users, both mutual information (MI) and Pearson correlation coefficient (r) are calculated, and several corresponding values are reported to reflect their coexpression correlation nature, including MI and r values, their respective rank orderings, their rank comparison, and their hybrid correlation value. Furthermore, for a given gene, the top 10 most relevant genes to it are displayed with the MI, r, or their hybrid perspective, respectively. Currently, the database totally includes 16 human cell groups, involving 20,283 human genes. The expression data and the calculated correlation results from the database are interactively accessible on the web page and can be implemented for other related applications and researches. PMID:26881263

  16. MIrExpress: A Database for Gene Coexpression Correlation in Immune Cells Based on Mutual Information and Pearson Correlation.

    PubMed

    Wang, Luman; Mo, Qiaochu; Wang, Jianxin

    2015-01-01

    Most current gene coexpression databases support the analysis for linear correlation of gene pairs, but not nonlinear correlation of them, which hinders precisely evaluating the gene-gene coexpression strengths. Here, we report a new database, MIrExpress, which takes advantage of the information theory, as well as the Pearson linear correlation method, to measure the linear correlation, nonlinear correlation, and their hybrid of cell-specific gene coexpressions in immune cells. For a given gene pair or probe set pair input by web users, both mutual information (MI) and Pearson correlation coefficient (r) are calculated, and several corresponding values are reported to reflect their coexpression correlation nature, including MI and r values, their respective rank orderings, their rank comparison, and their hybrid correlation value. Furthermore, for a given gene, the top 10 most relevant genes to it are displayed with the MI, r, or their hybrid perspective, respectively. Currently, the database totally includes 16 human cell groups, involving 20,283 human genes. The expression data and the calculated correlation results from the database are interactively accessible on the web page and can be implemented for other related applications and researches.

  17. Characterization of Chemically Induced Liver Injuries Using Gene Co-Expression Modules

    DTIC Science & Technology

    2014-09-16

    liver Sod2 Fibrosis/cirrhosis Sod2, Hao2, Sult1e1, Got1, Gulo, Obp3, Bdh1 Necrosis Sod2 Liver neoplasms Sod2, Rgn, Anxa2, Car3, Gstp1 Carcinoma Sod2... Gstp1 doi:10.1371/journal.pone.0107230.t006 Figure 10. Validation of external datasets. Scatter plots show the correlation of the log-ratios between

  18. Screening of Critical Genes and MicroRNAs in Blood Samples of Patients with Ruptured Intracranial Aneurysms by Bioinformatic Analysis of Gene Expression Data.

    PubMed

    Bo, Lijuan; Wei, Bo; Wang, Zhanfeng; Kong, Daliang; Gao, Zheng; Miao, Zhuang

    2017-09-20

    BACKGROUND This study aimed to identify more potential genes and miRNAs associated with the pathogenesis of intracranial aneurysms (IAs). MATERIAL AND METHODS The dataset of GSE36791 (accession number) was downloaded from the Gene Expression Omnibus database. Differentially expressed genes (DEGs) were screened for in the blood samples from patients with ruptured IAs and controls, followed by functional and pathway enrichment analyses. In addition, gene co-expression network was constructed and significant modules were extracted from the network by WGCNA R package. Screening for miRNAs that could regulate DEGs in the modules was performed and an analysis of regulatory relationships was conducted. RESULTS A total of 304 DEGs (167 up-regulated and 137 down-regulated genes) were screened for in blood samples from patients with ruptured IAs compared with those from controls. Functional enrichment analysis showed that the up-regulated genes were mainly associated with immune response and the down-regulated DEGs were mainly concerned with the structure of ribosome and translation. Besides, six functional modules were significantly identified, including four modules enriched by up-regulated genes and two modules enriched by down-regulated genes. Thereinto, the blue, yellow, and turquoise modules of up-regulated genes were all linked with immune response. Additionally, 16 miRNAs were predicted to regulate DEGs in the three modules associated with immune response, such as hsa-miR-1304, hsa-miR-33b, hsa-miR-125b, and hsa-miR-125a-5p. CONCLUSIONS Several genes and miRNAs (such as miR-1304, miR-33b, IRS2 and KCNJ2) may take part in the pathogenesis of IAs.

  19. Large clusters of co-expressed genes in the Drosophila genome.

    PubMed

    Boutanaev, Alexander M; Kalmykova, Alla I; Shevelyov, Yuri Y; Nurminsky, Dmitry I

    2002-12-12

    Clustering of co-expressed, non-homologous genes on chromosomes implies their co-regulation. In lower eukaryotes, co-expressed genes are often found in pairs. Clustering of genes that share aspects of transcriptional regulation has also been reported in higher eukaryotes. To advance our understanding of the mode of coordinated gene regulation in multicellular organisms, we performed a genome-wide analysis of the chromosomal distribution of co-expressed genes in Drosophila. We identified a total of 1,661 testes-specific genes, one-third of which are clustered on chromosomes. The number of clusters of three or more genes is much higher than expected by chance. We observed a similar trend for genes upregulated in the embryo and in the adult head, although the expression pattern of individual genes cannot be predicted on the basis of chromosomal position alone. Our data suggest that the prevalent mechanism of transcriptional co-regulation in higher eukaryotes operates with extensive chromatin domains that comprise multiple genes.

  20. Using the 2A Protein Coexpression System: Multicistronic 2A Vectors Expressing Gene(s) of Interest and Reporter Proteins.

    PubMed

    Luke, Garry A; Ryan, Martin D

    2018-01-01

    To date, a huge range of different proteins-many with cotranslational and posttranslational subcellular localization signals-have been coexpressed together with various reporter proteins in vitro and in vivo using 2A peptides. The pros and cons of 2A co-expression technology are considered below, followed by a simple example of a "how to" protocol to concatenate multiple genes of interest, together with a reporter gene, into a single gene linked via 2As for easy identification or selection of transduced cells.

  1. TRACING CO-REGULATORY NETWORK DYNAMICS IN NOISY, SINGLE-CELL TRANSCRIPTOME TRAJECTORIES.

    PubMed

    Cordero, Pablo; Stuart, Joshua M

    2017-01-01

    The availability of gene expression data at the single cell level makes it possible to probe the molecular underpinnings of complex biological processes such as differentiation and oncogenesis. Promising new methods have emerged for reconstructing a progression 'trajectory' from static single-cell transcriptome measurements. However, it remains unclear how to adequately model the appreciable level of noise in these data to elucidate gene regulatory network rewiring. Here, we present a framework called Single Cell Inference of MorphIng Trajectories and their Associated Regulation (SCIMITAR) that infers progressions from static single-cell transcriptomes by employing a continuous parametrization of Gaussian mixtures in high-dimensional curves. SCIMITAR yields rich models from the data that highlight genes with expression and co-expression patterns that are associated with the inferred progression. Further, SCIMITAR extracts regulatory states from the implicated trajectory-evolvingco-expression networks. We benchmark the method on simulated data to show that it yields accurate cell ordering and gene network inferences. Applied to the interpretation of a single-cell human fetal neuron dataset, SCIMITAR finds progression-associated genes in cornerstone neural differentiation pathways missed by standard differential expression tests. Finally, by leveraging the rewiring of gene-gene co-expression relations across the progression, the method reveals the rise and fall of co-regulatory states and trajectory-dependent gene modules. These analyses implicate new transcription factors in neural differentiation including putative co-factors for the multi-functional NFAT pathway.

  2. Modeling genome-wide dynamic regulatory network in mouse lungs with influenza infection using high-dimensional ordinary differential equations.

    PubMed

    Wu, Shuang; Liu, Zhi-Ping; Qiu, Xing; Wu, Hulin

    2014-01-01

    The immune response to viral infection is regulated by an intricate network of many genes and their products. The reverse engineering of gene regulatory networks (GRNs) using mathematical models from time course gene expression data collected after influenza infection is key to our understanding of the mechanisms involved in controlling influenza infection within a host. A five-step pipeline: detection of temporally differentially expressed genes, clustering genes into co-expressed modules, identification of network structure, parameter estimate refinement, and functional enrichment analysis, is developed for reconstructing high-dimensional dynamic GRNs from genome-wide time course gene expression data. Applying the pipeline to the time course gene expression data from influenza-infected mouse lungs, we have identified 20 distinct temporal expression patterns in the differentially expressed genes and constructed a module-based dynamic network using a linear ODE model. Both intra-module and inter-module annotations and regulatory relationships of our inferred network show some interesting findings and are highly consistent with existing knowledge about the immune response in mice after influenza infection. The proposed method is a computationally efficient, data-driven pipeline bridging experimental data, mathematical modeling, and statistical analysis. The application to the influenza infection data elucidates the potentials of our pipeline in providing valuable insights into systematic modeling of complicated biological processes.

  3. G-NEST: A gene neighborhood scoring tool to identify co-conserved, co-expressed genes

    USDA-ARS?s Scientific Manuscript database

    In previous studies, gene neighborhoods--spatial clusters of co-expressed genes in the genome--have been defined using arbitrary rules such as requiring adjacency, a minimum number of genes, a fixed window size, or a minimum expression level. In the current study, we developed a Gene Neighborhood Sc...

  4. Gene coexpression measures in large heterogeneous samples using count statistics.

    PubMed

    Wang, Y X Rachel; Waterman, Michael S; Huang, Haiyan

    2014-11-18

    With the advent of high-throughput technologies making large-scale gene expression data readily available, developing appropriate computational tools to process these data and distill insights into systems biology has been an important part of the "big data" challenge. Gene coexpression is one of the earliest techniques developed that is still widely in use for functional annotation, pathway analysis, and, most importantly, the reconstruction of gene regulatory networks, based on gene expression data. However, most coexpression measures do not specifically account for local features in expression profiles. For example, it is very likely that the patterns of gene association may change or only exist in a subset of the samples, especially when the samples are pooled from a range of experiments. We propose two new gene coexpression statistics based on counting local patterns of gene expression ranks to take into account the potentially diverse nature of gene interactions. In particular, one of our statistics is designed for time-course data with local dependence structures, such as time series coupled over a subregion of the time domain. We provide asymptotic analysis of their distributions and power, and evaluate their performance against a wide range of existing coexpression measures on simulated and real data. Our new statistics are fast to compute, robust against outliers, and show comparable and often better general performance.

  5. The chimeric gene CHRFAM7A, a partial duplication of the CHRNA7 gene, is a dominant negative regulator of α7*nAChR function

    PubMed Central

    Araud, Tanguy; Graw, Sharon; Berger, Ralph; Lee, Michael; Neveu, Estelle; Bertrand, Daniel; Leonard, Sherry

    2011-01-01

    The human α7 neuronal nicotinic acetylcholine receptor gene (CHRNA7) is a candidate gene for schizophrenia and an important drug target for cognitive deficits in the disorder. Activation of the α7*nAChR, results in opening of the channel and entry of mono- and divalent cations, including Ca++, that presynaptically participates to neurotransmitter release and postsynaptically to down-stream changes in gene expression. Schizophrenic patients have low levels of α7*nAChR, as measured by binding of the ligand [125I]-α-bungarotoxin (I-BTX). The structure of the gene, CHRNA7, is complex. During evolution, CHRNA7 was partially duplicated as a chimeric gene (CHRFAM7A), which is expressed in the human brain and elsewhere in the body. The association between a 2bp deletion in CHRFAM7A and schizophrenia suggested that this duplicate gene might contribute to cognitive impairment. To examine the putative contribution of CHRFAM7A on receptor function, co-expression of α7 and the duplicate genes was carried out in cell lines and Xenopus oocytes. Expression of the duplicate alone yielded protein expression but no functional receptor and co-expression with α7 caused a significant reduction of the amplitude of the ACh-evoked currents. Reduced current amplitude was not correlated with a reduction of I-BTX binding, suggesting the presence of non-functional (ACh-silent) receptors. This hypothesis is supported by a larger increase of the ACh-evoked current by the allosteric modulator 1-(5-chloro-2,4-dimethoxy-phenyl)-3-(5-methyl-isoxazol-3-yl)-urea (PNU-120596) in cells expressing the duplicate than in the control. These results suggest that CHRFAM7A acts as a dominant negative modulator of CHRNA7 function and is critical for receptor regulation in humans. PMID:21718690

  6. The chimeric gene CHRFAM7A, a partial duplication of the CHRNA7 gene, is a dominant negative regulator of α7*nAChR function.

    PubMed

    Araud, Tanguy; Graw, Sharon; Berger, Ralph; Lee, Michael; Neveu, Estele; Bertrand, Daniel; Leonard, Sherry

    2011-10-15

    The human α7 neuronal nicotinic acetylcholine receptor gene (CHRNA7) is a candidate gene for schizophrenia and an important drug target for cognitive deficits in the disorder. Activation of the α7*nAChR, results in opening of the channel and entry of mono- and divalent cations, including Ca(2+), that presynaptically participates to neurotransmitter release and postsynaptically to down-stream changes in gene expression. Schizophrenic patients have low levels of α7*nAChR, as measured by binding of the ligand [(125)I]-α-bungarotoxin (I-BTX). The structure of the gene, CHRNA7, is complex. During evolution, CHRNA7 was partially duplicated as a chimeric gene (CHRFAM7A), which is expressed in the human brain and elsewhere in the body. The association between a 2bp deletion in CHRFAM7A and schizophrenia suggested that this duplicate gene might contribute to cognitive impairment. To examine the putative contribution of CHRFAM7A on receptor function, co-expression of α7 and the duplicate genes was carried out in cell lines and Xenopus oocytes. Expression of the duplicate alone yielded protein expression but no functional receptor and co-expression with α7 caused a significant reduction of the amplitude of the ACh-evoked currents. Reduced current amplitude was not correlated with a reduction of I-BTX binding, suggesting the presence of non-functional (ACh-silent) receptors. This hypothesis is supported by a larger increase of the ACh-evoked current by the allosteric modulator 1-(5-chloro-2,4-dimethoxy-phenyl)-3-(5-methyl-isoxazol-3-yl)-urea (PNU-120596) in cells expressing the duplicate than in the control. These results suggest that CHRFAM7A acts as a dominant negative modulator of CHRNA7 function and is critical for receptor regulation in humans. Copyright © 2011 Elsevier Inc. All rights reserved.

  7. A molecular network of the aging human brain provides insights into the pathology and cognitive decline of Alzheimer's disease.

    PubMed

    Mostafavi, Sara; Gaiteri, Chris; Sullivan, Sarah E; White, Charles C; Tasaki, Shinya; Xu, Jishu; Taga, Mariko; Klein, Hans-Ulrich; Patrick, Ellis; Komashko, Vitalina; McCabe, Cristin; Smith, Robert; Bradshaw, Elizabeth M; Root, David E; Regev, Aviv; Yu, Lei; Chibnik, Lori B; Schneider, Julie A; Young-Pearse, Tracy L; Bennett, David A; De Jager, Philip L

    2018-06-01

    There is a need for new therapeutic targets with which to prevent Alzheimer's disease (AD), a major contributor to aging-related cognitive decline. Here we report the construction and validation of a molecular network of the aging human frontal cortex. Using RNA sequence data from 478 individuals, we first build a molecular network using modules of coexpressed genes and then relate these modules to AD and its neuropathologic and cognitive endophenotypes. We confirm these associations in two independent AD datasets. We also illustrate the use of the network in prioritizing amyloid- and cognition-associated genes for in vitro validation in human neurons and astrocytes. These analyses based on unique cohorts enable us to resolve the role of distinct cortical modules that have a direct effect on the accumulation of AD pathology from those that have a direct effect on cognitive decline, exemplifying a network approach to complex diseases.

  8. Genome-wide identification and functional prediction of nitrogen-responsive intergenic and intronic long non-coding RNAs in maize (Zea mays L.).

    PubMed

    Lv, Yuanda; Liang, Zhikai; Ge, Min; Qi, Weicong; Zhang, Tifu; Lin, Feng; Peng, Zhaohua; Zhao, Han

    2016-05-11

    Nitrogen (N) is an essential and often limiting nutrient to plant growth and development. Previous studies have shown that the mRNA expressions of numerous genes are regulated by nitrogen supplies; however, little is known about the expressed non-coding elements, for example long non-coding RNAs (lncRNAs) that control the response of maize (Zea mays L.) to nitrogen. LncRNAs are a class of non-coding RNAs larger than 200 bp, which have emerged as key regulators in gene expression. In this study, we surveyed the intergenic/intronic lncRNAs in maize B73 leaves at the V7 stage under conditions of N-deficiency and N-sufficiency using ribosomal RNA depletion and ultra-deep total RNA sequencing approaches. By integration with mRNA expression profiles and physiological evaluations, 7245 lncRNAs and 637 nitrogen-responsive lncRNAs were identified that exhibited unique expression patterns. Co-expression network analysis showed that the nitrogen-responsive lncRNAs were enriched mainly in one of the three co-expressed modules. The genes in the enriched module are mainly involved in NADH dehydrogenase activity, oxidative phosphorylation and the nitrogen compounds metabolic process. We identified a large number of lncRNAs in maize and illustrated their potential regulatory roles in response to N stress. The results lay the foundation for further in-depth understanding of the molecular mechanisms of lncRNAs' role in response to nitrogen stresses.

  9. An Integrated Cell Purification and Genomics Strategy Reveals Multiple Regulators of Pancreas Development

    PubMed Central

    Benitez, Cecil M.; Qu, Kun; Sugiyama, Takuya; Pauerstein, Philip T.; Liu, Yinghua; Tsai, Jennifer; Gu, Xueying; Ghodasara, Amar; Arda, H. Efsun; Zhang, Jiajing; Dekker, Joseph D.; Tucker, Haley O.; Chang, Howard Y.; Kim, Seung K.

    2014-01-01

    The regulatory logic underlying global transcriptional programs controlling development of visceral organs like the pancreas remains undiscovered. Here, we profiled gene expression in 12 purified populations of fetal and adult pancreatic epithelial cells representing crucial progenitor cell subsets, and their endocrine or exocrine progeny. Using probabilistic models to decode the general programs organizing gene expression, we identified co-expressed gene sets in cell subsets that revealed patterns and processes governing progenitor cell development, lineage specification, and endocrine cell maturation. Purification of Neurog3 mutant cells and module network analysis linked established regulators such as Neurog3 to unrecognized gene targets and roles in pancreas development. Iterative module network analysis nominated and prioritized transcriptional regulators, including diabetes risk genes. Functional validation of a subset of candidate regulators with corresponding mutant mice revealed that the transcription factors Etv1, Prdm16, Runx1t1 and Bcl11a are essential for pancreas development. Our integrated approach provides a unique framework for identifying regulatory genes and functional gene sets underlying pancreas development and associated diseases such as diabetes mellitus. PMID:25330008

  10. Skin Transcriptomes of common bottlenose dolphins (Tursiops truncatus) from the northern Gulf of Mexico and southeastern U.S. Atlantic coasts.

    PubMed

    Neely, Marion G; Morey, Jeanine S; Anderson, Paul; Balmer, Brian C; Ylitalo, Gina M; Zolman, Eric S; Speakman, Todd R; Sinclair, Carrie; Bachman, Melannie J; Huncik, Kevin; Kucklick, John; Rosel, Patricia E; Mullin, Keith D; Rowles, Teri K; Schwacke, Lori H; Van Dolah, Frances M

    2018-04-01

    Common bottlenose dolphins serve as sentinels for the health of their coastal environments as they are susceptible to health impacts from anthropogenic inputs through both direct exposure and food web magnification. Remote biopsy samples have been widely used to reveal contaminant burdens in free-ranging bottlenose dolphins, but do not address the health consequences of this exposure. To gain insight into whether remote biopsies can also identify health impacts associated with contaminant burdens, we employed RNA sequencing (RNA-seq) to interrogate the transcriptomes of remote skin biopsies from 116 bottlenose dolphins from the northern Gulf of Mexico and southeastern U.S. Atlantic coasts. Gene expression was analyzed using principal component analysis, differential expression testing, and gene co-expression networks, and the results correlated to season, location, and contaminant burden. Season had a significant impact, with over 60% of genes differentially expressed between spring/summer and winter months. Geographic location exhibited lesser effects on the transcriptome, with 23.5% of genes differentially expressed between the northern Gulf of Mexico and the southeastern U.S. Atlantic locations. Despite a large overlap between the seasonal and geographical gene sets, the pathways altered in the observed gene expression profiles were somewhat distinct. Co-regulated gene modules and differential expression analysis both identified epidermal development and cellular architecture pathways to be expressed at lower levels in animals from the northern Gulf of Mexico. Although contaminant burdens measured were not significantly different between regions, some correlation with contaminant loads in individuals was observed among co-expressed gene modules, but these did not include classical detoxification pathways. Instead, this study identified other, possibly downstream pathways, including those involved in cellular architecture, immune response, and oxidative stress, that may prove to be contaminant responsive markers in bottlenose dolphin skin. Copyright © 2017 The Authors. Published by Elsevier B.V. All rights reserved.

  11. Differential Network Analysis Reveals Evolutionary Complexity in Secondary Metabolism of Rauvolfia serpentina over Catharanthus roseus

    PubMed Central

    Pathania, Shivalika; Bagler, Ganesh; Ahuja, Paramvir S.

    2016-01-01

    Comparative co-expression analysis of multiple species using high-throughput data is an integrative approach to determine the uniformity as well as diversification in biological processes. Rauvolfia serpentina and Catharanthus roseus, both members of Apocyanacae family, are reported to have remedial properties against multiple diseases. Despite of sharing upstream of terpenoid indole alkaloid pathway, there is significant diversity in tissue-specific synthesis and accumulation of specialized metabolites in these plants. This led us to implement comparative co-expression network analysis to investigate the modules and genes responsible for differential tissue-specific expression as well as species-specific synthesis of metabolites. Toward these goals differential network analysis was implemented to identify candidate genes responsible for diversification of metabolites profile. Three genes were identified with significant difference in connectivity leading to differential regulatory behavior between these plants. These genes may be responsible for diversification of secondary metabolism, and thereby for species-specific metabolite synthesis. The network robustness of R. serpentina, determined based on topological properties, was also complemented by comparison of gene-metabolite networks of both plants, and may have evolved to have complex metabolic mechanisms as compared to C. roseus under the influence of various stimuli. This study reveals evolution of complexity in secondary metabolism of R. serpentina, and key genes that contribute toward diversification of specific metabolites. PMID:27588023

  12. Differential Network Analysis Reveals Evolutionary Complexity in Secondary Metabolism of Rauvolfia serpentina over Catharanthus roseus.

    PubMed

    Pathania, Shivalika; Bagler, Ganesh; Ahuja, Paramvir S

    2016-01-01

    Comparative co-expression analysis of multiple species using high-throughput data is an integrative approach to determine the uniformity as well as diversification in biological processes. Rauvolfia serpentina and Catharanthus roseus, both members of Apocyanacae family, are reported to have remedial properties against multiple diseases. Despite of sharing upstream of terpenoid indole alkaloid pathway, there is significant diversity in tissue-specific synthesis and accumulation of specialized metabolites in these plants. This led us to implement comparative co-expression network analysis to investigate the modules and genes responsible for differential tissue-specific expression as well as species-specific synthesis of metabolites. Toward these goals differential network analysis was implemented to identify candidate genes responsible for diversification of metabolites profile. Three genes were identified with significant difference in connectivity leading to differential regulatory behavior between these plants. These genes may be responsible for diversification of secondary metabolism, and thereby for species-specific metabolite synthesis. The network robustness of R. serpentina, determined based on topological properties, was also complemented by comparison of gene-metabolite networks of both plants, and may have evolved to have complex metabolic mechanisms as compared to C. roseus under the influence of various stimuli. This study reveals evolution of complexity in secondary metabolism of R. serpentina, and key genes that contribute toward diversification of specific metabolites.

  13. LEAP: constructing gene co-expression networks for single-cell RNA-sequencing data using pseudotime ordering.

    PubMed

    Specht, Alicia T; Li, Jun

    2017-03-01

    To construct gene co-expression networks based on single-cell RNA-Sequencing data, we present an algorithm called LEAP, which utilizes the estimated pseudotime of the cells to find gene co-expression that involves time delay. R package LEAP available on CRAN. jun.li@nd.edu. Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com

  14. Systematic analysis of microarray datasets to identify Parkinson's disease‑associated pathways and genes.

    PubMed

    Feng, Yinling; Wang, Xuefeng

    2017-03-01

    In order to investigate commonly disturbed genes and pathways in various brain regions of patients with Parkinson's disease (PD), microarray datasets from previous studies were collected and systematically analyzed. Different normalization methods were applied to microarray datasets from different platforms. A strategy combining gene co‑expression networks and clinical information was adopted, using weighted gene co‑expression network analysis (WGCNA) to screen for commonly disturbed genes in different brain regions of patients with PD. Functional enrichment analysis of commonly disturbed genes was performed using the Database for Annotation, Visualization, and Integrated Discovery (DAVID). Co‑pathway relationships were identified with Pearson's correlation coefficient tests and a hypergeometric distribution‑based test. Common genes in pathway pairs were selected out and regarded as risk genes. A total of 17 microarray datasets from 7 platforms were retained for further analysis. Five gene coexpression modules were identified, containing 9,745, 736, 233, 101 and 93 genes, respectively. One module was significantly correlated with PD samples and thus the 736 genes it contained were considered to be candidate PD‑associated genes. Functional enrichment analysis demonstrated that these genes were implicated in oxidative phosphorylation and PD. A total of 44 pathway pairs and 52 risk genes were revealed, and a risk gene pathway relationship network was constructed. Eight modules were identified and were revealed to be associated with PD, cancers and metabolism. A number of disturbed pathways and risk genes were unveiled in PD, and these findings may help advance understanding of PD pathogenesis.

  15. Annotation of gene function in citrus using gene expression information and co-expression networks

    PubMed Central

    2014-01-01

    Background The genus Citrus encompasses major cultivated plants such as sweet orange, mandarin, lemon and grapefruit, among the world’s most economically important fruit crops. With increasing volumes of transcriptomics data available for these species, Gene Co-expression Network (GCN) analysis is a viable option for predicting gene function at a genome-wide scale. GCN analysis is based on a “guilt-by-association” principle whereby genes encoding proteins involved in similar and/or related biological processes may exhibit similar expression patterns across diverse sets of experimental conditions. While bioinformatics resources such as GCN analysis are widely available for efficient gene function prediction in model plant species including Arabidopsis, soybean and rice, in citrus these tools are not yet developed. Results We have constructed a comprehensive GCN for citrus inferred from 297 publicly available Affymetrix Genechip Citrus Genome microarray datasets, providing gene co-expression relationships at a genome-wide scale (33,000 transcripts). The comprehensive citrus GCN consists of a global GCN (condition-independent) and four condition-dependent GCNs that survey the sweet orange species only, all citrus fruit tissues, all citrus leaf tissues, or stress-exposed plants. All of these GCNs are clustered using genome-wide, gene-centric (guide) and graph clustering algorithms for flexibility of gene function prediction. For each putative cluster, gene ontology (GO) enrichment and gene expression specificity analyses were performed to enhance gene function, expression and regulation pattern prediction. The guide-gene approach was used to infer novel roles of genes involved in disease susceptibility and vitamin C metabolism, and graph-clustering approaches were used to investigate isoprenoid/phenylpropanoid metabolism in citrus peel, and citric acid catabolism via the GABA shunt in citrus fruit. Conclusions Integration of citrus gene co-expression networks, functional enrichment analysis and gene expression information provide opportunities to infer gene function in citrus. We present a publicly accessible tool, Network Inference for Citrus Co-Expression (NICCE, http://citrus.adelaide.edu.au/nicce/home.aspx), for the gene co-expression analysis in citrus. PMID:25023870

  16. Co-expression of five genes in E coli for L-phenylalanine in Brevibacterium flavum

    PubMed Central

    Wu, Yong-Qing; Jiang, Pei-Hong; Fan, Chang-Sheng; Wang, Jian-Gang; Shang, Liang; Huang, Wei-Da

    2003-01-01

    AIM: To study the effect of co-expression of ppsA, pckA, aroG, pheA and tyrB genes on the production of L-phenylalanine, and to construct a genetic engineering strain for L-phenylalanine. METHODS: ppsA and pckA genes were amplified from genomic DNA of E. coli by polymerase chain reaction, and then introduced into shuttle vectors between E coli and Brevibacterium flavum to generate constructs pJN2 and pJN5. pJN2 was generated by inserting ppsA and pckA genes into vector pCZ; whereas pJN5 was obtained by introducing ppsA and pckA genes into pCZ-GAB, which was originally constructed for co-expression of aroG, pheA and tyrB genes. The recombinant plasmids were then introduced into B. flavum by electroporation and the transformants were used for L-phenylalanine fermentation. RESULTS: Compared with the original B. flavum cells, all the transformants were showed to have increased five enzyme activities specifically, and have enhanced L-phenylalanine biosynthesis ability variably. pJN5 transformant was observed to have the highest elevation of L-phenylalanine production by a 3.4-fold. Co-expression of ppsA and pckA increased activity of DAHP synthetase significantly. CONCLUSION: Co-expression of ppsA and pckA genes in B. flavum could remarkably increase the expression of DAHP synthetase; Co-expression of ppsA, pckA, aroG, pheA and tyrB of E. coli in B. flavum was a feasible approach to construct a strain for phenylalanine production. PMID:12532463

  17. Systems analysis of transcriptome data provides new hypotheses about Arabidopsis root response to nitrate treatments

    PubMed Central

    Canales, Javier; Moyano, Tomás C.; Villarroel, Eva; Gutiérrez, Rodrigo A.

    2014-01-01

    Nitrogen (N) is an essential macronutrient for plant growth and development. Plants adapt to changes in N availability partly by changes in global gene expression. We integrated publicly available root microarray data under contrasting nitrate conditions to identify new genes and functions important for adaptive nitrate responses in Arabidopsis thaliana roots. Overall, more than 2000 genes exhibited changes in expression in response to nitrate treatments in Arabidopsis thaliana root organs. Global regulation of gene expression by nitrate depends largely on the experimental context. However, despite significant differences from experiment to experiment in the identity of regulated genes, there is a robust nitrate response of specific biological functions. Integrative gene network analysis uncovered relationships between nitrate-responsive genes and 11 highly co-expressed gene clusters (modules). Four of these gene network modules have robust nitrate responsive functions such as transport, signaling, and metabolism. Network analysis hypothesized G2-like transcription factors are key regulatory factors controlling transport and signaling functions. Our meta-analysis highlights the role of biological processes not studied before in the context of the nitrate response such as root hair development and provides testable hypothesis to advance our understanding of nitrate responses in plants. PMID:24570678

  18. Gene Co-Expression Analysis Inferring the Crosstalk of Ethylene and Gibberellin in Modulating the Transcriptional Acclimation of Cassava Root Growth in Different Seasons

    PubMed Central

    Saithong, Treenut; Saerue, Samorn; Kalapanulak, Saowalak; Sojikul, Punchapat; Narangajavana, Jarunya; Bhumiratana, Sakarindr

    2015-01-01

    Cassava is a crop of hope for the 21st century. Great advantages of cassava over other crops are not only the capacity of carbohydrates, but it is also an easily grown crop with fast development. As a plant which is highly tolerant to a poor environment, cassava has been believed to own an effective acclimation process, an intelligent mechanism behind its survival and sustainability in a wide range of climates. Herein, we aimed to investigate the transcriptional regulation underlying the adaptive development of a cassava root to different seasonal cultivation climates. Gene co-expression analysis suggests that AP2-EREBP transcription factor (ERF1) orthologue (D142) played a pivotal role in regulating the cellular response to exposing to wet and dry seasons. The ERF shows crosstalk with gibberellin, via ent-Kaurene synthase (D106), in the transcriptional regulatory network that was proposed to modulate the downstream regulatory system through a distinct signaling mechanism. While sulfur assimilation is likely to be a signaling regulation for dry crop growth response, calmodulin-binding protein is responsible for regulation in the wet crop. With our initiative study, we hope that our findings will pave the way towards sustainability of cassava production under various kinds of stress considering the future global climate change. PMID:26366737

  19. Gene Co-Expression Analysis Inferring the Crosstalk of Ethylene and Gibberellin in Modulating the Transcriptional Acclimation of Cassava Root Growth in Different Seasons.

    PubMed

    Saithong, Treenut; Saerue, Samorn; Kalapanulak, Saowalak; Sojikul, Punchapat; Narangajavana, Jarunya; Bhumiratana, Sakarindr

    2015-01-01

    Cassava is a crop of hope for the 21st century. Great advantages of cassava over other crops are not only the capacity of carbohydrates, but it is also an easily grown crop with fast development. As a plant which is highly tolerant to a poor environment, cassava has been believed to own an effective acclimation process, an intelligent mechanism behind its survival and sustainability in a wide range of climates. Herein, we aimed to investigate the transcriptional regulation underlying the adaptive development of a cassava root to different seasonal cultivation climates. Gene co-expression analysis suggests that AP2-EREBP transcription factor (ERF1) orthologue (D142) played a pivotal role in regulating the cellular response to exposing to wet and dry seasons. The ERF shows crosstalk with gibberellin, via ent-Kaurene synthase (D106), in the transcriptional regulatory network that was proposed to modulate the downstream regulatory system through a distinct signaling mechanism. While sulfur assimilation is likely to be a signaling regulation for dry crop growth response, calmodulin-binding protein is responsible for regulation in the wet crop. With our initiative study, we hope that our findings will pave the way towards sustainability of cassava production under various kinds of stress considering the future global climate change.

  20. Evolution of herbivore-induced early defense signaling was shaped by genome-wide duplications in Nicotiana

    PubMed Central

    Zhou, Wenwu; Brockmöller, Thomas; Ling, Zhihao; Omdahl, Ashton; Baldwin, Ian T; Xu, Shuqing

    2016-01-01

    Herbivore-induced defenses are widespread, rapidly evolving and relevant for plant fitness. Such induced defenses are often mediated by early defense signaling (EDS) rapidly activated by the perception of herbivore associated elicitors (HAE) that includes transient accumulations of jasmonic acid (JA). Analyzing 60 HAE-induced leaf transcriptomes from closely-related Nicotiana species revealed a key gene co-expression network (M4 module) which is co-activated with the HAE-induced JA accumulations but is elicited independently of JA, as revealed in plants silenced in JA signaling. Functional annotations of the M4 module were consistent with roles in EDS and a newly identified hub gene of the M4 module (NaLRRK1) mediates a negative feedback loop with JA signaling. Phylogenomic analysis revealed preferential gene retention after genome-wide duplications shaped the evolution of HAE-induced EDS in Nicotiana. These results highlight the importance of genome-wide duplications in the evolution of adaptive traits in plants. DOI: http://dx.doi.org/10.7554/eLife.19531.001 PMID:27813478

  1. Prediction of Human Disease Genes by Human-Mouse Conserved Coexpression Analysis

    PubMed Central

    Grassi, Elena; Damasco, Christian; Silengo, Lorenzo; Oti, Martin; Provero, Paolo; Di Cunto, Ferdinando

    2008-01-01

    Background Even in the post-genomic era, the identification of candidate genes within loci associated with human genetic diseases is a very demanding task, because the critical region may typically contain hundreds of positional candidates. Since genes implicated in similar phenotypes tend to share very similar expression profiles, high throughput gene expression data may represent a very important resource to identify the best candidates for sequencing. However, so far, gene coexpression has not been used very successfully to prioritize positional candidates. Methodology/Principal Findings We show that it is possible to reliably identify disease-relevant relationships among genes from massive microarray datasets by concentrating only on genes sharing similar expression profiles in both human and mouse. Moreover, we show systematically that the integration of human-mouse conserved coexpression with a phenotype similarity map allows the efficient identification of disease genes in large genomic regions. Finally, using this approach on 850 OMIM loci characterized by an unknown molecular basis, we propose high-probability candidates for 81 genetic diseases. Conclusion Our results demonstrate that conserved coexpression, even at the human-mouse phylogenetic distance, represents a very strong criterion to predict disease-relevant relationships among human genes. PMID:18369433

  2. Maximizing capture of gene co-expression relationships through pre-clustering of input expression samples: an Arabidopsis case study.

    PubMed

    Feltus, F Alex; Ficklin, Stephen P; Gibson, Scott M; Smith, Melissa C

    2013-06-05

    In genomics, highly relevant gene interaction (co-expression) networks have been constructed by finding significant pair-wise correlations between genes in expression datasets. These networks are then mined to elucidate biological function at the polygenic level. In some cases networks may be constructed from input samples that measure gene expression under a variety of different conditions, such as for different genotypes, environments, disease states and tissues. When large sets of samples are obtained from public repositories it is often unmanageable to associate samples into condition-specific groups, and combining samples from various conditions has a negative effect on network size. A fixed significance threshold is often applied also limiting the size of the final network. Therefore, we propose pre-clustering of input expression samples to approximate condition-specific grouping of samples and individual network construction of each group as a means for dynamic significance thresholding. The net effect is increase sensitivity thus maximizing the total co-expression relationships in the final co-expression network compendium. A total of 86 Arabidopsis thaliana co-expression networks were constructed after k-means partitioning of 7,105 publicly available ATH1 Affymetrix microarray samples. We term each pre-sorted network a Gene Interaction Layer (GIL). Random Matrix Theory (RMT), an un-supervised thresholding method, was used to threshold each of the 86 networks independently, effectively providing a dynamic (non-global) threshold for the network. The overall gene count across all GILs reached 19,588 genes (94.7% measured gene coverage) and 558,022 unique co-expression relationships. In comparison, network construction without pre-sorting of input samples yielded only 3,297 genes (15.9%) and 129,134 relationships. in the global network. Here we show that pre-clustering of microarray samples helps approximate condition-specific networks and allows for dynamic thresholding using un-supervised methods. Because RMT ensures only highly significant interactions are kept, the GIL compendium consists of 558,022 unique high quality A. thaliana co-expression relationships across almost all of the measurable genes on the ATH1 array. For A. thaliana, these networks represent the largest compendium to date of significant gene co-expression relationships, and are a means to explore complex pathway, polygenic, and pleiotropic relationships for this focal model plant. The networks can be explored at sysbio.genome.clemson.edu. Finally, this method is applicable to any large expression profile collection for any organism and is best suited where a knowledge-independent network construction method is desired.

  3. Maximizing capture of gene co-expression relationships through pre-clustering of input expression samples: an Arabidopsis case study

    PubMed Central

    2013-01-01

    Background In genomics, highly relevant gene interaction (co-expression) networks have been constructed by finding significant pair-wise correlations between genes in expression datasets. These networks are then mined to elucidate biological function at the polygenic level. In some cases networks may be constructed from input samples that measure gene expression under a variety of different conditions, such as for different genotypes, environments, disease states and tissues. When large sets of samples are obtained from public repositories it is often unmanageable to associate samples into condition-specific groups, and combining samples from various conditions has a negative effect on network size. A fixed significance threshold is often applied also limiting the size of the final network. Therefore, we propose pre-clustering of input expression samples to approximate condition-specific grouping of samples and individual network construction of each group as a means for dynamic significance thresholding. The net effect is increase sensitivity thus maximizing the total co-expression relationships in the final co-expression network compendium. Results A total of 86 Arabidopsis thaliana co-expression networks were constructed after k-means partitioning of 7,105 publicly available ATH1 Affymetrix microarray samples. We term each pre-sorted network a Gene Interaction Layer (GIL). Random Matrix Theory (RMT), an un-supervised thresholding method, was used to threshold each of the 86 networks independently, effectively providing a dynamic (non-global) threshold for the network. The overall gene count across all GILs reached 19,588 genes (94.7% measured gene coverage) and 558,022 unique co-expression relationships. In comparison, network construction without pre-sorting of input samples yielded only 3,297 genes (15.9%) and 129,134 relationships. in the global network. Conclusions Here we show that pre-clustering of microarray samples helps approximate condition-specific networks and allows for dynamic thresholding using un-supervised methods. Because RMT ensures only highly significant interactions are kept, the GIL compendium consists of 558,022 unique high quality A. thaliana co-expression relationships across almost all of the measurable genes on the ATH1 array. For A. thaliana, these networks represent the largest compendium to date of significant gene co-expression relationships, and are a means to explore complex pathway, polygenic, and pleiotropic relationships for this focal model plant. The networks can be explored at sysbio.genome.clemson.edu. Finally, this method is applicable to any large expression profile collection for any organism and is best suited where a knowledge-independent network construction method is desired. PMID:23738693

  4. ARNetMiT R Package: association rules based gene co-expression networks of miRNA targets.

    PubMed

    Özgür Cingiz, M; Biricik, G; Diri, B

    2017-03-31

    miRNAs are key regulators that bind to target genes to suppress their gene expression level. The relations between miRNA-target genes enable users to derive co-expressed genes that may be involved in similar biological processes and functions in cells. We hypothesize that target genes of miRNAs are co-expressed, when they are regulated by multiple miRNAs. With the usage of these co-expressed genes, we can theoretically construct co-expression networks (GCNs) related to 152 diseases. In this study, we introduce ARNetMiT that utilize a hash based association rule algorithm in a novel way to infer the GCNs on miRNA-target genes data. We also present R package of ARNetMiT, which infers and visualizes GCNs of diseases that are selected by users. Our approach assumes miRNAs as transactions and target genes as their items. Support and confidence values are used to prune association rules on miRNA-target genes data to construct support based GCNs (sGCNs) along with support and confidence based GCNs (scGCNs). We use overlap analysis and the topological features for the performance analysis of GCNs. We also infer GCNs with popular GNI algorithms for comparison with the GCNs of ARNetMiT. Overlap analysis results show that ARNetMiT outperforms the compared GNI algorithms. We see that using high confidence values in scGCNs increase the ratio of the overlapped gene-gene interactions between the compared methods. According to the evaluation of the topological features of ARNetMiT based GCNs, the degrees of nodes have power-law distribution. The hub genes discovered by ARNetMiT based GCNs are consistent with the literature.

  5. Stem cell-associated genes are extremely poor prognostic factors for soft-tissue sarcoma patients.

    PubMed

    Taubert, H; Würl, P; Greither, T; Kappler, M; Bache, M; Bartel, F; Kehlen, A; Lautenschläger, C; Harris, L C; Kaushal, D; Füssel, S; Meye, A; Böhnke, A; Schmidt, H; Holzhausen, H-J; Hauptmann, S

    2007-11-01

    Cancer stem cells can play an important role in tumorigenesis and tumor progression. However, it is still difficult to detect and isolate cancer stem cells. An alternative approach is to analyse stem cell-associated gene expression. We investigated the coexpression of three stem cell-associated genes, Hiwi, hTERT and survivin, by quantitative real-time-PCR in 104 primary soft-tissue sarcomas (STS). Multivariate Cox's proportional hazards regression analyses allowed correlating gene expression with overall survival for STS patients. Coexpression of all three stem cell-associated genes resulted in a significantly increased risk of tumor-related death. Importantly, tumors of patients with the poorest prognosis were of all four tumor stages, suggesting that their risk is based upon coexpression of stem cell-associated genes rather than on tumor stage.

  6. The Use of RNA Sequencing and Correlation Network Analysis to Study Potential Regulators of Crabapple Leaf Color Transformation.

    PubMed

    Yang, Tuo; Li, Keting; Hao, Suxiao; Zhang, Jie; Song, Tingting; Tian, Ji; Yao, Yuncong

    2018-05-01

    Anthocyanins are plant pigments that contribute to the color of leaves, flowers and fruits, and that are beneficial to human health in the form of dietary antioxidants. The study of a transformable crabapple cultivar, 'India magic', which has red buds and green mature leaves, using mRNA profiling of four leaf developmental stages, allowed us to characterize molecular mechanisms regulating red color formation in early leaf development and the subsequent rapid down-regulation of anthocyanin biosynthesis. This analysis of differential gene expression during leaf development revealed that ethylene signaling-responsive genes are up-regulated during leaf pigmentation. Genes in the ethylene response factor (ERF), SPL, NAC, WRKY and MADS-box transcription factor (TF) families were identified in two weighted gene co-expression network analysis (WGCNA) modules as having a close relationship to anthocyanin accumulation. Analyses of network hub genes indicated that SPL TFs are located in central positions within anthocyanin-related modules. Furthermore, cis-motif and yeast one-hybrid assays suggested that several anthocyanin biosynthetic or regulatory genes are potential targets of SPL8 and SPL13B. Transient silencing of these two genes confirmed that they play a role in co-ordinating anthocyanin biosynthesis and crabapple leaf development. We present a high-resolution method for identifying regulatory modules associated with leaf pigmentation, which provides a platform for functional genomic studies of anthocyanin biosynthesis.

  7. Identification of crucial genes related to postmenopausal osteoporosis using gene expression profiling.

    PubMed

    Ma, Min; Chen, Xiaofei; Lu, Liangyu; Yuan, Feng; Zeng, Wen; Luo, Shulin; Yin, Feng; Cai, Junfeng

    2016-12-01

    Postmenopausal osteoporosis is a common bone disease and characterized by low bone mineral density. This study aimed to reveal key genes associated with postmenopausal osteoporosis (PMO), and provide a theoretical basis for subsequent experiments. The dataset GSE7429 was obtained from Gene Expression Omnibus. A total of 20 B cell samples (ten ones, respectively from postmenopausal women with low or high bone mineral density (BMD) were included in this dataset. Following screening of differentially expressed genes (DEGs), coexpression analysis of all genes was performed, and key genes in the coexpression network were screened using the random walk algorithm. Afterwards, functional and pathway analyses were conducted. Additionally, protein-protein interactions (PPIs) between DEGs and key genes were analyzed. A set of 308 DEGs (170 up-regulated ones and 138 down-regulated ones) between low BMD and high BMD samples were identified, and 101 key genes in the coexpression network were screened out. In the coexpression network, some genes had a higher score and degree, such as CSTA. The key genes in the coexpression network were mainly enriched in GO terms of the defense response (e.g., SERPINA1 and CST3), immune response (e.g., IL32 and CLEC7A); while, the DEGs were mainly enriched in structural constituent of cytoskeleton (e.g., CYLC2 and TUBA1B) and membrane-enclosed lumen (e.g., CCNE1 and INTS5). In the PPI network, CCNE1 interacted with REL; and TUBA1B interacted with ESR1. A series of interactions, such as CSTA/TYROBP, CCNE1/REL and TUBA1B/ESR1 might play pivotal roles in the occurrence and development of PMO.

  8. Genetic architecture of wood properties based on association analysis and co-expression networks in white spruce.

    PubMed

    Lamara, Mebarek; Raherison, Elie; Lenz, Patrick; Beaulieu, Jean; Bousquet, Jean; MacKay, John

    2016-04-01

    Association studies are widely utilized to analyze complex traits but their ability to disclose genetic architectures is often limited by statistical constraints, and functional insights are usually minimal in nonmodel organisms like forest trees. We developed an approach to integrate association mapping results with co-expression networks. We tested single nucleotide polymorphisms (SNPs) in 2652 candidate genes for statistical associations with wood density, stiffness, microfibril angle and ring width in a population of 1694 white spruce trees (Picea glauca). Associations mapping identified 229-292 genes per wood trait using a statistical significance level of P < 0.05 to maximize discovery. Over-representation of genes associated for nearly all traits was found in a xylem preferential co-expression group developed in independent experiments. A xylem co-expression network was reconstructed with 180 wood associated genes and several known MYB and NAC regulators were identified as network hubs. The network revealed a link between the gene PgNAC8, wood stiffness and microfibril angle, as well as considerable within-season variation for both genetic control of wood traits and gene expression. Trait associations were distributed throughout the network suggesting complex interactions and pleiotropic effects. Our findings indicate that integration of association mapping and co-expression networks enhances our understanding of complex wood traits. © 2015 The Authors. New Phytologist © 2015 New Phytologist Trust.

  9. Combined SOM-portrayal of gene expression and DNA methylation landscapes disentangles modes of epigenetic regulation in glioblastoma.

    PubMed

    Hopp, Lydia; Löffler-Wirth, Henry; Galle, Jörg; Binder, Hans

    2018-06-11

    We present here a novel method that enables unraveling the interplay between gene expression and DNA methylation in complex diseases such as cancer. The method is based on self-organizing maps and allows for analysis of data landscapes from 'governed by methylation' to 'governed by expression'. We identified regulatory modules of coexpressed and comethylated genes in high-grade gliomas: two modes are governed by genes hypermethylated and underexpressed in IDH-mutated cases, while two other modes reflect immune and stromal signatures in the classical and mesenchymal subtypes. A fifth mode with proneural characteristics comprises genes of repressed and poised chromatin states active in healthy brain. Two additional modes enrich genes either in active or repressed chromatin states. The method disentangles the interplay between gene expression and methylation. It has the potential to integrate also mutation and copy number data and to apply to large sample cohorts.

  10. 151. Bromocriptine Challenge Affects Working Memory Processing in Humans Depending on DRD2-Related Genes

    PubMed Central

    Pergola, Giulio; Selvaggi, Pierluigi; Gelao, Barbara; Di Carlo, Pasquale; Nettis, Maria Antonietta; Amico, Graziella; Felici, Valentina; Fazio, Leonardo; Rampino, Antonio; Sambataro, Fabio; Blasi, Giuseppe; Bertolino, Alessandro

    2017-01-01

    Abstract Background: Dopamine D2 receptors (D2R) contribute to the inverted-U shaped relationship between dopamine dorsolateral prefrontal cortex (DLPFC) and working memory (WM). Genetic variation in DRD2 coding for D2Rs modulates D2 signaling, but other genes in its pathway may be involved. In a previous work, using gene co-expression networks we identified 84 partner genes coregulated with DRD2 and eight single nucleotide polymorphisms (SNPs) predicting coexpression of the whole gene set in the human DLPFC [1]. These SNPs combined into a polygenic coexpression index (PCI) predicted WM performance and DLPFC activity in two independent samples of living healthy humans [1]. Here, we asked whether response to D2R targeting drugs is associated with this PCI. Thus, we investigated the interaction between WM behavioral/brain response to the D2R agonist Bromocriptine (BRO) and the PCI. [1] Pergola G, Di Carlo P, et al. (In press). Translational Psychiatry. Methods: Fifty healthy volunteers entered a double-blind, crossover, randomized, placebo-controlled fMRI study with BRO 1.25 mg and performed the N-Back WM task during the fMRI scanning session. We computed the PCI for all participants and investigated its association with WM-related behavior and brain activity using general linear models. Results: A PCI by drug interaction was significant on both DLPFC signal (right BA46, 242 voxels, F(1, 48) = 24; right BA9, 177 voxels, F(1, 48) = 19; P < .05 cluster-level FWE corrected) and behavioral scores, F(1, 46) = 4.6, P = .045, using a U-shaped quadratic model. The U-shaped relationship between the PCI and WM processing found on placebo was reversed on BRO. Furthermore, the increase in behavioral performance on BRO correlated with a decrease in BA46 activity, t(48) = −2.0, P = .049). Conclusion: The combined effect of multiple alleles on DRD2 coexpression covaried with drug response such that different allelic patterns were associated with similar responses, as in the inverted U-shaped model of WM. Thus, multiple genes and multiple allelic patterns are implicated in the inverted U-shaped dopamine/WM relationship. This relationship is reversed when individuals are administered BRO, suggesting that brain and behavioral response to this pharmacological challenge depends on a pleiotropic individual genetic background. Hence, pharmacogenomics in schizophrenia should take into account allelic patterns associated with molecular phenomena such as gene expression to predict drug response.

  11. A Risk Stratification Model for Lung Cancer Based on Gene Coexpression Network and Deep Learning

    PubMed Central

    2018-01-01

    Risk stratification model for lung cancer with gene expression profile is of great interest. Instead of previous models based on individual prognostic genes, we aimed to develop a novel system-level risk stratification model for lung adenocarcinoma based on gene coexpression network. Using multiple microarray, gene coexpression network analysis was performed to identify survival-related networks. A deep learning based risk stratification model was constructed with representative genes of these networks. The model was validated in two test sets. Survival analysis was performed using the output of the model to evaluate whether it could predict patients' survival independent of clinicopathological variables. Five networks were significantly associated with patients' survival. Considering prognostic significance and representativeness, genes of the two survival-related networks were selected for input of the model. The output of the model was significantly associated with patients' survival in two test sets and training set (p < 0.00001, p < 0.0001 and p = 0.02 for training and test sets 1 and 2, resp.). In multivariate analyses, the model was associated with patients' prognosis independent of other clinicopathological features. Our study presents a new perspective on incorporating gene coexpression networks into the gene expression signature and clinical application of deep learning in genomic data science for prognosis prediction. PMID:29581968

  12. Utility and Limitations of Using Gene Expression Data to Identify Functional Associations

    PubMed Central

    Peng, Cheng; Shiu, Shin-Han

    2016-01-01

    Gene co-expression has been widely used to hypothesize gene function through guilt-by association. However, it is not clear to what degree co-expression is informative, whether it can be applied to genes involved in different biological processes, and how the type of dataset impacts inferences about gene functions. Here our goal is to assess the utility and limitations of using co-expression as a criterion to recover functional associations between genes. By determining the percentage of gene pairs in a metabolic pathway with significant expression correlation, we found that many genes in the same pathway do not have similar transcript profiles and the choice of dataset, annotation quality, gene function, expression similarity measure, and clustering approach significantly impacts the ability to recover functional associations between genes using Arabidopsis thaliana as an example. Some datasets are more informative in capturing coordinated expression profiles and larger data sets are not always better. In addition, to recover the maximum number of known pathways and identify candidate genes with similar functions, it is important to explore rather exhaustively multiple dataset combinations, similarity measures, clustering algorithms and parameters. Finally, we validated the biological relevance of co-expression cluster memberships with an independent phenomics dataset and found that genes that consistently cluster with leucine degradation genes tend to have similar leucine levels in mutants. This study provides a framework for obtaining gene functional associations by maximizing the information that can be obtained from gene expression datasets. PMID:27935950

  13. Genes and gene networks implicated in aggression related behaviour.

    PubMed

    Malki, Karim; Pain, Oliver; Du Rietz, Ebba; Tosto, Maria Grazia; Paya-Cano, Jose; Sandnabba, Kenneth N; de Boer, Sietse; Schalkwyk, Leonard C; Sluyter, Frans

    2014-10-01

    Aggressive behaviour is a major cause of mortality and morbidity. Despite of moderate heritability estimates, progress in identifying the genetic factors underlying aggressive behaviour has been limited. There are currently three genetic mouse models of high and low aggression created using selective breeding. This is the first study to offer a global transcriptomic characterization of the prefrontal cortex across all three genetic mouse models of aggression. A systems biology approach has been applied to transcriptomic data across the three pairs of selected inbred mouse strains (Turku Aggressive (TA) and Turku Non-Aggressive (TNA), Short Attack Latency (SAL) and Long Attack Latency (LAL) mice and North Carolina Aggressive (NC900) and North Carolina Non-Aggressive (NC100)), providing novel insight into the neurobiological mechanisms and genetics underlying aggression. First, weighted gene co-expression network analysis (WGCNA) was performed to identify modules of highly correlated genes associated with aggression. Probe sets belonging to gene modules uncovered by WGCNA were carried forward for network analysis using ingenuity pathway analysis (IPA). The RankProd non-parametric algorithm was then used to statistically evaluate expression differences across the genes belonging to modules significantly associated with aggression. IPA uncovered two pathways, involving NF-kB and MAPKs. The secondary RankProd analysis yielded 14 differentially expressed genes, some of which have previously been implicated in pathways associated with aggressive behaviour, such as Adrbk2. The results highlighted plausible candidate genes and gene networks implicated in aggression-related behaviour.

  14. Structural covariance networks are coupled to expression of genes enriched in supragranular layers of the human cortex.

    PubMed

    Romero-Garcia, Rafael; Whitaker, Kirstie J; Váša, František; Seidlitz, Jakob; Shinn, Maxwell; Fonagy, Peter; Dolan, Raymond J; Jones, Peter B; Goodyer, Ian M; Bullmore, Edward T; Vértes, Petra E

    2018-05-01

    Complex network topology is characteristic of many biological systems, including anatomical and functional brain networks (connectomes). Here, we first constructed a structural covariance network from MRI measures of cortical thickness on 296 healthy volunteers, aged 14-24 years. Next, we designed a new algorithm for matching sample locations from the Allen Brain Atlas to the nodes of the SCN. Subsequently we used this to define, transcriptomic brain networks by estimating gene co-expression between pairs of cortical regions. Finally, we explored the hypothesis that transcriptional networks and structural MRI connectomes are coupled. A transcriptional brain network (TBN) and a structural covariance network (SCN) were correlated across connection weights and showed qualitatively similar complex topological properties: assortativity, small-worldness, modularity, and a rich-club. In both networks, the weight of an edge was inversely related to the anatomical (Euclidean) distance between regions. There were differences between networks in degree and distance distributions: the transcriptional network had a less fat-tailed degree distribution and a less positively skewed distance distribution than the SCN. However, cortical areas connected to each other within modules of the SCN had significantly higher levels of whole genome co-expression than expected by chance. Nodes connected in the SCN had especially high levels of expression and co-expression of a human supragranular enriched (HSE) gene set that has been specifically located to supragranular layers of human cerebral cortex and is known to be important for large-scale, long-distance cortico-cortical connectivity. This coupling of brain transcriptome and connectome topologies was largely but not entirely accounted for by the common constraint of physical distance on both networks. Copyright © 2018 The Authors. Published by Elsevier Inc. All rights reserved.

  15. Physiological Responses and Gene Co-Expression Network of Mycorrhizal Roots under K+ Deprivation1[OPEN

    PubMed Central

    Roy, Sushmita

    2017-01-01

    Arbuscular mycorrhizal (AM) associations enhance the phosphorous and nitrogen nutrition of host plants, but little is known about their role in potassium (K+) nutrition. Medicago truncatula plants were cocultured with the AM fungus Rhizophagus irregularis under high and low K+ regimes for 6 weeks. We determined how K+ deprivation affects plant development and mineral acquisition and how these negative effects are tempered by the AM colonization. The transcriptional response of AM roots under K+ deficiency was analyzed by whole-genome RNA sequencing. K+ deprivation decreased root biomass and external K+ uptake and modulated oxidative stress gene expression in M. truncatula roots. AM colonization induced specific transcriptional responses to K+ deprivation that seem to temper these negative effects. A gene network analysis revealed putative key regulators of these responses. This study confirmed that AM associations provide some tolerance to K+ deprivation to host plants, revealed that AM symbiosis modulates the expression of specific root genes to cope with this nutrient stress, and identified putative regulators participating in these tolerance mechanisms. PMID:28159827

  16. Network-based approach to identify prognostic biomarkers for estrogen receptor-positive breast cancer treatment with tamoxifen.

    PubMed

    Liu, Rong; Guo, Cheng-Xian; Zhou, Hong-Hao

    2015-01-01

    This study aims to identify effective gene networks and prognostic biomarkers associated with estrogen receptor positive (ER+) breast cancer using human mRNA studies. Weighted gene coexpression network analysis was performed with a complex ER+ breast cancer transcriptome to investigate the function of networks and key genes in the prognosis of breast cancer. We found a significant correlation of an expression module with distant metastasis-free survival (HR = 2.25; 95% CI .21.03-4.88 in discovery set; HR = 1.78; 95% CI = 1.07-2.93 in validation set). This module contained genes enriched in the biological process of the M phase. From this module, we further identified and validated 5 hub genes (CDK1, DLGAP5, MELK, NUSAP1, and RRM2), the expression levels of which were strongly associated with poor survival. Highly expressed MELK indicated poor survival in luminal A and luminal B breast cancer molecular subtypes. This gene was also found to be associated with tamoxifen resistance. Results indicated that a network-based approach may facilitate the discovery of biomarkers for the prognosis of ER+ breast cancer and may also be used as a basis for establishing personalized therapies. Nevertheless, before the application of this approach in clinical settings, in vivo and in vitro experiments and multi-center randomized controlled clinical trials are still needed.

  17. An intersection network based on combining SNP co-association and RNA co-expression networks for feed utilization traits in Japanese Black cattle.

    PubMed

    Okada, D; Endo, S; Matsuda, H; Ogawa, S; Taniguchi, Y; Katsuta, T; Watanabe, T; Iwaisaki, H

    2018-05-12

    Genome-wide association studies (GWAS) of quantitative traits have detected numerous genetic associations, but they encounter difficulties in pinpointing prominent candidate genes and inferring gene networks. The present study used a systems genetics approach integrating GWAS results with external RNA-expression data to detect candidate gene networks in feed utilization and growth traits of Japanese Black cattle, which are matters of concern. A SNP co-association network was derived from significant correlations between SNPs with effects estimated by GWAS across seven phenotypic traits. The resulting network genes contained significant numbers of annotations related to the traits. Using bovine transcriptome data from a public database, an RNA co-expression network was inferred based on the similarity of expression patterns across different tissues. An intersection network was then generated by superimposing the SNP and RNA networks and extracting shared interactions. This intersection network contained four tissue-specific modules: nervous system, reproductive system, muscular system, and glands. To characterize the structure (topographical properties) of the three networks, their scale-free properties were evaluated, which revealed that the intersection network was the most scale-free. In the sub-network containing the most connected transcription factors (URI1, ROCK2 and ETV6), most genes were widely expressed across tissues, and genes previously shown to be involved in the traits were found. Results indicated that the current approach might be used to construct a gene network that better reflects biological information, providing encouragement for the genetic dissection of economically important quantitative traits.

  18. A network approach of gene co-expression in the zea mays/Aspergillus flavus pathosystem to map host/pathogen interaction pathways

    USDA-ARS?s Scientific Manuscript database

    A gene co-expression network was generated using a dual RNA-seq study with the fungal pathogen A. flavus and its plant host Z. mays during the initial 3 days of infection. The analysis deciphered novel pathways and mapped genes of interest in both organisms during the infection. This network reveal...

  19. Pathway-related modules involved in the application of sevoflurane or propofol in off-pump coronary artery bypass graft surgery.

    PubMed

    Bu, Xiangmei; Wang, Bo; Wang, Yaoqi; Wang, Zhigang; Gong, Chunzhi; Qi, Feng; Zhang, Caixia

    2017-07-01

    Off-pump coronary artery bypass graft (CABG) surgery has recently emerged as a means to avoid the sequelae of extracorporeal circulation, including the whole-body inflammatory response, coagulation disorders and multiple organ dysfunction. At present, gas anesthesia, sevoflurane and intravenous anesthesia and propofol have been widely used during the CABG. To further understand the underlying mechanisms of these anesthetics on the gene level, the present study conducted pathway-related module analysis based on a co-expression network. This was performed in order to identify significant pathways in coronary artery disease patients who had undergone off-pump CABG surgery before and after applying sevoflurane or propofol. A total of 269 and 129 differentially expressed genes were obtained in the sevoflurane and propofol groups, respectively. In total, eight and seven pathways (P<0.05) in the sevoflurane and propofol groups were separately obtained via Kyoto Encyclopedia of Genes and Genome pathway analysis. Finally, eight and seven pathway-related modules in the sevoflurane and propofol groups were obtained, respectively. Furthermore, the mean degree of complement and coagulation cascades pathway-related module in both of the groups was the highest. It was predicted that during the CABG, the anesthetics might activate the complement and coagulation systems in order to possess some cardioprotective properties.

  20. Cis-Natural Antisense Transcripts Are Mainly Co-expressed with Their Sense Transcripts and Primarily Related to Energy Metabolic Pathways during Muscle Development.

    PubMed

    Zhao, Yunxia; Hou, Ye; Zhao, Changzhi; Liu, Fei; Luan, Yu; Jing, Lu; Li, Xinyun; Zhu, Mengjin; Zhao, Shuhong

    2016-01-01

    Cis-natural antisense transcripts (cis-NATs) are a new class of RNAs identified in various species. However, the biological functions of cis-NATs are largely unknown. In this study, we investigated the transcriptional characteristics and functions of cis-NATs in the muscle tissue of lean Landrace and indigenous fatty Lantang pigs. In total, 3,306 cis-NATs of 2,469 annotated genes were identified in the muscle tissue of pigs. More than 1,300 cis-NATs correlated with their sense genes at the transcriptional level, and approximately 80% of them were co-expressed in the two breeds. Furthermore, over 1,200 differentially expressed cis-NATs were identified during muscle development. Function annotation showed that the cis-NATs participated in muscle development mainly by co-expressing with genes involved in energy metabolic pathways, including citrate cycle (TCA cycle), glycolysis or gluconeogenesis, mitochondrial activation and so on. Moreover, these cis-NATs and their sense genes abruptly increased at the transition from the late fetal stages to the early postnatal stages and then decreased along with muscle development. In conclusion, the cis-NATs in the muscle tissue of pigs were identified and determined to be mainly co-expressed with their sense genes. The co-expressed cis-NATs and their sense gene were primarily related to energy metabolic pathways during muscle development in pigs. Our results offered novel evidence on the roles of cis-NATs during the muscle development of pigs.

  1. Modulation of gene expression and cell-cycle signaling pathways by the EGFR inhibitor gefitinib (Iressa) in rat urinary bladder cancer.

    PubMed

    Lu, Yan; Liu, Pengyuan; Van den Bergh, Francoise; Zellmer, Victoria; James, Michael; Wen, Weidong; Grubbs, Clinton J; Lubet, Ronald A; You, Ming

    2012-02-01

    The epidermal growth factor receptor inhibitor Iressa has shown strong preventive efficacy in the N-butyl-N-(4-hydroxybutyl)-nitrosamine (OH-BBN) model of bladder cancer in the rat. To explore its antitumor mechanism, we implemented a systems biology approach to characterize gene expression and signaling pathways in rat urinary bladder cancers treated with Iressa. Eleven bladder tumors from control rats, seven tumors from rats treated with Iressa, and seven normal bladder epithelia were profiled by the Affymetrix Rat Exon 1.0 ST Arrays. We identified 713 downregulated and 641 upregulated genes in comparing bladder tumors versus normal bladder epithelia. In addition, 178 genes were downregulated and 96 genes were upregulated when comparing control tumors versus Iressa-treated tumors. Two coexpression modules that were significantly correlated with tumor status and treatment status were identified [r = 0.70, P = 2.80 × 10(-15) (bladder tumor vs. normal bladder epithelium) and r = 0.63, P = 2.00 × 10(-42) (Iressa-treated tumor vs. control tumor), respectively]. Both tumor module and treatment module were enriched for genes involved in cell-cycle processes. Twenty-four and twenty-one highly connected hub genes likely to be key drivers in cell cycle were identified in the tumor module and treatment module, respectively. Analysis of microRNA genes on the array chips showed that tumor module and treatment module were significantly associated with expression levels of let-7c (r = 0.54, P = 3.70 × 10(-8) and r = 0.73, P = 1.50 × 10(-65), respectively). These results suggest that let-7c downregulation and its regulated cell-cycle pathway may play an integral role in governing bladder tumor suppression or collaborative oncogenesis and that Iressa exhibits its preventive efficacy on bladder tumorigenesis by upregulating let-7 and inhibiting the cell cycle. Cell culture study confirmed that the increased expression of let-7c decreases Iressa-treated bladder tumor cell growth. The identified hub genes may also serve as pharmacodynamic or efficacy biomarkers in clinical trials of chemoprevention in human bladder cancer. ©2011 AACR.

  2. The Functional Network of the Arabidopsis Plastoglobule Proteome Based on Quantitative Proteomics and Genome-Wide Coexpression Analysis1[C][W][OA

    PubMed Central

    Lundquist, Peter K.; Poliakov, Anton; Bhuiyan, Nazmul H.; Zybailov, Boris; Sun, Qi; van Wijk, Klaas J.

    2012-01-01

    Plastoglobules (PGs) in chloroplasts are thylakoid-associated monolayer lipoprotein particles containing prenyl and neutral lipids and several dozen proteins mostly with unknown functions. An integrated view of the role of the PG is lacking. Here, we better define the PG proteome and provide a conceptual framework for further studies. The PG proteome from Arabidopsis (Arabidopsis thaliana) leaf chloroplasts was determined by mass spectrometry of isolated PGs and quantitative comparison with the proteomes of unfractionated leaves, thylakoids, and stroma. Scanning electron microscopy showed the purity and size distribution of the isolated PGs. Compared with previous PG proteome analyses, we excluded several proteins and identified six new PG proteins, including an M48 metallopeptidase and two Absence of bc1 complex (ABC1) atypical kinases, confirmed by immunoblotting. This refined PG proteome consisted of 30 proteins, including six ABC1 kinases and seven fibrillins together comprising more than 70% of the PG protein mass. Other fibrillins were located predominantly in the stroma or thylakoid and not in PGs; we discovered that this partitioning can be predicted by their isoelectric point and hydrophobicity. A genome-wide coexpression network for the PG genes was then constructed from mRNA expression data. This revealed a modular network with four distinct modules that each contained at least one ABC1K and/or fibrillin gene. Each module showed clear enrichment in specific functions, including chlorophyll degradation/senescence, isoprenoid biosynthesis, plastid proteolysis, and redox regulators and phosphoregulators of electron flow. We propose a new testable model for the PGs, in which sets of genes are associated with specific PG functions. PMID:22274653

  3. Dynamic in vivo binding of transcription factors to cis-regulatory modules of cer and gsc in the stepwise formation of the Spemann–Mangold organizer

    PubMed Central

    Sudou, Norihiro; Yamamoto, Shinji; Ogino, Hajime; Taira, Masanori

    2012-01-01

    How multiple developmental cues are integrated on cis-regulatory modules (CRMs) for cell fate decisions remains uncertain. The Spemann–Mangold organizer in Xenopus embryos expresses the transcription factors Lim1/Lhx1, Otx2, Mix1, Siamois (Sia) and VegT. Reporter analyses using sperm nuclear transplantation and DNA injection showed that cerberus (cer) and goosecoid (gsc) are activated by the aforementioned transcription factors through CRMs conserved between X. laevis and X. tropicalis. ChIP-qPCR analysis for the five transcription factors revealed that cer and gsc CRMs are initially bound by both Sia and VegT at the late blastula stage, and subsequently bound by all five factors at the gastrula stage. At the neurula stage, only binding of Lim1 and Otx2 to the gsc CRM, among others, persists, which corresponds to their co-expression in the prechordal plate. Based on these data, together with detailed expression pattern analysis, we propose a new model of stepwise formation of the organizer, in which (1) maternal VegT and Wnt-induced Sia first bind to CRMs at the blastula stage; then (2) Nodal-inducible Lim1, Otx2, Mix1 and zygotic VegT are bound to CRMs in the dorsal endodermal and mesodermal regions where all these genes are co-expressed; and (3) these two regions are combined at the gastrula stage to form the organizer. Thus, the in vivo dynamics of multiple transcription factors highlight their roles in the initiation and maintenance of gene expression, and also reveal the stepwise integration of maternal, Nodal and Wnt signaling on CRMs of organizer genes to generate the organizer. PMID:22492356

  4. Dynamic in vivo binding of transcription factors to cis-regulatory modules of cer and gsc in the stepwise formation of the Spemann-Mangold organizer.

    PubMed

    Sudou, Norihiro; Yamamoto, Shinji; Ogino, Hajime; Taira, Masanori

    2012-05-01

    How multiple developmental cues are integrated on cis-regulatory modules (CRMs) for cell fate decisions remains uncertain. The Spemann-Mangold organizer in Xenopus embryos expresses the transcription factors Lim1/Lhx1, Otx2, Mix1, Siamois (Sia) and VegT. Reporter analyses using sperm nuclear transplantation and DNA injection showed that cerberus (cer) and goosecoid (gsc) are activated by the aforementioned transcription factors through CRMs conserved between X. laevis and X. tropicalis. ChIP-qPCR analysis for the five transcription factors revealed that cer and gsc CRMs are initially bound by both Sia and VegT at the late blastula stage, and subsequently bound by all five factors at the gastrula stage. At the neurula stage, only binding of Lim1 and Otx2 to the gsc CRM, among others, persists, which corresponds to their co-expression in the prechordal plate. Based on these data, together with detailed expression pattern analysis, we propose a new model of stepwise formation of the organizer, in which (1) maternal VegT and Wnt-induced Sia first bind to CRMs at the blastula stage; then (2) Nodal-inducible Lim1, Otx2, Mix1 and zygotic VegT are bound to CRMs in the dorsal endodermal and mesodermal regions where all these genes are co-expressed; and (3) these two regions are combined at the gastrula stage to form the organizer. Thus, the in vivo dynamics of multiple transcription factors highlight their roles in the initiation and maintenance of gene expression, and also reveal the stepwise integration of maternal, Nodal and Wnt signaling on CRMs of organizer genes to generate the organizer.

  5. Cogena, a novel tool for co-expressed gene-set enrichment analysis, applied to drug repositioning and drug mode of action discovery.

    PubMed

    Jia, Zhilong; Liu, Ying; Guan, Naiyang; Bo, Xiaochen; Luo, Zhigang; Barnes, Michael R

    2016-05-27

    Drug repositioning, finding new indications for existing drugs, has gained much recent attention as a potentially efficient and economical strategy for accelerating new therapies into the clinic. Although improvement in the sensitivity of computational drug repositioning methods has identified numerous credible repositioning opportunities, few have been progressed. Arguably the "black box" nature of drug action in a new indication is one of the main blocks to progression, highlighting the need for methods that inform on the broader target mechanism in the disease context. We demonstrate that the analysis of co-expressed genes may be a critical first step towards illumination of both disease pathology and mode of drug action. We achieve this using a novel framework, co-expressed gene-set enrichment analysis (cogena) for co-expression analysis of gene expression signatures and gene set enrichment analysis of co-expressed genes. The cogena framework enables simultaneous, pathway driven, disease and drug repositioning analysis. Cogena can be used to illuminate coordinated changes within disease transcriptomes and identify drugs acting mechanistically within this framework. We illustrate this using a psoriatic skin transcriptome, as an exemplar, and recover two widely used Psoriasis drugs (Methotrexate and Ciclosporin) with distinct modes of action. Cogena out-performs the results of Connectivity Map and NFFinder webservers in similar disease transcriptome analyses. Furthermore, we investigated the literature support for the other top-ranked compounds to treat psoriasis and showed how the outputs of cogena analysis can contribute new insight to support the progression of drugs into the clinic. We have made cogena freely available within Bioconductor or https://github.com/zhilongjia/cogena . In conclusion, by targeting co-expressed genes within disease transcriptomes, cogena offers novel biological insight, which can be effectively harnessed for drug discovery and repositioning, allowing the grouping and prioritisation of drug repositioning candidates on the basis of putative mode of action.

  6. Differences in transcriptional effects of 1α,25 dihydroxyvitamin D3 on fibroblasts associated to breast carcinomas and from paired normal breast tissues.

    PubMed

    Campos, Laura Tojeiro; Brentani, Helena; Roela, Rosimeire Aparecida; Katayama, Maria Lucia Hirata; Lima, Leandro; Rolim, Cíntia Flores; Milani, Cíntia; Folgueira, Maria Aparecida Azevedo Koike; Brentani, Maria Mitzi

    2013-01-01

    The effects of 1α,25 dihydroxyvitamin D3 (1,25D) on breast carcinoma associated fibroblasts (CAFs) are still unknown. This study aimed to identify genes whose expression was altered after 1,25D treatment in CAFs and matched adjacent normal mammary associated fibroblasts (NAFs). CAFs and NAFs (from 5 patients) were cultured with or without (control) 1,25D 100 nM. Both CAF and NAF expressed vitamin D receptor (VDR) and 1,25D induction of the genomic pathway was detected through up-regulation of the target gene CYP24A1. Microarray analysis showed that despite presenting 50% of overlapping genes, CAFs and NAFs exhibited distinct transcriptional profiles after 1,25D treatment (FDR<0.05). Functional analysis revealed that in CAFs, genes associated with proliferation (NRG1, WNT5A, PDGFC) were down regulated and those involved in immune modulation (NFKBIA, TREM-1) were up regulated, consistent with anti tumor activities of 1,25D in breast cancer. In NAFs, a distinct subset of genes was induced by 1,25D, involved in anti apoptosis, detoxification, antibacterial defense system and protection against oxidative stress, which may limit carcinogenesis. Co-expression network and interactome analysis of genes commonly regulated by 1,25D in NAFs and CAFs revealed differences in their co-expression values, suggesting that 1,25D effects in NAFs are distinct from those triggered in CAFs. Copyright © 2012 Elsevier Ltd. All rights reserved.

  7. ChIPBase v2.0: decoding transcriptional regulatory networks of non-coding RNAs and protein-coding genes from ChIP-seq data.

    PubMed

    Zhou, Ke-Ren; Liu, Shun; Sun, Wen-Ju; Zheng, Ling-Ling; Zhou, Hui; Yang, Jian-Hua; Qu, Liang-Hu

    2017-01-04

    The abnormal transcriptional regulation of non-coding RNAs (ncRNAs) and protein-coding genes (PCGs) is contributed to various biological processes and linked with human diseases, but the underlying mechanisms remain elusive. In this study, we developed ChIPBase v2.0 (http://rna.sysu.edu.cn/chipbase/) to explore the transcriptional regulatory networks of ncRNAs and PCGs. ChIPBase v2.0 has been expanded with ∼10 200 curated ChIP-seq datasets, which represent about 20 times expansion when comparing to the previous released version. We identified thousands of binding motif matrices and their binding sites from ChIP-seq data of DNA-binding proteins and predicted millions of transcriptional regulatory relationships between transcription factors (TFs) and genes. We constructed 'Regulator' module to predict hundreds of TFs and histone modifications that were involved in or affected transcription of ncRNAs and PCGs. Moreover, we built a web-based tool, Co-Expression, to explore the co-expression patterns between DNA-binding proteins and various types of genes by integrating the gene expression profiles of ∼10 000 tumor samples and ∼9100 normal tissues and cell lines. ChIPBase also provides a ChIP-Function tool and a genome browser to predict functions of diverse genes and visualize various ChIP-seq data. This study will greatly expand our understanding of the transcriptional regulations of ncRNAs and PCGs. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

  8. Network-based Analysis of Genome Wide Association Data Provides Novel Candidate Genes for Lipid and Lipoprotein Traits*

    PubMed Central

    Sharma, Amitabh; Gulbahce, Natali; Pevzner, Samuel J.; Menche, Jörg; Ladenvall, Claes; Folkersen, Lasse; Eriksson, Per; Orho-Melander, Marju; Barabási, Albert-László

    2013-01-01

    Genome wide association studies (GWAS) identify susceptibility loci for complex traits, but do not identify particular genes of interest. Integration of functional and network information may help in overcoming this limitation and identifying new susceptibility loci. Using GWAS and comorbidity data, we present a network-based approach to predict candidate genes for lipid and lipoprotein traits. We apply a prediction pipeline incorporating interactome, co-expression, and comorbidity data to Global Lipids Genetics Consortium (GLGC) GWAS for four traits of interest, identifying phenotypically coherent modules. These modules provide insights regarding gene involvement in complex phenotypes with multiple susceptibility alleles and low effect sizes. To experimentally test our predictions, we selected four candidate genes and genotyped representative SNPs in the Malmö Diet and Cancer Cardiovascular Cohort. We found significant associations with LDL-C and total-cholesterol levels for a synonymous SNP (rs234706) in the cystathionine beta-synthase (CBS) gene (p = 1 × 10−5 and adjusted-p = 0.013, respectively). Further, liver samples taken from 206 patients revealed that patients with the minor allele of rs234706 had significant dysregulation of CBS (p = 0.04). Despite the known biological role of CBS in lipid metabolism, SNPs within the locus have not yet been identified in GWAS of lipoprotein traits. Thus, the GWAS-based Comorbidity Module (GCM) approach identifies candidate genes missed by GWAS studies, serving as a broadly applicable tool for the investigation of other complex disease phenotypes. PMID:23882023

  9. A Consensus Network of Gene Regulatory Factors in the Human Frontal Lobe

    PubMed Central

    Berto, Stefano; Perdomo-Sabogal, Alvaro; Gerighausen, Daniel; Qin, Jing; Nowick, Katja

    2016-01-01

    Cognitive abilities, such as memory, learning, language, problem solving, and planning, involve the frontal lobe and other brain areas. Not much is known yet about the molecular basis of cognitive abilities, but it seems clear that cognitive abilities are determined by the interplay of many genes. One approach for analyzing the genetic networks involved in cognitive functions is to study the coexpression networks of genes with known importance for proper cognitive functions, such as genes that have been associated with cognitive disorders like intellectual disability (ID) or autism spectrum disorders (ASD). Because many of these genes are gene regulatory factors (GRFs) we aimed to provide insights into the gene regulatory networks active in the human frontal lobe. Using genome wide human frontal lobe expression data from 10 independent data sets, we first derived 10 individual coexpression networks for all GRFs including their potential target genes. We observed a high level of variability among these 10 independently derived networks, pointing out that relying on results from a single study can only provide limited biological insights. To instead focus on the most confident information from these 10 networks we developed a method for integrating such independently derived networks into a consensus network. This consensus network revealed robust GRF interactions that are conserved across the frontal lobes of different healthy human individuals. Within this network, we detected a strong central module that is enriched for 166 GRFs known to be involved in brain development and/or cognitive disorders. Interestingly, several hubs of the consensus network encode for GRFs that have not yet been associated with brain functions. Their central role in the network suggests them as excellent new candidates for playing an essential role in the regulatory network of the human frontal lobe, which should be investigated in future studies. PMID:27014338

  10. Persistent activation of interlinked type 2 airway epithelial gene networks in sputum-derived cells from aeroallergen-sensitized symptomatic asthmatics.

    PubMed

    Jones, Anya C; Troy, Niamh M; White, Elisha; Hollams, Elysia M; Gout, Alexander M; Ling, Kak-Ming; Kicic, Anthony; Stick, Stephen M; Sly, Peter D; Holt, Patrick G; Hall, Graham L; Bosco, Anthony

    2018-01-24

    Atopic asthma is a persistent disease characterized by intermittent wheeze and progressive loss of lung function. The disease is thought to be driven primarily by chronic aeroallergen-induced type 2-associated inflammation. However, the vast majority of atopics do not develop asthma despite ongoing aeroallergen exposure, suggesting additional mechanisms operate in conjunction with type 2 immunity to drive asthma pathogenesis. We employed RNA-Seq profiling of sputum-derived cells to identify gene networks operative at baseline in house dust mite-sensitized (HDM S ) subjects with/without wheezing history that are characteristic of the ongoing asthmatic state. The expression of type 2 effectors (IL-5, IL-13) was equivalent in both cohorts of subjects. However, in HDM S -wheezers they were associated with upregulation of two coexpression modules comprising multiple type 2- and epithelial-associated genes. The first module was interlinked by the hubs EGFR, ERBB2, CDH1 and IL-13. The second module was associated with CDHR3 and mucociliary clearance genes. Our findings provide new insight into the molecular mechanisms operative at baseline in the airway mucosa in atopic asthmatics undergoing natural aeroallergen exposure, and suggest that susceptibility to asthma amongst these subjects involves complex interactions between type 2- and epithelial-associated gene networks, which are not operative in equivalently sensitized/exposed atopic non-asthmatics.

  11. Co-expression analysis identifies CRC and AP1 the regulator of Arabidopsis fatty acid biosynthesis.

    PubMed

    Han, Xinxin; Yin, Linlin; Xue, Hongwei

    2012-07-01

    Fatty acids (FAs) play crucial rules in signal transduction and plant development, however, the regulation of FA metabolism is still poorly understood. To study the relevant regulatory network, fifty-eight FA biosynthesis genes including de novo synthases, desaturases and elongases were selected as "guide genes" to construct the co-expression network. Calculation of the correlation between all Arabidopsis thaliana (L.) genes with each guide gene by Arabidopsis co-expression dating mining tools (ACT) identifies 797 candidate FA-correlated genes. Gene ontology (GO) analysis of these co-expressed genes showed they are tightly correlated to photosynthesis and carbohydrate metabolism, and function in many processes. Interestingly, 63 transcription factors (TFs) were identified as candidate FA biosynthesis regulators and 8 TF families are enriched. Two TF genes, CRC and AP1, both correlating with 8 FA guide genes, were further characterized. Analyses of the ap1 and crc mutant showed the altered total FA composition of mature seeds. The contents of palmitoleic acid, stearic acid, arachidic acid and eicosadienoic acid are decreased, whereas that of oleic acid is increased in ap1 and crc seeds, which is consistent with the qRT-PCR analysis revealing the suppressed expression of the corresponding guide genes. In addition, yeast one-hybrid analysis and electrophoretic mobility shift assay (EMSA) revealed that CRC can bind to the promoter regions of KCS7 and KCS15, indicating that CRC may directly regulate FA biosynthesis. © 2012 Institute of Botany, Chinese Academy of Sciences.

  12. Broad Integration of Expression Maps and Co-Expression Networks Compassing Novel Gene Functions in the Brain

    PubMed Central

    Okamura-Oho, Yuko; Shimokawa, Kazuro; Nishimura, Masaomi; Takemoto, Satoko; Sato, Akira; Furuichi, Teiichi; Yokota, Hideo

    2014-01-01

    Using a recently invented technique for gene expression mapping in the whole-anatomy context, termed transcriptome tomography, we have generated a dataset of 36,000 maps of overall gene expression in the adult-mouse brain. Here, using an informatics approach, we identified a broad co-expression network that follows an inverse power law and is rich in functional interaction and gene-ontology terms. Our framework for the integrated analysis of expression maps and graphs of co-expression networks revealed that groups of combinatorially expressed genes, which regulate cell differentiation during development, were present in the adult brain and each of these groups was associated with a discrete cell types. These groups included non-coding genes of unknown function. We found that these genes specifically linked developmentally conserved groups in the network. A previously unrecognized robust expression pattern covering the whole brain was related to the molecular anatomy of key biological processes occurring in particular areas. PMID:25382412

  13. Modular organization of the white spruce (Picea glauca) transcriptome reveals functional organization and evolutionary signatures.

    PubMed

    Raherison, Elie S M; Giguère, Isabelle; Caron, Sébastien; Lamara, Mebarek; MacKay, John J

    2015-07-01

    Transcript profiling has shown the molecular bases of several biological processes in plants but few studies have developed an understanding of overall transcriptome variation. We investigated transcriptome structure in white spruce (Picea glauca), aiming to delineate its modular organization and associated functional and evolutionary attributes. Microarray analyses were used to: identify and functionally characterize groups of co-expressed genes; investigate expressional and functional diversity of vascular tissue preferential genes which were conserved among Picea species, and identify expression networks underlying wood formation. We classified 22 857 genes as variable (79%; 22 coexpression groups) or invariant (21%) by profiling across several vegetative tissues. Modular organization and complex transcriptome restructuring among vascular tissue preferential genes was revealed by their assignment to coexpression groups with partially overlapping profiles and partially distinct functions. Integrated analyses of tissue-based and temporally variable profiles identified secondary xylem gene networks, showed their remodelling over a growing season and identified PgNAC-7 (no apical meristerm (NAM), Arabidopsis transcription activation factor (ATAF) and cup-shaped cotyledon (CUC) transcription factor 007 in Picea glauca) as a major hub gene specific to earlywood formation. Reference profiling identified comprehensive, statistically robust coexpressed groups, revealing that modular organization underpins the evolutionary conservation of the transcriptome structure. © 2015 The Authors. New Phytologist © 2015 New Phytologist Trust.

  14. Transcriptional profiles of bovine in vivo pre-implantation development.

    PubMed

    Jiang, Zongliang; Sun, Jiangwen; Dong, Hong; Luo, Oscar; Zheng, Xinbao; Obergfell, Craig; Tang, Yong; Bi, Jinbo; O'Neill, Rachel; Ruan, Yijun; Chen, Jingbo; Tian, Xiuchun Cindy

    2014-09-04

    During mammalian pre-implantation embryonic development dramatic and orchestrated changes occur in gene transcription. The identification of the complete changes has not been possible until the development of the Next Generation Sequencing Technology. Here we report comprehensive transcriptome dynamics of single matured bovine oocytes and pre-implantation embryos developed in vivo. Surprisingly, more than half of the estimated 22,000 bovine genes, 11,488 to 12,729 involved in more than 100 pathways, is expressed in oocytes and early embryos. Despite the similarity in the total numbers of genes expressed across stages, the nature of the expressed genes is dramatically different. A total of 2,845 genes were differentially expressed among different stages, of which the largest change was observed between the 4- and 8-cell stages, demonstrating that the bovine embryonic genome is activated at this transition. Additionally, 774 genes were identified as only expressed/highly enriched in particular stages of development, suggesting their stage-specific roles in embryogenesis. Using weighted gene co-expression network analysis, we found 12 stage-specific modules of co-expressed genes that can be used to represent the corresponding stage of development. Furthermore, we identified conserved key members (or hub genes) of the bovine expressed gene networks. Their vast association with other embryonic genes suggests that they may have important regulatory roles in embryo development; yet, the majority of the hub genes are relatively unknown/under-studied in embryos. We also conducted the first comparison of embryonic expression profiles across three mammalian species, human, mouse and bovine, for which RNA-seq data are available. We found that the three species share more maternally deposited genes than embryonic genome activated genes. More importantly, there are more similarities in embryonic transcriptomes between bovine and humans than between humans and mice, demonstrating that bovine embryos are better models for human embryonic development. This study provides a comprehensive examination of gene activities in bovine embryos and identified little-known potential master regulators of pre-implantation development.

  15. Co-expression of NCED and ALO improves vitamin C level and tolerance to drought and chilling in transgenic tobacco and stylo plants.

    PubMed

    Bao, Gegen; Zhuo, Chunliu; Qian, Chunmei; Xiao, Ting; Guo, Zhenfei; Lu, Shaoyun

    2016-01-01

    Abscisic acid (ABA) regulates plant adaptive responses to various environmental stresses, while L-ascorbic acid (AsA) that is also named vitamin C is an important antioxidant and involves in plant stress tolerance and the immune system in domestic animals. Transgenic tobacco (Nicotiana tabacum L.) and stylo [Stylosanthes guianensis (Aublet) Swartz], a forage legume, plants co-expressing stylo 9-cis-epoxycarotenoid dioxygenase (SgNCED1) and yeast D-arabinono-1,4-lactone oxidase (ALO) genes were generated in this study, and tolerance to drought and chilling was analysed in comparison with transgenic tobacco overexpressing SgNCED1 or ALO and the wild-type plants. Compared to the SgNCED1 or ALO transgenic plants, in which only ABA or AsA levels were increased, both ABA and AsA levels were increased in transgenic tobacco and stylo plants co-expressing SgNCED1 and ALO genes. Compared to the wild type, an enhanced drought tolerance was observed in SgNCED1 transgenic tobacco plants with induced expression of drought-responsive genes, but not in ALO plants, while an enhanced chilling tolerance was observed in ALO transgenic tobaccos with induced expression of cold-responsive genes, but not in SgNCED1 plants. Co-expression of SgNCED1 and ALO genes resulted in elevated tolerance to both drought and chilling in transgenic tobacco and stylo plants with induced expression of both drought and cold-responsive genes. Our result suggests that co-expression of SgNCED1 and ALO genes is an effective way for use in forage plant improvement for increased tolerance to drought and chilling and nutrition quality. © 2015 Society for Experimental Biology, Association of Applied Biologists and John Wiley & Sons Ltd.

  16. Coexpression network based on natural variation in human gene expression reveals gene interactions and functions

    PubMed Central

    Nayak, Renuka R.; Kearns, Michael; Spielman, Richard S.; Cheung, Vivian G.

    2009-01-01

    Genes interact in networks to orchestrate cellular processes. Analysis of these networks provides insights into gene interactions and functions. Here, we took advantage of normal variation in human gene expression to infer gene networks, which we constructed using correlations in expression levels of more than 8.5 million gene pairs in immortalized B cells from three independent samples. The resulting networks allowed us to identify biological processes and gene functions. Among the biological pathways, we found processes such as translation and glycolysis that co-occur in the same subnetworks. We predicted the functions of poorly characterized genes, including CHCHD2 and TMEM111, and provided experimental evidence that TMEM111 is part of the endoplasmic reticulum-associated secretory pathway. We also found that IFIH1, a susceptibility gene of type 1 diabetes, interacts with YES1, which plays a role in glucose transport. Furthermore, genes that predispose to the same diseases are clustered nonrandomly in the coexpression network, suggesting that networks can provide candidate genes that influence disease susceptibility. Therefore, our analysis of gene coexpression networks offers information on the role of human genes in normal and disease processes. PMID:19797678

  17. One-plasmid tunable coexpression for mycobacterial protein–protein interaction studies

    PubMed Central

    Chang, Yong; Mead, David; Dhodda, Vinay; Brumm, Phil; Fox, Brian G

    2009-01-01

    A single plasmid that allows controlled coexpression has been developed for use in mycobacteria. The tetracycline inducible promoter, PtetO, was used to provide tetracycline-dependent induction of one gene, while the Psmyc, Pimyc, or Phsp promoters were used to provide three different levels of constitutive expression of a second gene. The functions of these four individual promoters were established using green fluorescent protein (GFP) and a newly identified red fluorescence inducible protein from Geobacillus sterothermophilus strain G1.13 (RFIP) as reporters. The tandem use of GFP and RFIP as reporter genes allowed optimization of the tunable coexpression in Mycobacterium smegmatis; either time at a fixed inducer concentration or changes in inducer concentration could be used to control the protein:protein ratio. This single vector system was used to coexpress the two-protein Mycobacterium tuberculosis stearoyl-CoA Δ9 desaturase complex (integral membrane desaturase Rv3229c and NADPH oxidoreductase Rv3230c) in M. smegmatis. The catalytic activity was found to increase in a manner corresponding to increasing the level of Rv3230c relative to a fixed level of Rv3229c. This system, which can yield finely tuned coexpression of the fatty acid desaturase complex in mycobacteria, may be useful for study of other multicomponent complexes. Furthermore, the tunable coexpression strategy used herein should also be applicable in other species with minor modifications. PMID:19760663

  18. NetMiner-an ensemble pipeline for building genome-wide and high-quality gene co-expression network using massive-scale RNA-seq samples.

    PubMed

    Yu, Hua; Jiao, Bingke; Lu, Lu; Wang, Pengfei; Chen, Shuangcheng; Liang, Chengzhi; Liu, Wei

    2018-01-01

    Accurately reconstructing gene co-expression network is of great importance for uncovering the genetic architecture underlying complex and various phenotypes. The recent availability of high-throughput RNA-seq sequencing has made genome-wide detecting and quantifying of the novel, rare and low-abundance transcripts practical. However, its potential merits in reconstructing gene co-expression network have still not been well explored. Using massive-scale RNA-seq samples, we have designed an ensemble pipeline, called NetMiner, for building genome-scale and high-quality Gene Co-expression Network (GCN) by integrating three frequently used inference algorithms. We constructed a RNA-seq-based GCN in one species of monocot rice. The quality of network obtained by our method was verified and evaluated by the curated gene functional association data sets, which obviously outperformed each single method. In addition, the powerful capability of network for associating genes with functions and agronomic traits was shown by enrichment analysis and case studies. In particular, we demonstrated the potential value of our proposed method to predict the biological roles of unknown protein-coding genes, long non-coding RNA (lncRNA) genes and circular RNA (circRNA) genes. Our results provided a valuable and highly reliable data source to select key candidate genes for subsequent experimental validation. To facilitate identification of novel genes regulating important biological processes and phenotypes in other plants or animals, we have published the source code of NetMiner, making it freely available at https://github.com/czllab/NetMiner.

  19. Identification of Crowding Stress Tolerance Co-Expression Networks Involved in Sweet Corn Yield

    PubMed Central

    Choe, Eunsoo; Drnevich, Jenny; Williams, Martin M.

    2016-01-01

    Tolerance to crowding stress has played a crucial role in improving agronomic productivity in field corn; however, commercial sweet corn hybrids vary greatly in crowding stress tolerance. The objectives were to 1) explore transcriptional changes among sweet corn hybrids with differential yield under crowding stress, 2) identify relationships between phenotypic responses and gene expression patterns, and 3) identify groups of genes associated with yield and crowding stress tolerance. Under conditions of crowding stress, three high-yielding and three low-yielding sweet corn hybrids were grouped for transcriptional and phenotypic analyses. Transcriptional analyses identified from 372 to 859 common differentially expressed genes (DEGs) for each hybrid. Large gene expression pattern variation among hybrids and only 26 common DEGs across all hybrid comparisons were identified, suggesting each hybrid has a unique response to crowding stress. Over-represented biological functions of DEGs also differed among hybrids. Strong correlation was observed between: 1) modules with up-regulation in high-yielding hybrids and yield traits, and 2) modules with up-regulation in low-yielding hybrids and plant/ear traits. Modules linked with yield traits may be important crowding stress response mechanisms influencing crop yield. Functional analysis of the modules and common DEGs identified candidate crowding stress tolerant processes in photosynthesis, glycolysis, cell wall, carbohydrate/nitrogen metabolic process, chromatin, and transcription regulation. Moreover, these biological functions were greatly inter-connected, indicating the importance of improving the mechanisms as a network. PMID:26796516

  20. Understanding developmental and adaptive cues in pine through metabolite profiling and co-expression network analysis

    PubMed Central

    Cañas, Rafael A.; Canales, Javier; Muñoz-Hernández, Carmen; Granados, Jose M.; Ávila, Concepción; García-Martín, María L.; Cánovas, Francisco M.

    2015-01-01

    Conifers include long-lived evergreen trees of great economic and ecological importance, including pines and spruces. During their long lives conifers must respond to seasonal environmental changes, adapt to unpredictable environmental stresses, and co-ordinate their adaptive adjustments with internal developmental programmes. To gain insights into these responses, we examined metabolite and transcriptomic profiles of needles from naturally growing 25-year-old maritime pine (Pinus pinaster L. Aiton) trees over a year. The effect of environmental parameters such as temperature and rain on needle development were studied. Our results show that seasonal changes in the metabolite profiles were mainly affected by the needles’ age and acclimation for winter, but changes in transcript profiles were mainly dependent on climatic factors. The relative abundance of most transcripts correlated well with temperature, particularly for genes involved in photosynthesis or winter acclimation. Gene network analysis revealed relationships between 14 co-expressed gene modules and development and adaptation to environmental stimuli. Novel Myb transcription factors were identified as candidate regulators during needle development. Our systems-based analysis provides integrated data of the seasonal regulation of maritime pine growth, opening new perspectives for understanding the complex regulatory mechanisms underlying conifers’ adaptive responses. Taken together, our results suggest that the environment regulates the transcriptome for fine tuning of the metabolome during development. PMID:25873654

  1. Modular transcriptional repertoire and MicroRNA target analyses characterize genomic dysregulation in the thymus of Down syndrome infants

    PubMed Central

    Moreira-Filho, Carlos Alberto; Bando, Silvia Yumi; Bertonha, Fernanda Bernardi; Silva, Filipi Nascimento; da Fontoura Costa, Luciano; Ferreira, Leandro Rodrigues; Furlanetto, Glaucio; Chacur, Paulo; Zerbini, Maria Claudia Nogueira; Carneiro-Sampaio, Magda

    2016-01-01

    Trisomy 21-driven transcriptional alterations in human thymus were characterized through gene coexpression network (GCN) and miRNA-target analyses. We used whole thymic tissue - obtained at heart surgery from Down syndrome (DS) and karyotipically normal subjects (CT) - and a network-based approach for GCN analysis that allows the identification of modular transcriptional repertoires (communities) and the interactions between all the system's constituents through community detection. Changes in the degree of connections observed for hierarchically important hubs/genes in CT and DS networks corresponded to community changes. Distinct communities of highly interconnected genes were topologically identified in these networks. The role of miRNAs in modulating the expression of highly connected genes in CT and DS was revealed through miRNA-target analysis. Trisomy 21 gene dysregulation in thymus may be depicted as the breakdown and altered reorganization of transcriptional modules. Leading networks acting in normal or disease states were identified. CT networks would depict the “canonical” way of thymus functioning. Conversely, DS networks represent a “non-canonical” way, i.e., thymic tissue adaptation under trisomy 21 genomic dysregulation. This adaptation is probably driven by epigenetic mechanisms acting at chromatin level and through the miRNA control of transcriptional programs involving the networks' high-hierarchy genes. PMID:26848775

  2. Analysis of genetic association using hierarchical clustering and cluster validation indices.

    PubMed

    Pagnuco, Inti A; Pastore, Juan I; Abras, Guillermo; Brun, Marcel; Ballarin, Virginia L

    2017-10-01

    It is usually assumed that co-expressed genes suggest co-regulation in the underlying regulatory network. Determining sets of co-expressed genes is an important task, based on some criteria of similarity. This task is usually performed by clustering algorithms, where the genes are clustered into meaningful groups based on their expression values in a set of experiment. In this work, we propose a method to find sets of co-expressed genes, based on cluster validation indices as a measure of similarity for individual gene groups, and a combination of variants of hierarchical clustering to generate the candidate groups. We evaluated its ability to retrieve significant sets on simulated correlated and real genomics data, where the performance is measured based on its detection ability of co-regulated sets against a full search. Additionally, we analyzed the quality of the best ranked groups using an online bioinformatics tool that provides network information for the selected genes. Copyright © 2017 Elsevier Inc. All rights reserved.

  3. Transcriptome analysis of an apple (Malus × domestica) yellow fruit somatic mutation identifies a gene network module highly associated with anthocyanin and epigenetic regulation.

    PubMed

    El-Sharkawy, Islam; Liang, Dong; Xu, Kenong

    2015-12-01

    Using RNA-seq, this study analysed an apple (Malus×domestica) anthocyanin-deficient yellow-skin somatic mutant 'Blondee' (BLO) and its red-skin parent 'Kidd's D-8' (KID), the original name of 'Gala', to understand the molecular mechanisms underlying the mutation. A total of 3299 differentially expressed genes (DEGs) were identified between BLO and KID at four developmental stages and/or between two adjacent stages within BLO and/or KID. A weighted gene co-expression network analysis (WGCNA) of the DEGs uncovered a network module of 34 genes highly correlated (r=0.95, P=9.0×10(-13)) with anthocyanin contents. Although 12 of the 34 genes in the WGCNA module were characterized and known of roles in anthocyanin, the remainder 22 appear to be novel. Examining the expression of ten representative genes in the module in 14 diverse apples revealed that at least eight were significantly correlated with anthocyanin variation. MdMYB10 (MDP0000259614) and MdGST (MDP0000252292) were among the most suppressed module member genes in BLO despite being undistinguishable in their corresponding sequences between BLO and KID. Methylation assay of MdMYB10 and MdGST in fruit skin revealed that two regions (MR3 and MR7) in the MdMYB10 promoter exhibited remarkable differences between BLO and KID. In particular, methylation was high and progressively increased alongside fruit development in BLO while was correspondingly low and constant in KID. The methylation levels in both MR3 and MR7 were negatively correlated with anthocyanin content as well as the expression of MdMYB10 and MdGST. Clearly, the collective repression of the 34 genes explains the loss-of-colour in BLO while the methylation in MdMYB10 promoter is likely causal for the mutation. © The Author 2015. Published by Oxford University Press on behalf of the Society for Experimental Biology.

  4. Transcriptome analysis of an apple (Malus × domestica) yellow fruit somatic mutation identifies a gene network module highly associated with anthocyanin and epigenetic regulation

    PubMed Central

    El-Sharkawy, Islam; Liang, Dong; Xu, Kenong

    2015-01-01

    Using RNA-seq, this study analysed an apple (Malus×domestica) anthocyanin-deficient yellow-skin somatic mutant ‘Blondee’ (BLO) and its red-skin parent ‘Kidd’s D-8’ (KID), the original name of ‘Gala’, to understand the molecular mechanisms underlying the mutation. A total of 3299 differentially expressed genes (DEGs) were identified between BLO and KID at four developmental stages and/or between two adjacent stages within BLO and/or KID. A weighted gene co-expression network analysis (WGCNA) of the DEGs uncovered a network module of 34 genes highly correlated (r=0.95, P=9.0×10–13) with anthocyanin contents. Although 12 of the 34 genes in the WGCNA module were characterized and known of roles in anthocyanin, the remainder 22 appear to be novel. Examining the expression of ten representative genes in the module in 14 diverse apples revealed that at least eight were significantly correlated with anthocyanin variation. MdMYB10 (MDP0000259614) and MdGST (MDP0000252292) were among the most suppressed module member genes in BLO despite being undistinguishable in their corresponding sequences between BLO and KID. Methylation assay of MdMYB10 and MdGST in fruit skin revealed that two regions (MR3 and MR7) in the MdMYB10 promoter exhibited remarkable differences between BLO and KID. In particular, methylation was high and progressively increased alongside fruit development in BLO while was correspondingly low and constant in KID. The methylation levels in both MR3 and MR7 were negatively correlated with anthocyanin content as well as the expression of MdMYB10 and MdGST. Clearly, the collective repression of the 34 genes explains the loss-of-colour in BLO while the methylation in MdMYB10 promoter is likely causal for the mutation. PMID:26417021

  5. Age gene expression and coexpression progressive signatures in peripheral blood leukocytes.

    PubMed

    Irizar, Haritz; Goñi, Joaquín; Alzualde, Ainhoa; Castillo-Triviño, Tamara; Olascoaga, Javier; Lopez de Munain, Adolfo; Otaegui, David

    2015-12-01

    Both cellular senescence and organismic aging are known to be dynamic processes that start early in life and progress constantly during the whole life of the individual. In this work, with the objective of identifying signatures of age-related progressive change at the transcriptomic level, we have performed a whole-genome gene expression analysis of peripheral blood leukocytes in a group of healthy individuals with ages ranging from 14 to 93 years. A set of genes with progressively changing gene expression (either increase or decrease with age) has been identified and contextualized in a coexpression network. A modularity analysis has been performed on this network and biological-term and pathway enrichment analyses have been used for biological interpretation of each module. In summary, the results of the present work reveal the existence of a transcriptomic component that shows progressive expression changes associated to age in peripheral blood leukocytes, highlighting both the dynamic nature of the process and the need to complement young vs. elder studies with longitudinal studies that include middle aged individuals. From the transcriptional point of view, immunosenescence seems to be occurring from a relatively early age, at least from the late 20s/early 30s, and the 49-56 year old age-range appears to be critical. In general, the genes that, according to our results, show progressive expression changes with aging are involved in pathogenic/cellular processes that have classically been linked to aging in humans: cancer, immune processes and cellular growth vs. maintenance. Copyright © 2015 Elsevier Inc. All rights reserved.

  6. Microglia recapitulate a hematopoietic master regulator network in the aging human frontal cortex

    PubMed Central

    Wehrspaun, Claudia C.; Haerty, Wilfried; Ponting, Chris P.

    2015-01-01

    Microglia form the immune system of the brain. Previous studies in cell cultures and animal models suggest altered activation states and cellular senescence in the aged brain. Instead, we analyzed 3 transcriptome data sets from the postmortem frontal cortex of 381 control individuals to show that microglia gene markers assemble into a transcriptional module in a gene coexpression network. These markers predominantly represented M1 and M1/M2b activation phenotypes. Expression of genes in this module generally declines over the adult life span. This decrease was more pronounced in microglia surface receptors for microglia and/or neuron crosstalk than in markers for activation state phenotypes. In addition to these receptors for exogenous signals, microglia are controlled by brain-expressed regulatory factors. We identified a subnetwork of transcription factors, including RUNX1, IRF8, PU.1, and TAL1, which are master regulators (MRs) for the age-dependent microglia module. The causal contributions of these MRs on the microglia module were verified using publicly available ChIP-Seq data. Interactions of these key MRs were preserved in a protein-protein interaction network. Importantly, these MRs appear to be essential for regulating microglia homeostasis in the adult human frontal cortex in addition to their crucial roles in hematopoiesis and myeloid cell-fate decisions during embryogenesis. PMID:26002684

  7. Whole genome co-expression analysis of soybean cytochrome P450 genes identifies nodulation-specific P450 monooxygenases

    PubMed Central

    2010-01-01

    Background Cytochrome P450 monooxygenases (P450s) catalyze oxidation of various substrates using oxygen and NAD(P)H. Plant P450s are involved in the biosynthesis of primary and secondary metabolites performing diverse biological functions. The recent availability of the soybean genome sequence allows us to identify and analyze soybean putative P450s at a genome scale. Co-expression analysis using an available soybean microarray and Illumina sequencing data provides clues for functional annotation of these enzymes. This approach is based on the assumption that genes that have similar expression patterns across a set of conditions may have a functional relationship. Results We have identified a total number of 332 full-length P450 genes and 378 pseudogenes from the soybean genome. From the full-length sequences, 195 genes belong to A-type, which could be further divided into 20 families. The remaining 137 genes belong to non-A type P450s and are classified into 28 families. A total of 178 probe sets were found to correspond to P450 genes on the Affymetrix soybean array. Out of these probe sets, 108 represented single genes. Using the 28 publicly available microarray libraries that contain organ-specific information, some tissue-specific P450s were identified. Similarly, stress responsive soybean P450s were retrieved from 99 microarray soybean libraries. We also utilized Illumina transcriptome sequencing technology to analyze the expressions of all 332 soybean P450 genes. This dataset contains total RNAs isolated from nodules, roots, root tips, leaves, flowers, green pods, apical meristem, mock-inoculated and Bradyrhizobium japonicum-infected root hair cells. The tissue-specific expression patterns of these P450 genes were analyzed and the expression of a representative set of genes were confirmed by qRT-PCR. We performed the co-expression analysis on many of the 108 P450 genes on the Affymetrix arrays. First we confirmed that CYP93C5 (an isoflavone synthase gene) is co-expressed with several genes encoding isoflavonoid-related metabolic enzymes. We then focused on nodulation-induced P450s and found that CYP728H1 was co-expressed with the genes involved in phenylpropanoid metabolism. Similarly, CYP736A34 was highly co-expressed with lipoxygenase, lectin and CYP83D1, all of which are involved in root and nodule development. Conclusions The genome scale analysis of P450s in soybean reveals many unique features of these important enzymes in this crop although the functions of most of them are largely unknown. Gene co-expression analysis proves to be a useful tool to infer the function of uncharacterized genes. Our work presented here could provide important leads toward functional genomics studies of soybean P450s and their regulatory network through the integration of reverse genetics, biochemistry, and metabolic profiling tools. The identification of nodule-specific P450s and their further exploitation may help us to better understand the intriguing process of soybean and rhizobium interaction. PMID:21062474

  8. Identification of candidate genes in Populus cell wall biosynthesis using text-mining, co-expression network and comparative genomics

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Yang, Xiaohan; Ye, Chuyu; Bisaria, Anjali

    2011-01-01

    Populus is an important bioenergy crop for bioethanol production. A greater understanding of cell wall biosynthesis processes is critical in reducing biomass recalcitrance, a major hindrance in efficient generation of ethanol from lignocellulosic biomass. Here, we report the identification of candidate cell wall biosynthesis genes through the development and application of a novel bioinformatics pipeline. As a first step, via text-mining of PubMed publications, we obtained 121 Arabidopsis genes that had the experimental evidences supporting their involvement in cell wall biosynthesis or remodeling. The 121 genes were then used as bait genes to query an Arabidopsis co-expression database and additionalmore » genes were identified as neighbors of the bait genes in the network, increasing the number of genes to 548. The 548 Arabidopsis genes were then used to re-query the Arabidopsis co-expression database and re-construct a network that captured additional network neighbors, expanding to a total of 694 genes. The 694 Arabidopsis genes were computationally divided into 22 clusters. Queries of the Populus genome using the Arabidopsis genes revealed 817 Populus orthologs. Functional analysis of gene ontology and tissue-specific gene expression indicated that these Arabidopsis and Populus genes are high likelihood candidates for functional genomics in relation to cell wall biosynthesis.« less

  9. Predicting gene regulatory networks of soybean nodulation from RNA-Seq transcriptome data.

    PubMed

    Zhu, Mingzhu; Dahmen, Jeremy L; Stacey, Gary; Cheng, Jianlin

    2013-09-22

    High-throughput RNA sequencing (RNA-Seq) is a revolutionary technique to study the transcriptome of a cell under various conditions at a systems level. Despite the wide application of RNA-Seq techniques to generate experimental data in the last few years, few computational methods are available to analyze this huge amount of transcription data. The computational methods for constructing gene regulatory networks from RNA-Seq expression data of hundreds or even thousands of genes are particularly lacking and urgently needed. We developed an automated bioinformatics method to predict gene regulatory networks from the quantitative expression values of differentially expressed genes based on RNA-Seq transcriptome data of a cell in different stages and conditions, integrating transcriptional, genomic and gene function data. We applied the method to the RNA-Seq transcriptome data generated for soybean root hair cells in three different development stages of nodulation after rhizobium infection. The method predicted a soybean nodulation-related gene regulatory network consisting of 10 regulatory modules common for all three stages, and 24, 49 and 70 modules separately for the first, second and third stage, each containing both a group of co-expressed genes and several transcription factors collaboratively controlling their expression under different conditions. 8 of 10 common regulatory modules were validated by at least two kinds of validations, such as independent DNA binding motif analysis, gene function enrichment test, and previous experimental data in the literature. We developed a computational method to reliably reconstruct gene regulatory networks from RNA-Seq transcriptome data. The method can generate valuable hypotheses for interpreting biological data and designing biological experiments such as ChIP-Seq, RNA interference, and yeast two hybrid experiments.

  10. CD8 single-cell gene coexpression reveals three different effector types present at distinct phases of the immune response

    PubMed Central

    Peixoto, António; Evaristo, César; Munitic, Ivana; Monteiro, Marta; Charbit, Alain; Rocha, Benedita; Veiga-Fernandes, Henrique

    2007-01-01

    To study in vivo CD8 T cell differentiation, we quantified the coexpression of multiple genes in single cells throughout immune responses. After in vitro activation, CD8 T cells rapidly express effector molecules and cease their expression when the antigen is removed. Gene behavior after in vivo activation, in contrast, was quite heterogeneous. Different mRNAs were induced at very different time points of the response, were transcribed during different time periods, and could decline or persist independently of the antigen load. Consequently, distinct gene coexpression patterns/different cell types were generated at the various phases of the immune responses. During primary stimulation, inflammatory molecules were induced and down-regulated shortly after activation, generating early cells that only mediated inflammation. Cytotoxic T cells were generated at the peak of the primary response, when individual cells simultaneously expressed multiple killer molecules, whereas memory cells lost killer capacity because they no longer coexpressed killer genes. Surprisingly, during secondary responses gene transcription became permanent. Secondary cells recovered after antigen elimination were more efficient killers than cytotoxic T cells present at the peak of the primary response. Thus, primary responses produced two transient effector types. However, after boosting, CD8 T cells differentiate into long-lived killer cells that persist in vivo in the absence of antigen. PMID:17485515

  11. Transcriptional and Hormonal Regulation of Gravitropism of Woody Stems in Populus[OPEN

    PubMed Central

    Gerttula, Suzanne; Zinkgraf, Matthew; Lewis, Daniel R.; Brumer, Harry; Hart, Foster; Filkov, Vladimir

    2015-01-01

    Angiosperm trees reorient their woody stems by asymmetrically producing a specialized xylem tissue, tension wood, which exerts a strong contractile force resulting in negative gravitropism of the stem. Here, we show, in Populus trees, that initial gravity perception and response occurs in specialized cells through sedimentation of starch-filled amyloplasts and relocalization of the auxin transport protein, PIN3. Gibberellic acid treatment stimulates the rate of tension wood formation and gravibending and enhances tissue-specific expression of an auxin-responsive reporter. Gravibending, maturation of contractile fibers, and gibberellic acid (GA) stimulation of tension wood formation are all sensitive to transcript levels of the Class I KNOX homeodomain transcription factor-encoding gene ARBORKNOX2 (ARK2). We generated genome-wide transcriptomes for trees in which gene expression was perturbed by gravistimulation, GA treatment, and modulation of ARK2 expression. These data were employed in computational analyses to model the transcriptional networks underlying wood formation, including identification and dissection of gene coexpression modules associated with wood phenotypes, GA response, and ARK2 binding to genes within modules. We propose a model for gravitropism in the woody stem in which the peripheral location of PIN3-expressing cells relative to the cambium results in auxin transport toward the cambium in the top of the stem, triggering tension wood formation, while transport away from the cambium in the bottom of the stem triggers opposite wood formation. PMID:26410302

  12. A regulatory network-based approach dissects late maturation processes related to the acquisition of desiccation tolerance and longevity of Medicago truncatula seeds.

    PubMed

    Verdier, Jerome; Lalanne, David; Pelletier, Sandra; Torres-Jerez, Ivone; Righetti, Karima; Bandyopadhyay, Kaustav; Leprince, Olivier; Chatelain, Emilie; Vu, Benoit Ly; Gouzy, Jerome; Gamas, Pascal; Udvardi, Michael K; Buitink, Julia

    2013-10-01

    In seeds, desiccation tolerance (DT) and the ability to survive the dry state for prolonged periods of time (longevity) are two essential traits for seed quality that are consecutively acquired during maturation. Using transcriptomic and metabolomic profiling together with a conditional-dependent network of global transcription interactions, we dissected the maturation events from the end of seed filling to final maturation drying during the last 3 weeks of seed development in Medicago truncatula. The network revealed distinct coexpression modules related to the acquisition of DT, longevity, and pod abscission. The acquisition of DT and dormancy module was associated with abiotic stress response genes, including late embryogenesis abundant (LEA) genes. The longevity module was enriched in genes involved in RNA processing and translation. Concomitantly, LEA polypeptides accumulated, displaying an 18-d delayed accumulation compared with transcripts. During maturation, gulose and stachyose levels increased and correlated with longevity. A seed-specific network identified known and putative transcriptional regulators of DT, including ABSCISIC ACID-INSENSITIVE3 (MtABI3), MtABI4, MtABI5, and APETALA2/ ETHYLENE RESPONSE ELEMENT BINDING PROTEIN (AtAP2/EREBP) transcription factor as major hubs. These transcriptional activators were highly connected to LEA genes. Longevity genes were highly connected to two MtAP2/EREBP and two basic leucine zipper transcription factors. A heat shock factor was found at the transition of DT and longevity modules, connecting to both gene sets. Gain- and loss-of-function approaches of MtABI3 confirmed 80% of its predicted targets, thereby experimentally validating the network. This study captures the coordinated regulation of seed maturation and identifies distinct regulatory networks underlying the preparation for the dry and quiescent states.

  13. A Regulatory Network-Based Approach Dissects Late Maturation Processes Related to the Acquisition of Desiccation Tolerance and Longevity of Medicago truncatula Seeds1[C][W][OPEN

    PubMed Central

    Verdier, Jerome; Lalanne, David; Pelletier, Sandra; Torres-Jerez, Ivone; Righetti, Karima; Bandyopadhyay, Kaustav; Leprince, Olivier; Chatelain, Emilie; Vu, Benoit Ly; Gouzy, Jerome; Gamas, Pascal; Udvardi, Michael K.; Buitink, Julia

    2013-01-01

    In seeds, desiccation tolerance (DT) and the ability to survive the dry state for prolonged periods of time (longevity) are two essential traits for seed quality that are consecutively acquired during maturation. Using transcriptomic and metabolomic profiling together with a conditional-dependent network of global transcription interactions, we dissected the maturation events from the end of seed filling to final maturation drying during the last 3 weeks of seed development in Medicago truncatula. The network revealed distinct coexpression modules related to the acquisition of DT, longevity, and pod abscission. The acquisition of DT and dormancy module was associated with abiotic stress response genes, including late embryogenesis abundant (LEA) genes. The longevity module was enriched in genes involved in RNA processing and translation. Concomitantly, LEA polypeptides accumulated, displaying an 18-d delayed accumulation compared with transcripts. During maturation, gulose and stachyose levels increased and correlated with longevity. A seed-specific network identified known and putative transcriptional regulators of DT, including ABSCISIC ACID-INSENSITIVE3 (MtABI3), MtABI4, MtABI5, and APETALA2/ ETHYLENE RESPONSE ELEMENT BINDING PROTEIN (AtAP2/EREBP) transcription factor as major hubs. These transcriptional activators were highly connected to LEA genes. Longevity genes were highly connected to two MtAP2/EREBP and two basic leucine zipper transcription factors. A heat shock factor was found at the transition of DT and longevity modules, connecting to both gene sets. Gain- and loss-of-function approaches of MtABI3 confirmed 80% of its predicted targets, thereby experimentally validating the network. This study captures the coordinated regulation of seed maturation and identifies distinct regulatory networks underlying the preparation for the dry and quiescent states. PMID:23929721

  14. Expression of lycopene biosynthesis genes fused in line with Shine-Dalgarno sequences improves the stress-tolerance of Lactococcus lactis.

    PubMed

    Dong, Xiangrong; Wang, Yanping; Yang, Fengyuan; Zhao, Shanshan; Tian, Bing; Li, Tao

    2017-01-01

    Lycopene biosynthetic genes from Deinococcus radiodurans were co-expressed in Lactococcus lactis to produce lycopene and improve its tolerance to stress. Lycopene-related genes from D. radiodurans, DR1395 (crtE), DR0862 (crtB), and DR0861 (crtI), were fused in line with S hine-Dalgarno (SD) sequences and co-expressed in L. lactis. The recombinant strain produced 0.36 mg lycopene g -1  dry cell wt after 48 h fermentation. The survival rate to UV irradiation of the recombinant strain was higher than that of the non-transformed strain. The L. lactis with co-expressed genes responsible for lycopene biosynthesis from D. radiodurans produced lycopene and exhibited increased resistance to UV stress, suggesting that the recombinant strain has important application potential in food industry.

  15. Analysis of genetic association in Listeria and Diabetes using Hierarchical Clustering and Silhouette Index

    NASA Astrophysics Data System (ADS)

    Pagnuco, Inti A.; Pastore, Juan I.; Abras, Guillermo; Brun, Marcel; Ballarin, Virginia L.

    2016-04-01

    It is usually assumed that co-expressed genes suggest co-regulation in the underlying regulatory network. Determining sets of co-expressed genes is an important task, where significative groups of genes are defined based on some criteria. This task is usually performed by clustering algorithms, where the whole family of genes, or a subset of them, are clustered into meaningful groups based on their expression values in a set of experiment. In this work we used a methodology based on the Silhouette index as a measure of cluster quality for individual gene groups, and a combination of several variants of hierarchical clustering to generate the candidate groups, to obtain sets of co-expressed genes for two real data examples. We analyzed the quality of the best ranked groups, obtained by the algorithm, using an online bioinformatics tool that provides network information for the selected genes. Moreover, to verify the performance of the algorithm, considering the fact that it doesn’t find all possible subsets, we compared its results against a full search, to determine the amount of good co-regulated sets not detected.

  16. Comparative Systems Analyses Reveal Molecular Signatures of Clinically tested Vaccine Adjuvants

    NASA Astrophysics Data System (ADS)

    Olafsdottir, Thorunn A.; Lindqvist, Madelene; Nookaew, Intawat; Andersen, Peter; Maertzdorf, Jeroen; Persson, Josefine; Christensen, Dennis; Zhang, Yuan; Anderson, Jenna; Khoomrung, Sakda; Sen, Partho; Agger, Else Marie; Coler, Rhea; Carter, Darrick; Meinke, Andreas; Rappuoli, Rino; Kaufmann, Stefan H. E.; Reed, Steven G.; Harandi, Ali M.

    2016-12-01

    A better understanding of the mechanisms of action of human adjuvants could inform a rational development of next generation vaccines for human use. Here, we exploited a genome wide transcriptomics analysis combined with a systems biology approach to determine the molecular signatures induced by four clinically tested vaccine adjuvants, namely CAF01, IC31, GLA-SE and Alum in mice. We report signature molecules, pathways, gene modules and networks, which are shared by or otherwise exclusive to these clinical-grade adjuvants in whole blood and draining lymph nodes of mice. Intriguingly, co-expression analysis revealed blood gene modules highly enriched for molecules with documented roles in T follicular helper (TFH) and germinal center (GC) responses. We could show that all adjuvants enhanced, although with different magnitude and kinetics, TFH and GC B cell responses in draining lymph nodes. These results represent, to our knowledge, the first comparative systems analysis of clinically tested vaccine adjuvants that may provide new insights into the mechanisms of action of human adjuvants.

  17. Long non-coding RNAs and mRNAs profiling during spleen development in pig.

    PubMed

    Che, Tiandong; Li, Diyan; Jin, Long; Fu, Yuhua; Liu, Yingkai; Liu, Pengliang; Wang, Yixin; Tang, Qianzi; Ma, Jideng; Wang, Xun; Jiang, Anan; Li, Xuewei; Li, Mingzhou

    2018-01-01

    Genome-wide transcriptomic studies in humans and mice have become extensive and mature. However, a comprehensive and systematic understanding of protein-coding genes and long non-coding RNAs (lncRNAs) expressed during pig spleen development has not been achieved. LncRNAs are known to participate in regulatory networks for an array of biological processes. Here, we constructed 18 RNA libraries from developing fetal pig spleen (55 days before birth), postnatal pig spleens (0, 30, 180 days and 2 years after birth), and the samples from the 2-year-old Wild Boar. A total of 15,040 lncRNA transcripts were identified among these samples. We found that the temporal expression pattern of lncRNAs was more restricted than observed for protein-coding genes. Time-series analysis showed two large modules for protein-coding genes and lncRNAs. The up-regulated module was enriched for genes related to immune and inflammatory function, while the down-regulated module was enriched for cell proliferation processes such as cell division and DNA replication. Co-expression networks indicated the functional relatedness between protein-coding genes and lncRNAs, which were enriched for similar functions over the series of time points examined. We identified numerous differentially expressed protein-coding genes and lncRNAs in all five developmental stages. Notably, ceruloplasmin precursor (CP), a protein-coding gene participating in antioxidant and iron transport processes, was differentially expressed in all stages. This study provides the first catalog of the developing pig spleen, and contributes to a fuller understanding of the molecular mechanisms underpinning mammalian spleen development.

  18. Intra- and interregional coregulation of opioid genes: broken symmetry in spinal circuits

    PubMed Central

    Kononenko, Olga; Galatenko, Vladimir; Andersson, Malin; Bazov, Igor; Watanabe, Hiroyuki; Zhou, Xing Wu; Iatsyshyna, Anna; Mityakina, Irina; Yakovleva, Tatiana; Sarkisyan, Daniil; Ponomarev, Igor; Krishtal, Oleg; Marklund, Niklas; Tonevitsky, Alex; Adkins, DeAnna L.; Bakalkin, Georgy

    2017-01-01

    Regulation of the formation and rewiring of neural circuits by neuropeptides may require coordinated production of these signaling molecules and their receptors that may be established at the transcriptional level. Here, we address this hypothesis by comparing absolute expression levels of opioid peptides with their receptors, the largest neuropeptide family, and by characterizing coexpression (transcriptionally coordinated) patterns of these genes. We demonstrated that expression patterns of opioid genes highly correlate within and across functionally and anatomically different areas. Opioid peptide genes, compared with their receptor genes, are transcribed at much greater absolute levels, which suggests formation of a neuropeptide cloud that covers the receptor-expressed circuits. Surprisingly, we found that both expression levels and the proportion of opioid receptors are strongly lateralized in the spinal cord, interregional coexpression patterns are side specific, and intraregional coexpression profiles are affected differently by left- and right-side unilateral body injury. We propose that opioid genes are regulated as interconnected components of the same molecular system distributed between distinct anatomic regions. The striking feature of this system is its asymmetric coexpression patterns, which suggest side-specific regulation of selective neural circuits by opioid neurohormones.—Kononenko, O., Galatenko, V., Andersson, M., Bazov, I., Watanabe, H., Zhou, X. W., Iatsyshyna, A., Mityakina, I., Yakovleva, T., Sarkisyan, D., Ponomarev, I., Krishtal, O., Marklund, N., Tonevitsky, A., Adkins, D. L., Bakalkin, G. Intra- and interregional coregulation of opioid genes: broken symmetry in spinal circuits. PMID:28122917

  19. Systematic comparison of co-expression of multiple recombinant thermophilic enzymes in Escherichia coli BL21(DE3).

    PubMed

    Chen, Hui; Huang, Rui; Zhang, Y-H Percival

    2017-06-01

    The precise control of multiple heterologous enzyme expression levels in one Escherichia coli strain is important for cascade biocatalysis, metabolic engineering, synthetic biology, natural product synthesis, and studies of complexed proteins. We systematically investigated the co-expression of up to four thermophilic enzymes (i.e., α-glucan phosphorylase (αGP), phosphoglucomutase (PGM), glucose 6-phosphate dehydrogenase (G6PDH), and 6-phosphogluconate dehydrogenase (6PGDH)) in E. coli BL21(DE3) by adding T7 promoter or T7 terminator of each gene for multiple genes in tandem, changing gene alignment, and comparing one or two plasmid systems. It was found that the addition of T7 terminator after each gene was useful to decrease the influence of the upstream gene. The co-expression of the four enzymes in E. coli BL21(DE3) was demonstrated to generate two NADPH molecules from one glucose unit of maltodextrin, where NADPH was oxidized to convert xylose to xylitol. The best four-gene co-expression system was based on two plasmids (pET and pACYC) which harbored two genes. As a result, apparent enzymatic activities of the four enzymes were regulated to be at similar levels and the overall four-enzyme activity was the highest based on the formation of xylitol. This study provides useful information for the precise control of multi-enzyme-coordinated expression in E. coli BL21(DE3).

  20. Increased co-expression of genes harboring the damaging de novo mutations in Chinese schizophrenic patients during prenatal development.

    PubMed

    Wang, Qiang; Li, Miaoxin; Yang, Zhenxing; Hu, Xun; Wu, Hei-Man; Ni, Peiyan; Ren, Hongyan; Deng, Wei; Li, Mingli; Ma, Xiaohong; Guo, Wanjun; Zhao, Liansheng; Wang, Yingcheng; Xiang, Bo; Lei, Wei; Sham, Pak C; Li, Tao

    2015-12-15

    Schizophrenia is a heritable, heterogeneous common psychiatric disorder. In this study, we evaluated the hypothesis that de novo variants (DNVs) contribute to the pathogenesis of schizophrenia. We performed exome sequencing in Chinese patients (N = 45) with schizophrenia and their unaffected parents (N = 90). Forty genes were found to contain DNVs. These genes had enriched transcriptional co-expression profile in prenatal frontal cortex (Bonferroni corrected p < 9.1 × 10(-3)), and in prenatal temporal and parietal regions (Bonferroni corrected p < 0.03). Also, four prenatal anatomical subregions (VCF, MFC, OFC and ITC) have shown significant enrichment of connectedness in co-expression networks. Moreover, four genes (LRP1, MACF1, DICER1 and ABCA2) harboring the damaging de novo mutations are strongly prioritized as susceptibility genes by multiple evidences. Our findings in Chinese schizophrenic patients indicate the pathogenic role of DNVs, supporting the hypothesis that schizophrenia is a neurodevelopmental disease.

  1. Increased co-expression of genes harboring the damaging de novo mutations in Chinese schizophrenic patients during prenatal development

    PubMed Central

    Wang, Qiang; Li, Miaoxin; Yang, Zhenxing; Hu, Xun; Wu, Hei-Man; Ni, Peiyan; Ren, Hongyan; Deng, Wei; Li, Mingli; Ma, Xiaohong; Guo, Wanjun; Zhao, Liansheng; Wang, Yingcheng; Xiang, Bo; Lei, Wei; Sham, Pak C; Li, Tao

    2015-01-01

    Schizophrenia is a heritable, heterogeneous common psychiatric disorder. In this study, we evaluated the hypothesis that de novo variants (DNVs) contribute to the pathogenesis of schizophrenia. We performed exome sequencing in Chinese patients (N = 45) with schizophrenia and their unaffected parents (N = 90). Forty genes were found to contain DNVs. These genes had enriched transcriptional co-expression profile in prenatal frontal cortex (Bonferroni corrected p < 9.1 × 10−3), and in prenatal temporal and parietal regions (Bonferroni corrected p < 0.03). Also, four prenatal anatomical subregions (VCF, MFC, OFC and ITC) have shown significant enrichment of connectedness in co-expression networks. Moreover, four genes (LRP1, MACF1, DICER1 and ABCA2) harboring the damaging de novo mutations are strongly prioritized as susceptibility genes by multiple evidences. Our findings in Chinese schizophrenic patients indicate the pathogenic role of DNVs, supporting the hypothesis that schizophrenia is a neurodevelopmental disease. PMID:26666178

  2. Using bioinformatics and systems genetics to dissect HDL-cholesterol genetics in an MRL/MpJ x SM/J intercross.

    PubMed

    Leduc, Magalie S; Blair, Rachael Hageman; Verdugo, Ricardo A; Tsaih, Shirng-Wern; Walsh, Kenneth; Churchill, Gary A; Paigen, Beverly

    2012-06-01

    A higher incidence of coronary artery disease is associated with a lower level of HDL-cholesterol. We searched for genetic loci influencing HDL-cholesterol in F2 mice from a cross between MRL/MpJ and SM/J mice. Quantitative trait loci (QTL) mapping revealed one significant HDL QTL (Apoa2 locus), four suggestive QTL on chromosomes 10, 11, 13, and 18 and four additional QTL on chromosomes 1 proximal, 3, 4, and 7 after adjusting HDL for the strong Apoa2 locus. A novel nonsynonymous polymorphism supports Lipg as the QTL gene for the chromosome 18 QTL, and a difference in Abca1 expression in liver tissue supports it as the QTL gene for the chromosome 4 QTL. Using weighted gene co-expression network analysis, we identified a module that after adjustment for Apoa2, correlated with HDL, was genetically determined by a QTL on chromosome 11, and overlapped with the HDL QTL. A combination of bioinformatics tools and systems genetics helped identify several candidate genes for both the chromosome 11 HDL and module QTL based on differential expression between the parental strains, cis regulation of expression, and causality modeling. We conclude that integrating systems genetics to a more-traditional genetics approach improves the power of complex trait gene identification.

  3. Co-expression of antioxidant enzymes with expression of p53, DNA repair, and heat shock protein genes in the gamma ray-irradiated hermaphroditic fish Kryptolebias marmoratus larvae.

    PubMed

    Rhee, Jae-Sung; Kim, Bo-Mi; Kim, Ryeo-Ok; Seo, Jung Soo; Kim, Il-Chan; Lee, Young-Mi; Lee, Jae-Seong

    2013-09-15

    To investigate effects of gamma ray irradiation in the hermaphroditic fish, Kryptolebias marmoratus larvae, we checked expression of p53, DNA repair, and heat shock protein genes with several antioxidant enzyme activities by quantitative real-time RT-PCR and biochemical methods in response to different doses of gamma radiation. As a result, the level of gamma radiation-induced DNA damage was initiated after 4Gy of radiation, and biochemical and molecular damage became substantial from 8Gy. In particular, several DNA repair mechanism-related genes were significantly modulated in the 6Gy gamma radiation-exposed fish larvae, suggesting that upregulation of such DNA repair genes was closely associated with cell survival after gamma irradiation. The mRNA expression of p53 and most hsps was also significantly upregulated at high doses of gamma radiation related to cellular damage. This finding indicates that gamma radiation can induce oxidative stress with associated antioxidant enzyme activities, and linked to modulation of the expression of DNA repair-related genes as one of the defense mechanisms against radiation damage. This study provides a better understanding of the molecular mode of action of defense mechanisms upon gamma radiation in fish larvae. Copyright © 2013 Elsevier B.V. All rights reserved.

  4. Systemic bioinformatics analysis of skeletal muscle gene expression profiles of sepsis

    PubMed Central

    Yang, Fang; Wang, Yumei

    2018-01-01

    Sepsis is a type of systemic inflammatory response syndrome with high morbidity and mortality. Skeletal muscle dysfunction is one of the major complications of sepsis that may also influence the outcome of sepsis. The aim of the present study was to explore and identify potential mechanisms and therapeutic targets of sepsis. Systemic bioinformatics analysis of skeletal muscle gene expression profiles from the Gene Expression Omnibus was performed. Differentially expressed genes (DEGs) in samples from patients with sepsis and control samples were screened out using the limma package. Differential co-expression and coregulation (DCE and DCR, respectively) analysis was performed based on the Differential Co-expression Analysis package to identify differences in gene co-expression and coregulation patterns between the control and sepsis groups. Gene Ontology terms and Kyoto Encyclopedia of Genes and Genomes pathways of DEGs were identified using the Database for Annotation, Visualization and Integrated Discovery, and inflammatory, cancer and skeletal muscle development-associated biological processes and pathways were identified. DCE and DCR analysis revealed several potential therapeutic targets for sepsis, including genes and transcription factors. The results of the present study may provide a basis for the development of novel therapeutic targets and treatment methods for sepsis. PMID:29805480

  5. Clustering approaches to identifying gene expression patterns from DNA microarray data.

    PubMed

    Do, Jin Hwan; Choi, Dong-Kug

    2008-04-30

    The analysis of microarray data is essential for large amounts of gene expression data. In this review we focus on clustering techniques. The biological rationale for this approach is the fact that many co-expressed genes are co-regulated, and identifying co-expressed genes could aid in functional annotation of novel genes, de novo identification of transcription factor binding sites and elucidation of complex biological pathways. Co-expressed genes are usually identified in microarray experiments by clustering techniques. There are many such methods, and the results obtained even for the same datasets may vary considerably depending on the algorithms and metrics for dissimilarity measures used, as well as on user-selectable parameters such as desired number of clusters and initial values. Therefore, biologists who want to interpret microarray data should be aware of the weakness and strengths of the clustering methods used. In this review, we survey the basic principles of clustering of DNA microarray data from crisp clustering algorithms such as hierarchical clustering, K-means and self-organizing maps, to complex clustering algorithms like fuzzy clustering.

  6. One-step affinity tag purification of full-length recombinant human AP-1 complexes from bacterial inclusion bodies using a polycistronic expression system

    PubMed Central

    Wang, Wei-Ming; Lee, A-Young; Chiang, Cheng-Ming

    2008-01-01

    The AP-1 transcription factor is a dimeric protein complex formed primarily between Jun (c-Jun, JunB, JunD) and Fos (c-Fos, FosB, Fra-1, Fra-2) family members. These distinct AP-1 complexes are expressed in many cell types and modulate target gene expression implicated in cell proliferation, differentiation, and stress responses. Although the importance of AP-1 has long been recognized, the biochemical characterization of AP-1 remains limited in part due to the difficulty in purifying full-length, reconstituted dimers with active DNA-binding and transcriptional activity. Using a combination of bacterial coexpression and epitope-tagging methods, we successfully purified all 12 heterodimers (3 Jun × 4 Fos) of full-length human AP-1 complexes as well as c-Jun/c-Jun, JunD/JunD, and c-Jun/JunD dimers from bacterial inclusion bodies using one-step nickel-NTA affinity tag purification following denaturation and renaturation of coexpressed AP-1 subunits. Coexpression of two constitutive components in a dimeric AP-1 complex helps stabilize the proteins when compared with individual protein expression in bacteria. Purified dimeric AP-1 complexes are functional in sequence-specific DNA binding, as illustrated by electrophoretic mobility shift assays and DNase I footprinting, and are also active in transcription with in vitro-reconstituted human papillomavirus (HPV) chromatin containing AP-1-binding sites in the native configuration of HPV nucleosomes. The availability of these recombinant full-length human AP-1 complexes has greatly facilitated mechanistic studies of AP-1-regulated gene transcription in many biological systems. PMID:18329890

  7. Genetic divergence in the transcriptional engram of chronic alcohol abuse: A laser-capture RNA-seq study of the mouse mesocorticolimbic system.

    PubMed

    Mulligan, Megan K; Mozhui, Khyobeni; Pandey, Ashutosh K; Smith, Maren L; Gong, Suzhen; Ingels, Jesse; Miles, Michael F; Lopez, Marcelo F; Lu, Lu; Williams, Robert W

    2017-02-01

    Genetic factors that influence the transition from initial drinking to dependence remain enigmatic. Recent studies have leveraged chronic intermittent ethanol (CIE) paradigms to measure changes in brain gene expression in a single strain at 0, 8, 72 h, and even 7 days following CIE. We extend these findings using LCM RNA-seq to profile expression in 11 brain regions in two inbred strains - C57BL/6J (B6) and DBA/2J (D2) - 72 h following multiple cycles of ethanol self-administration and CIE. Linear models identified differential expression based on treatment, region, strain, or interactions with treatment. Nearly 40% of genes showed a robust effect (FDR < 0.01) of region, and hippocampus CA1, cortex, bed nucleus stria terminalis, and nucleus accumbens core had the highest number of differentially expressed genes after treatment. Another 8% of differentially expressed genes demonstrated a robust effect of strain. As expected, based on similar studies in B6, treatment had a much smaller impact on expression; only 72 genes (p < 0.01) are modulated by treatment (independent of region or strain). Strikingly, many more genes (415) show a strain-specific and largely opposite response to treatment and are enriched in processes related to RNA metabolism, transcription factor activity, and mitochondrial function. Over 3 times as many changes in gene expression were detected in D2 compared to B6, and weighted gene co-expression network analysis (WGCNA) module comparison identified more modules enriched for treatment effects in D2. Substantial strain differences exist in the temporal pattern of transcriptional neuroadaptation to CIE, and these may drive individual differences in risk of addiction following excessive alcohol consumption. Copyright © 2016 Elsevier Inc. All rights reserved.

  8. Integrative Analysis of Many Weighted Co-Expression Networks Using Tensor Computation

    PubMed Central

    Li, Wenyuan; Liu, Chun-Chi; Zhang, Tong; Li, Haifeng; Waterman, Michael S.; Zhou, Xianghong Jasmine

    2011-01-01

    The rapid accumulation of biological networks poses new challenges and calls for powerful integrative analysis tools. Most existing methods capable of simultaneously analyzing a large number of networks were primarily designed for unweighted networks, and cannot easily be extended to weighted networks. However, it is known that transforming weighted into unweighted networks by dichotomizing the edges of weighted networks with a threshold generally leads to information loss. We have developed a novel, tensor-based computational framework for mining recurrent heavy subgraphs in a large set of massive weighted networks. Specifically, we formulate the recurrent heavy subgraph identification problem as a heavy 3D subtensor discovery problem with sparse constraints. We describe an effective approach to solving this problem by designing a multi-stage, convex relaxation protocol, and a non-uniform edge sampling technique. We applied our method to 130 co-expression networks, and identified 11,394 recurrent heavy subgraphs, grouped into 2,810 families. We demonstrated that the identified subgraphs represent meaningful biological modules by validating against a large set of compiled biological knowledge bases. We also showed that the likelihood for a heavy subgraph to be meaningful increases significantly with its recurrence in multiple networks, highlighting the importance of the integrative approach to biological network analysis. Moreover, our approach based on weighted graphs detects many patterns that would be overlooked using unweighted graphs. In addition, we identified a large number of modules that occur predominately under specific phenotypes. This analysis resulted in a genome-wide mapping of gene network modules onto the phenome. Finally, by comparing module activities across many datasets, we discovered high-order dynamic cooperativeness in protein complex networks and transcriptional regulatory networks. PMID:21698123

  9. Co-expression of mitosis-regulating genes contributes to malignant progression and prognosis in oligodendrogliomas

    PubMed Central

    Liu, Yanwei; Hu, Huimin; Zhang, Chuanbao; Wang, Haoyuan; Zhang, Wenlong; Wang, Zheng; Li, Mingyang; Zhang, Wei; Zhou, Dabiao; Jiang, Tao

    2015-01-01

    The clinical prognosis of patients with glioma is determined by tumor grades, but tumors of different subtypes with equal malignancy grade usually have different prognosis that is largely determined by genetic abnormalities. Oligodendrogliomas (ODs) are the second most common type of gliomas. In this study, integrative analyses found that distribution of TCGA transcriptomic subtypes was associated with grade progression in ODs. To identify critical gene(s) associated with tumor grades and TCGA subtypes, we analyzed 34 normal brain tissue (NBT), 146 WHO grade II and 130 grade III ODs by microarray and RNA sequencing, and identified a co-expression network of six genes (AURKA, NDC80,CENPK, KIAA0101, TIMELESS and MELK) that was associated with tumor grades and TCGA subtypes as well as Ki-67 expression. Validation of the six genes was performed by qPCR in additional 28 ODs. Importantly, these genes also were validated in four high-grade recurrent gliomas and the initial lower-grade gliomas resected from the same patients. Finally, the RNA data on two genes with the highest discrimination potential (AURKA and NDC80) and Ki-67 were validated on an independent cohort (5 NBTs and 86 ODs) by immunohistochemistry. Knockdown of AURKA and NDC80 by siRNAs suppressed Ki-67 expression and proliferation of gliomas cells. Survival analysis showed that high expression of the six genes corporately indicated a poor survival outcome. Correlation and protein interaction analysis provided further evidence for this co-expression network. These data suggest that the co-expression of the six mitosis-regulating genes was associated with malignant progression and prognosis in ODs. PMID:26468983

  10. Exploring of the molecular mechanism of rhinitis via bioinformatics methods

    PubMed Central

    Song, Yufen; Yan, Zhaohui

    2018-01-01

    The aim of this study was to analyze gene expression profiles for exploring the function and regulatory network of differentially expressed genes (DEGs) in pathogenesis of rhinitis by a bioinformatics method. The gene expression profile of GSE43523 was downloaded from the Gene Expression Omnibus database. The dataset contained 7 seasonal allergic rhinitis samples and 5 non-allergic normal samples. DEGs between rhinitis samples and normal samples were identified via the limma package of R. The webGestal database was used to identify enriched Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways of the DEGs. The differentially co-expressed pairs of the DEGs were identified via the DCGL package in R, and the differential co-expression network was constructed based on these pairs. A protein-protein interaction (PPI) network of the DEGs was constructed based on the Search Tool for the Retrieval of Interacting Genes database. A total of 263 DEGs were identified in rhinitis samples compared with normal samples, including 125 downregulated ones and 138 upregulated ones. The DEGs were enriched in 7 KEGG pathways. 308 differential co-expression gene pairs were obtained. A differential co-expression network was constructed, containing 212 nodes. In total, 148 PPI pairs of the DEGs were identified, and a PPI network was constructed based on these pairs. Bioinformatics methods could help us identify significant genes and pathways related to the pathogenesis of rhinitis. Steroid biosynthesis pathway and metabolic pathways might play important roles in the development of allergic rhinitis (AR). Genes such as CDC42 effector protein 5, solute carrier family 39 member A11 and PR/SET domain 10 might be also associated with the pathogenesis of AR, which provided references for the molecular mechanisms of AR. PMID:29257233

  11. Implications of miR166 and miR159 induction to the basal response mechanisms of an andigena potato (Solanum tuberosum subsp. andigena) to salinity stress, predicted from network models in Arabidopsis.

    PubMed

    Kitazumi, Ai; Kawahara, Yoshihiro; Onda, Ty S; De Koeyer, David; de los Reyes, Benildo G

    2015-01-01

    MicroRNA (miRNA) mediated changes in gene expression by post-transcriptional modulation of major regulatory transcription factors is a potent mechanism for integrating growth and stress-related responses. Exotic plants including many traditional varieties of Andean potatoes (Solanum tuberosum subsp. andigena) are known for better adaptation to marginal environments. Stress physiological studies confirmed earlier reports on the salinity tolerance potentials of certain andigena cultivars. Guided by the hypothesis that certain miRNAs play important roles in growth modulation under suboptimal conditions, we identified and characterized salinity stress-responsive miRNA-target gene pairs in the andigena cultivar Sullu by parallel analysis of noncoding and coding RNA transcriptomes. Inverse relationships were established by the reverse co-expression between two salinity stress-regulated miRNAs (miR166, miR159) and their target transcriptional regulators HD-ZIP-Phabulosa/Phavulota and Myb101, respectively. Based on heterologous models in Arabidopsis, the miR166-HD-ZIP-Phabulosa/Phavulota network appears to be involved in modulating growth perhaps by mediating vegetative dormancy, with linkages to defense-related pathways. The miR159-Myb101 network may be important for the modulation of vegetative growth while also controlling stress-induced premature transition to reproductive phase. We postulate that the induction of miR166 and miR159 under salinity stress represents important network hubs for balancing gene expression required for basal growth adjustments.

  12. Simultaneous coexpression of memory-related and effector-related genes by individual human CD8 T cells depends on antigen specificity and differentiation.

    PubMed

    Gupta, Bhawna; Iancu, Emanuela M; Gannon, Philippe O; Wieckowski, Sébastien; Baitsch, Lukas; Speiser, Daniel E; Rufer, Nathalie

    2012-07-01

    Phenotypic and functional cell properties are usually analyzed at the level of defined cell populations but not single cells. Yet, large differences between individual cells may have important functional consequences. It is likely that T-cell-mediated immunity depends on the polyfunctionality of individual T cells, rather than the sum of functions of responding T-cell subpopulations. We performed highly sensitive single-cell gene expression profiling, allowing the direct ex vivo characterization of individual virus-specific and tumor-specific T cells from healthy donors and melanoma patients. We have previously shown that vaccination with the natural tumor peptide Melan-A-induced T cells with superior effector functions as compared with vaccination with the analog peptide optimized for enhanced HLA-A*0201 binding. Here we found that natural peptide vaccination induced tumor-reactive CD8 T cells with frequent coexpression of both memory/homing-associated genes (CD27, IL7R, EOMES, CXCR3, and CCR5) and effector-related genes (IFNG, KLRD1, PRF1, and GZMB), comparable with protective Epstein-Barr virus-specific and cytomegalovirus-specific T cells. In contrast, memory/homing-associated and effector-associated genes were less frequently coexpressed after vaccination with the analog peptide. Remarkably, these findings reveal a previously unknown level of gene expression diversity among vaccine-specific and virus-specific T cells with the simultaneous coexpression of multiple memory/homing-related and effector-related genes by the same cell. Such broad functional gene expression signatures within antigen-specific T cells may be critical for mounting efficient responses to pathogens or tumors. In summary, direct ex vivo high-resolution molecular characterization of individual T cells provides key insights into the processes shaping the functional properties of tumor-specific and virus-specific T cells.

  13. A translational systems biology approach in both animals and humans identifies a functionally related module of accumbal genes involved in the regulation of reward processing and binge drinking in males.

    PubMed

    Stacey, David; Lourdusamy, Anbarasu; Ruggeri, Barbara; Maroteaux, Matthieu; Jia, Tianye; Cattrell, Anna; Nymberg, Charlotte; Banaschewski, Tobias; Bhattacharyya, Sohinee; Band, Hamid; Barker, Gareth; Bokde, Arun; Buchel, Christian; Carvalho, Fabiana; Conrod, Patricia; Desrivieres, Sylvane; Easton, Alanna; Fauth-Buehler, Mira; Fernandez-Medarde, Alberto; Flor, Herta; Frouin, Vincent; Gallinat, Jurgen; Garavanh, Hugh; Heinz, Andreas; Ittermann, Bernd; Lathrop, Mark; Lawrence, Claire; Loth, Eva; Mann, Karl; Martinot, Jean-Luc; Nees, Frauke; Paus, Tomas; Pausova, Zdenka; Rietschel, Marcella; Rotter, Andrea; Santos, Eugenio; Smolka, Michael; Sommer, Wolfgang; Mameli, Manuel; Spanagel, Rainer; Girault, Jean-Antoine; Mueller, Christian; Schumann, Gunter

    2016-04-01

    The mesolimbic dopamine system, composed primarily of dopaminergic neurons in the ventral tegmental area that project to striatal structures, is considered to be the key mediator of reinforcement-related mechanisms in the brain. Prompted by a genome-wide association meta-analysis implicating the Ras-specific guanine nucleotide-releasing factor 2 (RASGRF2) gene in the regulation of alcohol intake in men, we have recently shown that male Rasgrf2(-/-) mice exhibit reduced ethanol intake and preference accompanied by a perturbed mesolimbic dopamine system. We therefore propose that these mice represent a valid model to further elucidate the precise genes and mechanisms regulating mesolimbic dopamine functioning. Transcriptomic data from the nucleus accumbens (NAcc) of male Rasgrf2(-/-) mice and wild-type controls were analyzed by weighted gene coexpression network analysis (WGCNA). We performed follow-up genetic association tests in humans using a sample of male adolescents from the IMAGEN study characterized for binge drinking (n = 905) and ventral striatal activation during an fMRI reward task (n = 608). The WGCNA analyses using accumbal transcriptomic data revealed 37 distinct "modules," or functionally related groups of genes. Two of these modules were significantly associated with Rasgrf2 knockout status: M5 (p < 0.001) and M6 (p < 0.001). In follow-up translational analyses we found that human orthologues for the M5 module were significantly (p < 0.01) enriched with genetic association signals for binge drinking in male adolescents. Furthermore, the most significant locus, originating from the EH-domain containing 4 (EHD4) gene (p < 0.001), was also significantly associated with altered ventral striatal activity in male adolescents performing an fMRI reward task (pempirical < 0.001). It was not possible to determine the extent to which the M5 module was dysregulated in Rasgrf2(-/-) mice by perturbed mesolimbic dopamine signalling or by the loss of Rasgrf2 function in the NAcc. Taken together, our findings indicate that the accumbal M5 module, initially identified as being dysregulated in male Rasgrf2(-/-) mice, is also relevant for human alcohol-related phenotypes potentially through the modulation of reinforcement mechanisms in the NAcc. We therefore propose that the genes comprising this module represent important candidates for further elucidation within the context of alcohol-related phenotypes.

  14. DiffSLC: A graph centrality method to detect essential proteins of a protein-protein interaction network.

    PubMed

    Mistry, Divya; Wise, Roger P; Dickerson, Julie A

    2017-01-01

    Identification of central genes and proteins in biomolecular networks provides credible candidates for pathway analysis, functional analysis, and essentiality prediction. The DiffSLC centrality measure predicts central and essential genes and proteins using a protein-protein interaction network. Network centrality measures prioritize nodes and edges based on their importance to the network topology. These measures helped identify critical genes and proteins in biomolecular networks. The proposed centrality measure, DiffSLC, combines the number of interactions of a protein and the gene coexpression values of genes from which those proteins were translated, as a weighting factor to bias the identification of essential proteins in a protein interaction network. Potentially essential proteins with low node degree are promoted through eigenvector centrality. Thus, the gene coexpression values are used in conjunction with the eigenvector of the network's adjacency matrix and edge clustering coefficient to improve essentiality prediction. The outcome of this prediction is shown using three variations: (1) inclusion or exclusion of gene co-expression data, (2) impact of different coexpression measures, and (3) impact of different gene expression data sets. For a total of seven networks, DiffSLC is compared to other centrality measures using Saccharomyces cerevisiae protein interaction networks and gene expression data. Comparisons are also performed for the top ranked proteins against the known essential genes from the Saccharomyces Gene Deletion Project, which show that DiffSLC detects more essential proteins and has a higher area under the ROC curve than other compared methods. This makes DiffSLC a stronger alternative to other centrality methods for detecting essential genes using a protein-protein interaction network that obeys centrality-lethality principle. DiffSLC is implemented using the igraph package in R, and networkx package in Python. The python package can be obtained from git.io/diffslcpy. The R implementation and code to reproduce the analysis is available via git.io/diffslc.

  15. Graphical modeling of gene expression in monocytes suggests molecular mechanisms explaining increased atherosclerosis in smokers.

    PubMed

    Verdugo, Ricardo A; Zeller, Tanja; Rotival, Maxime; Wild, Philipp S; Münzel, Thomas; Lackner, Karl J; Weidmann, Henri; Ninio, Ewa; Trégouët, David-Alexandre; Cambien, François; Blankenberg, Stefan; Tiret, Laurence

    2013-01-01

    Smoking is a risk factor for atherosclerosis with reported widespread effects on gene expression in circulating blood cells. We hypothesized that a molecular signature mediating the relation between smoking and atherosclerosis may be found in the transcriptome of circulating monocytes. Genome-wide expression profiles and counts of atherosclerotic plaques in carotid arteries were collected in 248 smokers and 688 non-smokers from the general population. Patterns of co-expressed genes were identified by Independent Component Analysis (ICA) and network structure of the pattern-specific gene modules was inferred by the PC-algorithm. A likelihood-based causality test was implemented to select patterns that fit models containing a path "smoking→gene expression→plaques". Robustness of the causal inference was assessed by bootstrapping. At a FDR ≤0.10, 3,368 genes were associated to smoking or plaques, of which 93% were associated to smoking only. SASH1 showed the strongest association to smoking and PPARG the strongest association to plaques. Twenty-nine gene patterns were identified by ICA. Modules containing SASH1 and PPARG did not show evidence for the "smoking→gene expression→plaques" causality model. Conversely, three modules had good support for causal effects and exhibited a network topology consistent with gene expression mediating the relation between smoking and plaques. The network with the strongest support for causal effects was connected to plaques through SLC39A8, a gene with known association to HDL-cholesterol and cellular uptake of cadmium from tobacco, while smoking was directly connected to GAS6, a gene reported to have anti-inflammatory effects in atherosclerosis and to be up-regulated in the placenta of women smoking during pregnancy. Our analysis of the transcriptome of monocytes recovered genes relevant for association to smoking and atherosclerosis, and connected genes that before, were only studied in separate contexts. Inspection of correlation structure revealed candidates that would be missed by expression-phenotype association analysis alone.

  16. Graphical Modeling of Gene Expression in Monocytes Suggests Molecular Mechanisms Explaining Increased Atherosclerosis in Smokers

    PubMed Central

    Verdugo, Ricardo A.; Zeller, Tanja; Rotival, Maxime; Wild, Philipp S.; Münzel, Thomas; Lackner, Karl J.; Weidmann, Henri; Ninio, Ewa; Trégouët, David-Alexandre; Cambien, François; Blankenberg, Stefan; Tiret, Laurence

    2013-01-01

    Smoking is a risk factor for atherosclerosis with reported widespread effects on gene expression in circulating blood cells. We hypothesized that a molecular signature mediating the relation between smoking and atherosclerosis may be found in the transcriptome of circulating monocytes. Genome-wide expression profiles and counts of atherosclerotic plaques in carotid arteries were collected in 248 smokers and 688 non-smokers from the general population. Patterns of co-expressed genes were identified by Independent Component Analysis (ICA) and network structure of the pattern-specific gene modules was inferred by the PC-algorithm. A likelihood-based causality test was implemented to select patterns that fit models containing a path “smoking→gene expression→plaques”. Robustness of the causal inference was assessed by bootstrapping. At a FDR ≤0.10, 3,368 genes were associated to smoking or plaques, of which 93% were associated to smoking only. SASH1 showed the strongest association to smoking and PPARG the strongest association to plaques. Twenty-nine gene patterns were identified by ICA. Modules containing SASH1 and PPARG did not show evidence for the “smoking→gene expression→plaques” causality model. Conversely, three modules had good support for causal effects and exhibited a network topology consistent with gene expression mediating the relation between smoking and plaques. The network with the strongest support for causal effects was connected to plaques through SLC39A8, a gene with known association to HDL-cholesterol and cellular uptake of cadmium from tobacco, while smoking was directly connected to GAS6, a gene reported to have anti-inflammatory effects in atherosclerosis and to be up-regulated in the placenta of women smoking during pregnancy. Our analysis of the transcriptome of monocytes recovered genes relevant for association to smoking and atherosclerosis, and connected genes that before, were only studied in separate contexts. Inspection of correlation structure revealed candidates that would be missed by expression-phenotype association analysis alone. PMID:23372645

  17. The metastasis suppressor RARRES3 as an endogenous inhibitor of the immunoproteasome expression in breast cancer cells

    NASA Astrophysics Data System (ADS)

    Anderson, Alison M.; Kalimutho, Murugan; Harten, Sarah; Nanayakkara, Devathri M.; Khanna, Kum Kum; Ragan, Mark A.

    2017-01-01

    In breast cancer metastasis, the dynamic continuum involving pro- and anti-inflammatory regulators can become compromised. Over 600 genes have been implicated in metastasis to bone, lung or brain but how these genes might contribute to perturbation of immune function is poorly understood. To gain insight, we adopted a gene co-expression network approach that draws on the functional parallels between naturally occurring bone marrow-derived mesenchymal stem cells (BM-MSCs) and cancer stem cells (CSCs). Our network analyses indicate a key role for metastasis suppressor RARRES3, including potential to regulate the immunoproteasome (IP), a specialized proteasome induced under inflammatory conditions. Knockdown of RARRES3 in near-normal mammary epithelial and breast cancer cell lines increases overall transcript and protein levels of the IP subunits, but not of their constitutively expressed counterparts. RARRES3 mRNA expression is controlled by interferon regulatory factor IRF1, an inducer of the IP, and is sensitive to depletion of the retinoid-related receptor RORA that regulates various physiological processes including immunity through modulation of gene expression. Collectively, these findings identify a novel regulatory role for RARRES3 as an endogenous inhibitor of IP expression, and contribute to our evolving understanding of potential pathways underlying breast cancer driven immune modulation.

  18. Protein Network Signatures Associated with Exogenous Biofuels Treatments in Cyanobacterium Synechocystis sp. PCC 6803

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Pei, Guangsheng; Chen, Lei; Wang, Jiangxin

    2014-11-03

    Although recognized as a promising microbial cell factory for producing biofuels, current productivity in cyanobacterial systems is low. To make the processes economically feasible, one of the hurdles, which need to be overcome is the low tolerance of hosts to toxic biofuels. Meanwhile, little information is available regarding the cellular responses to biofuels stress in cyanobacteria, which makes it challenging for tolerance engineering. Using large proteomic datasets of Synechocystis under various biofuels stress and environmental perturbation, a protein co-expression network was first constructed and then combined with the experimentally determined protein–protein interaction network. Proteins with statistically higher topological overlap inmore » the integrated network were identified as common responsive proteins to both biofuels stress and environmental perturbations. In addition, a weighted gene co-expression network analysis was performed to distinguish unique responses to biofuels from those to environmental perturbations and to uncover metabolic modules and proteins uniquely associated with biofuels stress. The results showed that biofuel-specific proteins and modules were enriched in several functional categories, including photosynthesis, carbon fixation, and amino acid metabolism, which may represent potential key signatures for biofuels stress responses in Synechocystis. Network-based analysis allowed determination of the responses specifically related to biofuels stress, and the results constituted an important knowledge foundation for tolerance engineering against biofuels in Synechocystis.« less

  19. Transcriptional and Hormonal Regulation of Gravitropism of Woody Stems in Populus.

    PubMed

    Gerttula, Suzanne; Zinkgraf, Matthew; Muday, Gloria K; Lewis, Daniel R; Ibatullin, Farid M; Brumer, Harry; Hart, Foster; Mansfield, Shawn D; Filkov, Vladimir; Groover, Andrew

    2015-10-01

    Angiosperm trees reorient their woody stems by asymmetrically producing a specialized xylem tissue, tension wood, which exerts a strong contractile force resulting in negative gravitropism of the stem. Here, we show, in Populus trees, that initial gravity perception and response occurs in specialized cells through sedimentation of starch-filled amyloplasts and relocalization of the auxin transport protein, PIN3. Gibberellic acid treatment stimulates the rate of tension wood formation and gravibending and enhances tissue-specific expression of an auxin-responsive reporter. Gravibending, maturation of contractile fibers, and gibberellic acid (GA) stimulation of tension wood formation are all sensitive to transcript levels of the Class I KNOX homeodomain transcription factor-encoding gene ARBORKNOX2 (ARK2). We generated genome-wide transcriptomes for trees in which gene expression was perturbed by gravistimulation, GA treatment, and modulation of ARK2 expression. These data were employed in computational analyses to model the transcriptional networks underlying wood formation, including identification and dissection of gene coexpression modules associated with wood phenotypes, GA response, and ARK2 binding to genes within modules. We propose a model for gravitropism in the woody stem in which the peripheral location of PIN3-expressing cells relative to the cambium results in auxin transport toward the cambium in the top of the stem, triggering tension wood formation, while transport away from the cambium in the bottom of the stem triggers opposite wood formation. © 2015 American Society of Plant Biologists. All rights reserved.

  20. Network module detection: Affinity search technique with the multi-node topological overlap measure

    PubMed Central

    Li, Ai; Horvath, Steve

    2009-01-01

    Background Many clustering procedures only allow the user to input a pairwise dissimilarity or distance measure between objects. We propose a clustering method that can input a multi-point dissimilarity measure d(i1, i2, ..., iP) where the number of points P can be larger than 2. The work is motivated by gene network analysis where clusters correspond to modules of highly interconnected nodes. Here, we define modules as clusters of network nodes with high multi-node topological overlap. The topological overlap measure is a robust measure of interconnectedness which is based on shared network neighbors. In previous work, we have shown that the multi-node topological overlap measure yields biologically meaningful results when used as input of network neighborhood analysis. Findings We adapt network neighborhood analysis for the use of module detection. We propose the Module Affinity Search Technique (MAST), which is a generalized version of the Cluster Affinity Search Technique (CAST). MAST can accommodate a multi-node dissimilarity measure. Clusters grow around user-defined or automatically chosen seeds (e.g. hub nodes). We propose both local and global cluster growth stopping rules. We use several simulations and a gene co-expression network application to argue that the MAST approach leads to biologically meaningful results. We compare MAST with hierarchical clustering and partitioning around medoid clustering. Conclusion Our flexible module detection method is implemented in the MTOM software which can be downloaded from the following webpage: PMID:19619323

  1. Network module detection: Affinity search technique with the multi-node topological overlap measure.

    PubMed

    Li, Ai; Horvath, Steve

    2009-07-20

    Many clustering procedures only allow the user to input a pairwise dissimilarity or distance measure between objects. We propose a clustering method that can input a multi-point dissimilarity measure d(i1, i2, ..., iP) where the number of points P can be larger than 2. The work is motivated by gene network analysis where clusters correspond to modules of highly interconnected nodes. Here, we define modules as clusters of network nodes with high multi-node topological overlap. The topological overlap measure is a robust measure of interconnectedness which is based on shared network neighbors. In previous work, we have shown that the multi-node topological overlap measure yields biologically meaningful results when used as input of network neighborhood analysis. We adapt network neighborhood analysis for the use of module detection. We propose the Module Affinity Search Technique (MAST), which is a generalized version of the Cluster Affinity Search Technique (CAST). MAST can accommodate a multi-node dissimilarity measure. Clusters grow around user-defined or automatically chosen seeds (e.g. hub nodes). We propose both local and global cluster growth stopping rules. We use several simulations and a gene co-expression network application to argue that the MAST approach leads to biologically meaningful results. We compare MAST with hierarchical clustering and partitioning around medoid clustering. Our flexible module detection method is implemented in the MTOM software which can be downloaded from the following webpage: http://www.genetics.ucla.edu/labs/horvath/MTOM/

  2. Decoding genes with coexpression networks and metabolomics - 'majority report by precogs'.

    PubMed

    Saito, Kazuki; Hirai, Masami Y; Yonekura-Sakakibara, Keiko

    2008-01-01

    Following the sequencing of whole genomes of model plants, high-throughput decoding of gene function is a major challenge in modern plant biology. In view of remarkable technical advances in transcriptomics and metabolomics, integrated analysis of these 'omics' by data-mining informatics is an excellent tool for prediction and identification of gene function, particularly for genes involved in complicated metabolic pathways. The availability of Arabidopsis public transcriptome datasets containing data of >1000 microarrays reinforces the potential for prediction of gene function by transcriptome coexpression analysis. Here, we review the strategy of combining transcriptome and metabolome as a powerful technology for studying the functional genomics of model plants and also crop and medicinal plants.

  3. A gene co-expression network model identifies yield-related vicinity networks in Jatropha curcas shoot system.

    PubMed

    Govender, Nisha; Senan, Siju; Mohamed-Hussein, Zeti-Azura; Wickneswari, Ratnam

    2018-06-15

    The plant shoot system consists of reproductive organs such as inflorescences, buds and fruits, and the vegetative leaves and stems. In this study, the reproductive part of the Jatropha curcas shoot system, which includes the aerial shoots, shoots bearing the inflorescence and inflorescence were investigated in regard to gene-to-gene interactions underpinning yield-related biological processes. An RNA-seq based sequencing of shoot tissues performed on an Illumina HiSeq. 2500 platform generated 18 transcriptomes. Using the reference genome-based mapping approach, a total of 64 361 genes was identified in all samples and the data was annotated against the non-redundant database by the BLAST2GO Pro. Suite. After removing the outlier genes and samples, a total of 12 734 genes across 17 samples were subjected to gene co-expression network construction using petal, an R library. A gene co-expression network model built with scale-free and small-world properties extracted four vicinity networks (VNs) with putative involvement in yield-related biological processes as follow; heat stress tolerance, floral and shoot meristem differentiation, biosynthesis of chlorophyll molecules and laticifers, cell wall metabolism and epigenetic regulations. Our VNs revealed putative key players that could be adapted in breeding strategies for J. curcas shoot system improvements.

  4. Repeated Gestational Exposure of Mice to Chlorpyrifos Oxon Is Associated with Paraoxonase 1 (PON1) Modulated Effects in Maternal and Fetal Tissues

    PubMed Central

    Co, Aila L.; Hay, Ariel M.; MacDonald, James W.; Bammler, Theo K.; Farin, Federico M.; Costa, Lucio G.; Furlong, Clement E.

    2014-01-01

    Chlorpyrifos oxon (CPO), the toxic metabolite of the organophosphorus (OP) insecticide chlorpyrifos, causes developmental neurotoxicity in humans and rodents. CPO is hydrolyzed by paraoxonase-1 (PON1), with protection determined by PON1 levels and the human Q192R polymorphism. To examine how the Q192R polymorphism influences fetal toxicity associated with gestational CPO exposure, we measured enzyme inhibition and fetal-brain gene expression in wild-type (PON1+/+), PON1-knockout (PON1−/−), and tgHuPON1R192 and tgHuPON1Q192 transgenic mice. Pregnant mice exposed dermally to 0, 0.50, 0.75, or 0.85 mg/kg/d CPO from gestational day (GD) 6 through 17 were sacrificed on GD18. Biomarkers of CPO exposure inhibited in maternal tissues included brain acetylcholinesterase (AChE), red blood cell acylpeptide hydrolase (APH), and plasma butyrylcholinesterase (BChE) and carboxylesterase (CES). Fetal plasma BChE was inhibited in PON1−/− and tgHuPON1Q192, but not PON1+/+ or tgHuPON1R192 mice. Fetal brain AChE and plasma CES were inhibited in PON1−/− mice, but not in other genotypes. Weighted gene co-expression network analysis identified five gene modules based on clustering of the correlations among their fetal-brain expression values, allowing for correlation of module membership with the phenotypic data on enzyme inhibition. One module that correlated highly with maternal brain AChE activity had a large representation of homeobox genes. Gene set enrichment analysis revealed multiple gene sets affected by gestational CPO exposure in tgHuPON1Q192 but not tgHuPON1R192 mice, including gene sets involved in protein export, lipid metabolism, and neurotransmission. These data indicate that maternal PON1 status modulates the effects of repeated gestational CPO exposure on fetal-brain gene expression and on inhibition of both maternal and fetal biomarker enzymes. PMID:25070982

  5. STARNET 2: a web-based tool for accelerating discovery of gene regulatory networks using microarray co-expression data

    PubMed Central

    Jupiter, Daniel; Chen, Hailin; VanBuren, Vincent

    2009-01-01

    Background Although expression microarrays have become a standard tool used by biologists, analysis of data produced by microarray experiments may still present challenges. Comparison of data from different platforms, organisms, and labs may involve complicated data processing, and inferring relationships between genes remains difficult. Results STARNET 2 is a new web-based tool that allows post hoc visual analysis of correlations that are derived from expression microarray data. STARNET 2 facilitates user discovery of putative gene regulatory networks in a variety of species (human, rat, mouse, chicken, zebrafish, Drosophila, C. elegans, S. cerevisiae, Arabidopsis and rice) by graphing networks of genes that are closely co-expressed across a large heterogeneous set of preselected microarray experiments. For each of the represented organisms, raw microarray data were retrieved from NCBI's Gene Expression Omnibus for a selected Affymetrix platform. All pairwise Pearson correlation coefficients were computed for expression profiles measured on each platform, respectively. These precompiled results were stored in a MySQL database, and supplemented by additional data retrieved from NCBI. A web-based tool allows user-specified queries of the database, centered at a gene of interest. The result of a query includes graphs of correlation networks, graphs of known interactions involving genes and gene products that are present in the correlation networks, and initial statistical analyses. Two analyses may be performed in parallel to compare networks, which is facilitated by the new HEATSEEKER module. Conclusion STARNET 2 is a useful tool for developing new hypotheses about regulatory relationships between genes and gene products, and has coverage for 10 species. Interpretation of the correlation networks is supported with a database of previously documented interactions, a test for enrichment of Gene Ontology terms, and heat maps of correlation distances that may be used to compare two networks. The list of genes in a STARNET network may be useful in developing a list of candidate genes to use for the inference of causal networks. The tool is freely available at , and does not require user registration. PMID:19828039

  6. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Gompers, Andrea L.; Su-Feher, Linda; Ellegood, Jacob

    The chromatin remodeling gene CHD8 represents a central node in neurodevelopmental gene networks implicated in autism. In this paper, we examined the impact of germline heterozygous frameshift Chd8 mutation on neurodevelopment in mice. Chd8 +/ del5 mice displayed normal social interactions with no repetitive behaviors but exhibited cognitive impairment correlated with increased regional brain volume, validating that phenotypes of Chd8 +/ del5 mice overlap pathology reported in humans with CHD8 mutations. We applied network analysis to characterize neurodevelopmental gene expression, revealing widespread transcriptional changes in Chd8 +/ del5 mice across pathways disrupted in neurodevelopmental disorders, including neurogenesis, synaptic processes andmore » neuroimmune signaling. We identified a co-expression module with peak expression in early brain development featuring dysregulation of RNA processing, chromatin remodeling and cell-cycle genes enriched for promoter binding by Chd8, and we validated increased neuronal proliferation and developmental splicing perturbation in Chd8 +/ del5 mice. Finally, this integrative analysis offers an initial picture of the consequences of Chd8 haploinsufficiency for brain development.« less

  7. Gene expression analysis of colorectal cancer by bioinformatics strategy.

    PubMed

    Cui, Meng; Yuan, Junhua; Li, Jun; Sun, Bing; Li, Tao; Li, Yuantao; Wu, Guoliang

    2014-10-01

    We used bioinformatics technology to analyze gene expression profiles involved in colorectal cancer tissue samples and healthy controls. In this paper, we downloaded the gene expression profile GSE4107 from Gene Expression Omnibus (GEO) database, in which a total of 22 chips were available, including normal colonic mucosa tissue from normal healthy donors (n=10), colorectal cancer tissue samples from colorectal patients (n=33). To further understand the biological functions of the screened DGEs, the KEGG pathway enrichment analysis were conducted. Then we built a transcriptome network to study differentially co-expressed links. A total of 3151 DEGs of CRC were selected. Besides, total 164 DCGs (Differentially Coexpressed Gene, DCG) and 29279 DCLs (Differentially Co-expressed Link, DCL) were obtained. Furthermore, the significantly enriched KEGG pathways were Endocytosis, Calcium signaling pathway, Vascular smooth muscle contraction, Linoleic acid metabolism, Arginine and proline metabolism, Inositol phosphate metabolism and MAPK signaling pathway. Our results show that the generation of CRC involves multiple genes, TFs and pathways. Several signal and immune pathways are linked to CRC and give us more clues in the process of CRC. Hence, our work would pave ways for novel diagnosis of CRC, and provided theoretical guidance into cancer therapy.

  8. Genome-wide coexpression dynamics: Theory and application

    PubMed Central

    Li, Ker-Chau

    2002-01-01

    High-throughput expression profiling enables the global study of gene activities. Genes with positively correlated expression profiles are likely to encode functionally related proteins. However, all biological processes are interlocked, and each protein may play multiple cellular roles. Thus the coexpression of any two functionally related genes may depend on the constantly varying, yet often-unknown cellular state. To initiate a systematic study on this issue, a theory of coexpression dynamics is presented. This theory is used to rationalize a strategy of conducting a genome-wide search for the most critical cellular players that may affect the coexpression pattern of any two genes. In one example, using a yeast data set, our method reveals how the enzymes associated with the urea cycle are expressed to ensure proper mass flow of the involved metabolites. The correlation between ARG2 and CAR2 is found to change from positive to negative as the expression level of CPA2 increases. This delicate interplay in correlation signifies a remarkable control on the influx and efflux of ornithine and reflects well the intrinsic cellular demand for arginine. In addition to the urea cycle, our examples include SCH9 and CYR1 (both implicated in a recent longevity study), cytochrome c1 (mitochondrial electron transport), calmodulin (main calcium-binding protein), PFK1 and PFK2 (glycolysis), and two genes, ECM1 and YNL101W, the functions of which are newly revealed. The complexity in computation is eased by a new result from mathematical statistics. PMID:12486219

  9. Muscle-specific gene expression is underscored by differential stressor responses and coexpression changes.

    PubMed

    Moreno-Sánchez, Natalia; Rueda, Julia; Reverter, Antonio; Carabaño, María Jesús; Díaz, Clara

    2012-03-01

    Variations on the transcriptome from one skeletal muscle type to another still remain unknown. The reliable identification of stable gene coexpression networks is essential to unravel gene functions and define biological processes. The differential expression of two distinct muscles, M. flexor digitorum (FD) and M. psoas major (PM), was studied using microarrays in cattle to illustrate muscle-specific transcription patterns and to quantify changes in connectivity regarding the expected gene coexpression pattern. A total of 206 genes were differentially expressed (DE), 94 upregulated in PM and 112 in FD. The distribution of DE genes in pathways and biological functions was explored in the context of system biology. Global interactomes for genes of interest were predicted. Fast/slow twitch genes, genes coding for extracellular matrix, ribosomal and heat shock proteins, and fatty acid uptake centred the specific gene expression patterns per muscle. Genes involved in repairing mechanisms, such as ribosomal and heat shock proteins, suggested a differential ability of muscles to react to similar stressing factors, acting preferentially in slow twitch muscles. Muscle attributes do not seem to be completely explained by the muscle fibre composition. Changes in connectivity accounted for 24% of significant correlations between DE genes. Genes changing their connectivity mostly seem to contribute to the main differential attributes that characterize each specific muscle type. These results underscore the unique flexibility of skeletal muscle where a substantial set of genes are able to change their behavior depending on the circumstances.

  10. Genome-wide targeted prediction of ABA responsive genes in rice based on over-represented cis-motif in co-expressed genes.

    PubMed

    Lenka, Sangram K; Lohia, Bikash; Kumar, Abhay; Chinnusamy, Viswanathan; Bansal, Kailash C

    2009-02-01

    Abscisic acid (ABA), the popular plant stress hormone, plays a key role in regulation of sub-set of stress responsive genes. These genes respond to ABA through specific transcription factors which bind to cis-regulatory elements present in their promoters. We discovered the ABA Responsive Element (ABRE) core (ACGT) containing CGMCACGTGB motif as over-represented motif among the promoters of ABA responsive co-expressed genes in rice. Targeted gene prediction strategy using this motif led to the identification of 402 protein coding genes potentially regulated by ABA-dependent molecular genetic network. RT-PCR analysis of arbitrarily chosen 45 genes from the predicted 402 genes confirmed 80% accuracy of our prediction. Plant Gene Ontology (GO) analysis of ABA responsive genes showed enrichment of signal transduction and stress related genes among diverse functional categories.

  11. Differential DNA methylation marks and gene comethylation of COPD in African-Americans with COPD exacerbations.

    PubMed

    Busch, Robert; Qiu, Weiliang; Lasky-Su, Jessica; Morrow, Jarrett; Criner, Gerard; DeMeo, Dawn

    2016-11-05

    Chronic obstructive pulmonary disease (COPD) is the third-leading cause of death worldwide. Identifying COPD-associated DNA methylation marks in African-Americans may contribute to our understanding of racial disparities in COPD susceptibility. We determined differentially methylated genes and co-methylation network modules associated with COPD in African-Americans recruited during exacerbations of COPD and smoking controls from the Pennsylvania Study of Chronic Obstructive Pulmonary Exacerbations (PA-SCOPE) cohort. We assessed DNA methylation from whole blood samples in 362 African-American smokers in the PA-SCOPE cohort using the Illumina Infinium HumanMethylation27 BeadChip Array. Final analysis included 19302 CpG probes annotated to the nearest gene transcript after quality control. We tested methylation associations with COPD case-control status using mixed linear models. Weighted gene comethylation networks were constructed using weighted gene coexpression network analysis (WGCNA) and network modules were analyzed for association with COPD. There were five differentially methylated CpG probes significantly associated with COPD among African-Americans at an FDR less than 5 %, and seven additional probes that approached significance at an FDR less than 10 %. The top ranked gene association was MAML1, which has been shown to affect NOTCH-dependent angiogenesis in murine lung. Network modeling yielded the "yellow" and "blue" comethylation modules which were significantly associated with COPD (p-value 4 × 10 -10 and 4 × 10 -9 , respectively). The yellow module was enriched for gene sets related to inflammatory pathways known to be relevant to COPD. The blue module contained the top ranked genes in the concurrent differential methylation analysis (FXYD1/LGI4, gene significance p-value 1.2 × 10 -26 ; MAML1, p-value 2.0 × 10 -26 ; CD72, p-value 2.1 × 10 -25 ; and LPO, p-value 7.2 × 10 -25 ), and was significantly associated with lung development processes in Gene Ontology gene-set enrichment analysis. We identified 12 differentially methylated CpG sites associated with COPD that mapped to biologically plausible genes. Network module comethylation patterns have identified candidate genes that may be contributing to racial differences in COPD susceptibility and severity. COPD-associated comethylation modules contained genes previously associated with lung disease and inflammation and recapitulated known COPD-associated genes. The genes implicated by differential methylation and WGCNA analysis may provide mechanistic targets contributing to COPD susceptibility, exacerbations, and outcomes among African-Americans. Trial Registration: NCT00774176 , Registry: ClinicalTrials.gov, URL: www.clinicaltrials.gov , Date of Enrollment of First Participant: June 2004, Date Registered: 04 January 2008 (retrospectively registered).

  12. From single-cell to cell-pool transcriptomes: stochasticity in gene expression and RNA splicing.

    PubMed

    Marinov, Georgi K; Williams, Brian A; McCue, Ken; Schroth, Gary P; Gertz, Jason; Myers, Richard M; Wold, Barbara J

    2014-03-01

    Single-cell RNA-seq mammalian transcriptome studies are at an early stage in uncovering cell-to-cell variation in gene expression, transcript processing and editing, and regulatory module activity. Despite great progress recently, substantial challenges remain, including discriminating biological variation from technical noise. Here we apply the SMART-seq single-cell RNA-seq protocol to study the reference lymphoblastoid cell line GM12878. By using spike-in quantification standards, we estimate the absolute number of RNA molecules per cell for each gene and find significant variation in total mRNA content: between 50,000 and 300,000 transcripts per cell. We directly measure technical stochasticity by a pool/split design and find that there are significant differences in expression between individual cells, over and above technical variation. Specific gene coexpression modules were preferentially expressed in subsets of individual cells, including one enriched for mRNA processing and splicing factors. We assess cell-to-cell variation in alternative splicing and allelic bias and report evidence of significant differences in splice site usage that exceed splice variation in the pool/split comparison. Finally, we show that transcriptomes from small pools of 30-100 cells approach the information content and reproducibility of contemporary RNA-seq from large amounts of input material. Together, our results define an experimental and computational path forward for analyzing gene expression in rare cell types and cell states.

  13. Using bioinformatics and systems genetics to dissect HDL-cholesterol genetics in an MRL/MpJ × SM/J intercross[S

    PubMed Central

    Leduc, Magalie S.; Blair, Rachael Hageman; Verdugo, Ricardo A.; Tsaih, Shirng-Wern; Walsh, Kenneth; Churchill, Gary A.; Paigen, Beverly

    2012-01-01

    A higher incidence of coronary artery disease is associated with a lower level of HDL-cholesterol. We searched for genetic loci influencing HDL-cholesterol in F2 mice from a cross between MRL/MpJ and SM/J mice. Quantitative trait loci (QTL) mapping revealed one significant HDL QTL (Apoa2 locus), four suggestive QTL on chromosomes 10, 11, 13, and 18 and four additional QTL on chromosomes 1 proximal, 3, 4, and 7 after adjusting HDL for the strong Apoa2 locus. A novel nonsynonymous polymorphism supports Lipg as the QTL gene for the chromosome 18 QTL, and a difference in Abca1 expression in liver tissue supports it as the QTL gene for the chromosome 4 QTL. Using weighted gene co-expression network analysis, we identified a module that after adjustment for Apoa2, correlated with HDL, was genetically determined by a QTL on chromosome 11, and overlapped with the HDL QTL. A combination of bioinformatics tools and systems genetics helped identify several candidate genes for both the chromosome 11 HDL and module QTL based on differential expression between the parental strains, cis regulation of expression, and causality modeling. We conclude that integrating systems genetics to a more-traditional genetics approach improves the power of complex trait gene identification. PMID:22498810

  14. Genome-wide screen identifies a novel prognostic signature for breast cancer survival

    DOE PAGES

    Mao, Xuan Y.; Lee, Matthew J.; Zhu, Jeffrey; ...

    2017-01-21

    Large genomic datasets in combination with clinical data can be used as an unbiased tool to identify genes important in patient survival and discover potential therapeutic targets. We used a genome-wide screen to identify 587 genes significantly and robustly deregulated across four independent breast cancer (BC) datasets compared to normal breast tissue. Gene expression of 381 genes was significantly associated with relapse-free survival (RFS) in BC patients. We used a gene co-expression network approach to visualize the genetic architecture in normal breast and BCs. In normal breast tissue, co-expression cliques were identified enriched for cell cycle, gene transcription, cell adhesion,more » cytoskeletal organization and metabolism. In contrast, in BC, only two major co-expression cliques were identified enriched for cell cycle-related processes or blood vessel development, cell adhesion and mammary gland development processes. Interestingly, gene expression levels of 7 genes were found to be negatively correlated with many cell cycle related genes, highlighting these genes as potential tumor suppressors and novel therapeutic targets. A forward-conditional Cox regression analysis was used to identify a 12-gene signature associated with RFS. A prognostic scoring system was created based on the 12-gene signature. This scoring system robustly predicted BC patient RFS in 60 sampling test sets and was further validated in TCGA and METABRIC BC data. Our integrated study identified a 12-gene prognostic signature that could guide adjuvant therapy for BC patients and includes novel potential molecular targets for therapy.« less

  15. Genome-wide screen identifies a novel prognostic signature for breast cancer survival

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Mao, Xuan Y.; Lee, Matthew J.; Zhu, Jeffrey

    Large genomic datasets in combination with clinical data can be used as an unbiased tool to identify genes important in patient survival and discover potential therapeutic targets. We used a genome-wide screen to identify 587 genes significantly and robustly deregulated across four independent breast cancer (BC) datasets compared to normal breast tissue. Gene expression of 381 genes was significantly associated with relapse-free survival (RFS) in BC patients. We used a gene co-expression network approach to visualize the genetic architecture in normal breast and BCs. In normal breast tissue, co-expression cliques were identified enriched for cell cycle, gene transcription, cell adhesion,more » cytoskeletal organization and metabolism. In contrast, in BC, only two major co-expression cliques were identified enriched for cell cycle-related processes or blood vessel development, cell adhesion and mammary gland development processes. Interestingly, gene expression levels of 7 genes were found to be negatively correlated with many cell cycle related genes, highlighting these genes as potential tumor suppressors and novel therapeutic targets. A forward-conditional Cox regression analysis was used to identify a 12-gene signature associated with RFS. A prognostic scoring system was created based on the 12-gene signature. This scoring system robustly predicted BC patient RFS in 60 sampling test sets and was further validated in TCGA and METABRIC BC data. Our integrated study identified a 12-gene prognostic signature that could guide adjuvant therapy for BC patients and includes novel potential molecular targets for therapy.« less

  16. A novel strategy of integrated microarray analysis identifies CENPA, CDK1 and CDC20 as a cluster of diagnostic biomarkers in lung adenocarcinoma.

    PubMed

    Liu, Wan-Ting; Wang, Yang; Zhang, Jing; Ye, Fei; Huang, Xiao-Hui; Li, Bin; He, Qing-Yu

    2018-07-01

    Lung adenocarcinoma (LAC) is the most lethal cancer and the leading cause of cancer-related death worldwide. The identification of meaningful clusters of co-expressed genes or representative biomarkers may help improve the accuracy of LAC diagnoses. Public databases, such as the Gene Expression Omnibus (GEO), provide rich resources of valuable information for clinics, however, the integration of multiple microarray datasets from various platforms and institutes remained a challenge. To determine potential indicators of LAC, we performed genome-wide relative significance (GWRS), genome-wide global significance (GWGS) and support vector machine (SVM) analyses progressively to identify robust gene biomarker signatures from 5 different microarray datasets that included 330 samples. The top 200 genes with robust signatures were selected for integrative analysis according to "guilt-by-association" methods, including protein-protein interaction (PPI) analysis and gene co-expression analysis. Of these 200 genes, only 10 genes showed both intensive PPI network and high gene co-expression correlation (r > 0.8). IPA analysis of this regulatory networks suggested that the cell cycle process is a crucial determinant of LAC. CENPA, as well as two linked hub genes CDK1 and CDC20, are determined to be potential indicators of LAC. Immunohistochemical staining showed that CENPA, CDK1 and CDC20 were highly expressed in LAC cancer tissue with co-expression patterns. A Cox regression model indicated that LAC patients with CENPA + /CDK1 + and CENPA + /CDC20 + were high-risk groups in terms of overall survival. In conclusion, our integrated microarray analysis demonstrated that CENPA, CDK1 and CDC20 might serve as novel cluster of prognostic biomarkers for LAC, and the cooperative unit of three genes provides a technically simple approach for identification of LAC patients. Copyright © 2018 Elsevier B.V. All rights reserved.

  17. UNCLES: method for the identification of genes differentially consistently co-expressed in a specific subset of datasets.

    PubMed

    Abu-Jamous, Basel; Fa, Rui; Roberts, David J; Nandi, Asoke K

    2015-06-04

    Collective analysis of the increasingly emerging gene expression datasets are required. The recently proposed binarisation of consensus partition matrices (Bi-CoPaM) method can combine clustering results from multiple datasets to identify the subsets of genes which are consistently co-expressed in all of the provided datasets in a tuneable manner. However, results validation and parameter setting are issues that complicate the design of such methods. Moreover, although it is a common practice to test methods by application to synthetic datasets, the mathematical models used to synthesise such datasets are usually based on approximations which may not always be sufficiently representative of real datasets. Here, we propose an unsupervised method for the unification of clustering results from multiple datasets using external specifications (UNCLES). This method has the ability to identify the subsets of genes consistently co-expressed in a subset of datasets while being poorly co-expressed in another subset of datasets, and to identify the subsets of genes consistently co-expressed in all given datasets. We also propose the M-N scatter plots validation technique and adopt it to set the parameters of UNCLES, such as the number of clusters, automatically. Additionally, we propose an approach for the synthesis of gene expression datasets using real data profiles in a way which combines the ground-truth-knowledge of synthetic data and the realistic expression values of real data, and therefore overcomes the problem of faithfulness of synthetic expression data modelling. By application to those datasets, we validate UNCLES while comparing it with other conventional clustering methods, and of particular relevance, biclustering methods. We further validate UNCLES by application to a set of 14 real genome-wide yeast datasets as it produces focused clusters that conform well to known biological facts. Furthermore, in-silico-based hypotheses regarding the function of a few previously unknown genes in those focused clusters are drawn. The UNCLES method, the M-N scatter plots technique, and the expression data synthesis approach will have wide application for the comprehensive analysis of genomic and other sources of multiple complex biological datasets. Moreover, the derived in-silico-based biological hypotheses represent subjects for future functional studies.

  18. SiBIC: a web server for generating gene set networks based on biclusters obtained by maximal frequent itemset mining.

    PubMed

    Takahashi, Kei-ichiro; Takigawa, Ichigaku; Mamitsuka, Hiroshi

    2013-01-01

    Detecting biclusters from expression data is useful, since biclusters are coexpressed genes under only part of all given experimental conditions. We present a software called SiBIC, which from a given expression dataset, first exhaustively enumerates biclusters, which are then merged into rather independent biclusters, which finally are used to generate gene set networks, in which a gene set assigned to one node has coexpressed genes. We evaluated each step of this procedure: 1) significance of the generated biclusters biologically and statistically, 2) biological quality of merged biclusters, and 3) biological significance of gene set networks. We emphasize that gene set networks, in which nodes are not genes but gene sets, can be more compact than usual gene networks, meaning that gene set networks are more comprehensible. SiBIC is available at http://utrecht.kuicr.kyoto-u.ac.jp:8080/miami/faces/index.jsp.

  19. Definition of a core module for the nuclear retrograde response to altered organellar gene expression identifies GLK overexpressors as gun mutants.

    PubMed

    Leister, Dario; Kleine, Tatjana

    2016-07-01

    Retrograde signaling can be triggered by changes in organellar gene expression (OGE) induced by inhibitors such as lincomycin (LIN) or mutations that perturb OGE. Thus, an insufficiency of the organelle-targeted prolyl-tRNA synthetase PRORS1 in Arabidopsis thaliana activates retrograde signaling and reduces the expression of nuclear genes for photosynthetic proteins. Recently, we showed that mTERF6, a member of the so-called mitochondrial transcription termination factor (mTERF) family, is involved in the formation of chloroplast (cp) isoleucine-tRNA. To obtain further insights into its functions, co-expression analysis of MTERF6, PRORS1 and two other genes for organellar aminoacyl-tRNA synthetases was conducted. The results suggest a prominent role of mTERF6 in aminoacylation activity, light signaling and seed storage. Analysis of changes in whole-genome transcriptomes in the mterf6-1 mutant showed that levels of nuclear transcripts for cp OGE proteins were particularly affected. Comparison of the mterf6-1 transcriptome with that of prors1-2 showed that reduced aminoacylation of proline (prors1-2) and isoleucine (mterf6-1) tRNAs alters retrograde signaling in similar ways. Database analyses indicate that comparable gene expression changes are provoked by treatment with LIN, norflurazon or high light. A core OGE response module was defined by identifying genes that were differentially expressed under at least four of six conditions relevant to OGE signaling. Based on this module, overexpressors of the Golden2-like transcription factors GLK1 and GLK2 were identified as genomes uncoupled mutants. © 2016 Scandinavian Plant Physiology Society.

  20. A human functional protein interaction network and its application to cancer data analysis

    PubMed Central

    2010-01-01

    Background One challenge facing biologists is to tease out useful information from massive data sets for further analysis. A pathway-based analysis may shed light by projecting candidate genes onto protein functional relationship networks. We are building such a pathway-based analysis system. Results We have constructed a protein functional interaction network by extending curated pathways with non-curated sources of information, including protein-protein interactions, gene coexpression, protein domain interaction, Gene Ontology (GO) annotations and text-mined protein interactions, which cover close to 50% of the human proteome. By applying this network to two glioblastoma multiforme (GBM) data sets and projecting cancer candidate genes onto the network, we found that the majority of GBM candidate genes form a cluster and are closer than expected by chance, and the majority of GBM samples have sequence-altered genes in two network modules, one mainly comprising genes whose products are localized in the cytoplasm and plasma membrane, and another comprising gene products in the nucleus. Both modules are highly enriched in known oncogenes, tumor suppressors and genes involved in signal transduction. Similar network patterns were also found in breast, colorectal and pancreatic cancers. Conclusions We have built a highly reliable functional interaction network upon expert-curated pathways and applied this network to the analysis of two genome-wide GBM and several other cancer data sets. The network patterns revealed from our results suggest common mechanisms in the cancer biology. Our system should provide a foundation for a network or pathway-based analysis platform for cancer and other diseases. PMID:20482850

  1. Genes located in a chromosomal inversion are correlated with territorial song in white-throated sparrows.

    PubMed

    Zinzow-Kramer, W M; Horton, B M; McKee, C D; Michaud, J M; Tharp, G K; Thomas, J W; Tuttle, E M; Yi, S; Maney, D L

    2015-11-01

    The genome of the white-throated sparrow (Zonotrichia albicollis) contains an inversion polymorphism on chromosome 2 that is linked to predictable variation in a suite of phenotypic traits including plumage color, aggression and parental behavior. Differences in gene expression between the two color morphs, which represent the two common inversion genotypes (ZAL2/ZAL2 and ZAL2/ZAL2(m) ), may therefore advance our understanding of the molecular underpinnings of these phenotypes. To identify genes that are differentially expressed between the two morphs and correlated with behavior, we quantified gene expression and terrirorial aggression, including song, in a population of free-living white-throated sparrows. We analyzed gene expression in two brain regions, the medial amygdala (MeA) and hypothalamus. Both regions are part of a 'social behavior network', which is rich in steroid hormone receptors and previously linked with territorial behavior. Using weighted gene co-expression network analyses, we identified modules of genes that were correlated with both morph and singing behavior. The majority of these genes were located within the inversion, showing the profound effect of the inversion on the expression of genes captured by the rearrangement. These modules were enriched with genes related to retinoic acid signaling and basic cellular functioning. In the MeA, the most prominent pathways were those related to steroid hormone receptor activity. Within these pathways, the only gene encoding such a receptor was ESR1 (estrogen receptor 1), a gene previously shown to predict song rate in this species. The set of candidate genes we identified may mediate the effects of a chromosomal inversion on territorial behavior. © 2015 John Wiley & Sons Ltd and International Behavioural and Neural Genetics Society.

  2. Elementary screening of lymph node metastatic-related genes in gastric cancer based on the co-expression network of messenger RNA, microRNA and long non-coding RNA.

    PubMed

    Song, Zhonghua; Zhao, Wenhua; Cao, Danfeng; Zhang, Jinqing; Chen, Shouhua

    2018-01-01

    Gastric cancer (GC) is the fifth most common cancer and the third leading cause of cancer-related deaths worldwide. The high mortality might be attributed to delay in detection and is closely related to lymph node metastasis. Therefore, it is of great importance to explore the mechanism of lymph node metastasis and find strategies to block GC metastasis. Messenger RNA (mRNA), microRNA (miRNA) and long non-coding RNA (lncRNA) expression data and clinical data were downloaded from The Cancer Genome Atlas (TCGA) database. A total of 908 differentially expressed factors with variance >0.5 including 542 genes, 42 miRNA, and 324 lncRNA were screened using significant analysis microarray algorithm, and interaction networks were constructed using these differentially expressed factors. Furthermore, we conducted functional modules analysis in the network, and found that yellow and turquoise modules could separate samples efficiently. The groups classified in the yellow and turquoise modules had a significant difference in survival time, which was verified in another independent GC mRNA dataset (GSE62254). The results suggested that differentially expressed factors in the yellow and turquoise modules may participate in lymph node metastasis of GC and could be applied as potential biomarkers or therapeutic targets for GC.

  3. Elementary screening of lymph node metastatic-related genes in gastric cancer based on the co-expression network of messenger RNA, microRNA and long non-coding RNA

    PubMed Central

    Song, Zhonghua; Zhao, Wenhua; Cao, Danfeng; Zhang, Jinqing; Chen, Shouhua

    2018-01-01

    Gastric cancer (GC) is the fifth most common cancer and the third leading cause of cancer-related deaths worldwide. The high mortality might be attributed to delay in detection and is closely related to lymph node metastasis. Therefore, it is of great importance to explore the mechanism of lymph node metastasis and find strategies to block GC metastasis. Messenger RNA (mRNA), microRNA (miRNA) and long non-coding RNA (lncRNA) expression data and clinical data were downloaded from The Cancer Genome Atlas (TCGA) database. A total of 908 differentially expressed factors with variance >0.5 including 542 genes, 42 miRNA, and 324 lncRNA were screened using significant analysis microarray algorithm, and interaction networks were constructed using these differentially expressed factors. Furthermore, we conducted functional modules analysis in the network, and found that yellow and turquoise modules could separate samples efficiently. The groups classified in the yellow and turquoise modules had a significant difference in survival time, which was verified in another independent GC mRNA dataset (GSE62254). The results suggested that differentially expressed factors in the yellow and turquoise modules may participate in lymph node metastasis of GC and could be applied as potential biomarkers or therapeutic targets for GC. PMID:29489999

  4. Sheep skeletal muscle transcriptome analysis reveals muscle growth regulatory lncRNAs.

    PubMed

    Chao, Tianle; Ji, Zhibin; Hou, Lei; Wang, Jin; Zhang, Chunlan; Wang, Guizhi; Wang, Jianmin

    2018-01-01

    As widely distributed domestic animals, sheep are an important species and the source of mutton. In this study, we aimed to evaluate the regulatory lncRNAs associated with muscle growth and development between high production mutton sheep (Dorper sheep and Qianhua Mutton Merino sheep) and low production mutton sheep (Small-tailed Han sheep). In total, 39 lncRNAs were found to be differentially expressed. Using co-expression analysis and functional annotation, 1,206 co-expression interactions were found between 32 lncRNAs and 369 genes, and 29 of these lncRNAs were found to be associated with muscle development, metabolism, cell proliferation and apoptosis. lncRNA-mRNA interactions revealed 6 lncRNAs as hub lncRNAs. Moreover, three lncRNAs and their associated co-expressed genes were demonstrated by cis-regulatory gene analyses, and we also found a potential regulatory relationship between the pseudogene lncRNA LOC101121401 and its parent gene FTH1. This study provides a genome-wide resolution of lncRNA and mRNA regulation in muscles from mutton sheep.

  5. Sheep skeletal muscle transcriptome analysis reveals muscle growth regulatory lncRNAs

    PubMed Central

    Chao, Tianle; Ji, Zhibin; Hou, Lei; Wang, Jin; Zhang, Chunlan

    2018-01-01

    As widely distributed domestic animals, sheep are an important species and the source of mutton. In this study, we aimed to evaluate the regulatory lncRNAs associated with muscle growth and development between high production mutton sheep (Dorper sheep and Qianhua Mutton Merino sheep) and low production mutton sheep (Small-tailed Han sheep). In total, 39 lncRNAs were found to be differentially expressed. Using co-expression analysis and functional annotation, 1,206 co-expression interactions were found between 32 lncRNAs and 369 genes, and 29 of these lncRNAs were found to be associated with muscle development, metabolism, cell proliferation and apoptosis. lncRNA–mRNA interactions revealed 6 lncRNAs as hub lncRNAs. Moreover, three lncRNAs and their associated co-expressed genes were demonstrated by cis-regulatory gene analyses, and we also found a potential regulatory relationship between the pseudogene lncRNA LOC101121401 and its parent gene FTH1. This study provides a genome-wide resolution of lncRNA and mRNA regulation in muscles from mutton sheep. PMID:29666768

  6. The Transcriptome of the Reference Potato Genome Solanum tuberosum Group Phureja Clone DM1-3 516R44

    PubMed Central

    Massa, Alicia N.; Childs, Kevin L.; Lin, Haining; Bryan, Glenn J.; Giuliano, Giovanni; Buell, C. Robin

    2011-01-01

    Advances in molecular breeding in potato have been limited by its complex biological system, which includes vegetative propagation, autotetraploidy, and extreme heterozygosity. The availability of the potato genome and accompanying gene complement with corresponding gene structure, location, and functional annotation are powerful resources for understanding this complex plant and advancing molecular breeding efforts. Here, we report a reference for the potato transcriptome using 32 tissues and growth conditions from the doubled monoploid Solanum tuberosum Group Phureja clone DM1-3 516R44 for which a genome sequence is available. Analysis of greater than 550 million RNA-Seq reads permitted the detection and quantification of expression levels of over 22,000 genes. Hierarchical clustering and principal component analyses captured the biological variability that accounts for gene expression differences among tissues suggesting tissue-specific gene expression, and genes with tissue or condition restricted expression. Using gene co-expression network analysis, we identified 18 gene modules that represent tissue-specific transcriptional networks of major potato organs and developmental stages. This information provides a powerful resource for potato research as well as studies on other members of the Solanaceae family. PMID:22046362

  7. [Involvement of hydrogen peroxide in the regulation of coexpression of alternative oxidase and rotenone-insensitive NADH dehydrogenase in tomato leaves and calluses].

    PubMed

    Eprintsev, A T; Mal'tseva, E V; Shatskikh, A S; Popov, V N

    2011-01-01

    The involvement of active oxygen forms in the regulation of the expression of mitochondrial respiratory chain components, which are not related to energy storing, has been in vitro and in vivo studied in Lycopersicum esculentum L. The highest level of transcription of genes encoding alternative oxidase and NADH dehydrogenase has been observed in green tomato leaves. It has been shown that even low H2O2 concentrations activate both aoxlalpha and ndb1 genes, encoding alternative oxidase and external mitochondrial rotenone-insensitive NADH dehydrogenase, respectively. According to our results, in the case of an oxidative stress, alternative oxidase and NADH dehydrogenase are coexpressed in tomato plant tissues, and active oxygen forms serve as the secondary messengers of their coexpression.

  8. Neurobiological Signatures of Alcohol Dependence Revealed by Protein Profiling

    PubMed Central

    Gorini, Giorgio; Roberts, Amanda J.; Mayfield, R. Dayne

    2013-01-01

    Alcohol abuse causes dramatic neuroadaptations in the brain, which contribute to tolerance, dependence, and behavioral modifications. Previous proteomic studies in human alcoholics and animal models have identified candidate alcoholism-related proteins. However, recent evidences suggest that alcohol dependence is caused by changes in co-regulation that are invisible to single protein-based analysis. Here, we analyze global proteomics data to integrate differential expression, co-expression networks, and gene annotations to unveil key neurobiological rearrangements associated with the transition to alcohol dependence modeled by a Chronic Intermittent Ethanol (CIE), two-bottle choice (2BC) paradigm. We analyzed cerebral cortices (CTX) and midbrains (MB) from male C57BL/6J mice subjected to a CIE, 2BC paradigm, which induces heavy drinking and represents one of the best available animal models for alcohol dependence and relapse drinking. CIE induced significant changes in protein levels in dependent mice compared with their non-dependent controls. Multiple protein isoforms showed region-specific differential regulation as a result of post-translational modifications. Our integrative analysis identified modules of co-expressed proteins that were highly correlated with CIE treatment. We found that modules most related to the effects of CIE treatment coordinate molecular imbalances in endocytic- and energy-related pathways, with specific proteins involved, such as dynamin-1. The qRT-PCR experiments validated both differential and co-expression analyses, and the correspondence among our data and previous genomic and proteomic studies in humans and rodents substantiates our findings. The changes identified above may play a key role in the escalation of ethanol consumption associated with dependence. Our approach to alcohol addiction will advance knowledge of brain remodeling mechanisms and adaptive changes in response to drug abuse, contribute to understanding of organizational principles of CTX and MB proteomes, and define potential new molecular targets for treating alcohol addiction. The integrative analysis employed here highlight the advantages of systems approaches in studying the neurobiology of alcohol addiction. PMID:24358215

  9. Using co-expression analysis and stress-based screens to uncover Arabidopsis peroxisomal proteins involved in drought response

    DOE PAGES

    Li, Jiying; Hu, Jianping; Bassham, Diane

    2015-09-14

    Peroxisomes are essential organelles that house a wide array of metabolic reactions important for plant growth and development. However, our knowledge regarding the role of peroxisomal proteins in various biological processes, including plant stress response, is still incomplete. Recent proteomic studies of plant peroxisomes significantly increased the number of known peroxisomal proteins and greatly facilitated the study of peroxisomes at the systems level. The objectives of this study were to determine whether genes that encode peroxisomal proteins with related functions are co-expressed in Arabidopsis and identify peroxisomal proteins involved in stress response using in silico analysis and mutant screens. Usingmore » microarray data from online databases, we performed hierarchical clustering analysis to generate a comprehensive view of transcript level changes for Arabidopsis peroxisomal genes during development and under abiotic and biotic stress conditions. Many genes involved in the same metabolic pathways exhibited co-expression, some genes known to be involved in stress response are regulated by the corresponding stress conditions, and function of some peroxisomal proteins could be predicted based on their coexpression pattern. Since drought caused expression changes to the highest number of genes that encode peroxisomal proteins, we subjected a subset of Arabidopsis peroxisomal mutants to a drought stress assay. Mutants of the LON2 protease and the photorespiratory enzyme hydroxypyruvate reductase 1 (HPR1) showed enhanced susceptibility to drought, suggesting the involvement of peroxisomal quality control and photorespiration in drought resistance. Lastly, our study provided a global view of how genes that encode peroxisomal proteins respond to developmental and environmental cues and began to reveal additional peroxisomal proteins involved in stress response, thus opening up new avenues to investigate the role of peroxisomes in plant adaptation to environmental stresses.« less

  10. Differential co-expression analysis of rheumatoid arthritis with microarray data.

    PubMed

    Wang, Kunpeng; Zhao, Liqiang; Liu, Xuefeng; Hao, Zhenyong; Zhou, Yong; Yang, Chuandong; Li, Hongqiang

    2014-11-01

    The aim of the present study was to investigate the underlying molecular mechanisms of rheumatoid arthritis (RA) using microarray expression profiles from osteoarthritis and RA patients, to improve diagnosis and treatment strategies for the condition. The gene expression profile of GSE27390 was downloaded from Gene Expression Omnibus, including 19 samples from patients with RA (n=9) or osteoarthritis (n=10). Firstly, the differentially expressed genes (DEGs) were obtained with the thresholds of |logFC|>1.0 and P<0.05, using the t‑test method in LIMMA package. Then, differentially co-expressed genes (DCGs) and differentially co-expressed links (DCLs) were screened with q<0.25 by the differential coexpression analysis and differential regulation analysis of gene expression microarray data package. Secondly, pathway enrichment analysis for DCGs was performed by the Database for Annotation, Visualization and Integrated Discovery and the DCLs associated with RA were selected by comparing the obtained DCLs with known transcription factor (TF)-targets in the TRANSFAC database. Finally, the obtained TFs were mapped to the known TF-targets to construct the network using cytoscape software. A total of 1755 DEGs, 457 DCGs and 101988 DCLs were achieved and there were 20 TFs in the obtained six TF-target relations (STAT3-TNF, PBX1‑PLAU, SOCS3-STAT3, GATA1-ETS2, ETS1-ICAM4 and CEBPE‑GATA1) and 457 DCGs. A number of TF-target relations in the constructed network were not within DCLs when the TF and target gene were DCGs. The identified TFs may have an important role in the pathogenesis of RA and have the potential to be used as biomarkers for the development of novel diagnostic and therapeutic strategies for RA.

  11. A Natural Chimeric Pseudomonas Bacteriocin with Novel Pore-Forming Activity Parasitizes the Ferrichrome Transporter

    PubMed Central

    Kemland, Lieselore; Anoz-Carbonell, Ernesto; Buchanan, Susan K.; De Mot, René

    2017-01-01

    ABSTRACT Modular bacteriocins represent a major group of secreted protein toxins with a narrow spectrum of activity, involved in interference competition between Gram-negative bacteria. These antibacterial proteins include a domain for binding to the target cell and a toxin module at the carboxy terminus. Self-inhibition of producers is provided by coexpression of linked immunity genes that transiently inhibit the toxin’s activity through formation of bacteriocin-immunity complexes or by insertion in the inner membrane, depending on the type of toxin module. We demonstrate strain-specific inhibitory activity for PmnH, a Pseudomonas bacteriocin with an unprecedented dual-toxin architecture, hosting both a colicin M domain, potentially interfering with peptidoglycan synthesis, and a novel colicin N-type domain, a pore-forming module distinct from the colicin Ia-type domain in Pseudomonas aeruginosa pyocin S5. A downstream-linked gene product confers PmnH immunity upon susceptible strains. This protein, ImnH, has a transmembrane topology similar to that of Pseudomonas colicin M-like and pore-forming immunity proteins, although homology with either of these is essentially absent. The enhanced killing activity of PmnH under iron-limited growth conditions reflects parasitism of the ferrichrome-type transporter for entry into target cells, a strategy shown here to be used as well by monodomain colicin M-like bacteriocins from pseudomonads. The integration of a second type of toxin module in a bacteriocin gene could offer a competitive advantage against bacteria displaying immunity against only one of both toxic activities. PMID:28223456

  12. Enhanced in vivo selection of bone marrow cells by retroviral-mediated coexpression of mutant O6-methylguanine-DNA-methyltransferase and HOXB4.

    PubMed

    Milsom, Michael D; Woolford, Lorna B; Margison, Geoffrey P; Humphries, R Keith; Fairbairn, Leslie J

    2004-11-01

    To attain therapeutic levels of gene-modified hematopoietic stem cells, it may be necessary in the majority of disorders to provide an in vivo selective advantage that facilitates the expansion of their numbers. A popular strategy to achieve in vivo selection has been to employ drug selection while coexpressing a transgene that conveys chemoresistance, such as O6-methylguanine-DNA-methyltransferase (MGMT). An alternate approach is to confer an enhanced proliferative potential upon gene-modified hematopoietic stem cells through the delivery of the homeobox transcription factor HOXB4. By developing a novel tricistronic retroviral vector, we have facilitated the simultaneous coexpression of a mutant version of MGMT and HOXB4 in retrovirally transduced bone marrow. Using an in vivo competitive repopulation assay, we demonstrate that primary bone marrow cells containing this construct show enhanced reconstitution following transplant and improved selection subsequent to chemotherapeutic challenge in comparison to cells expressing either HOXB4 or MGMT alone. This selection advantage was evident even when HOXB4/MGMT-coexpressing cells were infused along with a large excess of unmodified cells. We propose that this selection cassette may facilitate the in vivo expansion of gene-modified hematopoietic stem cells at a level in excess of previous strategies.

  13. A resource for characterizing genome-wide binding and putative target genes of transcription factors expressed during secondary growth and wood formation in Populus.

    PubMed

    Liu, Lijun; Ramsay, Trevor; Zinkgraf, Matthew; Sundell, David; Street, Nathaniel Robert; Filkov, Vladimir; Groover, Andrew

    2015-06-01

    Identifying transcription factor target genes is essential for modeling the transcriptional networks underlying developmental processes. Here we report a chromatin immunoprecipitation sequencing (ChIP-seq) resource consisting of genome-wide binding regions and associated putative target genes for four Populus homeodomain transcription factors expressed during secondary growth and wood formation. Software code (programs and scripts) for processing the Populus ChIP-seq data are provided within a publically available iPlant image, including tools for ChIP-seq data quality control and evaluation adapted from the human Encyclopedia of DNA Elements (ENCODE) project. Basic information for each transcription factor (including members of Class I KNOX, Class III HD ZIP, BEL1-like families) binding are summarized, including the number and location of binding regions, distribution of binding regions relative to gene features, associated putative target genes, and enriched functional categories of putative target genes. These ChIP-seq data have been integrated within the Populus Genome Integrative Explorer (PopGenIE) where they can be analyzed using a variety of web-based tools. We present an example analysis that shows preferential binding of transcription factor ARBORKNOX1 to the nearest neighbor genes in a pre-calculated co-expression network module, and enrichment for meristem-related genes within this module including multiple orthologs of Arabidopsis KNOTTED-like Arabidopsis 2/6. © 2015 Society for Experimental Biology and John Wiley & Sons Ltd This article has been contributed to by US Government employees and their work is in the public domain in the USA.

  14. DiRE: identifying distant regulatory elements of co-expressed genes

    PubMed Central

    Gotea, Valer; Ovcharenko, Ivan

    2008-01-01

    Regulation of gene expression in eukaryotic genomes is established through a complex cooperative activity of proximal promoters and distant regulatory elements (REs) such as enhancers, repressors and silencers. We have developed a web server named DiRE, based on the Enhancer Identification (EI) method, for predicting distant regulatory elements in higher eukaryotic genomes, namely for determining their chromosomal location and functional characteristics. The server uses gene co-expression data, comparative genomics and profiles of transcription factor binding sites (TFBSs) to determine TFBS-association signatures that can be used for discriminating specific regulatory functions. DiRE's unique feature is its ability to detect REs outside of proximal promoter regions, as it takes advantage of the full gene locus to conduct the search. DiRE can predict common REs for any set of input genes for which the user has prior knowledge of co-expression, co-function or other biologically meaningful grouping. The server predicts function-specific REs consisting of clusters of specifically-associated TFBSs and it also scores the association of individual transcription factors (TFs) with the biological function shared by the group of input genes. Its integration with the Array2BIO server allows users to start their analysis with raw microarray expression data. The DiRE web server is freely available at http://dire.dcode.org. PMID:18487623

  15. Transcriptional dynamics of a conserved gene expression network associated with craniofacial divergence in Arctic charr.

    PubMed

    Ahi, Ehsan Pashay; Kapralova, Kalina Hristova; Pálsson, Arnar; Maier, Valerie Helene; Gudbrandsson, Jóhannes; Snorrason, Sigurdur S; Jónsson, Zophonías O; Franzdóttir, Sigrídur Rut

    2014-01-01

    Understanding the molecular basis of craniofacial variation can provide insights into key developmental mechanisms of adaptive changes and their role in trophic divergence and speciation. Arctic charr (Salvelinus alpinus) is a polymorphic fish species, and, in Lake Thingvallavatn in Iceland, four sympatric morphs have evolved distinct craniofacial structures. We conducted a gene expression study on candidates from a conserved gene coexpression network, focusing on the development of craniofacial elements in embryos of two contrasting Arctic charr morphotypes (benthic and limnetic). Four Arctic charr morphs were studied: one limnetic and two benthic morphs from Lake Thingvallavatn and a limnetic reference aquaculture morph. The presence of morphological differences at developmental stages before the onset of feeding was verified by morphometric analysis. Following up on our previous findings that Mmp2 and Sparc were differentially expressed between morphotypes, we identified a network of genes with conserved coexpression across diverse vertebrate species. A comparative expression study of candidates from this network in developing heads of the four Arctic charr morphs verified the coexpression relationship of these genes and revealed distinct transcriptional dynamics strongly correlated with contrasting craniofacial morphologies (benthic versus limnetic). A literature review and Gene Ontology analysis indicated that a significant proportion of the network genes play a role in extracellular matrix organization and skeletogenesis, and motif enrichment analysis of conserved noncoding regions of network candidates predicted a handful of transcription factors, including Ap1 and Ets2, as potential regulators of the gene network. The expression of Ets2 itself was also found to associate with network gene expression. Genes linked to glucocorticoid signalling were also studied, as both Mmp2 and Sparc are responsive to this pathway. Among those, several transcriptional targets and upstream regulators showed differential expression between the contrasting morphotypes. Interestingly, although selected network genes showed overlapping expression patterns in situ and no morph differences, Timp2 expression patterns differed between morphs. Our comparative study of transcriptional dynamics in divergent craniofacial morphologies of Arctic charr revealed a conserved network of coexpressed genes sharing functional roles in structural morphogenesis. We also implicate transcriptional regulators of the network as targets for future functional studies.

  16. Recombinant fowlpox viruses coexpressing chicken type I IFN and Newcastle disease virus HN and F genes: influence of IFN on protective efficacy and humoral responses of chickens following in ovo or post-hatch administration of recombinant viruses.

    PubMed

    Karaca, K; Sharma, J M; Winslow, B J; Junker, D E; Reddy, S; Cochran, M; McMillen, J

    1998-10-01

    We have constructed recombinant (r) fowl pox viruses (FPVs) coexpressing chicken type I interferon (IFN) and/or hemagglutinin-neuraminidase (HN) and fusion (F) proteins of Newcastle disease virus (NDV). We administered rFPVs and FPV into embryonated chicken eggs at 17 days of embryonation or in chickens after hatch. Administration of FPV or rFPVs did not influence hatchability and survival of hatched chicks. In ovo or after hatch vaccination of chickens with the recombinant viruses resulted in protection against challenge with virulent FPV and NDV. Chickens vaccinated with FPV or FPV-NDV recombinant had significantly lower body weight 2 weeks following vaccination. This loss in body weight was not detected in chickens receiving FPV-IFN and FPV-NDV-IFN recombinants. Chickens vaccinated with FPV coexpressing IFN and NDV genes produced less antibodies against NDV in comparison with chickens vaccinated with FPV expressing NDV genes.

  17. Systems genetics: a paradigm to improve discovery of candidate genes and mechanisms underlying complex traits.

    PubMed

    Feltus, F Alex

    2014-06-01

    Understanding the control of any trait optimally requires the detection of causal genes, gene interaction, and mechanism of action to discover and model the biochemical pathways underlying the expressed phenotype. Functional genomics techniques, including RNA expression profiling via microarray and high-throughput DNA sequencing, allow for the precise genome localization of biological information. Powerful genetic approaches, including quantitative trait locus (QTL) and genome-wide association study mapping, link phenotype with genome positions, yet genetics is less precise in localizing the relevant mechanistic information encoded in DNA. The coupling of salient functional genomic signals with genetically mapped positions is an appealing approach to discover meaningful gene-phenotype relationships. Techniques used to define this genetic-genomic convergence comprise the field of systems genetics. This short review will address an application of systems genetics where RNA profiles are associated with genetically mapped genome positions of individual genes (eQTL mapping) or as gene sets (co-expression network modules). Both approaches can be applied for knowledge independent selection of candidate genes (and possible control mechanisms) underlying complex traits where multiple, likely unlinked, genomic regions might control specific complex traits. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.

  18. A Canonical Correlation Analysis of AIDS Restriction Genes and Metabolic Pathways Identifies Purine Metabolism as a Key Cooperator.

    PubMed

    Ye, Hanhui; Yuan, Jinjin; Wang, Zhengwu; Huang, Aiqiong; Liu, Xiaolong; Han, Xiao; Chen, Yahong

    2016-01-01

    Human immunodeficiency virus causes a severe disease in humans, referred to as immune deficiency syndrome. Studies on the interaction between host genetic factors and the virus have revealed dozens of genes that impact diverse processes in the AIDS disease. To resolve more genetic factors related to AIDS, a canonical correlation analysis was used to determine the correlation between AIDS restriction and metabolic pathway gene expression. The results show that HIV-1 postentry cellular viral cofactors from AIDS restriction genes are coexpressed in human transcriptome microarray datasets. Further, the purine metabolism pathway comprises novel host factors that are coexpressed with AIDS restriction genes. Using a canonical correlation analysis for expression is a reliable approach to exploring the mechanism underlying AIDS.

  19. A translational systems biology approach in both animals and humans identifies a functionally related module of accumbal genes involved in the regulation of reward processing and binge drinking in males

    PubMed Central

    Stacey, David; Lourdusamy, Anbarasu; Ruggeri, Barbara; Maroteaux, Matthieu; Jia, Tianye; Cattrell, Anna; Nymberg, Charlotte; Banaschewski, Tobias; Bhattacharyya, Sohinee; Band, Hamid; Barker, Gareth; Bokde, Arun; Buchel, Christian; Carvalho, Fabiana; Conrod, Patricia; Desrivieres, Sylvane; Easton, Alanna; Fauth-Buehler, Mira; Fernandez-Medarde, Alberto; Flor, Herta; Frouin, Vincent; Gallinat, Jurgen; Garavanh, Hugh; Heinz, Andreas; Ittermann, Bernd; Lathrop, Mark; Lawrence, Claire; Loth, Eva; Mann, Karl; Martinot, Jean-Luc; Nees, Frauke; Paus, Tomas; Pausova, Zdenka; Rietschel, Marcella; Rotter, Andrea; Santos, Eugenio; Smolka, Michael; Sommer, Wolfgang; Mameli, Manuel; Spanagel, Rainer; Girault, Jean-Antoine; Mueller, Christian; Schumann, Gunter

    2016-01-01

    Background The mesolimbic dopamine system, composed primarily of dopaminergic neurons in the ventral tegmental area that project to striatal structures, is considered to be the key mediator of reinforcement-related mechanisms in the brain. Prompted by a genome-wide association meta-analysis implicating the Ras-specific guanine nucleotide-releasing factor 2 (RASGRF2) gene in the regulation of alcohol intake in men, we have recently shown that male Rasgrf2−/− mice exhibit reduced ethanol intake and preference accompanied by a perturbed mesolimbic dopamine system. We therefore propose that these mice represent a valid model to further elucidate the precise genes and mechanisms regulating mesolimbic dopamine functioning. Methods Transcriptomic data from the nucleus accumbens (NAcc) of male Rasgrf2−/− mice and wild-type controls were analyzed by weighted gene coexpression network analysis (WGCNA). We performed follow-up genetic association tests in humans using a sample of male adolescents from the IMAGEN study characterized for binge drinking (n = 905) and ventral striatal activation during an fMRI reward task (n = 608). Results The WGCNA analyses using accumbal transcriptomic data revealed 37 distinct “modules,” or functionally related groups of genes. Two of these modules were significantly associated with Rasgrf2 knockout status: M5 (p < 0.001) and M6 (p < 0.001). In follow-up translational analyses we found that human orthologues for the M5 module were significantly (p < 0.01) enriched with genetic association signals for binge drinking in male adolescents. Furthermore, the most significant locus, originating from the EH-domain containing 4 (EHD4) gene (p < 0.001), was also significantly associated with altered ventral striatal activity in male adolescents performing an fMRI reward task (pempirical < 0.001). Limitations It was not possible to determine the extent to which the M5 module was dysregulated in Rasgrf2−/− mice by perturbed mesolimbic dopamine signalling or by the loss of Rasgrf2 function in the NAcc. Conclusion Taken together, our findings indicate that the accumbal M5 module, initially identified as being dysregulated in male Rasgrf2−/− mice, is also relevant for human alcohol-related phenotypes potentially through the modulation of reinforcement mechanisms in the NAcc. We therefore propose that the genes comprising this module represent important candidates for further elucidation within the context of alcohol-related phenotypes. PMID:26679926

  20. Gene expression patterns combined with network analysis identify hub genes associated with bladder cancer.

    PubMed

    Bi, Dongbin; Ning, Hao; Liu, Shuai; Que, Xinxiang; Ding, Kejia

    2015-06-01

    To explore molecular mechanisms of bladder cancer (BC), network strategy was used to find biomarkers for early detection and diagnosis. The differentially expressed genes (DEGs) between bladder carcinoma patients and normal subjects were screened using empirical Bayes method of the linear models for microarray data package. Co-expression networks were constructed by differentially co-expressed genes and links. Regulatory impact factors (RIF) metric was used to identify critical transcription factors (TFs). The protein-protein interaction (PPI) networks were constructed by the Search Tool for the Retrieval of Interacting Genes/Proteins (STRING) and clusters were obtained through molecular complex detection (MCODE) algorithm. Centralities analyses for complex networks were performed based on degree, stress and betweenness. Enrichment analyses were performed based on Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) databases. Co-expression networks and TFs (based on expression data of global DEGs and DEGs in different stages and grades) were identified. Hub genes of complex networks, such as UBE2C, ACTA2, FABP4, CKS2, FN1 and TOP2A, were also obtained according to analysis of degree. In gene enrichment analyses of global DEGs, cell adhesion, proteinaceous extracellular matrix and extracellular matrix structural constituent were top three GO terms. ECM-receptor interaction, focal adhesion, and cell cycle were significant pathways. Our results provide some potential underlying biomarkers of BC. However, further validation is required and deep studies are needed to elucidate the pathogenesis of BC. Copyright © 2015 Elsevier Ltd. All rights reserved.

  1. Metabolic engineering of the Stevia rebaudiana ent-kaurene biosynthetic pathway in recombinant Escherichia coli.

    PubMed

    Kong, Min Kyung; Kang, Hyun-Jun; Kim, Jin Ho; Oh, Soon Hwan; Lee, Pyung Cheon

    2015-11-20

    The ent-kaurene is a dedicated precursor pool and is responsible for synthesizing natural sweeteners such as steviol glycosides. In this study, to produce ent-kaurene in Escherichia coli, we modularly constructed and expressed two ent-kaurene genes encoding ent-copalyl diphosphate synthase (CPPS) and ent-kaurene synthase (KS) from Stevia rebaudiana known as a typical plant producing steviol glycoside. The CPPS and KS from S. rebaudiana were functionally expressed in a heterologous host E. coli. Furthermore, in order to enhance ent-kaurene production in E. coli, six geranylgeranyl diphosphate synthases (GGPPS) from various microorganisms and eight strains of E. coli as host were compared by measuring ent-kaurene production. The highest ent-kaurene production of approximately 41.1mg/L was demonstrated in E. coli strain MG1655 co-expressing synthetic CPPS-KS module and GGPPS from Rhodobacter sphaeroides. The ent-kaurene production was further increased up to 179.6 mg/L by overexpression of the three key enzymes for isoprenoid precursor, 1-deoxyxylulose-5-phosphate synthase (DXS), farnesyl diphosphate synthase (IspA) and isopentenyl diphosphate isomerase (IDI) from E. coli. Finally, the highest titer of ent-kaurene (578 mg/L) with a specific yield of ent-kaurene of 143.5mg/g dry cell weight was obtained by culturing E. coli strain MG1655 co-expressing the ent-kaurene module, DXS, IDI and IspA in 1L bioreactor containing 20 g/L glycerol. Copyright © 2015 Elsevier B.V. All rights reserved.

  2. Co-expression of cystatin inhibitors OCI and OCII in transgenic potato plants alters Colorado potato beetle development

    USDA-ARS?s Scientific Manuscript database

    Oryzacystatins I and II (OCI and OCII) show potential for controlling pests that utilize cysteine proteinases for protein digestion. To strengthen individual inhibitory range and achieve an additive effect in the overall efficiency of these proteins against pests, both cystatin genes were co-express...

  3. Benchmark Dose Modeling Estimates of the Concentrations of Inorganic Arsenic That Induce Changes to the Neonatal Transcriptome, Proteome, and Epigenome in a Pregnancy Cohort.

    PubMed

    Rager, Julia E; Auerbach, Scott S; Chappell, Grace A; Martin, Elizabeth; Thompson, Chad M; Fry, Rebecca C

    2017-10-16

    Prenatal inorganic arsenic (iAs) exposure influences the expression of critical genes and proteins associated with adverse outcomes in newborns, in part through epigenetic mediators. The doses at which these genomic and epigenomic changes occur have yet to be evaluated in the context of dose-response modeling. The goal of the present study was to estimate iAs doses that correspond to changes in transcriptomic, proteomic, epigenomic, and integrated multi-omic signatures in human cord blood through benchmark dose (BMD) modeling. Genome-wide DNA methylation, microRNA expression, mRNA expression, and protein expression levels in cord blood were modeled against total urinary arsenic (U-tAs) levels from pregnant women exposed to varying levels of iAs. Dose-response relationships were modeled in BMDExpress, and BMDs representing 10% response levels were estimated. Overall, DNA methylation changes were estimated to occur at lower exposure concentrations in comparison to other molecular endpoints. Multi-omic module eigengenes were derived through weighted gene co-expression network analysis, representing co-modulated signatures across transcriptomic, proteomic, and epigenomic profiles. One module eigengene was associated with decreased gestational age occurring alongside increased iAs exposure. Genes/proteins within this module eigengene showed enrichment for organismal development, including potassium voltage-gated channel subfamily Q member 1 (KCNQ1), an imprinted gene showing differential methylation and expression in response to iAs. Modeling of this prioritized multi-omic module eigengene resulted in a BMD(BMDL) of 58(45) μg/L U-tAs, which was estimated to correspond to drinking water arsenic concentrations of 51(40) μg/L. Results are in line with epidemiological evidence supporting effects of prenatal iAs occurring at levels <100 μg As/L urine. Together, findings present a variety of BMD measures to estimate doses at which prenatal iAs exposure influences neonatal outcome-relevant transcriptomic, proteomic, and epigenomic profiles.

  4. Temporal network analysis identifies early physiological and transcriptomic indicators of mild drought in Brassica rapa

    PubMed Central

    Gehan, Malia A; Mockler, Todd C; Weinig, Cynthia; Ewers, Brent E

    2017-01-01

    The dynamics of local climates make development of agricultural strategies challenging. Yield improvement has progressed slowly, especially in drought-prone regions where annual crop production suffers from episodic aridity. Underlying drought responses are circadian and diel control of gene expression that regulate daily variations in metabolic and physiological pathways. To identify transcriptomic changes that occur in the crop Brassica rapa during initial perception of drought, we applied a co-expression network approach to associate rhythmic gene expression changes with physiological responses. Coupled analysis of transcriptome and physiological parameters over a two-day time course in control and drought-stressed plants provided temporal resolution necessary for correlation of network modules with dynamic changes in stomatal conductance, photosynthetic rate, and photosystem II efficiency. This approach enabled the identification of drought-responsive genes based on their differential rhythmic expression profiles in well-watered versus droughted networks and provided new insights into the dynamic physiological changes that occur during drought. PMID:28826479

  5. From Coexpression to Coregulation: An Approach to Inferring Transcriptional Regulation Among Gene Classes from Large-Scale Expression Data

    NASA Technical Reports Server (NTRS)

    Mjolsness, Eric; Castano, Rebecca; Mann, Tobias; Wold, Barbara

    2000-01-01

    We provide preliminary evidence that existing algorithms for inferring small-scale gene regulation networks from gene expression data can be adapted to large-scale gene expression data coming from hybridization microarrays. The essential steps are (I) clustering many genes by their expression time-course data into a minimal set of clusters of co-expressed genes, (2) theoretically modeling the various conditions under which the time-courses are measured using a continuous-time analog recurrent neural network for the cluster mean time-courses, (3) fitting such a regulatory model to the cluster mean time courses by simulated annealing with weight decay, and (4) analysing several such fits for commonalities in the circuit parameter sets including the connection matrices. This procedure can be used to assess the adequacy of existing and future gene expression time-course data sets for determining transcriptional regulatory relationships such as coregulation.

  6. Computational gene expression profiling under salt stress reveals patterns of co-expression

    PubMed Central

    Sanchita; Sharma, Ashok

    2016-01-01

    Plants respond differently to environmental conditions. Among various abiotic stresses, salt stress is a condition where excess salt in soil causes inhibition of plant growth. To understand the response of plants to the stress conditions, identification of the responsible genes is required. Clustering is a data mining technique used to group the genes with similar expression. The genes of a cluster show similar expression and function. We applied clustering algorithms on gene expression data of Solanum tuberosum showing differential expression in Capsicum annuum under salt stress. The clusters, which were common in multiple algorithms were taken further for analysis. Principal component analysis (PCA) further validated the findings of other cluster algorithms by visualizing their clusters in three-dimensional space. Functional annotation results revealed that most of the genes were involved in stress related responses. Our findings suggest that these algorithms may be helpful in the prediction of the function of co-expressed genes. PMID:26981411

  7. The structure of a gene co-expression network reveals biological functions underlying eQTLs.

    PubMed

    Villa-Vialaneix, Nathalie; Liaubet, Laurence; Laurent, Thibault; Cherel, Pierre; Gamot, Adrien; SanCristobal, Magali

    2013-01-01

    What are the commonalities between genes, whose expression level is partially controlled by eQTL, especially with regard to biological functions? Moreover, how are these genes related to a phenotype of interest? These issues are particularly difficult to address when the genome annotation is incomplete, as is the case for mammalian species. Moreover, the direct link between gene expression and a phenotype of interest may be weak, and thus difficult to handle. In this framework, the use of a co-expression network has proven useful: it is a robust approach for modeling a complex system of genetic regulations, and to infer knowledge for yet unknown genes. In this article, a case study was conducted with a mammalian species. It showed that the use of a co-expression network based on partial correlation, combined with a relevant clustering of nodes, leads to an enrichment of biological functions of around 83%. Moreover, the use of a spatial statistics approach allowed us to superimpose additional information related to a phenotype; this lead to highlighting specific genes or gene clusters that are related to the network structure and the phenotype. Three main results are worth noting: first, key genes were highlighted as a potential focus for forthcoming biological experiments; second, a set of biological functions, which support a list of genes under partial eQTL control, was set up by an overview of the global structure of the gene expression network; third, pH was found correlated with gene clusters, and then with related biological functions, as a result of a spatial analysis of the network topology.

  8. Coexpression of α6β4 Integrin and Guanine Nucleotide Exchange Factor Net1 Identifies Node-Positive Breast Cancer Patients at High Risk for Distant Metastasis

    PubMed Central

    Gilcrease, Michael Z.; Kilpatrick, Shannan K.; Woodward, Wendy A.; Zhou, Xiao; Nicolas, Marlo M.; Corley, Lynda J.; Fuller, Gregory N.; Tucker, Susan L.; Diaz, Leslie K.; Buchholz, Thomas A.; Frost, Jeffrey A.

    2009-01-01

    Preclinical data indicate that α6β4 integrin signaling through Ras homolog gene family, member A, plays an important role in tumor cell motility. The objective of this study was to determine whether the combined expression of α6β4 integrin and neuroepithelioma transforming gene 1 (Net1), a guanine nucleotide exchange factor specific for Ras homolog gene family member A, is associated with adverse clinical outcome in breast cancer patients. Immunohistochemical expression of each protein was evaluated in a tumor tissue microarray prepared from the primary tumors of 94 node-positive patients with invasive breast carcinoma treated with total mastectomy and doxorubicin-based chemotherapy without radiation with a median follow-up of 12.5 years. Associations between staining results and multiple clinicopathologic variables were investigated. Although there was no significant association between α6β4 integrin or Net1 expression and clinical outcome when each marker was considered individually, coexpression of α6β4 and Net1 was associated with decreased distant metastasis–free survival (P = 0.030). In the subset of patients with hormone receptor–positive tumors, coexpression of α6β4 and Net1 was associated with a decrease in distant metastasis–free and overall survival (P < 0.001 and P = 0.006, respectively). Although an association between human epidermal growth factor receptor 2 expression and coexpression of α6β4 and Net1 (P = 0.008) was observed, coexpression of α6β4 and Net1 (hazard ratio, 1.63; P = 0.02) and lymphovascular invasion (hazard ratio, 2.35; P = 0.02) were the only factors independently associated with the development of distant metastasis in multivariate analysis. These findings suggest that coexpression of α6β4 integrin and Net1 could be a useful biomarker for aggressive disease in node-positive breast cancer patients. PMID:19124484

  9. Coexpression of the High Molecular Weight Glutenin Subunit 1Ax1 and Puroindoline Improves Dough Mixing Properties in Durum Wheat (Triticum turgidum L. ssp. durum)

    PubMed Central

    Li, Xiaoyan; Xiao, Xin; Sun, Fusheng; Wang, Cheng; Hu, Wei; Feng, Zhijuan; Chang, Junli; Chen, Mingjie; Wang, Yuesheng; Li, Kexiu; Yang, Guangxiao; He, Guangyuan

    2012-01-01

    Wheat end-use quality mainly derives from two interrelated characteristics: the compositions of gluten proteins and grain hardness. The composition of gluten proteins determines dough rheological properties and thus confers the unique viscoelastic property on dough. One group of gluten proteins, high molecular weight glutenin subunits (HMW-GS), plays an important role in dough functional properties. On the other hand, grain hardness, which influences the milling process of flour, is controlled by Puroindoline a (Pina) and Puroindoline b (Pinb) genes. However, little is known about the combined effects of HMW-GS and PINs on dough functional properties. In this study, we crossed a Pina-expressing transgenic line with a 1Ax1-expressing line of durum wheat and screened out lines coexpressing 1Ax1 and Pina or lines expressing either 1Ax1 or Pina. Dough mixing analysis of these lines demonstrated that expression of 1Ax1 improved both dough strength and over-mixing tolerance, while expression of PINA detrimentally affected the dough resistance to extension. In lines coexpressing 1Ax1 and Pina, faster hydration of flour during mixing was observed possibly due to the lower water absorption and damaged starch caused by PINA expression. In addition, expression of 1Ax1 appeared to compensate the detrimental effect of PINA on dough resistance to extension. Consequently, coexpression of 1Ax1 and PINA in durum wheat had combined effects on dough mixing behaviors with a better dough strength and resistance to extension than those from lines expressing either 1Ax1 or Pina. The results in our study suggest that simultaneous modulation of dough strength and grain hardness in durum wheat could significantly improve its breadmaking quality and may not even impair its pastamaking potential. Therefore, coexpression of 1Ax1 and PINA in durum wheat has useful implications for breeding durum wheat with dual functionality (for pasta and bread) and may improve the economic values of durum wheat. PMID:23185532

  10. Coexpression of the high molecular weight glutenin subunit 1Ax1 and puroindoline improves dough mixing properties in durum wheat (Triticum turgidum L. ssp. durum).

    PubMed

    Li, Yin; Wang, Qiong; Li, Xiaoyan; Xiao, Xin; Sun, Fusheng; Wang, Cheng; Hu, Wei; Feng, Zhijuan; Chang, Junli; Chen, Mingjie; Wang, Yuesheng; Li, Kexiu; Yang, Guangxiao; He, Guangyuan

    2012-01-01

    Wheat end-use quality mainly derives from two interrelated characteristics: the compositions of gluten proteins and grain hardness. The composition of gluten proteins determines dough rheological properties and thus confers the unique viscoelastic property on dough. One group of gluten proteins, high molecular weight glutenin subunits (HMW-GS), plays an important role in dough functional properties. On the other hand, grain hardness, which influences the milling process of flour, is controlled by Puroindoline a (Pina) and Puroindoline b (Pinb) genes. However, little is known about the combined effects of HMW-GS and PINs on dough functional properties. In this study, we crossed a Pina-expressing transgenic line with a 1Ax1-expressing line of durum wheat and screened out lines coexpressing 1Ax1 and Pina or lines expressing either 1Ax1 or Pina. Dough mixing analysis of these lines demonstrated that expression of 1Ax1 improved both dough strength and over-mixing tolerance, while expression of PINA detrimentally affected the dough resistance to extension. In lines coexpressing 1Ax1 and Pina, faster hydration of flour during mixing was observed possibly due to the lower water absorption and damaged starch caused by PINA expression. In addition, expression of 1Ax1 appeared to compensate the detrimental effect of PINA on dough resistance to extension. Consequently, coexpression of 1Ax1 and PINA in durum wheat had combined effects on dough mixing behaviors with a better dough strength and resistance to extension than those from lines expressing either 1Ax1 or Pina. The results in our study suggest that simultaneous modulation of dough strength and grain hardness in durum wheat could significantly improve its breadmaking quality and may not even impair its pastamaking potential. Therefore, coexpression of 1Ax1 and PINA in durum wheat has useful implications for breeding durum wheat with dual functionality (for pasta and bread) and may improve the economic values of durum wheat.

  11. Co-regulation analysis of co-expressed modules under cold and pathogen stress conditions in tomato.

    PubMed

    Abedini, Davar; Rashidi Monfared, Sajad

    2018-06-01

    A primary mechanism for controlling the development of multicellular organisms is transcriptional regulation, which carried out by transcription factors (TFs) that recognize and bind to their binding sites on promoter region. The distance from translation start site, order, orientation, and spacing between cis elements are key factors in the concentration of active nuclear TFs and transcriptional regulation of target genes. In this study, overrepresented motifs in cold and pathogenesis responsive genes were scanned via Gibbs sampling method, this method is based on detection of overrepresented motifs by means of a stochastic optimization strategy that searches for all possible sets of short DNA segments. Then, identified motifs were checked by TRANSFAC, PLACE and Soft Berry databases in order to identify putative TFs which, interact to the motifs. Several cis/trans regulatory elements were found using these databases. Moreover, cross-talk between cold and pathogenesis responsive genes were confirmed. Statistical analysis was used to determine distribution of identified motifs on promoter region. In addition, co-regulation analysis results, illustrated genes in pathogenesis responsive module are divided into two main groups. Also, promoter region was crunched to six subareas in order to draw the pattern of distribution of motifs in promoter subareas. The result showed the majority of motifs are concentrated on 700 nucleotides upstream of the translational start site (ATG). In contrast, this result isn't true in another group. In other words, there was no difference between total and compartmentalized regions in cold responsive genes.

  12. Coexpression of Nuclear Receptors and Histone Methylation Modifying Genes in the Testis: Implications for Endocrine Disruptor Modes of Action

    PubMed Central

    Anderson, Alison M.; Carter, Kim W.; Anderson, Denise; Wise, Michael J.

    2012-01-01

    Background Endocrine disruptor chemicals elicit adverse health effects by perturbing nuclear receptor signalling systems. It has been speculated that these compounds may also perturb epigenetic mechanisms and thus contribute to the early origin of adult onset disease. We hypothesised that histone methylation may be a component of the epigenome that is susceptible to perturbation. We used coexpression analysis of publicly available data to investigate the combinatorial actions of nuclear receptors and genes involved in histone methylation in normal testis and when faced with endocrine disruptor compounds. Methodology/Principal Findings The expression patterns of a set of genes were profiled across testis tissue in human, rat and mouse, plus control and exposed samples from four toxicity experiments in the rat. Our results indicate that histone methylation events are a more general component of nuclear receptor mediated transcriptional regulation in the testis than previously appreciated. Coexpression patterns support the role of a gatekeeper mechanism involving the histone methylation modifiers Kdm1, Prdm2, and Ehmt1 and indicate that this mechanism is a common determinant of transcriptional integrity for genes critical to diverse physiological endpoints relevant to endocrine disruption. Coexpression patterns following exposure to vinclozolin and dibutyl phthalate suggest that coactivity of the demethylase Kdm1 in particular warrants further investigation in relation to endocrine disruptor mode of action. Conclusions/Significance This study provides proof of concept that a bioinformatics approach that profiles genes related to a specific hypothesis across multiple biological settings can provide powerful insight into coregulatory activity that would be difficult to discern at an individual experiment level or by traditional differential expression analysis methods. PMID:22496781

  13. Isolation of genes negatively or positively co-expressed with human recombination activating gene 1 (RAG1) by differential display PCR (DD RT-PCR).

    PubMed

    Verkoczy, L K; Berinstein, N L

    1998-10-01

    Differential display PCR (DD RT-PCR) has been extensively used for analysis of differential gene expression, but continues to be hampered by technical limitations that impair its effectiveness. In order to isolate novel genes co-expressing with human RAG1, we have developed an effective, multi-tiered screening/purification approach which effectively complements the standard DD RT-PCR methodology. In 'primary' screens, standard DD RT-PCR was used, detecting 22 reproducible differentially expressed amplicons between clonally related cell variants with differential constitutive expression of RAG mRNAs. 'Secondary' screens used differential display (DD) amplicons as probes in low and high stringency northern blotting. Eight of 22 independent DD amplicons detected nine independent differentially expressed transcripts. 'Tertiary' screens used reconfirmed amplicons as probes in northern analysis of multiple RAG-and RAG+sources. Reconfirmed DD amplicons detected six independent RAG co-expressing transcripts. All DD amplicons reconfirmed by northern blot were a heterogeneous mixture of cDNAs, necessitating further purification to isolate single cDNAs prior to subcloning and sequencing. To effectively select the appropriate cDNAs from DD amplicons, we excised and eluted the cDNA(s) directly from regions of prior northern blots in which differentially expressed transcripts were detected. Sequences of six purified cDNA clones specifically detecting RAG co-expressing transcripts included matches to portions of the human RAG2 and BSAP regions and to four novel partial cDNAs (three with homologies to human ESTs). Overall, our results also suggest that even when using clonally related variants from the same cell line in addition to all appropriate internal controls previously reported, further screening and purification steps are still required in order to efficiently and specifically isolate differentially expressed genes by DD RT-PCR.

  14. Visual gene-network analysis reveals the cancer gene co-expression in human endometrial cancer

    PubMed Central

    2014-01-01

    Background Endometrial cancers (ECs) are the most common form of gynecologic malignancy. Recent studies have reported that ECs reveal distinct markers for molecular pathogenesis, which in turn is linked to the various histological types of ECs. To understand further the molecular events contributing to ECs and endometrial tumorigenesis in general, a more precise identification of cancer-associated molecules and signaling networks would be useful for the detection and monitoring of malignancy, improving clinical cancer therapy, and personalization of treatments. Results ECs-specific gene co-expression networks were constructed by differential expression analysis and weighted gene co-expression network analysis (WGCNA). Important pathways and putative cancer hub genes contribution to tumorigenesis of ECs were identified. An elastic-net regularized classification model was built using the cancer hub gene signatures to predict the phenotypic characteristics of ECs. The 19 cancer hub gene signatures had high predictive power to distinguish among three key principal features of ECs: grade, type, and stage. Intriguingly, these hub gene networks seem to contribute to ECs progression and malignancy via cell-cycle regulation, antigen processing and the citric acid (TCA) cycle. Conclusions The results of this study provide a powerful biomarker discovery platform to better understand the progression of ECs and to uncover potential therapeutic targets in the treatment of ECs. This information might lead to improved monitoring of ECs and resulting improvement of treatment of ECs, the 4th most common of cancer in women. PMID:24758163

  15. Genome-wide analysis of coordinated transcript abundance during seed development in different Brassica rapa morphotypes.

    PubMed

    Basnet, Ram Kumar; Moreno-Pachon, Natalia; Lin, Ke; Bucher, Johan; Visser, Richard G F; Maliepaard, Chris; Bonnema, Guusje

    2013-12-01

    Brassica seeds are important as basic units of plant growth and sources of vegetable oil. Seed development is regulated by many dynamic metabolic processes controlled by complex networks of spatially and temporally expressed genes. We conducted a global microarray gene co-expression analysis by measuring transcript abundance of developing seeds from two diverse B. rapa morphotypes: a pak choi (leafy-type) and a yellow sarson (oil-type), and two of their doubled haploid (DH) progenies, (1) to study the timing of metabolic processes in developing seeds, (2) to explore the major transcriptional differences in developing seeds of the two morphotypes, and (3) to identify the optimum stage for a genetical genomics study in B. rapa seed. Seed developmental stages were similar in developing seeds of pak choi and yellow sarson of B. rapa; however, the colour of embryo and seed coat differed among these two morphotypes. In this study, most transcriptional changes occurred between 25 and 35 DAP, which shows that the timing of seed developmental processes in B. rapa is at later developmental stages than in the related species B. napus. Using a Weighted Gene Co-expression Network Analysis (WGCNA), we identified 47 "gene modules", of which 27 showed a significant association with temporal and/or genotypic variation. An additional hierarchical cluster analysis identified broad spectra of gene expression patterns during seed development. The predominant variation in gene expression was according to developmental stages rather than morphotype differences. Since lipids are the major storage compounds of Brassica seeds, we investigated in more detail the regulation of lipid metabolism. Four co-regulated gene clusters were identified with 17 putative cis-regulatory elements predicted in their 1000 bp upstream region, either specific or common to different lipid metabolic pathways. This is the first study of genome-wide profiling of transcript abundance during seed development in B. rapa. The identification of key physiological events, major expression patterns, and putative cis-regulatory elements provides useful information to construct gene regulatory networks in B. rapa developing seeds and provides a starting point for a genetical genomics study of seed quality traits.

  16. Exploring Plant Co-Expression and Gene-Gene Interactions with CORNET 3.0.

    PubMed

    Van Bel, Michiel; Coppens, Frederik

    2017-01-01

    Selecting and filtering a reference expression and interaction dataset when studying specific pathways and regulatory interactions can be a very time-consuming and error-prone task. In order to reduce the duplicated efforts required to amass such datasets, we have created the CORNET (CORrelation NETworks) platform which allows for easy access to a wide variety of data types: coexpression data, protein-protein interactions, regulatory interactions, and functional annotations. The CORNET platform outputs its results in either text format or through the Cytoscape framework, which is automatically launched by the CORNET website.CORNET 3.0 is the third iteration of the web platform designed for the user exploration of the coexpression space of plant genomes, with a focus on the model species Arabidopsis thaliana. Here we describe the platform: the tools, data, and best practices when using the platform. We indicate how the platform can be used to infer networks from a set of input genes, such as upregulated genes from an expression experiment. By exploring the network, new target and regulator genes can be discovered, allowing for follow-up experiments and more in-depth study. We also indicate how to avoid common pitfalls when evaluating the networks and how to avoid over interpretation of the results.All CORNET versions are available at http://bioinformatics.psb.ugent.be/cornet/ .

  17. Influence of socioeconomic status on the whole blood transcriptome in African Americans.

    PubMed

    Gaye, Amadou; Gibbons, Gary H; Barry, Charles; Quarells, Rakale; Davis, Sharon K

    2017-01-01

    The correlation between low socioeconomic status (SES) and poor health outcome or higher risk of disease has been consistently reported by many epidemiological studies across various race/ancestry groups. However, the biological mechanisms linking low SES to disease and/or disease risk factors are not well understood and remain relatively under-studied. The analysis of the blood transcriptome is a promising window for elucidating how social and environmental factors influence the molecular networks governing health and disease. To further define the mechanistic pathways between social determinants and health, this study examined the impact of SES on the blood transcriptome in a sample of African-Americans. An integrative approach leveraging three complementary methods (Weighted Gene Co-expression Network Analysis, Random Forest and Differential Expression) was adopted to identify the most predictive and robust transcriptome pathways associated with SES. We analyzed the expression of 15079 genes (RNA-seq) from whole blood across 36 samples. The results revealed a cluster of 141 co-expressed genes over-expressed in the low SES group. Three pro-inflammatory pathways (IL-8 Signaling, NF-κB Signaling and Dendritic Cell Maturation) are activated in this module and over-expressed in low SES. Random Forest analysis revealed 55 of the 141 genes that, collectively, predict SES with an area under the curve of 0.85. One third of the 141 genes are significantly over-expressed in the low SES group. Lower SES has consistently been linked to many social and environmental conditions acting as stressors and known to be correlated with vulnerability to chronic illnesses (e.g. asthma, diabetes) associated with a chronic inflammatory state. Our unbiased analysis of the blood transcriptome in African-Americans revealed evidence of a robust molecular signature of increased inflammation associated with low SES. The results provide a plausible link between the social factors and chronic inflammation.

  18. Placental protection of the fetal brain during short-term food deprivation.

    PubMed

    Broad, Kevin D; Keverne, Eric B

    2011-09-13

    The fetal genome regulates maternal physiology and behavior via its placenta, which produces hormones that act on the maternal hypothalamus. At the same time, the fetus itself develops a hypothalamus. In this study we show that many of the genes that regulate placental development also regulate the developing hypothalamus, and in mouse the coexpression of these genes is particularly high on embryonic days 12 and 13 (days E12-13). Such synchronized expression is regulated, in part, by the maternally imprinted gene, paternally expressed gene 3 (Peg3), which also is developmentally coexpressed in the hypothalamus and placenta at days E12-13. We further show that challenging this genomic linkage of hypothalamus and placenta with 24-h food deprivation results in disruption to coexpressed genes, primarily by affecting placental gene expression. Food deprivation also produces a significant decrease in Peg3 gene expression in the placenta, with consequences similar to many of the placental gene changes induced by Peg3 mutation. Such genomic dysregulation does not occur in the hypothalamus, where Peg3 expression increases with food deprivation. Thus, changes in gene expression brought about by food deprivation are consistent with the fetal genome's maintaining hypothalamic development at a cost to its placenta. This biased change to gene dysregulation in the placenta is linked to autophagy and ribosomal turnover, which sustain, in the short term, nutrient supply for the developing hypothalamus. Thus, the fetus controls its own destiny in times of acute starvation by short-term sacrifice of the placenta to preserve brain development.

  19. Germline Chd8 haploinsufficiency alters brain development in mouse

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Gompers, Andrea L.; Su-Feher, Linda; Ellegood, Jacob

    The chromatin remodeling gene CHD8 represents a central node in neurodevelopmental gene networks implicated in autism. In this paper, we examined the impact of germline heterozygous frameshift Chd8 mutation on neurodevelopment in mice. Chd8 +/ del5 mice displayed normal social interactions with no repetitive behaviors but exhibited cognitive impairment correlated with increased regional brain volume, validating that phenotypes of Chd8 +/ del5 mice overlap pathology reported in humans with CHD8 mutations. We applied network analysis to characterize neurodevelopmental gene expression, revealing widespread transcriptional changes in Chd8 +/ del5 mice across pathways disrupted in neurodevelopmental disorders, including neurogenesis, synaptic processes andmore » neuroimmune signaling. We identified a co-expression module with peak expression in early brain development featuring dysregulation of RNA processing, chromatin remodeling and cell-cycle genes enriched for promoter binding by Chd8, and we validated increased neuronal proliferation and developmental splicing perturbation in Chd8 +/ del5 mice. Finally, this integrative analysis offers an initial picture of the consequences of Chd8 haploinsufficiency for brain development.« less

  20. Germline Chd8 haploinsufficiency alters brain development in mouse

    DOE PAGES

    Gompers, Andrea L.; Su-Feher, Linda; Ellegood, Jacob; ...

    2017-06-26

    The chromatin remodeling gene CHD8 represents a central node in neurodevelopmental gene networks implicated in autism. In this paper, we examined the impact of germline heterozygous frameshift Chd8 mutation on neurodevelopment in mice. Chd8 +/ del5 mice displayed normal social interactions with no repetitive behaviors but exhibited cognitive impairment correlated with increased regional brain volume, validating that phenotypes of Chd8 +/ del5 mice overlap pathology reported in humans with CHD8 mutations. We applied network analysis to characterize neurodevelopmental gene expression, revealing widespread transcriptional changes in Chd8 +/ del5 mice across pathways disrupted in neurodevelopmental disorders, including neurogenesis, synaptic processes andmore » neuroimmune signaling. We identified a co-expression module with peak expression in early brain development featuring dysregulation of RNA processing, chromatin remodeling and cell-cycle genes enriched for promoter binding by Chd8, and we validated increased neuronal proliferation and developmental splicing perturbation in Chd8 +/ del5 mice. Finally, this integrative analysis offers an initial picture of the consequences of Chd8 haploinsufficiency for brain development.« less

  1. Transcriptome profiling analysis reveals biomarkers in colon cancer samples of various differentiation

    PubMed Central

    Yu, Tonghu; Zhang, Huaping; Qi, Hong

    2018-01-01

    The aim of the present study was to investigate more colon cancer-related genes in different stages. Gene expression profile E-GEOD-62932 was extracted for differentially expressed gene (DEG) screening. Series test of cluster analysis was used to obtain significant trending models. Based on the Gene Ontology and Kyoto Encyclopedia of Genes and Genomes databases, functional and pathway enrichment analysis were processed and a pathway relation network was constructed. Gene co-expression network and gene signal network were constructed for common DEGs. The DEGs with the same trend were clustered and in total, 16 clusters with statistical significance were obtained. The screened DEGs were enriched into small molecule metabolic process and metabolic pathways. The pathway relation network was constructed with 57 nodes. A total of 328 common DEGs were obtained. Gene signal network was constructed with 71 nodes. Gene co-expression network was constructed with 161 nodes and 211 edges. ABCD3, CPT2, AGL and JAM2 are potential biomarkers for the diagnosis of colon cancer. PMID:29928385

  2. Differential co-expression and regulation analyses reveal different mechanisms underlying major depressive disorder and subsyndromal symptomatic depression.

    PubMed

    Xu, Fan; Yang, Jing; Chen, Jin; Wu, Qingyuan; Gong, Wei; Zhang, Jianguo; Shao, Weihua; Mu, Jun; Yang, Deyu; Yang, Yongtao; Li, Zhiwei; Xie, Peng

    2015-04-03

    Recent depression research has revealed a growing awareness of how to best classify depression into depressive subtypes. Appropriately subtyping depression can lead to identification of subtypes that are more responsive to current pharmacological treatment and aid in separating out depressed patients in which current antidepressants are not particularly effective. Differential co-expression analysis (DCEA) and differential regulation analysis (DRA) were applied to compare the transcriptomic profiles of peripheral blood lymphocytes from patients with two depressive subtypes: major depressive disorder (MDD) and subsyndromal symptomatic depression (SSD). Six differentially regulated genes (DRGs) (FOSL1, SRF, JUN, TFAP4, SOX9, and HLF) and 16 transcription factor-to-target differentially co-expressed gene links or pairs (TF2target DCLs) appear to be the key differential factors in MDD; in contrast, one DRG (PATZ1) and eight TF2target DCLs appear to be the key differential factors in SSD. There was no overlap between the MDD target genes and SSD target genes. Venlafaxine (Efexor™, Effexor™) appears to have a significant effect on the gene expression profile of MDD patients but no significant effect on the gene expression profile of SSD patients. DCEA and DRA revealed no apparent similarities between the differential regulatory processes underlying MDD and SSD. This bioinformatic analysis may provide novel insights that can support future antidepressant R&D efforts.

  3. Recombination–deletion between homologous cassettes in retrovirus is suppressed via a strategy of degenerate codon substitution

    PubMed Central

    Im, Eung Jun; Bais, Anthony J; Yang, Wen; Ma, Qiangzhong; Guo, Xiuyang; Sepe, Steven M; Junghans, Richard P

    2014-01-01

    Transduction and expression procedures in gene therapy protocols may optimally transfer more than a single gene to correct a defect and/or transmit new functions to recipient cells or organisms. This may be accomplished by transduction with two (or more) vectors, or, more efficiently, in a single vector. Occasionally, it may be useful to coexpress homologous genes or chimeric proteins with regions of shared homology. Retroviridae include the dominant vector systems for gene transfer (e.g., gamma-retro and lentiviruses) and are capable of such multigene expression. However, these same viruses are known for efficient recombination–deletion when domains are duplicated within the viral genome. This problem can be averted by resorting to two-vector strategies (two-chain two-vector), but at a penalty to cost, convenience, and efficiency. Employing a chimeric antigen receptor system as an example, we confirm that coexpression of two genes with homologous domains in a single gamma-retroviral vector (two-chain single-vector) leads to recombination–deletion between repeated sequences, excising the equivalent of one of the chimeric antigen receptors. Here, we show that a degenerate codon substitution strategy in the two-chain single-vector format efficiently suppressed intravector deletional loss with rescue of balanced gene coexpression by minimizing sequence homology between repeated domains and preserving the final protein sequence. PMID:25419532

  4. Dual expression of Epstein-Barr virus, latent membrane protein-1 and human papillomavirus-16 E6 transform primary mouse embryonic fibroblasts through NF-κB signaling.

    PubMed

    Shimabuku, Tetsuya; Tamanaha, Ayumi; Kitamura, Bunta; Tanabe, Yasuka; Tawata, Natsumi; Ikehara, Fukino; Arakaki, Kazunari; Kinjo, Takao

    2014-01-01

    The prevalence of Epstein-Barr virus (EBV) and high-risk human papilloma virus (HPV) infections in patients with oral cancer in Okinawa, southwest islands of Japan, has led to the hypothesis that carcinogenesis is related to EBV and HPV co-infection. To explore the mechanisms of transformation induced by EBV and HPV co-infection, we analyzed the transformation of primary mouse embryonic fibroblasts (MEFs) expressing EBV and HPV-16 genes, alone or in combination. Expression of EBV latent membrane protein-1 (LMP-1) alone or in combination with HPV-16 E6 increased cell proliferation and decreased apoptosis, whereas single expression of EBV nuclear antigen-1 (EBNA-1), or HPV-16 E6 did not. Co-expression of LMP-1 and E6 induced anchorage-independent growth and tumor formation in nude mice, whereas expression of LMP-1 alone did not. Although the singular expression of these viral genes showed increased DNA damage and DNA damage response (DDR), co-expression of LMP-1 and E6 did not induce DDR, which is frequently seen in cancer cells. Furthermore, co-expression of LMP-1 with E6 increased NF-κB signaling, and the knockdown of LMP-1 or E6 in co-expressing cells decreased cell proliferation, anchorage independent growth, and NF-κB activation. These data suggested that expression of individual viral genes is insufficient for inducing transformation and that co-expression of LMP-1 and E6, which is associated with suppression of DDR and increased NF-κB activity, lead to transformation. Our findings demonstrate the synergistic effect by the interaction of oncogenes from different viruses on the transformation of primary MEFs.

  5. Overproduction of Geranylgeraniol by Metabolically Engineered Saccharomyces cerevisiae▿

    PubMed Central

    Tokuhiro, Kenro; Muramatsu, Masayoshi; Ohto, Chikara; Kawaguchi, Toshiya; Obata, Shusei; Muramoto, Nobuhiko; Hirai, Masana; Takahashi, Haruo; Kondo, Akihiko; Sakuradani, Eiji; Shimizu, Sakayu

    2009-01-01

    (E, E, E)-Geranylgeraniol (GGOH) is a valuable starting material for perfumes and pharmaceutical products. In the yeast Saccharomyces cerevisiae, GGOH is synthesized from the end products of the mevalonate pathway through the sequential reactions of farnesyl diphosphate synthetase (encoded by the ERG20 gene), geranylgeranyl diphosphate synthase (the BTS1 gene), and some endogenous phosphatases. We demonstrated that overexpression of the diacylglycerol diphosphate phosphatase (DPP1) gene could promote GGOH production. We also found that overexpression of a BTS1-DPP1 fusion gene was more efficient for producing GGOH than coexpression of these genes separately. Overexpression of the hydroxymethylglutaryl-coenzyme A reductase (HMG1) gene, which encodes the major rate-limiting enzyme of the mevalonate pathway, resulted in overproduction of squalene (191.9 mg liter−1) rather than GGOH (0.2 mg liter−1) in test tube cultures. Coexpression of the BTS1-DPP1 fusion gene along with the HMG1 gene partially redirected the metabolic flux from squalene to GGOH. Additional expression of a BTS1-ERG20 fusion gene resulted in an almost complete shift of the flux to GGOH production (228.8 mg liter−1 GGOH and 6.5 mg liter−1 squalene). Finally, we constructed a diploid prototrophic strain coexpressing the HMG1, BTS1-DPP1, and BTS1-ERG20 genes from multicopy integration vectors. This strain attained 3.31 g liter−1 GGOH production in a 10-liter jar fermentor with gradual feeding of a mixed glucose and ethanol solution. The use of bifunctional fusion genes such as the BTS1-DPP1 and ERG20-BTS1 genes that code sequential enzymes in the metabolic pathway was an effective method for metabolic engineering. PMID:19592534

  6. Effects of threshold on the topology of gene co-expression networks.

    PubMed

    Couto, Cynthia Martins Villar; Comin, César Henrique; Costa, Luciano da Fontoura

    2017-09-26

    Several developments regarding the analysis of gene co-expression profiles using complex network theory have been reported recently. Such approaches usually start with the construction of an unweighted gene co-expression network, therefore requiring the selection of a suitable threshold defining which pairs of vertices will be connected. We aimed at addressing such an important problem by suggesting and comparing five different approaches for threshold selection. Each of the methods considers a respective biologically-motivated criterion for electing a potentially suitable threshold. A set of 21 microarray experiments from different biological groups was used to investigate the effect of applying the five proposed criteria to several biological situations. For each experiment, we used the Pearson correlation coefficient to measure the relationship between each gene pair, and the resulting weight matrices were thresholded considering several values, generating respective adjacency matrices (co-expression networks). Each of the five proposed criteria was then applied in order to select the respective threshold value. The effects of these thresholding approaches on the topology of the resulting networks were compared by using several measurements, and we verified that, depending on the database, the impact on the topological properties can be large. However, a group of databases was verified to be similarly affected by most of the considered criteria. Based on such results, it can be suggested that when the generated networks present similar measurements, the thresholding method can be chosen with greater freedom. If the generated networks are markedly different, the thresholding method that better suits the interests of each specific research study represents a reasonable choice.

  7. A null model for Pearson coexpression networks.

    PubMed

    Gobbi, Andrea; Jurman, Giuseppe

    2015-01-01

    Gene coexpression networks inferred by correlation from high-throughput profiling such as microarray data represent simple but effective structures for discovering and interpreting linear gene relationships. In recent years, several approaches have been proposed to tackle the problem of deciding when the resulting correlation values are statistically significant. This is most crucial when the number of samples is small, yielding a non-negligible chance that even high correlation values are due to random effects. Here we introduce a novel hard thresholding solution based on the assumption that a coexpression network inferred by randomly generated data is expected to be empty. The threshold is theoretically derived by means of an analytic approach and, as a deterministic independent null model, it depends only on the dimensions of the starting data matrix, with assumptions on the skewness of the data distribution compatible with the structure of gene expression levels data. We show, on synthetic and array datasets, that the proposed threshold is effective in eliminating all false positive links, with an offsetting cost in terms of false negative detected edges.

  8. Genetic dissection of the Gpnmb network in the eye.

    PubMed

    Lu, Hong; Wang, Xusheng; Pullen, Matthew; Guan, Huaijin; Chen, Hui; Sahu, Shwetapadma; Zhang, Bing; Chen, Hao; Williams, Robert W; Geisert, Eldon E; Lu, Lu; Jablonski, Monica M

    2011-06-13

    To use a systematic genetics approach to investigate the regulation of Gpnmb, a gene that contributes to pigmentary dispersion syndrome (PDS) and pigmentary glaucoma (PG) in the DBA/2J (D2) mouse. Global patterns of gene expression were studied in whole eyes of a large family of BXD mouse strains (n = 67) generated by crossing the PDS- and PG-prone parent (DBA/2J) with a resistant strain (C57BL/6J). Quantitative trait locus (eQTL) mapping methods and gene set analysis were used to evaluate Gpnmb coexpression networks in wild-type and mutant cohorts. The level of Gpnmb expression was associated with a highly significant cis-eQTL at the location of the gene itself. This autocontrol of Gpnmb is likely to be a direct consequence of the known premature stop codon in exon 4. Both gene ontology and coexpression network analyses demonstrated that the mutation in Gpnmb radically modified the set of genes with which Gpnmb expression is correlated. The covariates of wild-type Gpnmb are involved in biological processes including melanin synthesis and cell migration, whereas the covariates of mutant Gpnmb are involved in the biological processes of posttranslational modification, stress activation, and sensory processing. These results demonstrated that a systematic genetics approach provides a powerful tool for constructing coexpression networks that define the biological process categories within which similarly regulated genes function. The authors showed that the R150X mutation in Gpnmb dramatically modified its list of genetic covariates, which may explain the associated ocular pathology.

  9. Co-expression of interleukin 12 enhances antitumor effects of a novel chimeric promoter-mediated suicide gene therapy in an immunocompetent mouse model

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Xu, Yu, E-mail: xuyu1001@gmail.com; Hubei Key Laboratory of Tumor Biological Behaviors and Hubei Cancer Clinical Study Center, 169 Donghu Road, Wuhan 430071; Liu, Zhengchun, E-mail: l135027@126.com

    Highlights: {yields} A novel chimeric promoter consisting of CArG element and hTERT promoter was developed. {yields} The promoter was characterized with radiation-inducibility and tumor-specificity. {yields} Suicide gene system driven by the promoter showed remarkable cytotoxicity in vitro. {yields} Co-expression of IL12 enhanced the promoter mediated suicide gene therapy in vivo. -- Abstract: The human telomerase reverse transcriptase (hTERT) promoter has been widely used in target gene therapy of cancer. However, low transcriptional activity limited its clinical application. Here, we designed a novel dual radiation-inducible and tumor-specific promoter system consisting of CArG elements and the hTERT promoter, resulting in increased expressionmore » of reporter genes after gamma-irradiation. Therapeutic and side effects of adenovirus-mediated horseradish peroxidase (HRP)/indole-3-acetic (IAA) system downstream of the chimeric promoter were evaluated in mice bearing Lewis lung carcinoma, combining with or without adenovirus-mediated interleukin 12 (IL12) gene driven by the cytomegalovirus promoter. The combination treatment showed more effective suppression of tumor growth than those with single agent alone, being associated with pronounced intratumoral T-lymphocyte infiltration and minor side effects. Our results suggest that the combination treatment with HRP/IAA system driven by the novel chimeric promoter and the co-expression of IL12 might be an effective and safe target gene therapy strategy of cancer.« less

  10. Construction and analysis of lncRNA-lncRNA synergistic networks to reveal clinically relevant lncRNAs in cancer.

    PubMed

    Li, Yongsheng; Chen, Juan; Zhang, Jinwen; Wang, Zishan; Shao, Tingting; Jiang, Chunjie; Xu, Juan; Li, Xia

    2015-09-22

    Long non-coding RNAs (lncRNAs) play key roles in diverse biological processes. Moreover, the development and progression of cancer often involves the combined actions of several lncRNAs. Here we propose a multi-step method for constructing lncRNA-lncRNA functional synergistic networks (LFSNs) through co-regulation of functional modules having three features: common coexpressed genes of lncRNA pairs, enrichment in the same functional category and close proximity within protein interaction networks. Applied to three cancers, we constructed cancer-specific LFSNs and found that they exhibit a scale free and modular architecture. In addition, cancer-associated lncRNAs tend to be hubs and are enriched within modules. Although there is little synergistic pairing of lncRNAs across cancers, lncRNA pairs involved in the same cancer hallmarks by regulating same or different biological processes. Finally, we identify prognostic biomarkers within cancer lncRNA expression datasets using modules derived from LFSNs. In summary, this proof-of-principle study indicates synergistic lncRNA pairs can be identified through integrative analysis of genome-wide expression data sets and functional information.

  11. Using genetic markers to orient the edges in quantitative trait networks: the NEO software.

    PubMed

    Aten, Jason E; Fuller, Tova F; Lusis, Aldons J; Horvath, Steve

    2008-04-15

    Systems genetic studies have been used to identify genetic loci that affect transcript abundances and clinical traits such as body weight. The pairwise correlations between gene expression traits and/or clinical traits can be used to define undirected trait networks. Several authors have argued that genetic markers (e.g expression quantitative trait loci, eQTLs) can serve as causal anchors for orienting the edges of a trait network. The availability of hundreds of thousands of genetic markers poses new challenges: how to relate (anchor) traits to multiple genetic markers, how to score the genetic evidence in favor of an edge orientation, and how to weigh the information from multiple markers. We develop and implement Network Edge Orienting (NEO) methods and software that address the challenges of inferring unconfounded and directed gene networks from microarray-derived gene expression data by integrating mRNA levels with genetic marker data and Structural Equation Model (SEM) comparisons. The NEO software implements several manual and automatic methods for incorporating genetic information to anchor traits. The networks are oriented by considering each edge separately, thus reducing error propagation. To summarize the genetic evidence in favor of a given edge orientation, we propose Local SEM-based Edge Orienting (LEO) scores that compare the fit of several competing causal graphs. SEM fitting indices allow the user to assess local and overall model fit. The NEO software allows the user to carry out a robustness analysis with regard to genetic marker selection. We demonstrate the utility of NEO by recovering known causal relationships in the sterol homeostasis pathway using liver gene expression data from an F2 mouse cross. Further, we use NEO to study the relationship between a disease gene and a biologically important gene co-expression module in liver tissue. The NEO software can be used to orient the edges of gene co-expression networks or quantitative trait networks if the edges can be anchored to genetic marker data. R software tutorials, data, and supplementary material can be downloaded from: http://www.genetics.ucla.edu/labs/horvath/aten/NEO.

  12. Understanding network concepts in modules

    PubMed Central

    2007-01-01

    Background Network concepts are increasingly used in biology and genetics. For example, the clustering coefficient has been used to understand network architecture; the connectivity (also known as degree) has been used to screen for cancer targets; and the topological overlap matrix has been used to define modules and to annotate genes. Dozens of potentially useful network concepts are known from graph theory. Results Here we study network concepts in special types of networks, which we refer to as approximately factorizable networks. In these networks, the pairwise connection strength (adjacency) between 2 network nodes can be factored into node specific contributions, named node 'conformity'. The node conformity turns out to be highly related to the connectivity. To provide a formalism for relating network concepts to each other, we define three types of network concepts: fundamental-, conformity-based-, and approximate conformity-based concepts. Fundamental concepts include the standard definitions of connectivity, density, centralization, heterogeneity, clustering coefficient, and topological overlap. The approximate conformity-based analogs of fundamental network concepts have several theoretical advantages. First, they allow one to derive simple relationships between seemingly disparate networks concepts. For example, we derive simple relationships between the clustering coefficient, the heterogeneity, the density, the centralization, and the topological overlap. The second advantage of approximate conformity-based network concepts is that they allow one to show that fundamental network concepts can be approximated by simple functions of the connectivity in module networks. Conclusion Using protein-protein interaction, gene co-expression, and simulated data, we show that a) many networks comprised of module nodes are approximately factorizable and b) in these types of networks, simple relationships exist between seemingly disparate network concepts. Our results are implemented in freely available R software code, which can be downloaded from the following webpage: http://www.genetics.ucla.edu/labs/horvath/ModuleConformity/ModuleNetworks PMID:17547772

  13. Altered Expression of Genes Implicated in Xylan Biosynthesis Affects Penetration Resistance against Powdery Mildew.

    PubMed

    Chowdhury, Jamil; Lück, Stefanie; Rajaraman, Jeyaraman; Douchkov, Dimitar; Shirley, Neil J; Schwerdt, Julian G; Schweizer, Patrick; Fincher, Geoffrey B; Burton, Rachel A; Little, Alan

    2017-01-01

    Heteroxylan has recently been identified as an important component of papillae, which are formed during powdery mildew infection of barley leaves. Deposition of heteroxylan near the sites of attempted fungal penetration in the epidermal cell wall is believed to enhance the physical resistance to the fungal penetration peg and hence to improve pre-invasion resistance. Several glycosyltransferase (GT) families are implicated in the assembly of heteroxylan in the plant cell wall, and are likely to work together in a multi-enzyme complex. Members of key GT families reported to be involved in heteroxylan biosynthesis are up-regulated in the epidermal layer of barley leaves during powdery mildew infection. Modulation of their expression leads to altered susceptibility levels, suggesting that these genes are important for penetration resistance. The highest level of resistance was achieved when a GT43 gene was co-expressed with a GT47 candidate gene, both of which have been predicted to be involved in xylan backbone biosynthesis. Altering the expression level of several candidate heteroxylan synthesis genes can significantly alter disease susceptibility. This is predicted to occur through changes in the amount and structure of heteroxylan in barley papillae.

  14. Engineered Biosynthesis of a Novel Amidated Polyketide, Using the Malonamyl-Specific Initiation Module from the Oxytetracycline Polyketide Synthase

    PubMed Central

    Zhang, Wenjun; Ames, Brian D.; Tsai, Shiou-Chuan; Tang, Yi

    2006-01-01

    Tetracyclines are aromatic polyketides biosynthesized by bacterial type II polyketide synthases (PKSs). Understanding the biochemistry of tetracycline PKSs is an important step toward the rational and combinatorial manipulation of tetracycline biosynthesis. To this end, we have sequenced the gene cluster of oxytetracycline (oxy and otc genes) PKS genes from Streptomyces rimosus. Sequence analysis revealed a total of 21 genes between the otrA and otrB resistance genes. We hypothesized that an amidotransferase, OxyD, synthesizes the malonamate starter unit that is a universal building block for tetracycline compounds. In vivo reconstitution using strain CH999 revealed that the minimal PKS and OxyD are necessary and sufficient for the biosynthesis of amidated polyketides. A novel alkaloid (WJ35, or compound 2) was synthesized as the major product when the oxy-encoded minimal PKS, the C-9 ketoreductase (OxyJ), and OxyD were coexpressed in CH999. WJ35 is an isoquinolone compound derived from an amidated decaketide backbone and cyclized with novel regioselectivity. The expression of OxyD with a heterologous minimal PKS did not afford similarly amidated polyketides, suggesting that the oxy-encoded minimal PKS possesses novel starter unit specificity. PMID:16597959

  15. Building gene co-expression networks using transcriptomics data for systems biology investigations: Comparison of methods using microarray data

    PubMed Central

    Kadarmideen, Haja N; Watson-haigh, Nathan S

    2012-01-01

    Gene co-expression networks (GCN), built using high-throughput gene expression data are fundamental aspects of systems biology. The main aims of this study were to compare two popular approaches to building and analysing GCN. We use real ovine microarray transcriptomics datasets representing four different treatments with Metyrapone, an inhibitor of cortisol biosynthesis. We conducted several microarray quality control checks before applying GCN methods to filtered datasets. Then we compared the outputs of two methods using connectivity as a criterion, as it measures how well a node (gene) is connected within a network. The two GCN construction methods used were, Weighted Gene Co-expression Network Analysis (WGCNA) and Partial Correlation and Information Theory (PCIT) methods. Nodes were ranked based on their connectivity measures in each of the four different networks created by WGCNA and PCIT and node ranks in two methods were compared to identify those nodes which are highly differentially ranked (HDR). A total of 1,017 HDR nodes were identified across one or more of four networks. We investigated HDR nodes by gene enrichment analyses in relation to their biological relevance to phenotypes. We observed that, in contrast to WGCNA method, PCIT algorithm removes many of the edges of the most highly interconnected nodes. Removal of edges of most highly connected nodes or hub genes will have consequences for downstream analyses and biological interpretations. In general, for large GCN construction (with > 20000 genes) access to large computer clusters, particularly those with larger amounts of shared memory is recommended. PMID:23144540

  16. Effect of two intermediate electron donors, NADPH and FADH(2), on Spirulina Delta (6)-desaturase co-expressed with two different immediate electron donors, cytochrome b (5) and ferredoxin, in Escherichia coli.

    PubMed

    Kurdrid, Pavinee; Subudhi, Sanjukta; Cheevadhanarak, Supapon; Tanticharoen, Morakot; Hongsthong, Apiradee

    2007-12-01

    When the gene desD encoding Spirulina Delta(6)-desaturase was heterologously expressed in E. coli, the enzyme was expressed without the ability to function. However, when this enzyme was co-expressed with an immediate electron donor, i.e. the cytochrome b (5) domain from Mucor rouxii, the results showed the production of GLA (gamma-linolenic acid), the product of the reaction catalyzed by Delta(6)-desaturase. The results revealed that in E. coli cells, where cytochrome b (5) is absent and ferredoxin, a natural electron donor of Delta(6)-desaturase, is present at a very low level, the cytochrome b (5) domain can complement for the function of ferredoxin in the host cells. In the present study, the Spirulina-ferredoxin gene was cloned and co-expressed with the Delta(6)-desaturase in E. coli. In comparison to the co-expression of cytochrome b ( 5 ) with the Delta(6)-desaturase, the co-expression with ferredoxin did not cause any differences in the GLA level. Moreover, the cultures containing the Delta(6)-desaturase co-expressed with cytochrome b (5) and ferredoxin were exogenously supplied with the intermediate electron donors, NADPH (nicotinamide adenine dinucleotide phosphate, reduced form) and FADH(2) (flavin adenine dinucleotide, reduced form), respectively. The GLA level in these host cells increased drastically, by approximately 50%, compared to the cells without the intermediate electron donors. The data indicated that besides the level of immediate electron donors, the level of intermediate electron donors is also critical for GLA production. Therefore, if the pools of the immediate and intermediate electron donors in the cells are manipulated, the GLA production in the heterologous host will be affected.

  17. Rhesus Monkey Rhadinovirus ORF57 Induces gH and gL Glycoprotein Expression through Posttranscriptional Accumulation of Target mRNAs ▿

    PubMed Central

    Shin, Young C.; Desrosiers, Ronald C.

    2011-01-01

    Open reading frame 57 (ORF57) of gamma-2 herpesviruses is a key regulator of viral gene expression. It has been reported to enhance the expression of viral genes by transcriptional, posttranscriptional, or translational activation mechanisms. Previously we have shown that the expression of gH and gL of rhesus monkey rhadinovirus (RRV), a close relative of the human Kaposi's sarcoma-associated herpesvirus (KSHV), could be dramatically rescued by codon optimization as well as by ORF57 coexpression (J. P. Bilello, J. S. Morgan, and R. C. Desrosiers, J. Virol. 82:7231–7237, 2008). We show here that ORF57 coexpression and codon optimization had similar effects, except that the rescue of expression by codon optimization was temporally delayed relative to that of ORF57 coexpression. The transfection of gL mRNA directly into cells with or without ORF57 coexpression and with or without codon optimization recapitulated the effects of these modes of induction on transfected DNA. These findings suggested an important role for the enhancement of mRNA stability and/or the translation of mRNA for these very different modes of induced expression. This conclusion was confirmed by several different measures of gH and gL mRNA stability and accumulation with or without ORF57 coexpression and with or without codon optimization. Our results indicate that RRV gH and gL expression is severely limited by the stability of the mRNA and that ORF57 coexpression and codon optimization independently induce gH and gL expression principally by allowing accumulation and translation of these mRNAs. PMID:21613403

  18. Vectors for co-expression of an unrestricted number of proteins

    PubMed Central

    Scheich, Christoph; Kümmel, Daniel; Soumailakakis, Dimitri; Heinemann, Udo; Büssow, Konrad

    2007-01-01

    A vector system is presented that allows generation of E. coli co-expression clones by a standardized, robust cloning procedure. The number of co-expressed proteins is not limited. Five ‘pQLink’ vectors for expression of His-tag and GST-tag fusion proteins as well as untagged proteins and for cloning by restriction enzymes or Gateway cloning were generated. The vectors allow proteins to be expressed individually; to achieve co-expression, two pQLink plasmids are combined by ligation-independent cloning. pQLink co-expression plasmids can accept an unrestricted number of genes. As an example, the co-expression of a heterotetrameric human transport protein particle (TRAPP) complex from a single plasmid, its isolation and analysis of its stoichiometry are shown. pQLink clones can be used directly for pull-down experiments if the proteins are expressed with different tags. We demonstrate pull-down experiments of human valosin-containing protein (VCP) with fragments of the autocrine motility factor receptor (AMFR). The cloning method avoids PCR or gel isolation of restriction fragments, and a single resistance marker and origin of replication are used, allowing over-expression of rare tRNAs from a second plasmid. It is expected that applications are not restricted to bacteria, but could include co-expression in other hosts such as Bacluovirus/insect cells. PMID:17311810

  19. Dynamic Visualization of Co-expression in Systems Genetics Data

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    New, Joshua Ryan; Huang, Jian; Chesler, Elissa J

    2008-01-01

    Biologists hope to address grand scientific challenges by exploring the abundance of data made available through modern microarray technology and other high-throughput techniques. The impact of this data, however, is limited unless researchers can effectively assimilate such complex information and integrate it into their daily research; interactive visualization tools are called for to support the effort. Specifically, typical studies of gene co-expression require novel visualization tools that enable the dynamic formulation and fine-tuning of hypotheses to aid the process of evaluating sensitivity of key parameters. These tools should allow biologists to develop an intuitive understanding of the structure of biologicalmore » networks and discover genes which reside in critical positions in networks and pathways. By using a graph as a universal data representation of correlation in gene expression data, our novel visualization tool employs several techniques that when used in an integrated manner provide innovative analytical capabilities. Our tool for interacting with gene co-expression data integrates techniques such as: graph layout, qualitative subgraph extraction through a novel 2D user interface, quantitative subgraph extraction using graph-theoretic algorithms or by querying an optimized b-tree, dynamic level-of-detail graph abstraction, and template-based fuzzy classification using neural networks. We demonstrate our system using a real-world workflow from a large-scale, systems genetics study of mammalian gene co-expression.« less

  20. miR-638 regulates gene expression networks associated with emphysematous lung destruction

    PubMed Central

    2013-01-01

    Background Chronic obstructive pulmonary disease (COPD) is a heterogeneous disease characterized by varying degrees of emphysematous lung destruction and small airway disease, each with distinct effects on clinical outcomes. There is little known about how microRNAs contribute specifically to the emphysema phenotype. We examined how genome-wide microRNA expression is altered with regional emphysema severity and how these microRNAs regulate disease-associated gene expression networks. Methods We profiled microRNAs in different regions of the lung with varying degrees of emphysema from 6 smokers with COPD and 2 controls (8 regions × 8 lungs = 64 samples). Regional emphysema severity was quantified by mean linear intercept. Whole genome microRNA and gene expression data were integrated in the same samples to build co-expression networks. Candidate microRNAs were perturbed in human lung fibroblasts in order to validate these networks. Results The expression levels of 63 microRNAs (P < 0.05) were altered with regional emphysema. A subset, including miR-638, miR-30c, and miR-181d, had expression levels that were associated with those of their predicted mRNA targets. Genes correlated with these microRNAs were enriched in pathways associated with emphysema pathophysiology (for example, oxidative stress and accelerated aging). Inhibition of miR-638 expression in lung fibroblasts led to modulation of these same emphysema-related pathways. Gene targets of miR-638 in these pathways were amongst those negatively correlated with miR-638 expression in emphysema. Conclusions Our findings demonstrate that microRNAs are altered with regional emphysema severity and modulate disease-associated gene expression networks. Furthermore, miR-638 may regulate gene expression pathways related to the oxidative stress response and aging in emphysematous lung tissue and lung fibroblasts. PMID:24380442

  1. The condition-dependent transcriptional network in Escherichia coli.

    PubMed

    Lemmens, Karen; De Bie, Tijl; Dhollander, Thomas; Monsieurs, Pieter; De Moor, Bart; Collado-Vides, Julio; Engelen, Kristof; Marchal, Kathleen

    2009-03-01

    Thanks to the availability of high-throughput omics data, bioinformatics approaches are able to hypothesize thus-far undocumented genetic interactions. However, due to the amount of noise in these data, inferences based on a single data source are often unreliable. A popular approach to overcome this problem is to integrate different data sources. In this study, we describe DISTILLER, a novel framework for data integration that simultaneously analyzes microarray and motif information to find modules that consist of genes that are co-expressed in a subset of conditions, and their corresponding regulators. By applying our method on publicly available data, we evaluated the condition-specific transcriptional network of Escherichia coli. DISTILLER confirmed 62% of 736 interactions described in RegulonDB, and 278 novel interactions were predicted.

  2. Computational, Integrative, and Comparative Methods for the Elucidation of Genetic Coexpression Networks

    DOE PAGES

    Baldwin, Nicole E.; Chesler, Elissa J.; Kirov, Stefan; ...

    2005-01-01

    Gene expression microarray data can be used for the assembly of genetic coexpression network graphs. Using mRNA samples obtained from recombinant inbred Mus musculus strains, it is possible to integrate allelic variation with molecular and higher-order phenotypes. The depth of quantitative genetic analysis of microarray data can be vastly enhanced utilizing this mouse resource in combination with powerful computational algorithms, platforms, and data repositories. The resulting network graphs transect many levels of biological scale. This approach is illustrated with the extraction of cliques of putatively co-regulated genes and their annotation using gene ontology analysis and cis -regulatory element discovery. Themore » causal basis for co-regulation is detected through the use of quantitative trait locus mapping.« less

  3. Integration of miRNA and Protein Profiling Reveals Coordinated Neuroadaptations in the Alcohol-Dependent Mouse Brain

    PubMed Central

    Gorini, Giorgio; Nunez, Yury O.; Mayfield, R. Dayne

    2013-01-01

    The molecular mechanisms underlying alcohol dependence involve different neurochemical systems and are brain region-dependent. Chronic Intermittent Ethanol (CIE) procedure, combined with a Two-Bottle Choice voluntary drinking paradigm, represents one of the best available animal models for alcohol dependence and relapse drinking. MicroRNAs, master regulators of the cellular transcriptome and proteome, can regulate their targets in a cooperative, combinatorial fashion, ensuring fine tuning and control over a large number of cellular functions. We analyzed cortex and midbrain microRNA expression levels using an integrative approach to combine and relate data to previous protein profiling from the same CIE-subjected samples, and examined the significance of the data in terms of relative contribution to alcohol consumption and dependence. MicroRNA levels were significantly altered in CIE-exposed dependent mice compared with their non-dependent controls. More importantly, our integrative analysis identified modules of coexpressed microRNAs that were highly correlated with CIE effects and predicted target genes encoding differentially expressed proteins. Coexpressed CIE-relevant proteins, in turn, were often negatively correlated with specific microRNA modules. Our results provide evidence that microRNA-orchestrated translational imbalances are driving the behavioral transition from alcohol consumption to dependence. This study represents the first attempt to combine ex vivo microRNA and protein expression on a global scale from the same mammalian brain samples. The integrative systems approach used here will improve our understanding of brain adaptive changes in response to drug abuse and suggests the potential therapeutic use of microRNAs as tools to prevent or compensate multiple neuroadaptations underlying addictive behavior. PMID:24358208

  4. Enteral High Fat-Polyunsaturated Fatty Acid Blend Alters the Pathogen Composition of the Intestinal Microbiome in Premature Infants with an Enterostomy

    PubMed Central

    Younge, Noelle; Yang, Qing; Seed, Patrick C.

    2016-01-01

    Objective To determine the effect of enteral fish oil and safflower oil supplementation on the intestinal microbiome in premature infants with an enterostomy. Study design Premature infants with an enterostomy were randomized to receive early enteral supplementation with a high fat-polyunsaturated fatty acid (HF-PUFA) blend of fish oil and safflower oil versus standard nutritional therapy. We used 16S rRNA gene sequencing for longitudinal profiling of the microbiome from the time of study entry until bowel reanastomosis. We used weighted gene co-expression network analysis to identify microbial community modules that differed between study groups over time. We performed imputed metagenomic analysis to determine metabolic pathways associated with the microbial genes. Results Sixteen infants were randomized to receive enteral HF-PUFA supplementation and 16 infants received standard care. The intestinal microbiota of infants in the treatment group differed from those in the control group, with greater bacterial diversity and lower abundance of Streptococcus, Clostridium, and many pathogenic genera within the Enterobacteriaceae family. We identified four microbial community modules with significant differences between groups over time. Imputed metagenomic analysis of the microbial genes revealed metabolic pathways that differed between groups, including metabolism of amino acids, carbohydrates, fatty acids, and secondary bile acid synthesis. Conclusion Enteral HF-PUFA supplementation was associated with decreased abundance of pathogenic bacteria, greater bacterial diversity, and shifts in the potential metabolic functions of intestinal microbiota. Trial registration ClinicalTrials.gov: NCT01306838 PMID:27856001

  5. Enteral High Fat-Polyunsaturated Fatty Acid Blend Alters the Pathogen Composition of the Intestinal Microbiome in Premature Infants with an Enterostomy.

    PubMed

    Younge, Noelle; Yang, Qing; Seed, Patrick C

    2017-02-01

    To determine the effect of enteral fish oil and safflower oil supplementation on the intestinal microbiome in infants with an enterostomy born premature. Infants with an enterostomy born premature were randomized to receive early enteral supplementation with a high-fat polyunsaturated fatty acid (HF-PUFA) blend of fish oil and safflower oil vs standard nutritional therapy. We used 16S rRNA gene sequencing for longitudinal profiling of the microbiome from the time of study entry until bowel reanastomosis. We used weighted gene coexpression network analysis to identify microbial community modules that differed between study groups over time. We performed imputed metagenomic analysis to determine metabolic pathways associated with the microbial genes. Sixteen infants were randomized to receive enteral HF-PUFA supplementation, and 16 infants received standard care. The intestinal microbiota of infants in the treatment group differed from those in the control group, with greater bacterial diversity and lower abundance of Streptococcus, Clostridium, and many pathogenic genera within the Enterobacteriaceae family. We identified 4 microbial community modules with significant differences between groups over time. Imputed metagenomic analysis of the microbial genes revealed metabolic pathways that differed between groups, including metabolism of amino acids, carbohydrates, fatty acids, and secondary bile acid synthesis. Enteral HF-PUFA supplementation was associated with decreased abundance of pathogenic bacteria, greater bacterial diversity, and shifts in the potential metabolic functions of intestinal microbiota. ClinicalTrials.gov:NCT01306838. Copyright © 2016 Elsevier Inc. All rights reserved.

  6. BiGGEsTS: integrated environment for biclustering analysis of time series gene expression data

    PubMed Central

    Gonçalves, Joana P; Madeira, Sara C; Oliveira, Arlindo L

    2009-01-01

    Background The ability to monitor changes in expression patterns over time, and to observe the emergence of coherent temporal responses using expression time series, is critical to advance our understanding of complex biological processes. Biclustering has been recognized as an effective method for discovering local temporal expression patterns and unraveling potential regulatory mechanisms. The general biclustering problem is NP-hard. In the case of time series this problem is tractable, and efficient algorithms can be used. However, there is still a need for specialized applications able to take advantage of the temporal properties inherent to expression time series, both from a computational and a biological perspective. Findings BiGGEsTS makes available state-of-the-art biclustering algorithms for analyzing expression time series. Gene Ontology (GO) annotations are used to assess the biological relevance of the biclusters. Methods for preprocessing expression time series and post-processing results are also included. The analysis is additionally supported by a visualization module capable of displaying informative representations of the data, including heatmaps, dendrograms, expression charts and graphs of enriched GO terms. Conclusion BiGGEsTS is a free open source graphical software tool for revealing local coexpression of genes in specific intervals of time, while integrating meaningful information on gene annotations. It is freely available at: . We present a case study on the discovery of transcriptional regulatory modules in the response of Saccharomyces cerevisiae to heat stress. PMID:19583847

  7. Step-by-Step Construction of Gene Co-expression Networks from High-Throughput Arabidopsis RNA Sequencing Data.

    PubMed

    Contreras-López, Orlando; Moyano, Tomás C; Soto, Daniela C; Gutiérrez, Rodrigo A

    2018-01-01

    The rapid increase in the availability of transcriptomics data generated by RNA sequencing represents both a challenge and an opportunity for biologists without bioinformatics training. The challenge is handling, integrating, and interpreting these data sets. The opportunity is to use this information to generate testable hypothesis to understand molecular mechanisms controlling gene expression and biological processes (Fig. 1). A successful strategy to generate tractable hypotheses from transcriptomics data has been to build undirected network graphs based on patterns of gene co-expression. Many examples of new hypothesis derived from network analyses can be found in the literature, spanning different organisms including plants and specific fields such as root developmental biology.In order to make the process of constructing a gene co-expression network more accessible to biologists, here we provide step-by-step instructions using published RNA-seq experimental data obtained from a public database. Similar strategies have been used in previous studies to advance root developmental biology. This guide includes basic instructions for the operation of widely used open source platforms such as Bio-Linux, R, and Cytoscape. Even though the data we used in this example was obtained from Arabidopsis thaliana, the workflow developed in this guide can be easily adapted to work with RNA-seq data from any organism.

  8. Co-Expression of Monodehydroascorbate Reductase and Dehydroascorbate Reductase from Brassica rapa Effectively Confers Tolerance to Freezing-Induced Oxidative Stress

    PubMed Central

    Shin, Sun-Young; Kim, Myung-Hee; Kim, Yul-Ho; Park, Hyang-Mi; Yoon, Ho-Sung

    2013-01-01

    Plants are exposed to various environmental stresses and have therefore developed antioxidant enzymes and molecules to protect their cellular components against toxicity derived from reactive oxygen species (ROS). Ascorbate is a very important antioxidant molecule in plants, and monodehydroascorbate reductase (MDHAR; EC 1.6.5.4) and dehydroascorbate reductase (DHAR; EC 1.8.5.1) are essential to regeneration of ascorbate for maintenance of ROS scavenging ability. The MDHAR and DHAR genes from Brassica rapa were cloned, transgenic plants overexpressing either BrMDHAR and BrDHAR were established, and then, each transgenic plant was hybridized to examine the effects of co-expression of both genes conferring tolerance to freezing. Transgenic plants co-overexpressing BrMDHAR and BrDHAR showed activated expression of relative antioxidant enzymes, and enhanced levels of glutathione and phenolics under freezing condition. Then, these alteration caused by co-expression led to alleviated redox status and lipid peroxidation and consequently conferred improved tolerance against severe freezing stress compared to transgenic plants overexpressing single gene. The results of this study suggested that although each expression of BrMDHAR or BrDHAR was available to according tolerance to freezing, the simultaneous expression of two genes generated synergistic effects conferring improved tolerance more effectively even severe freezing. PMID:24170089

  9. The use of open source bioinformatics tools to dissect transcriptomic data.

    PubMed

    Nitsche, Benjamin M; Ram, Arthur F J; Meyer, Vera

    2012-01-01

    Microarrays are a valuable technology to study fungal physiology on a transcriptomic level. Various microarray platforms are available comprising both single and two channel arrays. Despite different technologies, preprocessing of microarray data generally includes quality control, background correction, normalization, and summarization of probe level data. Subsequently, depending on the experimental design, diverse statistical analysis can be performed, including the identification of differentially expressed genes and the construction of gene coexpression networks.We describe how Bioconductor, a collection of open source and open development packages for the statistical programming language R, can be used for dissecting microarray data. We provide fundamental details that facilitate the process of getting started with R and Bioconductor. Using two publicly available microarray datasets from Aspergillus niger, we give detailed protocols on how to identify differentially expressed genes and how to construct gene coexpression networks.

  10. Estimation of the proteomic cancer co-expression sub networks by using association estimators.

    PubMed

    Erdoğan, Cihat; Kurt, Zeyneb; Diri, Banu

    2017-01-01

    In this study, the association estimators, which have significant influences on the gene network inference methods and used for determining the molecular interactions, were examined within the co-expression network inference concept. By using the proteomic data from five different cancer types, the hub genes/proteins within the disease-associated gene-gene/protein-protein interaction sub networks were identified. Proteomic data from various cancer types is collected from The Cancer Proteome Atlas (TCPA). Correlation and mutual information (MI) based nine association estimators that are commonly used in the literature, were compared in this study. As the gold standard to measure the association estimators' performance, a multi-layer data integration platform on gene-disease associations (DisGeNET) and the Molecular Signatures Database (MSigDB) was used. Fisher's exact test was used to evaluate the performance of the association estimators by comparing the created co-expression networks with the disease-associated pathways. It was observed that the MI based estimators provided more successful results than the Pearson and Spearman correlation approaches, which are used in the estimation of biological networks in the weighted correlation network analysis (WGCNA) package. In correlation-based methods, the best average success rate for five cancer types was 60%, while in MI-based methods the average success ratio was 71% for James-Stein Shrinkage (Shrink) and 64% for Schurmann-Grassberger (SG) association estimator, respectively. Moreover, the hub genes and the inferred sub networks are presented for the consideration of researchers and experimentalists.

  11. Estimation of the proteomic cancer co-expression sub networks by using association estimators

    PubMed Central

    Kurt, Zeyneb; Diri, Banu

    2017-01-01

    In this study, the association estimators, which have significant influences on the gene network inference methods and used for determining the molecular interactions, were examined within the co-expression network inference concept. By using the proteomic data from five different cancer types, the hub genes/proteins within the disease-associated gene-gene/protein-protein interaction sub networks were identified. Proteomic data from various cancer types is collected from The Cancer Proteome Atlas (TCPA). Correlation and mutual information (MI) based nine association estimators that are commonly used in the literature, were compared in this study. As the gold standard to measure the association estimators’ performance, a multi-layer data integration platform on gene-disease associations (DisGeNET) and the Molecular Signatures Database (MSigDB) was used. Fisher's exact test was used to evaluate the performance of the association estimators by comparing the created co-expression networks with the disease-associated pathways. It was observed that the MI based estimators provided more successful results than the Pearson and Spearman correlation approaches, which are used in the estimation of biological networks in the weighted correlation network analysis (WGCNA) package. In correlation-based methods, the best average success rate for five cancer types was 60%, while in MI-based methods the average success ratio was 71% for James-Stein Shrinkage (Shrink) and 64% for Schurmann-Grassberger (SG) association estimator, respectively. Moreover, the hub genes and the inferred sub networks are presented for the consideration of researchers and experimentalists. PMID:29145449

  12. Single-nucleotide polymorphism-gene intermixed networking reveals co-linkers connected to multiple gene expression phenotypes

    PubMed Central

    Gong, Bin-Sheng; Zhang, Qing-Pu; Zhang, Guang-Mei; Zhang, Shao-Jun; Zhang, Wei; Lv, Hong-Chao; Zhang, Fan; Lv, Sa-Li; Li, Chuan-Xing; Rao, Shao-Qi; Li, Xia

    2007-01-01

    Gene expression profiles and single-nucleotide polymorphism (SNP) profiles are modern data for genetic analysis. It is possible to use the two types of information to analyze the relationships among genes by some genetical genomics approaches. In this study, gene expression profiles were used as expression traits. And relationships among the genes, which were co-linked to a common SNP(s), were identified by integrating the two types of information. Further research on the co-expressions among the co-linked genes was carried out after the gene-SNP relationships were established using the Haseman-Elston sib-pair regression. The results showed that the co-expressions among the co-linked genes were significantly higher if the number of connections between the genes and a SNP(s) was more than six. Then, the genes were interconnected via one or more SNP co-linkers to construct a gene-SNP intermixed network. The genes sharing more SNPs tended to have a stronger correlation. Finally, a gene-gene network was constructed with their intensities of relationships (the number of SNP co-linkers shared) as the weights for the edges. PMID:18466544

  13. Human primitive brain displays negative mitochondrial-nuclear expression correlation of respiratory genes.

    PubMed

    Barshad, Gilad; Blumberg, Amit; Cohen, Tal; Mishmar, Dan

    2018-06-14

    Oxidative phosphorylation (OXPHOS), a fundamental energy source in all human tissues, requires interactions between mitochondrial (mtDNA)- and nuclear (nDNA)-encoded protein subunits. Although such interactions are fundamental to OXPHOS, bi-genomic coregulation is poorly understood. To address this question, we analyzed ∼8500 RNA-seq experiments from 48 human body sites. Despite well-known variation in mitochondrial activity, quantity, and morphology, we found overall positive mtDNA-nDNA OXPHOS genes' co-expression across human tissues. Nevertheless, negative mtDNA-nDNA gene expression correlation was identified in the hypothalamus, basal ganglia, and amygdala (subcortical brain regions, collectively termed the "primitive" brain). Single-cell RNA-seq analysis of mouse and human brains revealed that this phenomenon is evolutionarily conserved, and both are influenced by brain cell types (involving excitatory/inhibitory neurons and nonneuronal cells) and by their spatial brain location. As the "primitive" brain is highly oxidative, we hypothesized that such negative mtDNA-nDNA co-expression likely controls for the high mtDNA transcript levels, which enforce tight OXPHOS regulation, rather than rewiring toward glycolysis. Accordingly, we found "primitive" brain-specific up-regulation of lactate dehydrogenase B ( LDHB ), which associates with high OXPHOS activity, at the expense of LDHA , which promotes glycolysis. Analyses of co-expression, DNase-seq, and ChIP-seq experiments revealed candidate RNA-binding proteins and CEBPB as the best regulatory candidates to explain these phenomena. Finally, cross-tissue expression analysis unearthed tissue-dependent splice variants and OXPHOS subunit paralogs and allowed revising the list of canonical OXPHOS transcripts. Taken together, our analysis provides a comprehensive view of mito-nuclear gene co-expression across human tissues and provides overall insights into the bi-genomic regulation of mitochondrial activities. © 2018 Barshad et al.; Published by Cold Spring Harbor Laboratory Press.

  14. Exploring Transcription Factors-microRNAs Co-regulation Networks in Schizophrenia.

    PubMed

    Xu, Yong; Yue, Weihua; Yao Shugart, Yin; Li, Sheng; Cai, Lei; Li, Qiang; Cheng, Zaohuo; Wang, Guoqiang; Zhou, Zhenhe; Jin, Chunhui; Yuan, Jianmin; Tian, Lin; Wang, Jun; Zhang, Kai; Zhang, Kerang; Liu, Sha; Song, Yuqing; Zhang, Fuquan

    2016-07-01

    Transcriptional factors (TFs) and microRNAs (miRNAs) have been recognized as 2 classes of principal gene regulators that may be responsible for genome coexpression changes observed in schizophrenia (SZ). This study aims to (1) identify differentially coexpressed genes (DCGs) in 3 mRNA expression microarray datasets; (2) explore potential interactions among the DCGs, and differentially expressed miRNAs identified in our dataset composed of early-onset SZ patients and healthy controls; (3) validate expression levels of some key transcripts; and (4) explore the druggability of DCGs using the curated database. We detected a differential coexpression network associated with SZ and found that 9 out of the 12 regulators were replicated in either of the 2 other datasets. Leveraging the differentially expressed miRNAs identified in our previous dataset, we constructed a miRNA-TF-gene network relevant to SZ, including an EGR1-miR-124-3p-SKIL feed-forward loop. Our real-time quantitative PCR analysis indicated the overexpression of miR-124-3p, the under expression of SKIL and EGR1 in the blood of SZ patients compared with controls, and the direction of change of miR-124-3p and SKIL mRNA levels in SZ cases were reversed after a 12-week treatment cycle. Our druggability analysis revealed that many of these genes have the potential to be drug targets. Together, our results suggest that coexpression network abnormalities driven by combinatorial and interactive action from TFs and miRNAs may contribute to the development of SZ and be relevant to the clinical treatment of the disease. © The Author 2015. Published by Oxford University Press on behalf of the Maryland Psychiatric Research Center. All rights reserved. For permissions, please email: journals.permissions@oup.com.

  15. Exploring Transcription Factors-microRNAs Co-regulation Networks in Schizophrenia

    PubMed Central

    Xu, Yong; Yue, Weihua; Yao Shugart, Yin; Li, Sheng; Cai, Lei; Li, Qiang; Cheng, Zaohuo; Wang, Guoqiang; Zhou, Zhenhe; Jin, Chunhui; Yuan, Jianmin; Tian, Lin; Wang, Jun; Zhang, Kai; Zhang, Kerang; Liu, Sha; Song, Yuqing; Zhang, Fuquan

    2016-01-01

    Background: Transcriptional factors (TFs) and microRNAs (miRNAs) have been recognized as 2 classes of principal gene regulators that may be responsible for genome coexpression changes observed in schizophrenia (SZ). Methods: This study aims to (1) identify differentially coexpressed genes (DCGs) in 3 mRNA expression microarray datasets; (2) explore potential interactions among the DCGs, and differentially expressed miRNAs identified in our dataset composed of early-onset SZ patients and healthy controls; (3) validate expression levels of some key transcripts; and (4) explore the druggability of DCGs using the curated database. Results: We detected a differential coexpression network associated with SZ and found that 9 out of the 12 regulators were replicated in either of the 2 other datasets. Leveraging the differentially expressed miRNAs identified in our previous dataset, we constructed a miRNA–TF–gene network relevant to SZ, including an EGR1–miR-124-3p–SKIL feed-forward loop. Our real-time quantitative PCR analysis indicated the overexpression of miR-124-3p, the under expression of SKIL and EGR1 in the blood of SZ patients compared with controls, and the direction of change of miR-124-3p and SKIL mRNA levels in SZ cases were reversed after a 12-week treatment cycle. Our druggability analysis revealed that many of these genes have the potential to be drug targets. Conclusions: Together, our results suggest that coexpression network abnormalities driven by combinatorial and interactive action from TFs and miRNAs may contribute to the development of SZ and be relevant to the clinical treatment of the disease. PMID:26609121

  16. Transcriptome comparison and gene coexpression network analysis provide a systems view of citrus response to ‘Candidatus Liberibacter asiaticus’ infection

    PubMed Central

    2013-01-01

    Background Huanglongbing (HLB) is arguably the most destructive disease for the citrus industry. HLB is caused by infection of the bacterium, Candidatus Liberibacter spp. Several citrus GeneChip studies have revealed thousands of genes that are up- or down-regulated by infection with Ca. Liberibacter asiaticus. However, whether and how these host genes act to protect against HLB remains poorly understood. Results As a first step towards a mechanistic view of citrus in response to the HLB bacterial infection, we performed a comparative transcriptome analysis and found that a total of 21 Probesets are commonly up-regulated by the HLB bacterial infection. In addition, a number of genes are likely regulated specifically at early, late or very late stages of the infection. Furthermore, using Pearson correlation coefficient-based gene coexpression analysis, we constructed a citrus HLB response network consisting of 3,507 Probesets and 56,287 interactions. Genes involved in carbohydrate and nitrogen metabolic processes, transport, defense, signaling and hormone response were overrepresented in the HLB response network and the subnetworks for these processes were constructed. Analysis of the defense and hormone response subnetworks indicates that hormone response is interconnected with defense response. In addition, mapping the commonly up-regulated HLB responsive genes into the HLB response network resulted in a core subnetwork where transport plays a key role in the citrus response to the HLB bacterial infection. Moreover, analysis of a phloem protein subnetwork indicates a role for this protein and zinc transporters or zinc-binding proteins in the citrus HLB defense response. Conclusion Through integrating transcriptome comparison and gene coexpression network analysis, we have provided for the first time a systems view of citrus in response to the Ca. Liberibacter spp. infection causing HLB. PMID:23324561

  17. Analysis of differentially co-expressed genes based on microarray data of hepatocellular carcinoma.

    PubMed

    Wang, Y; Jiang, T; Li, Z; Lu, L; Zhang, R; Zhang, D; Wang, X; Tan, J

    2017-01-01

    Hepatocellular carcinoma (HCC) is the third leading cause of cancer related death worldwide. Although great progress in diagnosis and management of HCC have been made, the exact molecular mechanisms remain poorly understood. The study aims to identify potential biomarkers for HCC progression, mainly at transcription level. In this study, chip data GSE 29721 was utilized, which contains 10 HCC samples and 10 normal adjacent tissue samples. Differentially expressed genes (DEGs) between two sample types were selected by t-test method. Following, the differentially co-expressed genes (DCGs) and differentially co-expressed Links (DCLs) were identified by DCGL package in R with the threshold of q < 0.25. Afterwards, pathway enrichment analysis of the DCGs was carried out by DAVID. Then, DCLs were mapped to TRANSFAC database to reveal associations between relevant transcriptional factors (TFs) and their target genes. Quantitative real-time RT-PCR was performed for TFs or genes of interest. As a result, a total of 388 DCGs and 35,771 DCLs were obtained. The predominant pathways enriched by these genes were Cytokine-cytokine receptor interaction, ECM-receptor interaction and TGF-β signaling pathway. Three TF-target interactions, LEF1-NCAM1, EGR1-FN1 and FOS-MT2A were predicted. Compared with control, expressions of the TF genes EGR1, FOS and ETS2 were all up-regulated in the HCC cell line, HepG2; while LEF1 was down-regulated. Except NCAM1, all the target genes were up-regulated in HepG2. Our findings suggest these TFs and genes might play important roles in the pathogenesis of HCC and may be used as therapeutic targets for HCC management.

  18. Transcriptome profiling of equine vitamin E deficient neuroaxonal dystrophy identifies upregulation of liver X receptor target genes

    PubMed Central

    Finno, Carrie J.; Bordbari, Matthew H.; Valberg, Stephanie J.; Lee, David; Herron, Josi; Hines, Kelly; Monsour, Tamer; Scott, Erica; Bannasch, Danika L.; Mickelson, James; Xu, Libin

    2016-01-01

    Specific spontaneous heritable neurodegenerative diseases have been associated with lower serum and cerebrospinal fluid α-tocopherol (α-TOH) concentrations. Equine neuroaxonal dystrophy (eNAD) has similar histologic lesions to human ataxia with vitamin E deficiency caused by mutations in the α-TOH transfer protein gene (TTPA). Mutations in TTPA are not present with eNAD and the molecular basis remains unknown. Given the neuropathologic phenotypic similarity of the conditions, we assessed the molecular basis of eNAD by global transcriptome sequencing of the cervical spinal cord. Differential gene expression analysis identified 157 significantly (FDR<0.05) dysregulated transcripts within the spinal cord of eNAD-affected horses. Statistical enrichment analysis identified significant downregulation of the ionotropic and metabotropic group III glutamate receptor, synaptic vesicle trafficking and cholesterol biosynthesis pathways. Gene co-expression analysis identified one module of upregulated genes significantly associated with the eNAD phenotype that included the liver X receptor (LXR) targets CYP7A1, APOE, PLTP and ABCA1. Validation of CYP7A1 and APOE dysregulation was performed in an independent biologic group and CYP7A1 was found to be additionally upregulated in the medulla oblongata of eNAD horses. Evidence of LXR activation supports a role for modulation of oxysterol-dependent LXR transcription factor activity by tocopherols. We hypothesize that the protective role of α-TOH in eNAD may reside in its ability to prevent oxysterol accumulation and subsequent activation of the LXR in order to decrease lipid peroxidation associated neurodegeneration. PMID:27751910

  19. GESearch: An Interactive GUI Tool for Identifying Gene Expression Signature.

    PubMed

    Ye, Ning; Yin, Hengfu; Liu, Jingjing; Dai, Xiaogang; Yin, Tongming

    2015-01-01

    The huge amount of gene expression data generated by microarray and next-generation sequencing technologies present challenges to exploit their biological meanings. When searching for the coexpression genes, the data mining process is largely affected by selection of algorithms. Thus, it is highly desirable to provide multiple options of algorithms in the user-friendly analytical toolkit to explore the gene expression signatures. For this purpose, we developed GESearch, an interactive graphical user interface (GUI) toolkit, which is written in MATLAB and supports a variety of gene expression data files. This analytical toolkit provides four models, including the mean, the regression, the delegate, and the ensemble models, to identify the coexpression genes, and enables the users to filter data and to select gene expression patterns by browsing the display window or by importing knowledge-based genes. Subsequently, the utility of this analytical toolkit is demonstrated by analyzing two sets of real-life microarray datasets from cell-cycle experiments. Overall, we have developed an interactive GUI toolkit that allows for choosing multiple algorithms for analyzing the gene expression signatures.

  20. The Competence of Maize Shoot Meristems for Integrative Transformation and Inherited Expression of Transgenes.

    PubMed Central

    Zhong, H.; Sun, B.; Warkentin, D.; Zhang, S.; Wu, R.; Wu, T.; Sticklen, M. B.

    1996-01-01

    We have developed a novel and reproducible system for recovery of fertile transgenic maize (Zea mays L.) plants. The transformation was performed using microprojectile bombardment of cultured shoot apices of maize with a plasmid carrying two linked genes, the Streptomyces hygroscopicus phosphinothricin acetyltransferase gene (bar) and the potato proteinase inhibitor II gene, either alone or in combination with another plasmid containing the 5[prime] region of the rice actin 1 gene fused to the Escherichia coli [beta]-glucuronidase gene (gus). Bombarded shoot apices were subsequently multiplied and selected under 3 to 5 mg/L glufosinate ammonium. Co-transformation frequency was 100% (146/146) for linked genes and 80% (41/51) for unlinked genes. Co-expression frequency of the bar and gus genes was 57% (29/51). The co-integration, co-inheritance, and co-expression of bar, the potato proteinase inhibitor II gene, and gus in transgenic R0, R1, and R2 plants were confirmed. Localized expression of the actin 1-GUS protein in the R0 and R1 plants was extensively analyzed by histochemical and fluorometric assays. PMID:12226244

  1. From protein-protein interactions to protein co-expression networks: a new perspective to evaluate large-scale proteomic data.

    PubMed

    Vella, Danila; Zoppis, Italo; Mauri, Giancarlo; Mauri, Pierluigi; Di Silvestre, Dario

    2017-12-01

    The reductionist approach of dissecting biological systems into their constituents has been successful in the first stage of the molecular biology to elucidate the chemical basis of several biological processes. This knowledge helped biologists to understand the complexity of the biological systems evidencing that most biological functions do not arise from individual molecules; thus, realizing that the emergent properties of the biological systems cannot be explained or be predicted by investigating individual molecules without taking into consideration their relations. Thanks to the improvement of the current -omics technologies and the increasing understanding of the molecular relationships, even more studies are evaluating the biological systems through approaches based on graph theory. Genomic and proteomic data are often combined with protein-protein interaction (PPI) networks whose structure is routinely analyzed by algorithms and tools to characterize hubs/bottlenecks and topological, functional, and disease modules. On the other hand, co-expression networks represent a complementary procedure that give the opportunity to evaluate at system level including organisms that lack information on PPIs. Based on these premises, we introduce the reader to the PPI and to the co-expression networks, including aspects of reconstruction and analysis. In particular, the new idea to evaluate large-scale proteomic data by means of co-expression networks will be discussed presenting some examples of application. Their use to infer biological knowledge will be shown, and a special attention will be devoted to the topological and module analysis.

  2. Social Plasticity Relies on Different Neuroplasticity Mechanisms across the Brain Social Decision-Making Network in Zebrafish.

    PubMed

    Teles, Magda C; Cardoso, Sara D; Oliveira, Rui F

    2016-01-01

    Social living animals need to adjust the expression of their behavior to their status within the group and to changes in social context and this ability (social plasticity) has an impact on their Darwinian fitness. At the proximate level social plasticity must rely on neuroplasticity in the brain social decision-making network (SDMN) that underlies the expression of social behavior, such that the same neural circuit may underlie the expression of different behaviors depending on social context. Here we tested this hypothesis in zebrafish by characterizing the gene expression response in the SDMN to changes in social status of a set of genes involved in different types of neural plasticity: bdnf, involved in changes in synaptic strength; npas4, involved in contextual learning and dependent establishment of GABAergic synapses; neuroligins (nlgn1 and nlgn2) as synaptogenesis markers; and genes involved in adult neurogenesis (wnt3 and neurod). Four social phenotypes were experimentally induced: Winners and Losers of a real-opponent interaction; Mirror-fighters, that fight their own image in a mirror and thus do not experience a change in social status despite the expression of aggressive behavior; and non-interacting fish, which were used as a reference group. Our results show that each social phenotype (i.e., Winners, Losers, and Mirror-fighters) present specific patterns of gene expression across the SDMN, and that different neuroplasticity genes are differentially expressed in different nodes of the network (e.g., BDNF in the dorsolateral telencephalon, which is a putative teleost homolog of the mammalian hippocampus). Winners expressed unique patterns of gene co-expression across the SDMN, whereas in Losers and Mirror-fighters the co-expression patterns were similar in the dorsal regions of the telencephalon and in the supracommissural nucleus of the ventral telencephalic area, but differents in the remaining regions of the ventral telencephalon. These results indicate that social plasticity relies on multiple neuroplasticity mechanisms across the SDMN, and that there is not a single neuromolecular module underlying this type of behavioral flexibility.

  3. Social Plasticity Relies on Different Neuroplasticity Mechanisms across the Brain Social Decision-Making Network in Zebrafish

    PubMed Central

    Teles, Magda C.; Cardoso, Sara D.; Oliveira, Rui F.

    2016-01-01

    Social living animals need to adjust the expression of their behavior to their status within the group and to changes in social context and this ability (social plasticity) has an impact on their Darwinian fitness. At the proximate level social plasticity must rely on neuroplasticity in the brain social decision-making network (SDMN) that underlies the expression of social behavior, such that the same neural circuit may underlie the expression of different behaviors depending on social context. Here we tested this hypothesis in zebrafish by characterizing the gene expression response in the SDMN to changes in social status of a set of genes involved in different types of neural plasticity: bdnf, involved in changes in synaptic strength; npas4, involved in contextual learning and dependent establishment of GABAergic synapses; neuroligins (nlgn1 and nlgn2) as synaptogenesis markers; and genes involved in adult neurogenesis (wnt3 and neurod). Four social phenotypes were experimentally induced: Winners and Losers of a real-opponent interaction; Mirror-fighters, that fight their own image in a mirror and thus do not experience a change in social status despite the expression of aggressive behavior; and non-interacting fish, which were used as a reference group. Our results show that each social phenotype (i.e., Winners, Losers, and Mirror-fighters) present specific patterns of gene expression across the SDMN, and that different neuroplasticity genes are differentially expressed in different nodes of the network (e.g., BDNF in the dorsolateral telencephalon, which is a putative teleost homolog of the mammalian hippocampus). Winners expressed unique patterns of gene co-expression across the SDMN, whereas in Losers and Mirror-fighters the co-expression patterns were similar in the dorsal regions of the telencephalon and in the supracommissural nucleus of the ventral telencephalic area, but differents in the remaining regions of the ventral telencephalon. These results indicate that social plasticity relies on multiple neuroplasticity mechanisms across the SDMN, and that there is not a single neuromolecular module underlying this type of behavioral flexibility. PMID:26909029

  4. Intra- and interspecies gene expression models for predicting drug response in canine osteosarcoma.

    PubMed

    Fowles, Jared S; Brown, Kristen C; Hess, Ann M; Duval, Dawn L; Gustafson, Daniel L

    2016-02-19

    Genomics-based predictors of drug response have the potential to improve outcomes associated with cancer therapy. Osteosarcoma (OS), the most common primary bone cancer in dogs, is commonly treated with adjuvant doxorubicin or carboplatin following amputation of the affected limb. We evaluated the use of gene-expression based models built in an intra- or interspecies manner to predict chemosensitivity and treatment outcome in canine OS. Models were built and evaluated using microarray gene expression and drug sensitivity data from human and canine cancer cell lines, and canine OS tumor datasets. The "COXEN" method was utilized to filter gene signatures between human and dog datasets based on strong co-expression patterns. Models were built using linear discriminant analysis via the misclassification penalized posterior algorithm. The best doxorubicin model involved genes identified in human lines that were co-expressed and trained on canine OS tumor data, which accurately predicted clinical outcome in 73 % of dogs (p = 0.0262, binomial). The best carboplatin model utilized canine lines for gene identification and model training, with canine OS tumor data for co-expression. Dogs whose treatment matched our predictions had significantly better clinical outcomes than those that didn't (p = 0.0006, Log Rank), and this predictor significantly associated with longer disease free intervals in a Cox multivariate analysis (hazard ratio = 0.3102, p = 0.0124). Our data show that intra- and interspecies gene expression models can successfully predict response in canine OS, which may improve outcome in dogs and serve as pre-clinical validation for similar methods in human cancer research.

  5. Coexpression networks implicate human midfetal deep cortical projection neurons in the pathogenesis of autism

    PubMed Central

    Willsey, A. Jeremy; Sanders, Stephan J.; Li, Mingfeng; Dong, Shan; Tebbenkamp, Andrew T.; Muhle, Rebecca A.; Reilly, Steven K.; Lin, Leon; Fertuzinhos, Sofia; Miller, Jeremy A.; Murtha, Michael T.; Bichsel, Candace; Niu, Wei; Cotney, Justin; Ercan-Sencicek, A. Gulhan; Gockley, Jake; Gupta, Abha; Han, Wenqi; He, Xin; Hoffman, Ellen; Klei, Lambertus; Lei, Jing; Liu, Wenzhong; Liu, Li; Lu, Cong; Xu, Xuming; Zhu, Ying; Mane, Shrikant M.; Lein, Edward S.; Wei, Liping; Noonan, James P.; Roeder, Kathryn; Devlin, Bernie; Šestan, Nenad; State, Matthew W.

    2013-01-01

    SUMMARY Autism spectrum disorder (ASD) is a complex developmental syndrome of unknown etiology. Recent studies employing exome- and genome-wide sequencing have identified nine high-confidence ASD (hcASD) genes. Working from the hypothesis that ASD-associated mutations in these biologically pleiotropic genes will disrupt intersecting developmental processes to contribute to a common phenotype, we have attempted to identify time periods, brain regions, and cell types in which these genes converge. We have constructed coexpression networks based on the hcASD “seed” genes, leveraging a rich expression data set encompassing multiple human brain regions across human development and into adulthood. By assessing enrichment of an independent set of probable ASD (pASD) genes, derived from the same sequencing studies, we demonstrate a key point of convergence in midfetal layer 5/6 cortical projection neurons. This approach informs when, where, and in what cell types mutations in these specific genes may be productively studied to clarify ASD pathophysiology. PMID:24267886

  6. Transcriptome Analysis of CD4+ T Cells in Coeliac Disease Reveals Imprint of BACH2 and IFNγ Regulation

    PubMed Central

    Molloy, Ben; Dominguez Castro, Patricia; Cormican, Paul; Trimble, Valerie; Mahmud, Nasir; McManus, Ross

    2015-01-01

    Genetic studies have to date identified 43 genome wide significant coeliac disease susceptibility (CD) loci comprising over 70 candidate genes. However, how altered regulation of such disease associated genes contributes to CD pathogenesis remains to be elucidated. Recently there has been considerable emphasis on characterising cell type specific and stimulus dependent genetic variants. Therefore in this study we used RNA sequencing to profile over 70 transcriptomes of CD4+ T cells, a cell type crucial for CD pathogenesis, in both stimulated and resting samples from individuals with CD and unaffected controls. We identified extensive transcriptional changes across all conditions, with the previously established CD gene IFNy the most strongly up-regulated gene (log2 fold change 4.6; Padjusted = 2.40x10-11) in CD4+ T cells from CD patients compared to controls. We show a significant correlation of differentially expressed genes with genetic studies of the disease to date (Padjusted = 0.002), and 21 CD candidate susceptibility genes are differentially expressed under one or more of the conditions used in this study. Pathway analysis revealed significant enrichment of immune related processes. Co-expression network analysis identified several modules of coordinately expressed CD genes. Two modules were particularly highly enriched for differentially expressed genes (P<2.2x10-16) and highlighted IFNy and the genetically associated transcription factor BACH2 which showed significantly reduced expression in coeliac samples (log2FC -1.75; Padjusted = 3.6x10-3) as key regulatory genes in CD. Genes regulated by BACH2 were very significantly over-represented among our differentially expressed genes (P<2.2x10-16) indicating that reduced expression of this master regulator of T cell differentiation promotes a pro-inflammatory response and strongly corroborates genetic evidence that BACH2 plays an important role in CD pathogenesis. PMID:26444573

  7. The modularity and dynamicity of miRNA-mRNA interactions in high-grade serous ovarian carcinomas and the prognostic implication.

    PubMed

    Zhang, Wensheng; Edwards, Andrea; Fan, Wei; Flemington, Erik K; Zhang, Kun

    2016-08-01

    Ovarian carcinoma is the fifth-leading cause of cancer death among women in the United States. Major reasons for this persistent mortality include the poor understanding of the underlying biology and a lack of reliable biomarkers. Previous studies have shown that aberrantly expressed MicroRNAs (miRNAs) are involved in carcinogenesis and tumor progression by post-transcriptionally regulating gene expression. However, the interference of miRNAs in tumorigenesis is quite complicated and far from being fully understood. In this work, by an integrative analysis of mRNA expression, miRNA expression and clinical data published by The Cancer Genome Atlas (TCGA), we studied the modularity and dynamicity of miRNA-mRNA interactions and the prognostic implications in high-grade serous ovarian carcinomas. With the top transcriptional correlations (Bonferroni-adjusted p-value<0.01) as inputs, we identified five miRNA-mRNA module pairs (MPs), each of which included one positive-connection (correlation) module and one negative-connection (correlation) module. The number of miRNAs or mRNAs in each module varied from 3 to 7 or from 2 to 873. Among the four major negative-connection modules, three fit well with the widely accepted miRNA-mediated post-transcriptional regulation theory. These modules were enriched with the genes relevant to cell cycle and immune response. Moreover, we proposed two novel algorithms to reveal the group or sample specific dynamic regulations between these two RNA classes. The obtained miRNA-mRNA dynamic network contains 3350 interactions captured across different cancer progression stages or tumor grades. We found that those dynamic interactions tended to concentrate on a few miRNAs (e.g. miRNA-936), and were more likely present on the miRNA-mRNA pairs outside the discovered modules. In addition, we also pinpointed a robust prognostic signature consisting of 56 modular protein-coding genes, whose co-expression patterns were predictive for the survival time of ovarian cancer patients in multiple independent cohorts. Copyright © 2016 Elsevier Ltd. All rights reserved.

  8. MPIGeneNet: Parallel Calculation of Gene Co-Expression Networks on Multicore Clusters.

    PubMed

    Gonzalez-Dominguez, Jorge; Martin, Maria J

    2017-10-10

    In this work we present MPIGeneNet, a parallel tool that applies Pearson's correlation and Random Matrix Theory to construct gene co-expression networks. It is based on the state-of-the-art sequential tool RMTGeneNet, which provides networks with high robustness and sensitivity at the expenses of relatively long runtimes for large scale input datasets. MPIGeneNet returns the same results as RMTGeneNet but improves the memory management, reduces the I/O cost, and accelerates the two most computationally demanding steps of co-expression network construction by exploiting the compute capabilities of common multicore CPU clusters. Our performance evaluation on two different systems using three typical input datasets shows that MPIGeneNet is significantly faster than RMTGeneNet. As an example, our tool is up to 175.41 times faster on a cluster with eight nodes, each one containing two 12-core Intel Haswell processors. Source code of MPIGeneNet, as well as a reference manual, are available at https://sourceforge.net/projects/mpigenenet/.

  9. Improved synthesis of chiral alcohols with Escherichia coli cells co-expressing pyridine nucleotide transhydrogenase, NADP+-dependent alcohol dehydrogenase and NAD+-dependent formate dehydrogenase.

    PubMed

    Weckbecker, Andrea; Hummel, Werner

    2004-11-01

    Recombinant pyridine nucleotide transhydrogenase (PNT) from Escherichia coli has been used to regenerate NAD+ and NADPH. The pnta and pntb genes encoding for the alpha- and beta-subunits were cloned and co-expressed with NADP+-dependent alcohol dehydrogenase (ADH) from Lactobacillus kefir and NAD+-dependent formate dehydrogenase (FDH) from Candida boidinii. Using this whole-cell biocatalyst, efficient conversion of prochiral ketones to chiral alcohols was achieved: 66% acetophenone was reduced to (R)-phenylethanol over 12 h, whereas only 19% (R)-phenylethanol was formed under the same conditions with cells containing ADH and FDH genes but without PNT genes. Cells that were permeabilized with toluene showed ketone reduction only if both cofactors were present.

  10. When is hub gene selection better than standard meta-analysis?

    PubMed

    Langfelder, Peter; Mischel, Paul S; Horvath, Steve

    2013-01-01

    Since hub nodes have been found to play important roles in many networks, highly connected hub genes are expected to play an important role in biology as well. However, the empirical evidence remains ambiguous. An open question is whether (or when) hub gene selection leads to more meaningful gene lists than a standard statistical analysis based on significance testing when analyzing genomic data sets (e.g., gene expression or DNA methylation data). Here we address this question for the special case when multiple genomic data sets are available. This is of great practical importance since for many research questions multiple data sets are publicly available. In this case, the data analyst can decide between a standard statistical approach (e.g., based on meta-analysis) and a co-expression network analysis approach that selects intramodular hubs in consensus modules. We assess the performance of these two types of approaches according to two criteria. The first criterion evaluates the biological insights gained and is relevant in basic research. The second criterion evaluates the validation success (reproducibility) in independent data sets and often applies in clinical diagnostic or prognostic applications. We compare meta-analysis with consensus network analysis based on weighted correlation network analysis (WGCNA) in three comprehensive and unbiased empirical studies: (1) Finding genes predictive of lung cancer survival, (2) finding methylation markers related to age, and (3) finding mouse genes related to total cholesterol. The results demonstrate that intramodular hub gene status with respect to consensus modules is more useful than a meta-analysis p-value when identifying biologically meaningful gene lists (reflecting criterion 1). However, standard meta-analysis methods perform as good as (if not better than) a consensus network approach in terms of validation success (criterion 2). The article also reports a comparison of meta-analysis techniques applied to gene expression data and presents novel R functions for carrying out consensus network analysis, network based screening, and meta analysis.

  11. Integrated Systems Biology Analysis of Transcriptomes Reveals Candidate Genes for Acidity Control in Developing Fruits of Sweet Orange (Citrus sinensis L. Osbeck).

    PubMed

    Huang, Dingquan; Zhao, Yihong; Cao, Minghao; Qiao, Liang; Zheng, Zhi-Liang

    2016-01-01

    Organic acids, such as citrate and malate, are important contributors for the sensory traits of fleshy fruits. Although their biosynthesis has been illustrated, regulatory mechanisms of acid accumulation remain to be dissected. To provide transcriptional architecture and identify candidate genes for citrate accumulation in fruits, we have selected for transcriptome analysis four varieties of sweet orange (Citrus sinensis L. Osbeck) with varying fruit acidity, Succari (acidless), Bingtang (low acid), and Newhall and Xinhui (normal acid). Fruits of these varieties at 45 days post anthesis (DPA), which corresponds to Stage I (cell division), had similar acidity, but they displayed differential acid accumulation at 142 DPA (Stage II, cell expansion). Transcriptomes of fruits at 45 and 142 DPA were profiled using RNA sequencing and analyzed with three different algorithms (Pearson correlation, gene coexpression network and surrogate variable analysis). Our network analysis shows that the acid-correlated genes belong to three distinct network modules. Several of these candidate fruit acidity genes encode regulatory proteins involved in transport (such as AHA10), degradation (such as APD2) and transcription (such as AIL6) and act as hubs in the citrate accumulation gene networks. Taken together, our integrated systems biology analysis has provided new insights into the fruit citrate accumulation gene network and led to the identification of candidate genes likely associated with the fruit acidity control.

  12. Integrated Systems Biology Analysis of Transcriptomes Reveals Candidate Genes for Acidity Control in Developing Fruits of Sweet Orange (Citrus sinensis L. Osbeck)

    PubMed Central

    Huang, Dingquan; Zhao, Yihong; Cao, Minghao; Qiao, Liang; Zheng, Zhi-Liang

    2016-01-01

    Organic acids, such as citrate and malate, are important contributors for the sensory traits of fleshy fruits. Although their biosynthesis has been illustrated, regulatory mechanisms of acid accumulation remain to be dissected. To provide transcriptional architecture and identify candidate genes for citrate accumulation in fruits, we have selected for transcriptome analysis four varieties of sweet orange (Citrus sinensis L. Osbeck) with varying fruit acidity, Succari (acidless), Bingtang (low acid), and Newhall and Xinhui (normal acid). Fruits of these varieties at 45 days post anthesis (DPA), which corresponds to Stage I (cell division), had similar acidity, but they displayed differential acid accumulation at 142 DPA (Stage II, cell expansion). Transcriptomes of fruits at 45 and 142 DPA were profiled using RNA sequencing and analyzed with three different algorithms (Pearson correlation, gene coexpression network and surrogate variable analysis). Our network analysis shows that the acid-correlated genes belong to three distinct network modules. Several of these candidate fruit acidity genes encode regulatory proteins involved in transport (such as AHA10), degradation (such as APD2) and transcription (such as AIL6) and act as hubs in the citrate accumulation gene networks. Taken together, our integrated systems biology analysis has provided new insights into the fruit citrate accumulation gene network and led to the identification of candidate genes likely associated with the fruit acidity control. PMID:27092171

  13. Salmonid genomes have a remarkably expanded akirin family, coexpressed with genes from conserved pathways governing skeletal muscle growth and catabolism.

    PubMed

    Macqueen, Daniel J; Kristjánsson, Bjarni K; Johnston, Ian A

    2010-06-01

    Metazoan akirin genes regulate innate immunity, myogenesis, and carcinogenesis. Invertebrates typically have one family member, while most tetrapod and teleost vertebrates have one to three. We demonstrate an expanded repertoire of eight family members in genomes of four salmonid fishes, owing to paralog preservation after three tetraploidization events. Retention of paralogs secondarily lost in other teleosts may be related to functional diversification and posttranslational regulation. We hypothesized that salmonid akirins would be transcriptionally regulated in fast-twitch skeletal muscle during activation of conserved pathways governing catabolism and growth. The in vivo nutritional state of Arctic charr (Salvelinus alpinus L.) was experimentally manipulated, and transcript levels for akirin family members and 26 other genes were measured by quantitative real-time PCR (qPCR), allowing the establishment of a similarity network of expression profiles. In fasted muscle, a class of akirins was upregulated, with one family member showing high coexpression with catabolic genes coding the NF-kappaB p65 subunit, E2 ubiquitin-conjugating enzymes, E3 ubiquitin ligases, and IGF-I receptors. Another class of akirin was upregulated with subsequent feeding, coexpressed with 14-3-3 protein genes. There was no similarity between expression profiles of akirins with IGF hormones or binding protein genes. The level of phylogenetic relatedness of akirin family members was not a strong predictor of transcriptional responses to nutritional state, or differences in transcript abundance levels, indicating a complex pattern of regulatory evolution. The salmonid akirins epitomize the complexity linking the genome to physiological phenotypes of vertebrates with a history of tetraploidization.

  14. Cotransfecting norepinephrine transporter and vesicular monoamine transporter 2 genes for increased retention of metaiodobenzylguanidine labeled with iodine 131 in malignant hepatocarcinoma cells.

    PubMed

    Zhao, Yanlin; Zhong, Xiao; Ou, Xiaohong; Cai, Huawei; Wu, Xiaoai; Huang, Rui

    2017-03-01

    Norepinephrine transporter (NET) transfection leads to significant uptake of iodine-131-labeled metaiodobenzylguanidine ( 131 I-MIBG) in non-neuroendocrine tumors. However, the use of 131 I-MIBG is limited by its short retention time in target cells. To prolong the retention of 131 I-MIBG in target cells, we infected hepatocarcinoma (HepG2) cells with Lentivirus-encoding human NET and vesicular monoamine transporter 2 (VMAT2) genes to obtain NET-expressing, NET-VMAT2-coexpressing, and negative-control cell lines. We evaluated the uptake and efflux of 131 I-MIBG both in vitro and in vivo in mice bearing transfected tumors. NET-expressing and NET-VMAT2-coexpressing cells respectively showed 2.24 and 2.22 times higher 131 I-MIBG uptake than controls. Two hours after removal of 131 I-MIBG-containing medium, 25.4% efflux was observed in NET-VMAT2-coexpressing cells and 38.6% in NET-expressing cells. In vivo experiments were performed in nude mice bearing transfected tumors; results revealed that NET-VMAT2-coexpressing tumors had longer 131 I-MIBG retention time than NET-expressing tumors. Meanwhile, NET-VMAT2-coexpressing and NET-expressing tumors displayed 0.54% and 0.19%, respectively, of the injected dose per gram of tissue 24 h after 131 I-MIBG administration. Cotransfection of HepG2 cells with NET and VMAT2 resulted in increased 131 I-MIBG uptake and retention. However, the degree of increase was insufficient to be therapeutically effective in target cells.

  15. NorWood: a gene expression resource for evo-devo studies of conifer wood development.

    PubMed

    Jokipii-Lukkari, Soile; Sundell, David; Nilsson, Ove; Hvidsten, Torgeir R; Street, Nathaniel R; Tuominen, Hannele

    2017-10-01

    The secondary xylem of conifers is composed mainly of tracheids that differ anatomically and chemically from angiosperm xylem cells. There is currently no high-spatial-resolution data available profiling gene expression during wood formation for any coniferous species, which limits insight into tracheid development. RNA-sequencing data from replicated, high-spatial-resolution section series throughout the cambial and woody tissues of Picea abies were used to generate the NorWood.conGenIE.org web resource, which facilitates exploration of the associated gene expression profiles and co-expression networks. Integration within PlantGenIE.org enabled a comparative regulomics analysis, revealing divergent co-expression networks between P. abies and the two angiosperm species Arabidopsis thaliana and Populus tremula for the secondary cell wall (SCW) master regulator NAC Class IIB transcription factors. The SCW cellulose synthase genes (CesAs) were located in the neighbourhoods of the NAC factors in A. thaliana and P. tremula, but not in P. abies. The NorWood co-expression network enabled identification of potential SCW CesA regulators in P. abies. The NorWood web resource represents a powerful community tool for generating evo-devo insights into the divergence of wood formation between angiosperms and gymnosperms and for advancing understanding of the regulation of wood development in P. abies. © 2017 The Authors. New Phytologist © 2017 New Phytologist Trust.

  16. Comparative transcriptome and gene co-expression network analysis reveal genes and signaling pathways adaptively responsive to varied adverse stresses in the insect fungal pathogen, Beauveria bassiana.

    PubMed

    He, Zhangjiang; Zhao, Xin; Lu, Zhuoyue; Wang, Huifang; Liu, Pengfei; Zeng, Fanqin; Zhang, Yongjun

    2018-01-01

    Sensing, responding, and adapting to the surrounding environment are crucial for all living organisms to survive, proliferate, and differentiate in their biological niches. Beauveria bassiana is an economically important insect-pathogenic fungus which is widely used as a biocontrol agent to control a variety of insect pests. The fungal pathogen unavoidably encounters a variety of adverse environmental stresses and defense response from the host insects during application of the fungal agents. However, few are known about the transcription response of the fungus to respond or adapt varied adverse stresses. Here, we comparatively analyzed the transcriptome of B. bassiana in globe genome under the varied stationary-phase stresses including osmotic agent (0.8 M NaCl), high temperature (32 °C), cell wall-perturbing agent (Congo red), and oxidative agents (H 2 O 2 or menadione). Total of 12,412 reads were obtained, and mapped to the 6767 genes of the B. bassiana. All of these stresses caused transcription responses involved in basal metabolism, cell wall construction, stress response or cell rescue/detoxification, signaling transduction and gene transcription regulation, and likely other cellular processes. An array of genes displayed similar transcription patterns in response to at least two of the five stresses, suggesting a shared transcription response to varied adverse stresses. Gene co-expression network analysis revealed that mTOR signaling pathway, but not HOG1 MAP kinase pathway, played a central role in regulation the varied adverse stress responses, which was verified by RNAi-mediated knockdown of TOR1. Our findings provided an insight of transcription response and gene co-expression network of B. bassiana in adaptation to varied environments. Copyright © 2017 Elsevier Inc. All rights reserved.

  17. Caveolin-1 and Caveolin-2 Can Be Antagonistic Partners in Inflammation and Beyond

    PubMed Central

    de Almeida, Cecília Jacques Gonçalves

    2017-01-01

    Caveolins, encoded by the CAV gene family, are the main protein components of caveolae. In most tissues, caveolin-1 (Cav-1) and caveolin-2 (Cav-2) are co-expressed, and Cav-2 targeting to caveolae depends on the formation of heterooligomers with Cav-1. Notwithstanding, Cav-2 has unpredictable activities, opposing Cav-1 in the regulation of some cellular processes. While the major roles of Cav-1 as a modulator of cell signaling in inflammatory processes and in immune responses have been extensively discussed elsewhere, the aim of this review is to focus on data revealing the distinct activity of Cav-1 and Cav-2, which suggest that these proteins act antagonistically to fine-tune a variety of cellular processes relevant to inflammation. PMID:29250058

  18. Identification of cancer cytotoxic modulators of PDE3A by predictive chemogenomics

    PubMed Central

    de Waal, Luc; Lewis, Timothy A.; Rees, Matthew G.; Tsherniak, Aviad; Wu, Xiaoyun; Choi, Peter S.; Gechijian, Lara; Hartigan, Christina; Faloon, Patrick W.; Hickey, Mark J.; Tolliday, Nicola; Carr, Steven A.; Clemons, Paul A.; Munoz, Benito; Wagner, Bridget K.; Shamji, Alykhan F.; Koehler, Angela N.; Schenone, Monica; Burgin, Alex B.; Schreiber, Stuart L.; Greulich, Heidi; Meyerson, Matthew

    2015-01-01

    High cancer death rates indicate the need for new anti-cancer therapeutic agents. Approaches to discover new cancer drugs include target-based drug discovery and phenotypic screening. Here, we identified phosphodiesterase 3A modulators as cell-selective cancer cytotoxic compounds by phenotypic compound library screening and target deconvolution by predictive chemogenomics. We found that sensitivity to 6-(4-(diethylamino)-3-nitrophenyl)-5-methyl-4,5-dihydropyridazin-3(2H)-one, or DNMDP, across 766 cancer cell lines correlates with expression of the phosphodiesterase 3A gene, PDE3A. Like DNMDP, a subset of known PDE3A inhibitors kill selected cancer cells while others do not. Furthermore, PDE3A depletion leads to DNMDP resistance. We demonstrated that DNMDP binding to PDE3A promotes an interaction between PDE3A and Schlafen 12 (SLFN12), suggesting a neomorphic activity. Co-expression of SLFN12 with PDE3A correlates with DNMDP sensitivity, while depletion of SLFN12 results in decreased DNMDP sensitivity. Our results implicate PDE3A modulators as candidate cancer therapeutic agents and demonstrate the power of predictive chemogenomics in small-molecule discovery. PMID:26656089

  19. Global Profiling of Rice and Poplar Transcriptomes Highlights Key Conserved Circadian-Controlled Pathways and cis-Regulatory Modules

    PubMed Central

    Filichkin, Sergei A.; Breton, Ghislain; Priest, Henry D.; Dharmawardhana, Palitha; Jaiswal, Pankaj; Fox, Samuel E.; Michael, Todd P.; Chory, Joanne; Kay, Steve A.; Mockler, Todd C.

    2011-01-01

    Background Circadian clocks provide an adaptive advantage through anticipation of daily and seasonal environmental changes. In plants, the central clock oscillator is regulated by several interlocking feedback loops. It was shown that a substantial proportion of the Arabidopsis genome cycles with phases of peak expression covering the entire day. Synchronized transcriptome cycling is driven through an extensive network of diurnal and clock-regulated transcription factors and their target cis-regulatory elements. Study of the cycling transcriptome in other plant species could thus help elucidate the similarities and differences and identify hubs of regulation common to monocot and dicot plants. Methodology/Principal Findings Using a combination of oligonucleotide microarrays and data mining pipelines, we examined daily rhythms in gene expression in one monocotyledonous and one dicotyledonous plant, rice and poplar, respectively. Cycling transcriptomes were interrogated under different diurnal (driven) and circadian (free running) light and temperature conditions. Collectively, photocycles and thermocycles regulated about 60% of the expressed nuclear genes in rice and poplar. Depending on the condition tested, up to one third of oscillating Arabidopsis-poplar-rice orthologs were phased within three hours of each other suggesting a high degree of conservation in terms of rhythmic gene expression. We identified clusters of rhythmically co-expressed genes and searched their promoter sequences to identify phase-specific cis-elements, including elements that were conserved in the promoters of Arabidopsis, poplar, and rice. Conclusions/Significance Our results show that the cycling patterns of many circadian clock genes are highly conserved across poplar, rice, and Arabidopsis. The expression of many orthologous genes in key metabolic and regulatory pathways is diurnal and/or circadian regulated and phased to similar times of day. Our results confirm previous findings in Arabidopsis of three major classes of cis-regulatory modules within the plant circadian network: the morning (ME, GBOX), evening (EE, GATA), and midnight (PBX/TBX/SBX) modules. Identification of identical overrepresented motifs in the promoters of cycling genes from different species suggests that the core diurnal/circadian cis-regulatory network is deeply conserved between mono- and dicotyledonous species. PMID:21694767

  20. The biosynthetic gene cluster for the cyanogenic glucoside dhurrin in Sorghum bicolor contains its co-expressed vacuolar MATE transporter

    PubMed Central

    Darbani, Behrooz; Motawia, Mohammed Saddik; Olsen, Carl Erik; Nour-Eldin, Hussam H.; Møller, Birger Lindberg; Rook, Fred

    2016-01-01

    Genomic gene clusters for the biosynthesis of chemical defence compounds are increasingly identified in plant genomes. We previously reported the independent evolution of biosynthetic gene clusters for cyanogenic glucoside biosynthesis in three plant lineages. Here we report that the gene cluster for the cyanogenic glucoside dhurrin in Sorghum bicolor additionally contains a gene, SbMATE2, encoding a transporter of the multidrug and toxic compound extrusion (MATE) family, which is co-expressed with the biosynthetic genes. The predicted localisation of SbMATE2 to the vacuolar membrane was demonstrated experimentally by transient expression of a SbMATE2-YFP fusion protein and confocal microscopy. Transport studies in Xenopus laevis oocytes demonstrate that SbMATE2 is able to transport dhurrin. In addition, SbMATE2 was able to transport non-endogenous cyanogenic glucosides, but not the anthocyanin cyanidin 3-O-glucoside or the glucosinolate indol-3-yl-methyl glucosinolate. The genomic co-localisation of a transporter gene with the biosynthetic genes producing the transported compound is discussed in relation to the role self-toxicity of chemical defence compounds may play in the formation of gene clusters. PMID:27841372

  1. Arabidopsis VASCULAR-RELATED UNKNOWN PROTEIN1 Regulates Xylem Development and Growth by a Conserved Mechanism That Modulates Hormone Signaling1[W][OPEN

    PubMed Central

    Grienenberger, Etienne; Douglas, Carl J.

    2014-01-01

    Despite a strict conservation of the vascular tissues in vascular plants (tracheophytes), our understanding of the genetic basis underlying the differentiation of secondary cell wall-containing cells in the xylem of tracheophytes is still far from complete. Using coexpression analysis and phylogenetic conservation across sequenced tracheophyte genomes, we identified a number of Arabidopsis (Arabidopsis thaliana) genes of unknown function whose expression is correlated with secondary cell wall deposition. Among these, the Arabidopsis VASCULAR-RELATED UNKNOWN PROTEIN1 (VUP1) gene encodes a predicted protein of 24 kD with no annotated functional domains but containing domains that are highly conserved in tracheophytes. Here, we show that the VUP1 expression pattern, determined by promoter-β-glucuronidase reporter gene expression, is associated with vascular tissues, while vup1 loss-of-function mutants exhibit collapsed morphology of xylem vessel cells. Constitutive overexpression of VUP1 caused dramatic and pleiotropic developmental defects, including severe dwarfism, dark green leaves, reduced apical dominance, and altered photomorphogenesis, resembling brassinosteroid-deficient mutants. Constitutive overexpression of VUP homologs from multiple tracheophyte species induced similar defects. Whole-genome transcriptome analysis revealed that overexpression of VUP1 represses the expression of many brassinosteroid- and auxin-responsive genes. Additionally, deletion constructs and site-directed mutagenesis were used to identify critical domains and amino acids required for VUP1 function. Altogether, our data suggest a conserved role for VUP1 in regulating secondary wall formation during vascular development by tissue- or cell-specific modulation of hormone signaling pathways. PMID:24567189

  2. Novel insights into the response of Atlantic salmon (Salmo salar) to Piscirickettsia salmonis: Interplay of coding genes and lncRNAs during bacterial infection.

    PubMed

    Valenzuela-Miranda, Diego; Gallardo-Escárate, Cristian

    2016-12-01

    Despite the high prevalence and impact to Chilean salmon aquaculture of the intracellular bacterium Piscirickettsia salmonis, the molecular underpinnings of host-pathogen interactions remain unclear. Herein, the interplay of coding and non-coding transcripts has been proposed as a key mechanism involved in immune response. Therefore, the aim of this study was to evidence how coding and non-coding transcripts are modulated during the infection process of Atlantic salmon with P. salmonis. For this, RNA-seq was conducted in brain, spleen, and head kidney samples, revealing different transcriptional profiles according to bacterial load. Additionally, while most of the regulated genes annotated for diverse biological processes during infection, a common response associated with clathrin-mediated endocytosis and iron homeostasis was present in all tissues. Interestingly, while endocytosis-promoting factors and clathrin inductions were upregulated, endocytic receptors were mainly downregulated. Furthermore, the regulation of genes related to iron homeostasis suggested an intracellular accumulation of iron, a process in which heme biosynthesis/degradation pathways might play an important role. Regarding the non-coding response, 918 putative long non-coding RNAs were identified, where 425 were newly characterized for S. salar. Finally, co-localization and co-expression analyses revealed a strong correlation between the modulations of long non-coding RNAs and genes associated with endocytosis and iron homeostasis. These results represent the first comprehensive study of putative interplaying mechanisms of coding and non-coding RNAs during bacterial infection in salmonids. Copyright © 2016 Elsevier Ltd. All rights reserved.

  3. The Parkinson disease-related protein DJ-1 counteracts mitochondrial impairment induced by the tumour suppressor protein p53 by enhancing endoplasmic reticulum-mitochondria tethering.

    PubMed

    Ottolini, Denis; Calì, Tito; Negro, Alessandro; Brini, Marisa

    2013-06-01

    DJ-1 was first identified as an oncogene. More recently, mutations in its gene have been found causative for autosomal recessive familial Parkinson disease. Numerous studies support the DJ-1 role in the protection against oxidative stress and maintenance of mitochondria structure; however, the mechanism of its protective function remains largely unknown. We investigated whether mitochondrial Ca(2+) homeostasis, a key parameter in cell physiology, could be a target for DJ-1 action. Here, we show that DJ-1 modulates mitochondrial Ca(2+) transients induced upon cell stimulation with an 1,4,5-inositol-tris-phosphate agonist by favouring the endoplasmic reticulum (ER)-mitochondria tethering. A reduction of DJ-1 levels results in mitochondria fragmentation and decreased mitochondrial Ca(2+) uptake in stimulated cells. To functionally couple these effects with the well-recognized cytoprotective role of DJ-1, we investigated its action in respect to the tumour suppressor p53. p53 overexpression in HeLa cells impairs their ability to accumulate Ca(2+) in the mitochondrial matrix, causes alteration of the mitochondrial morphology and reduces ER-mitochondria contact sites. Mitochondrial impairments are independent from Drp1 activation, since the co-expression of the dominant negative mutant of Drp1 failed to abolish them. DJ-1 overexpression prevents these alterations by re-establishing the ER-mitochondria tethering. Similarly, the co-expression of the pro-fusion protein Mitofusin 2 blocks the effects induced by p53 on mitochondria, confirming that the modulation of the ER-mitochondria contact sites is critical to mitochondria integrity. Thus, the impairment of ER-mitochondria communication, as a consequence of DJ-1 loss-of-function, may be detrimental for mitochondria-related processes and be at the basis of mitochondrial dysfunction observed in Parkinson disease.

  4. Transcriptome analysis reveals determinant stages controlling human embryonic stem cell commitment to neuronal cells.

    PubMed

    Li, Yuanyuan; Wang, Ran; Qiao, Nan; Peng, Guangdun; Zhang, Ke; Tang, Ke; Han, Jing-Dong J; Jing, Naihe

    2017-12-01

    Proper neural commitment is essential for ensuring the appropriate development of the human brain and for preventing neurodevelopmental diseases such as autism spectrum disorders, schizophrenia, and intellectual disorders. However, the molecular mechanisms underlying the neural commitment in humans remain elusive. Here, we report the establishment of a neural differentiation system based on human embryonic stem cells (hESCs) and on comprehensive RNA sequencing analysis of transcriptome dynamics during early hESC differentiation. Using weighted gene co-expression network analysis, we reveal that the hESC neurodevelopmental trajectory has five stages: pluripotency (day 0); differentiation initiation (days 2, 4, and 6); neural commitment (days 8-10); neural progenitor cell proliferation (days 12, 14, and 16); and neuronal differentiation (days 18, 20, and 22). These stages were characterized by unique module genes, which may recapitulate the early human cortical development. Moreover, a comparison of our RNA-sequencing data with several other transcriptome profiling datasets from mice and humans indicated that Module 3 associated with the day 8-10 stage is a critical window of fate switch from the pluripotency to the neural lineage. Interestingly, at this stage, no key extrinsic signals were activated. In contrast, using CRISPR/Cas9-mediated gene knockouts, we also found that intrinsic hub transcription factors, including the schizophrenia-associated SIX3 gene and septo-optic dysplasia-related HESX1 gene, are required to program hESC neural determination. Our results improve the understanding of the mechanism of neural commitment in the human brain and may help elucidate the etiology of human mental disorders and advance therapies for managing these conditions. © 2017 by The American Society for Biochemistry and Molecular Biology, Inc.

  5. Genomic and Network Patterns of Schizophrenia Genetic Variation in Human Evolutionary Accelerated Regions

    PubMed Central

    Xu, Ke; Schadt, Eric E.; Pollard, Katherine S.; Roussos, Panos; Dudley, Joel T.

    2015-01-01

    The population persistence of schizophrenia despite associated reductions in fitness and fecundity suggests that the genetic basis of schizophrenia has a complex evolutionary history. A recent meta-analysis of schizophrenia genome-wide association studies offers novel opportunities for assessment of the evolutionary trajectories of schizophrenia-associated loci. In this study, we hypothesize that components of the genetic architecture of schizophrenia are attributable to human lineage-specific evolution. Our results suggest that schizophrenia-associated loci enrich in genes near previously identified human accelerated regions (HARs). Specifically, we find that genes near HARs conserved in nonhuman primates (pHARs) are enriched for schizophrenia-associated loci, and that pHAR-associated schizophrenia genes are under stronger selective pressure than other schizophrenia genes and other pHAR-associated genes. We further evaluate pHAR-associated schizophrenia genes in regulatory network contexts to investigate associated molecular functions and mechanisms. We find that pHAR-associated schizophrenia genes significantly enrich in a GABA-related coexpression module that was previously found to be differentially regulated in schizophrenia affected individuals versus healthy controls. In another two independent networks constructed from gene expression profiles from prefrontal cortex samples, we find that pHAR-associated schizophrenia genes are located in more central positions and their average path lengths to the other nodes are significantly shorter than those of other schizophrenia genes. Together, our results suggest that HARs are associated with potentially important functional roles in the genetic architecture of schizophrenia. PMID:25681384

  6. Co-expression of HoxA9 and bcr-abl genes in chronic myeloid leukemia.

    PubMed

    Tedeschi, Fabián A; Cardozo, Maria A; Valentini, Rosanna; Zalazar, Fabián E

    2010-05-01

    We have analyzed the co-expression of the bcr-abl and HoxA9 genes in the follow-up of patients with chronic myeloid leukemia (CML). In the present work we measured the HoxA9 and bcr-abl gene expression in sequential samples. In all patients, bcr-abl and HoxA9 were expressed at detectable levels in every sample. When the results were expressed in relation to abl, two different situations were found: (a) patients clinically stable at second sampling, with low relative risk at diagnosis (low Sokal's score), did not show significant differences in both bcr-abl and HoxA9 levels in the sequential samples analyzed, and (b) patients with poor prognosis (showing intermediate or high Sokal's score at diagnosis) had increased expression of bcr-abl as well as HoxA9 genes (p < 0.05). Since HoxA9 gene expression remains at relatively constant levels throughout adult life, our results could reflect actual changes in the expression rate of this gene associated with bcr-abl during the progression of CML.

  7. Neutralization of Bacterial YoeBSpn Toxicity and Enhanced Plant Growth in Arabidopsis thaliana via Co-Expression of the Toxin-Antitoxin Genes

    PubMed Central

    Abu Bakar, Fauziah; Yeo, Chew Chieng; Harikrishna, Jennifer Ann

    2016-01-01

    Bacterial toxin-antitoxin (TA) systems have various cellular functions, including as part of the general stress response. The genome of the Gram-positive human pathogen Streptococcus pneumoniae harbors several putative TA systems, including yefM-yoeBSpn, which is one of four systems that had been demonstrated to be biologically functional. Overexpression of the yoeBSpn toxin gene resulted in cell stasis and eventually cell death in its native host, as well as in Escherichia coli. Our previous work showed that induced expression of a yoeBSpn toxin-Green Fluorescent Protein (GFP) fusion gene apparently triggered apoptosis and was lethal in the model plant, Arabidopsis thaliana. In this study, we investigated the effects of co-expression of the yefMSpn antitoxin and yoeBSpn toxin-GFP fusion in transgenic A. thaliana. When co-expressed in Arabidopsis, the YefMSpn antitoxin was found to neutralize the toxicity of YoeBSpn-GFP. Interestingly, the inducible expression of both yefMSpn antitoxin and yoeBSpn toxin-GFP fusion in transgenic hybrid Arabidopsis resulted in larger rosette leaves and taller plants with a higher number of inflorescence stems and increased silique production. To our knowledge, this is the first demonstration of a prokaryotic antitoxin neutralizing its cognate toxin in plant cells. PMID:27104531

  8. PlaNet: Combined Sequence and Expression Comparisons across Plant Networks Derived from Seven Species[W][OA

    PubMed Central

    Mutwil, Marek; Klie, Sebastian; Tohge, Takayuki; Giorgi, Federico M.; Wilkins, Olivia; Campbell, Malcolm M.; Fernie, Alisdair R.; Usadel, Björn; Nikoloski, Zoran; Persson, Staffan

    2011-01-01

    The model organism Arabidopsis thaliana is readily used in basic research due to resource availability and relative speed of data acquisition. A major goal is to transfer acquired knowledge from Arabidopsis to crop species. However, the identification of functional equivalents of well-characterized Arabidopsis genes in other plants is a nontrivial task. It is well documented that transcriptionally coordinated genes tend to be functionally related and that such relationships may be conserved across different species and even kingdoms. To exploit such relationships, we constructed whole-genome coexpression networks for Arabidopsis and six important plant crop species. The interactive networks, clustered using the HCCA algorithm, are provided under the banner PlaNet (http://aranet.mpimp-golm.mpg.de). We implemented a comparative network algorithm that estimates similarities between network structures. Thus, the platform can be used to swiftly infer similar coexpressed network vicinities within and across species and can predict the identity of functional homologs. We exemplify this using the PSA-D and chalcone synthase-related gene networks. Finally, we assessed how ontology terms are transcriptionally connected in the seven species and provide the corresponding MapMan term coexpression networks. The data support the contention that this platform will considerably improve transfer of knowledge generated in Arabidopsis to valuable crop species. PMID:21441431

  9. Involvement of an ABI-like protein and a Ca2+-ATPase in drought tolerance as revealed by transcript profiling of a sweetpotato somatic hybrid and its parents Ipomoea batatas (L.) Lam. and I. triloba L.

    PubMed

    Yang, Yufeng; Wang, Yannan; Jia, Licong; Yang, Guohong; Xu, Xinzhi; Zhai, Hong; He, Shaozhen; Li, Junxia; Dai, Xiaodong; Qin, Na; Zhu, Cancan; Liu, Qingchang

    2018-01-01

    Previously, we obtained the sweetpotato somatic hybrid KT1 from a cross between sweetpotato (Ipomoea batatas (L.) Lam.) cv. Kokei No. 14 and its drought-tolerant wild relative I. triloba L. KT1 not only inherited the thick storage root characteristic of Kokei No. 14 but also the drought-tolerance trait of I. triloba L. The aim of this study was to explore the molecular mechanism of the drought tolerance of KT1. Four-week-old in vitro-grown plants of KT1, Kokei No. 14, and I. triloba L. were subjected to a simulated drought stress treatment (30% PEG6000) for 0, 6, 12 and 24 h. Total RNA was extracted from samples at each time point, and then used for transcriptome sequencing. The gene transcript profiles of KT1 and its parents were compared to identify differentially expressed genes, and drought-related modules were screened by a weighted gene co-expression network analysis. The functions of ABI-like protein and Ca2+-ATPase, two proteins screened from the cyan and light yellow modules, were analyzed in terms of their potential roles in drought tolerance in KT1 and its parents. These analyses of the drought responses of KT1 and its somatic donors at the transcriptional level provide new annotations for the molecular mechanism of drought tolerance in the somatic hybrid KT1 and its parents.

  10. Covariance Structure Models for Gene Expression Microarray Data

    ERIC Educational Resources Information Center

    Xie, Jun; Bentler, Peter M.

    2003-01-01

    Covariance structure models are applied to gene expression data using a factor model, a path model, and their combination. The factor model is based on a few factors that capture most of the expression information. A common factor of a group of genes may represent a common protein factor for the transcript of the co-expressed genes, and hence, it…

  11. Improving coenzyme Q8 production in Escherichia coli employing multiple strategies.

    PubMed

    Xu, Wen; Yang, Shuiyun; Zhao, Junchao; Su, Tingting; Zhao, Liangrui; Liu, Jiankang

    2014-08-01

    Coenzyme Q (CoQ) is a medically valuable compound and a high yielding strain for CoQ will have several benefits for the industrial production of CoQ. To increase the CoQ(8) content of E. coli, we blocked the pathway for the synthesis of menaquinone by deleting the menA gene. The blocking of menaquinone pathway increased the CoQ(8) content by 81 % in E. coli (ΔmenA). To study the CoQ producing potential of E. coli, we employed previous known increasing strategies for systematic metabolic engineering. These include the supplementation with substrate precursors and the co-expression of rate-limiting genes. The co-expression of dxs-ubiA and the supplementation with substrate precursors such as pyruvate (PYR) and parahydroxybenzoic acid (pHBA) increased the content of CoQ(8) in E. coli (ΔmenA) by 125 and 59 %, respectively. Moreover, a 180 % increase in the CoQ(8) content in E. coli (ΔmenA) was realized by the combination of the co-expression of dxs-ubiA and the supplementation with PYR and pHBA. All in all, CoQ(8) content in E. coli increased 4.06 times by blocking the menaquinone pathway, dxs-ubiA co-expression and the addition of sodium pyruvate and parahydroxybenzoic acid to the medium. Results suggested a synergistic effect among different metabolic engineering strategies.

  12. Retransformation of marker-free potato for enhanced resistance against fungal pathogens by pyramiding chitinase and wasabi defensin genes.

    PubMed

    Khan, Raham Sher; Darwish, Nader Ahmed; Khattak, Bushra; Ntui, Valentine Otang; Kong, Kynet; Shimomae, Kazuki; Nakamura, Ikuo; Mii, Masahiro

    2014-09-01

    Multi-auto-transformation vector system has been one of the strategies to produce marker-free transgenic plants without using selective chemicals and plant growth regulators and thus facilitating transgene stacking. In the study reported here, retransformation was carried out in marker-free transgenic potato CV. May Queen containing ChiC gene (isolated from Streptomyces griseus strain HUT 6037) with wasabi defensin (WD) gene (isolated from Wasabia japonica) to pyramid the two disease resistant genes. Molecular analyses of the developed shoots confirmed the existence of both the genes of interest (ChiC and WD) in transgenic plants. Co-expression of the genes was confirmed by RT-PCR, northern blot, and western blot analyses. Disease resistance assay of in vitro plants showed that the transgenic lines co-expressing both the ChiC and WD genes had higher resistance against the fungal pathogens, Fusarium oxysporum (Fusarium wilt) and Alternaria solani (early blight) compared to the non-transformed control and the transgenic lines expressing either of the ChiC or WD genes. The disease resistance potential of the transgenic plants could be increased by transgene stacking or multiple transformations.

  13. Exploring the Genomic Roadmap and Molecular Phylogenetics Associated with MODY Cascades Using Computational Biology.

    PubMed

    Chakraborty, Chiranjib; Bandyopadhyay, Sanghamitra; Doss, C George Priya; Agoramoorthy, Govindasamy

    2015-04-01

    Maturity onset diabetes of the young (MODY) is a metabolic and genetic disorder. It is different from type 1 and type 2 diabetes with low occurrence level (1-2%) among all diabetes. This disorder is a consequence of β-cell dysfunction. Till date, 11 subtypes of MODY have been identified, and all of them can cause gene mutations. However, very little is known about the gene mapping, molecular phylogenetics, and co-expression among MODY genes and networking between cascades. This study has used latest servers and software such as VarioWatch, ClustalW, MUSCLE, G Blocks, Phylogeny.fr, iTOL, WebLogo, STRING, and KEGG PATHWAY to perform comprehensive analyses of gene mapping, multiple sequences alignment, molecular phylogenetics, protein-protein network design, co-expression analysis of MODY genes, and pathway development. The MODY genes are located in chromosomes-2, 7, 8, 9, 11, 12, 13, 17, and 20. Highly aligned block shows Pro, Gly, Leu, Arg, and Pro residues are highly aligned in the positions of 296, 386, 437, 455, 456 and 598, respectively. Alignment scores inform us that HNF1A and HNF1B proteins have shown high sequence similarity among MODY proteins. Protein-protein network design shows that HNF1A, HNF1B, HNF4A, NEUROD1, PDX1, PAX4, INS, and GCK are strongly connected, and the co-expression analyses between MODY genes also show distinct association between HNF1A and HNF4A genes. This study has used latest tools of bioinformatics to develop a rapid method to assess the evolutionary relationship, the network development, and the associations among eleven MODY genes and cascades. The prediction of sequence conservation, molecular phylogenetics, protein-protein network and the association between the MODY cascades enhances opportunities to get more insights into the less-known MODY disease.

  14. Integration of heterogeneous molecular networks to unravel gene-regulation in Mycobacterium tuberculosis.

    PubMed

    van Dam, Jesse C J; Schaap, Peter J; Martins dos Santos, Vitor A P; Suárez-Diez, María

    2014-09-26

    Different methods have been developed to infer regulatory networks from heterogeneous omics datasets and to construct co-expression networks. Each algorithm produces different networks and efforts have been devoted to automatically integrate them into consensus sets. However each separate set has an intrinsic value that is diluted and partly lost when building a consensus network. Here we present a methodology to generate co-expression networks and, instead of a consensus network, we propose an integration framework where the different networks are kept and analysed with additional tools to efficiently combine the information extracted from each network. We developed a workflow to efficiently analyse information generated by different inference and prediction methods. Our methodology relies on providing the user the means to simultaneously visualise and analyse the coexisting networks generated by different algorithms, heterogeneous datasets, and a suite of analysis tools. As a show case, we have analysed the gene co-expression networks of Mycobacterium tuberculosis generated using over 600 expression experiments. Regarding DNA damage repair, we identified SigC as a key control element, 12 new targets for LexA, an updated LexA binding motif, and a potential mismatch repair system. We expanded the DevR regulon with 27 genes while identifying 9 targets wrongly assigned to this regulon. We discovered 10 new genes linked to zinc uptake and a new regulatory mechanism for ZuR. The use of co-expression networks to perform system level analysis allows the development of custom made methodologies. As show cases we implemented a pipeline to integrate ChIP-seq data and another method to uncover multiple regulatory layers. Our workflow is based on representing the multiple types of information as network representations and presenting these networks in a synchronous framework that allows their simultaneous visualization while keeping specific associations from the different networks. By simultaneously exploring these networks and metadata, we gained insights into regulatory mechanisms in M. tuberculosis that could not be obtained through the separate analysis of each data type.

  15. Convergent roles of de novo mutations and common variants in schizophrenia in tissue-specific and spatiotemporal co-expression network.

    PubMed

    Jia, Peilin; Chen, Xiangning; Fanous, Ayman H; Zhao, Zhongming

    2018-05-24

    Genetic components susceptible to complex disease such as schizophrenia include a wide spectrum of variants, including common variants (CVs) and de novo mutations (DNMs). Although CVs and DNMs differ by origin, it remains elusive whether and how they interact at the gene, pathway, and network levels that leads to the disease. In this work, we characterized the genes harboring schizophrenia-associated CVs (CVgenes) and the genes harboring DNMs (DNMgenes) using measures from network, tissue-specific expression profile, and spatiotemporal brain expression profile. We developed an algorithm to link the DNMgenes and CVgenes in spatiotemporal brain co-expression networks. DNMgenes tended to have central roles in the human protein-protein interaction (PPI) network, evidenced in their high degree and high betweenness values. DNMgenes and CVgenes connected with each other significantly more often than with other genes in the networks. However, only CVgenes remained significantly connected after adjusting for their degree. In our gene co-expression PPI network, we found DNMgenes and CVgenes connected in a tissue-specific fashion, and such a pattern was similar to that in GTEx brain but not in other GTEx tissues. Importantly, DNMgene-CVgene subnetworks were enriched with pathways of chromatin remodeling, MHC protein complex binding, and neurotransmitter activities. In summary, our results unveiled that both DNMgenes and CVgenes contributed to a core set of biologically important pathways and networks, and their interactions may attribute to the risk for schizophrenia. Our results also suggested a stronger biological effect of DNMgenes than CVgenes in schizophrenia.

  16. Enhanced Transgene Expression in Sugarcane by Co-Expression of Virus-Encoded RNA Silencing Suppressors

    PubMed Central

    Park, Jong-Won; Beyene, Getu; Buenrostro-Nava, Marco T.; Molina, Joe; Wang, Xiaofeng; Ciomperlik, Jessica J.; Manabayeva, Shuga A.; Alvarado, Veria Y.; Rathore, Keerti S.; Scholthof, Herman B.; Mirkov, T. Erik

    2013-01-01

    Post-transcriptional gene silencing is commonly observed in polyploid species and often poses a major limitation to plant improvement via biotechnology. Five plant viral suppressors of RNA silencing were evaluated for their ability to counteract gene silencing and enhance the expression of the Enhanced Yellow Fluorescent Protein (EYFP) or the β-glucuronidase (GUS) reporter gene in sugarcane, a major sugar and biomass producing polyploid. Functionality of these suppressors was first verified in Nicotiana benthamiana and onion epidermal cells, and later tested by transient expression in sugarcane young leaf segments and protoplasts. In young leaf segments co-expressing a suppressor, EYFP reached its maximum expression at 48–96 h post-DNA introduction and maintained its peak expression for a longer time compared with that in the absence of a suppressor. Among the five suppressors, Tomato bushy stunt virus-encoded P19 and Barley stripe mosaic virus-encoded γb were the most efficient. Co-expression with P19 and γb enhanced EYFP expression 4.6-fold and 3.6-fold in young leaf segments, and GUS activity 2.3-fold and 2.4-fold in protoplasts compared with those in the absence of a suppressor, respectively. In transgenic sugarcane, co-expression of GUS and P19 suppressor showed the highest accumulation of GUS levels with an average of 2.7-fold more than when GUS was expressed alone, with no detrimental phenotypic effects. The two established transient expression assays, based on young leaf segments and protoplasts, and confirmed by stable transgene expression, offer a rapid versatile system to verify the efficiency of RNA silencing suppressors that proved to be valuable in enhancing and stabilizing transgene expression in sugarcane. PMID:23799071

  17. Computing and Applying Atomic Regulons to Understand Gene Expression and Regulation

    DOE PAGES

    Faria, José P.; Davis, James J.; Edirisinghe, Janaka N.; ...

    2016-11-24

    Understanding gene function and regulation is essential for the interpretation, prediction, and ultimate design of cell responses to changes in the environment. A multitude of technologies, abstractions, and interpretive frameworks have emerged to answer the challenges presented by genome function and regulatory network inference. Here, we propose a new approach for producing biologically meaningful clusters of coexpressed genes, called Atomic Regulons (ARs), based on expression data, gene context, and functional relationships. We demonstrate this new approach by computing ARs for Escherichia coli, which we compare with the coexpressed gene clusters predicted by two prevalent existing methods: hierarchical clustering and k-meansmore » clustering. We test the consistency of ARs predicted by all methods against expected interactions predicted by the Context Likelihood of Relatedness (CLR) mutual information based method, finding that the ARs produced by our approach show better agreement with CLR interactions. We then apply our method to compute ARs for four other genomes: Shewanella oneidensis, Pseudomonas aeruginosa, Thermus thermophilus, and Staphylococcus aureus. We compare the AR clusters from all genomes to study the similarity of coexpression among a phylogenetically diverse set of species, identifying subsystems that show remarkable similarity over wide phylogenetic distances. We also study the sensitivity of our method for computing ARs to the expression data used in the computation, showing that our new approach requires less data than competing approaches to converge to a near final configuration of ARs. We go on to use our sensitivity analysis to identify the specific experiments that lead most rapidly to the final set of ARs for E. coli. As a result, this analysis produces insights into improving the design of gene expression experiments.« less

  18. Computing and Applying Atomic Regulons to Understand Gene Expression and Regulation

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Faria, José P.; Davis, James J.; Edirisinghe, Janaka N.

    Understanding gene function and regulation is essential for the interpretation, prediction, and ultimate design of cell responses to changes in the environment. A multitude of technologies, abstractions, and interpretive frameworks have emerged to answer the challenges presented by genome function and regulatory network inference. Here, we propose a new approach for producing biologically meaningful clusters of coexpressed genes, called Atomic Regulons (ARs), based on expression data, gene context, and functional relationships. We demonstrate this new approach by computing ARs for Escherichia coli, which we compare with the coexpressed gene clusters predicted by two prevalent existing methods: hierarchical clustering and k-meansmore » clustering. We test the consistency of ARs predicted by all methods against expected interactions predicted by the Context Likelihood of Relatedness (CLR) mutual information based method, finding that the ARs produced by our approach show better agreement with CLR interactions. We then apply our method to compute ARs for four other genomes: Shewanella oneidensis, Pseudomonas aeruginosa, Thermus thermophilus, and Staphylococcus aureus. We compare the AR clusters from all genomes to study the similarity of coexpression among a phylogenetically diverse set of species, identifying subsystems that show remarkable similarity over wide phylogenetic distances. We also study the sensitivity of our method for computing ARs to the expression data used in the computation, showing that our new approach requires less data than competing approaches to converge to a near final configuration of ARs. We go on to use our sensitivity analysis to identify the specific experiments that lead most rapidly to the final set of ARs for E. coli. As a result, this analysis produces insights into improving the design of gene expression experiments.« less

  19. The Pathway Coexpression Network: Revealing pathway relationships

    PubMed Central

    Tanzi, Rudolph E.

    2018-01-01

    A goal of genomics is to understand the relationships between biological processes. Pathways contribute to functional interplay within biological processes through complex but poorly understood interactions. However, limited functional references for global pathway relationships exist. Pathways from databases such as KEGG and Reactome provide discrete annotations of biological processes. Their relationships are currently either inferred from gene set enrichment within specific experiments, or by simple overlap, linking pathway annotations that have genes in common. Here, we provide a unifying interpretation of functional interaction between pathways by systematically quantifying coexpression between 1,330 canonical pathways from the Molecular Signatures Database (MSigDB) to establish the Pathway Coexpression Network (PCxN). We estimated the correlation between canonical pathways valid in a broad context using a curated collection of 3,207 microarrays from 72 normal human tissues. PCxN accounts for shared genes between annotations to estimate significant correlations between pathways with related functions rather than with similar annotations. We demonstrate that PCxN provides novel insight into mechanisms of complex diseases using an Alzheimer’s Disease (AD) case study. PCxN retrieved pathways significantly correlated with an expert curated AD gene list. These pathways have known associations with AD and were significantly enriched for genes independently associated with AD. As a further step, we show how PCxN complements the results of gene set enrichment methods by revealing relationships between enriched pathways, and by identifying additional highly correlated pathways. PCxN revealed that correlated pathways from an AD expression profiling study include functional clusters involved in cell adhesion and oxidative stress. PCxN provides expanded connections to pathways from the extracellular matrix. PCxN provides a powerful new framework for interrogation of global pathway relationships. Comprehensive exploration of PCxN can be performed at http://pcxn.org/. PMID:29554099

  20. Comprehensive analysis of coding-lncRNA gene co-expression network uncovers conserved functional lncRNAs in zebrafish.

    PubMed

    Chen, Wen; Zhang, Xuan; Li, Jing; Huang, Shulan; Xiang, Shuanglin; Hu, Xiang; Liu, Changning

    2018-05-09

    Zebrafish is a full-developed model system for studying development processes and human disease. Recent studies of deep sequencing had discovered a large number of long non-coding RNAs (lncRNAs) in zebrafish. However, only few of them had been functionally characterized. Therefore, how to take advantage of the mature zebrafish system to deeply investigate the lncRNAs' function and conservation is really intriguing. We systematically collected and analyzed a series of zebrafish RNA-seq data, then combined them with resources from known database and literatures. As a result, we obtained by far the most complete dataset of zebrafish lncRNAs, containing 13,604 lncRNA genes (21,128 transcripts) in total. Based on that, a co-expression network upon zebrafish coding and lncRNA genes was constructed and analyzed, and used to predict the Gene Ontology (GO) and the KEGG annotation of lncRNA. Meanwhile, we made a conservation analysis on zebrafish lncRNA, identifying 1828 conserved zebrafish lncRNA genes (1890 transcripts) that have their putative mammalian orthologs. We also found that zebrafish lncRNAs play important roles in regulation of the development and function of nervous system; these conserved lncRNAs present a significant sequential and functional conservation, with their mammalian counterparts. By integrative data analysis and construction of coding-lncRNA gene co-expression network, we gained the most comprehensive dataset of zebrafish lncRNAs up to present, as well as their systematic annotations and comprehensive analyses on function and conservation. Our study provides a reliable zebrafish-based platform to deeply explore lncRNA function and mechanism, as well as the lncRNA commonality between zebrafish and human.

  1. Differentiation of embryonic stem cells into hepatocytes that coexpress coagulation factors VIII and IX.

    PubMed

    Cao, Jun; Shang, Chang-zhen; Lü, Li-hong; Qiu, De-chuan; Ren, Meng; Chen, Ya-jin; Min, Jun

    2010-11-01

    To establish an efficient culture system to support embryonic stem (ES) cell differentiation into hepatocytes that coexpress F-VIII and F-IX. Mouse E14 ES cells were cultured in differentiation medium containing sodium butyrate (SB), basic fibroblast growth factor (bFGF), and/or bone morphogenetic protein 4 (BMP4) to induce the differentiation of endoderm cells and hepatic progenitor cells. Hepatocyte growth factor, oncostatin M, and dexamethasone were then used to induce the maturation of ES cell-derived hepatocytes. The mRNA expression levels of endoderm-specific genes and hepatocyte-specific genes, including the levels of F-VIII and F-IX, were detected by RT-PCR and real-time PCR during various stages of differentiation. Protein expression was examined by immunofluorescence and Western blot. At the final stage of differentiation, flow cytometry was performed to determine the percentage of cells coexpressing F-VIII and F-IX, and ELISA was used to detect the levels of F-VIII and F-IX protein secreted into the culture medium. The expression of endoderm-specific and hepatocyte-specific markers was upregulated to highest level in response to the combination of SB, bFGF, and BMP4. Treatment with the three inducers during hepatic progenitor differentiation significantly enhanced the mRNA and protein levels of F-VIII and F-IX in ES cell-derived hepatocytes. More importantly, F-VIII and F-IX were coexpressed with high efficiency at the final stage of differentiation, and they were also secreted into the culture medium. We have established a novel in vitro differentiation protocol for ES-derived hepatocytes that coexpress F-VIII and F-IX that may provide a foundation for stem cell replacement therapy for hemophilia.

  2. Tumor suppressor BRCA1 is expressed in prostate cancer and controls IGF-I receptor (IGF-IR) gene transcription in an androgen receptor-dependent manner

    PubMed Central

    Schayek, Hagit; Haugk, Kathy; Sun, Shihua; True, Lawrence D.; Plymate, Stephen R.; Werner, Haim

    2010-01-01

    Purpose The insulin-like growth factor (IGF) system plays an important role in prostate cancer. The BRCA1 gene encodes a transcription factor with tumor suppressor activity. The involvement of BRCA1 in prostate cancer, however, has not yet been elucidated. The purpose of the present study was to examine the functional correlations between BRCA1 and the IGF system in prostate cancer. Experimental Design An immunohistochemical analysis of BRCA1 was performed on Tissue Microarrays comprising 203 primary prostate cancer specimens. In addition, BRCA1 levels were measured in prostate cancer xenografts and in cell lines representing early stages of the disease (P69 cells) and advanced stages (M12 cells). The ability of BRCA1 to regulate IGF-IR expression was studied by coexpression experiments using a BRCA1 expression vector along with an IGF-IR promoter-luciferase reporter. Results We found significantly elevated BRCA1 levels in prostate cancer in comparison to histologically normal prostate tissue (p < 0.001). In addition, an inverse correlation between BRCA1 and IGF-IR levels was observed in the AR-negative P69 and M12 prostate cancer-derived cell lines. Coexpression experiments in M12 cells revealed that BRCA1 was able to suppress IGF-IR promoter activity and endogenous IGF-IR levels. On the other hand, BRCA1 enhanced IGF-IR levels in LnCaP C4-2 cells expressing an endogenous AR. Conclusions We provide evidence that BRCA1 differentially regulates IGF-IR expression in AR positive and negative prostate cancer cells. The mechanism of action of BRCA1 involves modulation of IGF-IR gene transcription. In addition, immunohistochemical data is consistent with a potential survival role of BRCA1 in prostate cancer. PMID:19223505

  3. A synthetic cGMP-sensitive gene switch providing Viagra(®)-controlled gene expression in mammalian cells and mice.

    PubMed

    Kim, Taeuk; Folcher, Marc; Charpin-El Hamri, Ghislaine; Fussenegger, Martin

    2015-05-01

    Cyclic guanosine monophosphate (cGMP) is a universal second messenger that is synthesized from guanosine triphosphate (GTP) by guanylyl cyclases (GCs) and hydrolyzed into guanosine monophosphate (GMP) by phosphodiesterases (PDEs). Small-molecule drugs that induce high cGMP levels in specialized tissues by boosting GC activity or inhibiting PDE activity have become the predominant treatment strategy for a wide range of medical conditions, including congestive heart failure, pulmonary hypertension, atherosclerosis-based claudication and erectile dysfunction. By fusing the cGMP receptor protein (CRP) of Rhodospirillum centenum to the Herpes simplex-derived transactivation domain VP16, we created a novel synthetic mammalian cGMP-sensing transcription factor (GTA) that activates synthetic promoters (PGTA) containing newly identified GTA-specific operator sites in a concentration-dependent manner. In cell lines expressing endogenous natriuretic peptide receptor A (NPR-A) (HeLa), GTA/PGTA-driven transgene expression was induced by B-type natriuretic peptide (BNP; Nesiritide(®)) in a concentration-dependent manner, which activated NPR-A׳s intracellular GC domain and triggered a corresponding cGMP surge. Ectopic expression of NPR-A in NPR-A-negative cell lines (HEK-293T) produced high cGMP levels and mediated maximum GTA/PGTA-driven transgene expression, which was suppressed by co-expression of PDEs (PDE-3A, PDE-5A and PDE-9A) and was re-triggered by the corresponding PDE inhibitor drugs (Pletal(®), Perfan(®), Primacor(®) (PDE-3A), Viagra(®), Levitra(®), Cialis(®) (PDE-5A) and BAY73-6691 (PDE-9A)). Mice implanted with microencapsulated designer cells co-expressing the GTA/PGTA device with NPR-A and PDE-5A showed control of blood SEAP levels through administration of sildenafil (Viagra(®)). Designer cells engineered for PDE inhibitor-modulated transgene expression may provide a cell-based PDE-targeting drug discovery platform and enable drug-adjusted gene- and cell-based therapies. Copyright © 2015 International Metabolic Engineering Society. Published by Elsevier Inc. All rights reserved.

  4. Transformation and model choice for RNA-seq co-expression analysis.

    PubMed

    Rau, Andrea; Maugis-Rabusseau, Cathy

    2018-05-01

    Although a large number of clustering algorithms have been proposed to identify groups of co-expressed genes from microarray data, the question of if and how such methods may be applied to RNA sequencing (RNA-seq) data remains unaddressed. In this work, we investigate the use of data transformations in conjunction with Gaussian mixture models for RNA-seq co-expression analyses, as well as a penalized model selection criterion to select both an appropriate transformation and number of clusters present in the data. This approach has the advantage of accounting for per-cluster correlation structures among samples, which can be strong in RNA-seq data. In addition, it provides a rigorous statistical framework for parameter estimation, an objective assessment of data transformations and number of clusters and the possibility of performing diagnostic checks on the quality and homogeneity of the identified clusters. We analyze four varied RNA-seq data sets to illustrate the use of transformations and model selection in conjunction with Gaussian mixture models. Finally, we propose a Bioconductor package coseq (co-expression of RNA-seq data) to facilitate implementation and visualization of the recommended RNA-seq co-expression analyses.

  5. TESTING HIGH-DIMENSIONAL COVARIANCE MATRICES, WITH APPLICATION TO DETECTING SCHIZOPHRENIA RISK GENES

    PubMed Central

    Zhu, Lingxue; Lei, Jing; Devlin, Bernie; Roeder, Kathryn

    2017-01-01

    Scientists routinely compare gene expression levels in cases versus controls in part to determine genes associated with a disease. Similarly, detecting case-control differences in co-expression among genes can be critical to understanding complex human diseases; however statistical methods have been limited by the high dimensional nature of this problem. In this paper, we construct a sparse-Leading-Eigenvalue-Driven (sLED) test for comparing two high-dimensional covariance matrices. By focusing on the spectrum of the differential matrix, sLED provides a novel perspective that accommodates what we assume to be common, namely sparse and weak signals in gene expression data, and it is closely related with Sparse Principal Component Analysis. We prove that sLED achieves full power asymptotically under mild assumptions, and simulation studies verify that it outperforms other existing procedures under many biologically plausible scenarios. Applying sLED to the largest gene-expression dataset obtained from post-mortem brain tissue from Schizophrenia patients and controls, we provide a novel list of genes implicated in Schizophrenia and reveal intriguing patterns in gene co-expression change for Schizophrenia subjects. We also illustrate that sLED can be generalized to compare other gene-gene “relationship” matrices that are of practical interest, such as the weighted adjacency matrices. PMID:29081874

  6. TESTING HIGH-DIMENSIONAL COVARIANCE MATRICES, WITH APPLICATION TO DETECTING SCHIZOPHRENIA RISK GENES.

    PubMed

    Zhu, Lingxue; Lei, Jing; Devlin, Bernie; Roeder, Kathryn

    2017-09-01

    Scientists routinely compare gene expression levels in cases versus controls in part to determine genes associated with a disease. Similarly, detecting case-control differences in co-expression among genes can be critical to understanding complex human diseases; however statistical methods have been limited by the high dimensional nature of this problem. In this paper, we construct a sparse-Leading-Eigenvalue-Driven (sLED) test for comparing two high-dimensional covariance matrices. By focusing on the spectrum of the differential matrix, sLED provides a novel perspective that accommodates what we assume to be common, namely sparse and weak signals in gene expression data, and it is closely related with Sparse Principal Component Analysis. We prove that sLED achieves full power asymptotically under mild assumptions, and simulation studies verify that it outperforms other existing procedures under many biologically plausible scenarios. Applying sLED to the largest gene-expression dataset obtained from post-mortem brain tissue from Schizophrenia patients and controls, we provide a novel list of genes implicated in Schizophrenia and reveal intriguing patterns in gene co-expression change for Schizophrenia subjects. We also illustrate that sLED can be generalized to compare other gene-gene "relationship" matrices that are of practical interest, such as the weighted adjacency matrices.

  7. Co-expression of Exo-inulinase and Endo-inulinase Genes in the Oleaginous Yeast Yarrowia lipolytica for Efficient Single Cell Oil Production from Inulin.

    PubMed

    Shi, Nianci; Mao, Weian; He, Xiaoxia; Chi, Zhe; Chi, Zhenming; Liu, Guanglei

    2018-05-01

    Yarrowia lipolytica is a promising platform for the single cell oil (SCO) production. In this study, a transformant X+N8 in which exo- and endo-inulinase genes were co-expressed could produce an inulinase activity of 124.33 U/mL within 72 h. However, the inulinase activity of a transformant X2 carrying a single exo-inulinase gene was only 47.33 U/mL within 72 h. Moreover, the transformant X+N8 could accumulate 48.13% (w/w) SCO from inulin and the cell dry weight reached 13.63 g/L within 78 h, which were significantly higher than those of the transformant X2 (41.87% (w/w) and 11.23 g/L) under the same conditions. In addition, inulin hydrolysis and utilization of the transformant X+N8 were also more efficient than those of the transformant X2 during the fermentation process. These results demonstrated that the co-expression of the exo- and endo-inulinase genes significantly enhanced the SCO production from inulin due to the improvement of the inulinase activity and the synergistic action of exo- and endo-inulinase. Besides, over 95.01% of the fatty acids from the transformant X+N8 were C16-C18, especially C18:1 (53.10%), suggesting that the fatty acids could be used as feedstock for biodiesel production.

  8. Use of transcriptomics and co-expression networks to analyze the interconnections between nitrogen assimilation and photorespiratory metabolism

    PubMed Central

    Pérez-Delgado, Carmen M.; Moyano, Tomás C.; García-Calderón, Margarita; Canales, Javier; Gutiérrez, Rodrigo A.; Márquez, Antonio J.; Betti, Marco

    2016-01-01

    Nitrogen is one of the most important nutrients for plants and, in natural soils, its availability is often a major limiting factor for plant growth. Here we examine the effect of different forms of nitrogen nutrition and of photorespiration on gene expression in the model legume Lotus japonicus with the aim of identifying regulatory candidate genes co-ordinating primary nitrogen assimilation and photorespiration. The transcriptomic changes produced by the use of different nitrogen sources in leaves of L. japonicus plants combined with the transcriptomic changes produced in the same tissue by different photorespiratory conditions were examined. The results obtained provide novel information on the possible role of plastidic glutamine synthetase in the response to different nitrogen sources and in the C/N balance of L. japonicus plants. The use of gene co-expression networks establishes a clear relationship between photorespiration and primary nitrogen assimilation and identifies possible transcription factors connected to the genes of both routes. PMID:27117340

  9. Molecular characterization and silk gland expression of Bombyx engrailed and invected genes.

    PubMed Central

    Hui, C C; Matsuno, K; Ueno, K; Suzuki, Y

    1992-01-01

    Genetic analysis in Drosophila has shown that engrailed (en) plays an important role in segmentation and neurogenesis. A closely related gene, invected (in), is coexpressed with en in the posterior developmental compartments where en is known to specify cell state. We report here the isolation of two en-like cDNAs from the middle silk glands of Bombyx mori larvae. Sequence analysis revealed that they are the counterparts of Drosophila en and in. Four highly conserved domains, including the homeodomain, were identified in these En and In proteins from Bombyx and Drosophila. In addition, two en-specific and one in-specific domains could also be found. These structurally homologous genes might share a similar role in Bombyx development. They were found to be coexpressed in the middle silk gland but not in the posterior silk gland during the fourth molt/fifth intermolt period. We speculate that these Bombyx en-like genes might be involved in the compartmentalization of the silk gland. Images PMID:1346065

  10. Using the shared genetics of dystonia and ataxia to unravel their pathogenesis

    PubMed Central

    Nibbeling, Esther A.R.; Delnooz, Cathérine C.S.; de Koning, Tom J.; Sinke, Richard J.; Jinnah, Hyder A.; Tijssen, Marina A.J.; Verbeek, Dineke S.

    2018-01-01

    In this review we explore the similarities between spinocerebellar ataxias and dystonias, and suggest potentially shared molecular pathways using a gene co-expression network approach. The spinocerebellar ataxias are a group of neurodegenerative disorders characterized by coordination problems caused mainly by atrophy of the cerebellum. The dystonias are another group of neurological movement disorders linked to basal ganglia dysfunction, although evidence is now pointing to cerebellar involvement as well. Our gene co-expression network approach identified 99 shared genes and showed the involvement of two major pathways: synaptic transmission and neurodevelopment. These pathways overlapped in the two disorders, with a large role for GABAergic signaling in both. The overlapping pathways may provide novel targets for disease therapies. We need to prioritize variants obtained by whole exome sequencing in the genes associated with these pathways in the search for new pathogenic variants, which can than be used to help in the genetic counseling of patients and their families. PMID:28143763

  11. oPOSSUM: integrated tools for analysis of regulatory motif over-representation

    PubMed Central

    Ho Sui, Shannan J.; Fulton, Debra L.; Arenillas, David J.; Kwon, Andrew T.; Wasserman, Wyeth W.

    2007-01-01

    The identification of over-represented transcription factor binding sites from sets of co-expressed genes provides insights into the mechanisms of regulation for diverse biological contexts. oPOSSUM, an internet-based system for such studies of regulation, has been improved and expanded in this new release. New features include a worm-specific version for investigating binding sites conserved between Caenorhabditis elegans and C. briggsae, as well as a yeast-specific version for the analysis of co-expressed sets of Saccharomyces cerevisiae genes. The human and mouse applications feature improvements in ortholog mapping, sequence alignments and the delineation of multiple alternative promoters. oPOSSUM2, introduced for the analysis of over-represented combinations of motifs in human and mouse genes, has been integrated with the original oPOSSUM system. Analysis using user-defined background gene sets is now supported. The transcription factor binding site models have been updated to include new profiles from the JASPAR database. oPOSSUM is available at http://www.cisreg.ca/oPOSSUM/ PMID:17576675

  12. Controlled Aggregation of Ferritin to Modulate MRI Relaxivity

    PubMed Central

    Bennett, Kevin M.; Shapiro, Erik M.; Sotak, Christopher H.; Koretsky, Alan P.

    2008-01-01

    Ferritin is an iron storage protein expressed in varying concentrations in mammalian cells. The deposition of ferric iron in the core of ferritin makes it a magnetic resonance imaging contrast agent, and ferritin has recently been proposed as a gene expression reporter protein for magnetic resonance imaging. To date, ferritin has been overexpressed in vivo and has been coexpressed with transferrin receptor to increase iron loading in cells. However, ferritin has a relatively low T2 relaxivity (R2 ≈ 1 mM−1s−1) at typical magnetic field strengths and so requires high levels of expression to be detected. One way to modulate the transverse relaxivity of a superparamagnetic agent is to cause it to aggregate, thereby manipulating the magnetic field gradients through which water diffuses. In this work, it is demonstrated by computer simulation and in vitro that aggregation of ferritin can alter relaxivity. The effects of aggregate size and intraaggregate perturber spacing on R2 are studied. Computer modeling indicates that the optimal spacing of the ferritin molecules in aggregate for increasing R2 is 100–200 nm for a typical range of water diffusion rates. Chemical cross-linking of ferritin at 12 Å spacing led to a 70% increase in R2 compared to uncross-linked ferritin controls. To modulate ferritin aggregation in a potentially biologically relevant manner, ferritin was attached to actin and polymerized in vitro. The polymerization of ferritin-F-actin caused a 20% increase in R2 compared to unpolymerized ferritin-G-actin. The R2-value was increased by another 10% by spacing the ferritin farther apart on the actin filaments. The modulation of ferritin aggregation by binding to cytoskeletal elements may be a useful strategy to make a functional reporter gene for magnetic resonance imaging. PMID:18326661

  13. Differential Network Analyses of Alzheimer’s Disease Identify Early Events in Alzheimer’s Disease Pathology

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Xia, Jing; Rocke, David M.; Perry, George

    In late-onset Alzheimer’s disease (AD), multiple brain regions are not affected simultaneously. Comparing the gene expression of the affected regions to identify the differences in the biological processes perturbed can lead to greater insight into AD pathogenesis and early characteristics. We identified differentially expressed (DE) genes from single cell microarray data of four AD affected brain regions: entorhinal cortex (EC), hippocampus (HIP), posterior cingulate cortex (PCC), and middle temporal gyrus (MTG). We organized the DE genes in the four brain regions into region-specific gene coexpression networks. Differential neighborhood analyses in the coexpression networks were performed to identify genes with lowmore » topological overlap (TO) of their direct neighbors. The low TO genes were used to characterize the biological differences between two regions. Our analyses show that increased oxidative stress, along with alterations in lipid metabolism in neurons, may be some of the very early events occurring in AD pathology. Cellular defense mechanisms try to intervene but fail, finally resulting in AD pathology as the disease progresses. Furthermore, disease annotation of the low TO genes in two independent protein interaction networks has resulted in association between cancer, diabetes, renal diseases, and cardiovascular diseases.« less

  14. NFκB pathway analysis: An approach to analyze gene co-expression networks employing feedback cycles.

    PubMed

    Dillenburg, Fabiane Cristine; Zanotto-Filho, Alfeu; Fonseca Moreira, José Cláudio; Ribeiro, Leila; Carro, Luigi

    2018-02-01

    The genes of the NFκB pathway are involved in the control of a plethora of biological processes ranking from inhibition of apoptosis to metastasis in cancer. It has been described that Gliobastoma multiforme (GBM) patients carry aberrant NFκB activation, but the molecular mechanisms are not completely understood. Here, we present a NFκB pathway analysis in tumor specimens of GBM compared to non-neoplasic brain tissues, based on the different kind of cycles found among genes of a gene co-expression network constructed using quantized data obtained from the microarrays. A cycle is a closed walk with all vertices distinct (except the first and last). Thanks to this way of finding relations among genes, a more robust interpretation of gene correlations is possible, because the cycles are associated with feedback mechanisms that are very common in biological networks. In GBM samples, we could conclude that the stoichiometric relationship between genes involved in NFκB pathway regulation is unbalanced. This can be measured and explained by the identification of a cycle. This conclusion helps to understand more about the biology of this type of tumor. Copyright © 2017 Elsevier Ltd. All rights reserved.

  15. Differential Network Analyses of Alzheimer’s Disease Identify Early Events in Alzheimer’s Disease Pathology

    DOE PAGES

    Xia, Jing; Rocke, David M.; Perry, George; ...

    2014-01-01

    In late-onset Alzheimer’s disease (AD), multiple brain regions are not affected simultaneously. Comparing the gene expression of the affected regions to identify the differences in the biological processes perturbed can lead to greater insight into AD pathogenesis and early characteristics. We identified differentially expressed (DE) genes from single cell microarray data of four AD affected brain regions: entorhinal cortex (EC), hippocampus (HIP), posterior cingulate cortex (PCC), and middle temporal gyrus (MTG). We organized the DE genes in the four brain regions into region-specific gene coexpression networks. Differential neighborhood analyses in the coexpression networks were performed to identify genes with lowmore » topological overlap (TO) of their direct neighbors. The low TO genes were used to characterize the biological differences between two regions. Our analyses show that increased oxidative stress, along with alterations in lipid metabolism in neurons, may be some of the very early events occurring in AD pathology. Cellular defense mechanisms try to intervene but fail, finally resulting in AD pathology as the disease progresses. Furthermore, disease annotation of the low TO genes in two independent protein interaction networks has resulted in association between cancer, diabetes, renal diseases, and cardiovascular diseases.« less

  16. [Weighted gene co-expression network analysis in biomedicine research].

    PubMed

    Liu, Wei; Li, Li; Ye, Hua; Tu, Wei

    2017-11-25

    High-throughput biological technologies are now widely applied in biology and medicine, allowing scientists to monitor thousands of parameters simultaneously in a specific sample. However, it is still an enormous challenge to mine useful information from high-throughput data. The emergence of network biology provides deeper insights into complex bio-system and reveals the modularity in tissue/cellular networks. Correlation networks are increasingly used in bioinformatics applications. Weighted gene co-expression network analysis (WGCNA) tool can detect clusters of highly correlated genes. Therefore, we systematically reviewed the application of WGCNA in the study of disease diagnosis, pathogenesis and other related fields. First, we introduced principle, workflow, advantages and disadvantages of WGCNA. Second, we presented the application of WGCNA in disease, physiology, drug, evolution and genome annotation. Then, we indicated the application of WGCNA in newly developed high-throughput methods. We hope this review will help to promote the application of WGCNA in biomedicine research.

  17. Putative carotenoid genes expressed under the regulation of Shine-Dalgarno regions in Escherichia coli for efficient lycopene production.

    PubMed

    Jin, Weiyue; Xu, Xian; Jiang, Ling; Zhang, Zhidong; Li, Shuang; Huang, He

    2015-11-01

    Putative genes crtE, crtB, and crtI from Deinococcus wulumiqiensis R12, a novel species, were identified by genome mining and were co-expressed using the optimized Shine-Dalgarno (SD) regions to improve lycopene yield. A lycopene biosynthesis pathway was constructed by co-expressing these three genes in Escherichia coli. After optimizing the upstream SD regions and the culture medium, the recombinant strain EDW11 produced 88 mg lycopene g(-1) dry cell wt (780 mg lycopene l(-1)) after 40 h fermentation without IPTG induction, while the strain EDW without optimized SD regions only produced 49 mg lycopene g(-1) dry cell wt (417 mg lycopene l(-1)). Based on the optimization of the upstream SD regions and culture medium, the yield of the strain EDW11 reached a high level during microbial lycopene production until now.

  18. Molecular networks and the evolution of human cognitive specializations.

    PubMed

    Fontenot, Miles; Konopka, Genevieve

    2014-12-01

    Inroads into elucidating the origins of human cognitive specializations have taken many forms, including genetic, genomic, anatomical, and behavioral assays that typically compare humans to non-human primates. While the integration of all of these approaches is essential for ultimately understanding human cognition, here, we review the usefulness of coexpression network analysis for specifically addressing this question. An increasing number of studies have incorporated coexpression networks into brain expression studies comparing species, disease versus control tissue, brain regions, or developmental time periods. A clearer picture has emerged of the key genes driving brain evolution, as well as the developmental and regional contributions of gene expression patterns important for normal brain development and those misregulated in cognitive diseases. Copyright © 2014 Elsevier Ltd. All rights reserved.

  19. An atlas of gene expression and gene co-regulation in the human retina.

    PubMed

    Pinelli, Michele; Carissimo, Annamaria; Cutillo, Luisa; Lai, Ching-Hung; Mutarelli, Margherita; Moretti, Maria Nicoletta; Singh, Marwah Veer; Karali, Marianthi; Carrella, Diego; Pizzo, Mariateresa; Russo, Francesco; Ferrari, Stefano; Ponzin, Diego; Angelini, Claudia; Banfi, Sandro; di Bernardo, Diego

    2016-07-08

    The human retina is a specialized tissue involved in light stimulus transduction. Despite its unique biology, an accurate reference transcriptome is still missing. Here, we performed gene expression analysis (RNA-seq) of 50 retinal samples from non-visually impaired post-mortem donors. We identified novel transcripts with high confidence (Observed Transcriptome (ObsT)) and quantified the expression level of known transcripts (Reference Transcriptome (RefT)). The ObsT included 77 623 transcripts (23 960 genes) covering 137 Mb (35 Mb new transcribed genome). Most of the transcripts (92%) were multi-exonic: 81% with known isoforms, 16% with new isoforms and 3% belonging to new genes. The RefT included 13 792 genes across 94 521 known transcripts. Mitochondrial genes were among the most highly expressed, accounting for about 10% of the reads. Of all the protein-coding genes in Gencode, 65% are expressed in the retina. We exploited inter-individual variability in gene expression to infer a gene co-expression network and to identify genes specifically expressed in photoreceptor cells. We experimentally validated the photoreceptors localization of three genes in human retina that had not been previously reported. RNA-seq data and the gene co-expression network are available online (http://retina.tigem.it). © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

  20. Simple Shared Motifs (SSM) in conserved region of promoters: a new approach to identify co-regulation patterns.

    PubMed

    Gruel, Jérémy; LeBorgne, Michel; LeMeur, Nolwenn; Théret, Nathalie

    2011-09-12

    Regulation of gene expression plays a pivotal role in cellular functions. However, understanding the dynamics of transcription remains a challenging task. A host of computational approaches have been developed to identify regulatory motifs, mainly based on the recognition of DNA sequences for transcription factor binding sites. Recent integration of additional data from genomic analyses or phylogenetic footprinting has significantly improved these methods. Here, we propose a different approach based on the compilation of Simple Shared Motifs (SSM), groups of sequences defined by their length and similarity and present in conserved sequences of gene promoters. We developed an original algorithm to search and count SSM in pairs of genes. An exceptional number of SSM is considered as a common regulatory pattern. The SSM approach is applied to a sample set of genes and validated using functional gene-set enrichment analyses. We demonstrate that the SSM approach selects genes that are over-represented in specific biological categories (Ontology and Pathways) and are enriched in co-expressed genes. Finally we show that genes co-expressed in the same tissue or involved in the same biological pathway have increased SSM values. Using unbiased clustering of genes, Simple Shared Motifs analysis constitutes an original contribution to provide a clearer definition of expression networks.

  1. Simple Shared Motifs (SSM) in conserved region of promoters: a new approach to identify co-regulation patterns

    PubMed Central

    2011-01-01

    Background Regulation of gene expression plays a pivotal role in cellular functions. However, understanding the dynamics of transcription remains a challenging task. A host of computational approaches have been developed to identify regulatory motifs, mainly based on the recognition of DNA sequences for transcription factor binding sites. Recent integration of additional data from genomic analyses or phylogenetic footprinting has significantly improved these methods. Results Here, we propose a different approach based on the compilation of Simple Shared Motifs (SSM), groups of sequences defined by their length and similarity and present in conserved sequences of gene promoters. We developed an original algorithm to search and count SSM in pairs of genes. An exceptional number of SSM is considered as a common regulatory pattern. The SSM approach is applied to a sample set of genes and validated using functional gene-set enrichment analyses. We demonstrate that the SSM approach selects genes that are over-represented in specific biological categories (Ontology and Pathways) and are enriched in co-expressed genes. Finally we show that genes co-expressed in the same tissue or involved in the same biological pathway have increased SSM values. Conclusions Using unbiased clustering of genes, Simple Shared Motifs analysis constitutes an original contribution to provide a clearer definition of expression networks. PMID:21910886

  2. Dissecting the chromatin interactome of microRNA genes.

    PubMed

    Chen, Dijun; Fu, Liang-Yu; Zhang, Zhao; Li, Guoliang; Zhang, Hang; Jiang, Li; Harrison, Andrew P; Shanahan, Hugh P; Klukas, Christian; Zhang, Hong-Yu; Ruan, Yijun; Chen, Ling-Ling; Chen, Ming

    2014-03-01

    Our knowledge of the role of higher-order chromatin structures in transcription of microRNA genes (MIRs) is evolving rapidly. Here we investigate the effect of 3D architecture of chromatin on the transcriptional regulation of MIRs. We demonstrate that MIRs have transcriptional features that are similar to protein-coding genes. RNA polymerase II-associated ChIA-PET data reveal that many groups of MIRs and protein-coding genes are organized into functionally compartmentalized chromatin communities and undergo coordinated expression when their genomic loci are spatially colocated. We observe that MIRs display widespread communication in those transcriptionally active communities. Moreover, miRNA-target interactions are significantly enriched among communities with functional homogeneity while depleted from the same community from which they originated, suggesting MIRs coordinating function-related pathways at posttranscriptional level. Further investigation demonstrates the existence of spatial MIR-MIR chromatin interacting networks. We show that groups of spatially coordinated MIRs are frequently from the same family and involved in the same disease category. The spatial interaction network possesses both common and cell-specific subnetwork modules that result from the spatial organization of chromatin within different cell types. Together, our study unveils an entirely unexplored layer of MIR regulation throughout the human genome that links the spatial coordination of MIRs to their co-expression and function.

  3. E-cadherin determines Caveolin-1 tumor suppression or metastasis enhancing function in melanoma cells

    PubMed Central

    Lobos-González, L; Aguilar, L; Diaz, J; Diaz, N; Urra, H; Torres, V; Silva, V; Fitzpatrick, C; Lladser, A; Hoek, K.S.; Leyton, L; Quest, AFG

    2013-01-01

    SUMMARY The role of caveolin-1 (CAV1) in cancer is highly controversial. CAV1 suppresses genes that favor tumor development, yet also promotes focal adhesion turnover and migration of metastatic cells. How these contrasting observations relate to CAV1 function in vivo is unclear. Our previous studies implicate E-cadherin in CAV1-dependent tumor suppression. Here we use murine melanoma B16F10 cells, with low levels of endogenous CAV1 and E-cadherin, to unravel how CAV1 affects tumor growth and metastasis, and to assess how co-expression of E-cadherin modulates CAV1 function in vivo in C57BL/6 mice. We find that overexpression of CAV1 in B16F10(cav-1) cells reduces subcutaneous tumor formation, but enhances metastasis relative to control cells. Furthermore, E-cadherin expression in B16F10(E-cad) cells reduces subcutaneous tumor formation, and lung metastasis when intravenously injected. Importantly, co-expression of CAV1 and E-cadherin in B16F10(cav1/E-cad) cells abolishes tumor formation, lung metastasis, increased Rac-1 activity and cell migration observed with B16F10(cav-1) cells. Finally, consistent with the notion that CAV1 participates in switching human melanomas to a more malignant phenotype, elevated levels of CAV1 expression correlated with enhanced migration and Rac-1 activation in these cells. PMID:23470013

  4. Transcriptional Regulatory Network Analysis of MYB Transcription Factor Family Genes in Rice.

    PubMed

    Smita, Shuchi; Katiyar, Amit; Chinnusamy, Viswanathan; Pandey, Dev M; Bansal, Kailash C

    2015-01-01

    MYB transcription factor (TF) is one of the largest TF families and regulates defense responses to various stresses, hormone signaling as well as many metabolic and developmental processes in plants. Understanding these regulatory hierarchies of gene expression networks in response to developmental and environmental cues is a major challenge due to the complex interactions between the genetic elements. Correlation analyses are useful to unravel co-regulated gene pairs governing biological process as well as identification of new candidate hub genes in response to these complex processes. High throughput expression profiling data are highly useful for construction of co-expression networks. In the present study, we utilized transcriptome data for comprehensive regulatory network studies of MYB TFs by "top-down" and "guide-gene" approaches. More than 50% of OsMYBs were strongly correlated under 50 experimental conditions with 51 hub genes via "top-down" approach. Further, clusters were identified using Markov Clustering (MCL). To maximize the clustering performance, parameter evaluation of the MCL inflation score (I) was performed in terms of enriched GO categories by measuring F-score. Comparison of co-expressed cluster and clads analyzed from phylogenetic analysis signifies their evolutionarily conserved co-regulatory role. We utilized compendium of known interaction and biological role with Gene Ontology enrichment analysis to hypothesize function of coexpressed OsMYBs. In the other part, the transcriptional regulatory network analysis by "guide-gene" approach revealed 40 putative targets of 26 OsMYB TF hubs with high correlation value utilizing 815 microarray data. The putative targets with MYB-binding cis-elements enrichment in their promoter region, functional co-occurrence as well as nuclear localization supports our finding. Specially, enrichment of MYB binding regions involved in drought-inducibility implying their regulatory role in drought response in rice. Thus, the co-regulatory network analysis facilitated the identification of complex OsMYB regulatory networks, and candidate target regulon genes of selected guide MYB genes. The results contribute to the candidate gene screening, and experimentally testable hypotheses for potential regulatory MYB TFs, and their targets under stress conditions.

  5. MiRNA-miRNA synergistic network: construction via co-regulating functional modules and disease miRNA topological features.

    PubMed

    Xu, Juan; Li, Chuan-Xing; Li, Yong-Sheng; Lv, Jun-Ying; Ma, Ye; Shao, Ting-Ting; Xu, Liang-De; Wang, Ying-Ying; Du, Lei; Zhang, Yun-Peng; Jiang, Wei; Li, Chun-Quan; Xiao, Yun; Li, Xia

    2011-02-01

    Synergistic regulations among multiple microRNAs (miRNAs) are important to understand the mechanisms of complex post-transcriptional regulations in humans. Complex diseases are affected by several miRNAs rather than a single miRNA. So, it is a challenge to identify miRNA synergism and thereby further determine miRNA functions at a system-wide level and investigate disease miRNA features in the miRNA-miRNA synergistic network from a new view. Here, we constructed a miRNA-miRNA functional synergistic network (MFSN) via co-regulating functional modules that have three features: common targets of corresponding miRNA pairs, enriched in the same gene ontology category and close proximity in the protein interaction network. Predicted miRNA synergism is validated by significantly high co-expression of functional modules and significantly negative regulation to functional modules. We found that the MFSN exhibits a scale free, small world and modular architecture. Furthermore, the topological features of disease miRNAs in the MFSN are distinct from non-disease miRNAs. They have more synergism, indicating their higher complexity of functions and are the global central cores of the MFSN. In addition, miRNAs associated with the same disease are close to each other. The structure of the MFSN and the features of disease miRNAs are validated to be robust using different miRNA target data sets.

  6. Gene co-expression network analysis identifies porcine genes associated with variation in Salmonella shedding

    USDA-ARS?s Scientific Manuscript database

    Salmonella enterica serovar Typhimurium is a gram-negative bacterium that can colonize the gut of humans and several species of food producing farm animals to cause enteric or septicaemic salmonellosis. While many studies have looked into the host genetic response to Salmonella infection, relatively...

  7. [Construction of high-yield strain by optimizing lycopene cyclase for β-carotene production].

    PubMed

    Jin, Yingfu; Han, Li; Zhang, Shasha; Li, Shizhong; Liu, Weifeng; Tao, Yong

    2017-11-25

    To optimize key enzymes, such as to explore the gene resources and to modify the expression level, can maximize metabolic pathways of target products. β-carotene is a terpenoid compound with important application value. Lycopene cyclase (CrtY) is the key enzyme in β-carotene biosynthesis pathway, catalyzing flavin adenine dinucleotide (FAD)-dependent cyclization reaction and β-carotene synthesis from lycopene precursor. We optimized lycopene cyclase (CrtY) to improve the synthesis of β-carotene and determined the effect of CrtY expression on metabolic pathways. Frist, we developed a β-carotene synthesis module by coexpressing the lycopene β-cyclase gene crtY with crtEBI module in Escherichia coli. Then we simultaneously optimized the ribosome-binding site (RBS) intensity and the species of crtY using oligo-linker mediated DNA assembly method (OLMA). Five strains with high β-carotene production capacity were screened out from the OLMA library. The β-carotene yields of these strains were up to 15.79-18.90 mg/g DCW (Dry cell weight), 65% higher than that of the original strain at shake flask level. The optimal strain CP12 was further identified and evaluated for β-carotene production at 5 L fermentation level. After process optimization, the final β-carotene yield could reach to 1.9 g/L. The results of RBS strength and metabolic intermediate analysis indicated that an appropriate expression level of CrtY could be beneficial for the function of the β-carotene synthesis module. The results of this study provide important insight into the optimization of β-carotene synthesis pathway in metabolic engineering.

  8. Gene Coexpression Analysis Reveals Complex Metabolism of the Monoterpene Alcohol Linalool in Arabidopsis Flowers[W][OPEN

    PubMed Central

    Ginglinger, Jean-François; Boachon, Benoit; Höfer, René; Paetz, Christian; Köllner, Tobias G.; Miesch, Laurence; Lugan, Raphael; Baltenweck, Raymonde; Mutterer, Jérôme; Ullmann, Pascaline; Beran, Franziska; Claudel, Patricia; Verstappen, Francel; Fischer, Marc J.C.; Karst, Francis; Bouwmeester, Harro; Miesch, Michel; Schneider, Bernd; Gershenzon, Jonathan; Ehlting, Jürgen; Werck-Reichhart, Danièle

    2013-01-01

    The cytochrome P450 family encompasses the largest family of enzymes in plant metabolism, and the functions of many of its members in Arabidopsis thaliana are still unknown. Gene coexpression analysis pointed to two P450s that were coexpressed with two monoterpene synthases in flowers and were thus predicted to be involved in monoterpenoid metabolism. We show that all four selected genes, the two terpene synthases (TPS10 and TPS14) and the two cytochrome P450s (CYP71B31 and CYP76C3), are simultaneously expressed at anthesis, mainly in upper anther filaments and in petals. Upon transient expression in Nicotiana benthamiana, the TPS enzymes colocalize in vesicular structures associated with the plastid surface, whereas the P450 proteins were detected in the endoplasmic reticulum. Whether they were expressed in Saccharomyces cerevisiae or in N. benthamiana, the TPS enzymes formed two different enantiomers of linalool: (−)-(R)-linalool for TPS10 and (+)-(S)-linalool for TPS14. Both P450 enzymes metabolize the two linalool enantiomers to form different but overlapping sets of hydroxylated or epoxidized products. These oxygenated products are not emitted into the floral headspace, but accumulate in floral tissues as further converted or conjugated metabolites. This work reveals complex linalool metabolism in Arabidopsis flowers, the ecological role of which remains to be determined. PMID:24285789

  9. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Harwood, Caroline S

    The goal of this project is to identify gene networks that are critical for efficient biohydrogen production by leveraging variation in gene content and gene expression in independently isolated Rhodopseudomonas palustris strains. Coexpression methods were applied to large data sets that we have collected to define probabilistic causal gene networks. To our knowledge this a first systems level approach that takes advantage of strain-to strain variability to computationally define networks critical for a particular bacterial phenotypic trait.

  10. Modulation of mutant Huntingtin aggregates and toxicity by human myeloid leukemia factors.

    PubMed

    Banerjee, Manisha; Datta, Moumita; Bhattacharyya, Nitai P

    2017-01-01

    Increased poly glutamine (polyQ) stretch at N-terminal of Huntingtin (HTT) causes Huntington's disease. HTT interacts with large number of proteins, although the preference for such interactions with wild type or mutated HTT protein remains largely unknown. HYPK, an intrinsically unstructured protein chaperone and interactor of mutant HTT was found to interact with myeloid leukemia factor 1 (MLF1) and 2 (MLF2). To identify the role of these two proteins in mutant HTT mediated aggregate formation and toxicity in a cell model, both the proteins were found to preferentially interact with the mutated N-terminal HTT. They significantly reduced the number of cells containing mutant HTT aggregates and subsequent apoptosis in Neuro2A cells. Additionally, in FRAP assay, mobile fraction of mutant HTT aggregates was increased in the presence of MLF1 or MLF2. Further, MLF1 could release transcription factors like p53, CBP and CREB from mutant HTT aggregates. Moreover, in HeLa cell co-expressing mutant HTT exon1 and full length MLF1, p53 was released from the aggregates, leading to the recovery of the expression of the GADD45A transcript, a p53 regulated gene. Taking together, these results showed that MLF1 and MLF2 modulated the formation of aggregates and induction of apoptosis as well as the expressions of genes indirectly. Copyright © 2016 Elsevier Ltd. All rights reserved.

  11. Effects of Trx2p and Sec23p expression on stable production of hepatitis B surface antigen S domain in recombinant Saccharomyces cerevisiae.

    PubMed

    Park, Young-Kyoung; Jung, Sang-Min; Lim, Hyung-Kwon; Son, Young-Jin; Park, Yong-Cheol; Seo, Jin-Ho

    2012-08-31

    The S domain of hepatitis B virus surface antigen (sHBsAg) is the primary component for vaccine development against virus infection. For stable expression of sHBsAg in recombinant Saccharomyces cerevisiae, new accessory genes necessary for foreign protein expression were screened by DNA microarray. Among 600 genes of interest, genes coding for an activating protein of ATPase in Hsp90 (Aha1p), S. cerevisiae DnaJ (Scj1p), thioredoxin 2 (Trx2p) and a GTPase-activator specific for Sar1 (Sec23p) as well as Pdi1p were selected in transcriptome analysis, which are known to facilitate disulfide bond formation or induce protein transport in the secretion pathway. Individual and combinatorial expression of SEC23, TRX2 and PDI1 increased total sHBsAg concentration by 1.9-6.5-fold, relative to the control strain expressing sHBsAg only. Additionally, moderate expression of Kex2p protease able to cut off the signal peptide enhanced the portion of the authentic sHBsAg to total sHBsAg. Fed-batch fermentation of the S. cerevisiae 2805 strain coexpressing the sHBsAg, SEC23, PDI1 and KEX2 genes resulted in 70.6mg/L final sHBsAg concentration which was 5.6 times higher than that of the control. Transmission electron microscopic analysis of the yeast cells elucidated the effects of the accessory gene coexpression on the intracellular localization of sHBsAg. Like PDI1, coexpression of both SEC23 and/or TRX2 newly isolated in this study is expected to improve the target protein expression in S. cerevisiae. Copyright © 2012 Elsevier B.V. All rights reserved.

  12. Temporal and spatial transcriptomic and microRNA dynamics of CAM photosynthesis in pineapple.

    PubMed

    Wai, Ching M; VanBuren, Robert; Zhang, Jisen; Huang, Lixian; Miao, Wenjing; Edger, Patrick P; Yim, Won C; Priest, Henry D; Meyers, Blake C; Mockler, Todd; Smith, J Andrew C; Cushman, John C; Ming, Ray

    2017-10-01

    The altered carbon assimilation pathway of crassulacean acid metabolism (CAM) photosynthesis results in an up to 80% higher water-use efficiency than C 3 photosynthesis in plants making it a potentially useful pathway for engineering crop plants with improved drought tolerance. Here we surveyed detailed temporal (diel time course) and spatial (across a leaf gradient) gene and microRNA (miRNA) expression patterns in the obligate CAM plant pineapple [Ananas comosus (L.) Merr.]. The high-resolution transcriptome atlas allowed us to distinguish between CAM-related and non-CAM gene copies. A differential gene co-expression network across green and white leaf diel datasets identified genes with circadian oscillation, CAM-related functions, and source-sink relations. Gene co-expression clusters containing CAM pathway genes are enriched with clock-associated cis-elements, suggesting circadian regulation of CAM. About 20% of pineapple microRNAs have diel expression patterns, with several that target key CAM-related genes. Expression and physiology data provide a model for CAM-specific carbohydrate flux and long-distance hexose transport. Together these resources provide a list of candidate genes for targeted engineering of CAM into C 3 photosynthesis crop species. © 2017 The Authors The Plant Journal © 2017 John Wiley & Sons Ltd.

  13. Integrated in silico analyses of regulatory and metabolic networks of Synechococcus sp. PCC 7002 reveal relationships between gene centrality and essentiality

    DOE PAGES

    Song, Hyun-Seob; McClure, Ryan S.; Bernstein, Hans C.; ...

    2015-03-27

    Cyanobacteria dynamically relay environmental inputs to intracellular adaptations through a coordinated adjustment of photosynthetic efficiency and carbon processing rates. The output of such adaptations is reflected through changes in transcriptional patterns and metabolic flux distributions that ultimately define growth strategy. To address interrelationships between metabolism and regulation, we performed integrative analyses of metabolic and gene co-expression networks in a model cyanobacterium, Synechococcus sp. PCC 7002. Centrality analyses using the gene co-expression network identified a set of key genes, which were defined here as ‘topologically important.’ Parallel in silico gene knock-out simulations, using the genome-scale metabolic network, classified what we termedmore » as ‘functionally important’ genes, deletion of which affected growth or metabolism. A strong positive correlation was observed between topologically and functionally important genes. Functionally important genes exhibited variable levels of topological centrality; however, the majority of topologically central genes were found to be functionally essential for growth. Subsequent functional enrichment analysis revealed that both functionally and topologically important genes in Synechococcus sp. PCC 7002 are predominantly associated with translation and energy metabolism, two cellular processes critical for growth. This research demonstrates how synergistic network-level analyses can be used for reconciliation of metabolic and gene expression data to uncover fundamental biological principles.« less

  14. Integrated in silico analyses of regulatory and metabolic networks of Synechococcus sp. PCC 7002 reveal relationships between gene centrality and essentiality

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Song, Hyun-Seob; McClure, Ryan S.; Bernstein, Hans C.

    Cyanobacteria dynamically relay environmental inputs to intracellular adaptations through a coordinated adjustment of photosynthetic efficiency and carbon processing rates. The output of such adaptations is reflected through changes in transcriptional patterns and metabolic flux distributions that ultimately define growth strategy. To address interrelationships between metabolism and regulation, we performed integrative analyses of metabolic and gene co-expression networks in a model cyanobacterium, Synechococcus sp. PCC 7002. Centrality analyses using the gene co-expression network identified a set of key genes, which were defined here as ‘topologically important.’ Parallel in silico gene knock-out simulations, using the genome-scale metabolic network, classified what we termedmore » as ‘functionally important’ genes, deletion of which affected growth or metabolism. A strong positive correlation was observed between topologically and functionally important genes. Functionally important genes exhibited variable levels of topological centrality; however, the majority of topologically central genes were found to be functionally essential for growth. Subsequent functional enrichment analysis revealed that both functionally and topologically important genes in Synechococcus sp. PCC 7002 are predominantly associated with translation and energy metabolism, two cellular processes critical for growth. This research demonstrates how synergistic network-level analyses can be used for reconciliation of metabolic and gene expression data to uncover fundamental biological principles.« less

  15. A mental retardation gene, motopsin/prss12, modulates cell morphology by interaction with seizure-related gene 6.

    PubMed

    Mitsui, Shinichi; Hidaka, Chiharu; Furihata, Mutsuo; Osako, Yoji; Yuri, Kazunari

    2013-07-12

    A serine protease, motopsin (prss12), plays a significant role in cognitive function and the development of the brain, since the loss of motopsin function causes severe mental retardation in humans and enhances social behavior in mice. Motopsin is activity-dependently secreted from neuronal cells, is captured around the synaptic cleft, and cleaves a proteoglycan, agrin. The multi-domain structure of motopsin, consisting of a signal peptide, a proline-rich domain, a kringle domain, three scavenger receptor cysteine-rich domains, and a protease domain at the C-terminal, suggests the interaction with other molecules through these domains. To identify a protein interacting with motopsin, we performed yeast two-hybrid screening and found that seizure-related gene 6 (sez-6), a transmembrane protein on the plasma membrane of neuronal cells, bound to the proline-rich/kringle domain of motopsin. Pull-down and immunoprecipitation analyses indicated the interaction between these proteins. Immunocytochemical and immunohistochemical analyses suggested the co-localization of motopsin and sez-6 at neuronal cells in the developmental mouse brain and at motor neurons in the anterior horn of human spinal cords. Transient expression of motopsin in neuro2a cells increased the number and length of neurites as well as the level of neurite branching. Interestingly, co-expression of sez-6 with motopsin restored the effect of motopsin at the basal level, while sez-6 expression alone showed no effects on cell morphology. Our results suggest that the interaction of motopsin and sez-6 modulates the neuronal cell morphology. Copyright © 2013 Elsevier Inc. All rights reserved.

  16. Involvement of an ABI-like protein and a Ca2+-ATPase in drought tolerance as revealed by transcript profiling of a sweetpotato somatic hybrid and its parents Ipomoea batatas (L.) Lam. and I. triloba L.

    PubMed Central

    Jia, Licong; Yang, Guohong; Xu, Xinzhi; Zhai, Hong; He, Shaozhen; Li, Junxia; Dai, Xiaodong; Qin, Na; Zhu, Cancan

    2018-01-01

    Previously, we obtained the sweetpotato somatic hybrid KT1 from a cross between sweetpotato (Ipomoea batatas (L.) Lam.) cv. Kokei No. 14 and its drought-tolerant wild relative I. triloba L. KT1 not only inherited the thick storage root characteristic of Kokei No. 14 but also the drought-tolerance trait of I. triloba L. The aim of this study was to explore the molecular mechanism of the drought tolerance of KT1. Four-week-old in vitro-grown plants of KT1, Kokei No. 14, and I. triloba L. were subjected to a simulated drought stress treatment (30% PEG6000) for 0, 6, 12 and 24 h. Total RNA was extracted from samples at each time point, and then used for transcriptome sequencing. The gene transcript profiles of KT1 and its parents were compared to identify differentially expressed genes, and drought-related modules were screened by a weighted gene co-expression network analysis. The functions of ABI-like protein and Ca2+-ATPase, two proteins screened from the cyan and light yellow modules, were analyzed in terms of their potential roles in drought tolerance in KT1 and its parents. These analyses of the drought responses of KT1 and its somatic donors at the transcriptional level provide new annotations for the molecular mechanism of drought tolerance in the somatic hybrid KT1 and its parents. PMID:29466419

  17. Comparative transcript profiling of alloplasmic male-sterile lines revealed altered gene expression related to pollen development in rice (Oryza sativa L.).

    PubMed

    Hu, Jihong; Chen, Guanglong; Zhang, Hongyuan; Qian, Qian; Ding, Yi

    2016-08-05

    Cytoplasmic male sterility (CMS) is an ideal model for investigating the mitochondrial-nuclear interaction and down-regulated genes in CMS lines which might be the candidate genes for pollen development in rice. In this study, a set of rice alloplasmic sporophytic CMS lines was obtained by successive backcrossing of Meixiang B, with three different cytoplasmic types: D62A (D type), ZS97A (WA type) and XQZ-A (DA type). Using microarray, the anther transcript profiles of the three indica rice CMS lines revealed 622 differentially expressed genes (DEGs) in each of the three CMS lines compared with the maintainer line Meixiang B. GO and MapMan analysis indicated that these DEGs were mainly involved in lipid metabolism and cell wall organization. Compared with the gene expression of sporophytic and gametophytic CMS lines, 303 DEGs were identified and 56 of them were down-regulated in all the CMS lines of rice. These down-regulated DEGs in the CMS lines were found to be involved in tapetum or cell wall formation and their suppressed expression might be related to male sterility. Weighted gene co-expression network analysis (WGCNA) revealed that two modules were significantly associated with male sterility and many hub genes that were differentially expressed in the CMS lines. A large set of putative genes involved in anther development was identified in the present study. The results will give some information for the nuclear gene regulation by different cytoplasmic genotypes and provide a rich resource for further functional research on the pollen development in rice.

  18. IQ-domain GTPase-activating protein 1 promotes the malignant phenotype of invasive ductal breast carcinoma via canonical Wnt pathway.

    PubMed

    Zhao, Huan-Yu; Han, Yang; Wang, Jian; Yang, Lian-He; Zheng, Xiao-Ying; Du, Jiang; Wu, Guang-Ping; Wang, En-Hua

    2017-06-01

    IQ-domain GTPase-activating protein 1 is a scaffolding protein with multidomain which plays a role in modulating dishevelled (Dvl) nuclear translocation in canonical Wnt pathway. However, the biological function and mechanism of IQ-domain GTPase-activating protein 1 in invasive ductal carcinoma (IDC) remain unknown. In this study, we found that IQ-domain GTPase-activating protein 1 expression was elevated in invasive ductal carcinoma, which was positively correlated with tumor grade, lymphatic metastasis, and poor prognosis. Coexpression of IQ-domain GTPase-activating protein 1 and Dvl in the nucleus and cytoplasm of invasive ductal carcinoma was significantly correlated but not in the membrane. Postoperative survival in the patients with their coexpression in the nucleus and cytoplasm was obviously lower than that without coexpression. The positive expression rates of c-myc and cyclin D1 were significantly higher in the patients with nuclear coexpression of Dvl and IQ-domain GTPase-activating protein 1 than that with cytoplasmic coexpression, correlating with poor prognosis. IQ-domain GTPase-activating protein 1 significantly enhanced cell proliferation and invasion in invasive ductal carcinoma cell lines by interacting with Dvl in cytoplasm to promote Dvl nuclear translocation so as to upregulate the expression of c-myc and cyclin D1. Collectively, our data suggest that IQ-domain GTPase-activating protein 1 may promote the malignant phenotype of invasive ductal carcinoma via canonical Wnt signaling, and it could be used as a potential prognostic biomarker for breast cancer patients.

  19. Combinatorial Regulation of Stilbene Synthase Genes by WRKY and MYB Transcription Factors in Grapevine (Vitis vinifera L.).

    PubMed

    Vannozzi, Alessandro; Wong, Darren Chern Jan; Höll, Janine; Hmmam, Ibrahim; Matus, José Tomás; Bogs, Jochen; Ziegler, Tobias; Dry, Ian; Barcaccia, Gianni; Lucchin, Margherita

    2018-05-01

    Stilbene synthase (STS) is the key enzyme leading to the biosynthesis of resveratrol. Recently we reported two R2R3-MYB transcription factor (TF) genes that regulate the stilbene biosynthetic pathway in grapevine: VviMYB14 and VviMYB15. These genes are strongly co-expressed with STS genes under a range of stress and developmental conditions, in agreement with the specific activation of STS promoters by these TFs. Genome-wide gene co-expression analysis using two separate transcriptome compendia based on microarray and RNA sequencing data revealed that WRKY TFs were the top TF family correlated with STS genes. On the basis of correlation frequency, four WRKY genes, namely VviWRKY03, VviWRKY24, VviWRKY43 and VviWRKY53, were further shortlisted and functionally validated. Expression analyses under both unstressed and stressed conditions, together with promoter-luciferase reporter assays, suggested different hierarchies for these TFs in the regulation of the stilbene biosynthetic pathway. In particular, VviWRKY24 seems to act as a singular effector in the activation of the VviSTS29 promoter, while VviWRKY03 acts through a combinatorial effect with VviMYB14, suggesting that these two regulators may interact at the protein level as previously reported in other species.

  20. Network-based expression analyses and experimental validations revealed high co-expression between Yap1 and stem cell markers compared to differentiated cells.

    PubMed

    Dehghanian, Fariba; Hojati, Zohreh; Esmaeili, Fariba; Masoudi-Nejad, Ali

    2018-05-21

    The Hippo signaling pathway is identified as a potential regulatory pathway which plays critical roles in differentiation and stem cell self-renewal. Yap1 is a primary transcriptional effector of this pathway. The importance of Yap1 in embryonic stem cells (ESCs) and differentiation procedure remains a challenging question, since two different observations have been reported. To answer this question we used co-expression network and differential co-expression analyses followed by experimental validations. Our results indicate that Yap1 is highly co-expressed with stem cell markers in ESCs but not in differentiated cells (DCs). The significant Yap1 down-regulation and also translocation of Yap1 into the cytoplasm during P19 differentiation was also detected. Moreover, our results suggest the E2f7, Lin28a and Dppa4 genes as possible regulatory nuclear factors of Hippo pathway in stem cells. The present findings are actively consistent with studies that suggested Yap1 as an essential factor for stem cell self-renewal. Copyright © 2018 Elsevier Inc. All rights reserved.

Top