general co-expression network-based: Topics by Science.gov

Sample records for general co-expression network-based

A novel method to identify pathways associated with renal cell carcinoma based on a gene co-expression network

PubMed Central

RUAN, XIYUN; LI, HONGYUN; LIU, BO; CHEN, JIE; ZHANG, SHIBAO; SUN, ZEQIANG; LIU, SHUANGQING; SUN, FAHAI; LIU, QINGYONG

2015-01-01

The aim of the present study was to develop a novel method for identifying pathways associated with renal cell carcinoma (RCC) based on a gene co-expression network. A framework was established where a co-expression network was derived from the database as well as various co-expression approaches. First, the backbone of the network based on differentially expressed (DE) genes between RCC patients and normal controls was constructed by the Search Tool for the Retrieval of Interacting Genes/Proteins (STRING) database. The differentially co-expressed links were detected by Pearson’s correlation, the empirical Bayesian (EB) approach and Weighted Gene Co-expression Network Analysis (WGCNA). The co-expressed gene pairs were merged by a rank-based algorithm. We obtained 842; 371; 2,883 and 1,595 co-expressed gene pairs from the co-expression networks of the STRING database, Pearson’s correlation EB method and WGCNA, respectively. Two hundred and eighty-one differentially co-expressed (DC) gene pairs were obtained from the merged network using this novel method. Pathway enrichment analysis based on the Kyoto Encyclopedia of Genes and Genomes (KEGG) database and the network enrichment analysis (NEA) method were performed to verify feasibility of the merged method. Results of the KEGG and NEA pathway analyses showed that the network was associated with RCC. The suggested method was computationally efficient to identify pathways associated with RCC and has been identified as a useful complement to traditional co-expression analysis. PMID:26058425
Analysis of bHLH coding genes using gene co-expression network approach.

PubMed

Srivastava, Swati; Sanchita; Singh, Garima; Singh, Noopur; Srivastava, Gaurava; Sharma, Ashok

2016-07-01

Network analysis provides a powerful framework for the interpretation of data. It uses novel reference network-based metrices for module evolution. These could be used to identify module of highly connected genes showing variation in co-expression network. In this study, a co-expression network-based approach was used for analyzing the genes from microarray data. Our approach consists of a simple but robust rank-based network construction. The publicly available gene expression data of Solanum tuberosum under cold and heat stresses were considered to create and analyze a gene co-expression network. The analysis provide highly co-expressed module of bHLH coding genes based on correlation values. Our approach was to analyze the variation of genes expression, according to the time period of stress through co-expression network approach. As the result, the seed genes were identified showing multiple connections with other genes in the same cluster. Seed genes were found to be vary in different time periods of stress. These analyzed seed genes may be utilized further as marker genes for developing the stress tolerant plant species.
Multi-tissue analysis of co-expression networks by higher-order generalized singular value decomposition identifies functionally coherent transcriptional modules.

PubMed

Xiao, Xiaolin; Moreno-Moral, Aida; Rotival, Maxime; Bottolo, Leonardo; Petretto, Enrico

2014-01-01

Recent high-throughput efforts such as ENCODE have generated a large body of genome-scale transcriptional data in multiple conditions (e.g., cell-types and disease states). Leveraging these data is especially important for network-based approaches to human disease, for instance to identify coherent transcriptional modules (subnetworks) that can inform functional disease mechanisms and pathological pathways. Yet, genome-scale network analysis across conditions is significantly hampered by the paucity of robust and computationally-efficient methods. Building on the Higher-Order Generalized Singular Value Decomposition, we introduce a new algorithmic approach for efficient, parameter-free and reproducible identification of network-modules simultaneously across multiple conditions. Our method can accommodate weighted (and unweighted) networks of any size and can similarly use co-expression or raw gene expression input data, without hinging upon the definition and stability of the correlation used to assess gene co-expression. In simulation studies, we demonstrated distinctive advantages of our method over existing methods, which was able to recover accurately both common and condition-specific network-modules without entailing ad-hoc input parameters as required by other approaches. We applied our method to genome-scale and multi-tissue transcriptomic datasets from rats (microarray-based) and humans (mRNA-sequencing-based) and identified several common and tissue-specific subnetworks with functional significance, which were not detected by other methods. In humans we recapitulated the crosstalk between cell-cycle progression and cell-extracellular matrix interactions processes in ventricular zones during neocortex expansion and further, we uncovered pathways related to development of later cognitive functions in the cortical plate of the developing brain which were previously unappreciated. Analyses of seven rat tissues identified a multi-tissue subnetwork of co-expressed heat shock protein (Hsp) and cardiomyopathy genes (Bag3, Cryab, Kras, Emd, Plec), which was significantly replicated using separate failing heart and liver gene expression datasets in humans, thus revealing a conserved functional role for Hsp genes in cardiovascular disease.
Comparison of co-expression measures: mutual information, correlation, and model based indices.

PubMed

Song, Lin; Langfelder, Peter; Horvath, Steve

2012-12-09

Co-expression measures are often used to define networks among genes. Mutual information (MI) is often used as a generalized correlation measure. It is not clear how much MI adds beyond standard (robust) correlation measures or regression model based association measures. Further, it is important to assess what transformations of these and other co-expression measures lead to biologically meaningful modules (clusters of genes). We provide a comprehensive comparison between mutual information and several correlation measures in 8 empirical data sets and in simulations. We also study different approaches for transforming an adjacency matrix, e.g. using the topological overlap measure. Overall, we confirm close relationships between MI and correlation in all data sets which reflects the fact that most gene pairs satisfy linear or monotonic relationships. We discuss rare situations when the two measures disagree. We also compare correlation and MI based approaches when it comes to defining co-expression network modules. We show that a robust measure of correlation (the biweight midcorrelation transformed via the topological overlap transformation) leads to modules that are superior to MI based modules and maximal information coefficient (MIC) based modules in terms of gene ontology enrichment. We present a function that relates correlation to mutual information which can be used to approximate the mutual information from the corresponding correlation coefficient. We propose the use of polynomial or spline regression models as an alternative to MI for capturing non-linear relationships between quantitative variables. The biweight midcorrelation outperforms MI in terms of elucidating gene pairwise relationships. Coupled with the topological overlap matrix transformation, it often leads to more significantly enriched co-expression modules. Spline and polynomial networks form attractive alternatives to MI in case of non-linear relationships. Our results indicate that MI networks can safely be replaced by correlation networks when it comes to measuring co-expression relationships in stationary data.
Tissue and cell-type co-expression networks of transcription factors and wood component genes in Populus trichocarpa.

PubMed

Shi, Rui; Wang, Jack P; Lin, Ying-Chung; Li, Quanzi; Sun, Ying-Hsuan; Chen, Hao; Sederoff, Ronald R; Chiang, Vincent L

2017-05-01

Co-expression networks based on transcriptomes of Populus trichocarpa major tissues and specific cell types suggest redundant control of cell wall component biosynthetic genes by transcription factors in wood formation. We analyzed the transcriptomes of five tissues (xylem, phloem, shoot, leaf, and root) and two wood forming cell types (fiber and vessel) of Populus trichocarpa to assemble gene co-expression subnetworks associated with wood formation. We identified 165 transcription factors (TFs) that showed xylem-, fiber-, and vessel-specific expression. Of these 165 TFs, 101 co-expressed (correlation coefficient, r > 0.7) with the 45 secondary cell wall cellulose, hemicellulose, and lignin biosynthetic genes. Each cell wall component gene co-expressed on average with 34 TFs, suggesting redundant control of the cell wall component gene expression. Co-expression analysis showed that the 101 TFs and the 45 cell wall component genes each has two distinct groups (groups 1 and 2), based on their co-expression patterns. The group 1 TFs (44 members) are predominantly xylem and fiber specific, and are all highly positively co-expressed with the group 1 cell wall component genes (30 members), suggesting their roles as major wood formation regulators. Group 1 TFs include a lateral organ boundary domain gene (LBD) that has the highest number of positively correlated cell wall component genes (36) and TFs (47). The group 2 TFs have 57 members, including 14 vessel-specific TFs, and are generally less correlated with the cell wall component genes. An exception is a vessel-specific basic helix-loop-helix (bHLH) gene that negatively correlates with 20 cell wall component genes, and may function as a key transcriptional suppressor. The co-expression networks revealed here suggest a well-structured transcriptional homeostasis for cell wall component biosynthesis during wood formation.
An iterative network partition algorithm for accurate identification of dense network modules

PubMed Central

Sun, Siqi; Dong, Xinran; Fu, Yao; Tian, Weidong

2012-01-01

A key step in network analysis is to partition a complex network into dense modules. Currently, modularity is one of the most popular benefit functions used to partition network modules. However, recent studies suggested that it has an inherent limitation in detecting dense network modules. In this study, we observed that despite the limitation, modularity has the advantage of preserving the primary network structure of the undetected modules. Thus, we have developed a simple iterative Network Partition (iNP) algorithm to partition a network. The iNP algorithm provides a general framework in which any modularity-based algorithm can be implemented in the network partition step. Here, we tested iNP with three modularity-based algorithms: multi-step greedy (MSG), spectral clustering and Qcut. Compared with the original three methods, iNP achieved a significant improvement in the quality of network partition in a benchmark study with simulated networks, identified more modules with significantly better enrichment of functionally related genes in both yeast protein complex network and breast cancer gene co-expression network, and discovered more cancer-specific modules in the cancer gene co-expression network. As such, iNP should have a broad application as a general method to assist in the analysis of biological networks. PMID:22121225
Co-expression network with protein-protein interaction and transcription regulation in malaria parasite Plasmodium falciparum.

PubMed

Yu, Fu-Dong; Yang, Shao-You; Li, Yuan-Yuan; Hu, Wei

2013-04-10

Malaria continues to be one of the most severe global infectious diseases, as a major threat to human health and economic development. Network-based biological analysis is a promising approach to uncover key genes and biological processes from a network viewpoint, which could not be recognized from individual gene-based signatures. We integrated gene co-expression profile with protein-protein interaction and transcriptional regulation information to construct a comprehensive gene co-expression network of Plasmodium falciparum. Based on this network, we identified 10 core modules by using ICE (Iterative Clique Enumeration) algorithm, which were essential for malaria parasite development in intraerythrocytic developmental cycle (IDC) stages. In each module, all genes were highly correlated probably due to co-regulation or formation of a protein complex. Some of these genes were recognized to be differentially coexpressed among three close-by IDC stages. The gene of prpf8 (PFD0265w) encoding pre-mRNA processing splicing factor 8 product was identified as DCGs (differentially co-expressed genes) among IDC stages, although this gene function was seldom reported in previous researches. Integrating the species-specific gene prediction and differential co-expression gene detection, we found some modules could perform species-specific functions according to some of genes in these modules were species-specific genes, like the module 10. Furthermore, in order to reveal the underlying mechanisms of the erythrocyte invasion by P. falciparum, Steiner Tree algorithm was employed to identify the invasion subnetwork from our gene co-expression network. The subnetwork-based analysis indicated that some important Plasmodium parasite specific genes could corporate with each other and be co-regulated during the parasite invasion process, which including a head-to-head gene pair of PfRH2a (PF13_0198) and PfRH2b (MAL13P1.176). This study based on gene co-expression network could shed new insights on the mechanisms of pathogenesis, even virulence and P. falciparum development. Crown Copyright © 2012. Published by Elsevier B.V. All rights reserved.
General expressions for downlink signal to interference and noise ratio in homogeneous and heterogeneous LTE-Advanced networks.

PubMed

Ali, Nora A; Mourad, Hebat-Allah M; ElSayed, Hany M; El-Soudani, Magdy; Amer, Hassanein H; Daoud, Ramez M

2016-11-01

The interference is the most important problem in LTE or LTE-Advanced networks. In this paper, the interference was investigated in terms of the downlink signal to interference and noise ratio (SINR). In order to compare the different frequency reuse methods that were developed to enhance the SINR, it would be helpful to have a generalized expression to study the performance of the different methods. Therefore, this paper introduces general expressions for the SINR in homogeneous and in heterogeneous networks. In homogeneous networks, the expression was applied for the most common types of frequency reuse techniques: soft frequency reuse (SFR) and fractional frequency reuse (FFR). The expression was examined by comparing it with previously developed ones in the literature and the comparison showed that the expression is valid for any type of frequency reuse scheme and any network topology. Furthermore, the expression was extended to include the heterogeneous network; the expression includes the problem of co-tier and cross-tier interference in heterogeneous networks (HetNet) and it was examined by the same method of the homogeneous one.
Discovery and validation of a glioblastoma co-expressed gene module

PubMed Central

Dunwoodie, Leland J.; Poehlman, William L.; Ficklin, Stephen P.; Feltus, Frank Alexander

2018-01-01

Tumors exhibit complex patterns of aberrant gene expression. Using a knowledge-independent, noise-reducing gene co-expression network construction software called KINC, we created multiple RNAseq-based gene co-expression networks relevant to brain and glioblastoma biology. In this report, we describe the discovery and validation of a glioblastoma-specific gene module that contains 22 co-expressed genes. The genes are upregulated in glioblastoma relative to normal brain and lower grade glioma samples; they are also hypo-methylated in glioblastoma relative to lower grade glioma tumors. Among the proneural, neural, mesenchymal, and classical glioblastoma subtypes, these genes are most-highly expressed in the mesenchymal subtype. Furthermore, high expression of these genes is associated with decreased survival across each glioblastoma subtype. These genes are of interest to glioblastoma biology and our gene interaction discovery and validation workflow can be used to discover and validate co-expressed gene modules derived from any co-expression network. PMID:29541392
Discovery and validation of a glioblastoma co-expressed gene module.

PubMed

Dunwoodie, Leland J; Poehlman, William L; Ficklin, Stephen P; Feltus, Frank Alexander

2018-02-16

Tumors exhibit complex patterns of aberrant gene expression. Using a knowledge-independent, noise-reducing gene co-expression network construction software called KINC, we created multiple RNAseq-based gene co-expression networks relevant to brain and glioblastoma biology. In this report, we describe the discovery and validation of a glioblastoma-specific gene module that contains 22 co-expressed genes. The genes are upregulated in glioblastoma relative to normal brain and lower grade glioma samples; they are also hypo-methylated in glioblastoma relative to lower grade glioma tumors. Among the proneural, neural, mesenchymal, and classical glioblastoma subtypes, these genes are most-highly expressed in the mesenchymal subtype. Furthermore, high expression of these genes is associated with decreased survival across each glioblastoma subtype. These genes are of interest to glioblastoma biology and our gene interaction discovery and validation workflow can be used to discover and validate co-expressed gene modules derived from any co-expression network.
Building gene co-expression networks using transcriptomics data for systems biology investigations: Comparison of methods using microarray data

PubMed Central

Kadarmideen, Haja N; Watson-haigh, Nathan S

2012-01-01

Gene co-expression networks (GCN), built using high-throughput gene expression data are fundamental aspects of systems biology. The main aims of this study were to compare two popular approaches to building and analysing GCN. We use real ovine microarray transcriptomics datasets representing four different treatments with Metyrapone, an inhibitor of cortisol biosynthesis. We conducted several microarray quality control checks before applying GCN methods to filtered datasets. Then we compared the outputs of two methods using connectivity as a criterion, as it measures how well a node (gene) is connected within a network. The two GCN construction methods used were, Weighted Gene Co-expression Network Analysis (WGCNA) and Partial Correlation and Information Theory (PCIT) methods. Nodes were ranked based on their connectivity measures in each of the four different networks created by WGCNA and PCIT and node ranks in two methods were compared to identify those nodes which are highly differentially ranked (HDR). A total of 1,017 HDR nodes were identified across one or more of four networks. We investigated HDR nodes by gene enrichment analyses in relation to their biological relevance to phenotypes. We observed that, in contrast to WGCNA method, PCIT algorithm removes many of the edges of the most highly interconnected nodes. Removal of edges of most highly connected nodes or hub genes will have consequences for downstream analyses and biological interpretations. In general, for large GCN construction (with > 20000 genes) access to large computer clusters, particularly those with larger amounts of shared memory is recommended. PMID:23144540
Comparison of gene co-networks reveals the molecular mechanisms of the rice (Oryza sativa L.) response to Rhizoctonia solani AG1 IA infection.

PubMed

Zhang, Jinfeng; Zhao, Wenjuan; Fu, Rong; Fu, Chenglin; Wang, Lingxia; Liu, Huainian; Li, Shuangcheng; Deng, Qiming; Wang, Shiquan; Zhu, Jun; Liang, Yueyang; Li, Ping; Zheng, Aiping

2018-05-05

Rhizoctonia solani causes rice sheath blight, an important disease affecting the growth of rice (Oryza sativa L.). Attempts to control the disease have met with little success. Based on transcriptional profiling, we previously identified more than 11,947 common differentially expressed genes (TPM > 10) between the rice genotypes TeQing and Lemont. In the current study, we extended these findings by focusing on an analysis of gene co-expression in response to R. solani AG1 IA and identified gene modules within the networks through weighted gene co-expression network analysis (WGCNA). We compared the different genes assigned to each module and the biological interpretations of gene co-expression networks at early and later modules in the two rice genotypes to reveal differential responses to AG1 IA. Our results show that different changes occurred in the two rice genotypes and that the modules in the two groups contain a number of candidate genes possibly involved in pathogenesis, such as the VQ protein. Furthermore, these gene co-expression networks provide comprehensive transcriptional information regarding gene expression in rice in response to AG1 IA. The co-expression networks derived from our data offer ideas for follow-up experimentation that will help advance our understanding of the translational regulation of rice gene expression changes in response to AG1 IA.
An intersection network based on combining SNP co-association and RNA co-expression networks for feed utilization traits in Japanese Black cattle.

PubMed

Okada, D; Endo, S; Matsuda, H; Ogawa, S; Taniguchi, Y; Katsuta, T; Watanabe, T; Iwaisaki, H

2018-05-12

Genome-wide association studies (GWAS) of quantitative traits have detected numerous genetic associations, but they encounter difficulties in pinpointing prominent candidate genes and inferring gene networks. The present study used a systems genetics approach integrating GWAS results with external RNA-expression data to detect candidate gene networks in feed utilization and growth traits of Japanese Black cattle, which are matters of concern. A SNP co-association network was derived from significant correlations between SNPs with effects estimated by GWAS across seven phenotypic traits. The resulting network genes contained significant numbers of annotations related to the traits. Using bovine transcriptome data from a public database, an RNA co-expression network was inferred based on the similarity of expression patterns across different tissues. An intersection network was then generated by superimposing the SNP and RNA networks and extracting shared interactions. This intersection network contained four tissue-specific modules: nervous system, reproductive system, muscular system, and glands. To characterize the structure (topographical properties) of the three networks, their scale-free properties were evaluated, which revealed that the intersection network was the most scale-free. In the sub-network containing the most connected transcription factors (URI1, ROCK2 and ETV6), most genes were widely expressed across tissues, and genes previously shown to be involved in the traits were found. Results indicated that the current approach might be used to construct a gene network that better reflects biological information, providing encouragement for the genetic dissection of economically important quantitative traits.
Integrative Analysis of Many Weighted Co-Expression Networks Using Tensor Computation

PubMed Central

Li, Wenyuan; Liu, Chun-Chi; Zhang, Tong; Li, Haifeng; Waterman, Michael S.; Zhou, Xianghong Jasmine

2011-01-01

The rapid accumulation of biological networks poses new challenges and calls for powerful integrative analysis tools. Most existing methods capable of simultaneously analyzing a large number of networks were primarily designed for unweighted networks, and cannot easily be extended to weighted networks. However, it is known that transforming weighted into unweighted networks by dichotomizing the edges of weighted networks with a threshold generally leads to information loss. We have developed a novel, tensor-based computational framework for mining recurrent heavy subgraphs in a large set of massive weighted networks. Specifically, we formulate the recurrent heavy subgraph identification problem as a heavy 3D subtensor discovery problem with sparse constraints. We describe an effective approach to solving this problem by designing a multi-stage, convex relaxation protocol, and a non-uniform edge sampling technique. We applied our method to 130 co-expression networks, and identified 11,394 recurrent heavy subgraphs, grouped into 2,810 families. We demonstrated that the identified subgraphs represent meaningful biological modules by validating against a large set of compiled biological knowledge bases. We also showed that the likelihood for a heavy subgraph to be meaningful increases significantly with its recurrence in multiple networks, highlighting the importance of the integrative approach to biological network analysis. Moreover, our approach based on weighted graphs detects many patterns that would be overlooked using unweighted graphs. In addition, we identified a large number of modules that occur predominately under specific phenotypes. This analysis resulted in a genome-wide mapping of gene network modules onto the phenome. Finally, by comparing module activities across many datasets, we discovered high-order dynamic cooperativeness in protein complex networks and transcriptional regulatory networks. PMID:21698123
Analysis of genetic association using hierarchical clustering and cluster validation indices.

PubMed

Pagnuco, Inti A; Pastore, Juan I; Abras, Guillermo; Brun, Marcel; Ballarin, Virginia L

2017-10-01

It is usually assumed that co-expressed genes suggest co-regulation in the underlying regulatory network. Determining sets of co-expressed genes is an important task, based on some criteria of similarity. This task is usually performed by clustering algorithms, where the genes are clustered into meaningful groups based on their expression values in a set of experiment. In this work, we propose a method to find sets of co-expressed genes, based on cluster validation indices as a measure of similarity for individual gene groups, and a combination of variants of hierarchical clustering to generate the candidate groups. We evaluated its ability to retrieve significant sets on simulated correlated and real genomics data, where the performance is measured based on its detection ability of co-regulated sets against a full search. Additionally, we analyzed the quality of the best ranked groups using an online bioinformatics tool that provides network information for the selected genes. Copyright © 2017 Elsevier Inc. All rights reserved.
LEAP: constructing gene co-expression networks for single-cell RNA-sequencing data using pseudotime ordering.

PubMed

Specht, Alicia T; Li, Jun

2017-03-01

To construct gene co-expression networks based on single-cell RNA-Sequencing data, we present an algorithm called LEAP, which utilizes the estimated pseudotime of the cells to find gene co-expression that involves time delay. R package LEAP available on CRAN. jun.li@nd.edu. Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com
Genetic architecture of wood properties based on association analysis and co-expression networks in white spruce.

PubMed

Lamara, Mebarek; Raherison, Elie; Lenz, Patrick; Beaulieu, Jean; Bousquet, Jean; MacKay, John

2016-04-01

Association studies are widely utilized to analyze complex traits but their ability to disclose genetic architectures is often limited by statistical constraints, and functional insights are usually minimal in nonmodel organisms like forest trees. We developed an approach to integrate association mapping results with co-expression networks. We tested single nucleotide polymorphisms (SNPs) in 2652 candidate genes for statistical associations with wood density, stiffness, microfibril angle and ring width in a population of 1694 white spruce trees (Picea glauca). Associations mapping identified 229-292 genes per wood trait using a statistical significance level of P < 0.05 to maximize discovery. Over-representation of genes associated for nearly all traits was found in a xylem preferential co-expression group developed in independent experiments. A xylem co-expression network was reconstructed with 180 wood associated genes and several known MYB and NAC regulators were identified as network hubs. The network revealed a link between the gene PgNAC8, wood stiffness and microfibril angle, as well as considerable within-season variation for both genetic control of wood traits and gene expression. Trait associations were distributed throughout the network suggesting complex interactions and pleiotropic effects. Our findings indicate that integration of association mapping and co-expression networks enhances our understanding of complex wood traits. © 2015 The Authors. New Phytologist © 2015 New Phytologist Trust.
Gene expression patterns combined with network analysis identify hub genes associated with bladder cancer.

PubMed

Bi, Dongbin; Ning, Hao; Liu, Shuai; Que, Xinxiang; Ding, Kejia

2015-06-01

To explore molecular mechanisms of bladder cancer (BC), network strategy was used to find biomarkers for early detection and diagnosis. The differentially expressed genes (DEGs) between bladder carcinoma patients and normal subjects were screened using empirical Bayes method of the linear models for microarray data package. Co-expression networks were constructed by differentially co-expressed genes and links. Regulatory impact factors (RIF) metric was used to identify critical transcription factors (TFs). The protein-protein interaction (PPI) networks were constructed by the Search Tool for the Retrieval of Interacting Genes/Proteins (STRING) and clusters were obtained through molecular complex detection (MCODE) algorithm. Centralities analyses for complex networks were performed based on degree, stress and betweenness. Enrichment analyses were performed based on Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) databases. Co-expression networks and TFs (based on expression data of global DEGs and DEGs in different stages and grades) were identified. Hub genes of complex networks, such as UBE2C, ACTA2, FABP4, CKS2, FN1 and TOP2A, were also obtained according to analysis of degree. In gene enrichment analyses of global DEGs, cell adhesion, proteinaceous extracellular matrix and extracellular matrix structural constituent were top three GO terms. ECM-receptor interaction, focal adhesion, and cell cycle were significant pathways. Our results provide some potential underlying biomarkers of BC. However, further validation is required and deep studies are needed to elucidate the pathogenesis of BC. Copyright © 2015 Elsevier Ltd. All rights reserved.
Hi-C Chromatin Interaction Networks Predict Co-expression in the Mouse Cortex

PubMed Central

Hulsman, Marc; Lelieveldt, Boudewijn P. F.; de Ridder, Jeroen; Reinders, Marcel

2015-01-01

The three dimensional conformation of the genome in the cell nucleus influences important biological processes such as gene expression regulation. Recent studies have shown a strong correlation between chromatin interactions and gene co-expression. However, predicting gene co-expression from frequent long-range chromatin interactions remains challenging. We address this by characterizing the topology of the cortical chromatin interaction network using scale-aware topological measures. We demonstrate that based on these characterizations it is possible to accurately predict spatial co-expression between genes in the mouse cortex. Consistent with previous findings, we find that the chromatin interaction profile of a gene-pair is a good predictor of their spatial co-expression. However, the accuracy of the prediction can be substantially improved when chromatin interactions are described using scale-aware topological measures of the multi-resolution chromatin interaction network. We conclude that, for co-expression prediction, it is necessary to take into account different levels of chromatin interactions ranging from direct interaction between genes (i.e. small-scale) to chromatin compartment interactions (i.e. large-scale). PMID:25965262
VTCdb: a gene co-expression database for the crop species Vitis vinifera (grapevine).

PubMed

Wong, Darren C J; Sweetman, Crystal; Drew, Damian P; Ford, Christopher M

2013-12-16

Gene expression datasets in model plants such as Arabidopsis have contributed to our understanding of gene function and how a single underlying biological process can be governed by a diverse network of genes. The accumulation of publicly available microarray data encompassing a wide range of biological and environmental conditions has enabled the development of additional capabilities including gene co-expression analysis (GCA). GCA is based on the understanding that genes encoding proteins involved in similar and/or related biological processes may exhibit comparable expression patterns over a range of experimental conditions, developmental stages and tissues. We present an open access database for the investigation of gene co-expression networks within the cultivated grapevine, Vitis vinifera. The new gene co-expression database, VTCdb (http://vtcdb.adelaide.edu.au/Home.aspx), offers an online platform for transcriptional regulatory inference in the cultivated grapevine. Using condition-independent and condition-dependent approaches, grapevine co-expression networks were constructed using the latest publicly available microarray datasets from diverse experimental series, utilising the Affymetrix Vitis vinifera GeneChip (16 K) and the NimbleGen Grape Whole-genome microarray chip (29 K), thus making it possible to profile approximately 29,000 genes (95% of the predicted grapevine transcriptome). Applications available with the online platform include the use of gene names, probesets, modules or biological processes to query the co-expression networks, with the option to choose between Affymetrix or Nimblegen datasets and between multiple co-expression measures. Alternatively, the user can browse existing network modules using interactive network visualisation and analysis via CytoscapeWeb. To demonstrate the utility of the database, we present examples from three fundamental biological processes (berry development, photosynthesis and flavonoid biosynthesis) whereby the recovered sub-networks reconfirm established plant gene functions and also identify novel associations. Together, we present valuable insights into grapevine transcriptional regulation by developing network models applicable to researchers in their prioritisation of gene candidates, for on-going study of biological processes related to grapevine development, metabolism and stress responses.

Co-expression network analysis of duplicate genes in maize (Zea mays L.) reveals no subgenome bias.

PubMed

Li, Lin; Briskine, Roman; Schaefer, Robert; Schnable, Patrick S; Myers, Chad L; Flagel, Lex E; Springer, Nathan M; Muehlbauer, Gary J

2016-11-04

Gene duplication is prevalent in many species and can result in coding and regulatory divergence. Gene duplications can be classified as whole genome duplication (WGD), tandem and inserted (non-syntenic). In maize, WGD resulted in the subgenomes maize1 and maize2, of which maize1 is considered the dominant subgenome. However, the landscape of co-expression network divergence of duplicate genes in maize is still largely uncharacterized. To address the consequence of gene duplication on co-expression network divergence, we developed a gene co-expression network from RNA-seq data derived from 64 different tissues/stages of the maize reference inbred-B73. WGD, tandem and inserted gene duplications exhibited distinct regulatory divergence. Inserted duplicate genes were more likely to be singletons in the co-expression networks, while WGD duplicate genes were likely to be co-expressed with other genes. Tandem duplicate genes were enriched in the co-expression pattern where co-expressed genes were nearly identical for the duplicates in the network. Older gene duplications exhibit more extensive co-expression variation than younger duplications. Overall, non-syntenic genes primarily from inserted duplications show more co-expression divergence. Also, such enlarged co-expression divergence is significantly related to duplication age. Moreover, subgenome dominance was not observed in the co-expression networks - maize1 and maize2 exhibit similar levels of intra subgenome correlations. Intriguingly, the level of inter subgenome co-expression was similar to the level of intra subgenome correlations, and genes from specific subgenomes were not likely to be the enriched in co-expression network modules and the hub genes were not predominantly from any specific subgenomes in maize. Our work provides a comprehensive analysis of maize co-expression network divergence for three different types of gene duplications and identifies potential relationships between duplication types, duplication ages and co-expression consequences.
Exploring of the molecular mechanism of rhinitis via bioinformatics methods

PubMed Central

Song, Yufen; Yan, Zhaohui

2018-01-01

The aim of this study was to analyze gene expression profiles for exploring the function and regulatory network of differentially expressed genes (DEGs) in pathogenesis of rhinitis by a bioinformatics method. The gene expression profile of GSE43523 was downloaded from the Gene Expression Omnibus database. The dataset contained 7 seasonal allergic rhinitis samples and 5 non-allergic normal samples. DEGs between rhinitis samples and normal samples were identified via the limma package of R. The webGestal database was used to identify enriched Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways of the DEGs. The differentially co-expressed pairs of the DEGs were identified via the DCGL package in R, and the differential co-expression network was constructed based on these pairs. A protein-protein interaction (PPI) network of the DEGs was constructed based on the Search Tool for the Retrieval of Interacting Genes database. A total of 263 DEGs were identified in rhinitis samples compared with normal samples, including 125 downregulated ones and 138 upregulated ones. The DEGs were enriched in 7 KEGG pathways. 308 differential co-expression gene pairs were obtained. A differential co-expression network was constructed, containing 212 nodes. In total, 148 PPI pairs of the DEGs were identified, and a PPI network was constructed based on these pairs. Bioinformatics methods could help us identify significant genes and pathways related to the pathogenesis of rhinitis. Steroid biosynthesis pathway and metabolic pathways might play important roles in the development of allergic rhinitis (AR). Genes such as CDC42 effector protein 5, solute carrier family 39 member A11 and PR/SET domain 10 might be also associated with the pathogenesis of AR, which provided references for the molecular mechanisms of AR. PMID:29257233
Multiscale Embedded Gene Co-expression Network Analysis

PubMed Central

Song, Won-Min; Zhang, Bin

2015-01-01

Gene co-expression network analysis has been shown effective in identifying functional co-expressed gene modules associated with complex human diseases. However, existing techniques to construct co-expression networks require some critical prior information such as predefined number of clusters, numerical thresholds for defining co-expression/interaction, or do not naturally reproduce the hallmarks of complex systems such as the scale-free degree distribution of small-worldness. Previously, a graph filtering technique called Planar Maximally Filtered Graph (PMFG) has been applied to many real-world data sets such as financial stock prices and gene expression to extract meaningful and relevant interactions. However, PMFG is not suitable for large-scale genomic data due to several drawbacks, such as the high computation complexity O(|V|3), the presence of false-positives due to the maximal planarity constraint, and the inadequacy of the clustering framework. Here, we developed a new co-expression network analysis framework called Multiscale Embedded Gene Co-expression Network Analysis (MEGENA) by: i) introducing quality control of co-expression similarities, ii) parallelizing embedded network construction, and iii) developing a novel clustering technique to identify multi-scale clustering structures in Planar Filtered Networks (PFNs). We applied MEGENA to a series of simulated data and the gene expression data in breast carcinoma and lung adenocarcinoma from The Cancer Genome Atlas (TCGA). MEGENA showed improved performance over well-established clustering methods and co-expression network construction approaches. MEGENA revealed not only meaningful multi-scale organizations of co-expressed gene clusters but also novel targets in breast carcinoma and lung adenocarcinoma. PMID:26618778
Multiscale Embedded Gene Co-expression Network Analysis.

PubMed

Song, Won-Min; Zhang, Bin

2015-11-01

Gene co-expression network analysis has been shown effective in identifying functional co-expressed gene modules associated with complex human diseases. However, existing techniques to construct co-expression networks require some critical prior information such as predefined number of clusters, numerical thresholds for defining co-expression/interaction, or do not naturally reproduce the hallmarks of complex systems such as the scale-free degree distribution of small-worldness. Previously, a graph filtering technique called Planar Maximally Filtered Graph (PMFG) has been applied to many real-world data sets such as financial stock prices and gene expression to extract meaningful and relevant interactions. However, PMFG is not suitable for large-scale genomic data due to several drawbacks, such as the high computation complexity O(|V|3), the presence of false-positives due to the maximal planarity constraint, and the inadequacy of the clustering framework. Here, we developed a new co-expression network analysis framework called Multiscale Embedded Gene Co-expression Network Analysis (MEGENA) by: i) introducing quality control of co-expression similarities, ii) parallelizing embedded network construction, and iii) developing a novel clustering technique to identify multi-scale clustering structures in Planar Filtered Networks (PFNs). We applied MEGENA to a series of simulated data and the gene expression data in breast carcinoma and lung adenocarcinoma from The Cancer Genome Atlas (TCGA). MEGENA showed improved performance over well-established clustering methods and co-expression network construction approaches. MEGENA revealed not only meaningful multi-scale organizations of co-expressed gene clusters but also novel targets in breast carcinoma and lung adenocarcinoma.
From Saccharomyces cerevisiae to human: The important gene co-expression modules.

PubMed

Liu, Wei; Li, Li; Ye, Hua; Chen, Haiwei; Shen, Weibiao; Zhong, Yuexian; Tian, Tian; He, Huaqin

2017-08-01

Network-based systems biology has become an important method for analyzing high-throughput gene expression data and gene function mining. Yeast has long been a popular model organism for biomedical research. In the current study, a weighted gene co-expression network analysis algorithm was applied to construct a gene co-expression network in Saccharomyces cerevisiae . Seventeen stable gene co-expression modules were detected from 2,814 S. cerevisiae microarray data. Further characterization of these modules with the Database for Annotation, Visualization and Integrated Discovery tool indicated that these modules were associated with certain biological processes, such as heat response, cell cycle, translational regulation, mitochondrion oxidative phosphorylation, amino acid metabolism and autophagy. Hub genes were also screened by intra-modular connectivity. Finally, the module conservation was evaluated in a human disease microarray dataset. Functional modules were identified in budding yeast, some of which are associated with patient survival. The current study provided a paradigm for single cell microorganisms and potentially other organisms.
Analysis of genetic association in Listeria and Diabetes using Hierarchical Clustering and Silhouette Index

NASA Astrophysics Data System (ADS)

Pagnuco, Inti A.; Pastore, Juan I.; Abras, Guillermo; Brun, Marcel; Ballarin, Virginia L.

2016-04-01

It is usually assumed that co-expressed genes suggest co-regulation in the underlying regulatory network. Determining sets of co-expressed genes is an important task, where significative groups of genes are defined based on some criteria. This task is usually performed by clustering algorithms, where the whole family of genes, or a subset of them, are clustered into meaningful groups based on their expression values in a set of experiment. In this work we used a methodology based on the Silhouette index as a measure of cluster quality for individual gene groups, and a combination of several variants of hierarchical clustering to generate the candidate groups, to obtain sets of co-expressed genes for two real data examples. We analyzed the quality of the best ranked groups, obtained by the algorithm, using an online bioinformatics tool that provides network information for the selected genes. Moreover, to verify the performance of the algorithm, considering the fact that it doesn’t find all possible subsets, we compared its results against a full search, to determine the amount of good co-regulated sets not detected.
Estimation of the proteomic cancer co-expression sub networks by using association estimators.

PubMed

Erdoğan, Cihat; Kurt, Zeyneb; Diri, Banu

2017-01-01

In this study, the association estimators, which have significant influences on the gene network inference methods and used for determining the molecular interactions, were examined within the co-expression network inference concept. By using the proteomic data from five different cancer types, the hub genes/proteins within the disease-associated gene-gene/protein-protein interaction sub networks were identified. Proteomic data from various cancer types is collected from The Cancer Proteome Atlas (TCPA). Correlation and mutual information (MI) based nine association estimators that are commonly used in the literature, were compared in this study. As the gold standard to measure the association estimators' performance, a multi-layer data integration platform on gene-disease associations (DisGeNET) and the Molecular Signatures Database (MSigDB) was used. Fisher's exact test was used to evaluate the performance of the association estimators by comparing the created co-expression networks with the disease-associated pathways. It was observed that the MI based estimators provided more successful results than the Pearson and Spearman correlation approaches, which are used in the estimation of biological networks in the weighted correlation network analysis (WGCNA) package. In correlation-based methods, the best average success rate for five cancer types was 60%, while in MI-based methods the average success ratio was 71% for James-Stein Shrinkage (Shrink) and 64% for Schurmann-Grassberger (SG) association estimator, respectively. Moreover, the hub genes and the inferred sub networks are presented for the consideration of researchers and experimentalists.
Estimation of the proteomic cancer co-expression sub networks by using association estimators

PubMed Central

Kurt, Zeyneb; Diri, Banu

2017-01-01

In this study, the association estimators, which have significant influences on the gene network inference methods and used for determining the molecular interactions, were examined within the co-expression network inference concept. By using the proteomic data from five different cancer types, the hub genes/proteins within the disease-associated gene-gene/protein-protein interaction sub networks were identified. Proteomic data from various cancer types is collected from The Cancer Proteome Atlas (TCPA). Correlation and mutual information (MI) based nine association estimators that are commonly used in the literature, were compared in this study. As the gold standard to measure the association estimators’ performance, a multi-layer data integration platform on gene-disease associations (DisGeNET) and the Molecular Signatures Database (MSigDB) was used. Fisher's exact test was used to evaluate the performance of the association estimators by comparing the created co-expression networks with the disease-associated pathways. It was observed that the MI based estimators provided more successful results than the Pearson and Spearman correlation approaches, which are used in the estimation of biological networks in the weighted correlation network analysis (WGCNA) package. In correlation-based methods, the best average success rate for five cancer types was 60%, while in MI-based methods the average success ratio was 71% for James-Stein Shrinkage (Shrink) and 64% for Schurmann-Grassberger (SG) association estimator, respectively. Moreover, the hub genes and the inferred sub networks are presented for the consideration of researchers and experimentalists. PMID:29145449
Mining disease genes using integrated protein-protein interaction and gene-gene co-regulation information.

PubMed

Li, Jin; Wang, Limei; Guo, Maozu; Zhang, Ruijie; Dai, Qiguo; Liu, Xiaoyan; Wang, Chunyu; Teng, Zhixia; Xuan, Ping; Zhang, Mingming

2015-01-01

In humans, despite the rapid increase in disease-associated gene discovery, a large proportion of disease-associated genes are still unknown. Many network-based approaches have been used to prioritize disease genes. Many networks, such as the protein-protein interaction (PPI), KEGG, and gene co-expression networks, have been used. Expression quantitative trait loci (eQTLs) have been successfully applied for the determination of genes associated with several diseases. In this study, we constructed an eQTL-based gene-gene co-regulation network (GGCRN) and used it to mine for disease genes. We adopted the random walk with restart (RWR) algorithm to mine for genes associated with Alzheimer disease. Compared to the Human Protein Reference Database (HPRD) PPI network alone, the integrated HPRD PPI and GGCRN networks provided faster convergence and revealed new disease-related genes. Therefore, using the RWR algorithm for integrated PPI and GGCRN is an effective method for disease-associated gene mining.
Differentially co-expressed interacting protein pairs discriminate samples under distinct stages of HIV type 1 infection.

PubMed

Yoon, Dukyong; Kim, Hyosil; Suh-Kim, Haeyoung; Park, Rae Woong; Lee, KiYoung

2011-01-01

Microarray analyses based on differentially expressed genes (DEGs) have been widely used to distinguish samples across different cellular conditions. However, studies based on DEGs have not been able to clearly determine significant differences between samples of pathophysiologically similar HIV-1 stages, e.g., between acute and chronic progressive (or AIDS) or between uninfected and clinically latent stages. We here suggest a novel approach to allow such discrimination based on stage-specific genetic features of HIV-1 infection. Our approach is based on co-expression changes of genes known to interact. The method can identify a genetic signature for a single sample as contrasted with existing protein-protein-based analyses with correlational designs. Our approach distinguishes each sample using differentially co-expressed interacting protein pairs (DEPs) based on co-expression scores of individual interacting pairs within a sample. The co-expression score has positive value if two genes in a sample are simultaneously up-regulated or down-regulated. And the score has higher absolute value if expression-changing ratios are similar between the two genes. We compared characteristics of DEPs with that of DEGs by evaluating their usefulness in separation of HIV-1 stage. And we identified DEP-based network-modules and their gene-ontology enrichment to find out the HIV-1 stage-specific gene signature. Based on the DEP approach, we observed clear separation among samples from distinct HIV-1 stages using clustering and principal component analyses. Moreover, the discrimination power of DEPs on the samples (70-100% accuracy) was much higher than that of DEGs (35-45%) using several well-known classifiers. DEP-based network analysis also revealed the HIV-1 stage-specific network modules; the main biological processes were related to "translation," "RNA splicing," "mRNA, RNA, and nucleic acid transport," and "DNA metabolism." Through the HIV-1 stage-related modules, changing stage-specific patterns of protein interactions could be observed. DEP-based method discriminated the HIV-1 infection stages clearly, and revealed a HIV-1 stage-specific gene signature. The proposed DEP-based method might complement existing DEG-based approaches in various microarray expression analyses.
Biological mechanism analysis of acute renal allograft rejection: integrated of mRNA and microRNA expression profiles.

PubMed

Huang, Shi-Ming; Zhao, Xia; Zhao, Xue-Mei; Wang, Xiao-Ying; Li, Shan-Shan; Zhu, Yu-Hui

2014-01-01

Renal transplantation is the preferred method for most patients with end-stage renal disease, however, acute renal allograft rejection is still a major risk factor for recipients leading to renal injury. To improve the early diagnosis and treatment of acute rejection, study on the molecular mechanism of it is urgent. MicroRNA (miRNA) expression profile and mRNA expression profile of acute renal allograft rejection and well-functioning allograft downloaded from ArrayExpress database were applied to identify differentially expressed (DE) miRNAs and DE mRNAs. DE miRNAs targets were predicted by combining five algorithm. By overlapping the DE mRNAs and DE miRNAs targets, common genes were obtained. Differentially co-expressed genes (DCGs) were identified by differential co-expression profile (DCp) and differential co-expression enrichment (DCe) methods in Differentially Co-expressed Genes and Links (DCGL) package. Then, co-expression network of DCGs and the cluster analysis were performed. Functional enrichment analysis for DCGs was undergone. A total of 1270 miRNA targets were predicted and 698 DE mRNAs were obtained. While overlapping miRNA targets and DE mRNAs, 59 common genes were gained. We obtained 103 DCGs and 5 transcription factors (TFs) based on regulatory impact factors (RIF), then built the regulation network of miRNA targets and DE mRNAs. By clustering the co-expression network, 5 modules were obtained. Thereinto, module 1 had the highest degree and module 2 showed the most number of DCGs and common genes. TF CEBPB and several common genes, such as RXRA, BASP1 and AKAP10, were mapped on the co-expression network. C1R showed the highest degree in the network. These genes might be associated with human acute renal allograft rejection. We conducted biological analysis on integration of DE mRNA and DE miRNA in acute renal allograft rejection, displayed gene expression patterns and screened out genes and TFs that may be related to acute renal allograft rejection.
Biological mechanism analysis of acute renal allograft rejection: integrated of mRNA and microRNA expression profiles

PubMed Central

Huang, Shi-Ming; Zhao, Xia; Zhao, Xue-Mei; Wang, Xiao-Ying; Li, Shan-Shan; Zhu, Yu-Hui

2014-01-01

Objectives: Renal transplantation is the preferred method for most patients with end-stage renal disease, however, acute renal allograft rejection is still a major risk factor for recipients leading to renal injury. To improve the early diagnosis and treatment of acute rejection, study on the molecular mechanism of it is urgent. Methods: MicroRNA (miRNA) expression profile and mRNA expression profile of acute renal allograft rejection and well-functioning allograft downloaded from ArrayExpress database were applied to identify differentially expressed (DE) miRNAs and DE mRNAs. DE miRNAs targets were predicted by combining five algorithm. By overlapping the DE mRNAs and DE miRNAs targets, common genes were obtained. Differentially co-expressed genes (DCGs) were identified by differential co-expression profile (DCp) and differential co-expression enrichment (DCe) methods in Differentially Co-expressed Genes and Links (DCGL) package. Then, co-expression network of DCGs and the cluster analysis were performed. Functional enrichment analysis for DCGs was undergone. Results: A total of 1270 miRNA targets were predicted and 698 DE mRNAs were obtained. While overlapping miRNA targets and DE mRNAs, 59 common genes were gained. We obtained 103 DCGs and 5 transcription factors (TFs) based on regulatory impact factors (RIF), then built the regulation network of miRNA targets and DE mRNAs. By clustering the co-expression network, 5 modules were obtained. Thereinto, module 1 had the highest degree and module 2 showed the most number of DCGs and common genes. TF CEBPB and several common genes, such as RXRA, BASP1 and AKAP10, were mapped on the co-expression network. C1R showed the highest degree in the network. These genes might be associated with human acute renal allograft rejection. Conclusions: We conducted biological analysis on integration of DE mRNA and DE miRNA in acute renal allograft rejection, displayed gene expression patterns and screened out genes and TFs that may be related to acute renal allograft rejection. PMID:25664019
Discovering mutated driver genes through a robust and sparse co-regularized matrix factorization framework with prior information from mRNA expression patterns and interaction network.

PubMed

Xi, Jianing; Wang, Minghui; Li, Ao

2018-06-05

Discovery of mutated driver genes is one of the primary objective for studying tumorigenesis. To discover some relatively low frequently mutated driver genes from somatic mutation data, many existing methods incorporate interaction network as prior information. However, the prior information of mRNA expression patterns are not exploited by these existing network-based methods, which is also proven to be highly informative of cancer progressions. To incorporate prior information from both interaction network and mRNA expressions, we propose a robust and sparse co-regularized nonnegative matrix factorization to discover driver genes from mutation data. Furthermore, our framework also conducts Frobenius norm regularization to overcome overfitting issue. Sparsity-inducing penalty is employed to obtain sparse scores in gene representations, of which the top scored genes are selected as driver candidates. Evaluation experiments by known benchmarking genes indicate that the performance of our method benefits from the two type of prior information. Our method also outperforms the existing network-based methods, and detect some driver genes that are not predicted by the competing methods. In summary, our proposed method can improve the performance of driver gene discovery by effectively incorporating prior information from interaction network and mRNA expression patterns into a robust and sparse co-regularized matrix factorization framework.
Uncovering robust patterns of microRNA co-expression across cancers using Bayesian Relevance Networks

PubMed Central

2017-01-01

Co-expression networks have long been used as a tool for investigating the molecular circuitry governing biological systems. However, most algorithms for constructing co-expression networks were developed in the microarray era, before high-throughput sequencing—with its unique statistical properties—became the norm for expression measurement. Here we develop Bayesian Relevance Networks, an algorithm that uses Bayesian reasoning about expression levels to account for the differing levels of uncertainty in expression measurements between highly- and lowly-expressed entities, and between samples with different sequencing depths. It combines data from groups of samples (e.g., replicates) to estimate group expression levels and confidence ranges. It then computes uncertainty-moderated estimates of cross-group correlations between entities, and uses permutation testing to assess their statistical significance. Using large scale miRNA data from The Cancer Genome Atlas, we show that our Bayesian update of the classical Relevance Networks algorithm provides improved reproducibility in co-expression estimates and lower false discovery rates in the resulting co-expression networks. Software is available at www.perkinslab.ca. PMID:28817636
Uncovering robust patterns of microRNA co-expression across cancers using Bayesian Relevance Networks.

PubMed

Ramachandran, Parameswaran; Sánchez-Taltavull, Daniel; Perkins, Theodore J

2017-01-01

Co-expression networks have long been used as a tool for investigating the molecular circuitry governing biological systems. However, most algorithms for constructing co-expression networks were developed in the microarray era, before high-throughput sequencing-with its unique statistical properties-became the norm for expression measurement. Here we develop Bayesian Relevance Networks, an algorithm that uses Bayesian reasoning about expression levels to account for the differing levels of uncertainty in expression measurements between highly- and lowly-expressed entities, and between samples with different sequencing depths. It combines data from groups of samples (e.g., replicates) to estimate group expression levels and confidence ranges. It then computes uncertainty-moderated estimates of cross-group correlations between entities, and uses permutation testing to assess their statistical significance. Using large scale miRNA data from The Cancer Genome Atlas, we show that our Bayesian update of the classical Relevance Networks algorithm provides improved reproducibility in co-expression estimates and lower false discovery rates in the resulting co-expression networks. Software is available at www.perkinslab.ca.
From protein-protein interactions to protein co-expression networks: a new perspective to evaluate large-scale proteomic data.

PubMed

Vella, Danila; Zoppis, Italo; Mauri, Giancarlo; Mauri, Pierluigi; Di Silvestre, Dario

2017-12-01

The reductionist approach of dissecting biological systems into their constituents has been successful in the first stage of the molecular biology to elucidate the chemical basis of several biological processes. This knowledge helped biologists to understand the complexity of the biological systems evidencing that most biological functions do not arise from individual molecules; thus, realizing that the emergent properties of the biological systems cannot be explained or be predicted by investigating individual molecules without taking into consideration their relations. Thanks to the improvement of the current -omics technologies and the increasing understanding of the molecular relationships, even more studies are evaluating the biological systems through approaches based on graph theory. Genomic and proteomic data are often combined with protein-protein interaction (PPI) networks whose structure is routinely analyzed by algorithms and tools to characterize hubs/bottlenecks and topological, functional, and disease modules. On the other hand, co-expression networks represent a complementary procedure that give the opportunity to evaluate at system level including organisms that lack information on PPIs. Based on these premises, we introduce the reader to the PPI and to the co-expression networks, including aspects of reconstruction and analysis. In particular, the new idea to evaluate large-scale proteomic data by means of co-expression networks will be discussed presenting some examples of application. Their use to infer biological knowledge will be shown, and a special attention will be devoted to the topological and module analysis.
Identification of differential pathways in papillary thyroid carcinoma utilizing pathway co-expression analysis.

PubMed

Qiu, Wei-Hai; Chen, Gui-Yan; Cui, Lu; Zhang, Ting-Ming; Wei, Feng; Yang, Yong

2016-01-01

To identify differential pathways between papillary thyroid carcinoma (PTC) patients and normal controls utilizing a novel method which combined pathway with co-expression network. The proposed method included three steps. In the first step, we conducted pretreatments for background pathways and gained representative pathways in PTC. Subsequently, a co-expression network for representative pathways was constructed using empirical Bayes (EB) approach to assign a weight value for each pathway. Finally, random model was extracted to set the thresholds of identifying differential pathways. We obtained 1267 representative pathways and their weight values based on the co-expressed pathway network, and then by meeting the criterion (Weight > 0.0296), 87 differential pathways in total across PTC patients and normal controls were identified. The top three ranked differential pathways were CREB phosphorylation, attachment of GPI anchor to urokinase plasminogen activator receptor (uPAR) and loss of function of SMAD2/3 in cancer. In conclusion, we successfully identified differential pathways (such as CREB phosphorylation, attachment of GPI anchor to uPAR and post-translational modification: synthesis of GPI-anchored proteins) for PTC using the proposed pathway co-expression method, and these pathways might be potential biomarkers for target therapy and detection of PTC.
Integration of heterogeneous molecular networks to unravel gene-regulation in Mycobacterium tuberculosis.

PubMed

van Dam, Jesse C J; Schaap, Peter J; Martins dos Santos, Vitor A P; Suárez-Diez, María

2014-09-26

Different methods have been developed to infer regulatory networks from heterogeneous omics datasets and to construct co-expression networks. Each algorithm produces different networks and efforts have been devoted to automatically integrate them into consensus sets. However each separate set has an intrinsic value that is diluted and partly lost when building a consensus network. Here we present a methodology to generate co-expression networks and, instead of a consensus network, we propose an integration framework where the different networks are kept and analysed with additional tools to efficiently combine the information extracted from each network. We developed a workflow to efficiently analyse information generated by different inference and prediction methods. Our methodology relies on providing the user the means to simultaneously visualise and analyse the coexisting networks generated by different algorithms, heterogeneous datasets, and a suite of analysis tools. As a show case, we have analysed the gene co-expression networks of Mycobacterium tuberculosis generated using over 600 expression experiments. Regarding DNA damage repair, we identified SigC as a key control element, 12 new targets for LexA, an updated LexA binding motif, and a potential mismatch repair system. We expanded the DevR regulon with 27 genes while identifying 9 targets wrongly assigned to this regulon. We discovered 10 new genes linked to zinc uptake and a new regulatory mechanism for ZuR. The use of co-expression networks to perform system level analysis allows the development of custom made methodologies. As show cases we implemented a pipeline to integrate ChIP-seq data and another method to uncover multiple regulatory layers. Our workflow is based on representing the multiple types of information as network representations and presenting these networks in a synchronous framework that allows their simultaneous visualization while keeping specific associations from the different networks. By simultaneously exploring these networks and metadata, we gained insights into regulatory mechanisms in M. tuberculosis that could not be obtained through the separate analysis of each data type.
Transcriptome profiling analysis reveals biomarkers in colon cancer samples of various differentiation

PubMed Central

Yu, Tonghu; Zhang, Huaping; Qi, Hong

2018-01-01

The aim of the present study was to investigate more colon cancer-related genes in different stages. Gene expression profile E-GEOD-62932 was extracted for differentially expressed gene (DEG) screening. Series test of cluster analysis was used to obtain significant trending models. Based on the Gene Ontology and Kyoto Encyclopedia of Genes and Genomes databases, functional and pathway enrichment analysis were processed and a pathway relation network was constructed. Gene co-expression network and gene signal network were constructed for common DEGs. The DEGs with the same trend were clustered and in total, 16 clusters with statistical significance were obtained. The screened DEGs were enriched into small molecule metabolic process and metabolic pathways. The pathway relation network was constructed with 57 nodes. A total of 328 common DEGs were obtained. Gene signal network was constructed with 71 nodes. Gene co-expression network was constructed with 161 nodes and 211 edges. ABCD3, CPT2, AGL and JAM2 are potential biomarkers for the diagnosis of colon cancer. PMID:29928385
Maximizing capture of gene co-expression relationships through pre-clustering of input expression samples: an Arabidopsis case study.

PubMed

Feltus, F Alex; Ficklin, Stephen P; Gibson, Scott M; Smith, Melissa C

2013-06-05

In genomics, highly relevant gene interaction (co-expression) networks have been constructed by finding significant pair-wise correlations between genes in expression datasets. These networks are then mined to elucidate biological function at the polygenic level. In some cases networks may be constructed from input samples that measure gene expression under a variety of different conditions, such as for different genotypes, environments, disease states and tissues. When large sets of samples are obtained from public repositories it is often unmanageable to associate samples into condition-specific groups, and combining samples from various conditions has a negative effect on network size. A fixed significance threshold is often applied also limiting the size of the final network. Therefore, we propose pre-clustering of input expression samples to approximate condition-specific grouping of samples and individual network construction of each group as a means for dynamic significance thresholding. The net effect is increase sensitivity thus maximizing the total co-expression relationships in the final co-expression network compendium. A total of 86 Arabidopsis thaliana co-expression networks were constructed after k-means partitioning of 7,105 publicly available ATH1 Affymetrix microarray samples. We term each pre-sorted network a Gene Interaction Layer (GIL). Random Matrix Theory (RMT), an un-supervised thresholding method, was used to threshold each of the 86 networks independently, effectively providing a dynamic (non-global) threshold for the network. The overall gene count across all GILs reached 19,588 genes (94.7% measured gene coverage) and 558,022 unique co-expression relationships. In comparison, network construction without pre-sorting of input samples yielded only 3,297 genes (15.9%) and 129,134 relationships. in the global network. Here we show that pre-clustering of microarray samples helps approximate condition-specific networks and allows for dynamic thresholding using un-supervised methods. Because RMT ensures only highly significant interactions are kept, the GIL compendium consists of 558,022 unique high quality A. thaliana co-expression relationships across almost all of the measurable genes on the ATH1 array. For A. thaliana, these networks represent the largest compendium to date of significant gene co-expression relationships, and are a means to explore complex pathway, polygenic, and pleiotropic relationships for this focal model plant. The networks can be explored at sysbio.genome.clemson.edu. Finally, this method is applicable to any large expression profile collection for any organism and is best suited where a knowledge-independent network construction method is desired.

Maximizing capture of gene co-expression relationships through pre-clustering of input expression samples: an Arabidopsis case study

PubMed Central

2013-01-01

Background In genomics, highly relevant gene interaction (co-expression) networks have been constructed by finding significant pair-wise correlations between genes in expression datasets. These networks are then mined to elucidate biological function at the polygenic level. In some cases networks may be constructed from input samples that measure gene expression under a variety of different conditions, such as for different genotypes, environments, disease states and tissues. When large sets of samples are obtained from public repositories it is often unmanageable to associate samples into condition-specific groups, and combining samples from various conditions has a negative effect on network size. A fixed significance threshold is often applied also limiting the size of the final network. Therefore, we propose pre-clustering of input expression samples to approximate condition-specific grouping of samples and individual network construction of each group as a means for dynamic significance thresholding. The net effect is increase sensitivity thus maximizing the total co-expression relationships in the final co-expression network compendium. Results A total of 86 Arabidopsis thaliana co-expression networks were constructed after k-means partitioning of 7,105 publicly available ATH1 Affymetrix microarray samples. We term each pre-sorted network a Gene Interaction Layer (GIL). Random Matrix Theory (RMT), an un-supervised thresholding method, was used to threshold each of the 86 networks independently, effectively providing a dynamic (non-global) threshold for the network. The overall gene count across all GILs reached 19,588 genes (94.7% measured gene coverage) and 558,022 unique co-expression relationships. In comparison, network construction without pre-sorting of input samples yielded only 3,297 genes (15.9%) and 129,134 relationships. in the global network. Conclusions Here we show that pre-clustering of microarray samples helps approximate condition-specific networks and allows for dynamic thresholding using un-supervised methods. Because RMT ensures only highly significant interactions are kept, the GIL compendium consists of 558,022 unique high quality A. thaliana co-expression relationships across almost all of the measurable genes on the ATH1 array. For A. thaliana, these networks represent the largest compendium to date of significant gene co-expression relationships, and are a means to explore complex pathway, polygenic, and pleiotropic relationships for this focal model plant. The networks can be explored at sysbio.genome.clemson.edu. Finally, this method is applicable to any large expression profile collection for any organism and is best suited where a knowledge-independent network construction method is desired. PMID:23738693
Detection of gene communities in multi-networks reveals cancer drivers

NASA Astrophysics Data System (ADS)

Cantini, Laura; Medico, Enzo; Fortunato, Santo; Caselle, Michele

2015-12-01

We propose a new multi-network-based strategy to integrate different layers of genomic information and use them in a coordinate way to identify driving cancer genes. The multi-networks that we consider combine transcription factor co-targeting, microRNA co-targeting, protein-protein interaction and gene co-expression networks. The rationale behind this choice is that gene co-expression and protein-protein interactions require a tight coregulation of the partners and that such a fine tuned regulation can be obtained only combining both the transcriptional and post-transcriptional layers of regulation. To extract the relevant biological information from the multi-network we studied its partition into communities. To this end we applied a consensus clustering algorithm based on state of art community detection methods. Even if our procedure is valid in principle for any pathology in this work we concentrate on gastric, lung, pancreas and colorectal cancer and identified from the enrichment analysis of the multi-network communities a set of candidate driver cancer genes. Some of them were already known oncogenes while a few are new. The combination of the different layers of information allowed us to extract from the multi-network indications on the regulatory pattern and functional role of both the already known and the new candidate driver genes.
Conserved Non-Coding Regulatory Signatures in Arabidopsis Co-Expressed Gene Modules

PubMed Central

Spangler, Jacob B.; Ficklin, Stephen P.; Luo, Feng; Freeling, Michael; Feltus, F. Alex

2012-01-01

Complex traits and other polygenic processes require coordinated gene expression. Co-expression networks model mRNA co-expression: the product of gene regulatory networks. To identify regulatory mechanisms underlying coordinated gene expression in a tissue-enriched context, ten Arabidopsis thaliana co-expression networks were constructed after manually sorting 4,566 RNA profiling datasets into aerial, flower, leaf, root, rosette, seedling, seed, shoot, whole plant, and global (all samples combined) groups. Collectively, the ten networks contained 30% of the measurable genes of Arabidopsis and were circumscribed into 5,491 modules. Modules were scrutinized for cis regulatory mechanisms putatively encoded in conserved non-coding sequences (CNSs) previously identified as remnants of a whole genome duplication event. We determined the non-random association of 1,361 unique CNSs to 1,904 co-expression network gene modules. Furthermore, the CNS elements were placed in the context of known gene regulatory networks (GRNs) by connecting 250 CNS motifs with known GRN cis elements. Our results provide support for a regulatory role of some CNS elements and suggest the functional consequences of CNS activation of co-expression in specific gene sets dispersed throughout the genome. PMID:23024789
Conserved non-coding regulatory signatures in Arabidopsis co-expressed gene modules.

PubMed

Spangler, Jacob B; Ficklin, Stephen P; Luo, Feng; Freeling, Michael; Feltus, F Alex

2012-01-01

Complex traits and other polygenic processes require coordinated gene expression. Co-expression networks model mRNA co-expression: the product of gene regulatory networks. To identify regulatory mechanisms underlying coordinated gene expression in a tissue-enriched context, ten Arabidopsis thaliana co-expression networks were constructed after manually sorting 4,566 RNA profiling datasets into aerial, flower, leaf, root, rosette, seedling, seed, shoot, whole plant, and global (all samples combined) groups. Collectively, the ten networks contained 30% of the measurable genes of Arabidopsis and were circumscribed into 5,491 modules. Modules were scrutinized for cis regulatory mechanisms putatively encoded in conserved non-coding sequences (CNSs) previously identified as remnants of a whole genome duplication event. We determined the non-random association of 1,361 unique CNSs to 1,904 co-expression network gene modules. Furthermore, the CNS elements were placed in the context of known gene regulatory networks (GRNs) by connecting 250 CNS motifs with known GRN cis elements. Our results provide support for a regulatory role of some CNS elements and suggest the functional consequences of CNS activation of co-expression in specific gene sets dispersed throughout the genome.
Network-based co-expression analysis for exploring the potential diagnostic biomarkers of metastatic melanoma.

PubMed

Wang, Li-Xin; Li, Yang; Chen, Guan-Zhi

2018-01-01

Metastatic melanoma is an aggressive skin cancer and is one of the global malignancies with high mortality and morbidity. It is essential to identify and verify diagnostic biomarkers of early metastatic melanoma. Previous studies have systematically assessed protein biomarkers and mRNA-based expression characteristics. However, molecular markers for the early diagnosis of metastatic melanoma have not been identified. To explore potential regulatory targets, we have analyzed the gene microarray expression profiles of malignant melanoma samples by co-expression analysis based on the network approach. The differentially expressed genes (DEGs) were screened by the EdgeR package of R software. A weighted gene co-expression network analysis (WGCNA) was used for the identification of DEGs in the special gene modules and hub genes. Subsequently, a protein-protein interaction network was constructed to extract hub genes associated with gene modules. Finally, twenty-four important hub genes (RASGRP2, IKZF1, CXCR5, LTB, BLK, LINGO3, CCR6, P2RY10, RHOH, JUP, KRT14, PLA2G3, SPRR1A, KRT78, SFN, CLDN4, IL1RN, PKP3, CBLC, KRT16, TMEM79, KLK8, LYPD3 and LYPD5) were treated as valuable factors involved in the immune response and tumor cell development in tumorigenesis. In addition, a transcriptional regulatory network was constructed for these specific modules or hub genes, and a few core transcriptional regulators were found to be mostly associated with our hub genes, including GATA1, STAT1, SP1, and PSG1. In summary, our findings enhance our understanding of the biological process of malignant melanoma metastasis, enabling us to identify specific genes to use for diagnostic and prognostic markers and possibly for targeted therapy.
Pan- and core- network analysis of co-expression genes in a model plant

DOE PAGES

He, Fei; Maslov, Sergei

2016-12-16

Genome-wide gene expression experiments have been performed using the model plant Arabidopsis during the last decade. Some studies involved construction of coexpression networks, a popular technique used to identify groups of co-regulated genes, to infer unknown gene functions. One approach is to construct a single coexpression network by combining multiple expression datasets generated in different labs. We advocate a complementary approach in which we construct a large collection of 134 coexpression networks based on expression datasets reported in individual publications. To this end we reanalyzed public expression data. To describe this collection of networks we introduced concepts of ‘pan-network’ andmore » ‘core-network’ representing union and intersection between a sizeable fractions of individual networks, respectively. Here, we showed that these two types of networks are different both in terms of their topology and biological function of interacting genes. For example, the modules of the pan-network are enriched in regulatory and signaling functions, while the modules of the core-network tend to include components of large macromolecular complexes such as ribosomes and photosynthetic machinery. Our analysis is aimed to help the plant research community to better explore the information contained within the existing vast collection of gene expression data in Arabidopsis.« less
Pan- and core- network analysis of co-expression genes in a model plant

DOE Office of Scientific and Technical Information (OSTI.GOV)

He, Fei; Maslov, Sergei

Genome-wide gene expression experiments have been performed using the model plant Arabidopsis during the last decade. Some studies involved construction of coexpression networks, a popular technique used to identify groups of co-regulated genes, to infer unknown gene functions. One approach is to construct a single coexpression network by combining multiple expression datasets generated in different labs. We advocate a complementary approach in which we construct a large collection of 134 coexpression networks based on expression datasets reported in individual publications. To this end we reanalyzed public expression data. To describe this collection of networks we introduced concepts of ‘pan-network’ andmore » ‘core-network’ representing union and intersection between a sizeable fractions of individual networks, respectively. Here, we showed that these two types of networks are different both in terms of their topology and biological function of interacting genes. For example, the modules of the pan-network are enriched in regulatory and signaling functions, while the modules of the core-network tend to include components of large macromolecular complexes such as ribosomes and photosynthetic machinery. Our analysis is aimed to help the plant research community to better explore the information contained within the existing vast collection of gene expression data in Arabidopsis.« less
ICan: an integrated co-alteration network to identify ovarian cancer-related genes.

PubMed

Zhou, Yuanshuai; Liu, Yongjing; Li, Kening; Zhang, Rui; Qiu, Fujun; Zhao, Ning; Xu, Yan

2015-01-01

Over the last decade, an increasing number of integrative studies on cancer-related genes have been published. Integrative analyses aim to overcome the limitation of a single data type, and provide a more complete view of carcinogenesis. The vast majority of these studies used sample-matched data of gene expression and copy number to investigate the impact of copy number alteration on gene expression, and to predict and prioritize candidate oncogenes and tumor suppressor genes. However, correlations between genes were neglected in these studies. Our work aimed to evaluate the co-alteration of copy number, methylation and expression, allowing us to identify cancer-related genes and essential functional modules in cancer. We built the Integrated Co-alteration network (ICan) based on multi-omics data, and analyzed the network to uncover cancer-related genes. After comparison with random networks, we identified 155 ovarian cancer-related genes, including well-known (TP53, BRCA1, RB1 and PTEN) and also novel cancer-related genes, such as PDPN and EphA2. We compared the results with a conventional method: CNAmet, and obtained a significantly better area under the curve value (ICan: 0.8179, CNAmet: 0.5183). In this paper, we describe a framework to find cancer-related genes based on an Integrated Co-alteration network. Our results proved that ICan could precisely identify candidate cancer genes and provide increased mechanistic understanding of carcinogenesis. This work suggested a new research direction for biological network analyses involving multi-omics data.
ICan: An Integrated Co-Alteration Network to Identify Ovarian Cancer-Related Genes

PubMed Central

Zhou, Yuanshuai; Liu, Yongjing; Li, Kening; Zhang, Rui; Qiu, Fujun; Zhao, Ning; Xu, Yan

2015-01-01

Background Over the last decade, an increasing number of integrative studies on cancer-related genes have been published. Integrative analyses aim to overcome the limitation of a single data type, and provide a more complete view of carcinogenesis. The vast majority of these studies used sample-matched data of gene expression and copy number to investigate the impact of copy number alteration on gene expression, and to predict and prioritize candidate oncogenes and tumor suppressor genes. However, correlations between genes were neglected in these studies. Our work aimed to evaluate the co-alteration of copy number, methylation and expression, allowing us to identify cancer-related genes and essential functional modules in cancer. Results We built the Integrated Co-alteration network (ICan) based on multi-omics data, and analyzed the network to uncover cancer-related genes. After comparison with random networks, we identified 155 ovarian cancer-related genes, including well-known (TP53, BRCA1, RB1 and PTEN) and also novel cancer-related genes, such as PDPN and EphA2. We compared the results with a conventional method: CNAmet, and obtained a significantly better area under the curve value (ICan: 0.8179, CNAmet: 0.5183). Conclusion In this paper, we describe a framework to find cancer-related genes based on an Integrated Co-alteration network. Our results proved that ICan could precisely identify candidate cancer genes and provide increased mechanistic understanding of carcinogenesis. This work suggested a new research direction for biological network analyses involving multi-omics data. PMID:25803614
Microarray-based bioinformatics analysis of the combined effects of SiNPs and PbAc on cardiovascular system in zebrafish.

PubMed

Hu, Hejing; Zhang, Yannan; Shi, Yanfeng; Feng, Lin; Duan, Junchao; Sun, Zhiwei

2017-10-01

With rapid development of nanotechnology and growing environmental pollution, the combined toxic effects of SiNPs and pollutants of heavy metals like lead have received global attentions. The aim of this study was to explore the cardiovascular effects of the co-exposure of SiNPs and lead acetate (PbAc) in zebrafish using microarray and bioinformatics analysis. Although there was no other obvious cardiovascular malformation except bleeding phenotype, bradycardia, angiogenesis inhibition and declined cardiac output in zebrafish co-exposed of SiNPs and PbAc at NOAEL level, significant changes were observed in mRNA and microRNA (miRNA) expression patterns. STC-GO analysis indicated that the co-exposure might have more toxic effects on cardiovascular system than that exposure alone. Key differentially expressed genes were discerned out based on the Dynamic-gene-network, including stxbp1a, ndfip2, celf4 and gsk3b. Furthermore, several miRNAs obtained from the miRNA-Gene-Network might play crucial roles in cardiovascular disease, such as dre-miR-93, dre-miR-34a, dre-miR-181c, dre-miR-7145, dre-miR-730, dre-miR-129-5p, dre-miR-19d, dre-miR-218b, dre-miR-221. Besides, the analysis of miRNA-pathway-network indicated that the zebrafish were stimulated by the co-exposure of SiNPs and PbAc, which might cause the disturbance of calcium homeostasis and endoplasmic reticulum stress. As a result, cardiac muscle contraction might be deteriorated. In general, our data provide abundant fundamental research clues to the combined toxicity of environmental pollutants and further in-depth verifications are needed. Copyright © 2017 Elsevier Ltd. All rights reserved.
A Functional and Regulatory Network Associated with PIP Expression in Human Breast Cancer

PubMed Central

Debily, Marie-Anne; Marhomy, Sandrine El; Boulanger, Virginie; Eveno, Eric; Mariage-Samson, Régine; Camarca, Alessandra; Auffray, Charles; Piatier-Tonneau, Dominique; Imbeaud, Sandrine

2009-01-01

Background The PIP (prolactin-inducible protein) gene has been shown to be expressed in breast cancers, with contradictory results concerning its implication. As both the physiological role and the molecular pathways in which PIP is involved are poorly understood, we conducted combined gene expression profiling and network analysis studies on selected breast cancer cell lines presenting distinct PIP expression levels and hormonal receptor status, to explore the functional and regulatory network of PIP co-modulated genes. Principal Findings Microarray analysis allowed identification of genes co-modulated with PIP independently of modulations resulting from hormonal treatment or cell line heterogeneity. Relevant clusters of genes that can discriminate between [PIP+] and [PIP−] cells were identified. Functional and regulatory network analyses based on a knowledge database revealed a master network of PIP co-modulated genes, including many interconnecting oncogenes and tumor suppressor genes, half of which were detected as differentially expressed through high-precision measurements. The network identified appears associated with an inhibition of proliferation coupled with an increase of apoptosis and an enhancement of cell adhesion in breast cancer cell lines, and contains many genes with a STAT5 regulatory motif in their promoters. Conclusions Our global exploratory approach identified biological pathways modulated along with PIP expression, providing further support for its good prognostic value of disease-free survival in breast cancer. Moreover, our data pointed to the importance of a regulatory subnetwork associated with PIP expression in which STAT5 appears as a potential transcriptional regulator. PMID:19262752
Diurnal Transcriptome and Gene Network Represented through Sparse Modeling in Brachypodium distachyon.

PubMed

Koda, Satoru; Onda, Yoshihiko; Matsui, Hidetoshi; Takahagi, Kotaro; Yamaguchi-Uehara, Yukiko; Shimizu, Minami; Inoue, Komaki; Yoshida, Takuhiro; Sakurai, Tetsuya; Honda, Hiroshi; Eguchi, Shinto; Nishii, Ryuei; Mochida, Keiichi

2017-01-01

We report the comprehensive identification of periodic genes and their network inference, based on a gene co-expression analysis and an Auto-Regressive eXogenous (ARX) model with a group smoothly clipped absolute deviation (SCAD) method using a time-series transcriptome dataset in a model grass, Brachypodium distachyon . To reveal the diurnal changes in the transcriptome in B. distachyon , we performed RNA-seq analysis of its leaves sampled through a diurnal cycle of over 48 h at 4 h intervals using three biological replications, and identified 3,621 periodic genes through our wavelet analysis. The expression data are feasible to infer network sparsity based on ARX models. We found that genes involved in biological processes such as transcriptional regulation, protein degradation, and post-transcriptional modification and photosynthesis are significantly enriched in the periodic genes, suggesting that these processes might be regulated by circadian rhythm in B. distachyon . On the basis of the time-series expression patterns of the periodic genes, we constructed a chronological gene co-expression network and identified putative transcription factors encoding genes that might be involved in the time-specific regulatory transcriptional network. Moreover, we inferred a transcriptional network composed of the periodic genes in B. distachyon , aiming to identify genes associated with other genes through variable selection by grouping time points for each gene. Based on the ARX model with the group SCAD regularization using our time-series expression datasets of the periodic genes, we constructed gene networks and found that the networks represent typical scale-free structure. Our findings demonstrate that the diurnal changes in the transcriptome in B. distachyon leaves have a sparse network structure, demonstrating the spatiotemporal gene regulatory network over the cyclic phase transitions in B. distachyon diurnal growth.
A statistical method for measuring activation of gene regulatory networks.

PubMed

Esteves, Gustavo H; Reis, Luiz F L

2018-06-13

Gene expression data analysis is of great importance for modern molecular biology, given our ability to measure the expression profiles of thousands of genes and enabling studies rooted in systems biology. In this work, we propose a simple statistical model for the activation measuring of gene regulatory networks, instead of the traditional gene co-expression networks. We present the mathematical construction of a statistical procedure for testing hypothesis regarding gene regulatory network activation. The real probability distribution for the test statistic is evaluated by a permutation based study. To illustrate the functionality of the proposed methodology, we also present a simple example based on a small hypothetical network and the activation measuring of two KEGG networks, both based on gene expression data collected from gastric and esophageal samples. The two KEGG networks were also analyzed for a public database, available through NCBI-GEO, presented as Supplementary Material. This method was implemented in an R package that is available at the BioConductor project website under the name maigesPack.
MPIGeneNet: Parallel Calculation of Gene Co-Expression Networks on Multicore Clusters.

PubMed

Gonzalez-Dominguez, Jorge; Martin, Maria J

2017-10-10

In this work we present MPIGeneNet, a parallel tool that applies Pearson's correlation and Random Matrix Theory to construct gene co-expression networks. It is based on the state-of-the-art sequential tool RMTGeneNet, which provides networks with high robustness and sensitivity at the expenses of relatively long runtimes for large scale input datasets. MPIGeneNet returns the same results as RMTGeneNet but improves the memory management, reduces the I/O cost, and accelerates the two most computationally demanding steps of co-expression network construction by exploiting the compute capabilities of common multicore CPU clusters. Our performance evaluation on two different systems using three typical input datasets shows that MPIGeneNet is significantly faster than RMTGeneNet. As an example, our tool is up to 175.41 times faster on a cluster with eight nodes, each one containing two 12-core Intel Haswell processors. Source code of MPIGeneNet, as well as a reference manual, are available at https://sourceforge.net/projects/mpigenenet/.
Visual gene-network analysis reveals the cancer gene co-expression in human endometrial cancer

PubMed Central

2014-01-01

Background Endometrial cancers (ECs) are the most common form of gynecologic malignancy. Recent studies have reported that ECs reveal distinct markers for molecular pathogenesis, which in turn is linked to the various histological types of ECs. To understand further the molecular events contributing to ECs and endometrial tumorigenesis in general, a more precise identification of cancer-associated molecules and signaling networks would be useful for the detection and monitoring of malignancy, improving clinical cancer therapy, and personalization of treatments. Results ECs-specific gene co-expression networks were constructed by differential expression analysis and weighted gene co-expression network analysis (WGCNA). Important pathways and putative cancer hub genes contribution to tumorigenesis of ECs were identified. An elastic-net regularized classification model was built using the cancer hub gene signatures to predict the phenotypic characteristics of ECs. The 19 cancer hub gene signatures had high predictive power to distinguish among three key principal features of ECs: grade, type, and stage. Intriguingly, these hub gene networks seem to contribute to ECs progression and malignancy via cell-cycle regulation, antigen processing and the citric acid (TCA) cycle. Conclusions The results of this study provide a powerful biomarker discovery platform to better understand the progression of ECs and to uncover potential therapeutic targets in the treatment of ECs. This information might lead to improved monitoring of ECs and resulting improvement of treatment of ECs, the 4th most common of cancer in women. PMID:24758163
A gene co-expression network model identifies yield-related vicinity networks in Jatropha curcas shoot system.

PubMed

Govender, Nisha; Senan, Siju; Mohamed-Hussein, Zeti-Azura; Wickneswari, Ratnam

2018-06-15

The plant shoot system consists of reproductive organs such as inflorescences, buds and fruits, and the vegetative leaves and stems. In this study, the reproductive part of the Jatropha curcas shoot system, which includes the aerial shoots, shoots bearing the inflorescence and inflorescence were investigated in regard to gene-to-gene interactions underpinning yield-related biological processes. An RNA-seq based sequencing of shoot tissues performed on an Illumina HiSeq. 2500 platform generated 18 transcriptomes. Using the reference genome-based mapping approach, a total of 64 361 genes was identified in all samples and the data was annotated against the non-redundant database by the BLAST2GO Pro. Suite. After removing the outlier genes and samples, a total of 12 734 genes across 17 samples were subjected to gene co-expression network construction using petal, an R library. A gene co-expression network model built with scale-free and small-world properties extracted four vicinity networks (VNs) with putative involvement in yield-related biological processes as follow; heat stress tolerance, floral and shoot meristem differentiation, biosynthesis of chlorophyll molecules and laticifers, cell wall metabolism and epigenetic regulations. Our VNs revealed putative key players that could be adapted in breeding strategies for J. curcas shoot system improvements.
Functional modules by relating protein interaction networks and gene expression.

PubMed

Tornow, Sabine; Mewes, H W

2003-11-01

Genes and proteins are organized on the basis of their particular mutual relations or according to their interactions in cellular and genetic networks. These include metabolic or signaling pathways and protein interaction, regulatory or co-expression networks. Integrating the information from the different types of networks may lead to the notion of a functional network and functional modules. To find these modules, we propose a new technique which is based on collective, multi-body correlations in a genetic network. We calculated the correlation strength of a group of genes (e.g. in the co-expression network) which were identified as members of a module in a different network (e.g. in the protein interaction network) and estimated the probability that this correlation strength was found by chance. Groups of genes with a significant correlation strength in different networks have a high probability that they perform the same function. Here, we propose evaluating the multi-body correlations by applying the superparamagnetic approach. We compare our method to the presently applied mean Pearson correlations and show that our method is more sensitive in revealing functional relationships.
Functional modules by relating protein interaction networks and gene expression

PubMed Central

Tornow, Sabine; Mewes, H. W.

2003-01-01

Genes and proteins are organized on the basis of their particular mutual relations or according to their interactions in cellular and genetic networks. These include metabolic or signaling pathways and protein interaction, regulatory or co-expression networks. Integrating the information from the different types of networks may lead to the notion of a functional network and functional modules. To find these modules, we propose a new technique which is based on collective, multi-body correlations in a genetic network. We calculated the correlation strength of a group of genes (e.g. in the co-expression network) which were identified as members of a module in a different network (e.g. in the protein interaction network) and estimated the probability that this correlation strength was found by chance. Groups of genes with a significant correlation strength in different networks have a high probability that they perform the same function. Here, we propose evaluating the multi-body correlations by applying the superparamagnetic approach. We compare our method to the presently applied mean Pearson correlations and show that our method is more sensitive in revealing functional relationships. PMID:14576317
CoNekT: an open-source framework for comparative genomic and transcriptomic network analyses.

PubMed

Proost, Sebastian; Mutwil, Marek

2018-05-01

The recent accumulation of gene expression data in the form of RNA sequencing creates unprecedented opportunities to study gene regulation and function. Furthermore, comparative analysis of the expression data from multiple species can elucidate which functional gene modules are conserved across species, allowing the study of the evolution of these modules. However, performing such comparative analyses on raw data is not feasible for many biologists. Here, we present CoNekT (Co-expression Network Toolkit), an open source web server, that contains user-friendly tools and interactive visualizations for comparative analyses of gene expression data and co-expression networks. These tools allow analysis and cross-species comparison of (i) gene expression profiles; (ii) co-expression networks; (iii) co-expressed clusters involved in specific biological processes; (iv) tissue-specific gene expression; and (v) expression profiles of gene families. To demonstrate these features, we constructed CoNekT-Plants for green alga, seed plants and flowering plants (Picea abies, Chlamydomonas reinhardtii, Vitis vinifera, Arabidopsis thaliana, Oryza sativa, Zea mays and Solanum lycopersicum) and thus provide a web-tool with the broadest available collection of plant phyla. CoNekT-Plants is freely available from http://conekt.plant.tools, while the CoNekT source code and documentation can be found at https://github.molgen.mpg.de/proost/CoNekT/.
Gene co-expression network analysis in Rhodobacter capsulatus and application to comparative expression analysis of Rhodobacter sphaeroides

DOE Office of Scientific and Technical Information (OSTI.GOV)

Pena-Castillo, Lourdes; Mercer, Ryan; Gurinovich, Anastasia

2014-08-28

The genus Rhodobacter contains purple nonsulfur bacteria found mostly in freshwater environments. Representative strains of two Rhodobacter species, R. capsulatus and R. sphaeroides, have had their genomes fully sequenced and both have been the subject of transcriptional profiling studies. Gene co-expression networks can be used to identify modules of genes with similar expression profiles. Functional analysis of gene modules can then associate co-expressed genes with biological pathways, and network statistics can determine the degree of module preservation in related networks. In this paper, we constructed an R. capsulatus gene co-expression network, performed functional analysis of identified gene modules, and investigatedmore » preservation of these modules in R. capsulatus proteomics data and in R. sphaeroides transcriptomics data. Results: The analysis identified 40 gene co-expression modules in R. capsulatus. Investigation of the module gene contents and expression profiles revealed patterns that were validated based on previous studies supporting the biological relevance of these modules. We identified two R. capsulatus gene modules preserved in the protein abundance data. We also identified several gene modules preserved between both Rhodobacter species, which indicate that these cellular processes are conserved between the species and are candidates for functional information transfer between species. Many gene modules were non-preserved, providing insight into processes that differentiate the two species. In addition, using Local Network Similarity (LNS), a recently proposed metric for expression divergence, we assessed the expression conservation of between-species pairs of orthologs, and within-species gene-protein expression profiles. Conclusions: Our analyses provide new sources of information for functional annotation in R. capsulatus because uncharacterized genes in modules are now connected with groups of genes that constitute a joint functional annotation. We identified R. capsulatus modules enriched with genes for ribosomal proteins, porphyrin and bacteriochlorophyll anabolism, and biosynthesis of secondary metabolites to be preserved in R. sphaeroides whereas modules related to RcGTA production and signalling showed lack of preservation in R. sphaeroides. In addition, we demonstrated that network statistics may also be applied within-species to identify congruence between mRNA expression and protein abundance data for which simple correlation measurements have previously had mixed results.« less

ARNetMiT R Package: association rules based gene co-expression networks of miRNA targets.

PubMed

Özgür Cingiz, M; Biricik, G; Diri, B

2017-03-31

miRNAs are key regulators that bind to target genes to suppress their gene expression level. The relations between miRNA-target genes enable users to derive co-expressed genes that may be involved in similar biological processes and functions in cells. We hypothesize that target genes of miRNAs are co-expressed, when they are regulated by multiple miRNAs. With the usage of these co-expressed genes, we can theoretically construct co-expression networks (GCNs) related to 152 diseases. In this study, we introduce ARNetMiT that utilize a hash based association rule algorithm in a novel way to infer the GCNs on miRNA-target genes data. We also present R package of ARNetMiT, which infers and visualizes GCNs of diseases that are selected by users. Our approach assumes miRNAs as transactions and target genes as their items. Support and confidence values are used to prune association rules on miRNA-target genes data to construct support based GCNs (sGCNs) along with support and confidence based GCNs (scGCNs). We use overlap analysis and the topological features for the performance analysis of GCNs. We also infer GCNs with popular GNI algorithms for comparison with the GCNs of ARNetMiT. Overlap analysis results show that ARNetMiT outperforms the compared GNI algorithms. We see that using high confidence values in scGCNs increase the ratio of the overlapped gene-gene interactions between the compared methods. According to the evaluation of the topological features of ARNetMiT based GCNs, the degrees of nodes have power-law distribution. The hub genes discovered by ARNetMiT based GCNs are consistent with the literature.
Is My Network Module Preserved and Reproducible?

PubMed Central

Langfelder, Peter; Luo, Rui; Oldham, Michael C.; Horvath, Steve

2011-01-01

In many applications, one is interested in determining which of the properties of a network module change across conditions. For example, to validate the existence of a module, it is desirable to show that it is reproducible (or preserved) in an independent test network. Here we study several types of network preservation statistics that do not require a module assignment in the test network. We distinguish network preservation statistics by the type of the underlying network. Some preservation statistics are defined for a general network (defined by an adjacency matrix) while others are only defined for a correlation network (constructed on the basis of pairwise correlations between numeric variables). Our applications show that the correlation structure facilitates the definition of particularly powerful module preservation statistics. We illustrate that evaluating module preservation is in general different from evaluating cluster preservation. We find that it is advantageous to aggregate multiple preservation statistics into summary preservation statistics. We illustrate the use of these methods in six gene co-expression network applications including 1) preservation of cholesterol biosynthesis pathway in mouse tissues, 2) comparison of human and chimpanzee brain networks, 3) preservation of selected KEGG pathways between human and chimpanzee brain networks, 4) sex differences in human cortical networks, 5) sex differences in mouse liver networks. While we find no evidence for sex specific modules in human cortical networks, we find that several human cortical modules are less preserved in chimpanzees. In particular, apoptosis genes are differentially co-expressed between humans and chimpanzees. Our simulation studies and applications show that module preservation statistics are useful for studying differences between the modular structure of networks. Data, R software and accompanying tutorials can be downloaded from the following webpage: http://www.genetics.ucla.edu/labs/horvath/CoexpressionNetwork/ModulePreservation. PMID:21283776
Effects of threshold on the topology of gene co-expression networks.

PubMed

Couto, Cynthia Martins Villar; Comin, César Henrique; Costa, Luciano da Fontoura

2017-09-26

Several developments regarding the analysis of gene co-expression profiles using complex network theory have been reported recently. Such approaches usually start with the construction of an unweighted gene co-expression network, therefore requiring the selection of a suitable threshold defining which pairs of vertices will be connected. We aimed at addressing such an important problem by suggesting and comparing five different approaches for threshold selection. Each of the methods considers a respective biologically-motivated criterion for electing a potentially suitable threshold. A set of 21 microarray experiments from different biological groups was used to investigate the effect of applying the five proposed criteria to several biological situations. For each experiment, we used the Pearson correlation coefficient to measure the relationship between each gene pair, and the resulting weight matrices were thresholded considering several values, generating respective adjacency matrices (co-expression networks). Each of the five proposed criteria was then applied in order to select the respective threshold value. The effects of these thresholding approaches on the topology of the resulting networks were compared by using several measurements, and we verified that, depending on the database, the impact on the topological properties can be large. However, a group of databases was verified to be similarly affected by most of the considered criteria. Based on such results, it can be suggested that when the generated networks present similar measurements, the thresholding method can be chosen with greater freedom. If the generated networks are markedly different, the thresholding method that better suits the interests of each specific research study represents a reasonable choice.
Massive-scale gene co-expression network construction and robustness testing using random matrix theory.

PubMed

Gibson, Scott M; Ficklin, Stephen P; Isaacson, Sven; Luo, Feng; Feltus, Frank A; Smith, Melissa C

2013-01-01

The study of gene relationships and their effect on biological function and phenotype is a focal point in systems biology. Gene co-expression networks built using microarray expression profiles are one technique for discovering and interpreting gene relationships. A knowledge-independent thresholding technique, such as Random Matrix Theory (RMT), is useful for identifying meaningful relationships. Highly connected genes in the thresholded network are then grouped into modules that provide insight into their collective functionality. While it has been shown that co-expression networks are biologically relevant, it has not been determined to what extent any given network is functionally robust given perturbations in the input sample set. For such a test, hundreds of networks are needed and hence a tool to rapidly construct these networks. To examine functional robustness of networks with varying input, we enhanced an existing RMT implementation for improved scalability and tested functional robustness of human (Homo sapiens), rice (Oryza sativa) and budding yeast (Saccharomyces cerevisiae). We demonstrate dramatic decrease in network construction time and computational requirements and show that despite some variation in global properties between networks, functional similarity remains high. Moreover, the biological function captured by co-expression networks thresholded by RMT is highly robust.
Module discovery by exhaustive search for densely connected, co-expressed regions in biomolecular interaction networks.

PubMed

Colak, Recep; Moser, Flavia; Chu, Jeffrey Shih-Chieh; Schönhuth, Alexander; Chen, Nansheng; Ester, Martin

2010-10-25

Computational prediction of functionally related groups of genes (functional modules) from large-scale data is an important issue in computational biology. Gene expression experiments and interaction networks are well studied large-scale data sources, available for many not yet exhaustively annotated organisms. It has been well established, when analyzing these two data sources jointly, modules are often reflected by highly interconnected (dense) regions in the interaction networks whose participating genes are co-expressed. However, the tractability of the problem had remained unclear and methods by which to exhaustively search for such constellations had not been presented. We provide an algorithmic framework, referred to as Densely Connected Biclustering (DECOB), by which the aforementioned search problem becomes tractable. To benchmark the predictive power inherent to the approach, we computed all co-expressed, dense regions in physical protein and genetic interaction networks from human and yeast. An automatized filtering procedure reduces our output which results in smaller collections of modules, comparable to state-of-the-art approaches. Our results performed favorably in a fair benchmarking competition which adheres to standard criteria. We demonstrate the usefulness of an exhaustive module search, by using the unreduced output to more quickly perform GO term related function prediction tasks. We point out the advantages of our exhaustive output by predicting functional relationships using two examples. We demonstrate that the computation of all densely connected and co-expressed regions in interaction networks is an approach to module discovery of considerable value. Beyond confirming the well settled hypothesis that such co-expressed, densely connected interaction network regions reflect functional modules, we open up novel computational ways to comprehensively analyze the modular organization of an organism based on prevalent and largely available large-scale datasets. Software and data sets are available at http://www.sfu.ca/~ester/software/DECOB.zip.
Gene Network Construction from Microarray Data Identifies a Key Network Module and Several Candidate Hub Genes in Age-Associated Spatial Learning Impairment

PubMed Central

Uddin, Raihan; Singh, Shiva M.

2017-01-01

As humans age many suffer from a decrease in normal brain functions including spatial learning impairments. This study aimed to better understand the molecular mechanisms in age-associated spatial learning impairment (ASLI). We used a mathematical modeling approach implemented in Weighted Gene Co-expression Network Analysis (WGCNA) to create and compare gene network models of young (learning unimpaired) and aged (predominantly learning impaired) brains from a set of exploratory datasets in rats in the context of ASLI. The major goal was to overcome some of the limitations previously observed in the traditional meta- and pathway analysis using these data, and identify novel ASLI related genes and their networks based on co-expression relationship of genes. This analysis identified a set of network modules in the young, each of which is highly enriched with genes functioning in broad but distinct GO functional categories or biological pathways. Interestingly, the analysis pointed to a single module that was highly enriched with genes functioning in “learning and memory” related functions and pathways. Subsequent differential network analysis of this “learning and memory” module in the aged (predominantly learning impaired) rats compared to the young learning unimpaired rats allowed us to identify a set of novel ASLI candidate hub genes. Some of these genes show significant repeatability in networks generated from independent young and aged validation datasets. These hub genes are highly co-expressed with other genes in the network, which not only show differential expression but also differential co-expression and differential connectivity across age and learning impairment. The known function of these hub genes indicate that they play key roles in critical pathways, including kinase and phosphatase signaling, in functions related to various ion channels, and in maintaining neuronal integrity relating to synaptic plasticity and memory formation. Taken together, they provide a new insight and generate new hypotheses into the molecular mechanisms responsible for age associated learning impairment, including spatial learning. PMID:29066959
Gene Network Construction from Microarray Data Identifies a Key Network Module and Several Candidate Hub Genes in Age-Associated Spatial Learning Impairment.

PubMed

Uddin, Raihan; Singh, Shiva M

2017-01-01

As humans age many suffer from a decrease in normal brain functions including spatial learning impairments. This study aimed to better understand the molecular mechanisms in age-associated spatial learning impairment (ASLI). We used a mathematical modeling approach implemented in Weighted Gene Co-expression Network Analysis (WGCNA) to create and compare gene network models of young (learning unimpaired) and aged (predominantly learning impaired) brains from a set of exploratory datasets in rats in the context of ASLI. The major goal was to overcome some of the limitations previously observed in the traditional meta- and pathway analysis using these data, and identify novel ASLI related genes and their networks based on co-expression relationship of genes. This analysis identified a set of network modules in the young, each of which is highly enriched with genes functioning in broad but distinct GO functional categories or biological pathways. Interestingly, the analysis pointed to a single module that was highly enriched with genes functioning in "learning and memory" related functions and pathways. Subsequent differential network analysis of this "learning and memory" module in the aged (predominantly learning impaired) rats compared to the young learning unimpaired rats allowed us to identify a set of novel ASLI candidate hub genes. Some of these genes show significant repeatability in networks generated from independent young and aged validation datasets. These hub genes are highly co-expressed with other genes in the network, which not only show differential expression but also differential co-expression and differential connectivity across age and learning impairment. The known function of these hub genes indicate that they play key roles in critical pathways, including kinase and phosphatase signaling, in functions related to various ion channels, and in maintaining neuronal integrity relating to synaptic plasticity and memory formation. Taken together, they provide a new insight and generate new hypotheses into the molecular mechanisms responsible for age associated learning impairment, including spatial learning.
The Role of Vitamin D in the Transcriptional Program of Human Pregnancy

PubMed Central

Al-Garawi, Amal; Carey, Vincent J.; Chhabra, Divya; Morrow, Jarrett; Lasky-Su, Jessica; Qiu, Weiliang; Laranjo, Nancy; Litonjua, Augusto A.; Weiss, Scott T.

2016-01-01

Background Patterns of gene expression of human pregnancy are poorly understood. In a trial of vitamin D supplementation in pregnant women, peripheral blood transcriptomes were measured longitudinally on 30 women and used to characterize gene co-expression networks. Objective Studies suggest that increased maternal Vitamin D levels may reduce the risk of asthma in early life, yet the underlying mechanisms have not been examined. In this study, we used a network-based approach to examine changes in gene expression profiles during the course of normal pregnancy and evaluated their association with maternal Vitamin D levels. Design The VDAART study is a randomized clinical trial of vitamin D supplementation in pregnancy for reduction of pediatric asthma risk. The trial enrolled 881 women at 10–18 weeks of gestation. Longitudinal gene expression measures were obtained on thirty pregnant women, using RNA isolated from peripheral blood samples obtained in the first and third trimesters. Differentially expressed genes were identified using significance of analysis of microarrays (SAM), and clustered using a weighted gene co-expression network analysis (WGCNA). Gene-set enrichment was performed to identify major biological pathways. Results Comparison of transcriptional profiles between first and third trimesters of pregnancy identified 5839 significantly differentially expressed genes (FDR<0.05). Weighted gene co-expression network analysis clustered these transcripts into 14 co-expression modules of which two showed significant correlation with maternal vitamin D levels. Pathway analysis of these two modules revealed genes enriched in immune defense pathways and extracellular matrix reorganization as well as genes enriched in notch signaling and transcription factor networks. Conclusion Our data show that gene expression profiles of healthy pregnant women change during the course of pregnancy and suggest that maternal Vitamin D levels influence transcriptional profiles. These alterations of the maternal transcriptome may contribute to fetal immune imprinting and reduce allergic sensitization in early life. Trial Registration clinicaltrials.gov NCT00920621 PMID:27711190
Assessing the utility of gene co-expression stability in combination with correlation in the analysis of protein-protein interaction networks

PubMed Central

2011-01-01

Background Gene co-expression, in the form of a correlation coefficient, has been valuable in the analysis, classification and prediction of protein-protein interactions. However, it is susceptible to bias from a few samples having a large effect on the correlation coefficient. Gene co-expression stability is a means of quantifying this bias, with high stability indicating robust, unbiased co-expression correlation coefficients. We assess the utility of gene co-expression stability as an additional measure to support the co-expression correlation in the analysis of protein-protein interaction networks. Results We studied the patterns of co-expression correlation and stability in interacting proteins with respect to their interaction promiscuity, levels of intrinsic disorder, and essentiality or disease-relatedness. Co-expression stability, along with co-expression correlation, acts as a better classifier of hub proteins in interaction networks, than co-expression correlation alone, enabling the identification of a class of hubs that are functionally distinct from the widely accepted transient (date) and obligate (party) hubs. Proteins with high levels of intrinsic disorder have low co-expression correlation and high stability with their interaction partners suggesting their involvement in transient interactions, except for a small group that have high co-expression correlation and are typically subunits of stable complexes. Similar behavior was seen for disease-related and essential genes. Interacting proteins that are both disordered have higher co-expression stability than ordered protein pairs. Using co-expression correlation and stability, we found that transient interactions are more likely to occur between an ordered and a disordered protein while obligate interactions primarily occur between proteins that are either both ordered, or disordered. Conclusions We observe that co-expression stability shows distinct patterns in structurally and functionally different groups of proteins and interactions. We conclude that it is a useful and important measure to be used in concert with gene co-expression correlation for further insights into the characteristics of proteins in the context of their interaction network. PMID:22369639
The structure of a gene co-expression network reveals biological functions underlying eQTLs.

PubMed

Villa-Vialaneix, Nathalie; Liaubet, Laurence; Laurent, Thibault; Cherel, Pierre; Gamot, Adrien; SanCristobal, Magali

2013-01-01

What are the commonalities between genes, whose expression level is partially controlled by eQTL, especially with regard to biological functions? Moreover, how are these genes related to a phenotype of interest? These issues are particularly difficult to address when the genome annotation is incomplete, as is the case for mammalian species. Moreover, the direct link between gene expression and a phenotype of interest may be weak, and thus difficult to handle. In this framework, the use of a co-expression network has proven useful: it is a robust approach for modeling a complex system of genetic regulations, and to infer knowledge for yet unknown genes. In this article, a case study was conducted with a mammalian species. It showed that the use of a co-expression network based on partial correlation, combined with a relevant clustering of nodes, leads to an enrichment of biological functions of around 83%. Moreover, the use of a spatial statistics approach allowed us to superimpose additional information related to a phenotype; this lead to highlighting specific genes or gene clusters that are related to the network structure and the phenotype. Three main results are worth noting: first, key genes were highlighted as a potential focus for forthcoming biological experiments; second, a set of biological functions, which support a list of genes under partial eQTL control, was set up by an overview of the global structure of the gene expression network; third, pH was found correlated with gene clusters, and then with related biological functions, as a result of a spatial analysis of the network topology.
Annotation of gene function in citrus using gene expression information and co-expression networks

PubMed Central

2014-01-01

Background The genus Citrus encompasses major cultivated plants such as sweet orange, mandarin, lemon and grapefruit, among the world’s most economically important fruit crops. With increasing volumes of transcriptomics data available for these species, Gene Co-expression Network (GCN) analysis is a viable option for predicting gene function at a genome-wide scale. GCN analysis is based on a “guilt-by-association” principle whereby genes encoding proteins involved in similar and/or related biological processes may exhibit similar expression patterns across diverse sets of experimental conditions. While bioinformatics resources such as GCN analysis are widely available for efficient gene function prediction in model plant species including Arabidopsis, soybean and rice, in citrus these tools are not yet developed. Results We have constructed a comprehensive GCN for citrus inferred from 297 publicly available Affymetrix Genechip Citrus Genome microarray datasets, providing gene co-expression relationships at a genome-wide scale (33,000 transcripts). The comprehensive citrus GCN consists of a global GCN (condition-independent) and four condition-dependent GCNs that survey the sweet orange species only, all citrus fruit tissues, all citrus leaf tissues, or stress-exposed plants. All of these GCNs are clustered using genome-wide, gene-centric (guide) and graph clustering algorithms for flexibility of gene function prediction. For each putative cluster, gene ontology (GO) enrichment and gene expression specificity analyses were performed to enhance gene function, expression and regulation pattern prediction. The guide-gene approach was used to infer novel roles of genes involved in disease susceptibility and vitamin C metabolism, and graph-clustering approaches were used to investigate isoprenoid/phenylpropanoid metabolism in citrus peel, and citric acid catabolism via the GABA shunt in citrus fruit. Conclusions Integration of citrus gene co-expression networks, functional enrichment analysis and gene expression information provide opportunities to infer gene function in citrus. We present a publicly accessible tool, Network Inference for Citrus Co-Expression (NICCE, http://citrus.adelaide.edu.au/nicce/home.aspx), for the gene co-expression analysis in citrus. PMID:25023870
NetMiner-an ensemble pipeline for building genome-wide and high-quality gene co-expression network using massive-scale RNA-seq samples.

PubMed

Yu, Hua; Jiao, Bingke; Lu, Lu; Wang, Pengfei; Chen, Shuangcheng; Liang, Chengzhi; Liu, Wei

2018-01-01

Accurately reconstructing gene co-expression network is of great importance for uncovering the genetic architecture underlying complex and various phenotypes. The recent availability of high-throughput RNA-seq sequencing has made genome-wide detecting and quantifying of the novel, rare and low-abundance transcripts practical. However, its potential merits in reconstructing gene co-expression network have still not been well explored. Using massive-scale RNA-seq samples, we have designed an ensemble pipeline, called NetMiner, for building genome-scale and high-quality Gene Co-expression Network (GCN) by integrating three frequently used inference algorithms. We constructed a RNA-seq-based GCN in one species of monocot rice. The quality of network obtained by our method was verified and evaluated by the curated gene functional association data sets, which obviously outperformed each single method. In addition, the powerful capability of network for associating genes with functions and agronomic traits was shown by enrichment analysis and case studies. In particular, we demonstrated the potential value of our proposed method to predict the biological roles of unknown protein-coding genes, long non-coding RNA (lncRNA) genes and circular RNA (circRNA) genes. Our results provided a valuable and highly reliable data source to select key candidate genes for subsequent experimental validation. To facilitate identification of novel genes regulating important biological processes and phenotypes in other plants or animals, we have published the source code of NetMiner, making it freely available at https://github.com/czllab/NetMiner.
Massive-Scale Gene Co-Expression Network Construction and Robustness Testing Using Random Matrix Theory

PubMed Central

Isaacson, Sven; Luo, Feng; Feltus, Frank A.; Smith, Melissa C.

2013-01-01

The study of gene relationships and their effect on biological function and phenotype is a focal point in systems biology. Gene co-expression networks built using microarray expression profiles are one technique for discovering and interpreting gene relationships. A knowledge-independent thresholding technique, such as Random Matrix Theory (RMT), is useful for identifying meaningful relationships. Highly connected genes in the thresholded network are then grouped into modules that provide insight into their collective functionality. While it has been shown that co-expression networks are biologically relevant, it has not been determined to what extent any given network is functionally robust given perturbations in the input sample set. For such a test, hundreds of networks are needed and hence a tool to rapidly construct these networks. To examine functional robustness of networks with varying input, we enhanced an existing RMT implementation for improved scalability and tested functional robustness of human (Homo sapiens), rice (Oryza sativa) and budding yeast (Saccharomyces cerevisiae). We demonstrate dramatic decrease in network construction time and computational requirements and show that despite some variation in global properties between networks, functional similarity remains high. Moreover, the biological function captured by co-expression networks thresholded by RMT is highly robust. PMID:23409071
Network-based expression analyses and experimental validations revealed high co-expression between Yap1 and stem cell markers compared to differentiated cells.

PubMed

Dehghanian, Fariba; Hojati, Zohreh; Esmaeili, Fariba; Masoudi-Nejad, Ali

2018-05-21

The Hippo signaling pathway is identified as a potential regulatory pathway which plays critical roles in differentiation and stem cell self-renewal. Yap1 is a primary transcriptional effector of this pathway. The importance of Yap1 in embryonic stem cells (ESCs) and differentiation procedure remains a challenging question, since two different observations have been reported. To answer this question we used co-expression network and differential co-expression analyses followed by experimental validations. Our results indicate that Yap1 is highly co-expressed with stem cell markers in ESCs but not in differentiated cells (DCs). The significant Yap1 down-regulation and also translocation of Yap1 into the cytoplasm during P19 differentiation was also detected. Moreover, our results suggest the E2f7, Lin28a and Dppa4 genes as possible regulatory nuclear factors of Hippo pathway in stem cells. The present findings are actively consistent with studies that suggested Yap1 as an essential factor for stem cell self-renewal. Copyright © 2018 Elsevier Inc. All rights reserved.
Analysis of co-occurrence toponyms in web pages based on complex networks

NASA Astrophysics Data System (ADS)

Zhong, Xiang; Liu, Jiajun; Gao, Yong; Wu, Lun

2017-01-01

A large number of geographical toponyms exist in web pages and other documents, providing abundant geographical resources for GIS. It is very common for toponyms to co-occur in the same documents. To investigate these relations associated with geographic entities, a novel complex network model for co-occurrence toponyms is proposed. Then, 12 toponym co-occurrence networks are constructed from the toponym sets extracted from the People's Daily Paper documents of 2010. It is found that two toponyms have a high co-occurrence probability if they are at the same administrative level or if they possess a part-whole relationship. By applying complex network analysis methods to toponym co-occurrence networks, we find the following characteristics. (1) The navigation vertices of the co-occurrence networks can be found by degree centrality analysis. (2) The networks express strong cluster characteristics, and it takes only several steps to reach one vertex from another one, implying that the networks are small-world graphs. (3) The degree distribution satisfies the power law with an exponent of 1.7, so the networks are free-scale. (4) The networks are disassortative and have similar assortative modes, with assortative exponents of approximately 0.18 and assortative indexes less than 0. (5) The frequency of toponym co-occurrence is weakly negatively correlated with geographic distance, but more strongly negatively correlated with administrative hierarchical distance. Considering the toponym frequencies and co-occurrence relationships, a novel method based on link analysis is presented to extract the core toponyms from web pages. This method is suitable and effective for geographical information retrieval.
Enhancing biological relevance of a weighted gene co-expression network for functional module identification.

PubMed

Prom-On, Santitham; Chanthaphan, Atthawut; Chan, Jonathan Hoyin; Meechai, Asawin

2011-02-01

Relationships among gene expression levels may be associated with the mechanisms of the disease. While identifying a direct association such as a difference in expression levels between case and control groups links genes to disease mechanisms, uncovering an indirect association in the form of a network structure may help reveal the underlying functional module associated with the disease under scrutiny. This paper presents a method to improve the biological relevance in functional module identification from the gene expression microarray data by enhancing the structure of a weighted gene co-expression network using minimum spanning tree. The enhanced network, which is called a backbone network, contains only the essential structural information to represent the gene co-expression network. The entire backbone network is decoupled into a number of coherent sub-networks, and then the functional modules are reconstructed from these sub-networks to ensure minimum redundancy. The method was tested with a simulated gene expression dataset and case-control expression datasets of autism spectrum disorder and colorectal cancer studies. The results indicate that the proposed method can accurately identify clusters in the simulated dataset, and the functional modules of the backbone network are more biologically relevant than those obtained from the original approach.
Correlated mRNAs and miRNAs from co-expression and regulatory networks affect porcine muscle and finally meat properties.

PubMed

Ponsuksili, Siriluck; Du, Yang; Hadlich, Frieder; Siengdee, Puntita; Murani, Eduard; Schwerin, Manfred; Wimmers, Klaus

2013-08-05

Physiological processes aiding the conversion of muscle to meat involve many genes associated with muscle structure and metabolic processes. MicroRNAs regulate networks of genes to orchestrate cellular functions, in turn regulating phenotypes. We applied weighted gene co-expression network analysis to identify co-expression modules that correlated to meat quality phenotypes and were highly enriched for genes involved in glucose metabolism, response to wounding, mitochondrial ribosome, mitochondrion, and extracellular matrix. Negative correlation of miRNA with mRNA and target prediction were used to select transcripts out of the modules of trait-associated mRNAs to further identify those genes that are correlated with post mortem traits. Porcine muscle co-expression transcript networks that correlated to post mortem traits were identified. The integration of miRNA and mRNA expression analyses, as well as network analysis, enabled us to interpret the differentially-regulated genes from a systems perspective. Linking co-expression networks of transcripts and hierarchically organized pairs of miRNAs and mRNAs to meat properties yields new insight into several biological pathways underlying phenotype differences. These pathways may also be diagnostic for many myopathies, which are accompanied by deficient nutrient and oxygen supply of muscle fibers.
Global Landscape of a Co-Expressed Gene Network in Barley and its Application to Gene Discovery in Triticeae Crops

PubMed Central

Mochida, Keiichi; Uehara-Yamaguchi, Yukiko; Yoshida, Takuhiro; Sakurai, Tetsuya; Shinozaki, Kazuo

2011-01-01

Accumulated transcriptome data can be used to investigate regulatory networks of genes involved in various biological systems. Co-expression analysis data sets generated from comprehensively collected transcriptome data sets now represent efficient resources that are capable of facilitating the discovery of genes with closely correlated expression patterns. In order to construct a co-expression network for barley, we analyzed 45 publicly available experimental series, which are composed of 1,347 sets of GeneChip data for barley. On the basis of a gene-to-gene weighted correlation coefficient, we constructed a global barley co-expression network and classified it into clusters of subnetwork modules. The resulting clusters are candidates for functional regulatory modules in the barley transcriptome. To annotate each of the modules, we performed comparative annotation using genes in Arabidopsis and Brachypodium distachyon. On the basis of a comparative analysis between barley and two model species, we investigated functional properties from the representative distributions of the gene ontology (GO) terms. Modules putatively involved in drought stress response and cellulose biogenesis have been identified. These modules are discussed to demonstrate the effectiveness of the co-expression analysis. Furthermore, we applied the data set of co-expressed genes coupled with comparative analysis in attempts to discover potentially Triticeae-specific network modules. These results demonstrate that analysis of the co-expression network of the barley transcriptome together with comparative analysis should promote the process of gene discovery in barley. Furthermore, the insights obtained should be transferable to investigations of Triticeae plants. The associated data set generated in this analysis is publicly accessible at http://coexpression.psc.riken.jp/barley/. PMID:21441235
From SNP co-association to RNA co-expression: novel insights into gene networks for intramuscular fatty acid composition in porcine.

PubMed

Ramayo-Caldas, Yuliaxis; Ballester, Maria; Fortes, Marina R S; Esteve-Codina, Anna; Castelló, Anna; Noguera, Jose L; Fernández, Ana I; Pérez-Enciso, Miguel; Reverter, Antonio; Folch, Josep M

2014-03-26

Fatty acids (FA) play a critical role in energy homeostasis and metabolic diseases; in the context of livestock species, their profile also impacts on meat quality for healthy human consumption. Molecular pathways controlling lipid metabolism are highly interconnected and are not fully understood. Elucidating these molecular processes will aid technological development towards improvement of pork meat quality and increased knowledge of FA metabolism, underpinning metabolic diseases in humans. The results from genome-wide association studies (GWAS) across 15 phenotypes were subjected to an Association Weight Matrix (AWM) approach to predict a network of 1,096 genes related to intramuscular FA composition in pigs. To identify the key regulators of FA metabolism, we focused on the minimal set of transcription factors (TF) that the explored the majority of the network topology. Pathway and network analyses pointed towards a trio of TF as key regulators of FA metabolism: NCOA2, FHL2 and EP300. Promoter sequence analyses confirmed that these TF have binding sites for some well-know regulators of lipid and carbohydrate metabolism. For the first time in a non-model species, some of the co-associations observed at the genetic level were validated through co-expression at the transcriptomic level based on real-time PCR of 40 genes in adipose tissue, and a further 55 genes in liver. In particular, liver expression of NCOA2 and EP300 differed between pig breeds (Iberian and Landrace) extreme in terms of fat deposition. Highly clustered co-expression networks in both liver and adipose tissues were observed. EP300 and NCOA2 showed centrality parameters above average in the both networks. Over all genes, co-expression analyses confirmed 28.9% of the AWM predicted gene-gene interactions in liver and 33.0% in adipose tissue. The magnitude of this validation varied across genes, with up to 60.8% of the connections of NCOA2 in adipose tissue being validated via co-expression. Our results recapitulate the known transcriptional regulation of FA metabolism, predict gene interactions that can be experimentally validated, and suggest that genetic variants mapped to EP300, FHL2, and NCOA2 modulate lipid metabolism and control energy homeostasis in pigs.
Pleiotropic and Epistatic Network-Based Discovery: Integrated Networks for Target Gene Discovery

DOE Office of Scientific and Technical Information (OSTI.GOV)

Weighill, Deborah; Jones, Piet; Shah, Manesh

Biological organisms are complex systems that are composed of functional networks of interacting molecules and macro-molecules. Complex phenotypes are the result of orchestrated, hierarchical, heterogeneous collections of expressed genomic variants. However, the effects of these variants are the result of historic selective pressure and current environmental and epigenetic signals, and, as such, their co-occurrence can be seen as genome-wide correlations in a number of different manners. Biomass recalcitrance (i.e., the resistance of plants to degradation or deconstruction, which ultimately enables access to a plant's sugars) is a complex polygenic phenotype of high importance to biofuels initiatives. This study makes usemore » of data derived from the re-sequenced genomes from over 800 different Populus trichocarpa genotypes in combination with metabolomic and pyMBMS data across this population, as well as co-expression and co-methylation networks in order to better understand the molecular interactions involved in recalcitrance, and identify target genes involved in lignin biosynthesis/degradation. A Lines Of Evidence (LOE) scoring system is developed to integrate the information in the different layers and quantify the number of lines of evidence linking genes to target functions. This new scoring system was applied to quantify the lines of evidence linking genes to lignin-related genes and phenotypes across the network layers, and allowed for the generation of new hypotheses surrounding potential new candidate genes involved in lignin biosynthesis in P. trichocarpa, including various AGAMOUS-LIKE genes. Lastly, the resulting Genome Wide Association Study networks, integrated with Single Nucleotide Polymorphism (SNP) correlation, co-methylation, and co-expression networks through the LOE scores are proving to be a powerful approach to determine the pleiotropic and epistatic relationships underlying cellular functions and, as such, the molecular basis for complex phenotypes, such as recalcitrance.« less

Pleiotropic and Epistatic Network-Based Discovery: Integrated Networks for Target Gene Discovery

DOE PAGES

Weighill, Deborah; Jones, Piet; Shah, Manesh; ...

2018-05-11

Biological organisms are complex systems that are composed of functional networks of interacting molecules and macro-molecules. Complex phenotypes are the result of orchestrated, hierarchical, heterogeneous collections of expressed genomic variants. However, the effects of these variants are the result of historic selective pressure and current environmental and epigenetic signals, and, as such, their co-occurrence can be seen as genome-wide correlations in a number of different manners. Biomass recalcitrance (i.e., the resistance of plants to degradation or deconstruction, which ultimately enables access to a plant's sugars) is a complex polygenic phenotype of high importance to biofuels initiatives. This study makes usemore » of data derived from the re-sequenced genomes from over 800 different Populus trichocarpa genotypes in combination with metabolomic and pyMBMS data across this population, as well as co-expression and co-methylation networks in order to better understand the molecular interactions involved in recalcitrance, and identify target genes involved in lignin biosynthesis/degradation. A Lines Of Evidence (LOE) scoring system is developed to integrate the information in the different layers and quantify the number of lines of evidence linking genes to target functions. This new scoring system was applied to quantify the lines of evidence linking genes to lignin-related genes and phenotypes across the network layers, and allowed for the generation of new hypotheses surrounding potential new candidate genes involved in lignin biosynthesis in P. trichocarpa, including various AGAMOUS-LIKE genes. Lastly, the resulting Genome Wide Association Study networks, integrated with Single Nucleotide Polymorphism (SNP) correlation, co-methylation, and co-expression networks through the LOE scores are proving to be a powerful approach to determine the pleiotropic and epistatic relationships underlying cellular functions and, as such, the molecular basis for complex phenotypes, such as recalcitrance.« less
Identifying key genes in rheumatoid arthritis by weighted gene co-expression network analysis.

PubMed

Ma, Chunhui; Lv, Qi; Teng, Songsong; Yu, Yinxian; Niu, Kerun; Yi, Chengqin

2017-08-01

This study aimed to identify rheumatoid arthritis (RA) related genes based on microarray data using the WGCNA (weighted gene co-expression network analysis) method. Two gene expression profile datasets GSE55235 (10 RA samples and 10 healthy controls) and GSE77298 (16 RA samples and seven healthy controls) were downloaded from Gene Expression Omnibus database. Characteristic genes were identified using metaDE package. WGCNA was used to find disease-related networks based on gene expression correlation coefficients, and module significance was defined as the average gene significance of all genes used to assess the correlation between the module and RA status. Genes in the disease-related gene co-expression network were subject to functional annotation and pathway enrichment analysis using Database for Annotation Visualization and Integrated Discovery. Characteristic genes were also mapped to the Connectivity Map to screen small molecules. A total of 599 characteristic genes were identified. For each dataset, characteristic genes in the green, red and turquoise modules were most closely associated with RA, with gene numbers of 54, 43 and 79, respectively. These genes were enriched in totally enriched in 17 Gene Ontology terms, mainly related to immune response (CD97, FYB, CXCL1, IKBKE, CCR1, etc.), inflammatory response (CD97, CXCL1, C3AR1, CCR1, LYZ, etc.) and homeostasis (C3AR1, CCR1, PLN, CCL19, PPT1, etc.). Two small-molecule drugs sanguinarine and papaverine were predicted to have a therapeutic effect against RA. Genes related to immune response, inflammatory response and homeostasis presumably have critical roles in RA pathogenesis. Sanguinarine and papaverine have a potential therapeutic effect against RA. © 2017 Asia Pacific League of Associations for Rheumatology and John Wiley & Sons Australia, Ltd.
Further Investigation of Receding Horizion-Based Controllers and Neural Network-Based Systems

NASA Technical Reports Server (NTRS)

Kelkar, Atul G.; Haley, Pamela J. (Technical Monitor)

2000-01-01

This report provides a comprehensive summary of the research work performed over the entire duration of the co-operative research agreement between NASA Langley Research Center and Kansas State University. This summary briefly lists the findings and also suggests possible future directions for the continuation of the subject research in the area of Generalized Predictive Control (GPC) and Network Based Generalized Predictive Control (NGPC).
Step-by-Step Construction of Gene Co-expression Networks from High-Throughput Arabidopsis RNA Sequencing Data.

PubMed

Contreras-López, Orlando; Moyano, Tomás C; Soto, Daniela C; Gutiérrez, Rodrigo A

2018-01-01

The rapid increase in the availability of transcriptomics data generated by RNA sequencing represents both a challenge and an opportunity for biologists without bioinformatics training. The challenge is handling, integrating, and interpreting these data sets. The opportunity is to use this information to generate testable hypothesis to understand molecular mechanisms controlling gene expression and biological processes (Fig. 1). A successful strategy to generate tractable hypotheses from transcriptomics data has been to build undirected network graphs based on patterns of gene co-expression. Many examples of new hypothesis derived from network analyses can be found in the literature, spanning different organisms including plants and specific fields such as root developmental biology.In order to make the process of constructing a gene co-expression network more accessible to biologists, here we provide step-by-step instructions using published RNA-seq experimental data obtained from a public database. Similar strategies have been used in previous studies to advance root developmental biology. This guide includes basic instructions for the operation of widely used open source platforms such as Bio-Linux, R, and Cytoscape. Even though the data we used in this example was obtained from Arabidopsis thaliana, the workflow developed in this guide can be easily adapted to work with RNA-seq data from any organism.
Network of proteins, enzymes and genes linked to biomass degradation shared by Trichoderma species.

PubMed

Horta, Maria Augusta Crivelente; Filho, Jaire Alves Ferreira; Murad, Natália Faraj; de Oliveira Santos, Eidy; Dos Santos, Clelton Aparecido; Mendes, Juliano Sales; Brandão, Marcelo Mendes; Azzoni, Sindelia Freitas; de Souza, Anete Pereira

2018-01-22

Understanding relationships between genes responsible for enzymatic hydrolysis of cellulose and synergistic reactions is fundamental for improving biomass biodegradation technologies. To reveal synergistic reactions, the transcriptome, exoproteome, and enzymatic activities of extracts from Trichoderma harzianum, Trichoderma reesei and Trichoderma atroviride under biodegradation conditions were examined. This work revealed co-regulatory networks across carbohydrate-active enzyme (CAZy) genes and secreted proteins in extracts. A set of 80 proteins and respective genes that might correspond to a common system for biodegradation from the studied species were evaluated to elucidate new co-regulated genes. Differences such as one unique base pair between fungal genomes might influence enzyme-substrate binding sites and alter fungal gene expression responses, explaining the enzymatic activities specific to each species observed in the corresponding extracts. These differences are also responsible for the different architectures observed in the co-expression networks.
STARNET 2: a web-based tool for accelerating discovery of gene regulatory networks using microarray co-expression data

PubMed Central

Jupiter, Daniel; Chen, Hailin; VanBuren, Vincent

2009-01-01

Background Although expression microarrays have become a standard tool used by biologists, analysis of data produced by microarray experiments may still present challenges. Comparison of data from different platforms, organisms, and labs may involve complicated data processing, and inferring relationships between genes remains difficult. Results STARNET 2 is a new web-based tool that allows post hoc visual analysis of correlations that are derived from expression microarray data. STARNET 2 facilitates user discovery of putative gene regulatory networks in a variety of species (human, rat, mouse, chicken, zebrafish, Drosophila, C. elegans, S. cerevisiae, Arabidopsis and rice) by graphing networks of genes that are closely co-expressed across a large heterogeneous set of preselected microarray experiments. For each of the represented organisms, raw microarray data were retrieved from NCBI's Gene Expression Omnibus for a selected Affymetrix platform. All pairwise Pearson correlation coefficients were computed for expression profiles measured on each platform, respectively. These precompiled results were stored in a MySQL database, and supplemented by additional data retrieved from NCBI. A web-based tool allows user-specified queries of the database, centered at a gene of interest. The result of a query includes graphs of correlation networks, graphs of known interactions involving genes and gene products that are present in the correlation networks, and initial statistical analyses. Two analyses may be performed in parallel to compare networks, which is facilitated by the new HEATSEEKER module. Conclusion STARNET 2 is a useful tool for developing new hypotheses about regulatory relationships between genes and gene products, and has coverage for 10 species. Interpretation of the correlation networks is supported with a database of previously documented interactions, a test for enrichment of Gene Ontology terms, and heat maps of correlation distances that may be used to compare two networks. The list of genes in a STARNET network may be useful in developing a list of candidate genes to use for the inference of causal networks. The tool is freely available at , and does not require user registration. PMID:19828039
Dynamic Visualization of Co-expression in Systems Genetics Data

DOE Office of Scientific and Technical Information (OSTI.GOV)

New, Joshua Ryan; Huang, Jian; Chesler, Elissa J

2008-01-01

Biologists hope to address grand scientific challenges by exploring the abundance of data made available through modern microarray technology and other high-throughput techniques. The impact of this data, however, is limited unless researchers can effectively assimilate such complex information and integrate it into their daily research; interactive visualization tools are called for to support the effort. Specifically, typical studies of gene co-expression require novel visualization tools that enable the dynamic formulation and fine-tuning of hypotheses to aid the process of evaluating sensitivity of key parameters. These tools should allow biologists to develop an intuitive understanding of the structure of biologicalmore » networks and discover genes which reside in critical positions in networks and pathways. By using a graph as a universal data representation of correlation in gene expression data, our novel visualization tool employs several techniques that when used in an integrated manner provide innovative analytical capabilities. Our tool for interacting with gene co-expression data integrates techniques such as: graph layout, qualitative subgraph extraction through a novel 2D user interface, quantitative subgraph extraction using graph-theoretic algorithms or by querying an optimized b-tree, dynamic level-of-detail graph abstraction, and template-based fuzzy classification using neural networks. We demonstrate our system using a real-world workflow from a large-scale, systems genetics study of mammalian gene co-expression.« less
Transcriptomic changes during maize roots development responsive to Cadmium (Cd) pollution using comparative RNAseq-based approach

DOE Office of Scientific and Technical Information (OSTI.GOV)

Peng, Hua; Sichuan Tourism College, Chengdu, 610000, Sichuan; He, Xiujing

The heavy metal cadmium (Cd), acts as a widespread environmental contaminant, which has shown to adversely affect human health, food safety and ecosystem safety in recent years. However, research on how plant respond to various kinds of heavy metal stress is scarcely reported, especially for understanding of complex molecular regulatory mechanisms and elucidating the gene networks of plant respond to Cd stress. Here, transcriptomic changes during Mo17 and B73 seedlings development responsive to Cd pollution were investigated and comparative RNAseq-based approach in both genotypes were performed. 115 differential expression genes (DEGs) with significant alteration in expression were found co-modulated inmore » both genotypes during the maize seedling development; of those, most of DGEs were found comprised of stress and defense responses proteins, transporters, as well as transcription factors, such as thaumatin-like protein, ZmOPR2 and ZmOPR5. More interestingly, genotype-specific transcriptional factors changes induced by Cd stress were found contributed to the regulatory mechanism of Cd sensitivity in both different genotypes. Moreover, 12 co-expression modules associated with specific biological processes or pathways (M1 to M12) were identified by consensus co-expression network. These results will expand our understanding of complex molecular mechanism of response and defense to Cd exposure in maize seedling roots. - Highlights: • Transcriptomic changes responsive to Cd pollution using comparative RNAseq-based approach. • 115 differential expression genes (DEGs) were found co-modulated in both genotypes. • Most of DGEs belong to stress and defense responses proteins, transporters, transcription factors. • 12 co-expression modules associated with specific biological processes or pathways. • Genotype-specific transcriptional factors changes induced by Cd stress were found.« less
Functional networks inference from rule-based machine learning models.

PubMed

Lazzarini, Nicola; Widera, Paweł; Williamson, Stuart; Heer, Rakesh; Krasnogor, Natalio; Bacardit, Jaume

2016-01-01

Functional networks play an important role in the analysis of biological processes and systems. The inference of these networks from high-throughput (-omics) data is an area of intense research. So far, the similarity-based inference paradigm (e.g. gene co-expression) has been the most popular approach. It assumes a functional relationship between genes which are expressed at similar levels across different samples. An alternative to this paradigm is the inference of relationships from the structure of machine learning models. These models are able to capture complex relationships between variables, that often are different/complementary to the similarity-based methods. We propose a protocol to infer functional networks from machine learning models, called FuNeL. It assumes, that genes used together within a rule-based machine learning model to classify the samples, might also be functionally related at a biological level. The protocol is first tested on synthetic datasets and then evaluated on a test suite of 8 real-world datasets related to human cancer. The networks inferred from the real-world data are compared against gene co-expression networks of equal size, generated with 3 different methods. The comparison is performed from two different points of view. We analyse the enriched biological terms in the set of network nodes and the relationships between known disease-associated genes in a context of the network topology. The comparison confirms both the biological relevance and the complementary character of the knowledge captured by the FuNeL networks in relation to similarity-based methods and demonstrates its potential to identify known disease associations as core elements of the network. Finally, using a prostate cancer dataset as a case study, we confirm that the biological knowledge captured by our method is relevant to the disease and consistent with the specialised literature and with an independent dataset not used in the inference process. The implementation of our network inference protocol is available at: http://ico2s.org/software/funel.html.
System-wide analysis of the transcriptional network of human myelomonocytic leukemia cells predicts attractor structure and phorbol-ester-induced differentiation and dedifferentiation transitions

NASA Astrophysics Data System (ADS)

Sakata, Katsumi; Ohyanagi, Hajime; Sato, Shinji; Nobori, Hiroya; Hayashi, Akiko; Ishii, Hideshi; Daub, Carsten O.; Kawai, Jun; Suzuki, Harukazu; Saito, Toshiyuki

2015-02-01

We present a system-wide transcriptional network structure that controls cell types in the context of expression pattern transitions that correspond to cell type transitions. Co-expression based analyses uncovered a system-wide, ladder-like transcription factor cluster structure composed of nearly 1,600 transcription factors in a human transcriptional network. Computer simulations based on a transcriptional regulatory model deduced from the system-wide, ladder-like transcription factor cluster structure reproduced expression pattern transitions when human THP-1 myelomonocytic leukaemia cells cease proliferation and differentiate under phorbol myristate acetate stimulation. The behaviour of MYC, a reprogramming Yamanaka factor that was suggested to be essential for induced pluripotent stem cells during dedifferentiation, could be interpreted based on the transcriptional regulation predicted by the system-wide, ladder-like transcription factor cluster structure. This study introduces a novel system-wide structure to transcriptional networks that provides new insights into network topology.
Broad Integration of Expression Maps and Co-Expression Networks Compassing Novel Gene Functions in the Brain

PubMed Central

Okamura-Oho, Yuko; Shimokawa, Kazuro; Nishimura, Masaomi; Takemoto, Satoko; Sato, Akira; Furuichi, Teiichi; Yokota, Hideo

2014-01-01

Using a recently invented technique for gene expression mapping in the whole-anatomy context, termed transcriptome tomography, we have generated a dataset of 36,000 maps of overall gene expression in the adult-mouse brain. Here, using an informatics approach, we identified a broad co-expression network that follows an inverse power law and is rich in functional interaction and gene-ontology terms. Our framework for the integrated analysis of expression maps and graphs of co-expression networks revealed that groups of combinatorially expressed genes, which regulate cell differentiation during development, were present in the adult brain and each of these groups was associated with a discrete cell types. These groups included non-coding genes of unknown function. We found that these genes specifically linked developmentally conserved groups in the network. A previously unrecognized robust expression pattern covering the whole brain was related to the molecular anatomy of key biological processes occurring in particular areas. PMID:25382412
Detection of Significant Pneumococcal Meningitis Biomarkers by Ego Network.

PubMed

Wang, Qian; Lou, Zhifeng; Zhai, Liansuo; Zhao, Haibin

2017-06-01

To identify significant biomarkers for detection of pneumococcal meningitis based on ego network. Based on the gene expression data of pneumococcal meningitis and global protein-protein interactions (PPIs) data recruited from open access databases, the authors constructed a differential co-expression network (DCN) to identify pneumococcal meningitis biomarkers in a network view. Here EgoNet algorithm was employed to screen the significant ego networks that could accurately distinguish pneumococcal meningitis from healthy controls, by sequentially seeking ego genes, searching candidate ego networks, refinement of candidate ego networks and significance analysis to identify ego networks. Finally, the functional inference of the ego networks was performed to identify significant pathways for pneumococcal meningitis. By differential co-expression analysis, the authors constructed the DCN that covered 1809 genes and 3689 interactions. From the DCN, a total of 90 ego genes were identified. Starting from these ego genes, three significant ego networks (Module 19, Module 70 and Module 71) that could predict clinical outcomes for pneumococcal meningitis were identified by EgoNet algorithm, and the corresponding ego genes were GMNN, MAD2L1 and TPX2, respectively. Pathway analysis showed that these three ego networks were related to CDT1 association with the CDC6:ORC:origin complex, inactivation of APC/C via direct inhibition of the APC/C complex pathway, and DNA strand elongation, respectively. The authors successfully screened three significant ego modules which could accurately predict the clinical outcomes for pneumococcal meningitis and might play important roles in host response to pathogen infection in pneumococcal meningitis.
Single-nucleotide polymorphism-gene intermixed networking reveals co-linkers connected to multiple gene expression phenotypes

PubMed Central

Gong, Bin-Sheng; Zhang, Qing-Pu; Zhang, Guang-Mei; Zhang, Shao-Jun; Zhang, Wei; Lv, Hong-Chao; Zhang, Fan; Lv, Sa-Li; Li, Chuan-Xing; Rao, Shao-Qi; Li, Xia

2007-01-01

Gene expression profiles and single-nucleotide polymorphism (SNP) profiles are modern data for genetic analysis. It is possible to use the two types of information to analyze the relationships among genes by some genetical genomics approaches. In this study, gene expression profiles were used as expression traits. And relationships among the genes, which were co-linked to a common SNP(s), were identified by integrating the two types of information. Further research on the co-expressions among the co-linked genes was carried out after the gene-SNP relationships were established using the Haseman-Elston sib-pair regression. The results showed that the co-expressions among the co-linked genes were significantly higher if the number of connections between the genes and a SNP(s) was more than six. Then, the genes were interconnected via one or more SNP co-linkers to construct a gene-SNP intermixed network. The genes sharing more SNPs tended to have a stronger correlation. Finally, a gene-gene network was constructed with their intensities of relationships (the number of SNP co-linkers shared) as the weights for the edges. PMID:18466544
Modeling Bi-modality Improves Characterization of Cell Cycle on Gene Expression in Single Cells

PubMed Central

Danaher, Patrick; Finak, Greg; Krouse, Michael; Wang, Alice; Webster, Philippa; Beechem, Joseph; Gottardo, Raphael

2014-01-01

Advances in high-throughput, single cell gene expression are allowing interrogation of cell heterogeneity. However, there is concern that the cell cycle phase of a cell might bias characterizations of gene expression at the single-cell level. We assess the effect of cell cycle phase on gene expression in single cells by measuring 333 genes in 930 cells across three phases and three cell lines. We determine each cell's phase non-invasively without chemical arrest and use it as a covariate in tests of differential expression. We observe bi-modal gene expression, a previously-described phenomenon, wherein the expression of otherwise abundant genes is either strongly positive, or undetectable within individual cells. This bi-modality is likely both biologically and technically driven. Irrespective of its source, we show that it should be modeled to draw accurate inferences from single cell expression experiments. To this end, we propose a semi-continuous modeling framework based on the generalized linear model, and use it to characterize genes with consistent cell cycle effects across three cell lines. Our new computational framework improves the detection of previously characterized cell-cycle genes compared to approaches that do not account for the bi-modality of single-cell data. We use our semi-continuous modelling framework to estimate single cell gene co-expression networks. These networks suggest that in addition to having phase-dependent shifts in expression (when averaged over many cells), some, but not all, canonical cell cycle genes tend to be co-expressed in groups in single cells. We estimate the amount of single cell expression variability attributable to the cell cycle. We find that the cell cycle explains only 5%–17% of expression variability, suggesting that the cell cycle will not tend to be a large nuisance factor in analysis of the single cell transcriptome. PMID:25032992
Preservation affinity in consensus modules among stages of HIV-1 progression.

PubMed

Mosaddek Hossain, Sk Md; Ray, Sumanta; Mukhopadhyay, Anirban

2017-03-20

Analysis of gene expression data provides valuable insights into disease mechanism. Investigating relationship among co-expression modules of different stages is a meaningful tool to understand the way in which a disease progresses. Identifying topological preservation of modular structure also contributes to that understanding. HIV-1 disease provides a well-documented progression pattern through three stages of infection: acute, chronic and non-progressor. In this article, we have developed a novel framework to describe the relationship among the consensus (or shared) co-expression modules for each pair of HIV-1 infection stages. The consensus modules are identified to assess the preservation of network properties. We have investigated the preservation patterns of co-expression networks during HIV-1 disease progression through an eigengene-based approach. We discovered that the expression patterns of consensus modules have a strong preservation during the transitions of three infection stages. In particular, it is noticed that between acute and non-progressor stages the preservation is slightly more than the other pair of stages. Moreover, we have constructed eigengene networks for the identified consensus modules and observed the preservation structure among them. Some consensus modules are marked as preserved in two pairs of stages and are analyzed further to form a higher order meta-network consisting of a group of preserved modules. Additionally, we observed that module membership (MM) values of genes within a module are consistent with the preservation characteristics. The MM values of genes within a pair of preserved modules show strong correlation patterns across two infection stages. We have performed an extensive analysis to discover preservation pattern of co-expression network constructed from microarray gene expression data of three different HIV-1 progression stages. The preservation pattern is investigated through identification of consensus modules in each pair of infection stages. It is observed that the preservation of the expression pattern of consensus modules remains more prominent during the transition of infection from acute stage to non-progressor stage. Additionally, we observed that the module membership values of genes are coherent with preserved modules across the HIV-1 progression stages.
Identification of co-expression gene networks, regulatory genes and pathways for obesity based on adipose tissue RNA Sequencing in a porcine model.

PubMed

Kogelman, Lisette J A; Cirera, Susanna; Zhernakova, Daria V; Fredholm, Merete; Franke, Lude; Kadarmideen, Haja N

2014-09-30

Obesity is a complex metabolic condition in strong association with various diseases, like type 2 diabetes, resulting in major public health and economic implications. Obesity is the result of environmental and genetic factors and their interactions, including genome-wide genetic interactions. Identification of co-expressed and regulatory genes in RNA extracted from relevant tissues representing lean and obese individuals provides an entry point for the identification of genes and pathways of importance to the development of obesity. The pig, an omnivorous animal, is an excellent model for human obesity, offering the possibility to study in-depth organ-level transcriptomic regulations of obesity, unfeasible in humans. Our aim was to reveal adipose tissue co-expression networks, pathways and transcriptional regulations of obesity using RNA Sequencing based systems biology approaches in a porcine model. We selected 36 animals for RNA Sequencing from a previously created F2 pig population representing three extreme groups based on their predicted genetic risks for obesity. We applied Weighted Gene Co-expression Network Analysis (WGCNA) to detect clusters of highly co-expressed genes (modules). Additionally, regulator genes were detected using Lemon-Tree algorithms. WGCNA revealed five modules which were strongly correlated with at least one obesity-related phenotype (correlations ranging from -0.54 to 0.72, P < 0.001). Functional annotation identified pathways enlightening the association between obesity and other diseases, like osteoporosis (osteoclast differentiation, P = 1.4E-7), and immune-related complications (e.g. Natural killer cell mediated cytotoxity, P = 3.8E-5; B cell receptor signaling pathway, P = 7.2E-5). Lemon-Tree identified three potential regulator genes, using confident scores, for the WGCNA module which was associated with osteoclast differentiation: CCR1, MSR1 and SI1 (probability scores respectively 95.30, 62.28, and 34.58). Moreover, detection of differentially connected genes identified various genes previously identified to be associated with obesity in humans and rodents, e.g. CSF1R and MARC2. To our knowledge, this is the first study to apply systems biology approaches using porcine adipose tissue RNA-Sequencing data in a genetically characterized porcine model for obesity. We revealed complex networks, pathways, candidate and regulatory genes related to obesity, confirming the complexity of obesity and its association with immune-related disorders and osteoporosis.
A network approach of gene co-expression in the zea mays/Aspergillus flavus pathosystem to map host/pathogen interaction pathways

USDA-ARS?s Scientific Manuscript database

A gene co-expression network was generated using a dual RNA-seq study with the fungal pathogen A. flavus and its plant host Z. mays during the initial 3 days of infection. The analysis deciphered novel pathways and mapped genes of interest in both organisms during the infection. This network reveal...
Identification of PEG-induced water stress responsive transcripts using co-expression network in Eucalyptus grandis.

PubMed

Ghosh Dasgupta, Modhumita; Dharanishanthi, Veeramuthu

2017-09-05

Ecophysiological studies in Eucalyptus have shown that water is the principal factor limiting stem growth. Effect of water deficit conditions on physiological and biochemical parameters has been extensively reported in Eucalyptus. The present study was conducted to identify major polyethylene glycol induced water stress responsive transcripts in Eucalyptus grandis using gene co-expression network. A customized array representing 3359 water stress responsive genes was designed to document their expression in leaves of E. grandis cuttings subjected to -0.225MPa of PEG treatment. The differentially expressed transcripts were documented and significantly co-expressed transcripts were used for construction of network. The co-expression network was constructed with 915 nodes and 3454 edges with degree ranging from 2 to 45. Ninety four GO categories and 117 functional pathways were identified in the network. MCODE analysis generated 27 modules and module 6 with 479 nodes and 1005 edges was identified as the biologically relevant network. The major water responsive transcripts represented in the module included dehydrin, osmotin, LEA protein, expansin, arabinogalactans, heat shock proteins, major facilitator proteins, ARM repeat proteins, raffinose synthase, tonoplast intrinsic protein and transcription factors like DREB2A, ARF9, AGL24, UNE12, WLIM1 and MYB66, MYB70, MYB 55, MYB 16 and MYB 103. The coordinated analysis of gene expression patterns and coexpression networks developed in this study identified an array of transcripts that may regulate PEG induced water stress responses in E. grandis. Copyright © 2017 Elsevier B.V. All rights reserved.
Identifying osteosarcoma metastasis associated genes by weighted gene co-expression network analysis (WGCNA).

PubMed

Tian, Honglai; Guan, Donghui; Li, Jianmin

2018-06-01

Osteosarcoma (OS), the most common malignant bone tumor, accounts for the heavy healthy threat in the period of children and adolescents. OS occurrence usually correlates with early metastasis and high death rate. This study aimed to better understand the mechanism of OS metastasis.Based on Gene Expression Omnibus (GEO) database, we downloaded 4 expression profile data sets associated with OS metastasis, and selected differential expressed genes. Weighted gene co-expression network analysis (WGCNA) approach allowed us to investigate the most OS metastasis-correlated module. Gene Ontology functional and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analyses were used to give annotation of selected OS metastasis-associated genes.We select 897 differential expressed genes from OS metastasis and OS non-metastasis groups. Based on these selected genes, WGCNA further explored 142 genes included in the most OS metastasis-correlated module. Gene Ontology functional and KEGG pathway enrichment analyses showed that significantly OS metastasis-associated genes were involved in pathway correlated with insulin-like growth factor binding.Our research figured out several potential molecules participating in metastasis process and factors acting as biomarker. With this study, we could better explore the mechanism of OS metastasis and further discover more therapy targets.
Abnormally high expression of POLD1, MCM2, and PLK4 promotes relapse of acute lymphoblastic leukemia.

PubMed

Li, Sheng; Wang, Chengzhong; Wang, Weikai; Liu, Weidong; Zhang, Guiqin

2018-05-01

This study aimed to explore the underlying mechanism of relapsed acute lymphoblastic leukemia (ALL).Datasets of GSE28460 and GSE18497 were downloaded from Gene Expression Omnibus (GEO). Differentially expressed genes (DEGs) between diagnostic and relapsed ALL samples were identified using Limma package in R, and a Venn diagram was drawn. Next, functional enrichment analyses of co-regulated DEGs were performed. Based on the String database, protein-protein interaction network and module analyses were also conducted. Moreover, transcription factors and miRNAs targeting co-regulated DEGs were predicted using the WebGestalt online tool.A total of 71 co-regulated DEGs were identified, including 56 co-upregulated genes and 15 co-downregulated genes. Functional enrichment analyses showed that upregulated DEGs were significantly enriched in the cell cycle, and DNA replication, and repair related pathways. POLD1, MCM2, and PLK4 were hub proteins in both protein-protein interaction network and module, and might be potential targets of E2F. Additionally, POLD1 and MCM2 were found to be regulated by miR-520H via E2F1.High expression of POLD1, MCM2, and PLK4 might play positive roles in the recurrence of ALL, and could serve as potential therapeutic targets for the treatment of relapsed ALL.

WGCNA: an R package for weighted correlation network analysis.

PubMed

Langfelder, Peter; Horvath, Steve

2008-12-29

Correlation networks are increasingly being used in bioinformatics applications. For example, weighted gene co-expression network analysis is a systems biology method for describing the correlation patterns among genes across microarray samples. Weighted correlation network analysis (WGCNA) can be used for finding clusters (modules) of highly correlated genes, for summarizing such clusters using the module eigengene or an intramodular hub gene, for relating modules to one another and to external sample traits (using eigengene network methodology), and for calculating module membership measures. Correlation networks facilitate network based gene screening methods that can be used to identify candidate biomarkers or therapeutic targets. These methods have been successfully applied in various biological contexts, e.g. cancer, mouse genetics, yeast genetics, and analysis of brain imaging data. While parts of the correlation network methodology have been described in separate publications, there is a need to provide a user-friendly, comprehensive, and consistent software implementation and an accompanying tutorial. The WGCNA R software package is a comprehensive collection of R functions for performing various aspects of weighted correlation network analysis. The package includes functions for network construction, module detection, gene selection, calculations of topological properties, data simulation, visualization, and interfacing with external software. Along with the R package we also present R software tutorials. While the methods development was motivated by gene expression data, the underlying data mining approach can be applied to a variety of different settings. The WGCNA package provides R functions for weighted correlation network analysis, e.g. co-expression network analysis of gene expression data. The R package along with its source code and additional material are freely available at http://www.genetics.ucla.edu/labs/horvath/CoexpressionNetwork/Rpackages/WGCNA.
FastGCN: A GPU Accelerated Tool for Fast Gene Co-Expression Networks

PubMed Central

Liang, Meimei; Zhang, Futao; Jin, Gulei; Zhu, Jun

2015-01-01

Gene co-expression networks comprise one type of valuable biological networks. Many methods and tools have been published to construct gene co-expression networks; however, most of these tools and methods are inconvenient and time consuming for large datasets. We have developed a user-friendly, accelerated and optimized tool for constructing gene co-expression networks that can fully harness the parallel nature of GPU (Graphic Processing Unit) architectures. Genetic entropies were exploited to filter out genes with no or small expression changes in the raw data preprocessing step. Pearson correlation coefficients were then calculated. After that, we normalized these coefficients and employed the False Discovery Rate to control the multiple tests. At last, modules identification was conducted to construct the co-expression networks. All of these calculations were implemented on a GPU. We also compressed the coefficient matrix to save space. We compared the performance of the GPU implementation with those of multi-core CPU implementations with 16 CPU threads, single-thread C/C++ implementation and single-thread R implementation. Our results show that GPU implementation largely outperforms single-thread C/C++ implementation and single-thread R implementation, and GPU implementation outperforms multi-core CPU implementation when the number of genes increases. With the test dataset containing 16,000 genes and 590 individuals, we can achieve greater than 63 times the speed using a GPU implementation compared with a single-thread R implementation when 50 percent of genes were filtered out and about 80 times the speed when no genes were filtered out. PMID:25602758
FastGCN: a GPU accelerated tool for fast gene co-expression networks.

PubMed

Liang, Meimei; Zhang, Futao; Jin, Gulei; Zhu, Jun

2015-01-01

Gene co-expression networks comprise one type of valuable biological networks. Many methods and tools have been published to construct gene co-expression networks; however, most of these tools and methods are inconvenient and time consuming for large datasets. We have developed a user-friendly, accelerated and optimized tool for constructing gene co-expression networks that can fully harness the parallel nature of GPU (Graphic Processing Unit) architectures. Genetic entropies were exploited to filter out genes with no or small expression changes in the raw data preprocessing step. Pearson correlation coefficients were then calculated. After that, we normalized these coefficients and employed the False Discovery Rate to control the multiple tests. At last, modules identification was conducted to construct the co-expression networks. All of these calculations were implemented on a GPU. We also compressed the coefficient matrix to save space. We compared the performance of the GPU implementation with those of multi-core CPU implementations with 16 CPU threads, single-thread C/C++ implementation and single-thread R implementation. Our results show that GPU implementation largely outperforms single-thread C/C++ implementation and single-thread R implementation, and GPU implementation outperforms multi-core CPU implementation when the number of genes increases. With the test dataset containing 16,000 genes and 590 individuals, we can achieve greater than 63 times the speed using a GPU implementation compared with a single-thread R implementation when 50 percent of genes were filtered out and about 80 times the speed when no genes were filtered out.
Network Analysis Implicates Alpha-Synuclein (Snca) in the Regulation of Ovariectomy-Induced Bone Loss

PubMed Central

Calabrese, Gina; Mesner, Larry D.; Foley, Patricia L.; Rosen, Clifford J.; Farber, Charles R.

2016-01-01

The postmenopausal period in women is associated with decreased circulating estrogen levels, which accelerate bone loss and increase the risk of fracture. Here, we gained novel insight into the molecular mechanisms mediating bone loss in ovariectomized (OVX) mice, a model of human menopause, using co-expression network analysis. Specifically, we generated a co-expression network consisting of 53 gene modules using expression profiles from intact and OVX mice from a panel of inbred strains. The expression of four modules was altered by OVX, including module 23 whose expression was decreased by OVX across all strains. Module 23 was enriched for genes involved in the response to oxidative stress, a process known to be involved in OVX-induced bone loss. Additionally, module 23 homologs were co-expressed in human bone marrow. Alpha synuclein (Snca) was one of the most highly connected “hub” genes in module 23. We characterized mice deficient in Snca and observed a 40% reduction in OVX-induced bone loss. Furthermore, protection was associated with the altered expression of specific network modules, including module 23. In summary, the results of this study suggest that Snca regulates bone network homeostasis and ovariectomy-induced bone loss. PMID:27378017
Gene networks and the evolution of plant morphology.

PubMed

Das Gupta, Mainak; Tsiantis, Miltos

2018-06-06

Elaboration of morphology depends on the precise orchestration of gene expression by key regulatory genes. The hierarchy and relationship among the participating genes is commonly known as gene regulatory network (GRN). Therefore, the evolution of morphology ultimately occurs by the rewiring of gene network structures or by the co-option of gene networks to novel domains. The availability of high-resolution expression data combined with powerful statistical tools have opened up new avenues to formulate and test hypotheses on how diverse gene networks influence trait development and diversity. Here we summarize recent studies based on both big-data and genetics approaches to understand the evolution of plant form and physiology. We also discuss recent genome-wide investigations on how studying open-chromatin regions may help study the evolution of gene expression patterns. Copyright © 2018. Published by Elsevier Ltd.
Modality-Spanning Deficits in Attention-Deficit/Hyperactivity Disorder in Functional Networks, Gray Matter, and White Matter

PubMed Central

Kessler, Daniel; Angstadt, Michael; Welsh, Robert C.

2014-01-01

Previous neuroimaging investigations in attention-deficit/hyperactivity disorder (ADHD) have separately identified distributed structural and functional deficits, but interconnections between these deficits have not been explored. To unite these modalities in a common model, we used joint independent component analysis, a multivariate, multimodal method that identifies cohesive components that span modalities. Based on recent network models of ADHD, we hypothesized that altered relationships between large-scale networks, in particular, default mode network (DMN) and task-positive networks (TPNs), would co-occur with structural abnormalities in cognitive regulation regions. For 756 human participants in the ADHD-200 sample, we produced gray and white matter volume maps with voxel-based morphometry, as well as whole-brain functional connectomes. Joint independent component analysis was performed, and the resulting transmodal components were tested for differential expression in ADHD versus healthy controls. Four components showed greater expression in ADHD. Consistent with our a priori hypothesis, we observed reduced DMN-TPN segregation co-occurring with structural abnormalities in dorsolateral prefrontal cortex and anterior cingulate cortex, two important cognitive control regions. We also observed altered intranetwork connectivity in DMN, dorsal attention network, and visual network, with co-occurring distributed structural deficits. There was strong evidence of spatial correspondence across modalities: For all four components, the impact of the respective component on gray matter at a region strongly predicted the impact on functional connectivity at that region. Overall, our results demonstrate that ADHD involves multiple, cohesive modality spanning deficits, each one of which exhibits strong spatial overlap in the pattern of structural and functional alterations. PMID:25505309
GEM-TREND: a web tool for gene expression data mining toward relevant network discovery.

PubMed

Feng, Chunlai; Araki, Michihiro; Kunimoto, Ryo; Tamon, Akiko; Makiguchi, Hiroki; Niijima, Satoshi; Tsujimoto, Gozoh; Okuno, Yasushi

2009-09-03

DNA microarray technology provides us with a first step toward the goal of uncovering gene functions on a genomic scale. In recent years, vast amounts of gene expression data have been collected, much of which are available in public databases, such as the Gene Expression Omnibus (GEO). To date, most researchers have been manually retrieving data from databases through web browsers using accession numbers (IDs) or keywords, but gene-expression patterns are not considered when retrieving such data. The Connectivity Map was recently introduced to compare gene expression data by introducing gene-expression signatures (represented by a set of genes with up- or down-regulated labels according to their biological states) and is available as a web tool for detecting similar gene-expression signatures from a limited data set (approximately 7,000 expression profiles representing 1,309 compounds). In order to support researchers to utilize the public gene expression data more effectively, we developed a web tool for finding similar gene expression data and generating its co-expression networks from a publicly available database. GEM-TREND, a web tool for searching gene expression data, allows users to search data from GEO using gene-expression signatures or gene expression ratio data as a query and retrieve gene expression data by comparing gene-expression pattern between the query and GEO gene expression data. The comparison methods are based on the nonparametric, rank-based pattern matching approach of Lamb et al. (Science 2006) with the additional calculation of statistical significance. The web tool was tested using gene expression ratio data randomly extracted from the GEO and with in-house microarray data, respectively. The results validated the ability of GEM-TREND to retrieve gene expression entries biologically related to a query from GEO. For further analysis, a network visualization interface is also provided, whereby genes and gene annotations are dynamically linked to external data repositories. GEM-TREND was developed to retrieve gene expression data by comparing query gene-expression pattern with those of GEO gene expression data. It could be a very useful resource for finding similar gene expression profiles and constructing its gene co-expression networks from a publicly available database. GEM-TREND was designed to be user-friendly and is expected to support knowledge discovery. GEM-TREND is freely available at http://cgs.pharm.kyoto-u.ac.jp/services/network.
WGCNA: an R package for weighted correlation network analysis

PubMed Central

Langfelder, Peter; Horvath, Steve

2008-01-01

Background Correlation networks are increasingly being used in bioinformatics applications. For example, weighted gene co-expression network analysis is a systems biology method for describing the correlation patterns among genes across microarray samples. Weighted correlation network analysis (WGCNA) can be used for finding clusters (modules) of highly correlated genes, for summarizing such clusters using the module eigengene or an intramodular hub gene, for relating modules to one another and to external sample traits (using eigengene network methodology), and for calculating module membership measures. Correlation networks facilitate network based gene screening methods that can be used to identify candidate biomarkers or therapeutic targets. These methods have been successfully applied in various biological contexts, e.g. cancer, mouse genetics, yeast genetics, and analysis of brain imaging data. While parts of the correlation network methodology have been described in separate publications, there is a need to provide a user-friendly, comprehensive, and consistent software implementation and an accompanying tutorial. Results The WGCNA R software package is a comprehensive collection of R functions for performing various aspects of weighted correlation network analysis. The package includes functions for network construction, module detection, gene selection, calculations of topological properties, data simulation, visualization, and interfacing with external software. Along with the R package we also present R software tutorials. While the methods development was motivated by gene expression data, the underlying data mining approach can be applied to a variety of different settings. Conclusion The WGCNA package provides R functions for weighted correlation network analysis, e.g. co-expression network analysis of gene expression data. The R package along with its source code and additional material are freely available at . PMID:19114008
Weighted gene co-expression network analysis reveals potential genes involved in early metamorphosis process in sea cucumber Apostichopus japonicus.

PubMed

Li, Yongxin; Kikuchi, Mani; Li, Xueyan; Gao, Qionghua; Xiong, Zijun; Ren, Yandong; Zhao, Ruoping; Mao, Bingyu; Kondo, Mariko; Irie, Naoki; Wang, Wen

2018-01-01

Sea cucumbers, one main class of Echinoderms, have a very fast and drastic metamorphosis process during their development. However, the molecular basis under this process remains largely unknown. Here we systematically examined the gene expression profiles of Japanese common sea cucumber (Apostichopus japonicus) for the first time by RNA sequencing across 16 developmental time points from fertilized egg to juvenile stage. Based on the weighted gene co-expression network analysis (WGCNA), we identified 21 modules. Among them, MEdarkmagenta was highly expressed and correlated with the early metamorphosis process from late auricularia to doliolaria larva. Furthermore, gene enrichment and differentially expressed gene analysis identified several genes in the module that may play key roles in the metamorphosis process. Our results not only provide a molecular basis for experimentally studying the development and morphological complexity of sea cucumber, but also lay a foundation for improving its emergence rate. Copyright © 2017 Elsevier Inc. All rights reserved.
CrosstalkNet: A Visualization Tool for Differential Co-expression Networks and Communities.

PubMed

Manem, Venkata; Adam, George Alexandru; Gruosso, Tina; Gigoux, Mathieu; Bertos, Nicholas; Park, Morag; Haibe-Kains, Benjamin

2018-04-15

Variations in physiological conditions can rewire molecular interactions between biological compartments, which can yield novel insights into gain or loss of interactions specific to perturbations of interest. Networks are a promising tool to elucidate intercellular interactions, yet exploration of these large-scale networks remains a challenge due to their high dimensionality. To retrieve and mine interactions, we developed CrosstalkNet, a user friendly, web-based network visualization tool that provides a statistical framework to infer condition-specific interactions coupled with a community detection algorithm for bipartite graphs to identify significantly dense subnetworks. As a case study, we used CrosstalkNet to mine a set of 54 and 22 gene-expression profiles from breast tumor and normal samples, respectively, with epithelial and stromal compartments extracted via laser microdissection. We show how CrosstalkNet can be used to explore large-scale co-expression networks and to obtain insights into the biological processes that govern cross-talk between different tumor compartments. Significance: This web application enables researchers to mine complex networks and to decipher novel biological processes in tumor epithelial-stroma cross-talk as well as in other studies of intercompartmental interactions. Cancer Res; 78(8); 2140-3. ©2018 AACR . ©2018 American Association for Cancer Research.
An additional k-means clustering step improves the biological features of WGCNA gene co-expression networks.

PubMed

Botía, Juan A; Vandrovcova, Jana; Forabosco, Paola; Guelfi, Sebastian; D'Sa, Karishma; Hardy, John; Lewis, Cathryn M; Ryten, Mina; Weale, Michael E

2017-04-12

Weighted Gene Co-expression Network Analysis (WGCNA) is a widely used R software package for the generation of gene co-expression networks (GCN). WGCNA generates both a GCN and a derived partitioning of clusters of genes (modules). We propose k-means clustering as an additional processing step to conventional WGCNA, which we have implemented in the R package km2gcn (k-means to gene co-expression network, https://github.com/juanbot/km2gcn ). We assessed our method on networks created from UKBEC data (10 different human brain tissues), on networks created from GTEx data (42 human tissues, including 13 brain tissues), and on simulated networks derived from GTEx data. We observed substantially improved module properties, including: (1) few or zero misplaced genes; (2) increased counts of replicable clusters in alternate tissues (x3.1 on average); (3) improved enrichment of Gene Ontology terms (seen in 48/52 GCNs) (4) improved cell type enrichment signals (seen in 21/23 brain GCNs); and (5) more accurate partitions in simulated data according to a range of similarity indices. The results obtained from our investigations indicate that our k-means method, applied as an adjunct to standard WGCNA, results in better network partitions. These improved partitions enable more fruitful downstream analyses, as gene modules are more biologically meaningful.
Network Compression as a Quality Measure for Protein Interaction Networks

PubMed Central

Royer, Loic; Reimann, Matthias; Stewart, A. Francis; Schroeder, Michael

2012-01-01

With the advent of large-scale protein interaction studies, there is much debate about data quality. Can different noise levels in the measurements be assessed by analyzing network structure? Because proteomic regulation is inherently co-operative, modular and redundant, it is inherently compressible when represented as a network. Here we propose that network compression can be used to compare false positive and false negative noise levels in protein interaction networks. We validate this hypothesis by first confirming the detrimental effect of false positives and false negatives. Second, we show that gold standard networks are more compressible. Third, we show that compressibility correlates with co-expression, co-localization, and shared function. Fourth, we also observe correlation with better protein tagging methods, physiological expression in contrast to over-expression of tagged proteins, and smart pooling approaches for yeast two-hybrid screens. Overall, this new measure is a proxy for both sensitivity and specificity and gives complementary information to standard measures such as average degree and clustering coefficients. PMID:22719828
Identification of dysregulated long non-coding RNAs/microRNAs/mRNAs in TNM I stage lung adenocarcinoma

PubMed Central

Tian, Ziqiang; Wen, Shiwang; Zhang, Yuefeng; Shi, Xinqiang; Zhu, Yonggang; Xu, Yanzhao; Lv, Huilai; Wang, Guiying

2017-01-01

Lung adenocarcinoma (LUAD) is the primary subtype in lung cancer, which is the leading cause of cancer-related death worldwide. This study aimed to investigate the aberrant expression profiling of long non-coding RNA (lncRNA) in TNM I stage (stage I) LUAD. The lncRNA/mRNA/miRNA expression profiling of stage I LUAD and adjacent non-tumor tissues from 4 patients were measured by RNA-sequencing. Total of 175 differentially expressed lncRNAs (DELs), 1321 differentially expressed mRNAs (DEMs) and 94 differentially expressed microRNAs (DEMIs) were identified in stage I LUAD. DEMI-DEM regulatory network consisted of 544 nodes and 1123 edge; miR-200 family members had high connectivity with DEMs. In DEL-DEM co-expression network, CDKN2B-AS1, FENDRR and LINC00312 had the high connectivity with DEMs, which co-expressed with 105, 63 and 61 DEMs, respectively. DEL-DEMI-DEM network depicted the links among DELs, DEMI and DEMs. Identified DEMs were significantly enriched in cell adhesion molecules, focal adhesion and tight junction of Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways; and enriched in cell adhesion, angiogenesis and regulation of cell proliferation of Gene Ontology biological processes. Quantitative real-time polymerase chain reaction results were generally consistent with our bioinformatics analyses. LINC00312 and FENDRR had diagnostic value for LUAD patients in The Cancer Genome Atlas database. Our study might lay the foundation for illumination of pathogenesis of LUAD and identification of potential therapeutic targets and novel diagnosis biomarkers for LUAD patients. PMID:28881680
Orthoscape: a cytoscape application for grouping and visualization KEGG based gene networks by taxonomy and homology principles.

PubMed

Mustafin, Zakhar Sergeevich; Lashin, Sergey Alexandrovich; Matushkin, Yury Georgievich; Gunbin, Konstantin Vladimirovich; Afonnikov, Dmitry Arkadievich

2017-01-27

There are many available software tools for visualization and analysis of biological networks. Among them, Cytoscape ( http://cytoscape.org/ ) is one of the most comprehensive packages, with many plugins and applications which extends its functionality by providing analysis of protein-protein interaction, gene regulatory and gene co-expression networks, metabolic, signaling, neural as well as ecological-type networks including food webs, communities networks etc. Nevertheless, only three plugins tagged 'network evolution' found in Cytoscape official app store and in literature. We have developed a new Cytoscape 3.0 application Orthoscape aimed to facilitate evolutionary analysis of gene networks and visualize the results. Orthoscape aids in analysis of evolutionary information available for gene sets and networks by highlighting: (1) the orthology relationships between genes; (2) the evolutionary origin of gene network components; (3) the evolutionary pressure mode (diversifying or stabilizing, negative or positive selection) of orthologous groups in general and/or branch-oriented mode. The distinctive feature of Orthoscape is the ability to control all data analysis steps via user-friendly interface. Orthoscape allows its users to analyze gene networks or separated gene sets in the context of evolution. At each step of data analysis, Orthoscape also provides for convenient visualization and data manipulation.
TRACING CO-REGULATORY NETWORK DYNAMICS IN NOISY, SINGLE-CELL TRANSCRIPTOME TRAJECTORIES.

PubMed

Cordero, Pablo; Stuart, Joshua M

2017-01-01

The availability of gene expression data at the single cell level makes it possible to probe the molecular underpinnings of complex biological processes such as differentiation and oncogenesis. Promising new methods have emerged for reconstructing a progression 'trajectory' from static single-cell transcriptome measurements. However, it remains unclear how to adequately model the appreciable level of noise in these data to elucidate gene regulatory network rewiring. Here, we present a framework called Single Cell Inference of MorphIng Trajectories and their Associated Regulation (SCIMITAR) that infers progressions from static single-cell transcriptomes by employing a continuous parametrization of Gaussian mixtures in high-dimensional curves. SCIMITAR yields rich models from the data that highlight genes with expression and co-expression patterns that are associated with the inferred progression. Further, SCIMITAR extracts regulatory states from the implicated trajectory-evolvingco-expression networks. We benchmark the method on simulated data to show that it yields accurate cell ordering and gene network inferences. Applied to the interpretation of a single-cell human fetal neuron dataset, SCIMITAR finds progression-associated genes in cornerstone neural differentiation pathways missed by standard differential expression tests. Finally, by leveraging the rewiring of gene-gene co-expression relations across the progression, the method reveals the rise and fall of co-regulatory states and trajectory-dependent gene modules. These analyses implicate new transcription factors in neural differentiation including putative co-factors for the multi-functional NFAT pathway.
Preliminary Evaluation, Texas State Library Communication Network, 1968.

ERIC Educational Resources Information Center

Texas State Library, Austin. Field Services Div.

In 1968 the Texas State Library established a library communications network under Title III of the Librar y Services and Construction Act. The objective of this study was to evaluate the network after six months of operation. Part I of the study consists of a general evaluation by Peat, Marwick, Mitchell and Co., based on operational data…
Gene co-expression analysis identifies gene clusters associated with isotropic and polarized growth in Aspergillus fumigatus conidia.

PubMed

Baltussen, Tim J H; Coolen, Jordy P M; Zoll, Jan; Verweij, Paul E; Melchers, Willem J G

2018-04-26

Aspergillus fumigatus is a saprophytic fungus that extensively produces conidia. These microscopic asexually reproductive structures are small enough to reach the lungs. Germination of conidia followed by hyphal growth inside human lungs is a key step in the establishment of infection in immunocompromised patients. RNA-Seq was used to analyze the transcriptome of dormant and germinating A. fumigatus conidia. Construction of a gene co-expression network revealed four gene clusters (modules) correlated with a growth phase (dormant, isotropic growth, polarized growth). Transcripts levels of genes encoding for secondary metabolites were high in dormant conidia. During isotropic growth, transcript levels of genes involved in cell wall modifications increased. Two modules encoding for growth and cell cycle/DNA processing were associated with polarized growth. In addition, the co-expression network was used to identify highly connected intermodular hub genes. These genes may have a pivotal role in the respective module and could therefore be compelling therapeutic targets. Generally, cell wall remodeling is an important process during isotropic and polarized growth, characterized by an increase of transcripts coding for hyphal growth and cell cycle/DNA processing when polarized growth is initiated. Copyright © 2018 The Authors. Published by Elsevier Inc. All rights reserved.
Network-based integration of GWAS and gene expression identifies a HOX-centric network associated with serous ovarian cancer risk

PubMed Central

Kar, Siddhartha P.; Tyrer, Jonathan P.; Li, Qiyuan; Lawrenson, Kate; Aben, Katja K.H.; Anton-Culver, Hoda; Antonenkova, Natalia; Chenevix-Trench, Georgia; Baker, Helen; Bandera, Elisa V.; Bean, Yukie T.; Beckmann, Matthias W.; Berchuck, Andrew; Bisogna, Maria; Bjørge, Line; Bogdanova, Natalia; Brinton, Louise; Brooks-Wilson, Angela; Butzow, Ralf; Campbell, Ian; Carty, Karen; Chang-Claude, Jenny; Chen, Yian Ann; Chen, Zhihua; Cook, Linda S.; Cramer, Daniel; Cunningham, Julie M.; Cybulski, Cezary; Dansonka-Mieszkowska, Agnieszka; Dennis, Joe; Dicks, Ed; Doherty, Jennifer A.; Dörk, Thilo; du Bois, Andreas; Dürst, Matthias; Eccles, Diana; Easton, Douglas F.; Edwards, Robert P.; Ekici, Arif B.; Fasching, Peter A.; Fridley, Brooke L.; Gao, Yu-Tang; Gentry-Maharaj, Aleksandra; Giles, Graham G.; Glasspool, Rosalind; Goode, Ellen L.; Goodman, Marc T.; Grownwald, Jacek; Harrington, Patricia; Harter, Philipp; Hein, Alexander; Heitz, Florian; Hildebrandt, Michelle A.T.; Hillemanns, Peter; Hogdall, Estrid; Hogdall, Claus K.; Hosono, Satoyo; Iversen, Edwin S.; Jakubowska, Anna; Paul, James; Jensen, Allan; Ji, Bu-Tian; Karlan, Beth Y; Kjaer, Susanne K.; Kelemen, Linda E.; Kellar, Melissa; Kelley, Joseph; Kiemeney, Lambertus A.; Krakstad, Camilla; Kupryjanczyk, Jolanta; Lambrechts, Diether; Lambrechts, Sandrina; Le, Nhu D.; Lee, Alice W.; Lele, Shashi; Leminen, Arto; Lester, Jenny; Levine, Douglas A.; Liang, Dong; Lissowska, Jolanta; Lu, Karen; Lubinski, Jan; Lundvall, Lene; Massuger, Leon; Matsuo, Keitaro; McGuire, Valerie; McLaughlin, John R.; McNeish, Iain A.; Menon, Usha; Modugno, Francesmary; Moysich, Kirsten B.; Narod, Steven A.; Nedergaard, Lotte; Ness, Roberta B.; Nevanlinna, Heli; Odunsi, Kunle; Olson, Sara H.; Orlow, Irene; Orsulic, Sandra; Weber, Rachel Palmieri; Pearce, Celeste Leigh; Pejovic, Tanja; Pelttari, Liisa M.; Permuth-Wey, Jennifer; Phelan, Catherine M.; Pike, Malcolm C.; Poole, Elizabeth M.; Ramus, Susan J.; Risch, Harvey A.; Rosen, Barry; Rossing, Mary Anne; Rothstein, Joseph H.; Rudolph, Anja; Runnebaum, Ingo B.; Rzepecka, Iwona K.; Salvesen, Helga B.; Schildkraut, Joellen M.; Schwaab, Ira; Shu, Xiao-Ou; Shvetsov, Yurii B; Siddiqui, Nadeem; Sieh, Weiva; Song, Honglin; Southey, Melissa C.; Sucheston-Campbell, Lara E.; Tangen, Ingvild L.; Teo, Soo-Hwang; Terry, Kathryn L.; Thompson, Pamela J; Timorek, Agnieszka; Tsai, Ya-Yu; Tworoger, Shelley S.; van Altena, Anne M.; Van Nieuwenhuysen, Els; Vergote, Ignace; Vierkant, Robert A.; Wang-Gohrke, Shan; Walsh, Christine; Wentzensen, Nicolas; Whittemore, Alice S.; Wicklund, Kristine G.; Wilkens, Lynne R.; Woo, Yin-Ling; Wu, Xifeng; Wu, Anna; Yang, Hannah; Zheng, Wei; Ziogas, Argyrios; Sellers, Thomas A.; Monteiro, Alvaro N. A.; Freedman, Matthew L.; Gayther, Simon A.; Pharoah, Paul D. P.

2015-01-01

Background Genome-wide association studies (GWAS) have so far reported 12 loci associated with serous epithelial ovarian cancer (EOC) risk. We hypothesized that some of these loci function through nearby transcription factor (TF) genes and that putative target genes of these TFs as identified by co-expression may also be enriched for additional EOC risk associations. Methods We selected TF genes within 1 Mb of the top signal at the 12 genome-wide significant risk loci. Mutual information, a form of correlation, was used to build networks of genes strongly co-expressed with each selected TF gene in the unified microarray data set of 489 serous EOC tumors from The Cancer Genome Atlas. Genes represented in this data set were subsequently ranked using a gene-level test based on results for germline SNPs from a serous EOC GWAS meta-analysis (2,196 cases/4,396 controls). Results Gene set enrichment analysis identified six networks centered on TF genes (HOXB2, HOXB5, HOXB6, HOXB7 at 17q21.32 and HOXD1, HOXD3 at 2q31) that were significantly enriched for genes from the risk-associated end of the ranked list (P<0.05 and FDR<0.05). These results were replicated (P<0.05) using an independent association study (7,035 cases/21,693 controls). Genes underlying enrichment in the six networks were pooled into a combined network. Conclusion We identified a HOX-centric network associated with serous EOC risk containing several genes with known or emerging roles in serous EOC development. Impact Network analysis integrating large, context-specific data sets has the potential to offer mechanistic insights into cancer susceptibility and prioritize genes for experimental characterization. PMID:26209509
A CoAP-Based Network Access Authentication Service for Low-Power Wide Area Networks: LO-CoAP-EAP.

PubMed

Garcia-Carrillo, Dan; Marin-Lopez, Rafael; Kandasamy, Arunprabhu; Pelov, Alexander

2017-11-17

The Internet-of-Things (IoT) landscape is expanding with new radio technologies. In addition to the Low-Rate Wireless Personal Area Network (LR-WPAN), the recent set of technologies conforming the so-called Low-Power Wide Area Networks (LP-WAN) offers long-range communications, allowing one to send small pieces of information at a reduced energy cost, which promotes the creation of new IoT applications and services. However, LP-WAN technologies pose new challenges since they have strong limitations in the available bandwidth. In general, a first step prior to a smart object being able to gain access to the network is the process of network access authentication. It involves authentication, authorization and key management operations. This process is of vital importance for operators to control network resources. However, proposals for managing network access authentication in LP-WAN are tailored to the specifics of each technology, which could introduce interoperability problems in the future. In this sense, little effort has been put so far into providing a wireless-independent solution for network access authentication in the area of LP-WAN. To fill this gap, we propose a service named Low-Overhead CoAP-EAP (LO-CoAP-EAP), which is based on previous work designed for LR-WPAN. LO-CoAP-EAP integrates the use of Authentication, Authorization and Accounting (AAA) infrastructures and the Extensible Authentication Protocol (EAP) protocol. For this integration, we use the Constrained Application Protocol (CoAP) to design a network authentication service independent of the type of LP-WAN technology. LO-CoAP-EAP represents a trade-off between flexibility, wireless technology independence, scalability and performance in LP-WAN.
A CoAP-Based Network Access Authentication Service for Low-Power Wide Area Networks: LO-CoAP-EAP

PubMed Central

Garcia-Carrillo, Dan; Marin-Lopez, Rafael; Kandasamy, Arunprabhu; Pelov, Alexander

2017-01-01

The Internet-of-Things (IoT) landscape is expanding with new radio technologies. In addition to the Low-Rate Wireless Personal Area Network (LR-WPAN), the recent set of technologies conforming the so-called Low-Power Wide Area Networks (LP-WAN) offers long-range communications, allowing one to send small pieces of information at a reduced energy cost, which promotes the creation of new IoT applications and services. However, LP-WAN technologies pose new challenges since they have strong limitations in the available bandwidth. In general, a first step prior to a smart object being able to gain access to the network is the process of network access authentication. It involves authentication, authorization and key management operations. This process is of vital importance for operators to control network resources. However, proposals for managing network access authentication in LP-WAN are tailored to the specifics of each technology, which could introduce interoperability problems in the future. In this sense, little effort has been put so far into providing a wireless-independent solution for network access authentication in the area of LP-WAN. To fill this gap, we propose a service named Low-Overhead CoAP-EAP (LO-CoAP-EAP), which is based on previous work designed for LR-WPAN. LO-CoAP-EAP integrates the use of Authentication, Authorization and Accounting (AAA) infrastructures and the Extensible Authentication Protocol (EAP) protocol. For this integration, we use the Constrained Application Protocol (CoAP) to design a network authentication service independent of the type of LP-WAN technology. LO-CoAP-EAP represents a trade-off between flexibility, wireless technology independence, scalability and performance in LP-WAN. PMID:29149040

Scale-space measures for graph topology link protein network architecture to function.

PubMed

Hulsman, Marc; Dimitrakopoulos, Christos; de Ridder, Jeroen

2014-06-15

The network architecture of physical protein interactions is an important determinant for the molecular functions that are carried out within each cell. To study this relation, the network architecture can be characterized by graph topological characteristics such as shortest paths and network hubs. These characteristics have an important shortcoming: they do not take into account that interactions occur across different scales. This is important because some cellular functions may involve a single direct protein interaction (small scale), whereas others require more and/or indirect interactions, such as protein complexes (medium scale) and interactions between large modules of proteins (large scale). In this work, we derive generalized scale-aware versions of known graph topological measures based on diffusion kernels. We apply these to characterize the topology of networks across all scales simultaneously, generating a so-called graph topological scale-space. The comprehensive physical interaction network in yeast is used to show that scale-space based measures consistently give superior performance when distinguishing protein functional categories and three major types of functional interactions-genetic interaction, co-expression and perturbation interactions. Moreover, we demonstrate that graph topological scale spaces capture biologically meaningful features that provide new insights into the link between function and protein network architecture. Matlab(TM) code to calculate the scale-aware topological measures (STMs) is available at http://bioinformatics.tudelft.nl/TSSA © The Author 2014. Published by Oxford University Press.
Co-expression Network Approach to Studying the Effects of Botulinum Neurotoxin-A.

PubMed

Mukund, Kavitha; Ward, Samuel R; Lieber, Richard L; Subramaniam, Shankar

2017-10-16

Botulinum Neurotoxin A (BoNT-A) is a potent neurotoxin with several clinical applications.The goal of this study was to utilize co-expression network theory to analyze temporal transcriptional data from skeletal muscle after BoNT-A treatment. Expression data for 2000 genes (extracted using a ranking heuristic) served as the basis for this analysis. Using weighted gene co-expression network analysis (WGCNA), we identified 19 co-expressed modules, further hierarchically clustered into 5 groups. Quantifying average expression and co-expression patterns across these groups revealed temporal aspects of muscle's response to BoNT-A. Functional analysis revealed enrichment of group 1 with metabolism; group 5 with contradictory functions of atrophy and cellular recovery; and groups 2 and 3 with extracellular matrix (ECM) and non-fast fiber isoforms. Topological positioning of two highly ranked, significantly expressed genes- Dclk1 and Ostalpha within group 5 suggested possible mechanistic roles in recovery from BoNT-A induced atrophy. Phenotypic correlations of groups with titin and myosin protein content further emphasized the effect of BoNT-A on the sarcomeric contraction machinery in early phase of chemodenervation. In summary, our approach revealed a hierarchical functional response to BoNT-A induced paralysis with early metabolic and later ECM responses and identified putative biomarkers associated with chemodenervation. Additionally, our results provide an unbiased validation of the response documented in our previous workBotulinum Neurotoxin A (BoNT-A) is a potent neurotoxin with several clinical applications.The goal of this study was to utilize co-expression network theory to analyze temporal transcriptional data from skeletal muscle after BoNT-A treatment. Expression data for 2000 genes (extracted using a ranking heuristic) served as the basis for this analysis. Using weighted gene co-expression network analysis (WGCNA), we identified 19 co-expressed modules, further hierarchically clustered into 5 groups. Quantifying average expression and co-expression patterns across these groups revealed temporal aspects of muscle's response to BoNT-A. Functional analysis revealed enrichment of group 1 with metabolism; group 5 with contradictory functions of atrophy and cellular recovery; and groups 2 and 3 with extracellular matrix (ECM) and non-fast fiber isoforms. Topological positioning of two highly ranked, significantly expressed genes- Dclk1 and Ostalpha within group 5 suggested possible mechanistic roles in recovery from BoNT-A induced atrophy. Phenotypic correlations of groups with titin and myosin protein content further emphasized the effect of BoNT-A on the sarcomeric contraction machinery in early phase of chemodenervation. In summary, our approach revealed a hierarchical functional response to BoNT-A induced paralysis with early metabolic and later ECM responses and identified putative biomarkers associated with chemodenervation. Additionally, our results provide an unbiased validation of the response documented in our previous work.
Differential Gene Expression (DEX) and Alternative Splicing Events (ASE) for Temporal Dynamic Processes Using HMMs and Hierarchical Bayesian Modeling Approaches.

PubMed

Oh, Sunghee; Song, Seongho

2017-01-01

In gene expression profile, data analysis pipeline is categorized into four levels, major downstream tasks, i.e., (1) identification of differential expression; (2) clustering co-expression patterns; (3) classification of subtypes of samples; and (4) detection of genetic regulatory networks, are performed posterior to preprocessing procedure such as normalization techniques. To be more specific, temporal dynamic gene expression data has its inherent feature, namely, two neighboring time points (previous and current state) are highly correlated with each other, compared to static expression data which samples are assumed as independent individuals. In this chapter, we demonstrate how HMMs and hierarchical Bayesian modeling methods capture the horizontal time dependency structures in time series expression profiles by focusing on the identification of differential expression. In addition, those differential expression genes and transcript variant isoforms over time detected in core prerequisite steps can be generally further applied in detection of genetic regulatory networks to comprehensively uncover dynamic repertoires in the aspects of system biology as the coupled framework.
Comparison of directed and weighted co-occurrence networks of six languages

NASA Astrophysics Data System (ADS)

Gao, Yuyang; Liang, Wei; Shi, Yuming; Huang, Qiuling

2014-01-01

To study commonalities and differences among different languages, we select 100 reports from the documents of the United Nations, each of which was written in Arabic, Chinese, English, French, Russian and Spanish languages, separately. Based on these corpora, we construct 6 weighted and directed word co-occurrence networks. Besides all the networks exhibit scale-free and small-world features, we find several new non-trivial results, including connections among English words are denser, and the expression of English language is more flexible and powerful; the connection way among Spanish words is more stringent and this indicates that the Spanish grammar is more rigorous; values of many statistical parameters of the French and Spanish networks are very approximate and this shows that these two languages share many commonalities; Arabic and Russian words have many varieties, which result in rich types of words and a sparse connection among words; connections among Chinese words obey a more uniform distribution, and one inclines to use the least number of Chinese words to express the same complex information as those in other five languages. This shows that the expression of Chinese language is quite concise. In addition, several topics worth further investigating by the complex network approach have been observed in this study.
CoGAPS matrix factorization algorithm identifies transcriptional changes in AP-2alpha target genes in feedback from therapeutic inhibition of the EGFR network

PubMed Central

Thakar, Manjusha; Howard, Jason D.; Kagohara, Luciane T.; Krigsfeld, Gabriel; Ranaweera, Ruchira S.; Hughes, Robert M.; Perez, Jimena; Jones, Siân; Favorov, Alexander V.; Carey, Jacob; Stein-O'Brien, Genevieve; Gaykalova, Daria A.; Ochs, Michael F.; Chung, Christine H.

2016-01-01

Patients with oncogene driven tumors are treated with targeted therapeutics including EGFR inhibitors. Genomic data from The Cancer Genome Atlas (TCGA) demonstrates molecular alterations to EGFR, MAPK, and PI3K pathways in previously untreated tumors. Therefore, this study uses bioinformatics algorithms to delineate interactions resulting from EGFR inhibitor use in cancer cells with these genetic alterations. We modify the HaCaT keratinocyte cell line model to simulate cancer cells with constitutive activation of EGFR, HRAS, and PI3K in a controlled genetic background. We then measure gene expression after treating modified HaCaT cells with gefitinib, afatinib, and cetuximab. The CoGAPS algorithm distinguishes a gene expression signature associated with the anticipated silencing of the EGFR network. It also infers a feedback signature with EGFR gene expression itself increasing in cells that are responsive to EGFR inhibitors. This feedback signature has increased expression of several growth factor receptors regulated by the AP-2 family of transcription factors. The gene expression signatures for AP-2alpha are further correlated with sensitivity to cetuximab treatment in HNSCC cell lines and changes in EGFR expression in HNSCC tumors with low CDKN2A gene expression. In addition, the AP-2alpha gene expression signatures are also associated with inhibition of MEK, PI3K, and mTOR pathways in the Library of Integrated Network-Based Cellular Signatures (LINCS) data. These results suggest that AP-2 transcription factors are activated as feedback from EGFR network inhibition and may mediate EGFR inhibitor resistance. PMID:27650546
Template-based procedures for neural network interpretation.

PubMed

Alexander, J A.; Mozer, M C.

1999-04-01

Although neural networks often achieve impressive learning and generalization performance, their internal workings are typically all but impossible to decipher. This characteristic of the networks, their opacity, is one of the disadvantages of connectionism compared to more traditional, rule-oriented approaches to artificial intelligence. Without a thorough understanding of the network behavior, confidence in a system's results is lowered, and the transfer of learned knowledge to other processing systems - including humans - is precluded. Methods that address the opacity problem by casting network weights in symbolic terms are commonly referred to as rule extraction techniques. This work describes a principled approach to symbolic rule extraction from standard multilayer feedforward networks based on the notion of weight templates, parameterized regions of weight space corresponding to specific symbolic expressions. With an appropriate choice of representation, we show how template parameters may be efficiently identified and instantiated to yield the optimal match to the actual weights of a unit. Depending on the requirements of the application domain, the approach can accommodate n-ary disjunctions and conjunctions with O(k) complexity, simple n-of-m expressions with O(k(2)) complexity, or more general classes of recursive n-of-m expressions with O(k(L+2)) complexity, where k is the number of inputs to an unit and L the recursion level of the expression class. Compared to other approaches in the literature, our method of rule extraction offers benefits in simplicity, computational performance, and overall flexibility. Simulation results on a variety of problems demonstrate the application of our procedures as well as the strengths and the weaknesses of our general approach.
Gene regulatory networks reused to build novel traits: co-option of an eye-related gene regulatory network in eye-like organs and red wing patches on insect wings is suggested by optix expression.

PubMed

Monteiro, Antónia

2012-03-01

Co-option of the eye developmental gene regulatory network may have led to the appearance of novel functional traits on the wings of flies and butterflies. The first trait is a recently described wing organ in a species of extinct midge resembling the outer layers of the midge's own compound eye. The second trait is red pigment patches on Heliconius butterfly wings connected to the expression of an eye selector gene, optix. These examples, as well as others, are discussed regarding the type of empirical evidence and burden of proof that have been used to infer gene network co-option underlying the origin of novel traits. A conceptual framework describing increasing confidence in inference of network co-option is proposed. Novel research directions to facilitate inference of network co-option are also highlighted, especially in cases where the pre-existent and novel traits do not resemble each other. Copyright © 2012 WILEY Periodicals, Inc.
THD-Module Extractor: An Application for CEN Module Extraction and Interesting Gene Identification for Alzheimer's Disease.

PubMed

Kakati, Tulika; Kashyap, Hirak; Bhattacharyya, Dhruba K

2016-11-30

There exist many tools and methods for construction of co-expression network from gene expression data and for extraction of densely connected gene modules. In this paper, a method is introduced to construct co-expression network and to extract co-expressed modules having high biological significance. The proposed method has been validated on several well known microarray datasets extracted from a diverse set of species, using statistical measures, such as p and q values. The modules obtained in these studies are found to be biologically significant based on Gene Ontology enrichment analysis, pathway analysis, and KEGG enrichment analysis. Further, the method was applied on an Alzheimer's disease dataset and some interesting genes are found, which have high semantic similarity among them, but are not significantly correlated in terms of expression similarity. Some of these interesting genes, such as MAPT, CASP2, and PSEN2, are linked with important aspects of Alzheimer's disease, such as dementia, increase cell death, and deposition of amyloid-beta proteins in Alzheimer's disease brains. The biological pathways associated with Alzheimer's disease, such as, Wnt signaling, Apoptosis, p53 signaling, and Notch signaling, incorporate these interesting genes. The proposed method is evaluated in regard to existing literature.
THD-Module Extractor: An Application for CEN Module Extraction and Interesting Gene Identification for Alzheimer’s Disease

PubMed Central

Kakati, Tulika; Kashyap, Hirak; Bhattacharyya, Dhruba K.

2016-01-01

There exist many tools and methods for construction of co-expression network from gene expression data and for extraction of densely connected gene modules. In this paper, a method is introduced to construct co-expression network and to extract co-expressed modules having high biological significance. The proposed method has been validated on several well known microarray datasets extracted from a diverse set of species, using statistical measures, such as p and q values. The modules obtained in these studies are found to be biologically significant based on Gene Ontology enrichment analysis, pathway analysis, and KEGG enrichment analysis. Further, the method was applied on an Alzheimer’s disease dataset and some interesting genes are found, which have high semantic similarity among them, but are not significantly correlated in terms of expression similarity. Some of these interesting genes, such as MAPT, CASP2, and PSEN2, are linked with important aspects of Alzheimer’s disease, such as dementia, increase cell death, and deposition of amyloid-beta proteins in Alzheimer’s disease brains. The biological pathways associated with Alzheimer’s disease, such as, Wnt signaling, Apoptosis, p53 signaling, and Notch signaling, incorporate these interesting genes. The proposed method is evaluated in regard to existing literature. PMID:27901073
GEM-TREND: a web tool for gene expression data mining toward relevant network discovery

PubMed Central

Feng, Chunlai; Araki, Michihiro; Kunimoto, Ryo; Tamon, Akiko; Makiguchi, Hiroki; Niijima, Satoshi; Tsujimoto, Gozoh; Okuno, Yasushi

2009-01-01

Background DNA microarray technology provides us with a first step toward the goal of uncovering gene functions on a genomic scale. In recent years, vast amounts of gene expression data have been collected, much of which are available in public databases, such as the Gene Expression Omnibus (GEO). To date, most researchers have been manually retrieving data from databases through web browsers using accession numbers (IDs) or keywords, but gene-expression patterns are not considered when retrieving such data. The Connectivity Map was recently introduced to compare gene expression data by introducing gene-expression signatures (represented by a set of genes with up- or down-regulated labels according to their biological states) and is available as a web tool for detecting similar gene-expression signatures from a limited data set (approximately 7,000 expression profiles representing 1,309 compounds). In order to support researchers to utilize the public gene expression data more effectively, we developed a web tool for finding similar gene expression data and generating its co-expression networks from a publicly available database. Results GEM-TREND, a web tool for searching gene expression data, allows users to search data from GEO using gene-expression signatures or gene expression ratio data as a query and retrieve gene expression data by comparing gene-expression pattern between the query and GEO gene expression data. The comparison methods are based on the nonparametric, rank-based pattern matching approach of Lamb et al. (Science 2006) with the additional calculation of statistical significance. The web tool was tested using gene expression ratio data randomly extracted from the GEO and with in-house microarray data, respectively. The results validated the ability of GEM-TREND to retrieve gene expression entries biologically related to a query from GEO. For further analysis, a network visualization interface is also provided, whereby genes and gene annotations are dynamically linked to external data repositories. Conclusion GEM-TREND was developed to retrieve gene expression data by comparing query gene-expression pattern with those of GEO gene expression data. It could be a very useful resource for finding similar gene expression profiles and constructing its gene co-expression networks from a publicly available database. GEM-TREND was designed to be user-friendly and is expected to support knowledge discovery. GEM-TREND is freely available at . PMID:19728865
Co-LncRNA: investigating the lncRNA combinatorial effects in GO annotations and KEGG pathways based on human RNA-Seq data

PubMed Central

Zhao, Zheng; Bai, Jing; Wu, Aiwei; Wang, Yuan; Zhang, Jinwen; Wang, Zishan; Li, Yongsheng; Xu, Juan; Li, Xia

2015-01-01

Long non-coding RNAs (lncRNAs) are emerging as key regulators of diverse biological processes and diseases. However, the combinatorial effects of these molecules in a specific biological function are poorly understood. Identifying co-expressed protein-coding genes of lncRNAs would provide ample insight into lncRNA functions. To facilitate such an effort, we have developed Co-LncRNA, which is a web-based computational tool that allows users to identify GO annotations and KEGG pathways that may be affected by co-expressed protein-coding genes of a single or multiple lncRNAs. LncRNA co-expressed protein-coding genes were first identified in publicly available human RNA-Seq datasets, including 241 datasets across 6560 total individuals representing 28 tissue types/cell lines. Then, the lncRNA combinatorial effects in a given GO annotations or KEGG pathways are taken into account by the simultaneous analysis of multiple lncRNAs in user-selected individual or multiple datasets, which is realized by enrichment analysis. In addition, this software provides a graphical overview of pathways that are modulated by lncRNAs, as well as a specific tool to display the relevant networks between lncRNAs and their co-expressed protein-coding genes. Co-LncRNA also supports users in uploading their own lncRNA and protein-coding gene expression profiles to investigate the lncRNA combinatorial effects. It will be continuously updated with more human RNA-Seq datasets on an annual basis. Taken together, Co-LncRNA provides a web-based application for investigating lncRNA combinatorial effects, which could shed light on their biological roles and could be a valuable resource for this community. Database URL: http://www.bio-bigdata.com/Co-LncRNA/ PMID:26363020
Structural covariance networks are coupled to expression of genes enriched in supragranular layers of the human cortex.

PubMed

Romero-Garcia, Rafael; Whitaker, Kirstie J; Váša, František; Seidlitz, Jakob; Shinn, Maxwell; Fonagy, Peter; Dolan, Raymond J; Jones, Peter B; Goodyer, Ian M; Bullmore, Edward T; Vértes, Petra E

2018-05-01

Complex network topology is characteristic of many biological systems, including anatomical and functional brain networks (connectomes). Here, we first constructed a structural covariance network from MRI measures of cortical thickness on 296 healthy volunteers, aged 14-24 years. Next, we designed a new algorithm for matching sample locations from the Allen Brain Atlas to the nodes of the SCN. Subsequently we used this to define, transcriptomic brain networks by estimating gene co-expression between pairs of cortical regions. Finally, we explored the hypothesis that transcriptional networks and structural MRI connectomes are coupled. A transcriptional brain network (TBN) and a structural covariance network (SCN) were correlated across connection weights and showed qualitatively similar complex topological properties: assortativity, small-worldness, modularity, and a rich-club. In both networks, the weight of an edge was inversely related to the anatomical (Euclidean) distance between regions. There were differences between networks in degree and distance distributions: the transcriptional network had a less fat-tailed degree distribution and a less positively skewed distance distribution than the SCN. However, cortical areas connected to each other within modules of the SCN had significantly higher levels of whole genome co-expression than expected by chance. Nodes connected in the SCN had especially high levels of expression and co-expression of a human supragranular enriched (HSE) gene set that has been specifically located to supragranular layers of human cerebral cortex and is known to be important for large-scale, long-distance cortico-cortical connectivity. This coupling of brain transcriptome and connectome topologies was largely but not entirely accounted for by the common constraint of physical distance on both networks. Copyright © 2018 The Authors. Published by Elsevier Inc. All rights reserved.
Microfluidics-Based PCR for Fusion Transcript Detection.

PubMed

Chen, Hui

2016-01-01

The microfluidic technology allows the production of network of submillimeter-size fluidic channels and reservoirs in a variety of material systems. The microfluidic-based polymerase chain reaction (PCR) allows automated multiplexing of multiple samples and multiple assays simultaneously within a network of microfluidic channels and chambers that are co-ordinated in controlled fashion by the valves. The individual PCR reaction is performed in nanoliter volume, which allows testing on samples with limited DNA and RNA. The microfluidics devices are used in various types of PCR such as digital PCR and single molecular emulsion PCR for genotyping, gene expression, and miRNA expression. In this chapter, the use of a microfluidics-based PCR for simultaneous screening of 14 known fusion transcripts in patients with leukemia is described.
Temporal network analysis identifies early physiological and transcriptomic indicators of mild drought in Brassica rapa

PubMed Central

Gehan, Malia A; Mockler, Todd C; Weinig, Cynthia; Ewers, Brent E

2017-01-01

The dynamics of local climates make development of agricultural strategies challenging. Yield improvement has progressed slowly, especially in drought-prone regions where annual crop production suffers from episodic aridity. Underlying drought responses are circadian and diel control of gene expression that regulate daily variations in metabolic and physiological pathways. To identify transcriptomic changes that occur in the crop Brassica rapa during initial perception of drought, we applied a co-expression network approach to associate rhythmic gene expression changes with physiological responses. Coupled analysis of transcriptome and physiological parameters over a two-day time course in control and drought-stressed plants provided temporal resolution necessary for correlation of network modules with dynamic changes in stomatal conductance, photosynthetic rate, and photosystem II efficiency. This approach enabled the identification of drought-responsive genes based on their differential rhythmic expression profiles in well-watered versus droughted networks and provided new insights into the dynamic physiological changes that occur during drought. PMID:28826479
Towards Reliable and Energy-Efficient Incremental Cooperative Communication for Wireless Body Area Networks.

PubMed

Yousaf, Sidrah; Javaid, Nadeem; Qasim, Umar; Alrajeh, Nabil; Khan, Zahoor Ali; Ahmed, Mansoor

2016-02-24

In this study, we analyse incremental cooperative communication for wireless body area networks (WBANs) with different numbers of relays. Energy efficiency (EE) and the packet error rate (PER) are investigated for different schemes. We propose a new cooperative communication scheme with three-stage relaying and compare it to existing schemes. Our proposed scheme provides reliable communication with less PER at the cost of surplus energy consumption. Analytical expressions for the EE of the proposed three-stage cooperative communication scheme are also derived, taking into account the effect of PER. Later on, the proposed three-stage incremental cooperation is implemented in a network layer protocol; enhanced incremental cooperative critical data transmission in emergencies for static WBANs (EInCo-CEStat). Extensive simulations are conducted to validate the proposed scheme. Results of incremental relay-based cooperative communication protocols are compared to two existing cooperative routing protocols: cooperative critical data transmission in emergencies for static WBANs (Co-CEStat) and InCo-CEStat. It is observed from the simulation results that incremental relay-based cooperation is more energy efficient than the existing conventional cooperation protocol, Co-CEStat. The results also reveal that EInCo-CEStat proves to be more reliable with less PER and higher throughput than both of the counterpart protocols. However, InCo-CEStat has less throughput with a greater stability period and network lifetime. Due to the availability of more redundant links, EInCo-CEStat achieves a reduced packet drop rate at the cost of increased energy consumption.
Towards Reliable and Energy-Efficient Incremental Cooperative Communication for Wireless Body Area Networks

PubMed Central

Yousaf, Sidrah; Javaid, Nadeem; Qasim, Umar; Alrajeh, Nabil; Khan, Zahoor Ali; Ahmed, Mansoor

2016-01-01

In this study, we analyse incremental cooperative communication for wireless body area networks (WBANs) with different numbers of relays. Energy efficiency (EE) and the packet error rate (PER) are investigated for different schemes. We propose a new cooperative communication scheme with three-stage relaying and compare it to existing schemes. Our proposed scheme provides reliable communication with less PER at the cost of surplus energy consumption. Analytical expressions for the EE of the proposed three-stage cooperative communication scheme are also derived, taking into account the effect of PER. Later on, the proposed three-stage incremental cooperation is implemented in a network layer protocol; enhanced incremental cooperative critical data transmission in emergencies for static WBANs (EInCo-CEStat). Extensive simulations are conducted to validate the proposed scheme. Results of incremental relay-based cooperative communication protocols are compared to two existing cooperative routing protocols: cooperative critical data transmission in emergencies for static WBANs (Co-CEStat) and InCo-CEStat. It is observed from the simulation results that incremental relay-based cooperation is more energy efficient than the existing conventional cooperation protocol, Co-CEStat. The results also reveal that EInCo-CEStat proves to be more reliable with less PER and higher throughput than both of the counterpart protocols. However, InCo-CEStat has less throughput with a greater stability period and network lifetime. Due to the availability of more redundant links, EInCo-CEStat achieves a reduced packet drop rate at the cost of increased energy consumption. PMID:26927104
Network-Based Comparative Analysis of Arabidopsis Immune Responses to Golovinomyces orontii and Botrytis cinerea Infections.

PubMed

Jiang, Zhenhong; Dong, Xiaobao; Zhang, Ziding

2016-01-11

A comprehensive exploration of common and specific plant responses to biotrophs and necrotrophs is necessary for a better understanding of plant immunity. Here, we compared the Arabidopsis defense responses evoked by the biotrophic fungus Golovinomyces orontii and the necrotrophic fungus Botrytis cinerea through integrative network analysis. Two time-course transcriptional datasets were integrated with an Arabidopsis protein-protein interaction (PPI) network to construct a G. orontii conditional PPI sub-network (gCPIN) and a B. cinerea conditional PPI sub-network (bCPIN). We found that hubs in gCPIN and bCPIN played important roles in disease resistance. Hubs in bCPIN evolved faster than hubs in gCPIN, indicating the different selection pressures imposed on plants by different pathogens. By analyzing the common network from gCPIN and bCPIN, we identified two network components in which the genes were heavily involved in defense and development, respectively. The co-expression relationships between interacting proteins connecting the two components were different under G. orontii and B. cinerea infection conditions. Closer inspection revealed that auxin-related genes were overrepresented in the interactions connecting these two components, suggesting a critical role of auxin signaling in regulating the different co-expression relationships. Our work may provide new insights into plant defense responses against pathogens with different lifestyles.
A reverse engineering approach to optimize experiments for the construction of biological regulatory networks.

PubMed

Zhang, Xiaomeng; Shao, Bin; Wu, Yangle; Qi, Ouyang

2013-01-01

One of the major objectives in systems biology is to understand the relation between the topological structures and the dynamics of biological regulatory networks. In this context, various mathematical tools have been developed to deduct structures of regulatory networks from microarray expression data. In general, from a single data set, one cannot deduct the whole network structure; additional expression data are usually needed. Thus how to design a microarray expression experiment in order to get the most information is a practical problem in systems biology. Here we propose three methods, namely, maximum distance method, trajectory entropy method, and sampling method, to derive the optimal initial conditions for experiments. The performance of these methods is tested and evaluated in three well-known regulatory networks (budding yeast cell cycle, fission yeast cell cycle, and E. coli. SOS network). Based on the evaluation, we propose an efficient strategy for the design of microarray expression experiments.
Cluster and propensity based approximation of a network

PubMed Central

2013-01-01

Background The models in this article generalize current models for both correlation networks and multigraph networks. Correlation networks are widely applied in genomics research. In contrast to general networks, it is straightforward to test the statistical significance of an edge in a correlation network. It is also easy to decompose the underlying correlation matrix and generate informative network statistics such as the module eigenvector. However, correlation networks only capture the connections between numeric variables. An open question is whether one can find suitable decompositions of the similarity measures employed in constructing general networks. Multigraph networks are attractive because they support likelihood based inference. Unfortunately, it is unclear how to adjust current statistical methods to detect the clusters inherent in many data sets. Results Here we present an intuitive and parsimonious parametrization of a general similarity measure such as a network adjacency matrix. The cluster and propensity based approximation (CPBA) of a network not only generalizes correlation network methods but also multigraph methods. In particular, it gives rise to a novel and more realistic multigraph model that accounts for clustering and provides likelihood based tests for assessing the significance of an edge after controlling for clustering. We present a novel Majorization-Minimization (MM) algorithm for estimating the parameters of the CPBA. To illustrate the practical utility of the CPBA of a network, we apply it to gene expression data and to a bi-partite network model for diseases and disease genes from the Online Mendelian Inheritance in Man (OMIM). Conclusions The CPBA of a network is theoretically appealing since a) it generalizes correlation and multigraph network methods, b) it improves likelihood based significance tests for edge counts, c) it directly models higher-order relationships between clusters, and d) it suggests novel clustering algorithms. The CPBA of a network is implemented in Fortran 95 and bundled in the freely available R package PropClust. PMID:23497424
A statistically inferred microRNA network identifies breast cancer target miR-940 as an actin cytoskeleton regulator

NASA Astrophysics Data System (ADS)

Bhajun, Ricky; Guyon, Laurent; Pitaval, Amandine; Sulpice, Eric; Combe, Stéphanie; Obeid, Patricia; Haguet, Vincent; Ghorbel, Itebeddine; Lajaunie, Christian; Gidrol, Xavier

2015-02-01

MiRNAs are key regulators of gene expression. By binding to many genes, they create a complex network of gene co-regulation. Here, using a network-based approach, we identified miRNA hub groups by their close connections and common targets. In one cluster containing three miRNAs, miR-612, miR-661 and miR-940, the annotated functions of the co-regulated genes suggested a role in small GTPase signalling. Although the three members of this cluster targeted the same subset of predicted genes, we showed that their overexpression impacted cell fates differently. miR-661 demonstrated enhanced phosphorylation of myosin II and an increase in cell invasion, indicating a possible oncogenic miRNA. On the contrary, miR-612 and miR-940 inhibit phosphorylation of myosin II and cell invasion. Finally, expression profiling in human breast tissues showed that miR-940 was consistently downregulated in breast cancer tissues

RNA sequencing analysis reveals new findings of hyperbaric oxygen treatment on rats with acute carbon monoxide poisoning.

PubMed

Wang, Wenlan; Xue, Li; Li, Ya; Li, Rong; Xie, Xiaoping; Bao, Junxiang; Hai, Chunxu; Li, Jinsheng

2016-01-01

To elucidate the altered gene network in the brains of carbon monoxide (CO) poisoned rats after treatment with hyperbaric oxygen (HBO₂). RNA sequencing (RNA-seq) analysis was performed to examine differentially expressed genes (DEGs) in brain tissue samples from nine male rats: a normal control group; a CO poisoning group; and an HBO₂ treatment group (three rats/group). Reverse transcription polymerase chain reaction (RT-PCR) and real-time quantitative PCR were used for validation of the DEGs in another 18 male rats (six rats/group). RNA-seq revealed that two genes were upregulated (4.18 and 8.76 log to the base 2 fold change) (p⟨0.05) in the CO-poisoned rats relative to the control rats; two genes were upregulated (3.88 and 7.69 log to the base 2 fold change); and 23 genes were downregulated (3.49-15.12 log to the base 2 fold change) (p⟨0.05) in the brains of the HBO₂-treated rats relative to the CO-poisoned rats. Target prediction of DEGs by gene network analysis and analysis of pathways affected suggested that regulation of gene expressions of dopamine metabolism and nitric oxide (NO) synthesis were significantly affected by CO poisoning and HBO₂ treatment. Results of RT-PCR and real-time quantitative PCR indicated that four genes (Pomc, GH-1, Pr1 and Fshβ) associated with hormone secretion in the hypothalamic-pituitary system have potential as markers for prognosis of CO. This study is the first RNA-seq analysis profile of HBO₂ treatment on rats with acute CO poisoning. It concludes that changes of hormone secretion in the hypothalamic-pituitary system, dopamine metabolism and NO synthesis involved in brain damage and behavior abnormalities after CO poisoning and HBO₂ therapy may regulate these changes.
Porcine Tissue-Specific Regulatory Networks Derived from Meta-Analysis of the Transcriptome

PubMed Central

Pérez-Montarelo, Dafne; Hudson, Nicholas J.; Fernández, Ana I.; Ramayo-Caldas, Yuliaxis; Dalrymple, Brian P.; Reverter, Antonio

2012-01-01

The processes that drive tissue identity and differentiation remain unclear for most tissue types. So are the gene networks and transcription factors (TF) responsible for the differential structure and function of each particular tissue, and this is particularly true for non model species with incomplete genomic resources. To better understand the regulation of genes responsible for tissue identity in pigs, we have inferred regulatory networks from a meta-analysis of 20 gene expression studies spanning 480 Porcine Affymetrix chips for 134 experimental conditions on 27 distinct tissues. We developed a mixed-model normalization approach with a covariance structure that accommodated the disparity in the origin of the individual studies, and obtained the normalized expression of 12,320 genes across the 27 tissues. Using this resource, we constructed a network, based on the co-expression patterns of 1,072 TF and 1,232 tissue specific genes. The resulting network is consistent with the known biology of tissue development. Within the network, genes clustered by tissue and tissues clustered by site of embryonic origin. These clusters were significantly enriched for genes annotated in key relevant biological processes and confirm gene functions and interactions from the literature. We implemented a Regulatory Impact Factor (RIF) metric to identify the key regulators in skeletal muscle and tissues from the central nervous systems. The normalization of the meta-analysis, the inference of the gene co-expression network and the RIF metric, operated synergistically towards a successful search for tissue-specific regulators. Novel among these findings are evidence suggesting a novel key role of ERCC3 as a muscle regulator. Together, our results recapitulate the known biology behind tissue specificity and provide new valuable insights in a less studied but valuable model species. PMID:23049964
Informed walks: whispering hints to gene hunters inside networks' jungle.

PubMed

Bourdakou, Marilena M; Spyrou, George M

2017-10-11

Systemic approaches offer a different point of view on the analysis of several types of molecular associations as well as on the identification of specific gene communities in several cancer types. However, due to lack of sufficient data needed to construct networks based on experimental evidence, statistical gene co-expression networks are widely used instead. Many efforts have been made to exploit the information hidden in these networks. However, these approaches still need to capitalize comprehensively the prior knowledge encrypted into molecular pathway associations and improve their efficiency regarding the discovery of both exclusive subnetworks as candidate biomarkers and conserved subnetworks that may uncover common origins of several cancer types. In this study we present the development of the Informed Walks model based on random walks that incorporate information from molecular pathways to mine candidate genes and gene-gene links. The proposed model has been applied to TCGA (The Cancer Genome Atlas) datasets from seven different cancer types, exploring the reconstructed co-expression networks of the whole set of genes and driving to highlighted sub-networks for each cancer type. In the sequel, we elucidated the impact of each subnetwork on the indication of underlying exclusive and common molecular mechanisms as well as on the short-listing of drugs that have the potential to suppress the corresponding cancer type through a drug-repurposing pipeline. We have developed a method of gene subnetwork highlighting based on prior knowledge, capable to give fruitful insights regarding the underlying molecular mechanisms and valuable input to drug-repurposing pipelines for a variety of cancer types.
Co-expression networks reveal the tissue-specific regulation of transcription and splicing

PubMed Central

Saha, Ashis; Kim, Yungil; Gewirtz, Ariel D.H.; Jo, Brian; Gao, Chuan; McDowell, Ian C.; Engelhardt, Barbara E.

2017-01-01

Gene co-expression networks capture biologically important patterns in gene expression data, enabling functional analyses of genes, discovery of biomarkers, and interpretation of genetic variants. Most network analyses to date have been limited to assessing correlation between total gene expression levels in a single tissue or small sets of tissues. Here, we built networks that additionally capture the regulation of relative isoform abundance and splicing, along with tissue-specific connections unique to each of a diverse set of tissues. We used the Genotype-Tissue Expression (GTEx) project v6 RNA sequencing data across 50 tissues and 449 individuals. First, we developed a framework called Transcriptome-Wide Networks (TWNs) for combining total expression and relative isoform levels into a single sparse network, capturing the interplay between the regulation of splicing and transcription. We built TWNs for 16 tissues and found that hubs in these networks were strongly enriched for splicing and RNA binding genes, demonstrating their utility in unraveling regulation of splicing in the human transcriptome. Next, we used a Bayesian biclustering model that identifies network edges unique to a single tissue to reconstruct Tissue-Specific Networks (TSNs) for 26 distinct tissues and 10 groups of related tissues. Finally, we found genetic variants associated with pairs of adjacent nodes in our networks, supporting the estimated network structures and identifying 20 genetic variants with distant regulatory impact on transcription and splicing. Our networks provide an improved understanding of the complex relationships of the human transcriptome across tissues. PMID:29021288
Differential Network Analysis Reveals Evolutionary Complexity in Secondary Metabolism of Rauvolfia serpentina over Catharanthus roseus

PubMed Central

Pathania, Shivalika; Bagler, Ganesh; Ahuja, Paramvir S.

2016-01-01

Comparative co-expression analysis of multiple species using high-throughput data is an integrative approach to determine the uniformity as well as diversification in biological processes. Rauvolfia serpentina and Catharanthus roseus, both members of Apocyanacae family, are reported to have remedial properties against multiple diseases. Despite of sharing upstream of terpenoid indole alkaloid pathway, there is significant diversity in tissue-specific synthesis and accumulation of specialized metabolites in these plants. This led us to implement comparative co-expression network analysis to investigate the modules and genes responsible for differential tissue-specific expression as well as species-specific synthesis of metabolites. Toward these goals differential network analysis was implemented to identify candidate genes responsible for diversification of metabolites profile. Three genes were identified with significant difference in connectivity leading to differential regulatory behavior between these plants. These genes may be responsible for diversification of secondary metabolism, and thereby for species-specific metabolite synthesis. The network robustness of R. serpentina, determined based on topological properties, was also complemented by comparison of gene-metabolite networks of both plants, and may have evolved to have complex metabolic mechanisms as compared to C. roseus under the influence of various stimuli. This study reveals evolution of complexity in secondary metabolism of R. serpentina, and key genes that contribute toward diversification of specific metabolites. PMID:27588023
Differential Network Analysis Reveals Evolutionary Complexity in Secondary Metabolism of Rauvolfia serpentina over Catharanthus roseus.

PubMed

Pathania, Shivalika; Bagler, Ganesh; Ahuja, Paramvir S

2016-01-01

Comparative co-expression analysis of multiple species using high-throughput data is an integrative approach to determine the uniformity as well as diversification in biological processes. Rauvolfia serpentina and Catharanthus roseus, both members of Apocyanacae family, are reported to have remedial properties against multiple diseases. Despite of sharing upstream of terpenoid indole alkaloid pathway, there is significant diversity in tissue-specific synthesis and accumulation of specialized metabolites in these plants. This led us to implement comparative co-expression network analysis to investigate the modules and genes responsible for differential tissue-specific expression as well as species-specific synthesis of metabolites. Toward these goals differential network analysis was implemented to identify candidate genes responsible for diversification of metabolites profile. Three genes were identified with significant difference in connectivity leading to differential regulatory behavior between these plants. These genes may be responsible for diversification of secondary metabolism, and thereby for species-specific metabolite synthesis. The network robustness of R. serpentina, determined based on topological properties, was also complemented by comparison of gene-metabolite networks of both plants, and may have evolved to have complex metabolic mechanisms as compared to C. roseus under the influence of various stimuli. This study reveals evolution of complexity in secondary metabolism of R. serpentina, and key genes that contribute toward diversification of specific metabolites.
The solvability of quantum k-pair network in a measurement-based way.

PubMed

Li, Jing; Xu, Gang; Chen, Xiu-Bo; Qu, Zhiguo; Niu, Xin-Xin; Yang, Yi-Xian

2017-12-01

Network coding is an effective means to enhance the communication efficiency. The characterization of network solvability is one of the most important topic in this field. However, for general network, the solvability conditions are still a challenge. In this paper, we consider the solvability of general quantum k-pair network in measurement-based framework. For the first time, a detailed account of measurement-based quantum network coding(MB-QNC) is specified systematically. Differing from existing coding schemes, single qubit measurements on a pre-shared graph state are the only allowed coding operations. Since no control operations are concluded, it makes MB-QNC schemes more feasible. Further, the sufficient conditions formulating by eigenvalue equations and stabilizer matrix are presented, which build an unambiguous relation among the solvability and the general network. And this result can also analyze the feasibility of sharing k EPR pairs task in large-scale networks. Finally, in the presence of noise, we analyze the advantage of MB-QNC in contrast to gate-based way. By an instance network [Formula: see text], we show that MB-QNC allows higher error thresholds. Specially, for X error, the error threshold is about 30% higher than 10% in gate-based way. In addition, the specific expressions of fidelity subject to some constraint conditions are given.
Discretization provides a conceptually simple tool to build expression networks.

PubMed

Vass, J Keith; Higham, Desmond J; Mudaliar, Manikhandan A V; Mao, Xuerong; Crowther, Daniel J

2011-04-18

Biomarker identification, using network methods, depends on finding regular co-expression patterns; the overall connectivity is of greater importance than any single relationship. A second requirement is a simple algorithm for ranking patients on how relevant a gene-set is. For both of these requirements discretized data helps to first identify gene cliques, and then to stratify patients.We explore a biologically intuitive discretization technique which codes genes as up- or down-regulated, with values close to the mean set as unchanged; this allows a richer description of relationships between genes than can be achieved by positive and negative correlation. We find a close agreement between our results and the template gene-interactions used to build synthetic microarray-like data by SynTReN, which synthesizes "microarray" data using known relationships which are successfully identified by our method.We are able to split positive co-regulation into up-together and down-together and negative co-regulation is considered as directed up-down relationships. In some cases these exist in only one direction, with real data, but not with the synthetic data. We illustrate our approach using two studies on white blood cells and derived immortalized cell lines and compare the approach with standard correlation-based computations. No attempt is made to distinguish possible causal links as the search for biomarkers would be crippled by losing highly significant co-expression relationships. This contrasts with approaches like ARACNE and IRIS.The method is illustrated with an analysis of gene-expression for energy metabolism pathways. For each discovered relationship we are able to identify the samples on which this is based in the discretized sample-gene matrix, along with a simplified view of the patterns of gene expression; this helps to dissect the gene-sample relevant to a research topic--identifying sets of co-regulated and anti-regulated genes and the samples or patients in which this relationship occurs.
Convergent roles of de novo mutations and common variants in schizophrenia in tissue-specific and spatiotemporal co-expression network.

PubMed

Jia, Peilin; Chen, Xiangning; Fanous, Ayman H; Zhao, Zhongming

2018-05-24

Genetic components susceptible to complex disease such as schizophrenia include a wide spectrum of variants, including common variants (CVs) and de novo mutations (DNMs). Although CVs and DNMs differ by origin, it remains elusive whether and how they interact at the gene, pathway, and network levels that leads to the disease. In this work, we characterized the genes harboring schizophrenia-associated CVs (CVgenes) and the genes harboring DNMs (DNMgenes) using measures from network, tissue-specific expression profile, and spatiotemporal brain expression profile. We developed an algorithm to link the DNMgenes and CVgenes in spatiotemporal brain co-expression networks. DNMgenes tended to have central roles in the human protein-protein interaction (PPI) network, evidenced in their high degree and high betweenness values. DNMgenes and CVgenes connected with each other significantly more often than with other genes in the networks. However, only CVgenes remained significantly connected after adjusting for their degree. In our gene co-expression PPI network, we found DNMgenes and CVgenes connected in a tissue-specific fashion, and such a pattern was similar to that in GTEx brain but not in other GTEx tissues. Importantly, DNMgene-CVgene subnetworks were enriched with pathways of chromatin remodeling, MHC protein complex binding, and neurotransmitter activities. In summary, our results unveiled that both DNMgenes and CVgenes contributed to a core set of biologically important pathways and networks, and their interactions may attribute to the risk for schizophrenia. Our results also suggested a stronger biological effect of DNMgenes than CVgenes in schizophrenia.
Modeling genome-wide dynamic regulatory network in mouse lungs with influenza infection using high-dimensional ordinary differential equations.

PubMed

Wu, Shuang; Liu, Zhi-Ping; Qiu, Xing; Wu, Hulin

2014-01-01

The immune response to viral infection is regulated by an intricate network of many genes and their products. The reverse engineering of gene regulatory networks (GRNs) using mathematical models from time course gene expression data collected after influenza infection is key to our understanding of the mechanisms involved in controlling influenza infection within a host. A five-step pipeline: detection of temporally differentially expressed genes, clustering genes into co-expressed modules, identification of network structure, parameter estimate refinement, and functional enrichment analysis, is developed for reconstructing high-dimensional dynamic GRNs from genome-wide time course gene expression data. Applying the pipeline to the time course gene expression data from influenza-infected mouse lungs, we have identified 20 distinct temporal expression patterns in the differentially expressed genes and constructed a module-based dynamic network using a linear ODE model. Both intra-module and inter-module annotations and regulatory relationships of our inferred network show some interesting findings and are highly consistent with existing knowledge about the immune response in mice after influenza infection. The proposed method is a computationally efficient, data-driven pipeline bridging experimental data, mathematical modeling, and statistical analysis. The application to the influenza infection data elucidates the potentials of our pipeline in providing valuable insights into systematic modeling of complicated biological processes.
Analysis of Temporal-spatial Co-variation within Gene Expression Microarray Data in an Organogenesis Model

NASA Astrophysics Data System (ADS)

Ehler, Martin; Rajapakse, Vinodh; Zeeberg, Barry; Brooks, Brian; Brown, Jacob; Czaja, Wojciech; Bonner, Robert F.

The gene networks underlying closure of the optic fissure during vertebrate eye development are poorly understood. We used a novel clustering method based on Laplacian Eigenmaps, a nonlinear dimension reduction method, to analyze microarray data from laser capture microdissected (LCM) cells at the site and developmental stages (days 10.5 to 12.5) of optic fissure closure. Our new method provided greater biological specificity than classical clustering algorithms in terms of identifying more biological processes and functions related to eye development as defined by Gene Ontology at lower false discovery rates. This new methodology builds on the advantages of LCM to isolate pure phenotypic populations within complex tissues and allows improved ability to identify critical gene products expressed at lower copy number. The combination of LCM of embryonic organs, gene expression microarrays, and extracting spatial and temporal co-variations appear to be a powerful approach to understanding the gene regulatory networks that specify mammalian organogenesis.
Bayesian estimation of the discrete coefficient of determination.

PubMed

Chen, Ting; Braga-Neto, Ulisses M

2016-12-01

The discrete coefficient of determination (CoD) measures the nonlinear interaction between discrete predictor and target variables and has had far-reaching applications in Genomic Signal Processing. Previous work has addressed the inference of the discrete CoD using classical parametric and nonparametric approaches. In this paper, we introduce a Bayesian framework for the inference of the discrete CoD. We derive analytically the optimal minimum mean-square error (MMSE) CoD estimator, as well as a CoD estimator based on the Optimal Bayesian Predictor (OBP). For the latter estimator, exact expressions for its bias, variance, and root-mean-square (RMS) are given. The accuracy of both Bayesian CoD estimators with non-informative and informative priors, under fixed or random parameters, is studied via analytical and numerical approaches. We also demonstrate the application of the proposed Bayesian approach in the inference of gene regulatory networks, using gene-expression data from a previously published study on metastatic melanoma.
Human-computer interaction reflected in the design of user interfaces for general practitioners.

PubMed

Stoicu-Tivadar, Lacramioara; Stoicu-Tivadar, Vasile

2006-01-01

To address the problem of properly built health information systems in general practice as an important issue for their approval and use in clinical practice. We present how a national general practitioner (GP) network was built, put in practice and several results of its activity seen from the clinician's and the software application team's points of view. We used a multi-level incremental development appropriate for the conditions of the required information system. After the development of the first version of the software components (based on rapid prototyping) of the sentinel network, a questionnaire addressed the needs and improvements required by the health professionals. Based on the answers, the functionality of the system and the interface were improved regarding the real needs expressed by the end-users. The network is functional and the collected data from the network are being processed using statistical methods. The academic software team developed a GP application that is well received by the GPs in the network, as resulted from the survey and discussions during the training period. As an added confirmation, several GPs outside the network enrolled after seeing the software at work. Another confirmation that we did a good job was that after the final presentation of the results of the project a representative from the Romanian Society for Cardiology expressed the wish of this society to access the data yielded by the network.
Integrated Module and Gene-Specific Regulatory Inference Implicates Upstream Signaling Networks

PubMed Central

Roy, Sushmita; Lagree, Stephen; Hou, Zhonggang; Thomson, James A.; Stewart, Ron; Gasch, Audrey P.

2013-01-01

Regulatory networks that control gene expression are important in diverse biological contexts including stress response and development. Each gene's regulatory program is determined by module-level regulation (e.g. co-regulation via the same signaling system), as well as gene-specific determinants that can fine-tune expression. We present a novel approach, Modular regulatory network learning with per gene information (MERLIN), that infers regulatory programs for individual genes while probabilistically constraining these programs to reveal module-level organization of regulatory networks. Using edge-, regulator- and module-based comparisons of simulated networks of known ground truth, we find MERLIN reconstructs regulatory programs of individual genes as well or better than existing approaches of network reconstruction, while additionally identifying modular organization of the regulatory networks. We use MERLIN to dissect global transcriptional behavior in two biological contexts: yeast stress response and human embryonic stem cell differentiation. Regulatory modules inferred by MERLIN capture co-regulatory relationships between signaling proteins and downstream transcription factors thereby revealing the upstream signaling systems controlling transcriptional responses. The inferred networks are enriched for regulators with genetic or physical interactions, supporting the inference, and identify modules of functionally related genes bound by the same transcriptional regulators. Our method combines the strengths of per-gene and per-module methods to reveal new insights into transcriptional regulation in stress and development. PMID:24146602
Differential co-expression analysis reveals a novel prognostic gene module in ovarian cancer.

PubMed

Gov, Esra; Arga, Kazim Yalcin

2017-07-10

Ovarian cancer is one of the most significant disease among gynecological disorders that women suffered from over the centuries. However, disease-specific and effective biomarkers were still not available, since studies have focused on individual genes associated with ovarian cancer, ignoring the interactions and associations among the gene products. Here, ovarian cancer differential co-expression networks were reconstructed via meta-analysis of gene expression data and co-expressed gene modules were identified in epithelial cells from ovarian tumor and healthy ovarian surface epithelial samples to propose ovarian cancer associated genes and their interactions. We propose a novel, highly interconnected, differentially co-expressed, and co-regulated gene module in ovarian cancer consisting of 84 prognostic genes. Furthermore, the specificity of the module to ovarian cancer was shown through analyses of datasets in nine other cancers. These observations underscore the importance of transcriptome based systems biomarkers research in deciphering the elusive pathophysiology of ovarian cancer, and here, we present reciprocal interplay between candidate ovarian cancer genes and their transcriptional regulatory dynamics. The corresponding gene module might provide new insights on ovarian cancer prognosis and treatment strategies that continue to place a significant burden on global health.
GEM2Net: from gene expression modeling to -omics networks, a new CATdb module to investigate Arabidopsis thaliana genes involved in stress response.

PubMed

Zaag, Rim; Tamby, Jean Philippe; Guichard, Cécile; Tariq, Zakia; Rigaill, Guillem; Delannoy, Etienne; Renou, Jean-Pierre; Balzergue, Sandrine; Mary-Huard, Tristan; Aubourg, Sébastien; Martin-Magniette, Marie-Laure; Brunaud, Véronique

2015-01-01

CATdb (http://urgv.evry.inra.fr/CATdb) is a database providing a public access to a large collection of transcriptomic data, mainly for Arabidopsis but also for other plants. This resource has the rare advantage to contain several thousands of microarray experiments obtained with the same technical protocol and analyzed by the same statistical pipelines. In this paper, we present GEM2Net, a new module of CATdb that takes advantage of this homogeneous dataset to mine co-expression units and decipher Arabidopsis gene functions. GEM2Net explores 387 stress conditions organized into 18 biotic and abiotic stress categories. For each one, a model-based clustering is applied on expression differences to identify clusters of co-expressed genes. To characterize functions associated with these clusters, various resources are analyzed and integrated: Gene Ontology, subcellular localization of proteins, Hormone Families, Transcription Factor Families and a refined stress-related gene list associated to publications. Exploiting protein-protein interactions and transcription factors-targets interactions enables to display gene networks. GEM2Net presents the analysis of the 18 stress categories, in which 17,264 genes are involved and organized within 681 co-expression clusters. The meta-data analyses were stored and organized to compose a dynamic Web resource. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.
Construction and comparison of gene co-expression networks shows complex plant immune responses

PubMed Central

López, Camilo; López-Kleine, Liliana

2014-01-01

Gene co-expression networks (GCNs) are graphic representations that depict the coordinated transcription of genes in response to certain stimuli. GCNs provide functional annotations of genes whose function is unknown and are further used in studies of translational functional genomics among species. In this work, a methodology for the reconstruction and comparison of GCNs is presented. This approach was applied using gene expression data that were obtained from immunity experiments in Arabidopsis thaliana, rice, soybean, tomato and cassava. After the evaluation of diverse similarity metrics for the GCN reconstruction, we recommended the mutual information coefficient measurement and a clustering coefficient-based method for similarity threshold selection. To compare GCNs, we proposed a multivariate approach based on the Principal Component Analysis (PCA). Branches of plant immunity that were exemplified by each experiment were analyzed in conjunction with the PCA results, suggesting both the robustness and the dynamic nature of the cellular responses. The dynamic of molecular plant responses produced networks with different characteristics that are differentiable using our methodology. The comparison of GCNs from plant pathosystems, showed that in response to similar pathogens plants could activate conserved signaling pathways. The results confirmed that the closeness of GCNs projected on the principal component space is an indicative of similarity among GCNs. This also can be used to understand global patterns of events triggered during plant immune responses. PMID:25320678
Network analysis reveals stage-specific changes in zebrafish embryo development using time course whole transcriptome profiling and prior biological knowledge.

PubMed

Zhang, Yuji

2015-01-01

Molecular networks act as the backbone of molecular activities within cells, offering a unique opportunity to better understand the mechanism of diseases. While network data usually constitute only static network maps, integrating them with time course gene expression information can provide clues to the dynamic features of these networks and unravel the mechanistic driver genes characterizing cellular responses. Time course gene expression data allow us to broadly "watch" the dynamics of the system. However, one challenge in the analysis of such data is to establish and characterize the interplay among genes that are altered at different time points in the context of a biological process or functional category. Integrative analysis of these data sources will lead us a more complete understanding of how biological entities (e.g., genes and proteins) coordinately perform their biological functions in biological systems. In this paper, we introduced a novel network-based approach to extract functional knowledge from time-dependent biological processes at a system level using time course mRNA sequencing data in zebrafish embryo development. The proposed method was applied to investigate 1α, 25(OH)2D3-altered mechanisms in zebrafish embryo development. We applied the proposed method to a public zebrafish time course mRNA-Seq dataset, containing two different treatments along four time points. We constructed networks between gene ontology biological process categories, which were enriched in differential expressed genes between consecutive time points and different conditions. The temporal propagation of 1α, 25-Dihydroxyvitamin D3-altered transcriptional changes started from a few genes that were altered initially at earlier stage, to large groups of biological coherent genes at later stages. The most notable biological processes included neuronal and retinal development and generalized stress response. In addition, we also investigated the relationship among biological processes enriched in co-expressed genes under different conditions. The enriched biological processes include translation elongation, nucleosome assembly, and retina development. These network dynamics provide new insights into the impact of 1α, 25-Dihydroxyvitamin D3 treatment in bone and cartilage development. We developed a network-based approach to analyzing the DEGs at different time points by integrating molecular interactions and gene ontology information. These results demonstrate that the proposed approach can provide insight on the molecular mechanisms taking place in vertebrate embryo development upon treatment with 1α, 25(OH)2D3. Our approach enables the monitoring of biological processes that can serve as a basis for generating new testable hypotheses. Such network-based integration approach can be easily extended to any temporal- or condition-dependent genomic data analyses.
Identification of the transcriptional regulators by expression profiling infected with hepatitis B virus.

PubMed

Chai, Xiaoqiang; Han, Yanan; Yang, Jian; Zhao, Xianxian; Liu, Yewang; Hou, Xugang; Tang, Yiheng; Zhao, Shirong; Li, Xiao

2016-02-01

The molecular pathogenesis of infection by hepatitis B virus with human is extremely complex and heterogeneous. To date the molecular information is not clearly defined despite intensive research efforts. Thus, studies aimed at transcription and regulation during virus infection or combined researches of those already known to be beneficial are needed. With the purpose of identifying the transcriptional regulators related to infection of hepatitis B virus in gene level, the gene expression profiles from some normal individuals and hepatitis B patients were analyzed in our study. In this work, the differential expressed genes were selected primarily. The several genes among those were validated in an independent set by qRT-PCR. Then the differentially co-expression analysis was conducted to identify differentially co-expressed links and differential co-expressed genes. Next, the analysis of the regulatory impact factors was performed through mapping the links and regulatory data. In order to give a further insight to these regulators, the co-expression gene modules were identified using a threshold-based hierarchical clustering method. Incidentally, the construction of the regulatory network was generated using the computer software. A total of 137,284 differentially co-expressed links and 780 differential co-expressed genes were identified. These co-expressed genes were significantly enriched inflammatory response. The results of regulatory impact factors revealed several crucial regulators related to hepatocellular carcinoma and other high-rank regulators. Meanwhile, more than one hundred co-expression gene modules were identified using clustering method. In our study, some important transcriptional regulators were identified using a computational method, which may enhance the understanding of disease mechanisms and lead to an improved treatment of hepatitis B. However, further experimental studies are required to confirm these findings. Copyright © 2015 Elsevier Masson SAS. All rights reserved.
Network-based analysis of oligodendrogliomas predicts novel cancer gene candidates within the region of the 1p/19q co-deletion.

PubMed

Gladitz, Josef; Klink, Barbara; Seifert, Michael

2018-06-11

Oligodendrogliomas are primary human brain tumors with a characteristic 1p/19q co-deletion of important prognostic relevance, but little is known about the pathology of this chromosomal mutation. We developed a network-based approach to identify novel cancer gene candidates in the region of the 1p/19q co-deletion. Gene regulatory networks were learned from gene expression and copy number data of 178 oligodendrogliomas and further used to quantify putative impacts of differentially expressed genes of the 1p/19q region on cancer-relevant pathways. We predicted 8 genes with strong impact on signaling pathways and 14 genes with strong impact on metabolic pathways widespread across the region of the 1p/19 co-deletion. Many of these candidates (e.g. ELTD1, SDHB, SEPW1, SLC17A7, SZRD1, THAP3, ZBTB17) are likely to push, whereas others (e.g. CAP1, HBXIP, KLK6, PARK7, PTAFR) might counteract oligodendroglioma development. For example, ELTD1, a functionally validated glioblastoma oncogene located on 1p, was overexpressed. Further, the known glioblastoma tumor suppressor SLC17A7 located on 19q was underexpressed. Moreover, known epigenetic alterations triggered by mutated SDHB in paragangliomas suggest that underexpressed SDHB in oligodendrogliomas may support and possibly enhance the epigenetic reprogramming induced by the IDH-mutation. We further analyzed rarely observed deletions and duplications of chromosomal arms within oligodendroglioma subcohorts identifying putative oncogenes and tumor suppressors that possibly influence the development of oligodendroglioma subgroups. Our in-depth computational study contributes to a better understanding of the pathology of the 1p/19q co-deletion and other chromosomal arm mutations. This might open opportunities for functional validations and new therapeutic strategies.

Coexpression network based on natural variation in human gene expression reveals gene interactions and functions

PubMed Central

Nayak, Renuka R.; Kearns, Michael; Spielman, Richard S.; Cheung, Vivian G.

2009-01-01

Genes interact in networks to orchestrate cellular processes. Analysis of these networks provides insights into gene interactions and functions. Here, we took advantage of normal variation in human gene expression to infer gene networks, which we constructed using correlations in expression levels of more than 8.5 million gene pairs in immortalized B cells from three independent samples. The resulting networks allowed us to identify biological processes and gene functions. Among the biological pathways, we found processes such as translation and glycolysis that co-occur in the same subnetworks. We predicted the functions of poorly characterized genes, including CHCHD2 and TMEM111, and provided experimental evidence that TMEM111 is part of the endoplasmic reticulum-associated secretory pathway. We also found that IFIH1, a susceptibility gene of type 1 diabetes, interacts with YES1, which plays a role in glucose transport. Furthermore, genes that predispose to the same diseases are clustered nonrandomly in the coexpression network, suggesting that networks can provide candidate genes that influence disease susceptibility. Therefore, our analysis of gene coexpression networks offers information on the role of human genes in normal and disease processes. PMID:19797678
Active subnetwork recovery with a mechanism-dependent scoring function; with application to angiogenesis and organogenesis studies

PubMed Central

2013-01-01

Background The learning active subnetworks problem involves finding subnetworks of a bio-molecular network that are active in a particular condition. Many approaches integrate observation data (e.g., gene expression) with the network topology to find candidate subnetworks. Increasingly, pathway databases contain additional annotation information that can be mined to improve prediction accuracy, e.g., interaction mechanism (e.g., transcription, microRNA, cleavage) annotations. We introduce a mechanism-based approach to active subnetwork recovery which exploits such annotations. We suggest that neighboring interactions in a network tend to be co-activated in a way that depends on the “correlation” of their mechanism annotations. e.g., neighboring phosphorylation and de-phosphorylation interactions may be more likely to be co-activated than neighboring phosphorylation and covalent bonding interactions. Results Our method iteratively learns the mechanism correlations and finds the most likely active subnetwork. We use a probabilistic graphical model with a Markov Random Field component which creates dependencies between the states (active or non-active) of neighboring interactions, that incorporates a mechanism-based component to the function. We apply a heuristic-based EM-based algorithm suitable for the problem. We validated our method’s performance using simulated data in networks downloaded from GeneGO against the same approach without the mechanism-based component, and two other existing methods. We validated our methods performance in correctly recovering (1) the true interaction states, and (2) global network properties of the original network against these other methods. We applied our method to networks generated from time-course gene expression studies in angiogenesis and lung organogenesis and validated the findings from a biological perspective against current literature. Conclusions The advantage of our mechanism-based approach is best seen in networks composed of connected regions with a large number of interactions annotated with a subset of mechanisms, e.g., a regulatory region of transcription interactions, or a cleavage cascade region. When applied to real datasets, our method recovered novel and biologically meaningful putative interactions, e.g., interactions from an integrin signaling pathway using the angiogenesis dataset, and a group of regulatory microRNA interactions in an organogenesis network. PMID:23432934
Reconstruction of an integrated genome-scale co-expression network reveals key modules involved in lung adenocarcinoma.

PubMed

Bidkhori, Gholamreza; Narimani, Zahra; Hosseini Ashtiani, Saman; Moeini, Ali; Nowzari-Dalini, Abbas; Masoudi-Nejad, Ali

2013-01-01

Our goal of this study was to reconstruct a "genome-scale co-expression network" and find important modules in lung adenocarcinoma so that we could identify the genes involved in lung adenocarcinoma. We integrated gene mutation, GWAS, CGH, array-CGH and SNP array data in order to identify important genes and loci in genome-scale. Afterwards, on the basis of the identified genes a co-expression network was reconstructed from the co-expression data. The reconstructed network was named "genome-scale co-expression network". As the next step, 23 key modules were disclosed through clustering. In this study a number of genes have been identified for the first time to be implicated in lung adenocarcinoma by analyzing the modules. The genes EGFR, PIK3CA, TAF15, XIAP, VAPB, Appl1, Rab5a, ARF4, CLPTM1L, SP4, ZNF124, LPP, FOXP1, SOX18, MSX2, NFE2L2, SMARCC1, TRA2B, CBX3, PRPF6, ATP6V1C1, MYBBP1A, MACF1, GRM2, TBXA2R, PRKAR2A, PTK2, PGF and MYO10 are among the genes that belong to modules 1 and 22. All these genes, being implicated in at least one of the phenomena, namely cell survival, proliferation and metastasis, have an over-expression pattern similar to that of EGFR. In few modules, the genes such as CCNA2 (Cyclin A2), CCNB2 (Cyclin B2), CDK1, CDK5, CDC27, CDCA5, CDCA8, ASPM, BUB1, KIF15, KIF2C, NEK2, NUSAP1, PRC1, SMC4, SYCE2, TFDP1, CDC42 and ARHGEF9 are present that play a crucial role in cell cycle progression. In addition to the mentioned genes, there are some other genes (i.e. DLGAP5, BIRC5, PSMD2, Src, TTK, SENP2, PSMD2, DOK2, FUS and etc.) in the modules.
Co-expression networks reveal the tissue-specific regulation of transcription and splicing.

PubMed

Saha, Ashis; Kim, Yungil; Gewirtz, Ariel D H; Jo, Brian; Gao, Chuan; McDowell, Ian C; Engelhardt, Barbara E; Battle, Alexis

2017-11-01

Gene co-expression networks capture biologically important patterns in gene expression data, enabling functional analyses of genes, discovery of biomarkers, and interpretation of genetic variants. Most network analyses to date have been limited to assessing correlation between total gene expression levels in a single tissue or small sets of tissues. Here, we built networks that additionally capture the regulation of relative isoform abundance and splicing, along with tissue-specific connections unique to each of a diverse set of tissues. We used the Genotype-Tissue Expression (GTEx) project v6 RNA sequencing data across 50 tissues and 449 individuals. First, we developed a framework called Transcriptome-Wide Networks (TWNs) for combining total expression and relative isoform levels into a single sparse network, capturing the interplay between the regulation of splicing and transcription. We built TWNs for 16 tissues and found that hubs in these networks were strongly enriched for splicing and RNA binding genes, demonstrating their utility in unraveling regulation of splicing in the human transcriptome. Next, we used a Bayesian biclustering model that identifies network edges unique to a single tissue to reconstruct Tissue-Specific Networks (TSNs) for 26 distinct tissues and 10 groups of related tissues. Finally, we found genetic variants associated with pairs of adjacent nodes in our networks, supporting the estimated network structures and identifying 20 genetic variants with distant regulatory impact on transcription and splicing. Our networks provide an improved understanding of the complex relationships of the human transcriptome across tissues. © 2017 Saha et al.; Published by Cold Spring Harbor Laboratory Press.
Using genetic markers to orient the edges in quantitative trait networks: the NEO software.

PubMed

Aten, Jason E; Fuller, Tova F; Lusis, Aldons J; Horvath, Steve

2008-04-15

Systems genetic studies have been used to identify genetic loci that affect transcript abundances and clinical traits such as body weight. The pairwise correlations between gene expression traits and/or clinical traits can be used to define undirected trait networks. Several authors have argued that genetic markers (e.g expression quantitative trait loci, eQTLs) can serve as causal anchors for orienting the edges of a trait network. The availability of hundreds of thousands of genetic markers poses new challenges: how to relate (anchor) traits to multiple genetic markers, how to score the genetic evidence in favor of an edge orientation, and how to weigh the information from multiple markers. We develop and implement Network Edge Orienting (NEO) methods and software that address the challenges of inferring unconfounded and directed gene networks from microarray-derived gene expression data by integrating mRNA levels with genetic marker data and Structural Equation Model (SEM) comparisons. The NEO software implements several manual and automatic methods for incorporating genetic information to anchor traits. The networks are oriented by considering each edge separately, thus reducing error propagation. To summarize the genetic evidence in favor of a given edge orientation, we propose Local SEM-based Edge Orienting (LEO) scores that compare the fit of several competing causal graphs. SEM fitting indices allow the user to assess local and overall model fit. The NEO software allows the user to carry out a robustness analysis with regard to genetic marker selection. We demonstrate the utility of NEO by recovering known causal relationships in the sterol homeostasis pathway using liver gene expression data from an F2 mouse cross. Further, we use NEO to study the relationship between a disease gene and a biologically important gene co-expression module in liver tissue. The NEO software can be used to orient the edges of gene co-expression networks or quantitative trait networks if the edges can be anchored to genetic marker data. R software tutorials, data, and supplementary material can be downloaded from: http://www.genetics.ucla.edu/labs/horvath/aten/NEO.
Inference of gene regulatory networks from time series by Tsallis entropy

PubMed Central

2011-01-01

Background The inference of gene regulatory networks (GRNs) from large-scale expression profiles is one of the most challenging problems of Systems Biology nowadays. Many techniques and models have been proposed for this task. However, it is not generally possible to recover the original topology with great accuracy, mainly due to the short time series data in face of the high complexity of the networks and the intrinsic noise of the expression measurements. In order to improve the accuracy of GRNs inference methods based on entropy (mutual information), a new criterion function is here proposed. Results In this paper we introduce the use of generalized entropy proposed by Tsallis, for the inference of GRNs from time series expression profiles. The inference process is based on a feature selection approach and the conditional entropy is applied as criterion function. In order to assess the proposed methodology, the algorithm is applied to recover the network topology from temporal expressions generated by an artificial gene network (AGN) model as well as from the DREAM challenge. The adopted AGN is based on theoretical models of complex networks and its gene transference function is obtained from random drawing on the set of possible Boolean functions, thus creating its dynamics. On the other hand, DREAM time series data presents variation of network size and its topologies are based on real networks. The dynamics are generated by continuous differential equations with noise and perturbation. By adopting both data sources, it is possible to estimate the average quality of the inference with respect to different network topologies, transfer functions and network sizes. Conclusions A remarkable improvement of accuracy was observed in the experimental results by reducing the number of false connections in the inferred topology by the non-Shannon entropy. The obtained best free parameter of the Tsallis entropy was on average in the range 2.5 ≤ q ≤ 3.5 (hence, subextensive entropy), which opens new perspectives for GRNs inference methods based on information theory and for investigation of the nonextensivity of such networks. The inference algorithm and criterion function proposed here were implemented and included in the DimReduction software, which is freely available at http://sourceforge.net/projects/dimreduction and http://code.google.com/p/dimreduction/. PMID:21545720
A Systems Approach Identifies Networks and Genes Linking Sleep and Stress: Implications for Neuropsychiatric Disorders

PubMed Central

Jiang, Peng; Scarpa, Joseph R.; Fitzpatrick, Karrie; Losic, Bojan; Gao, Vance D.; Hao, Ke; Summa, Keith C.; Yang, He S.; Zhang, Bin; Allada, Ravi; Vitaterna, Martha H.; Turek, Fred W.; Kasarskis, Andrew

2016-01-01

SUMMARY Sleep dysfunction and stress susceptibility are co-morbid complex traits, which often precede and predispose patients to a variety of neuropsychiatric diseases. Here, we demonstrate multi-level organizations of genetic landscape, candidate genes, and molecular networks associated with 328 stress and sleep traits in a chronically stressed population of 338 (C57BL/6J×A/J) F2 mice. We constructed striatal gene co-expression networks, revealing functionally and cell-type specific gene co-regulations important for stress and sleep. Using a composite ranking system, we identified network modules most relevant for 15 independent phenotypic categories, highlighting a mitochondria/synaptic module that links sleep and stress. The key network regulators of this module are overrepresented with genes implicated in neuropsychiatric diseases. Our work suggests the interplay between sleep, stress, and neuropathology emerge from genetic influences on gene expression and their collective organization through complex molecular networks, providing a framework to interrogate the mechanisms underlying sleep, stress susceptibility, and related neuropsychiatric disorders. PMID:25921536
Analysis of Bos taurus and Sus scrofa X and Y chromosome transcriptome highlights reproductive driver genes

PubMed Central

Khan, Faheem Ahmed; Liu, Hui; Zhou, Hao; Wang, Kai; Qamar, Muhammad Tahir Ul; Pandupuspitasari, Nuruliarizki Shinta; Shujun, Zhang

2017-01-01

The biology of sperm, its capability of fertilizing an egg and its role in sex ratio are the major biological questions in reproductive biology. To answer these question we integrated X and Y chromosome transcriptome across different species: Bos taurus and Sus scrofa and identified reproductive driver genes based on Weighted Gene Co-Expression Network Analysis (WGCNA) algorithm. Our strategy resulted in 11007 and 10445 unique genes consisting of 9 and 11 reproductive modules in Bos taurus and Sus scrofa, respectively. The consensus module calculation yields an overall 167 overlapped genes which were mapped to 846 DEGs in Bos taurus to finally get a list of 67 dual feature genes. We develop gene co-expression network of selected 67 genes that consists of 58 nodes (27 down-regulated and 31 up-regulated genes) enriched to 66 GO biological process (BP) including 6 GO annotations related to reproduction and two KEGG pathways. Moreover, we searched significantly related TF (ISRE, AP1FJ, RP58, CREL) and miRNAs (bta-miR-181a, bta-miR-17-5p, bta-miR-146b, bta-miR-146a) which targeted the genes in co-expression network. In addition we performed genetic analysis including phylogenetic, functional domain identification, epigenetic modifications, mutation analysis of the most important reproductive driver genes PRM1, PPP2R2B and PAFAH1B1 and finally performed a protein docking analysis to visualize their therapeutic and gene expression regulation ability. PMID:28903352
A Dual-Intein Autoprocessing Domain that Directs Synchronized Protein Co-Expression in Both Prokaryotes and Eukaryotes

PubMed Central

Zhang, Bei; Rapolu, Madhusudhan; Liang, Zhibin; Han, Zhenlin; Williams, Philip G.; Su, Wei Wen

2015-01-01

Being able to coordinate co-expression of multiple proteins is necessary for a variety of important applications such as assembly of protein complexes, trait stacking, and metabolic engineering. Currently only few options are available for multiple recombinant protein co-expression, and most of them are not applicable to both prokaryotic and eukaryotic hosts. Here, we report a new polyprotein vector system that is based on a pair of self-excising mini-inteins fused in tandem, termed the dual-intein (DI) domain, to achieve synchronized co-expression of multiple proteins. The DI domain comprises an Ssp DnaE mini-intein N159A mutant and an Ssp DnaB mini-intein C1A mutant connected in tandem by a peptide linker to mediate efficient release of the flanking proteins via autocatalytic cleavage. Essentially complete release of constituent proteins, GFP and RFP (mCherry), from a polyprotein precursor, in bacterial, mammalian, and plant hosts was demonstrated. In addition, successful co-expression of GFP with chloramphenicol acetyltransferase, and thioredoxin with RFP, respectively, further substantiates the general applicability of the DI polyprotein system. Collectively, our results demonstrate the DI-based polyprotein technology as a highly valuable addition to the molecular toolbox for multi-protein co-expression which finds vast applications in biotechnology, biosciences, and biomedicine. PMID:25712612
Statistical identification of gene association by CID in application of constructing ER regulatory network

PubMed Central

Liu, Li-Yu D; Chen, Chien-Yu; Chen, Mei-Ju M; Tsai, Ming-Shian; Lee, Cho-Han S; Phang, Tzu L; Chang, Li-Yun; Kuo, Wen-Hung; Hwa, Hsiao-Lin; Lien, Huang-Chun; Jung, Shih-Ming; Lin, Yi-Shing; Chang, King-Jen; Hsieh, Fon-Jou

2009-01-01

Background A variety of high-throughput techniques are now available for constructing comprehensive gene regulatory networks in systems biology. In this study, we report a new statistical approach for facilitating in silico inference of regulatory network structure. The new measure of association, coefficient of intrinsic dependence (CID), is model-free and can be applied to both continuous and categorical distributions. When given two variables X and Y, CID answers whether Y is dependent on X by examining the conditional distribution of Y given X. In this paper, we apply CID to analyze the regulatory relationships between transcription factors (TFs) (X) and their downstream genes (Y) based on clinical data. More specifically, we use estrogen receptor α (ERα) as the variable X, and the analyses are based on 48 clinical breast cancer gene expression arrays (48A). Results The analytical utility of CID was evaluated in comparison with four commonly used statistical methods, Galton-Pearson's correlation coefficient (GPCC), Student's t-test (STT), coefficient of determination (CoD), and mutual information (MI). When being compared to GPCC, CoD, and MI, CID reveals its preferential ability to discover the regulatory association where distribution of the mRNA expression levels on X and Y does not fit linear models. On the other hand, when CID is used to measure the association of a continuous variable (Y) against a discrete variable (X), it shows similar performance as compared to STT, and appears to outperform CoD and MI. In addition, this study established a two-layer transcriptional regulatory network to exemplify the usage of CID, in combination with GPCC, in deciphering gene networks based on gene expression profiles from patient arrays. Conclusion CID is shown to provide useful information for identifying associations between genes and transcription factors of interest in patient arrays. When coupled with the relationships detected by GPCC, the association predicted by CID are applicable to the construction of transcriptional regulatory networks. This study shows how information from different data sources and learning algorithms can be integrated to investigate whether relevant regulatory mechanisms identified in cell models can also be partially re-identified in clinical samples of breast cancers. Availability the implementation of CID in R codes can be freely downloaded from . PMID:19292896
Co-LncRNA: investigating the lncRNA combinatorial effects in GO annotations and KEGG pathways based on human RNA-Seq data.

PubMed

Zhao, Zheng; Bai, Jing; Wu, Aiwei; Wang, Yuan; Zhang, Jinwen; Wang, Zishan; Li, Yongsheng; Xu, Juan; Li, Xia

2015-01-01

Long non-coding RNAs (lncRNAs) are emerging as key regulators of diverse biological processes and diseases. However, the combinatorial effects of these molecules in a specific biological function are poorly understood. Identifying co-expressed protein-coding genes of lncRNAs would provide ample insight into lncRNA functions. To facilitate such an effort, we have developed Co-LncRNA, which is a web-based computational tool that allows users to identify GO annotations and KEGG pathways that may be affected by co-expressed protein-coding genes of a single or multiple lncRNAs. LncRNA co-expressed protein-coding genes were first identified in publicly available human RNA-Seq datasets, including 241 datasets across 6560 total individuals representing 28 tissue types/cell lines. Then, the lncRNA combinatorial effects in a given GO annotations or KEGG pathways are taken into account by the simultaneous analysis of multiple lncRNAs in user-selected individual or multiple datasets, which is realized by enrichment analysis. In addition, this software provides a graphical overview of pathways that are modulated by lncRNAs, as well as a specific tool to display the relevant networks between lncRNAs and their co-expressed protein-coding genes. Co-LncRNA also supports users in uploading their own lncRNA and protein-coding gene expression profiles to investigate the lncRNA combinatorial effects. It will be continuously updated with more human RNA-Seq datasets on an annual basis. Taken together, Co-LncRNA provides a web-based application for investigating lncRNA combinatorial effects, which could shed light on their biological roles and could be a valuable resource for this community. Database URL: http://www.bio-bigdata.com/Co-LncRNA/. © The Author(s) 2015. Published by Oxford University Press.
Identification of aberrantly expressed long non-coding RNAs in stomach adenocarcinoma.

PubMed

Gu, Jianbin; Li, Yong; Fan, Liqiao; Zhao, Qun; Tan, Bibo; Hua, Kelei; Wu, Guobin

2017-07-25

Stomach adenocarcinoma (STAD) is a common malignancy worldwide. This study aimed to identify the aberrantly expressed long non-coding RNAs (lncRNAs) in STAD. Total of 74 DElncRNAs and 449 DEmRNAs were identified in STAD compared with paired non-tumor tissues. The DElncRNA/DEmRNA co-expression network was constructed, which covered 519 nodes and 2993 edges. The qRT-PCR validation results of DElncRNAs were consistent with our bioinformatics analysis based on RNA-sequencing. The DEmRNAs co-expressed with DElncRNAs were significantly enriched in gastric acid secretion, complement and coagulation cascades, pancreatic secretion, cytokine-cytokine receptor interaction and Jak-STAT signaling pathway. The expression levels of the nine candidate DElncRNAs in TCGA database were compatible with our RNA-sequencing. FEZF1-AS1, HOTAIR and LINC01234 had the potential diagnosis value for STAD. The lncRNA and mRNA expression profile of 3 STAD tissues and 3 matched adjacent non-tumor tissues was obtained through high-throughput RNA-sequencing. Differentially expressed lncRNAs/mRNAs (DElncRNAs/DEmRNAs) were identified in STAD. DElncRNA/DEmRNA co-expression network construction, Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment analyses were conducted to predict the biological functions of DElncRNAs. Quantitative real-time polymerase chain reaction (qRT-PCR) was subjected to validate the expression levels of DEmRNAs and DElncRNAs. Moreover, the expression of DElncRNAs was validated through The Cancer Genome Atlas (TCGA) database. The diagnosis value of candidate DElncRNAs was accessed by receiver operating characteristic (ROC) analysis. Our work might provide useful information for exploring the tumorigenesis mechanism of STAD and pave the road for identification of diagnostic biomarkers in STAD.
Dynamic sporulation gene co-expression networks for Bacillus subtilis 168 and the food-borne isolate Bacillus amyloliquefaciens: a transcriptomic model

PubMed Central

Omony, Jimmy; de Jong, Anne; Krawczyk, Antonina O.; Eijlander, Robyn T.; Kuipers, Oscar P.

2018-01-01

Sporulation is a survival strategy, adapted by bacterial cells in response to harsh environmental adversities. The adaptation potential differs between strains and the variations may arise from differences in gene regulation. Gene networks are a valuable way of studying such regulation processes and establishing associations between genes. We reconstructed and compared sporulation gene co-expression networks (GCNs) of the model laboratory strain Bacillus subtilis 168 and the food-borne industrial isolate Bacillus amyloliquefaciens. Transcriptome data obtained from samples of six stages during the sporulation process were used for network inference. Subsequently, a gene set enrichment analysis was performed to compare the reconstructed GCNs of B. subtilis 168 and B. amyloliquefaciens with respect to biological functions, which showed the enriched modules with coherent functional groups associated with sporulation. On basis of the GCNs and time-evolution of differentially expressed genes, we could identify novel candidate genes strongly associated with sporulation in B. subtilis 168 and B. amyloliquefaciens. The GCNs offer a framework for exploring transcription factors, their targets, and co-expressed genes during sporulation. Furthermore, the methodology described here can conveniently be applied to other species or biological processes. PMID:29424683
Dynamic sporulation gene co-expression networks for Bacillus subtilis 168 and the food-borne isolate Bacillus amyloliquefaciens: a transcriptomic model.

PubMed

Omony, Jimmy; de Jong, Anne; Krawczyk, Antonina O; Eijlander, Robyn T; Kuipers, Oscar P

2018-02-09

Sporulation is a survival strategy, adapted by bacterial cells in response to harsh environmental adversities. The adaptation potential differs between strains and the variations may arise from differences in gene regulation. Gene networks are a valuable way of studying such regulation processes and establishing associations between genes. We reconstructed and compared sporulation gene co-expression networks (GCNs) of the model laboratory strain Bacillus subtilis 168 and the food-borne industrial isolate Bacillus amyloliquefaciens. Transcriptome data obtained from samples of six stages during the sporulation process were used for network inference. Subsequently, a gene set enrichment analysis was performed to compare the reconstructed GCNs of B. subtilis 168 and B. amyloliquefaciens with respect to biological functions, which showed the enriched modules with coherent functional groups associated with sporulation. On basis of the GCNs and time-evolution of differentially expressed genes, we could identify novel candidate genes strongly associated with sporulation in B. subtilis 168 and B. amyloliquefaciens. The GCNs offer a framework for exploring transcription factors, their targets, and co-expressed genes during sporulation. Furthermore, the methodology described here can conveniently be applied to other species or biological processes.
Comprehensive analysis of coding-lncRNA gene co-expression network uncovers conserved functional lncRNAs in zebrafish.

PubMed

Chen, Wen; Zhang, Xuan; Li, Jing; Huang, Shulan; Xiang, Shuanglin; Hu, Xiang; Liu, Changning

2018-05-09

Zebrafish is a full-developed model system for studying development processes and human disease. Recent studies of deep sequencing had discovered a large number of long non-coding RNAs (lncRNAs) in zebrafish. However, only few of them had been functionally characterized. Therefore, how to take advantage of the mature zebrafish system to deeply investigate the lncRNAs' function and conservation is really intriguing. We systematically collected and analyzed a series of zebrafish RNA-seq data, then combined them with resources from known database and literatures. As a result, we obtained by far the most complete dataset of zebrafish lncRNAs, containing 13,604 lncRNA genes (21,128 transcripts) in total. Based on that, a co-expression network upon zebrafish coding and lncRNA genes was constructed and analyzed, and used to predict the Gene Ontology (GO) and the KEGG annotation of lncRNA. Meanwhile, we made a conservation analysis on zebrafish lncRNA, identifying 1828 conserved zebrafish lncRNA genes (1890 transcripts) that have their putative mammalian orthologs. We also found that zebrafish lncRNAs play important roles in regulation of the development and function of nervous system; these conserved lncRNAs present a significant sequential and functional conservation, with their mammalian counterparts. By integrative data analysis and construction of coding-lncRNA gene co-expression network, we gained the most comprehensive dataset of zebrafish lncRNAs up to present, as well as their systematic annotations and comprehensive analyses on function and conservation. Our study provides a reliable zebrafish-based platform to deeply explore lncRNA function and mechanism, as well as the lncRNA commonality between zebrafish and human.
Transcriptional Regulatory Network Analysis of MYB Transcription Factor Family Genes in Rice.

PubMed

Smita, Shuchi; Katiyar, Amit; Chinnusamy, Viswanathan; Pandey, Dev M; Bansal, Kailash C

2015-01-01

MYB transcription factor (TF) is one of the largest TF families and regulates defense responses to various stresses, hormone signaling as well as many metabolic and developmental processes in plants. Understanding these regulatory hierarchies of gene expression networks in response to developmental and environmental cues is a major challenge due to the complex interactions between the genetic elements. Correlation analyses are useful to unravel co-regulated gene pairs governing biological process as well as identification of new candidate hub genes in response to these complex processes. High throughput expression profiling data are highly useful for construction of co-expression networks. In the present study, we utilized transcriptome data for comprehensive regulatory network studies of MYB TFs by "top-down" and "guide-gene" approaches. More than 50% of OsMYBs were strongly correlated under 50 experimental conditions with 51 hub genes via "top-down" approach. Further, clusters were identified using Markov Clustering (MCL). To maximize the clustering performance, parameter evaluation of the MCL inflation score (I) was performed in terms of enriched GO categories by measuring F-score. Comparison of co-expressed cluster and clads analyzed from phylogenetic analysis signifies their evolutionarily conserved co-regulatory role. We utilized compendium of known interaction and biological role with Gene Ontology enrichment analysis to hypothesize function of coexpressed OsMYBs. In the other part, the transcriptional regulatory network analysis by "guide-gene" approach revealed 40 putative targets of 26 OsMYB TF hubs with high correlation value utilizing 815 microarray data. The putative targets with MYB-binding cis-elements enrichment in their promoter region, functional co-occurrence as well as nuclear localization supports our finding. Specially, enrichment of MYB binding regions involved in drought-inducibility implying their regulatory role in drought response in rice. Thus, the co-regulatory network analysis facilitated the identification of complex OsMYB regulatory networks, and candidate target regulon genes of selected guide MYB genes. The results contribute to the candidate gene screening, and experimentally testable hypotheses for potential regulatory MYB TFs, and their targets under stress conditions.
Interplay of Noisy Gene Expression and Dynamics Explains Patterns of Bacterial Operon Organization

NASA Astrophysics Data System (ADS)

Igoshin, Oleg

2011-03-01

Bacterial chromosomes are organized into operons -- sets of genes co-transcribed into polycistronic messenger RNA. Hypotheses explaining the emergence and maintenance of operons include proportional co-regulation, horizontal transfer of intact ``selfish'' operons, emergence via gene duplication, and co-production of physically interacting proteins to speed their association. We hypothesized an alternative: operons can reduce or increase intrinsic gene expression noise in a manner dependent on the post-translational interactions, thereby resulting in selection for or against operons in depending on the network architecture. We devised five classes of two-gene network modules and show that the effects of operons on intrinsic noise depend on class membership. Two classes exhibit decreased noise with co-transcription, two others reveal increased noise, and the remaining one does not show a significant difference. To test our modeling predictions we employed bioinformatic analysis to determine the relationship gene expression noise and operon organization. The results confirm the overrepresentation of noise-minimizing operon architectures and provide evidence against other hypotheses. Our results thereby suggest a central role for gene expression noise in selecting for or maintaining operons in bacterial chromosomes. This demonstrates how post-translational network dynamics may provide selective pressure for organizing bacterial chromosomes, and has practical consequences for designing synthetic gene networks. This work is supported by National Institutes of Health grant 1R01GM096189-01.
Contextual Hub Analysis Tool (CHAT): A Cytoscape app for identifying contextually relevant hubs in biological networks.

PubMed

Muetze, Tanja; Goenawan, Ivan H; Wiencko, Heather L; Bernal-Llinares, Manuel; Bryan, Kenneth; Lynn, David J

2016-01-01

Highly connected nodes (hubs) in biological networks are topologically important to the structure of the network and have also been shown to be preferentially associated with a range of phenotypes of interest. The relative importance of a hub node, however, can change depending on the biological context. Here, we report a Cytoscape app, the Contextual Hub Analysis Tool (CHAT), which enables users to easily construct and visualize a network of interactions from a gene or protein list of interest, integrate contextual information, such as gene expression or mass spectrometry data, and identify hub nodes that are more highly connected to contextual nodes (e.g. genes or proteins that are differentially expressed) than expected by chance. In a case study, we use CHAT to construct a network of genes that are differentially expressed in Dengue fever, a viral infection. CHAT was used to identify and compare contextual and degree-based hubs in this network. The top 20 degree-based hubs were enriched in pathways related to the cell cycle and cancer, which is likely due to the fact that proteins involved in these processes tend to be highly connected in general. In comparison, the top 20 contextual hubs were enriched in pathways commonly observed in a viral infection including pathways related to the immune response to viral infection. This analysis shows that such contextual hubs are considerably more biologically relevant than degree-based hubs and that analyses which rely on the identification of hubs solely based on their connectivity may be biased towards nodes that are highly connected in general rather than in the specific context of interest. CHAT is available for Cytoscape 3.0+ and can be installed via the Cytoscape App Store ( http://apps.cytoscape.org/apps/chat).
Gene networks specific for innate immunity define post-traumatic stress disorder.

PubMed

Breen, M S; Maihofer, A X; Glatt, S J; Tylee, D S; Chandler, S D; Tsuang, M T; Risbrough, V B; Baker, D G; O'Connor, D T; Nievergelt, C M; Woelk, C H

2015-12-01

The molecular factors involved in the development of Post-Traumatic Stress Disorder (PTSD) remain poorly understood. Previous transcriptomic studies investigating the mechanisms of PTSD apply targeted approaches to identify individual genes under a cross-sectional framework lack a holistic view of the behaviours and properties of these genes at the system-level. Here we sought to apply an unsupervised gene-network based approach to a prospective experimental design using whole-transcriptome RNA-Seq gene expression from peripheral blood leukocytes of U.S. Marines (N=188), obtained both pre- and post-deployment to conflict zones. We identified discrete groups of co-regulated genes (i.e., co-expression modules) and tested them for association to PTSD. We identified one module at both pre- and post-deployment containing putative causal signatures for PTSD development displaying an over-expression of genes enriched for functions of innate-immune response and interferon signalling (Type-I and Type-II). Importantly, these results were replicated in a second non-overlapping independent dataset of U.S. Marines (N=96), further outlining the role of innate immune and interferon signalling genes within co-expression modules to explain at least part of the causal pathophysiology for PTSD development. A second module, consequential of trauma exposure, contained PTSD resiliency signatures and an over-expression of genes involved in hemostasis and wound responsiveness suggesting that chronic levels of stress impair proper wound healing during/after exposure to the battlefield while highlighting the role of the hemostatic system as a clinical indicator of chronic-based stress. These findings provide novel insights for early preventative measures and advanced PTSD detection, which may lead to interventions that delay or perhaps abrogate the development of PTSD.
Detecting complexes from edge-weighted PPI networks via genes expression analysis.

PubMed

Zhang, Zehua; Song, Jian; Tang, Jijun; Xu, Xinying; Guo, Fei

2018-04-24

Identifying complexes from PPI networks has become a key problem to elucidate protein functions and identify signal and biological processes in a cell. Proteins binding as complexes are important roles of life activity. Accurate determination of complexes in PPI networks is crucial for understanding principles of cellular organization. We propose a novel method to identify complexes on PPI networks, based on different co-expression information. First, we use Markov Cluster Algorithm with an edge-weighting scheme to calculate complexes on PPI networks. Then, we propose some significant features, such as graph information and gene expression analysis, to filter and modify complexes predicted by Markov Cluster Algorithm. To evaluate our method, we test on two experimental yeast PPI networks. On DIP network, our method has Precision and F-Measure values of 0.6004 and 0.5528. On MIPS network, our method has F-Measure and S n values of 0.3774 and 0.3453. Comparing to existing methods, our method improves Precision value by at least 0.1752, F-Measure value by at least 0.0448, S n value by at least 0.0771. Experiments show that our method achieves better results than some state-of-the-art methods for identifying complexes on PPI networks, with the prediction quality improved in terms of evaluation criteria.

Genome-Wide Analyses of Calcium Sensors Reveal Their Involvement in Drought Stress Response and Storage Roots Deterioration after Harvest in Cassava.

PubMed

Hu, Wei; Yan, Yan; Tie, Weiwei; Ding, Zehong; Wu, Chunlai; Ding, Xupo; Wang, Wenquan; Xia, Zhiqiang; Guo, Jianchun; Peng, Ming

2018-04-19

Calcium (Ca 2+ ) plays a crucial role in plant development and responses to environmental stimuli. Currently, calmodulins (CaMs), calmodulin-like proteins (CMLs), and calcineurin B-like proteins (CBLs), such as Ca 2+ sensors, are not well understood in cassava ( Manihot esculenta Crantz), an important tropical crop. In the present study, 8 CaMs, 48 CMLs, and 9 CBLs were genome-wide identified in cassava, which were divided into two, four, and four groups, respectively, based on evolutionary relationship, protein motif, and gene structure analyses. Transcriptomic analysis revealed the expression diversity of cassava CaMs-CMLs-CBLs in distinct tissues and in response to drought stress in different genotypes. Generally, cassava CaMs-CMLs-CBLs showed different expression profiles between cultivated varieties (Arg7 and SC124) and wild ancestor (W14) after drought treatment. In addition, numerous CaMs-CMLs-CBLs were significantly upregulated at 6 h, 12 h, and 48 h after harvest, suggesting their possible role during storage roots (SR) deterioration. Further interaction network and co-expression analyses suggested that a CBL-mediated interaction network was widely involved in SR deterioration. Taken together, this study provides new insights into CaMs-CMLs-CBLs-mediated drought adaption and SR deterioration at the transcription level in cassava, and identifies some candidates for the genetic improvement of cassava.
Co-expression network analysis identified six hub genes in association with metastasis risk and prognosis in hepatocellular carcinoma

PubMed Central

Feng, Juerong; Zhou, Rui; Chang, Ying; Liu, Jing; Zhao, Qiu

2017-01-01

Hepatocellular carcinoma (HCC) has a high incidence and mortality worldwide, and its carcinogenesis and progression are influenced by a complex network of gene interactions. A weighted gene co-expression network was constructed to identify gene modules associated with the clinical traits in HCC (n = 214). Among the 13 modules, high correlation was only found between the red module and metastasis risk (classified by the HCC metastasis gene signature) (R2 = −0.74). Moreover, in the red module, 34 network hub genes for metastasis risk were identified, six of which (ABAT, AGXT, ALDH6A1, CYP4A11, DAO and EHHADH) were also hub nodes in the protein-protein interaction network of the module genes. Thus, a total of six hub genes were identified. In validation, all hub genes showed a negative correlation with the four-stage HCC progression (P for trend < 0.05) in the test set. Furthermore, in the training set, HCC samples with any hub gene lowly expressed demonstrated a higher recurrence rate and poorer survival rate (hazard ratios with 95% confidence intervals > 1). RNA-sequencing data of 142 HCC samples showed consistent results in the prognosis. Gene set enrichment analysis (GSEA) demonstrated that in the samples with any hub gene highly expressed, a total of 24 functional gene sets were enriched, most of which focused on amino acid metabolism and oxidation. In conclusion, co-expression network analysis identified six hub genes in association with HCC metastasis risk and prognosis, which might improve the prognosis by influencing amino acid metabolism and oxidation. PMID:28430663
Potential Regulators Driving the Transition in Nonalcoholic Fatty Liver Disease: a Stage-Based View.

PubMed

Lou, Yi; Chen, Yi-Dan; Sun, Fu-Rong; Shi, Jun-Ping; Song, Yu; Yang, Jin

2017-01-01

The incidence of nonalcoholic fatty liver disease (NAFLD), ranging from mild steatosis to hepatocellular injury and inflammation, increases with the rise of obesity. However, the implications of transcription factors network in progressive NAFLD remain to be determined. A co-regulatory network approach by combining gene expression and transcription influence was utilized to dissect transcriptional regulators in different NAFLD stages. In vivo, mice models of NAFLD were used to investigate whether dysregulated expression be undertaken by transcriptional regulators. Through constructing a large-scale co-regulatory network, sample-specific regulator activity was estimated. The combinations of active regulators that drive the progression of NAFLD were identified. Next, top regulators in each stage of NAFLD were determined, and the results were validated using the different experiments and bariatric surgical samples. In particular, Adipocyte enhancer-binding protein 1 (AEBP1) showed increased transcription activity in nonalcoholic steatohepatitis (NASH). Further characterization of the AEBP1 related transcription program defined its co-regulators, targeted genes, and functional organization. The dynamics of AEBP1 and its potential targets were verified in an animal model of NAFLD. This study identifies putative functions for several transcription factors in the pathogenesis of NAFLD and may thus point to potential targets for therapeutic interventions. © 2017 The Author(s) Published by S. Karger AG, Basel.
Progress in Open-World, Integrative, Collaborative Science Data Platforms (Invited)

NASA Astrophysics Data System (ADS)

Fox, P. A.

2013-12-01

As collaborative, or network science spreads into more Earth and space science fields, both the participants and their funders have expressed a very strong desire for highly functional data and information capabilities that are a) easy to use, b) integrated in a variety of ways, c) leverage prior investments and keep pace with rapid technical change, and d) are not expensive or time-consuming to build or maintain. In response, and based on our accumulated experience over the last decade and a maturing of several key technical approaches, we have adapted, extended, and integrated several open source applications and frameworks that handle major portions of functionality for these platforms. At minimum, these functions include: an object-type repository, collaboration tools, an ability to identify and manage all key entities in the platform, and an integrated portal to manage diverse content and applications, with varied access levels and privacy options. At a conceptual level, science networks (even small ones) deal with people, and many intellectual artifacts produced or consumed in research, organizational and/our outreach activities, as well as the relations among them. Increasingly these networks are modeled as knowledge networks, i.e. graphs with named and typed relations among the 'nodes'. Nodes can be people, organizations, datasets, events, presentations, publications, videos, meetings, reports, groups, and more. In this heterogeneous ecosystem, it is also important to use a set of common informatics approaches to co-design and co-evolve the needed science data platforms based on what real people want to use them for. In this contribution, we present our methods and results for information modeling, adapting, integrating and evolving a networked data science and information architecture based on several open source technologies (Drupal, VIVO, the Comprehensive Knowledge Archive Network; CKAN, and the Global Handle System; GHS). In particular we present both the instantiation of this data platform for the Deep Carbon Observatory, including key functional and non-functional attributes, how the smart mediation among the components is modeled and managed, and discuss its general applicability.
Inhibitory effect of Lactobacillus salivarius on Streptococcus mutans biofilm formation.

PubMed

Wu, C-C; Lin, C-T; Wu, C-Y; Peng, W-S; Lee, M-J; Tsai, Y-C

2015-02-01

Dental caries arises from an imbalance of metabolic activities in dental biofilms developed primarily by Streptococcus mutans. This study was conducted to isolate potential oral probiotics with antagonistic activities against S. mutans biofilm formation from Lactobacillus salivarius, frequently found in human saliva. We analysed 64 L. salivarius strains and found that two, K35 and K43, significantly inhibited S. mutans biofilm formation with inhibitory activities more pronounced than those of Lactobacillus rhamnosus GG (LGG), a prototypical probiotic that shows anti-caries activity. Scanning electron microscopy showed that co-culture of S. mutans with K35 or K43 resulted in significantly reduced amounts of attached bacteria and network-like structures, typically comprising exopolysaccharides. Spot assay for S. mutans indicated that K35 and K43 strains possessed a stronger bactericidal activity against S. mutans than LGG. Moreover, quantitative real-time polymerase chain reaction showed that the expression of genes encoding glucosyltransferases, gtfB, gtfC, and gtfD was reduced when S. mutans were co-cultured with K35 or K43. However, LGG activated the expression of gtfB and gtfC, but did not influence the expression of gtfD in the co-culture. A transwell-based biofilm assay indicated that these lactobacilli inhibited S. mutans biofilm formation in a contact-independent manner. In conclusion, we identified two L. salivarius strains with inhibitory activities on the growth and expression of S. mutans virulence genes to reduce its biofilm formation. This is not a general characteristic of the species, so presents a potential strategy for in vivo alteration of plaque biofilm and caries. © 2014 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Protein Network Signatures Associated with Exogenous Biofuels Treatments in Cyanobacterium Synechocystis sp. PCC 6803

DOE Office of Scientific and Technical Information (OSTI.GOV)

Pei, Guangsheng; Chen, Lei; Wang, Jiangxin

2014-11-03

Although recognized as a promising microbial cell factory for producing biofuels, current productivity in cyanobacterial systems is low. To make the processes economically feasible, one of the hurdles, which need to be overcome is the low tolerance of hosts to toxic biofuels. Meanwhile, little information is available regarding the cellular responses to biofuels stress in cyanobacteria, which makes it challenging for tolerance engineering. Using large proteomic datasets of Synechocystis under various biofuels stress and environmental perturbation, a protein co-expression network was first constructed and then combined with the experimentally determined protein–protein interaction network. Proteins with statistically higher topological overlap inmore » the integrated network were identified as common responsive proteins to both biofuels stress and environmental perturbations. In addition, a weighted gene co-expression network analysis was performed to distinguish unique responses to biofuels from those to environmental perturbations and to uncover metabolic modules and proteins uniquely associated with biofuels stress. The results showed that biofuel-specific proteins and modules were enriched in several functional categories, including photosynthesis, carbon fixation, and amino acid metabolism, which may represent potential key signatures for biofuels stress responses in Synechocystis. Network-based analysis allowed determination of the responses specifically related to biofuels stress, and the results constituted an important knowledge foundation for tolerance engineering against biofuels in Synechocystis.« less
Autism and psychosis expressions diametrically modulate the right temporoparietal junction.

PubMed

Abu-Akel, Ahmad M; Apperly, Ian A; Wood, Stephen J; Hansen, Peter C

2017-10-01

The mentalizing network is atypically activated in autism and schizophrenia spectrum disorders. While these disorders are considered diagnostically independent, expressions of both can co-occur in the same individual. We examined the concurrent effect of autism traits and psychosis proneness on the activity of the mentalizing network in 24 neurotypical adults while performing a social competitive game. Activations were observed in the paracingulate cortex and the right temporoparietal junction (rTPJ). Autism traits and psychosis proneness did not modulate activity within the paracingulate or the dorsal component of the rTPJ. However, diametric modulations of autism traits and psychosis proneness were observed in the posterior (rvpTPJ) and anterior (rvaTPJ) subdivisions of the ventral rTPJ, which respectively constitute core regions within the mentalizing and attention-reorienting networks. Within the rvpTPJ, increasing autism tendencies decreased activity, and increasing psychosis proneness increased activity. This effect was reversed within the rvaTPJ. We suggest that this results from an interaction between regions responsible for higher level social cognitive processing (rvpTPJ) and regions responsible for domain-general attentional processes (rvaTPJ). The observed diametric modulation of autism tendencies and psychosis proneness of neuronal activity within the mentalizing network highlights the importance of assessing both autism and psychosis expressions within the individual.
Development of the Verona coding definitions of emotional sequences to code health providers' responses (VR-CoDES-P) to patient cues and concerns.

PubMed

Del Piccolo, Lidia; de Haes, Hanneke; Heaven, Cathy; Jansen, Jesse; Verheul, William; Bensing, Jozien; Bergvik, Svein; Deveugele, Myriam; Eide, Hilde; Fletcher, Ian; Goss, Claudia; Humphris, Gerry; Kim, Young-Mi; Langewitz, Wolf; Mazzi, Maria Angela; Mjaaland, Trond; Moretti, Francesca; Nübling, Matthias; Rimondini, Michela; Salmon, Peter; Sibbern, Tonje; Skre, Ingunn; van Dulmen, Sandra; Wissow, Larry; Young, Bridget; Zandbelt, Linda; Zimmermann, Christa; Finset, Arnstein

2011-02-01

To present a method to classify health provider responses to patient cues and concerns according to the VR-CoDES-CC (Del Piccolo et al. (2009) [2] and Zimmermann et al. (submitted for publication) [3]). The system permits sequence analysis and a detailed description of how providers handle patient's expressions of emotion. The Verona-CoDES-P system has been developed based on consensus views within the "Verona Network of Sequence Analysis". The different phases of the creation process are described in detail. A reliability study has been conducted on 20 interviews from a convenience sample of 104 psychiatric consultations. The VR-CoDES-P has two main classes of provider responses, corresponding to the degree of explicitness (yes/no) and space (yes/no) that is given by the health provider to each cue/concern expressed by the patient. The system can be further subdivided into 17 individual categories. Statistical analyses showed that the VR-CoDES-P is reliable (agreement 92.86%, Cohen's kappa 0.90 (±0.04) p<0.0001). Once validity and reliability are tested in different settings, the system should be applied to investigate the relationship between provider responses to patients' expression of emotions and outcome variables. Research employing the VR-CoDES-P should be applied to develop research-based approaches to maximize appropriate responses to patients' indirect and overt expressions of emotional needs. Copyright © 2010 Elsevier Ireland Ltd. All rights reserved.
A model for the multiplex dynamics of two-mode and one-mode networks, with an application to employment preference, friendship, and advice

PubMed Central

Snijders, Tom A.B.; Lomi, Alessandro; Torló, Vanina Jasmine

2012-01-01

We propose a new stochastic actor-oriented model for the co-evolution of two-mode and one-mode networks. The model posits that activities of a set of actors, represented in the two-mode network, co-evolve with exchanges and interactions between the actors, as represented in the one-mode network. The model assumes that the actors, not the activities, have agency. The empirical value of the model is demonstrated by examining how employment preferences co-evolve with friendship and advice relations in a group of seventy-five MBA students. The analysis shows that activity in the two-mode network, as expressed by number of employment preferences, is related to activity in the friendship network, as expressed by outdegrees. Further, advice ties between students lead to agreement with respect to employment preferences. In addition, considering the multiplexity of advice and friendship ties yields a better understanding of the dynamics of the advice relation: tendencies to reciprocation and homophily in advice relations are mediated to an important extent by friendship relations. The discussion pays attention to the implications of this study in the broader context of current efforts to model the co-evolutionary dynamics of social networks and individual behavior. PMID:23690653
Neuroendocrine and immune network re-modeling in chronic fatigue syndrome: an exploratory analysis.

PubMed

Fuite, Jim; Vernon, Suzanne D; Broderick, Gordon

2008-12-01

This work investigates the significance of changes in association patterns linking indicators of neuroendocrine and immune activity in patients with chronic fatigue syndrome (CFS). Gene sets preferentially expressed in specific immune cell isolates were integrated with neuroendocrine data from a large population-based study. Co-expression patterns linking immune cell activity with hypothalamic-pituitary-adrenal (HPA), thyroidal (HPT) and gonadal (HPG) axis status were computed using mutual information criteria. Networks in control and CFS subjects were compared globally in terms of a weighted graph edit distance. Local re-modeling of node connectivity was quantified by node degree and eigenvector centrality measures. Results indicate statistically significant differences between CFS and control networks determined mainly by re-modeling around pituitary and thyroid nodes as well as an emergent immune sub-network. Findings align with known mechanisms of chronic inflammation and support possible immune-mediated loss of thyroid function in CFS exacerbated by blunted HPA axis responsiveness.
Graphical modeling of gene expression in monocytes suggests molecular mechanisms explaining increased atherosclerosis in smokers.

PubMed

Verdugo, Ricardo A; Zeller, Tanja; Rotival, Maxime; Wild, Philipp S; Münzel, Thomas; Lackner, Karl J; Weidmann, Henri; Ninio, Ewa; Trégouët, David-Alexandre; Cambien, François; Blankenberg, Stefan; Tiret, Laurence

2013-01-01

Smoking is a risk factor for atherosclerosis with reported widespread effects on gene expression in circulating blood cells. We hypothesized that a molecular signature mediating the relation between smoking and atherosclerosis may be found in the transcriptome of circulating monocytes. Genome-wide expression profiles and counts of atherosclerotic plaques in carotid arteries were collected in 248 smokers and 688 non-smokers from the general population. Patterns of co-expressed genes were identified by Independent Component Analysis (ICA) and network structure of the pattern-specific gene modules was inferred by the PC-algorithm. A likelihood-based causality test was implemented to select patterns that fit models containing a path "smoking→gene expression→plaques". Robustness of the causal inference was assessed by bootstrapping. At a FDR ≤0.10, 3,368 genes were associated to smoking or plaques, of which 93% were associated to smoking only. SASH1 showed the strongest association to smoking and PPARG the strongest association to plaques. Twenty-nine gene patterns were identified by ICA. Modules containing SASH1 and PPARG did not show evidence for the "smoking→gene expression→plaques" causality model. Conversely, three modules had good support for causal effects and exhibited a network topology consistent with gene expression mediating the relation between smoking and plaques. The network with the strongest support for causal effects was connected to plaques through SLC39A8, a gene with known association to HDL-cholesterol and cellular uptake of cadmium from tobacco, while smoking was directly connected to GAS6, a gene reported to have anti-inflammatory effects in atherosclerosis and to be up-regulated in the placenta of women smoking during pregnancy. Our analysis of the transcriptome of monocytes recovered genes relevant for association to smoking and atherosclerosis, and connected genes that before, were only studied in separate contexts. Inspection of correlation structure revealed candidates that would be missed by expression-phenotype association analysis alone.
Graphical Modeling of Gene Expression in Monocytes Suggests Molecular Mechanisms Explaining Increased Atherosclerosis in Smokers

PubMed Central

Verdugo, Ricardo A.; Zeller, Tanja; Rotival, Maxime; Wild, Philipp S.; Münzel, Thomas; Lackner, Karl J.; Weidmann, Henri; Ninio, Ewa; Trégouët, David-Alexandre; Cambien, François; Blankenberg, Stefan; Tiret, Laurence

2013-01-01

Smoking is a risk factor for atherosclerosis with reported widespread effects on gene expression in circulating blood cells. We hypothesized that a molecular signature mediating the relation between smoking and atherosclerosis may be found in the transcriptome of circulating monocytes. Genome-wide expression profiles and counts of atherosclerotic plaques in carotid arteries were collected in 248 smokers and 688 non-smokers from the general population. Patterns of co-expressed genes were identified by Independent Component Analysis (ICA) and network structure of the pattern-specific gene modules was inferred by the PC-algorithm. A likelihood-based causality test was implemented to select patterns that fit models containing a path “smoking→gene expression→plaques”. Robustness of the causal inference was assessed by bootstrapping. At a FDR ≤0.10, 3,368 genes were associated to smoking or plaques, of which 93% were associated to smoking only. SASH1 showed the strongest association to smoking and PPARG the strongest association to plaques. Twenty-nine gene patterns were identified by ICA. Modules containing SASH1 and PPARG did not show evidence for the “smoking→gene expression→plaques” causality model. Conversely, three modules had good support for causal effects and exhibited a network topology consistent with gene expression mediating the relation between smoking and plaques. The network with the strongest support for causal effects was connected to plaques through SLC39A8, a gene with known association to HDL-cholesterol and cellular uptake of cadmium from tobacco, while smoking was directly connected to GAS6, a gene reported to have anti-inflammatory effects in atherosclerosis and to be up-regulated in the placenta of women smoking during pregnancy. Our analysis of the transcriptome of monocytes recovered genes relevant for association to smoking and atherosclerosis, and connected genes that before, were only studied in separate contexts. Inspection of correlation structure revealed candidates that would be missed by expression-phenotype association analysis alone. PMID:23372645
Complex Network Analysis of CA3 Transcriptome Reveals Pathogenic and Compensatory Pathways in Refractory Temporal Lobe Epilepsy

PubMed Central

Bando, Silvia Yumi; Silva, Filipi Nascimento; Costa, Luciano da Fontoura; Silva, Alexandre V.; Pimentel-Silva, Luciana R.; Castro, Luiz HM.; Wen, Hung-Tzu; Amaro, Edson; Moreira-Filho, Carlos Alberto

2013-01-01

We previously described – studying transcriptional signatures of hippocampal CA3 explants – that febrile (FS) and afebrile (NFS) forms of refractory mesial temporal lobe epilepsy constitute two distinct genomic phenotypes. That network analysis was based on a limited number (hundreds) of differentially expressed genes (DE networks) among a large set of valid transcripts (close to two tens of thousands). Here we developed a methodology for complex network visualization (3D) and analysis that allows the categorization of network nodes according to distinct hierarchical levels of gene-gene connections (node degree) and of interconnection between node neighbors (concentric node degree). Hubs are highly connected nodes, VIPs have low node degree but connect only with hubs, and high-hubs have VIP status and high overall number of connections. Studying the whole set of CA3 valid transcripts we: i) obtained complete transcriptional networks (CO) for FS and NFS phenotypic groups; ii) examined how CO and DE networks are related; iii) characterized genomic and molecular mechanisms underlying FS and NFS phenotypes, identifying potential novel targets for therapeutic interventions. We found that: i) DE hubs and VIPs are evenly distributed inside the CO networks; ii) most DE hubs and VIPs are related to synaptic transmission and neuronal excitability whereas most CO hubs, VIPs and high hubs are related to neuronal differentiation, homeostasis and neuroprotection, indicating compensatory mechanisms. Complex network visualization and analysis is a useful tool for systems biology approaches to multifactorial diseases. Network centrality observed for hubs, VIPs and high hubs of CO networks, is consistent with the network disease model, where a group of nodes whose perturbation leads to a disease phenotype occupies a central position in the network. Conceivably, the chance for exerting therapeutic effects through the modulation of particular genes will be higher if these genes are highly interconnected in transcriptional networks. PMID:24278214
Co-option of the polarity gene network shapes filament morphology in angiosperms

PubMed Central

de Almeida, Ana Maria Rocha; Yockteng, Roxana; Schnable, James; Alvarez-Buylla, Elena R.; Freeling, Michael; Specht, Chelsea D.

2014-01-01

The molecular genetic mechanisms underlying abaxial-adaxial polarity in plants have been studied as a property of lateral and flattened organs, such as leaves. In leaves, laminar expansion occurs as a result of balanced abaxial-adaxial gene expression. Over- or under- expression of either abaxializing or adaxializing genes inhibits laminar growth, resulting in a mutant radialized phenotype. Here, we show that co-option of the abaxial-adaxial polarity gene network plays a role in the evolution of stamen filament morphology in angiosperms. RNA-Seq data from species bearing laminar (flattened) or radial (cylindrical) filaments demonstrates that species with laminar filaments exhibit balanced expression of abaxial-adaxial (ab-ad) genes, while overexpression of a YABBY gene is found in species with radial filaments. This result suggests that unbalanced expression of ab-ad genes results in inhibition of laminar outgrowth, leading to a radially symmetric structure as found in many angiosperm filaments. We anticipate that co-option of the polarity gene network is a fundamental mechanism shaping many aspects of plant morphology during angiosperm evolution. PMID:25168962
Co-option of the polarity gene network shapes filament morphology in angiosperms.

PubMed

de Almeida, Ana Maria Rocha; Yockteng, Roxana; Schnable, James; Alvarez-Buylla, Elena R; Freeling, Michael; Specht, Chelsea D

2014-08-29

The molecular genetic mechanisms underlying abaxial-adaxial polarity in plants have been studied as a property of lateral and flattened organs, such as leaves. In leaves, laminar expansion occurs as a result of balanced abaxial-adaxial gene expression. Over- or under- expression of either abaxializing or adaxializing genes inhibits laminar growth, resulting in a mutant radialized phenotype. Here, we show that co-option of the abaxial-adaxial polarity gene network plays a role in the evolution of stamen filament morphology in angiosperms. RNA-Seq data from species bearing laminar (flattened) or radial (cylindrical) filaments demonstrates that species with laminar filaments exhibit balanced expression of abaxial-adaxial (ab-ad) genes, while overexpression of a YABBY gene is found in species with radial filaments. This result suggests that unbalanced expression of ab-ad genes results in inhibition of laminar outgrowth, leading to a radially symmetric structure as found in many angiosperm filaments. We anticipate that co-option of the polarity gene network is a fundamental mechanism shaping many aspects of plant morphology during angiosperm evolution.
Regulatory network rewiring for secondary metabolism in Arabidopsis thaliana under various conditions

PubMed Central

2014-01-01

Background Plant secondary metabolites are critical to various biological processes. However, the regulations of these metabolites are complex because of regulatory rewiring or crosstalk. To unveil how regulatory behaviors on secondary metabolism reshape biological processes, we constructed and analyzed a dynamic regulatory network of secondary metabolic pathways in Arabidopsis. Results The dynamic regulatory network was constructed through integrating co-expressed gene pairs and regulatory interactions. Regulatory interactions were either predicted by conserved transcription factor binding sites (TFBSs) or proved by experiments. We found that integrating two data (co-expression and predicted regulatory interactions) enhanced the number of highly confident regulatory interactions by over 10% compared with using single data. The dynamic changes of regulatory network systematically manifested regulatory rewiring to explain the mechanism of regulation, such as in terpenoids metabolism, the regulatory crosstalk of RAV1 (AT1G13260) and ATHB1 (AT3G01470) on HMG1 (hydroxymethylglutaryl-CoA reductase, AT1G76490); and regulation of RAV1 on epoxysqualene biosynthesis and sterol biosynthesis. Besides, we investigated regulatory rewiring with expression, network topology and upstream signaling pathways. Regulatory rewiring was revealed by the variability of genes’ expression: pathway genes and transcription factors (TFs) were significantly differentially expressed under different conditions (such as terpenoids biosynthetic genes in tissue experiments and E2F/DP family members in genotype experiments). Both network topology and signaling pathways supported regulatory rewiring. For example, we discovered correlation among the numbers of pathway genes, TFs and network topology: one-gene pathways (such as δ-carotene biosynthesis) were regulated by a fewer TFs, and were not critical to metabolic network because of their low degrees in topology. Upstream signaling pathways of 50 TFs were identified to comprehend the underlying mechanism of TFs’ regulatory rewiring. Conclusion Overall, this dynamic regulatory network largely improves the understanding of perplexed regulatory rewiring in secondary metabolism in Arabidopsis. PMID:24993737
svdPPCS: an effective singular value decomposition-based method for conserved and divergent co-expression gene module identification.

PubMed

Zhang, Wensheng; Edwards, Andrea; Fan, Wei; Zhu, Dongxiao; Zhang, Kun

2010-06-22

Comparative analysis of gene expression profiling of multiple biological categories, such as different species of organisms or different kinds of tissue, promises to enhance the fundamental understanding of the universality as well as the specialization of mechanisms and related biological themes. Grouping genes with a similar expression pattern or exhibiting co-expression together is a starting point in understanding and analyzing gene expression data. In recent literature, gene module level analysis is advocated in order to understand biological network design and system behaviors in disease and life processes; however, practical difficulties often lie in the implementation of existing methods. Using the singular value decomposition (SVD) technique, we developed a new computational tool, named svdPPCS (SVD-based Pattern Pairing and Chart Splitting), to identify conserved and divergent co-expression modules of two sets of microarray experiments. In the proposed methods, gene modules are identified by splitting the two-way chart coordinated with a pair of left singular vectors factorized from the gene expression matrices of the two biological categories. Importantly, the cutoffs are determined by a data-driven algorithm using the well-defined statistic, SVD-p. The implementation was illustrated on two time series microarray data sets generated from the samples of accessory gland (ACG) and malpighian tubule (MT) tissues of the line W118 of M. drosophila. Two conserved modules and six divergent modules, each of which has a unique characteristic profile across tissue kinds and aging processes, were identified. The number of genes contained in these models ranged from five to a few hundred. Three to over a hundred GO terms were over-represented in individual modules with FDR < 0.1. One divergent module suggested the tissue-specific relationship between the expressions of mitochondrion-related genes and the aging process. This finding, together with others, may be of biological significance. The validity of the proposed SVD-based method was further verified by a simulation study, as well as the comparisons with regression analysis and cubic spline regression analysis plus PAM based clustering. svdPPCS is a novel computational tool for the comparative analysis of transcriptional profiling. It especially fits the comparison of time series data of related organisms or different tissues of the same organism under equivalent or similar experimental conditions. The general scheme can be directly extended to the comparisons of multiple data sets. It also can be applied to the integration of data sets from different platforms and of different sources.
Integrating Genetic and Gene Co-expression Analysis Identifies Gene Networks Involved in Alcohol and Stress Responses

PubMed Central

Luo, Jie; Xu, Pei; Cao, Peijian; Wan, Hongjian; Lv, Xiaonan; Xu, Shengchun; Wang, Gangjun; Cook, Melloni N.; Jones, Byron C.; Lu, Lu; Wang, Xusheng

2018-01-01

Although the link between stress and alcohol is well recognized, the underlying mechanisms of how they interplay at the molecular level remain unclear. The purpose of this study is to identify molecular networks underlying the effects of alcohol and stress responses, as well as their interaction on anxiety behaviors in the hippocampus of mice using a systems genetics approach. Here, we applied a gene co-expression network approach to transcriptomes of 41 BXD mouse strains under four conditions: stress, alcohol, stress-induced alcohol and control. The co-expression analysis identified 14 modules and characterized four expression patterns across the four conditions. The four expression patterns include up-regulation in no restraint stress and given an ethanol injection (NOE) but restoration in restraint stress followed by an ethanol injection (RSE; pattern 1), down-regulation in NOE but rescue in RSE (pattern 2), up-regulation in both restraint stress followed by a saline injection (RSS) and NOE, and further amplification in RSE (pattern 3), and up-regulation in RSS but reduction in both NOE and RSE (pattern 4). We further identified four functional subnetworks by superimposing protein-protein interactions (PPIs) to the 14 co-expression modules, including γ-aminobutyric acid receptor (GABA) signaling, glutamate signaling, neuropeptide signaling, cAMP-dependent signaling. We further performed module specificity analysis to identify modules that are specific to stress, alcohol, or stress-induced alcohol responses. Finally, we conducted causality analysis to link genetic variation to these identified modules, and anxiety behaviors after stress and alcohol treatments. This study underscores the importance of integrative analysis and offers new insights into the molecular networks underlying stress and alcohol responses. PMID:29674951
Synchronization of generalized reaction-diffusion neural networks with time-varying delays based on general integral inequalities and sampled-data control approach.

PubMed

Dharani, S; Rakkiyappan, R; Cao, Jinde; Alsaedi, Ahmed

2017-08-01

This paper explores the problem of synchronization of a class of generalized reaction-diffusion neural networks with mixed time-varying delays. The mixed time-varying delays under consideration comprise of both discrete and distributed delays. Due to the development and merits of digital controllers, sampled-data control is a natural choice to establish synchronization in continuous-time systems. Using a newly introduced integral inequality, less conservative synchronization criteria that assure the global asymptotic synchronization of the considered generalized reaction-diffusion neural network and mixed delays are established in terms of linear matrix inequalities (LMIs). The obtained easy-to-test LMI-based synchronization criteria depends on the delay bounds in addition to the reaction-diffusion terms, which is more practicable. Upon solving these LMIs by using Matlab LMI control toolbox, a desired sampled-data controller gain can be acuqired without any difficulty. Finally, numerical examples are exploited to express the validity of the derived LMI-based synchronization criteria.
A constitutive rheological model for agglomerating blood derived from nonequilibrium thermodynamics

NASA Astrophysics Data System (ADS)

Tsimouri, Ioanna Ch.; Stephanou, Pavlos S.; Mavrantzas, Vlasis G.

2018-03-01

Red blood cells tend to aggregate in the presence of plasma proteins, forming structures known as rouleaux. Here, we derive a constitutive rheological model for human blood which accounts for the formation and dissociation of rouleaux using the generalized bracket formulation of nonequilibrium thermodynamics. Similar to the model derived by Owens and co-workers ["A non-homogeneous constitutive model for human blood. Part 1. Model derivation and steady flow," J. Fluid Mech. 617, 327-354 (2008)] through polymer network theory, each rouleau in our model is represented as a dumbbell; the corresponding structural variable is the conformation tensor of the dumbbell. The kinetics of rouleau formation and dissociation is treated as in the work of Germann et al. ["Nonequilibrium thermodynamic modeling of the structure and rheology of concentrated wormlike micellar solutions," J. Non-Newton. Fluid Mech. 196, 51-57 (2013)] by assuming a set of reversible reactions, each characterized by a forward and a reverse rate constant. The final set of evolution equations for the microstructure of each rouleau and the expression for the stress tensor turn out to be very similar to those of Owens and co-workers. However, by explicitly considering a mechanism for the formation and breakage of rouleaux, our model further provides expressions for the aggregation and disaggregation rates appearing in the final transport equations, which in the kinetic theory-based network model of Owens were absent and had to be specified separately. Despite this, the two models are found to provide similar descriptions of experimental data on the size distribution of rouleaux.

Perceived Conventionality in Co-speech Gestures Involves the Fronto-Temporal Language Network.

PubMed

Wolf, Dhana; Rekittke, Linn-Marlen; Mittelberg, Irene; Klasen, Martin; Mathiak, Klaus

2017-01-01

Face-to-face communication is multimodal; it encompasses spoken words, facial expressions, gaze, and co-speech gestures. In contrast to linguistic symbols (e.g., spoken words or signs in sign language) relying on mostly explicit conventions, gestures vary in their degree of conventionality. Bodily signs may have a general accepted or conventionalized meaning (e.g., a head shake) or less so (e.g., self-grooming). We hypothesized that subjective perception of conventionality in co-speech gestures relies on the classical language network, i.e., the left hemispheric inferior frontal gyrus (IFG, Broca's area) and the posterior superior temporal gyrus (pSTG, Wernicke's area) and studied 36 subjects watching video-recorded story retellings during a behavioral and an functional magnetic resonance imaging (fMRI) experiment. It is well documented that neural correlates of such naturalistic videos emerge as intersubject covariance (ISC) in fMRI even without involving a stimulus (model-free analysis). The subjects attended either to perceived conventionality or to a control condition (any hand movements or gesture-speech relations). Such tasks modulate ISC in contributing neural structures and thus we studied ISC changes to task demands in language networks. Indeed, the conventionality task significantly increased covariance of the button press time series and neuronal synchronization in the left IFG over the comparison with other tasks. In the left IFG, synchronous activity was observed during the conventionality task only. In contrast, the left pSTG exhibited correlated activation patterns during all conditions with an increase in the conventionality task at the trend level only. Conceivably, the left IFG can be considered a core region for the processing of perceived conventionality in co-speech gestures similar to spoken language. In general, the interpretation of conventionalized signs may rely on neural mechanisms that engage during language comprehension.
Perceived Conventionality in Co-speech Gestures Involves the Fronto-Temporal Language Network

PubMed Central

Wolf, Dhana; Rekittke, Linn-Marlen; Mittelberg, Irene; Klasen, Martin; Mathiak, Klaus

2017-01-01

Face-to-face communication is multimodal; it encompasses spoken words, facial expressions, gaze, and co-speech gestures. In contrast to linguistic symbols (e.g., spoken words or signs in sign language) relying on mostly explicit conventions, gestures vary in their degree of conventionality. Bodily signs may have a general accepted or conventionalized meaning (e.g., a head shake) or less so (e.g., self-grooming). We hypothesized that subjective perception of conventionality in co-speech gestures relies on the classical language network, i.e., the left hemispheric inferior frontal gyrus (IFG, Broca's area) and the posterior superior temporal gyrus (pSTG, Wernicke's area) and studied 36 subjects watching video-recorded story retellings during a behavioral and an functional magnetic resonance imaging (fMRI) experiment. It is well documented that neural correlates of such naturalistic videos emerge as intersubject covariance (ISC) in fMRI even without involving a stimulus (model-free analysis). The subjects attended either to perceived conventionality or to a control condition (any hand movements or gesture-speech relations). Such tasks modulate ISC in contributing neural structures and thus we studied ISC changes to task demands in language networks. Indeed, the conventionality task significantly increased covariance of the button press time series and neuronal synchronization in the left IFG over the comparison with other tasks. In the left IFG, synchronous activity was observed during the conventionality task only. In contrast, the left pSTG exhibited correlated activation patterns during all conditions with an increase in the conventionality task at the trend level only. Conceivably, the left IFG can be considered a core region for the processing of perceived conventionality in co-speech gestures similar to spoken language. In general, the interpretation of conventionalized signs may rely on neural mechanisms that engage during language comprehension. PMID:29249945
Genetic network inference as a series of discrimination tasks.

PubMed

Kimura, Shuhei; Nakayama, Satoshi; Hatakeyama, Mariko

2009-04-01

Genetic network inference methods based on sets of differential equations generally require a great deal of time, as the equations must be solved many times. To reduce the computational cost, researchers have proposed other methods for inferring genetic networks by solving sets of differential equations only a few times, or even without solving them at all. When we try to obtain reasonable network models using these methods, however, we must estimate the time derivatives of the gene expression levels with great precision. In this study, we propose a new method to overcome the drawbacks of inference methods based on sets of differential equations. Our method infers genetic networks by obtaining classifiers capable of predicting the signs of the derivatives of the gene expression levels. For this purpose, we defined a genetic network inference problem as a series of discrimination tasks, then solved the defined series of discrimination tasks with a linear programming machine. Our experimental results demonstrated that the proposed method is capable of correctly inferring genetic networks, and doing so more than 500 times faster than the other inference methods based on sets of differential equations. Next, we applied our method to actual expression data of the bacterial SOS DNA repair system. And finally, we demonstrated that our approach relates to the inference method based on the S-system model. Though our method provides no estimation of the kinetic parameters, it should be useful for researchers interested only in the network structure of a target system. Supplementary data are available at Bioinformatics online.
Inferring nonlinear gene regulatory networks from gene expression data based on distance correlation.

PubMed

Guo, Xiaobo; Zhang, Ye; Hu, Wenhao; Tan, Haizhu; Wang, Xueqin

2014-01-01

Nonlinear dependence is general in regulation mechanism of gene regulatory networks (GRNs). It is vital to properly measure or test nonlinear dependence from real data for reconstructing GRNs and understanding the complex regulatory mechanisms within the cellular system. A recently developed measurement called the distance correlation (DC) has been shown powerful and computationally effective in nonlinear dependence for many situations. In this work, we incorporate the DC into inferring GRNs from the gene expression data without any underling distribution assumptions. We propose three DC-based GRNs inference algorithms: CLR-DC, MRNET-DC and REL-DC, and then compare them with the mutual information (MI)-based algorithms by analyzing two simulated data: benchmark GRNs from the DREAM challenge and GRNs generated by SynTReN network generator, and an experimentally determined SOS DNA repair network in Escherichia coli. According to both the receiver operator characteristic (ROC) curve and the precision-recall (PR) curve, our proposed algorithms significantly outperform the MI-based algorithms in GRNs inference.
Inferring Nonlinear Gene Regulatory Networks from Gene Expression Data Based on Distance Correlation

PubMed Central

Guo, Xiaobo; Zhang, Ye; Hu, Wenhao; Tan, Haizhu; Wang, Xueqin

2014-01-01

Nonlinear dependence is general in regulation mechanism of gene regulatory networks (GRNs). It is vital to properly measure or test nonlinear dependence from real data for reconstructing GRNs and understanding the complex regulatory mechanisms within the cellular system. A recently developed measurement called the distance correlation (DC) has been shown powerful and computationally effective in nonlinear dependence for many situations. In this work, we incorporate the DC into inferring GRNs from the gene expression data without any underling distribution assumptions. We propose three DC-based GRNs inference algorithms: CLR-DC, MRNET-DC and REL-DC, and then compare them with the mutual information (MI)-based algorithms by analyzing two simulated data: benchmark GRNs from the DREAM challenge and GRNs generated by SynTReN network generator, and an experimentally determined SOS DNA repair network in Escherichia coli. According to both the receiver operator characteristic (ROC) curve and the precision-recall (PR) curve, our proposed algorithms significantly outperform the MI-based algorithms in GRNs inference. PMID:24551058
The complex networks approach for authorship attribution of books

NASA Astrophysics Data System (ADS)

Mehri, Ali; Darooneh, Amir H.; Shariati, Ashrafalsadat

2012-04-01

Authorship analysis by means of textual features is an important task in linguistic studies. We employ complex networks theory to tackle this disputed problem. In this work, we focus on some measurable quantities of word co-occurrence network of each book for authorship characterization. Based on the network features, attribution probability is defined for authorship identification. Furthermore, two scaling exponents, q-parameter and α-exponent, are combined to classify personal writing style with acceptable high resolution power. The q-parameter, generally known as the nonextensivity measure, is calculated for degree distribution and the α-exponent comes from a power law relationship between number of links and number of nodes in the co-occurrence network constructed for different books written by each author. The applicability of the presented method is evaluated in an experiment with thirty six books of five Persian litterateurs. Our results show high accuracy rate in authorship attribution.
Identification of candidate genes in Populus cell wall biosynthesis using text-mining, co-expression network and comparative genomics

DOE Office of Scientific and Technical Information (OSTI.GOV)

Yang, Xiaohan; Ye, Chuyu; Bisaria, Anjali

2011-01-01

Populus is an important bioenergy crop for bioethanol production. A greater understanding of cell wall biosynthesis processes is critical in reducing biomass recalcitrance, a major hindrance in efficient generation of ethanol from lignocellulosic biomass. Here, we report the identification of candidate cell wall biosynthesis genes through the development and application of a novel bioinformatics pipeline. As a first step, via text-mining of PubMed publications, we obtained 121 Arabidopsis genes that had the experimental evidences supporting their involvement in cell wall biosynthesis or remodeling. The 121 genes were then used as bait genes to query an Arabidopsis co-expression database and additionalmore » genes were identified as neighbors of the bait genes in the network, increasing the number of genes to 548. The 548 Arabidopsis genes were then used to re-query the Arabidopsis co-expression database and re-construct a network that captured additional network neighbors, expanding to a total of 694 genes. The 694 Arabidopsis genes were computationally divided into 22 clusters. Queries of the Populus genome using the Arabidopsis genes revealed 817 Populus orthologs. Functional analysis of gene ontology and tissue-specific gene expression indicated that these Arabidopsis and Populus genes are high likelihood candidates for functional genomics in relation to cell wall biosynthesis.« less
Gene network interconnectedness and the generalized topological overlap measure

PubMed Central

Yip, Andy M; Horvath, Steve

2007-01-01

Background Network methods are increasingly used to represent the interactions of genes and/or proteins. Genes or proteins that are directly linked may have a similar biological function or may be part of the same biological pathway. Since the information on the connection (adjacency) between 2 nodes may be noisy or incomplete, it can be desirable to consider alternative measures of pairwise interconnectedness. Here we study a class of measures that are proportional to the number of neighbors that a pair of nodes share in common. For example, the topological overlap measure by Ravasz et al. [1] can be interpreted as a measure of agreement between the m = 1 step neighborhoods of 2 nodes. Several studies have shown that two proteins having a higher topological overlap are more likely to belong to the same functional class than proteins having a lower topological overlap. Here we address the question whether a measure of topological overlap based on higher-order neighborhoods could give rise to a more robust and sensitive measure of interconnectedness. Results We generalize the topological overlap measure from m = 1 step neighborhoods to m ≥ 2 step neighborhoods. This allows us to define the m-th order generalized topological overlap measure (GTOM) by (i) counting the number of m-step neighbors that a pair of nodes share and (ii) normalizing it to take a value between 0 and 1. Using theoretical arguments, a yeast co-expression network application, and a fly protein network application, we illustrate the usefulness of the proposed measure for module detection and gene neighborhood analysis. Conclusion Topological overlap can serve as an important filter to counter the effects of spurious or missing connections between network nodes. The m-th order topological overlap measure allows one to trade-off sensitivity versus specificity when it comes to defining pairwise interconnectedness and network modules. PMID:17250769
Deciphering microbial interactions and detecting keystone species with co-occurrence networks.

PubMed

Berry, David; Widder, Stefanie

2014-01-01

Co-occurrence networks produced from microbial survey sequencing data are frequently used to identify interactions between community members. While this approach has potential to reveal ecological processes, it has been insufficiently validated due to the technical limitations inherent in studying complex microbial ecosystems. Here, we simulate multi-species microbial communities with known interaction patterns using generalized Lotka-Volterra dynamics. We then construct co-occurrence networks and evaluate how well networks reveal the underlying interactions and how experimental and ecological parameters can affect network inference and interpretation. We find that co-occurrence networks can recapitulate interaction networks under certain conditions, but that they lose interpretability when the effects of habitat filtering become significant. We demonstrate that networks suffer from local hot spots of spurious correlation in the neighborhood of hub species that engage in many interactions. We also identify topological features associated with keystone species in co-occurrence networks. This study provides a substantiated framework to guide environmental microbiologists in the construction and interpretation of co-occurrence networks from microbial survey datasets.
The COPD Knowledge Base: enabling data analysis and computational simulation in translational COPD research.

PubMed

Cano, Isaac; Tényi, Ákos; Schueller, Christine; Wolff, Martin; Huertas Migueláñez, M Mercedes; Gomez-Cabrero, David; Antczak, Philipp; Roca, Josep; Cascante, Marta; Falciani, Francesco; Maier, Dieter

2014-11-28

Previously we generated a chronic obstructive pulmonary disease (COPD) specific knowledge base (http://www.copdknowledgebase.eu) from clinical and experimental data, text-mining results and public databases. This knowledge base allowed the retrieval of specific molecular networks together with integrated clinical and experimental data. The COPDKB has now been extended to integrate over 40 public data sources on functional interaction (e.g. signal transduction, transcriptional regulation, protein-protein interaction, gene-disease association). In addition we integrated COPD-specific expression and co-morbidity networks connecting over 6 000 genes/proteins with physiological parameters and disease states. Three mathematical models describing different aspects of systemic effects of COPD were connected to clinical and experimental data. We have completely redesigned the technical architecture of the user interface and now provide html and web browser-based access and form-based searches. A network search enables the use of interconnecting information and the generation of disease-specific sub-networks from general knowledge. Integration with the Synergy-COPD Simulation Environment enables multi-scale integrated simulation of individual computational models while integration with a Clinical Decision Support System allows delivery into clinical practice. The COPD Knowledge Base is the only publicly available knowledge resource dedicated to COPD and combining genetic information with molecular, physiological and clinical data as well as mathematical modelling. Its integrated analysis functions provide overviews about clinical trends and connections while its semantically mapped content enables complex analysis approaches. We plan to further extend the COPDKB by offering it as a repository to publish and semantically integrate data from relevant clinical trials. The COPDKB is freely available after registration at http://www.copdknowledgebase.eu.
CoNNeCT Antenna Positioning System Dynamic Simulator Modal Model Correlation

NASA Technical Reports Server (NTRS)

Jones, Tevor M.; McNelis, Mark E.; Staab, Lucas D.; Akers, James C.; Suarez, Vicente

2012-01-01

The National Aeronautics and Space Administration (NASA) developed an on-orbit, adaptable, Software Defined Radios (SDR)/Space Telecommunications Radio System (STRS)-based testbed facility to conduct a suite of experiments to advance technologies, reduce risk, and enable future mission capabilities on the International Space Station (ISS). The Communications, Navigation, and Networking reConfigurable Testbed (CoNNeCT) Project will provide NASA, industry, other Government agencies, and academic partners the opportunity to develop and field communications, navigation, and networking technologies in both the laboratory and space environment based on reconfigurable, software-defined radio platforms and the STRS Architecture. The CoNNeCT Payload Operations Nomenclature is "SCAN Testbed," and this nomenclature will be used in all ISS integration, safety, verification, and operations documentation. The SCAN Testbed (payload) is a Flight Releasable Attachment Mechanism (FRAM) based payload that will launch aboard the Japanese H-II Transfer Vehicle (HTV) Multipurpose Exposed Pallet (EP-MP) to the International Space Station (ISS), and will be transferred to the Express Logistics Carrier 3 (ELC3) via Extravehicular Robotics (EVR). The SCAN Testbed will operate on-orbit for a minimum of two years.
CoNNeCT Antenna Positioning System Dynamic Simulator Modal Model Correlation

NASA Technical Reports Server (NTRS)

Jones, Trevor M.; McNelis, Mark E.; Staab, Lucas D.; Akers, James C.; Suarez, Vicente J.

2012-01-01

The National Aeronautics and Space Administration (NASA) developed an on-orbit, adaptable, Software Defined Radios (SDR)/Space Telecommunications Radio System (STRS)-based testbed facility to conduct a suite of experiments to advance technologies, reduce risk, and enable future mission capabilities on the International Space Station (ISS). The Communications, Navigation, and Networking reConfigurable Testbed (CoNNeCT) Project will provide NASA, industry, other Government agencies, and academic partners the opportunity to develop and field communications, navigation, and networking technologies in both the laboratory and space environment based on reconfigurable, software-defined radio platforms and the STRS Architecture. The CoNNeCT Payload Operations Nomenclature is SCAN Testbed, and this nomenclature will be used in all ISS integration, safety, verification, and operations documentation. The SCAN Testbed (payload) is a Flight Releasable Attachment Mechanism (FRAM) based payload that will launch aboard the Japanese H-II Transfer Vehicle (HTV) Multipurpose Exposed Pallet (EP-MP) to the International Space Station (ISS), and will be transferred to the Express Logistics Carrier 3 (ELC3) via Extravehicular Robotics (EVR). The SCAN Testbed will operate on-orbit for a minimum of two years.
Co-Attention Based Neural Network for Source-Dependent Essay Scoring

ERIC Educational Resources Information Center

Zhang, Haoran; Litman, Diane

2018-01-01

This paper presents an investigation of using a co-attention based neural network for source-dependent essay scoring. We use a co-attention mechanism to help the model learn the importance of each part of the essay more accurately. Also, this paper shows that the co-attention based neural network model provides reliable score prediction of…
Network module detection: Affinity search technique with the multi-node topological overlap measure

PubMed Central

Li, Ai; Horvath, Steve

2009-01-01

Background Many clustering procedures only allow the user to input a pairwise dissimilarity or distance measure between objects. We propose a clustering method that can input a multi-point dissimilarity measure d(i1, i2, ..., iP) where the number of points P can be larger than 2. The work is motivated by gene network analysis where clusters correspond to modules of highly interconnected nodes. Here, we define modules as clusters of network nodes with high multi-node topological overlap. The topological overlap measure is a robust measure of interconnectedness which is based on shared network neighbors. In previous work, we have shown that the multi-node topological overlap measure yields biologically meaningful results when used as input of network neighborhood analysis. Findings We adapt network neighborhood analysis for the use of module detection. We propose the Module Affinity Search Technique (MAST), which is a generalized version of the Cluster Affinity Search Technique (CAST). MAST can accommodate a multi-node dissimilarity measure. Clusters grow around user-defined or automatically chosen seeds (e.g. hub nodes). We propose both local and global cluster growth stopping rules. We use several simulations and a gene co-expression network application to argue that the MAST approach leads to biologically meaningful results. We compare MAST with hierarchical clustering and partitioning around medoid clustering. Conclusion Our flexible module detection method is implemented in the MTOM software which can be downloaded from the following webpage: PMID:19619323
Network module detection: Affinity search technique with the multi-node topological overlap measure.

PubMed

Li, Ai; Horvath, Steve

2009-07-20

Many clustering procedures only allow the user to input a pairwise dissimilarity or distance measure between objects. We propose a clustering method that can input a multi-point dissimilarity measure d(i1, i2, ..., iP) where the number of points P can be larger than 2. The work is motivated by gene network analysis where clusters correspond to modules of highly interconnected nodes. Here, we define modules as clusters of network nodes with high multi-node topological overlap. The topological overlap measure is a robust measure of interconnectedness which is based on shared network neighbors. In previous work, we have shown that the multi-node topological overlap measure yields biologically meaningful results when used as input of network neighborhood analysis. We adapt network neighborhood analysis for the use of module detection. We propose the Module Affinity Search Technique (MAST), which is a generalized version of the Cluster Affinity Search Technique (CAST). MAST can accommodate a multi-node dissimilarity measure. Clusters grow around user-defined or automatically chosen seeds (e.g. hub nodes). We propose both local and global cluster growth stopping rules. We use several simulations and a gene co-expression network application to argue that the MAST approach leads to biologically meaningful results. We compare MAST with hierarchical clustering and partitioning around medoid clustering. Our flexible module detection method is implemented in the MTOM software which can be downloaded from the following webpage: http://www.genetics.ucla.edu/labs/horvath/MTOM/
Community Structure Analysis of Transcriptional Networks Reveals Distinct Molecular Pathways for Early- and Late-Onset Temporal Lobe Epilepsy with Childhood Febrile Seizures

PubMed Central

Moreira-Filho, Carlos Alberto; Bando, Silvia Yumi; Bertonha, Fernanda Bernardi; Iamashita, Priscila; Silva, Filipi Nascimento; Costa, Luciano da Fontoura; Silva, Alexandre Valotta; Castro, Luiz Henrique Martins; Wen, Hung-Tzu

2015-01-01

Age at epilepsy onset has a broad impact on brain plasticity and epilepsy pathomechanisms. Prolonged febrile seizures in early childhood (FS) constitute an initial precipitating insult (IPI) commonly associated with mesial temporal lobe epilepsy (MTLE). FS-MTLE patients may have early disease onset, i.e. just after the IPI, in early childhood, or late-onset, ranging from mid-adolescence to early adult life. The mechanisms governing early (E) or late (L) disease onset are largely unknown. In order to unveil the molecular pathways underlying E and L subtypes of FS-MTLE we investigated global gene expression in hippocampal CA3 explants of FS-MTLE patients submitted to hippocampectomy. Gene coexpression networks (GCNs) were obtained for the E and L patient groups. A network-based approach for GCN analysis was employed allowing: i) the visualization and analysis of differentially expressed (DE) and complete (CO) - all valid GO annotated transcripts - GCNs for the E and L groups; ii) the study of interactions between all the system’s constituents based on community detection and coarse-grained community structure methods. We found that the E-DE communities with strongest connection weights harbor highly connected genes mainly related to neural excitability and febrile seizures, whereas in L-DE communities these genes are not only involved in network excitability but also playing roles in other epilepsy-related processes. Inversely, in E-CO the strongly connected communities are related to compensatory pathways (seizure inhibition, neuronal survival and responses to stress conditions) while in L-CO these communities harbor several genes related to pro-epileptic effects, seizure-related mechanisms and vulnerability to epilepsy. These results fit the concept, based on fMRI and behavioral studies, that early onset epilepsies, although impacting more severely the hippocampus, are associated to compensatory mechanisms, while in late MTLE development the brain is less able to generate adaptive mechanisms, what has implications for epilepsy management and drug discovery. PMID:26011637
Physiological and molecular alterations in plants exposed to high [CO2] under phosphorus stress.

PubMed

Pandey, Renu; Zinta, Gaurav; AbdElgawad, Hamada; Ahmad, Altaf; Jain, Vanita; Janssens, Ivan A

2015-01-01

Atmospheric [CO2] has increased substantially in recent decades and will continue to do so, whereas the availability of phosphorus (P) is limited and unlikely to increase in the future. P is a non-renewable resource, and it is essential to every form of life. P is a key plant nutrient controlling the responsiveness of photosynthesis to [CO2]. Increases in [CO2] typically results in increased biomass through stimulation of net photosynthesis, and hence enhance the demand for P uptake. However, most soils contain low concentrations of available P. Therefore, low P is one of the major growth-limiting factors for plants in many agricultural and natural ecosystems. The adaptive responses of plants to [CO2] and P availability encompass alterations at morphological, physiological, biochemical and molecular levels. In general low P reduces growth, whereas high [CO2] enhances it particularly in C3 plants. Photosynthetic capacity is often enhanced under high [CO2] with sufficient P supply through modulation of enzyme activities involved in carbon fixation such as ribulose-1,5-bisphosphate carboxylase/oxygenase (Rubisco). However, high [CO2] with low P availability results in enhanced dry matter partitioning towards roots. Alterations in below-ground processes including root morphology, exudation and mycorrhizal association are influenced by [CO2] and P availability. Under high P availability, elevated [CO2] improves the uptake of P from soil. In contrast, under low P availability, high [CO2] mainly improves the efficiency with which plants produce biomass per unit P. At molecular level, the spatio-temporal regulation of genes involved in plant adaptation to low P and high [CO2] has been studied individually in various plant species. Genome-wide expression profiling of high [CO2] grown plants revealed hormonal regulation of biomass accumulation through complex transcriptional networks. Similarly, differential transcriptional regulatory networks are involved in P-limitation responses in plants. Analysis of expression patterns of some typical P-limitation induced genes under high [CO2] suggests that long-term exposure of plants to high [CO2] would have a tendency to stimulate similar transcriptional responses as observed under P-limitation. However, studies on the combined effect of high [CO2] and low P on gene expression are scarce. Such studies would provide insights into the development of P efficient crops in the context of anticipated increases in atmospheric [CO2]. Copyright © 2015 Elsevier Inc. All rights reserved.
Identification of rice genes associated with cosmic-ray response via co-expression gene network analysis.

PubMed

Hwang, Sun-Goo; Kim, Dong Sub; Hwang, Jung Eun; Han, A-Reum; Jang, Cheol Seong

2014-05-15

In order to better understand the biological systems that are affected in response to cosmic ray (CR), we conducted weighted gene co-expression network analysis using the module detection method. By using the Pearson's correlation coefficient (PCC) value, we evaluated complex gene-gene functional interactions between 680 CR-responsive probes from integrated microarray data sets, which included large-scale transcriptional profiling of 1000 microarray samples. These probes were divided into 6 distinct modules that contained 20 enriched gene ontology (GO) functions, such as oxidoreductase activity, hydrolase activity, and response to stimulus and stress. In particular, modules 1 and 2 commonly showed enriched annotation categories such as oxidoreductase activity, including enriched cis-regulatory elements known as ROS-specific regulators. These results suggest that the ROS-mediated irradiation response pathway is affected by CR in modules 1 and 2. We found 243 ionizing radiation (IR)-responsive probes that exhibited similarities in expression patterns in various irradiation microarray data sets. The expression patterns of 6 randomly selected IR-responsive genes were evaluated by quantitative reverse transcription polymerase chain reaction following treatment with CR, gamma rays (GR), and ion beam (IB); similar patterns were observed among these genes under these 3 treatments. Moreover, we constructed subnetworks of IR-responsive genes and evaluated the expression levels of their neighboring genes following GR treatment; similar patterns were observed among them. These results of network-based analyses might provide a clue to understanding the complex biological system related to the CR response in plants. Copyright © 2014 Elsevier B.V. All rights reserved.
Integrated analysis of long noncoding RNA and mRNA expression profile in children with obesity by microarray analysis.

PubMed

Liu, Yuesheng; Ji, Yuqiang; Li, Min; Wang, Min; Yi, Xiaoqing; Yin, Chunyan; Wang, Sisi; Zhang, Meizhen; Zhao, Zhao; Xiao, Yanfeng

2018-06-08

Long noncoding RNAs (lncRNAs) have an important role in adipose tissue function and energy metabolism homeostasis, and abnormalities may lead to obesity. To investigate whether lncRNAs are involved in childhood obesity, we investigated the differential expression profile of lncRNAs in obese children compared with non-obese children. A total number of 1268 differentially expressed lncRNAs and 1085 differentially expressed mRNAs were identified. Gene Ontology (GO) and pathway analysis revealed that these lncRNAs were involved in varied biological processes, including the inflammatory response, lipid metabolic process, osteoclast differentiation and fatty acid metabolism. In addition, the lncRNA-mRNA co-expression network and the protein-protein interaction (PPI) network were constructed to identify hub regulatory lncRNAs and genes based on the microarray expression profiles. This study for the first time identifies an expression profile of differentially expressed lncRNAs in obese children and indicated hub lncRNA RP11-20G13.3 attenuated adipogenesis of preadipocytes, which is conducive to the search for new diagnostic and therapeutic strategies of childhood obesity.
A systems-genetics approach and data mining tool to assist in the discovery of genes underlying complex traits in Oryza sativa.

PubMed

Ficklin, Stephen P; Feltus, Frank Alex

2013-01-01

Many traits of biological and agronomic significance in plants are controlled in a complex manner where multiple genes and environmental signals affect the expression of the phenotype. In Oryza sativa (rice), thousands of quantitative genetic signals have been mapped to the rice genome. In parallel, thousands of gene expression profiles have been generated across many experimental conditions. Through the discovery of networks with real gene co-expression relationships, it is possible to identify co-localized genetic and gene expression signals that implicate complex genotype-phenotype relationships. In this work, we used a knowledge-independent, systems genetics approach, to discover a high-quality set of co-expression networks, termed Gene Interaction Layers (GILs). Twenty-two GILs were constructed from 1,306 Affymetrix microarray rice expression profiles that were pre-clustered to allow for improved capture of gene co-expression relationships. Functional genomic and genetic data, including over 8,000 QTLs and 766 phenotype-tagged SNPs (p-value < = 0.001) from genome-wide association studies, both covering over 230 different rice traits were integrated with the GILs. An online systems genetics data-mining resource, the GeneNet Engine, was constructed to enable dynamic discovery of gene sets (i.e. network modules) that overlap with genetic traits. GeneNet Engine does not provide the exact set of genes underlying a given complex trait, but through the evidence of gene-marker correspondence, co-expression, and functional enrichment, site visitors can identify genes with potential shared causality for a trait which could then be used for experimental validation. A set of 2 million SNPs was incorporated into the database and serve as a potential set of testable biomarkers for genes in modules that overlap with genetic traits. Herein, we describe two modules found using GeneNet Engine, one with significant overlap with the trait amylose content and another with significant overlap with blast disease resistance.

A Systems-Genetics Approach and Data Mining Tool to Assist in the Discovery of Genes Underlying Complex Traits in Oryza sativa

PubMed Central

Ficklin, Stephen P.; Feltus, Frank Alex

2013-01-01

Many traits of biological and agronomic significance in plants are controlled in a complex manner where multiple genes and environmental signals affect the expression of the phenotype. In Oryza sativa (rice), thousands of quantitative genetic signals have been mapped to the rice genome. In parallel, thousands of gene expression profiles have been generated across many experimental conditions. Through the discovery of networks with real gene co-expression relationships, it is possible to identify co-localized genetic and gene expression signals that implicate complex genotype-phenotype relationships. In this work, we used a knowledge-independent, systems genetics approach, to discover a high-quality set of co-expression networks, termed Gene Interaction Layers (GILs). Twenty-two GILs were constructed from 1,306 Affymetrix microarray rice expression profiles that were pre-clustered to allow for improved capture of gene co-expression relationships. Functional genomic and genetic data, including over 8,000 QTLs and 766 phenotype-tagged SNPs (p-value < = 0.001) from genome-wide association studies, both covering over 230 different rice traits were integrated with the GILs. An online systems genetics data-mining resource, the GeneNet Engine, was constructed to enable dynamic discovery of gene sets (i.e. network modules) that overlap with genetic traits. GeneNet Engine does not provide the exact set of genes underlying a given complex trait, but through the evidence of gene-marker correspondence, co-expression, and functional enrichment, site visitors can identify genes with potential shared causality for a trait which could then be used for experimental validation. A set of 2 million SNPs was incorporated into the database and serve as a potential set of testable biomarkers for genes in modules that overlap with genetic traits. Herein, we describe two modules found using GeneNet Engine, one with significant overlap with the trait amylose content and another with significant overlap with blast disease resistance. PMID:23874666
NFκB pathway analysis: An approach to analyze gene co-expression networks employing feedback cycles.

PubMed

Dillenburg, Fabiane Cristine; Zanotto-Filho, Alfeu; Fonseca Moreira, José Cláudio; Ribeiro, Leila; Carro, Luigi

2018-02-01

The genes of the NFκB pathway are involved in the control of a plethora of biological processes ranking from inhibition of apoptosis to metastasis in cancer. It has been described that Gliobastoma multiforme (GBM) patients carry aberrant NFκB activation, but the molecular mechanisms are not completely understood. Here, we present a NFκB pathway analysis in tumor specimens of GBM compared to non-neoplasic brain tissues, based on the different kind of cycles found among genes of a gene co-expression network constructed using quantized data obtained from the microarrays. A cycle is a closed walk with all vertices distinct (except the first and last). Thanks to this way of finding relations among genes, a more robust interpretation of gene correlations is possible, because the cycles are associated with feedback mechanisms that are very common in biological networks. In GBM samples, we could conclude that the stoichiometric relationship between genes involved in NFκB pathway regulation is unbalanced. This can be measured and explained by the identification of a cycle. This conclusion helps to understand more about the biology of this type of tumor. Copyright © 2017 Elsevier Ltd. All rights reserved.
Network-based differential gene expression analysis suggests cell cycle related genes regulated by E2F1 underlie the molecular difference between smoker and non-smoker lung adenocarcinoma

PubMed Central

2013-01-01

Background Differential gene expression (DGE) analysis is commonly used to reveal the deregulated molecular mechanisms of complex diseases. However, traditional DGE analysis (e.g., the t test or the rank sum test) tests each gene independently without considering interactions between them. Top-ranked differentially regulated genes prioritized by the analysis may not directly relate to the coherent molecular changes underlying complex diseases. Joint analyses of co-expression and DGE have been applied to reveal the deregulated molecular modules underlying complex diseases. Most of these methods consist of separate steps: first to identify gene-gene relationships under the studied phenotype then to integrate them with gene expression changes for prioritizing signature genes, or vice versa. It is warrant a method that can simultaneously consider gene-gene co-expression strength and corresponding expression level changes so that both types of information can be leveraged optimally. Results In this paper, we develop a gene module based method for differential gene expression analysis, named network-based differential gene expression (nDGE) analysis, a one-step integrative process for prioritizing deregulated genes and grouping them into gene modules. We demonstrate that nDGE outperforms existing methods in prioritizing deregulated genes and discovering deregulated gene modules using simulated data sets. When tested on a series of smoker and non-smoker lung adenocarcinoma data sets, we show that top differentially regulated genes identified by the rank sum test in different sets are not consistent while top ranked genes defined by nDGE in different data sets significantly overlap. nDGE results suggest that a differentially regulated gene module, which is enriched for cell cycle related genes and E2F1 targeted genes, plays a role in the molecular differences between smoker and non-smoker lung adenocarcinoma. Conclusions In this paper, we develop nDGE to prioritize deregulated genes and group them into gene modules by simultaneously considering gene expression level changes and gene-gene co-regulations. When applied to both simulated and empirical data, nDGE outperforms the traditional DGE method. More specifically, when applied to smoker and non-smoker lung cancer sets, nDGE results illustrate the molecular differences between smoker and non-smoker lung cancer. PMID:24341432
Genome-wide analysis of aberrantly expressed lncRNAs and miRNAs with associated co-expression and ceRNA networks in β-thalassemia and hereditary persistence of fetal hemoglobin.

PubMed

Lai, Ketong; Jia, Siyuan; Yu, Shanjuan; Luo, Jianming; He, Yunyan

2017-07-25

The implications of lncRNAs regarding fetal hemoglobin (HbF) induction in hemoglobin disorders remain poorly understood. In this study, microarray analysis was performed to profile lncRNAs, miRNAs and mRNAs in individuals with hereditary persistence of fetal hemoglobin (HPFH), β-thalassemia carriers with high HbF levels and healthy controls. The results show aberrant expression of 862 lncRNAs, 568 mRNAs and 63 miRNAs in the high-HbF group compared with the control group. Altered NR_001589, NR_120526, T315543, miR-486-3p, miR-19b-1-5p and miR-20a-3p expression was confirmed by quantitative reverse transcription-polymerase chain reaction, and Spearman correlation coefficients revealed significant positive correlations with HbF. Gene Ontology and Kyoto Encyclopedia of Genes and Genomes pathway enrichment analyses showed the hematopoietic cell lineage and apoptosis to be most significantly dysregulated in HbF induction. We analyzed coding genes near the lncRNAs and constructed a coding-noncoding co-expression network. Based on the results, lncRNAs likely contribute to increased HbF levels by activating expression of HBE1 and hematopoietic cell lineage-inducible molecules and by inhibiting that of apoptosis-inducible molecules. Finally, through construction of a competing endogenous RNA network, we found that 6 lncRNAs could bind competitively with miR-486-3p, resulting in increased HbF levels. Taken together, our findings provide new insights into the mechanisms of HbF induction and potentially provide new targets for the treatment of β-thalassemia major.
Genome-wide analysis of aberrantly expressed lncRNAs and miRNAs with associated co-expression and ceRNA networks in β-thalassemia and hereditary persistence of fetal hemoglobin

PubMed Central

Yu, Shanjuan; Luo, Jianming; He, Yunyan

2017-01-01

The implications of lncRNAs regarding fetal hemoglobin (HbF) induction in hemoglobin disorders remain poorly understood. In this study, microarray analysis was performed to profile lncRNAs, miRNAs and mRNAs in individuals with hereditary persistence of fetal hemoglobin (HPFH), β-thalassemia carriers with high HbF levels and healthy controls. The results show aberrant expression of 862 lncRNAs, 568 mRNAs and 63 miRNAs in the high-HbF group compared with the control group. Altered NR_001589, NR_120526, T315543, miR-486-3p, miR-19b-1-5p and miR-20a-3p expression was confirmed by quantitative reverse transcription-polymerase chain reaction, and Spearman correlation coefficients revealed significant positive correlations with HbF. Gene Ontology and Kyoto Encyclopedia of Genes and Genomes pathway enrichment analyses showed the hematopoietic cell lineage and apoptosis to be most significantly dysregulated in HbF induction. We analyzed coding genes near the lncRNAs and constructed a coding-noncoding co-expression network. Based on the results, lncRNAs likely contribute to increased HbF levels by activating expression of HBE1 and hematopoietic cell lineage-inducible molecules and by inhibiting that of apoptosis-inducible molecules. Finally, through construction of a competing endogenous RNA network, we found that 6 lncRNAs could bind competitively with miR-486-3p, resulting in increased HbF levels. Taken together, our findings provide new insights into the mechanisms of HbF induction and potentially provide new targets for the treatment of β-thalassemia major. PMID:28624809
Blood gene expression profiles suggest altered immune function associated with symptoms of generalized anxiety disorder.

PubMed

Wingo, Aliza P; Gibson, Greg

2015-01-01

Prospective epidemiological studies found that generalized anxiety disorder (GAD) can impair immune function and increase risk for cardiovascular disease or events. Mechanisms underlying the physiological reverberations of anxiety, however, are still elusive. Hence, we aimed to investigate molecular processes mediating effects of anxiety on physical health using blood gene expression profiles of 336 community participants (157 anxious and 179 control). We examined genome-wide differential gene expression in anxiety, as well as associations between nine major modules of co-regulated transcripts in blood gene expression and anxiety. No significant differential expression was observed in women, but 631 genes were differentially expressed between anxious and control men at the false discovery rate of 0.1 after controlling for age, body mass index, race, and batch effect. Gene set enrichment analysis (GSEA) revealed that genes with altered expression levels in anxious men were involved in response of various immune cells to vaccination and to acute viral and bacterial infection, and in a metabolic network affecting traits of metabolic syndrome. Further, we found one set of 260 co-regulated genes to be significantly associated with anxiety in men after controlling for the relevant covariates, and demonstrate its equivalence to a component of the stress-related conserved transcriptional response to adversity profile. Taken together, our results suggest potential molecular pathways that can explain negative effects of GAD observed in epidemiological studies. Remarkably, even mild anxiety, which most of our participants had, was associated with observable changes in immune-related gene expression levels. Our findings generate hypotheses and provide incremental insights into molecular mechanisms mediating negative physiological effects of GAD. Published by Elsevier Inc.
[Weighted gene co-expression network analysis in biomedicine research].

PubMed

Liu, Wei; Li, Li; Ye, Hua; Tu, Wei

2017-11-25

High-throughput biological technologies are now widely applied in biology and medicine, allowing scientists to monitor thousands of parameters simultaneously in a specific sample. However, it is still an enormous challenge to mine useful information from high-throughput data. The emergence of network biology provides deeper insights into complex bio-system and reveals the modularity in tissue/cellular networks. Correlation networks are increasingly used in bioinformatics applications. Weighted gene co-expression network analysis (WGCNA) tool can detect clusters of highly correlated genes. Therefore, we systematically reviewed the application of WGCNA in the study of disease diagnosis, pathogenesis and other related fields. First, we introduced principle, workflow, advantages and disadvantages of WGCNA. Second, we presented the application of WGCNA in disease, physiology, drug, evolution and genome annotation. Then, we indicated the application of WGCNA in newly developed high-throughput methods. We hope this review will help to promote the application of WGCNA in biomedicine research.
NorWood: a gene expression resource for evo-devo studies of conifer wood development.

PubMed

Jokipii-Lukkari, Soile; Sundell, David; Nilsson, Ove; Hvidsten, Torgeir R; Street, Nathaniel R; Tuominen, Hannele

2017-10-01

The secondary xylem of conifers is composed mainly of tracheids that differ anatomically and chemically from angiosperm xylem cells. There is currently no high-spatial-resolution data available profiling gene expression during wood formation for any coniferous species, which limits insight into tracheid development. RNA-sequencing data from replicated, high-spatial-resolution section series throughout the cambial and woody tissues of Picea abies were used to generate the NorWood.conGenIE.org web resource, which facilitates exploration of the associated gene expression profiles and co-expression networks. Integration within PlantGenIE.org enabled a comparative regulomics analysis, revealing divergent co-expression networks between P. abies and the two angiosperm species Arabidopsis thaliana and Populus tremula for the secondary cell wall (SCW) master regulator NAC Class IIB transcription factors. The SCW cellulose synthase genes (CesAs) were located in the neighbourhoods of the NAC factors in A. thaliana and P. tremula, but not in P. abies. The NorWood co-expression network enabled identification of potential SCW CesA regulators in P. abies. The NorWood web resource represents a powerful community tool for generating evo-devo insights into the divergence of wood formation between angiosperms and gymnosperms and for advancing understanding of the regulation of wood development in P. abies. © 2017 The Authors. New Phytologist © 2017 New Phytologist Trust.
Genetic Network Inference: From Co-Expression Clustering to Reverse Engineering

NASA Technical Reports Server (NTRS)

Dhaeseleer, Patrik; Liang, Shoudan; Somogyi, Roland

2000-01-01

Advances in molecular biological, analytical, and computational technologies are enabling us to systematically investigate the complex molecular processes underlying biological systems. In particular, using high-throughput gene expression assays, we are able to measure the output of the gene regulatory network. We aim here to review datamining and modeling approaches for conceptualizing and unraveling the functional relationships implicit in these datasets. Clustering of co-expression profiles allows us to infer shared regulatory inputs and functional pathways. We discuss various aspects of clustering, ranging from distance measures to clustering algorithms and multiple-duster memberships. More advanced analysis aims to infer causal connections between genes directly, i.e., who is regulating whom and how. We discuss several approaches to the problem of reverse engineering of genetic networks, from discrete Boolean networks, to continuous linear and non-linear models. We conclude that the combination of predictive modeling with systematic experimental verification will be required to gain a deeper insight into living organisms, therapeutic targeting, and bioengineering.
Deciphering microbial interactions and detecting keystone species with co-occurrence networks

PubMed Central

Berry, David; Widder, Stefanie

2014-01-01

Co-occurrence networks produced from microbial survey sequencing data are frequently used to identify interactions between community members. While this approach has potential to reveal ecological processes, it has been insufficiently validated due to the technical limitations inherent in studying complex microbial ecosystems. Here, we simulate multi-species microbial communities with known interaction patterns using generalized Lotka-Volterra dynamics. We then construct co-occurrence networks and evaluate how well networks reveal the underlying interactions and how experimental and ecological parameters can affect network inference and interpretation. We find that co-occurrence networks can recapitulate interaction networks under certain conditions, but that they lose interpretability when the effects of habitat filtering become significant. We demonstrate that networks suffer from local hot spots of spurious correlation in the neighborhood of hub species that engage in many interactions. We also identify topological features associated with keystone species in co-occurrence networks. This study provides a substantiated framework to guide environmental microbiologists in the construction and interpretation of co-occurrence networks from microbial survey datasets. PMID:24904535
An Integrated Cell Purification and Genomics Strategy Reveals Multiple Regulators of Pancreas Development

PubMed Central

Benitez, Cecil M.; Qu, Kun; Sugiyama, Takuya; Pauerstein, Philip T.; Liu, Yinghua; Tsai, Jennifer; Gu, Xueying; Ghodasara, Amar; Arda, H. Efsun; Zhang, Jiajing; Dekker, Joseph D.; Tucker, Haley O.; Chang, Howard Y.; Kim, Seung K.

2014-01-01

The regulatory logic underlying global transcriptional programs controlling development of visceral organs like the pancreas remains undiscovered. Here, we profiled gene expression in 12 purified populations of fetal and adult pancreatic epithelial cells representing crucial progenitor cell subsets, and their endocrine or exocrine progeny. Using probabilistic models to decode the general programs organizing gene expression, we identified co-expressed gene sets in cell subsets that revealed patterns and processes governing progenitor cell development, lineage specification, and endocrine cell maturation. Purification of Neurog3 mutant cells and module network analysis linked established regulators such as Neurog3 to unrecognized gene targets and roles in pancreas development. Iterative module network analysis nominated and prioritized transcriptional regulators, including diabetes risk genes. Functional validation of a subset of candidate regulators with corresponding mutant mice revealed that the transcription factors Etv1, Prdm16, Runx1t1 and Bcl11a are essential for pancreas development. Our integrated approach provides a unique framework for identifying regulatory genes and functional gene sets underlying pancreas development and associated diseases such as diabetes mellitus. PMID:25330008
Simple Shared Motifs (SSM) in conserved region of promoters: a new approach to identify co-regulation patterns.

PubMed

Gruel, Jérémy; LeBorgne, Michel; LeMeur, Nolwenn; Théret, Nathalie

2011-09-12

Regulation of gene expression plays a pivotal role in cellular functions. However, understanding the dynamics of transcription remains a challenging task. A host of computational approaches have been developed to identify regulatory motifs, mainly based on the recognition of DNA sequences for transcription factor binding sites. Recent integration of additional data from genomic analyses or phylogenetic footprinting has significantly improved these methods. Here, we propose a different approach based on the compilation of Simple Shared Motifs (SSM), groups of sequences defined by their length and similarity and present in conserved sequences of gene promoters. We developed an original algorithm to search and count SSM in pairs of genes. An exceptional number of SSM is considered as a common regulatory pattern. The SSM approach is applied to a sample set of genes and validated using functional gene-set enrichment analyses. We demonstrate that the SSM approach selects genes that are over-represented in specific biological categories (Ontology and Pathways) and are enriched in co-expressed genes. Finally we show that genes co-expressed in the same tissue or involved in the same biological pathway have increased SSM values. Using unbiased clustering of genes, Simple Shared Motifs analysis constitutes an original contribution to provide a clearer definition of expression networks.
Simple Shared Motifs (SSM) in conserved region of promoters: a new approach to identify co-regulation patterns

PubMed Central

2011-01-01

Background Regulation of gene expression plays a pivotal role in cellular functions. However, understanding the dynamics of transcription remains a challenging task. A host of computational approaches have been developed to identify regulatory motifs, mainly based on the recognition of DNA sequences for transcription factor binding sites. Recent integration of additional data from genomic analyses or phylogenetic footprinting has significantly improved these methods. Results Here, we propose a different approach based on the compilation of Simple Shared Motifs (SSM), groups of sequences defined by their length and similarity and present in conserved sequences of gene promoters. We developed an original algorithm to search and count SSM in pairs of genes. An exceptional number of SSM is considered as a common regulatory pattern. The SSM approach is applied to a sample set of genes and validated using functional gene-set enrichment analyses. We demonstrate that the SSM approach selects genes that are over-represented in specific biological categories (Ontology and Pathways) and are enriched in co-expressed genes. Finally we show that genes co-expressed in the same tissue or involved in the same biological pathway have increased SSM values. Conclusions Using unbiased clustering of genes, Simple Shared Motifs analysis constitutes an original contribution to provide a clearer definition of expression networks. PMID:21910886
Deregulation of an imprinted gene network in prostate cancer

PubMed Central

Ribarska, Teodora; Goering, Wolfgang; Droop, Johanna; Bastian, Klaus-Marius; Ingenwerth, Marc; Schulz, Wolfgang A

2014-01-01

Multiple epigenetic alterations contribute to prostate cancer progression by deregulating gene expression. Epigenetic mechanisms, especially differential DNA methylation at imprinting control regions (termed DMRs), normally ensure the exclusive expression of imprinted genes from one specific parental allele. We therefore wondered to which extent imprinted genes become deregulated in prostate cancer and, if so, whether deregulation is due to altered DNA methylation at DMRs. Therefore, we selected presumptive deregulated imprinted genes from a previously conducted in silico analysis and from the literature and analyzed their expression in prostate cancer tissues by qRT-PCR. We found significantly diminished expression of PLAGL1/ZAC1, MEG3, NDN, CDKN1C, IGF2, and H19, while LIT1 was significantly overexpressed. The PPP1R9A gene, which is imprinted in selected tissues only, was strongly overexpressed, but was expressed biallelically in benign and cancerous prostatic tissues. Expression of many of these genes was strongly correlated, suggesting co-regulation, as in an imprinted gene network (IGN) reported in mice. Deregulation of the network genes also correlated with EZH2 and HOXC6 overexpression. Pyrosequencing analysis of all relevant DMRs revealed generally stable DNA methylation between benign and cancerous prostatic tissues, but frequent hypo- and hyper-methylation was observed at the H19 DMR in both benign and cancerous tissues. Re-expression of the ZAC1 transcription factor induced H19, CDKN1C and IGF2, supporting its function as a nodal regulator of the IGN. Our results indicate that a group of imprinted genes are coordinately deregulated in prostate cancers, independently of DNA methylation changes. PMID:24513574
Deregulation of an imprinted gene network in prostate cancer.

PubMed

Ribarska, Teodora; Goering, Wolfgang; Droop, Johanna; Bastian, Klaus-Marius; Ingenwerth, Marc; Schulz, Wolfgang A

2014-05-01

Multiple epigenetic alterations contribute to prostate cancer progression by deregulating gene expression. Epigenetic mechanisms, especially differential DNA methylation at imprinting control regions (termed DMRs), normally ensure the exclusive expression of imprinted genes from one specific parental allele. We therefore wondered to which extent imprinted genes become deregulated in prostate cancer and, if so, whether deregulation is due to altered DNA methylation at DMRs. Therefore, we selected presumptive deregulated imprinted genes from a previously conducted in silico analysis and from the literature and analyzed their expression in prostate cancer tissues by qRT-PCR. We found significantly diminished expression of PLAGL1/ZAC1, MEG3, NDN, CDKN1C, IGF2, and H19, while LIT1 was significantly overexpressed. The PPP1R9A gene, which is imprinted in selected tissues only, was strongly overexpressed, but was expressed biallelically in benign and cancerous prostatic tissues. Expression of many of these genes was strongly correlated, suggesting co-regulation, as in an imprinted gene network (IGN) reported in mice. Deregulation of the network genes also correlated with EZH2 and HOXC6 overexpression. Pyrosequencing analysis of all relevant DMRs revealed generally stable DNA methylation between benign and cancerous prostatic tissues, but frequent hypo- and hyper-methylation was observed at the H19 DMR in both benign and cancerous tissues. Re-expression of the ZAC1 transcription factor induced H19, CDKN1C and IGF2, supporting its function as a nodal regulator of the IGN. Our results indicate that a group of imprinted genes are coordinately deregulated in prostate cancers, independently of DNA methylation changes.
Using scale and feather traits for module construction provides a functional approach to chicken epidermal development.

PubMed

Bao, Weier; Greenwold, Matthew J; Sawyer, Roger H

2017-11-01

Gene co-expression network analysis has been a research method widely used in systematically exploring gene function and interaction. Using the Weighted Gene Co-expression Network Analysis (WGCNA) approach to construct a gene co-expression network using data from a customized 44K microarray transcriptome of chicken epidermal embryogenesis, we have identified two distinct modules that are highly correlated with scale or feather development traits. Signaling pathways related to feather development were enriched in the traditional KEGG pathway analysis and functional terms relating specifically to embryonic epidermal development were also enriched in the Gene Ontology analysis. Significant enrichment annotations were discovered from customized enrichment tools such as Modular Single-Set Enrichment Test (MSET) and Medical Subject Headings (MeSH). Hub genes in both trait-correlated modules showed strong specific functional enrichment toward epidermal development. Also, regulatory elements, such as transcription factors and miRNAs, were targeted in the significant enrichment result. This work highlights the advantage of this methodology for functional prediction of genes not previously associated with scale- and feather trait-related modules.
Integrative Transcriptomic Analysis Uncovers Novel Gene Modules That Underlie the Sulfate Response in Arabidopsis thaliana

PubMed Central

Henríquez-Valencia, Carlos; Arenas-M, Anita; Medina, Joaquín; Canales, Javier

2018-01-01

Sulfur is an essential nutrient for plant growth and development. Sulfur is a constituent of proteins, the plasma membrane and cell walls, among other important cellular components. To obtain new insights into the gene regulatory networks underlying the sulfate response, we performed an integrative meta-analysis of transcriptomic data from five different sulfate experiments available in public databases. This bioinformatic approach allowed us to identify a robust set of genes whose expression depends only on sulfate availability, indicating that those genes play an important role in the sulfate response. In relation to sulfate metabolism, the biological function of approximately 45% of these genes is currently unknown. Moreover, we found several consistent Gene Ontology terms related to biological processes that have not been extensively studied in the context of the sulfate response; these processes include cell wall organization, carbohydrate metabolism, nitrogen compound transport, and the regulation of proteolysis. Gene co-expression network analyses revealed relationships between the sulfate-responsive genes that were distributed among seven function-specific co-expression modules. The most connected genes in the sulfate co-expression network belong to a module related to the carbon response, suggesting that this biological function plays an important role in the control of the sulfate response. Temporal analyses of the network suggest that sulfate starvation generates a biphasic response, which involves that major changes in gene expression occur during both the early and late responses. Network analyses predicted that the sulfate response is regulated by a limited number of transcription factors, including MYBs, bZIPs, and NF-YAs. In conclusion, our analysis identified new candidate genes and provided new hypotheses to advance our understanding of the transcriptional regulation of sulfate metabolism in plants. PMID:29692794
Integrative Transcriptomic Analysis Uncovers Novel Gene Modules That Underlie the Sulfate Response in Arabidopsis thaliana.

PubMed

Henríquez-Valencia, Carlos; Arenas-M, Anita; Medina, Joaquín; Canales, Javier

2018-01-01

Sulfur is an essential nutrient for plant growth and development. Sulfur is a constituent of proteins, the plasma membrane and cell walls, among other important cellular components. To obtain new insights into the gene regulatory networks underlying the sulfate response, we performed an integrative meta-analysis of transcriptomic data from five different sulfate experiments available in public databases. This bioinformatic approach allowed us to identify a robust set of genes whose expression depends only on sulfate availability, indicating that those genes play an important role in the sulfate response. In relation to sulfate metabolism, the biological function of approximately 45% of these genes is currently unknown. Moreover, we found several consistent Gene Ontology terms related to biological processes that have not been extensively studied in the context of the sulfate response; these processes include cell wall organization, carbohydrate metabolism, nitrogen compound transport, and the regulation of proteolysis. Gene co-expression network analyses revealed relationships between the sulfate-responsive genes that were distributed among seven function-specific co-expression modules. The most connected genes in the sulfate co-expression network belong to a module related to the carbon response, suggesting that this biological function plays an important role in the control of the sulfate response. Temporal analyses of the network suggest that sulfate starvation generates a biphasic response, which involves that major changes in gene expression occur during both the early and late responses. Network analyses predicted that the sulfate response is regulated by a limited number of transcription factors, including MYBs, bZIPs, and NF-YAs. In conclusion, our analysis identified new candidate genes and provided new hypotheses to advance our understanding of the transcriptional regulation of sulfate metabolism in plants.
NETWORK ASSISTED ANALYSIS TO REVEAL THE GENETIC BASIS OF AUTISM1

PubMed Central

Liu, Li; Lei, Jing; Roeder, Kathryn

2016-01-01

While studies show that autism is highly heritable, the nature of the genetic basis of this disorder remains illusive. Based on the idea that highly correlated genes are functionally interrelated and more likely to affect risk, we develop a novel statistical tool to find more potentially autism risk genes by combining the genetic association scores with gene co-expression in specific brain regions and periods of development. The gene dependence network is estimated using a novel partial neighborhood selection (PNS) algorithm, where node specific properties are incorporated into network estimation for improved statistical and computational efficiency. Then we adopt a hidden Markov random field (HMRF) model to combine the estimated network and the genetic association scores in a systematic manner. The proposed modeling framework can be naturally extended to incorporate additional structural information concerning the dependence between genes. Using currently available genetic association data from whole exome sequencing studies and brain gene expression levels, the proposed algorithm successfully identified 333 genes that plausibly affect autism risk. PMID:27134692
Use of transcriptome sequencing to understand the pistillate flowering in hickory (Carya cathayensis Sarg.).

PubMed

Huang, You-Jun; Liu, Li-Li; Huang, Jian-Qin; Wang, Zheng-Jia; Chen, Fang-Fang; Zhang, Qi-Xiang; Zheng, Bing-Song; Chen, Ming

2013-10-10

Different from herbaceous plants, the woody plants undergo a long-period vegetative stage to achieve floral transition. They then turn into seasonal plants, flowering annually. In this study, a preliminary model of gene regulations for seasonal pistillate flowering in hickory (Carya cathayensis) was proposed. The genome-wide dynamic transcriptome was characterized via the joint-approach of RNA sequencing and microarray analysis. Differential transcript abundance analysis uncovered the dynamic transcript abundance patterns of flowering correlated genes and their major functions based on Gene Ontology (GO) analysis. To explore pistillate flowering mechanism in hickory, a comprehensive flowering gene regulatory network based on Arabidopsis thaliana was constructed by additional literature mining. A total of 114 putative flowering or floral genes including 31 with differential transcript abundance were identified in hickory. The locations, functions and dynamic transcript abundances were analyzed in the gene regulatory networks. A genome-wide co-expression network for the putative flowering or floral genes shows three flowering regulatory modules corresponding to response to light abiotic stimulus, cold stress, and reproductive development process, respectively. Totally 27 potential flowering or floral genes were recruited which are meaningful to understand the hickory specific seasonal flowering mechanism better. Flowering event of pistillate flower bud in hickory is triggered by several pathways synchronously including the photoperiod, autonomous, vernalization, gibberellin, and sucrose pathway. Totally 27 potential flowering or floral genes were recruited from the genome-wide co-expression network function module analysis. Moreover, the analysis provides a potential FLC-like gene based vernalization pathway and an 'AC' model for pistillate flower development in hickory. This work provides an available framework for pistillate flower development in hickory, which is significant for insight into regulation of flowering and floral development of woody plants.

Use of transcriptome sequencing to understand the pistillate flowering in hickory (Carya cathayensis Sarg.)

PubMed Central

2013-01-01

Background Different from herbaceous plants, the woody plants undergo a long-period vegetative stage to achieve floral transition. They then turn into seasonal plants, flowering annually. In this study, a preliminary model of gene regulations for seasonal pistillate flowering in hickory (Carya cathayensis) was proposed. The genome-wide dynamic transcriptome was characterized via the joint-approach of RNA sequencing and microarray analysis. Results Differential transcript abundance analysis uncovered the dynamic transcript abundance patterns of flowering correlated genes and their major functions based on Gene Ontology (GO) analysis. To explore pistillate flowering mechanism in hickory, a comprehensive flowering gene regulatory network based on Arabidopsis thaliana was constructed by additional literature mining. A total of 114 putative flowering or floral genes including 31 with differential transcript abundance were identified in hickory. The locations, functions and dynamic transcript abundances were analyzed in the gene regulatory networks. A genome-wide co-expression network for the putative flowering or floral genes shows three flowering regulatory modules corresponding to response to light abiotic stimulus, cold stress, and reproductive development process, respectively. Totally 27 potential flowering or floral genes were recruited which are meaningful to understand the hickory specific seasonal flowering mechanism better. Conclusions Flowering event of pistillate flower bud in hickory is triggered by several pathways synchronously including the photoperiod, autonomous, vernalization, gibberellin, and sucrose pathway. Totally 27 potential flowering or floral genes were recruited from the genome-wide co-expression network function module analysis. Moreover, the analysis provides a potential FLC-like gene based vernalization pathway and an 'AC’ model for pistillate flower development in hickory. This work provides an available framework for pistillate flower development in hickory, which is significant for insight into regulation of flowering and floral development of woody plants. PMID:24106755
DEFINING THE PLAYERS IN HIGHER-ORDER NETWORKS: PREDICTIVE MODELING FOR REVERSE ENGINEERING FUNCTIONAL INFLUENCE NETWORKS

DOE Office of Scientific and Technical Information (OSTI.GOV)

McDermott, Jason E.; Costa, Michelle N.; Stevens, S.L.

A difficult problem that is currently growing rapidly due to the sharp increase in the amount of high-throughput data available for many systems is that of determining useful and informative causative influence networks. These networks can be used to predict behavior given observation of a small number of components, predict behavior at a future time point, or identify components that are critical to the functioning of the system under particular conditions. In these endeavors incorporating observations of systems from a wide variety of viewpoints can be particularly beneficial, but has often been undertaken with the objective of inferring networks thatmore » are generally applicable. The focus of the current work is to integrate both general observations and measurements taken for a particular pathology, that of ischemic stroke, to provide improved ability to produce useful predictions of systems behavior. A number of hybrid approaches have recently been proposed for network generation in which the Gene Ontology is used to filter or enrich network links inferred from gene expression data through reverse engineering methods. These approaches have been shown to improve the biological plausibility of the inferred relationships determined, but still treat knowledge-based and machine-learning inferences as incommensurable inputs. In this paper, we explore how further improvements may be achieved through a full integration of network inference insights achieved through application of the Gene Ontology and reverse engineering methods with specific reference to the construction of dynamic models of transcriptional regulatory networks. We show that integrating two approaches to network construction, one based on reverse-engineering from conditional transcriptional data, one based on reverse-engineering from in situ hybridization data, and another based on functional associations derived from Gene Ontology, using probabilities can improve results of clustering as evaluated by a predictive model of transcriptional expression levels.« less
Hierarchical cortical transcriptome disorganization in autism.

PubMed

Lombardo, Michael V; Courchesne, Eric; Lewis, Nathan E; Pramparo, Tiziano

2017-01-01

Autism spectrum disorders (ASD) are etiologically heterogeneous and complex. Functional genomics work has begun to identify a diverse array of dysregulated transcriptomic programs (e.g., synaptic, immune, cell cycle, DNA damage, WNT signaling, cortical patterning and differentiation) potentially involved in ASD brain abnormalities during childhood and adulthood. However, it remains unclear whether such diverse dysregulated pathways are independent of each other or instead reflect coordinated hierarchical systems-level pathology. Two ASD cortical transcriptome datasets were re-analyzed using consensus weighted gene co-expression network analysis (WGCNA) to identify common co-expression modules across datasets. Linear mixed-effect models and Bayesian replication statistics were used to identify replicable differentially expressed modules. Eigengene network analysis was then utilized to identify between-group differences in how co-expression modules interact and cluster into hierarchical meta-modular organization. Protein-protein interaction analyses were also used to determine whether dysregulated co-expression modules show enhanced interactions. We find replicable evidence for 10 gene co-expression modules that are differentially expressed in ASD cortex. Rather than being independent non-interacting sources of pathology, these dysregulated co-expression modules work in synergy and physically interact at the protein level. These systems-level transcriptional signals are characterized by downregulation of synaptic processes coordinated with upregulation of immune/inflammation, response to other organism, catabolism, viral processes, translation, protein targeting and localization, cell proliferation, and vasculature development. Hierarchical organization of meta-modules (clusters of highly correlated modules) is also highly affected in ASD. These findings highlight that dysregulation of the ASD cortical transcriptome is characterized by the dysregulation of multiple coordinated transcriptional programs producing synergistic systems-level effects that cannot be fully appreciated by studying the individual component biological processes in isolation.
Model-based design of RNA hybridization networks implemented in living cells

PubMed Central

Rodrigo, Guillermo; Prakash, Satya; Shen, Shensi; Majer, Eszter

2017-01-01

Abstract Synthetic gene circuits allow the behavior of living cells to be reprogrammed, and non-coding small RNAs (sRNAs) are increasingly being used as programmable regulators of gene expression. However, sRNAs (natural or synthetic) are generally used to regulate single target genes, while complex dynamic behaviors would require networks of sRNAs regulating each other. Here, we report a strategy for implementing such networks that exploits hybridization reactions carried out exclusively by multifaceted sRNAs that are both targets of and triggers for other sRNAs. These networks are ultimately coupled to the control of gene expression. We relied on a thermodynamic model of the different stable conformational states underlying this system at the nucleotide level. To test our model, we designed five different RNA hybridization networks with a linear architecture, and we implemented them in Escherichia coli. We validated the network architecture at the molecular level by native polyacrylamide gel electrophoresis, as well as the network function at the bacterial population and single-cell levels with a fluorescent reporter. Our results suggest that it is possible to engineer complex cellular programs based on RNA from first principles. Because these networks are mainly based on physical interactions, our designs could be expanded to other organisms as portable regulatory resources or to implement biological computations. PMID:28934501
Gene co-expression networks shed light into diseases of brain iron accumulation

PubMed Central

Bettencourt, Conceição; Forabosco, Paola; Wiethoff, Sarah; Heidari, Moones; Johnstone, Daniel M.; Botía, Juan A.; Collingwood, Joanna F.; Hardy, John; Milward, Elizabeth A.; Ryten, Mina; Houlden, Henry

2016-01-01

Aberrant brain iron deposition is observed in both common and rare neurodegenerative disorders, including those categorized as Neurodegeneration with Brain Iron Accumulation (NBIA), which are characterized by focal iron accumulation in the basal ganglia. Two NBIA genes are directly involved in iron metabolism, but whether other NBIA-related genes also regulate iron homeostasis in the human brain, and whether aberrant iron deposition contributes to neurodegenerative processes remains largely unknown. This study aims to expand our understanding of these iron overload diseases and identify relationships between known NBIA genes and their main interacting partners by using a systems biology approach. We used whole-transcriptome gene expression data from human brain samples originating from 101 neuropathologically normal individuals (10 brain regions) to generate weighted gene co-expression networks and cluster the 10 known NBIA genes in an unsupervised manner. We investigated NBIA-enriched networks for relevant cell types and pathways, and whether they are disrupted by iron loading in NBIA diseased tissue and in an in vivo mouse model. We identified two basal ganglia gene co-expression modules significantly enriched for NBIA genes, which resemble neuronal and oligodendrocytic signatures. These NBIA gene networks are enriched for iron-related genes, and implicate synapse and lipid metabolism related pathways. Our data also indicates that these networks are disrupted by excessive brain iron loading. We identified multiple cell types in the origin of NBIA disorders. We also found unforeseen links between NBIA networks and iron-related processes, and demonstrate convergent pathways connecting NBIAs and phenotypically overlapping diseases. Our results are of further relevance for these diseases by providing candidates for new causative genes and possible points for therapeutic intervention. PMID:26707700
Gene co-expression networks shed light into diseases of brain iron accumulation.

PubMed

Bettencourt, Conceição; Forabosco, Paola; Wiethoff, Sarah; Heidari, Moones; Johnstone, Daniel M; Botía, Juan A; Collingwood, Joanna F; Hardy, John; Milward, Elizabeth A; Ryten, Mina; Houlden, Henry

2016-03-01

Aberrant brain iron deposition is observed in both common and rare neurodegenerative disorders, including those categorized as Neurodegeneration with Brain Iron Accumulation (NBIA), which are characterized by focal iron accumulation in the basal ganglia. Two NBIA genes are directly involved in iron metabolism, but whether other NBIA-related genes also regulate iron homeostasis in the human brain, and whether aberrant iron deposition contributes to neurodegenerative processes remains largely unknown. This study aims to expand our understanding of these iron overload diseases and identify relationships between known NBIA genes and their main interacting partners by using a systems biology approach. We used whole-transcriptome gene expression data from human brain samples originating from 101 neuropathologically normal individuals (10 brain regions) to generate weighted gene co-expression networks and cluster the 10 known NBIA genes in an unsupervised manner. We investigated NBIA-enriched networks for relevant cell types and pathways, and whether they are disrupted by iron loading in NBIA diseased tissue and in an in vivo mouse model. We identified two basal ganglia gene co-expression modules significantly enriched for NBIA genes, which resemble neuronal and oligodendrocytic signatures. These NBIA gene networks are enriched for iron-related genes, and implicate synapse and lipid metabolism related pathways. Our data also indicates that these networks are disrupted by excessive brain iron loading. We identified multiple cell types in the origin of NBIA disorders. We also found unforeseen links between NBIA networks and iron-related processes, and demonstrate convergent pathways connecting NBIAs and phenotypically overlapping diseases. Our results are of further relevance for these diseases by providing candidates for new causative genes and possible points for therapeutic intervention. Copyright © 2015 The Authors. Published by Elsevier Inc. All rights reserved.
Mimosa: Mixture Model of Co-expression to Detect Modulators of Regulatory Interaction

NASA Astrophysics Data System (ADS)

Hansen, Matthew; Everett, Logan; Singh, Larry; Hannenhalli, Sridhar

Functionally related genes tend to be correlated in their expression patterns across multiple conditions and/or tissue-types. Thus co-expression networks are often used to investigate functional groups of genes. In particular, when one of the genes is a transcription factor (TF), the co-expression-based interaction is interpreted, with caution, as a direct regulatory interaction. However, any particular TF, and more importantly, any particular regulatory interaction, is likely to be active only in a subset of experimental conditions. Moreover, the subset of expression samples where the regulatory interaction holds may be marked by presence or absence of a modifier gene, such as an enzyme that post-translationally modifies the TF. Such subtlety of regulatory interactions is overlooked when one computes an overall expression correlation. Here we present a novel mixture modeling approach where a TF-Gene pair is presumed to be significantly correlated (with unknown coefficient) in a (unknown) subset of expression samples. The parameters of the model are estimated using a Maximum Likelihood approach. The estimated mixture of expression samples is then mined to identify genes potentially modulating the TF-Gene interaction. We have validated our approach using synthetic data and on three biological cases in cow and in yeast. While limited in some ways, as discussed, the work represents a novel approach to mine expression data and detect potential modulators of regulatory interactions.
RNA Sequencing and Bioinformatics Analysis Implicate the Regulatory Role of a Long Noncoding RNA-mRNA Network in Hepatic Stellate Cell Activation.

PubMed

Guo, Can-Jie; Xiao, Xiao; Sheng, Li; Chen, Lili; Zhong, Wei; Li, Hai; Hua, Jing; Ma, Xiong

2017-01-01

To analyze the long noncoding (lncRNA)-mRNA expression network and potential roles in rat hepatic stellate cells (HSCs) during activation. LncRNA expression was analyzed in quiescent and culture-activated HSCs by RNA sequencing, and differentially expressed lncRNAs verified by quantitative reverse transcription polymerase chain reaction (qRT-PCR) were subjected to bioinformatics analysis. In vivo analyses of differential lncRNA-mRNA expression were performed on a rat model of liver fibrosis. We identified upregulation of 12 lncRNAs and 155 mRNAs and downregulation of 12 lncRNAs and 374 mRNAs in activated HSCs. Additionally, we identified the differential expression of upregulated lncRNAs (NONRATT012636.2, NONRATT016788.2, and NONRATT021402.2) and downregulated lncRNAs (NONRATT007863.2, NONRATT019720.2, and NONRATT024061.2) in activated HSCs relative to levels observed in quiescent HSCs, and Gene Ontology and Kyoto Encyclopedia of Genes and Genomes pathway analyses showed that changes in lncRNAs associated with HSC activation revealed 11 significantly enriched pathways according to their predicted targets. Moreover, based on the predicted co-expression network, the relative dynamic levels of NONRATT013819.2 and lysyl oxidase (Lox) were compared during HSC activation both in vitro and in vivo. Our results confirmed the upregulation of lncRNA NONRATT013819.2 and Lox mRNA associated with the extracellular matrix (ECM)-related signaling pathway in HSCs and fibrotic livers. Our results detailing a dysregulated lncRNA-mRNA network might provide new treatment strategies for hepatic fibrosis based on findings indicating potentially critical roles for NONRATT013819.2 and Lox in ECM remodeling during HSC activation. © 2017 The Author(s). Published by S. Karger AG, Basel.
When is hub gene selection better than standard meta-analysis?

PubMed

Langfelder, Peter; Mischel, Paul S; Horvath, Steve

2013-01-01

Since hub nodes have been found to play important roles in many networks, highly connected hub genes are expected to play an important role in biology as well. However, the empirical evidence remains ambiguous. An open question is whether (or when) hub gene selection leads to more meaningful gene lists than a standard statistical analysis based on significance testing when analyzing genomic data sets (e.g., gene expression or DNA methylation data). Here we address this question for the special case when multiple genomic data sets are available. This is of great practical importance since for many research questions multiple data sets are publicly available. In this case, the data analyst can decide between a standard statistical approach (e.g., based on meta-analysis) and a co-expression network analysis approach that selects intramodular hubs in consensus modules. We assess the performance of these two types of approaches according to two criteria. The first criterion evaluates the biological insights gained and is relevant in basic research. The second criterion evaluates the validation success (reproducibility) in independent data sets and often applies in clinical diagnostic or prognostic applications. We compare meta-analysis with consensus network analysis based on weighted correlation network analysis (WGCNA) in three comprehensive and unbiased empirical studies: (1) Finding genes predictive of lung cancer survival, (2) finding methylation markers related to age, and (3) finding mouse genes related to total cholesterol. The results demonstrate that intramodular hub gene status with respect to consensus modules is more useful than a meta-analysis p-value when identifying biologically meaningful gene lists (reflecting criterion 1). However, standard meta-analysis methods perform as good as (if not better than) a consensus network approach in terms of validation success (criterion 2). The article also reports a comparison of meta-analysis techniques applied to gene expression data and presents novel R functions for carrying out consensus network analysis, network based screening, and meta analysis.
Profiling and Co-expression Network Analysis of Learned Helplessness Regulated mRNAs and lncRNAs in the Mouse Hippocampus

PubMed Central

Li, Chaoqun; Cao, Feifei; Li, Shengli; Huang, Shenglin; Li, Wei; Abumaria, Nashat

2018-01-01

Although studies provide insights into the neurobiology of stress and depression, the exact molecular mechanisms underlying their pathologies remain largely unknown. Long non-coding RNA (lncRNA) has been implicated in brain functions and behavior. A potential link between lncRNA and psychiatric disorders has been proposed. However, it remains undetermined whether IncRNA regulation, in the brain, contributes to stress or depression pathologies. In this study, we used a valid animal model of depression-like symptoms; namely learned helplessness, RNA-seq, Gene Ontology and co-expression network analyses to profile the expression pattern of lncRNA and mRNA in the hippocampus of mice. We identified 6346 differentially expressed transcripts. Among them, 340 lncRNAs and 3559 protein coding mRNAs were differentially expressed in helpless mice in comparison with control and/or non-helpless mice (inescapable stress resilient mice). Gene Ontology and pathway enrichment analyses indicated that induction of helplessness altered expression of mRNAs enriched in fundamental biological functions implicated in stress/depression neurobiology such as synaptic, metabolic, cell survival and proliferation, developmental and chromatin modification functions. To explore the possible regulatory roles of the altered lncRNAs, we constructed co-expression networks composed of the lncRNAs and mRNAs. Among our differentially expressed lncRNAs, 17% showed significant correlation with genes. Functional co-expression analysis linked the identified lncRNAs to several cellular mechanisms implicated in stress/depression neurobiology. Importantly, 57% of the identified regulatory lncRNAs significantly correlated with 18 different synapse-related functions. Thus, the current study identifies for the first time distinct groups of lncRNAs regulated by induction of learned helplessness in the mouse brain. Our results suggest that lncRNA-directed regulatory mechanisms might contribute to stress-induced pathologies; in particular, to inescapable stress-induced synaptic modifications. PMID:29375311
Profiling and Co-expression Network Analysis of Learned Helplessness Regulated mRNAs and lncRNAs in the Mouse Hippocampus.

PubMed

Li, Chaoqun; Cao, Feifei; Li, Shengli; Huang, Shenglin; Li, Wei; Abumaria, Nashat

2017-01-01

Although studies provide insights into the neurobiology of stress and depression, the exact molecular mechanisms underlying their pathologies remain largely unknown. Long non-coding RNA (lncRNA) has been implicated in brain functions and behavior. A potential link between lncRNA and psychiatric disorders has been proposed. However, it remains undetermined whether IncRNA regulation, in the brain, contributes to stress or depression pathologies. In this study, we used a valid animal model of depression-like symptoms; namely learned helplessness, RNA-seq, Gene Ontology and co-expression network analyses to profile the expression pattern of lncRNA and mRNA in the hippocampus of mice. We identified 6346 differentially expressed transcripts. Among them, 340 lncRNAs and 3559 protein coding mRNAs were differentially expressed in helpless mice in comparison with control and/or non-helpless mice (inescapable stress resilient mice). Gene Ontology and pathway enrichment analyses indicated that induction of helplessness altered expression of mRNAs enriched in fundamental biological functions implicated in stress/depression neurobiology such as synaptic, metabolic, cell survival and proliferation, developmental and chromatin modification functions. To explore the possible regulatory roles of the altered lncRNAs, we constructed co-expression networks composed of the lncRNAs and mRNAs. Among our differentially expressed lncRNAs, 17% showed significant correlation with genes. Functional co-expression analysis linked the identified lncRNAs to several cellular mechanisms implicated in stress/depression neurobiology. Importantly, 57% of the identified regulatory lncRNAs significantly correlated with 18 different synapse-related functions. Thus, the current study identifies for the first time distinct groups of lncRNAs regulated by induction of learned helplessness in the mouse brain. Our results suggest that lncRNA-directed regulatory mechanisms might contribute to stress-induced pathologies; in particular, to inescapable stress-induced synaptic modifications.
Evolutionary features of academic articles co-keyword network and keywords co-occurrence network: Based on two-mode affiliation network

NASA Astrophysics Data System (ADS)

Li, Huajiao; An, Haizhong; Wang, Yue; Huang, Jiachen; Gao, Xiangyun

2016-05-01

Keeping abreast of trends in the articles and rapidly grasping a body of article's key points and relationship from a holistic perspective is a new challenge in both literature research and text mining. As the important component, keywords can present the core idea of the academic article. Usually, articles on a single theme or area could share one or some same keywords, and we can analyze topological features and evolution of the articles co-keyword networks and keywords co-occurrence networks to realize the in-depth analysis of the articles. This paper seeks to integrate statistics, text mining, complex networks and visualization to analyze all of the academic articles on one given theme, complex network(s). All 5944 ;complex networks; articles that were published between 1990 and 2013 and are available on the Web of Science are extracted. Based on the two-mode affiliation network theory, a new frontier of complex networks, we constructed two different networks, one taking the articles as nodes, the co-keyword relationships as edges and the quantity of co-keywords as the weight to construct articles co-keyword network, and another taking the articles' keywords as nodes, the co-occurrence relationships as edges and the quantity of simultaneous co-occurrences as the weight to construct keyword co-occurrence network. An integrated method for analyzing the topological features and evolution of the articles co-keyword network and keywords co-occurrence networks is proposed, and we also defined a new function to measure the innovation coefficient of the articles in annual level. This paper provides a useful tool and process for successfully achieving in-depth analysis and rapid understanding of the trends and relationships of articles in a holistic perspective.
[Exploration of common biological pathways for attention deficit hyperactivity disorder and low birth weight].

PubMed

Xiang, Bo; Yu, Minglan; Liang, Xuemei; Lei, Wei; Huang, Chaohua; Chen, Jing; He, Wenying; Zhang, Tao; Li, Tao; Liu, Kezhi

2017-12-10

To explore common biological pathways for attention deficit hyperactivity disorder (ADHD) and low birth weight (LBW). Thei-Gsea4GwasV2 software was used to analyze the result of genome-wide association analysis (GWAS) for LBW (pathways were derived from Reactome), and nominally significant (P< 0.05, FDR< 0.25) pathways were tested for replication in ADHD.Significant pathways were analyzed with DAPPLE and Reatome FI software to identify genes involved in such pathways, with each cluster enriched with the gene ontology (GO). The Centiscape2.0 software was used to calculate the degree of genetic networks and the betweenness value to explore the core node (gene). Weighed gene co-expression network analysis (WGCNA) was then used to explore the co-expression of genes in these pathways.With gene expression data derived from BrainSpan, GO enrichment was carried out for each gene module. Eleven significant biological pathways was identified in association with LBW, among which two (Selenoamino acid metabolism and Diseases associated with glycosaminoglycan metabolism) were replicated during subsequent ADHD analysis. Network analysis of 130 genes in these pathways revealed that some of the sub-networksare related with morphology of cerebellum, development of hippocampus, and plasticity of synaptic structure. Upon co-expression network analysis, 120 genes passed the quality control and were found to express in 3 gene modules. These modules are mainly related to the regulation of synaptic structure and activity regulation. ADHD and LBW share some biological regulation processes. Anomalies of such proces sesmay predispose to ADHD.
An interactive network of elastase, secretases, and PAR-2 protein regulates CXCR1 receptor surface expression on neutrophils.

PubMed

Bakele, Martina; Lotz-Havla, Amelie S; Jakowetz, Anja; Carevic, Melanie; Marcos, Veronica; Muntau, Ania C; Gersting, Soeren W; Hartl, Dominik

2014-07-25

CXCL8 (IL-8) recruits and activates neutrophils through the G protein-coupled chemokine receptor CXCR1. We showed previously that elastase cleaves CXCR1 and thereby impairs antibacterial host defense. However, the molecular intracellular machinery involved in this process remained undefined. Here we demonstrate by using flow cytometry, confocal microscopy, subcellular fractionation, co-immunoprecipitation, and bioluminescence resonance energy transfer that combined α- and γ-secretase activities are functionally involved in elastase-mediated regulation of CXCR1 surface expression on human neutrophils, whereas matrix metalloproteases are dispensable. We further demonstrate that PAR-2 is stored in mobilizable compartments in neutrophils. Bioluminescence resonance energy transfer and co-immunoprecipitation studies showed that secretases, PAR-2, and CXCR1 colocalize and physically interact in a novel protease/secretase-chemokine receptor network. PAR-2 blocking experiments provided evidence that elastase increased intracellular presenilin-1 expression through PAR-2 signaling. When viewed in combination, these studies establish a novel functional network of elastase, secretases, and PAR-2 that regulate CXCR1 expression on neutrophils. Interfering with this network could lead to novel therapeutic approaches in neutrophilic diseases, such as cystic fibrosis or rheumatoid arthritis.
An Interactive Network of Elastase, Secretases, and PAR-2 Protein Regulates CXCR1 Receptor Surface Expression on Neutrophils*

PubMed Central

Bakele, Martina; Lotz-Havla, Amelie S.; Jakowetz, Anja; Carevic, Melanie; Marcos, Veronica; Muntau, Ania C.; Gersting, Soeren W.; Hartl, Dominik

2014-01-01

CXCL8 (IL-8) recruits and activates neutrophils through the G protein-coupled chemokine receptor CXCR1. We showed previously that elastase cleaves CXCR1 and thereby impairs antibacterial host defense. However, the molecular intracellular machinery involved in this process remained undefined. Here we demonstrate by using flow cytometry, confocal microscopy, subcellular fractionation, co-immunoprecipitation, and bioluminescence resonance energy transfer that combined α- and γ-secretase activities are functionally involved in elastase-mediated regulation of CXCR1 surface expression on human neutrophils, whereas matrix metalloproteases are dispensable. We further demonstrate that PAR-2 is stored in mobilizable compartments in neutrophils. Bioluminescence resonance energy transfer and co-immunoprecipitation studies showed that secretases, PAR-2, and CXCR1 colocalize and physically interact in a novel protease/secretase-chemokine receptor network. PAR-2 blocking experiments provided evidence that elastase increased intracellular presenilin-1 expression through PAR-2 signaling. When viewed in combination, these studies establish a novel functional network of elastase, secretases, and PAR-2 that regulate CXCR1 expression on neutrophils. Interfering with this network could lead to novel therapeutic approaches in neutrophilic diseases, such as cystic fibrosis or rheumatoid arthritis. PMID:24914212
Network-Induced Classification Kernels for Gene Expression Profile Analysis

PubMed Central

Dror, Gideon; Shamir, Ron

2012-01-01

Abstract Computational classification of gene expression profiles into distinct disease phenotypes has been highly successful to date. Still, robustness, accuracy, and biological interpretation of the results have been limited, and it was suggested that use of protein interaction information jointly with the expression profiles can improve the results. Here, we study three aspects of this problem. First, we show that interactions are indeed relevant by showing that co-expressed genes tend to be closer in the network of interactions. Second, we show that the improved performance of one extant method utilizing expression and interactions is not really due to the biological information in the network, while in another method this is not the case. Finally, we develop a new kernel method—called NICK—that integrates network and expression data for SVM classification, and demonstrate that overall it achieves better results than extant methods while running two orders of magnitude faster. PMID:22697242
Identification of potential transcriptomic markers in developing pediatric sepsis: a weighted gene co-expression network analysis and a case-control validation study.

PubMed

Li, Yiping; Li, Yanhong; Bai, Zhenjiang; Pan, Jian; Wang, Jian; Fang, Fang

2017-12-13

Sepsis represents a complex disease with the dysregulated inflammatory response and high mortality rate. The goal of this study was to identify potential transcriptomic markers in developing pediatric sepsis by a co-expression module analysis of the transcriptomic dataset. Using the R software and Bioconductor packages, we performed a weighted gene co-expression network analysis to identify co-expression modules significantly associated with pediatric sepsis. Functional interpretation (gene ontology and pathway analysis) and enrichment analysis with known transcription factors and microRNAs of the identified candidate modules were then performed. In modules significantly associated with sepsis, the intramodular analysis was further performed and "hub genes" were identified and validated by quantitative real-time PCR (qPCR) in this study. 15 co-expression modules in total were detected, and four modules ("midnight blue", "cyan", "brown", and "tan") were most significantly associated with pediatric sepsis and suggested as potential sepsis-associated modules. Gene ontology analysis and pathway analysis revealed that these four modules strongly associated with immune response. Three of the four sepsis-associated modules were also enriched with known transcription factors (false discovery rate-adjusted P < 0.05). Hub genes were identified in each of the four modules. Four of the identified hub genes (MYB proto-oncogene like 1, killer cell lectin like receptor G1, stomatin, and membrane spanning 4-domains A4A) were further validated to be differentially expressed between septic children and controls by qPCR. Four pediatric sepsis-associated co-expression modules were identified in this study. qPCR results suggest that hub genes in these modules are potential transcriptomic markers for pediatric sepsis diagnosis. These results provide novel insights into the pathogenesis of pediatric sepsis and promote the generation of diagnostic gene sets.
Genes associated with thermosensitive genic male sterility in rice identified by comparative expression profiling.

PubMed

Pan, Yufang; Li, Qiaofeng; Wang, Zhizheng; Wang, Yang; Ma, Rui; Zhu, Lili; He, Guangcun; Chen, Rongzhi

2014-12-16

Thermosensitive genic male sterile (TGMS) lines and photoperiod-sensitive genic male sterile (PGMS) lines have been successfully used in hybridization to improve rice yields. However, the molecular mechanisms underlying male sterility transitions in most PGMS/TGMS rice lines are unclear. In the recently developed TGMS-Co27 line, the male sterility is based on co-suppression of a UDP-glucose pyrophosphorylase gene (Ugp1), but further study is needed to fully elucidate the molecular mechanisms involved. Microarray-based transcriptome profiling of TGMS-Co27 and wild-type Hejiang 19 (H1493) plants grown at high and low temperatures revealed that 15462 probe sets representing 8303 genes were differentially expressed in the two lines, under the two conditions, or both. Environmental factors strongly affected global gene expression. Some genes important for pollen development were strongly repressed in TGMS-Co27 at high temperature. More significantly, series-cluster analysis of differentially expressed genes (DEGs) between TGMS-Co27 plants grown under the two conditions showed that low temperature induced the expression of a gene cluster. This cluster was found to be essential for sterility transition. It includes many meiosis stage-related genes that are probably important for thermosensitive male sterility in TGMS-Co27, inter alia: Arg/Ser-rich domain (RS)-containing zinc finger proteins, polypyrimidine tract-binding proteins (PTBs), DEAD/DEAH box RNA helicases, ZOS (C2H2 zinc finger proteins of Oryza sativa), at least one polyadenylate-binding protein and some other RNA recognition motif (RRM) domain-containing proteins involved in post-transcriptional processes, eukaryotic initiation factor 5B (eIF5B), ribosomal proteins (L37, L1p/L10e, L27 and L24), aminoacyl-tRNA synthetases (ARSs), eukaryotic elongation factor Tu (eEF-Tu) and a peptide chain release factor protein involved in translation. The differential expression of 12 DEGs that are important for pollen development, low temperature responses or TGMS was validated by quantitative RT-PCR (qRT-PCR). Temperature strongly affects global gene expression and may be the common regulator of fertility in PGMS/TGMS rice lines. The identified expression changes reflect perturbations in the transcriptomic regulation of pollen development networks in TGMS-Co27. Findings from this and previous studies indicate that sets of genes involved in post-transcriptional and translation processes are involved in thermosensitive male sterility transitions in TGMS-Co27.
Weighted gene co-expression network analysis of expression data of monozygotic twins identifies specific modules and hub genes related to BMI.

PubMed

Wang, Weijing; Jiang, Wenjie; Hou, Lin; Duan, Haiping; Wu, Yili; Xu, Chunsheng; Tan, Qihua; Li, Shuxia; Zhang, Dongfeng

2017-11-13

The therapeutic management of obesity is challenging, hence further elucidating the underlying mechanisms of obesity development and identifying new diagnostic biomarkers and therapeutic targets are urgent and necessary. Here, we performed differential gene expression analysis and weighted gene co-expression network analysis (WGCNA) to identify significant genes and specific modules related to BMI based on gene expression profile data of 7 discordant monozygotic twins. In the differential gene expression analysis, it appeared that 32 differentially expressed genes (DEGs) were with a trend of up-regulation in twins with higher BMI when compared to their siblings. Categories of positive regulation of nitric-oxide synthase biosynthetic process, positive regulation of NF-kappa B import into nucleus, and peroxidase activity were significantly enriched within GO database and NF-kappa B signaling pathway within KEGG database. DEGs of NAMPT, TLR9, PTGS2, HBD, and PCSK1N might be associated with obesity. In the WGCNA, among the total 20 distinct co-expression modules identified, coral1 module (68 genes) had the strongest positive correlation with BMI (r = 0.56, P = 0.04) and disease status (r = 0.56, P = 0.04). Categories of positive regulation of phospholipase activity, high-density lipoprotein particle clearance, chylomicron remnant clearance, reverse cholesterol transport, intermediate-density lipoprotein particle, chylomicron, low-density lipoprotein particle, very-low-density lipoprotein particle, voltage-gated potassium channel complex, cholesterol transporter activity, and neuropeptide hormone activity were significantly enriched within GO database for this module. And alcoholism and cell adhesion molecules pathways were significantly enriched within KEGG database. Several hub genes, such as GAL, ASB9, NPPB, TBX2, IL17C, APOE, ABCG4, and APOC2 were also identified. The module eigengene of saddlebrown module (212 genes) was also significantly correlated with BMI (r = 0.56, P = 0.04), and hub genes of KCNN1 and AQP10 were differentially expressed. We identified significant genes and specific modules potentially related to BMI based on the gene expression profile data of monozygotic twins. The findings may help further elucidate the underlying mechanisms of obesity development and provide novel insights to research potential gene biomarkers and signaling pathways for obesity treatment. Further analysis and validation of the findings reported here are important and necessary when more sample size is acquired.
On the stochastic dissemination of faults in an admissible network

NASA Technical Reports Server (NTRS)

Kyrala, A.

1987-01-01

The dynamic distribution of faults in a general type network is discussed. The starting point is a uniquely branched network in which each pair of nodes is connected by a single branch. Mathematical expressions for the uniquely branched network transition matrix are derived to show that sufficient stationarity exists to ensure the validity of the use of the Markov Chain model to analyze networks. In addition the conditions for the use of Semi-Markov models are discussed. General mathematical expressions are derived in an examination of branch redundancy techniques commonly used to increase reliability.

High-Density, High-Resolution, Low-Cost Air Quality Sensor Networks for Urban Air Monitoring

NASA Astrophysics Data System (ADS)

Mead, M. I.; Popoola, O. A.; Stewart, G.; Bright, V.; Kaye, P.; Saffell, J.

2012-12-01

Monitoring air quality in highly granular environments such as urban areas which are spatially heterogeneous with variable emission sources, measurements need to be made at appropriate spatial and temporal scales. Current routine air quality monitoring networks generally are either composed of sparse expensive installations (incorporating e.g. chemiluminescence instruments) or higher density low time resolution systems (e.g. NO2 diffusion tubes). Either approach may not accurately capture important effects such as pollutant "hot spots" or adequately capture spatial (or temporal) variability. As a result, analysis based on data from traditional low spatial resolution networks, such as personal exposure, may be inaccurate. In this paper we present details of a sophisticated, low-cost, multi species (gas phase, speciated PM, meteorology) air quality measurement network methodology incorporating GPS and GPRS which has been developed for high resolution air quality measurements in urban areas. Sensor networks developed in the Centre for Atmospheric Science (University of Cambridge) incorporated electrochemical gas sensors configured for use in urban air quality studies operating at parts-per-billion (ppb) levels. It has been demonstrated that these sensors can be used to measure key air quality gases such as CO, NO and NO2 at the low ppb mixing ratios present in the urban environment (estimated detection limits <4ppb for CO and NO and <1ppb for NO2. Mead et al (submitted Aug., 2012)). Based on this work, a state of the art multi species instrument package for deployment in scalable sensor networks has been developed which has general applicability. This is currently being employed as part of a major 3 year UK program at London Heathrow airport (the Sensor Networks for Air Quality (SNAQ) Heathrow project). The main project outcome is the creation of a calibrated, high spatial and temporal resolution data set for O3, NO, NO2, SO2, CO, CO2, VOCstotal, size-speciated PM, temperature, relative humidity, wind speed and direction. The network incorporates existing GPRS infrastructures for real time sending of data with low overheads in terms of cost, effort and installation. In this paper we present data from the SNAQ Heathrow project as well as previous deployments showing measurement capability at the ppb level for NO, NO2 and CO. We show that variability can be observed and measured quantitatively using these sensor networks over widely differing time scales from individual emission events, diurnal variability associated with traffic and meteorological conditions, through to longer term synoptic weather conditions and seasonal behaviour. This work demonstrates a widely applicable generic capability to urban areas, airports as well as other complex emissions environments making this sensor system methodology valuable for scientific, policy and regulatory issues. We conclude that the low-cost high-density network philosophy has the potential to provide a more complete assessment of the high-granularity air quality structure generally observed in the environment. Further, when appropriately deployed, has the potential to offer a new paradigm in air quality quantification and monitoring.
Preliminary Evidence for the Emergence of a Health Care Online Community of Practice: Using a Netnographic Framework for Twitter Hashtag Analytics.

PubMed

Roland, Damian; Spurr, Jesse; Cabrera, Daniel

2017-07-14

Online communities of practice (oCoPs) may emerge from interactions on social media. These communities offer an open digital space and flat role hierarchy for information sharing and provide a strong group identity, rapid flow of information, content curation, and knowledge translation. To date, there is only a small body of evidence in medicine or health care to verify the existence of an oCoP. We aimed to examine the emergence of an oCoP through the study of social media interactions of the free open access medical education (FOAM) movement. We examined social media activity in Twitter by analyzing the network centrality metrics of tweets with the #FOAMed hashtag and compared them with previously validated criteria of a community of practice (CoP). The centrality analytics of the FOAM community showed concordance with aspects of a general CoP (in terms of community, domain, and practice), as well as some specific traits of a health care community, including social control, common purpose, flat hierarchy, and network-based and concrete achievement. This study demonstrated preliminary evidence of an oCoP focused on education and based on social media interactions. Further examination of the topology of the network is needed to definitely prove the existence of an oCoP. Given that these communities result in significant knowledge translation and practice change, further research in this area appears warranted. ©Damian Roland, Jesse Spurr, Daniel Cabrera. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 14.07.2017.
Unsupervised, statistically-based systems biology approach for unraveling the genetics of complex traits: A demonstration with ethanol metabolism.

PubMed

Lusk, Ryan; Saba, Laura M; Vanderlinden, Lauren A; Zidek, Vaclav; Silhavy, Jan; Pravenec, Michal; Hoffman, Paula L; Tabakoff, Boris

2018-04-24

A statistical pipeline was developed and used for determining candidate genes and candidate gene co-expression networks involved in two alcohol (i.e., ethanol) metabolism phenotypes, namely alcohol clearance and acetate area under the curve (AUC) in a recombinant inbred (HXB/BXH) rat panel. The approach was also used to provide an indication of how ethanol metabolism can impact the normal function of the identified networks. RNA was extracted from alcohol-naïve liver tissue of 30 strains of HXB/BXH recombinant inbred rats. The reconstructed transcripts were quantitated and data was used to construct gene co-expression modules and networks. A separate group of rats, comprising the same 30 strains, were injected with ethanol (2 gm/kg) for measurement of blood ethanol and acetate levels. These data were used for QTL analysis of the rate of ethanol disappearance and circulating acetate levels. The analysis pipeline required calculation of the module eigengene values, the correction of these values with ethanol metabolism rates and acetate levels across the rat strains and the determination of the eigengene QTLs. For a module to be considered a candidate for determining phenotype, the module eigengene values had to have significant correlation with the strain phenotypic values and the module eigengene QTLs had to overlap the phenotypic QTLs. Of the 658 transcript co-expression modules generated from liver RNA sequencing data, a single module satisfied all criteria for being a candidate for determining the alcohol clearance trait. This module contained two alcohol dehydrogenase genes, including the gene whose product was previously shown to be responsible for the majority of alcohol elimination in the rat. This module was also the only module identified as a candidate for influencing circulating acetate levels. This module was also linked to the process of generation and utilization of retinoic acid as related to the autonomous immune response. We propose that our analytical pipeline can successfully identify genetic regions and transcripts which predispose a particular phenotype and our analysis provides functional context for co-expression module components. This article is protected by copyright. All rights reserved. This article is protected by copyright. All rights reserved.
Reconstruction of an Integrated Genome-Scale Co-Expression Network Reveals Key Modules Involved in Lung Adenocarcinoma

PubMed Central

Hosseini Ashtiani, Saman; Moeini, Ali; Nowzari-Dalini, Abbas; Masoudi-Nejad, Ali

2013-01-01

Our goal of this study was to reconstruct a “genome-scale co-expression network” and find important modules in lung adenocarcinoma so that we could identify the genes involved in lung adenocarcinoma. We integrated gene mutation, GWAS, CGH, array-CGH and SNP array data in order to identify important genes and loci in genome-scale. Afterwards, on the basis of the identified genes a co-expression network was reconstructed from the co-expression data. The reconstructed network was named “genome-scale co-expression network”. As the next step, 23 key modules were disclosed through clustering. In this study a number of genes have been identified for the first time to be implicated in lung adenocarcinoma by analyzing the modules. The genes EGFR, PIK3CA, TAF15, XIAP, VAPB, Appl1, Rab5a, ARF4, CLPTM1L, SP4, ZNF124, LPP, FOXP1, SOX18, MSX2, NFE2L2, SMARCC1, TRA2B, CBX3, PRPF6, ATP6V1C1, MYBBP1A, MACF1, GRM2, TBXA2R, PRKAR2A, PTK2, PGF and MYO10 are among the genes that belong to modules 1 and 22. All these genes, being implicated in at least one of the phenomena, namely cell survival, proliferation and metastasis, have an over-expression pattern similar to that of EGFR. In few modules, the genes such as CCNA2 (Cyclin A2), CCNB2 (Cyclin B2), CDK1, CDK5, CDC27, CDCA5, CDCA8, ASPM, BUB1, KIF15, KIF2C, NEK2, NUSAP1, PRC1, SMC4, SYCE2, TFDP1, CDC42 and ARHGEF9 are present that play a crucial role in cell cycle progression. In addition to the mentioned genes, there are some other genes (i.e. DLGAP5, BIRC5, PSMD2, Src, TTK, SENP2, PSMD2, DOK2, FUS and etc.) in the modules. PMID:23874428
Global Genetic Variations Predict Brain Response to Faces

PubMed Central

Dickie, Erin W.; Tahmasebi, Amir; French, Leon; Kovacevic, Natasa; Banaschewski, Tobias; Barker, Gareth J.; Bokde, Arun; Büchel, Christian; Conrod, Patricia; Flor, Herta; Garavan, Hugh; Gallinat, Juergen; Gowland, Penny; Heinz, Andreas; Ittermann, Bernd; Lawrence, Claire; Mann, Karl; Martinot, Jean-Luc; Nees, Frauke; Nichols, Thomas; Lathrop, Mark; Loth, Eva; Pausova, Zdenka; Rietschel, Marcela; Smolka, Michal N.; Ströhle, Andreas; Toro, Roberto; Schumann, Gunter; Paus, Tomáš

2014-01-01

Face expressions are a rich source of social signals. Here we estimated the proportion of phenotypic variance in the brain response to facial expressions explained by common genetic variance captured by ∼500,000 single nucleotide polymorphisms. Using genomic-relationship-matrix restricted maximum likelihood (GREML), we related this global genetic variance to that in the brain response to facial expressions, as assessed with functional magnetic resonance imaging (fMRI) in a community-based sample of adolescents (n = 1,620). Brain response to facial expressions was measured in 25 regions constituting a face network, as defined previously. In 9 out of these 25 regions, common genetic variance explained a significant proportion of phenotypic variance (40–50%) in their response to ambiguous facial expressions; this was not the case for angry facial expressions. Across the network, the strength of the genotype-phenotype relationship varied as a function of the inter-individual variability in the number of functional connections possessed by a given region (R2 = 0.38, p<0.001). Furthermore, this variability showed an inverted U relationship with both the number of observed connections (R2 = 0.48, p<0.001) and the magnitude of brain response (R2 = 0.32, p<0.001). Thus, a significant proportion of the brain response to facial expressions is predicted by common genetic variance in a subset of regions constituting the face network. These regions show the highest inter-individual variability in the number of connections with other network nodes, suggesting that the genetic model captures variations across the adolescent brains in co-opting these regions into the face network. PMID:25122193
CoPub: a literature-based keyword enrichment tool for microarray data analysis.

PubMed

Frijters, Raoul; Heupers, Bart; van Beek, Pieter; Bouwhuis, Maurice; van Schaik, René; de Vlieg, Jacob; Polman, Jan; Alkema, Wynand

2008-07-01

Medline is a rich information source, from which links between genes and keywords describing biological processes, pathways, drugs, pathologies and diseases can be extracted. We developed a publicly available tool called CoPub that uses the information in the Medline database for the biological interpretation of microarray data. CoPub allows batch input of multiple human, mouse or rat genes and produces lists of keywords from several biomedical thesauri that are significantly correlated with the set of input genes. These lists link to Medline abstracts in which the co-occurring input genes and correlated keywords are highlighted. Furthermore, CoPub can graphically visualize differentially expressed genes and over-represented keywords in a network, providing detailed insight in the relationships between genes and keywords, and revealing the most influential genes as highly connected hubs. CoPub is freely accessible at http://services.nbic.nl/cgi-bin/copub/CoPub.pl.
PerSubs: A Graph-Based Algorithm for the Identification of Perturbed Subpathways Caused by Complex Diseases.

PubMed

Vrahatis, Aristidis G; Rapti, Angeliki; Sioutas, Spyros; Tsakalidis, Athanasios

2017-01-01

In the era of Systems Biology and growing flow of omics experimental data from high throughput techniques, experimentalists are in need of more precise pathway-based tools to unravel the inherent complexity of diseases and biological processes. Subpathway-based approaches are the emerging generation of pathway-based analysis elucidating the biological mechanisms under the perspective of local topologies onto a complex pathway network. Towards this orientation, we developed PerSub, a graph-based algorithm which detects subpathways perturbed by a complex disease. The perturbations are imprinted through differentially expressed and co-expressed subpathways as recorded by RNA-seq experiments. Our novel algorithm is applied on data obtained from a real experimental study and the identified subpathways provide biological evidence for the brain aging.
A Network Approach of Gene Co-expression in the Zea mays/Aspergillus flavus Pathosystem to Map Host/Pathogen Interaction Pathways.

PubMed

Musungu, Bryan M; Bhatnagar, Deepak; Brown, Robert L; Payne, Gary A; OBrian, Greg; Fakhoury, Ahmad M; Geisler, Matt

2016-01-01

A gene co-expression network (GEN) was generated using a dual RNA-seq study with the fungal pathogen Aspergillus flavus and its plant host Zea mays during the initial 3 days of infection. The analysis deciphered novel pathways and mapped genes of interest in both organisms during the infection. This network revealed a high degree of connectivity in many of the previously recognized pathways in Z. mays such as jasmonic acid, ethylene, and reactive oxygen species (ROS). For the pathogen A. flavus , a link between aflatoxin production and vesicular transport was identified within the network. There was significant interspecies correlation of expression between Z. mays and A. flavus for a subset of 104 Z. mays , and 1942 A. flavus genes. This resulted in an interspecies subnetwork enriched in multiple Z. mays genes involved in the production of ROS. In addition to the ROS from Z. mays , there was enrichment in the vesicular transport pathways and the aflatoxin pathway for A. flavus . Included in these genes, a key aflatoxin cluster regulator, AflS, was found to be co-regulated with multiple Z. mays ROS producing genes within the network, suggesting AflS may be monitoring host ROS levels. The entire GEN for both host and pathogen, and the subset of interspecies correlations, is presented as a tool for hypothesis generation and discovery for events in the early stages of fungal infection of Z. mays by A. flavus .
A Network Approach of Gene Co-expression in the Zea mays/Aspergillus flavus Pathosystem to Map Host/Pathogen Interaction Pathways

PubMed Central

Musungu, Bryan M.; Bhatnagar, Deepak; Brown, Robert L.; Payne, Gary A.; OBrian, Greg; Fakhoury, Ahmad M.; Geisler, Matt

2016-01-01

A gene co-expression network (GEN) was generated using a dual RNA-seq study with the fungal pathogen Aspergillus flavus and its plant host Zea mays during the initial 3 days of infection. The analysis deciphered novel pathways and mapped genes of interest in both organisms during the infection. This network revealed a high degree of connectivity in many of the previously recognized pathways in Z. mays such as jasmonic acid, ethylene, and reactive oxygen species (ROS). For the pathogen A. flavus, a link between aflatoxin production and vesicular transport was identified within the network. There was significant interspecies correlation of expression between Z. mays and A. flavus for a subset of 104 Z. mays, and 1942 A. flavus genes. This resulted in an interspecies subnetwork enriched in multiple Z. mays genes involved in the production of ROS. In addition to the ROS from Z. mays, there was enrichment in the vesicular transport pathways and the aflatoxin pathway for A. flavus. Included in these genes, a key aflatoxin cluster regulator, AflS, was found to be co-regulated with multiple Z. mays ROS producing genes within the network, suggesting AflS may be monitoring host ROS levels. The entire GEN for both host and pathogen, and the subset of interspecies correlations, is presented as a tool for hypothesis generation and discovery for events in the early stages of fungal infection of Z. mays by A. flavus. PMID:27917194
Functional differentiation and spatial-temporal co-expression networks of the NBS-encoding gene family in Jilin ginseng, Panax ginseng C.A. Meyer.

PubMed

Yin, Rui; Zhao, Mingzhu; Wang, Kangyu; Lin, Yanping; Wang, Yanfang; Sun, Chunyu; Wang, Yi; Zhang, Meiping

2017-01-01

Ginseng, Panax ginseng C.A. Meyer, is one of the most important medicinal plants for human health and medicine. It has been documented that over 80% of genes conferring resistance to bacteria, viruses, fungi and nematodes are contributed by the nucleotide binding site (NBS)-encoding gene family. Therefore, identification and characterization of NBS genes expressed in ginseng are paramount to its genetic improvement and breeding. However, little is known about the NBS-encoding genes in ginseng. Here we report genome-wide identification and systems analysis of the NBS genes actively expressed in ginseng (PgNBS genes). Four hundred twelve PgNBS gene transcripts, derived from 284 gene models, were identified from the transcriptomes of 14 ginseng tissues. These genes were classified into eight types, including TNL, TN, CNL, CN, NL, N, RPW8-NL and RPW8-N. Seven conserved motifs were identified in both the Toll/interleukine-1 receptor (TIR) and coiled-coil (CC) typed genes whereas six were identified in the RPW8 typed genes. Phylogenetic analysis showed that the PgNBS gene family is an ancient family, with a vast majority of its genes originated before ginseng originated. In spite of their belonging to a family, the PgNBS genes have functionally dramatically differentiated and been categorized into numerous functional categories. The expressions of the across tissues, different aged roots and the roots of different genotypes. However, they are coordinating in expression, forming a single co-expression network. These results provide a deeper understanding of the origin, evolution and functional differentiation and expression dynamics of the NBS-encoding gene family in plants in general and in ginseng particularly, and a NBS gene toolkit useful for isolation and characterization of disease resistance genes and for enhanced disease resistance breeding in ginseng and related species.
Functional differentiation and spatial-temporal co-expression networks of the NBS-encoding gene family in Jilin ginseng, Panax ginseng C.A. Meyer

PubMed Central

Wang, Kangyu; Lin, Yanping; Wang, Yanfang; Sun, Chunyu; Wang, Yi

2017-01-01

Ginseng, Panax ginseng C.A. Meyer, is one of the most important medicinal plants for human health and medicine. It has been documented that over 80% of genes conferring resistance to bacteria, viruses, fungi and nematodes are contributed by the nucleotide binding site (NBS)-encoding gene family. Therefore, identification and characterization of NBS genes expressed in ginseng are paramount to its genetic improvement and breeding. However, little is known about the NBS-encoding genes in ginseng. Here we report genome-wide identification and systems analysis of the NBS genes actively expressed in ginseng (PgNBS genes). Four hundred twelve PgNBS gene transcripts, derived from 284 gene models, were identified from the transcriptomes of 14 ginseng tissues. These genes were classified into eight types, including TNL, TN, CNL, CN, NL, N, RPW8-NL and RPW8-N. Seven conserved motifs were identified in both the Toll/interleukine-1 receptor (TIR) and coiled-coil (CC) typed genes whereas six were identified in the RPW8 typed genes. Phylogenetic analysis showed that the PgNBS gene family is an ancient family, with a vast majority of its genes originated before ginseng originated. In spite of their belonging to a family, the PgNBS genes have functionally dramatically differentiated and been categorized into numerous functional categories. The expressions of the across tissues, different aged roots and the roots of different genotypes. However, they are coordinating in expression, forming a single co-expression network. These results provide a deeper understanding of the origin, evolution and functional differentiation and expression dynamics of the NBS-encoding gene family in plants in general and in ginseng particularly, and a NBS gene toolkit useful for isolation and characterization of disease resistance genes and for enhanced disease resistance breeding in ginseng and related species. PMID:28727829
A prior-based integrative framework for functional transcriptional regulatory network inference

PubMed Central

Siahpirani, Alireza F.

2017-01-01

Abstract Transcriptional regulatory networks specify regulatory proteins controlling the context-specific expression levels of genes. Inference of genome-wide regulatory networks is central to understanding gene regulation, but remains an open challenge. Expression-based network inference is among the most popular methods to infer regulatory networks, however, networks inferred from such methods have low overlap with experimentally derived (e.g. ChIP-chip and transcription factor (TF) knockouts) networks. Currently we have a limited understanding of this discrepancy. To address this gap, we first develop a regulatory network inference algorithm, based on probabilistic graphical models, to integrate expression with auxiliary datasets supporting a regulatory edge. Second, we comprehensively analyze our and other state-of-the-art methods on different expression perturbation datasets. Networks inferred by integrating sequence-specific motifs with expression have substantially greater agreement with experimentally derived networks, while remaining more predictive of expression than motif-based networks. Our analysis suggests natural genetic variation as the most informative perturbation for network inference, and, identifies core TFs whose targets are predictable from expression. Multiple reasons make the identification of targets of other TFs difficult, including network architecture and insufficient variation of TF mRNA level. Finally, we demonstrate the utility of our inference algorithm to infer stress-specific regulatory networks and for regulator prioritization. PMID:27794550
Different approach to the modeling of nonfree particle diffusion

NASA Astrophysics Data System (ADS)

Buhl, Niels

2018-03-01

A new approach to the modeling of nonfree particle diffusion is presented. The approach uses a general setup based on geometric graphs (networks of curves), which means that particle diffusion in anything from arrays of barriers and pore networks to general geometric domains can be considered and that the (free random walk) central limit theorem can be generalized to cover also the nonfree case. The latter gives rise to a continuum-limit description of the diffusive motion where the effect of partially absorbing barriers is accounted for in a natural and non-Markovian way that, in contrast to the traditional approach, quantifies the absorptivity of a barrier in terms of a dimensionless parameter in the range 0 to 1. The generalized theorem gives two general analytic expressions for the continuum-limit propagator: an infinite sum of Gaussians and an infinite sum of plane waves. These expressions entail the known method-of-images and Laplace eigenfunction expansions as special cases and show how the presence of partially absorbing barriers can lead to phenomena such as line splitting and band gap formation in the plane wave wave-number spectrum.
In silico identification of miRNAs and their target genes and analysis of gene co-expression network in saffron (Crocus sativus L.) stigma

PubMed Central

Zinati, Zahra; Shamloo-Dashtpagerdi, Roohollah; Behpouri, Ali

2016-01-01

As an aromatic and colorful plant of substantive taste, saffron (Crocus sativus L.) owes such properties of matter to growing class of the secondary metabolites derived from the carotenoids, apocarotenoids. Regarding the critical role of microRNAs in secondary metabolic synthesis and the limited number of identified miRNAs in C. sativus, on the other hand, one may see the point how the characterization of miRNAs along with the corresponding target genes in C. sativus might expand our perspectives on the roles of miRNAs in carotenoid/apocarotenoid biosynthetic pathway. A computational analysis was used to identify miRNAs and their targets using EST (Expressed Sequence Tag) library from mature saffron stigmas. Then, a gene co- expression network was constructed to identify genes which are potentially involved in carotenoid/apocarotenoid biosynthetic pathways. EST analysis led to the identification of two putative miRNAs (miR414 and miR837-5p) along with the corresponding stem- looped precursors. To our knowledge, this is the first report on miR414 and miR837-5p in C. sativus. Co-expression network analysis indicated that miR414 and miR837-5p may play roles in C. sativus metabolic pathways and led to identification of candidate genes including six transcription factors and one protein kinase probably involved in carotenoid/apocarotenoid biosynthetic pathway. Presence of transcription factors, miRNAs and protein kinase in the network indicated multiple layers of regulation in saffron stigma. The candidate genes from this study may help unraveling regulatory networks underlying the carotenoid/apocarotenoid biosynthesis in saffron and designing metabolic engineering for enhanced secondary metabolites. PMID:28261627
An atlas of gene expression and gene co-regulation in the human retina.

PubMed

Pinelli, Michele; Carissimo, Annamaria; Cutillo, Luisa; Lai, Ching-Hung; Mutarelli, Margherita; Moretti, Maria Nicoletta; Singh, Marwah Veer; Karali, Marianthi; Carrella, Diego; Pizzo, Mariateresa; Russo, Francesco; Ferrari, Stefano; Ponzin, Diego; Angelini, Claudia; Banfi, Sandro; di Bernardo, Diego

2016-07-08

The human retina is a specialized tissue involved in light stimulus transduction. Despite its unique biology, an accurate reference transcriptome is still missing. Here, we performed gene expression analysis (RNA-seq) of 50 retinal samples from non-visually impaired post-mortem donors. We identified novel transcripts with high confidence (Observed Transcriptome (ObsT)) and quantified the expression level of known transcripts (Reference Transcriptome (RefT)). The ObsT included 77 623 transcripts (23 960 genes) covering 137 Mb (35 Mb new transcribed genome). Most of the transcripts (92%) were multi-exonic: 81% with known isoforms, 16% with new isoforms and 3% belonging to new genes. The RefT included 13 792 genes across 94 521 known transcripts. Mitochondrial genes were among the most highly expressed, accounting for about 10% of the reads. Of all the protein-coding genes in Gencode, 65% are expressed in the retina. We exploited inter-individual variability in gene expression to infer a gene co-expression network and to identify genes specifically expressed in photoreceptor cells. We experimentally validated the photoreceptors localization of three genes in human retina that had not been previously reported. RNA-seq data and the gene co-expression network are available online (http://retina.tigem.it). © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Model-based design of RNA hybridization networks implemented in living cells.

PubMed

Rodrigo, Guillermo; Prakash, Satya; Shen, Shensi; Majer, Eszter; Daròs, José-Antonio; Jaramillo, Alfonso

2017-09-19

Synthetic gene circuits allow the behavior of living cells to be reprogrammed, and non-coding small RNAs (sRNAs) are increasingly being used as programmable regulators of gene expression. However, sRNAs (natural or synthetic) are generally used to regulate single target genes, while complex dynamic behaviors would require networks of sRNAs regulating each other. Here, we report a strategy for implementing such networks that exploits hybridization reactions carried out exclusively by multifaceted sRNAs that are both targets of and triggers for other sRNAs. These networks are ultimately coupled to the control of gene expression. We relied on a thermodynamic model of the different stable conformational states underlying this system at the nucleotide level. To test our model, we designed five different RNA hybridization networks with a linear architecture, and we implemented them in Escherichia coli. We validated the network architecture at the molecular level by native polyacrylamide gel electrophoresis, as well as the network function at the bacterial population and single-cell levels with a fluorescent reporter. Our results suggest that it is possible to engineer complex cellular programs based on RNA from first principles. Because these networks are mainly based on physical interactions, our designs could be expanded to other organisms as portable regulatory resources or to implement biological computations. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Evaluating methods of inferring gene regulatory networks highlights their lack of performance for single cell gene expression data.

PubMed

Chen, Shuonan; Mar, Jessica C

2018-06-19

A fundamental fact in biology states that genes do not operate in isolation, and yet, methods that infer regulatory networks for single cell gene expression data have been slow to emerge. With single cell sequencing methods now becoming accessible, general network inference algorithms that were initially developed for data collected from bulk samples may not be suitable for single cells. Meanwhile, although methods that are specific for single cell data are now emerging, whether they have improved performance over general methods is unknown. In this study, we evaluate the applicability of five general methods and three single cell methods for inferring gene regulatory networks from both experimental single cell gene expression data and in silico simulated data. Standard evaluation metrics using ROC curves and Precision-Recall curves against reference sets sourced from the literature demonstrated that most of the methods performed poorly when they were applied to either experimental single cell data, or simulated single cell data, which demonstrates their lack of performance for this task. Using default settings, network methods were applied to the same datasets. Comparisons of the learned networks highlighted the uniqueness of some predicted edges for each method. The fact that different methods infer networks that vary substantially reflects the underlying mathematical rationale and assumptions that distinguish network methods from each other. This study provides a comprehensive evaluation of network modeling algorithms applied to experimental single cell gene expression data and in silico simulated datasets where the network structure is known. Comparisons demonstrate that most of these assessed network methods are not able to predict network structures from single cell expression data accurately, even if they are specifically developed for single cell methods. Also, single cell methods, which usually depend on more elaborative algorithms, in general have less similarity to each other in the sets of edges detected. The results from this study emphasize the importance for developing more accurate optimized network modeling methods that are compatible for single cell data. Newly-developed single cell methods may uniquely capture particular features of potential gene-gene relationships, and caution should be taken when we interpret these results.
Reconstruction of the genome-scale co-expression network for the Hippo signaling pathway in colorectal cancer.

PubMed

Dehghanian, Fariba; Hojati, Zohreh; Hosseinkhan, Nazanin; Mousavian, Zaynab; Masoudi-Nejad, Ali

2018-05-26

The Hippo signaling pathway (HSP) has been identified as an essential and complex signaling pathway for tumor suppression that coordinates proliferation, differentiation, cell death, cell growth and stemness. In the present study, we conducted a genome-scale co-expression analysis to reconstruct the HSP in colorectal cancer (CRC). Five key modules were detected through network clustering, and a detailed discussion of two modules containing respectively 18 and 13 over and down-regulated members of HSP was provided. Our results suggest new potential regulatory factors in the HSP. The detected modules also suggest novel genes contributing to CRC. Moreover, differential expression analysis confirmed the differential expression pattern of HSP members and new suggested regulatory factors between tumor and normal samples. These findings can further reveal the importance of HSP in CRC. Copyright © 2018 Elsevier Ltd. All rights reserved.
A systems approach identifies networks and genes linking sleep and stress: implications for neuropsychiatric disorders.

PubMed

Jiang, Peng; Scarpa, Joseph R; Fitzpatrick, Karrie; Losic, Bojan; Gao, Vance D; Hao, Ke; Summa, Keith C; Yang, He S; Zhang, Bin; Allada, Ravi; Vitaterna, Martha H; Turek, Fred W; Kasarskis, Andrew

2015-05-05

Sleep dysfunction and stress susceptibility are comorbid complex traits that often precede and predispose patients to a variety of neuropsychiatric diseases. Here, we demonstrate multilevel organizations of genetic landscape, candidate genes, and molecular networks associated with 328 stress and sleep traits in a chronically stressed population of 338 (C57BL/6J × A/J) F2 mice. We constructed striatal gene co-expression networks, revealing functionally and cell-type-specific gene co-regulations important for stress and sleep. Using a composite ranking system, we identified network modules most relevant for 15 independent phenotypic categories, highlighting a mitochondria/synaptic module that links sleep and stress. The key network regulators of this module are overrepresented with genes implicated in neuropsychiatric diseases. Our work suggests that the interplay among sleep, stress, and neuropathology emerges from genetic influences on gene expression and their collective organization through complex molecular networks, providing a framework for interrogating the mechanisms underlying sleep, stress susceptibility, and related neuropsychiatric disorders. Copyright © 2015 The Authors. Published by Elsevier Inc. All rights reserved.
Genotet: An Interactive Web-based Visual Exploration Framework to Support Validation of Gene Regulatory Networks.

PubMed

Yu, Bowen; Doraiswamy, Harish; Chen, Xi; Miraldi, Emily; Arrieta-Ortiz, Mario Luis; Hafemeister, Christoph; Madar, Aviv; Bonneau, Richard; Silva, Cláudio T

2014-12-01

Elucidation of transcriptional regulatory networks (TRNs) is a fundamental goal in biology, and one of the most important components of TRNs are transcription factors (TFs), proteins that specifically bind to gene promoter and enhancer regions to alter target gene expression patterns. Advances in genomic technologies as well as advances in computational biology have led to multiple large regulatory network models (directed networks) each with a large corpus of supporting data and gene-annotation. There are multiple possible biological motivations for exploring large regulatory network models, including: validating TF-target gene relationships, figuring out co-regulation patterns, and exploring the coordination of cell processes in response to changes in cell state or environment. Here we focus on queries aimed at validating regulatory network models, and on coordinating visualization of primary data and directed weighted gene regulatory networks. The large size of both the network models and the primary data can make such coordinated queries cumbersome with existing tools and, in particular, inhibits the sharing of results between collaborators. In this work, we develop and demonstrate a web-based framework for coordinating visualization and exploration of expression data (RNA-seq, microarray), network models and gene-binding data (ChIP-seq). Using specialized data structures and multiple coordinated views, we design an efficient querying model to support interactive analysis of the data. Finally, we show the effectiveness of our framework through case studies for the mouse immune system (a dataset focused on a subset of key cellular functions) and a model bacteria (a small genome with high data-completeness).

Gene Discovery of Characteristic Metabolic Pathways in the Tea Plant (Camellia sinensis) Using ‘Omics’-Based Network Approaches: A Future Perspective

PubMed Central

Zhang, Shihua; Zhang, Liang; Tai, Yuling; Wang, Xuewen; Ho, Chi-Tang; Wan, Xiaochun

2018-01-01

Characteristic secondary metabolites, including flavonoids, theanine and caffeine, in the tea plant (Camellia sinensis) are the primary sources of the rich flavors, fresh taste, and health benefits of tea. The decoding of genes involved in these characteristic components is still significantly lagging, which lays an obstacle for applied genetic improvement and metabolic engineering. With the popularity of high-throughout transcriptomics and metabolomics, ‘omics’-based network approaches, such as gene co-expression network and gene-to-metabolite network, have emerged as powerful tools for gene discovery of plant-specialized (secondary) metabolism. Thus, it is pivotal to summarize and introduce such system-based strategies in facilitating gene identification of characteristic metabolic pathways in the tea plant (or other plants). In this review, we describe recent advances in transcriptomics and metabolomics for transcript and metabolite profiling, and highlight ‘omics’-based network strategies using successful examples in model and non-model plants. Further, we summarize recent progress in ‘omics’ analysis for gene identification of characteristic metabolites in the tea plant. Limitations of the current strategies are discussed by comparison with ‘omics’-based network approaches. Finally, we demonstrate the potential of introducing such network strategies in the tea plant, with a prospects ending for a promising network discovery of characteristic metabolite genes in the tea plant. PMID:29915604
Use of transcriptomics and co-expression networks to analyze the interconnections between nitrogen assimilation and photorespiratory metabolism

PubMed Central

Pérez-Delgado, Carmen M.; Moyano, Tomás C.; García-Calderón, Margarita; Canales, Javier; Gutiérrez, Rodrigo A.; Márquez, Antonio J.; Betti, Marco

2016-01-01

Nitrogen is one of the most important nutrients for plants and, in natural soils, its availability is often a major limiting factor for plant growth. Here we examine the effect of different forms of nitrogen nutrition and of photorespiration on gene expression in the model legume Lotus japonicus with the aim of identifying regulatory candidate genes co-ordinating primary nitrogen assimilation and photorespiration. The transcriptomic changes produced by the use of different nitrogen sources in leaves of L. japonicus plants combined with the transcriptomic changes produced in the same tissue by different photorespiratory conditions were examined. The results obtained provide novel information on the possible role of plastidic glutamine synthetase in the response to different nitrogen sources and in the C/N balance of L. japonicus plants. The use of gene co-expression networks establishes a clear relationship between photorespiration and primary nitrogen assimilation and identifies possible transcription factors connected to the genes of both routes. PMID:27117340
NetCoDer: A Retransmission Mechanism for WSNs Based on Cooperative Relays and Network Coding

PubMed Central

Valle, Odilson T.; Montez, Carlos; Medeiros de Araujo, Gustavo; Vasques, Francisco; Moraes, Ricardo

2016-01-01

Some of the most difficult problems to deal with when using Wireless Sensor Networks (WSNs) are related to the unreliable nature of communication channels. In this context, the use of cooperative diversity techniques and the application of network coding concepts may be promising solutions to improve the communication reliability. In this paper, we propose the NetCoDer scheme to address this problem. Its design is based on merging cooperative diversity techniques and network coding concepts. We evaluate the effectiveness of the NetCoDer scheme through both an experimental setup with real WSN nodes and a simulation assessment, comparing NetCoDer performance against state-of-the-art TDMA-based (Time Division Multiple Access) retransmission techniques: BlockACK, Master/Slave and Redundant TDMA. The obtained results highlight that the proposed NetCoDer scheme clearly improves the network performance when compared with other retransmission techniques. PMID:27258280
Analyzing the interactions of mRNAs, miRNAs, lncRNAs and circRNAs to predict competing endogenous RNA networks in glioblastoma.

PubMed

Yuan, Yang; Jiaoming, Li; Xiang, Wang; Yanhui, Liu; Shu, Jiang; Maling, Gou; Qing, Mao

2018-05-01

Cross-talk between competitive endogenous RNAs (ceRNAs) may play a critical role in revealing potential mechanisms of tumor development and physiology. Glioblastoma is the most common type of malignant primary brain tumor, and the mechanisms of tumor genesis and development in glioblastoma are unclear. Here, to investigate the role of non-coding RNAs and the ceRNA network in glioblastoma, we performed paired-end RNA sequencing and microarray analyses to obtain the expression profiles of mRNAs, lncRNAs, circRNAs and miRNAs. We identified that the expression of 501 lncRNAs, 1999 mRNAs, 2038 circRNAs and 143 miRNAs were often altered between glioblastoma and matched normal brain tissue. Gene ontology and Kyoto Encyclopedia of Genes and Genomes pathway analyses were performed on these differentially expressed mRNAs and miRNA-mediated target genes of lncRNAs and circRNAs. Furthermore, we used a multi-step computational framework and several bioinformatics methods to construct a ceRNA network combining mRNAs, miRNAs, lncRNAs and circRNA, based on co-expression analysis between the differentially expressed RNAs. We identified that plenty of lncRNAs, CircRNAs and their downstream target genes in the ceRNA network are related to glutamatergic synapse, suggesting that glutamate metabolism is involved in glioma biological functions. Our results will accelerate the understanding of tumorigenesis, cancer progression and even therapeutic targeting in glioblastoma.
Geo-Distinctive Comorbidity Networks of Pediatric Asthma.

PubMed

Shin, Eun Kyong; Shaban-Nejad, Arash

2018-01-01

Most pediatric asthma cases occur in complex interdependencies, exhibiting complex manifestation of multiple symptoms. Studying asthma comorbidities can help to better understand the etiology pathway of the disease. Albeit such relations of co-expressed symptoms and their interactions have been highlighted recently, empirical investigation has not been rigorously applied to pediatric asthma cases. In this study, we use computational network modeling and analysis to reveal the links and associations between commonly co-observed diseases/conditions with asthma among children in Memphis, Tennessee. We present a novel method for geo-parsed comorbidity network analysis to show the distinctive patterns of comorbidity networks in urban and suburban areas in Memphis.
Mechanisms of Severe Acute Respiratory Syndrome Coronavirus-Induced Acute Lung Injury

PubMed Central

Gralinski, Lisa E.; Bankhead, Armand; Jeng, Sophia; Menachery, Vineet D.; Proll, Sean; Belisle, Sarah E.; Matzke, Melissa; Webb-Robertson, Bobbie-Jo M.; Luna, Maria L.; Shukla, Anil K.; Ferris, Martin T.; Bolles, Meagan; Chang, Jean; Aicher, Lauri; Waters, Katrina M.; Smith, Richard D.; Metz, Thomas O.; Law, G. Lynn; Katze, Michael G.; McWeeney, Shannon; Baric, Ralph S.

2013-01-01

ABSTRACT Systems biology offers considerable promise in uncovering novel pathways by which viruses and other microbial pathogens interact with host signaling and expression networks to mediate disease severity. In this study, we have developed an unbiased modeling approach to identify new pathways and network connections mediating acute lung injury, using severe acute respiratory syndrome coronavirus (SARS-CoV) as a model pathogen. We utilized a time course of matched virologic, pathological, and transcriptomic data within a novel methodological framework that can detect pathway enrichment among key highly connected network genes. This unbiased approach produced a high-priority list of 4 genes in one pathway out of over 3,500 genes that were differentially expressed following SARS-CoV infection. With these data, we predicted that the urokinase and other wound repair pathways would regulate lethal versus sublethal disease following SARS-CoV infection in mice. We validated the importance of the urokinase pathway for SARS-CoV disease severity using genetically defined knockout mice, proteomic correlates of pathway activation, and pathological disease severity. The results of these studies demonstrate that a fine balance exists between host coagulation and fibrinolysin pathways regulating pathological disease outcomes, including diffuse alveolar damage and acute lung injury, following infection with highly pathogenic respiratory viruses, such as SARS-CoV. PMID:23919993
Characterization of a photoacoustic system through neural networks to determine multicomponent samples

NASA Astrophysics Data System (ADS)

Zajarevich, N. M.; Peuriot, A. L.; Slezak, V. B.

2016-07-01

Photoacoustic spectroscopy for trace gases detection, based on a CO2 laser, can be used in a wide range of applications. The tunability of this laser in the mid-infrared (9.4-10.6 μm) allows the quantitative determination of different substances in multicomponent samples. In general, at traces level, the total photoacoustic amplitude at a certain wavelength may be approximated by a linear superposition of the amplitudes given by each of the species absorbing at that wavelength. However, in some cases, the sum of the individual signals is no longer valid. In particular, it is known the presence of CO2 delays the acoustic signal in relation to the laser excitation due to the exchange of vibrational energy between CO2 and N2. This phenomenon generates a slow V-T energy relaxation from a metastable N2 vibrational level and the sum of individual contributions may no longer be valid. Moreover, the resolution of a linear equation system has limitations, so the possibility to determine concentrations in photoacoustics based on neural network is proposed in this work. This procedure is tried in a particular case of a volatile organic compound, such as C2H4, and CO2 in air. The results are compared with the ones obtained with a model based on rate equations.
A proteomic network approach across the ALS-FTD disease spectrum resolves clinical phenotypes and genetic vulnerability in human brain.

PubMed

Umoh, Mfon E; Dammer, Eric B; Dai, Jingting; Duong, Duc M; Lah, James J; Levey, Allan I; Gearing, Marla; Glass, Jonathan D; Seyfried, Nicholas T

2018-01-01

Amyotrophic lateral sclerosis (ALS) and frontotemporal dementia (FTD) are neurodegenerative diseases with overlap in clinical presentation, neuropathology, and genetic underpinnings. The molecular basis for the overlap of these disorders is not well established. We performed a comparative unbiased mass spectrometry-based proteomic analysis of frontal cortical tissues from postmortem cases clinically defined as ALS, FTD, ALS and FTD (ALS/FTD), and controls. We also included a subset of patients with the C9orf72 expansion mutation, the most common genetic cause of both ALS and FTD Our systems-level analysis of the brain proteome integrated both differential expression and co-expression approaches to assess the relationship of these differences to clinical and pathological phenotypes. Weighted co-expression network analysis revealed 15 modules of co-expressed proteins, eight of which were significantly different across the ALS-FTD disease spectrum. These included modules associated with RNA binding proteins, synaptic transmission, and inflammation with cell-type specificity that showed correlation with TDP-43 pathology and cognitive dysfunction. Modules were also examined for their overlap with TDP-43 protein-protein interactions, revealing one module enriched with RNA-binding proteins and other causal ALS genes that increased in FTD/ALS and FTD cases. A module enriched with astrocyte and microglia proteins was significantly increased in ALS cases carrying the C9orf72 mutation compared to sporadic ALS cases, suggesting that the genetic expansion is associated with inflammation in the brain even without clinical evidence of dementia. Together, these findings highlight the utility of integrative systems-level proteomic approaches to resolve clinical phenotypes and genetic mechanisms underlying the ALS-FTD disease spectrum in human brain. © 2017 The Authors. Published under the terms of the CC BY 4.0 license.
Prediction of cassava protein interactome based on interolog method.

PubMed

Thanasomboon, Ratana; Kalapanulak, Saowalak; Netrphan, Supatcharee; Saithong, Treenut

2017-12-08

Cassava is a starchy root crop whose role in food security becomes more significant nowadays. Together with the industrial uses for versatile purposes, demand for cassava starch is continuously growing. However, in-depth study to uncover the mystery of cellular regulation, especially the interaction between proteins, is lacking. To reduce the knowledge gap in protein-protein interaction (PPI), genome-scale PPI network of cassava was constructed using interolog-based method (MePPI-In, available at http://bml.sbi.kmutt.ac.th/ppi ). The network was constructed from the information of seven template plants. The MePPI-In included 90,173 interactions from 7,209 proteins. At least, 39 percent of the total predictions were found with supports from gene/protein expression data, while further co-expression analysis yielded 16 highly promising PPIs. In addition, domain-domain interaction information was employed to increase reliability of the network and guide the search for more groups of promising PPIs. Moreover, the topology and functional content of MePPI-In was similar to the networks of Arabidopsis and rice. The potential contribution of MePPI-In for various applications, such as protein-complex formation and prediction of protein function, was discussed and exemplified. The insights provided by our MePPI-In would hopefully enable us to pursue precise trait improvement in cassava.
Multiple Assembly Rules Drive the Co-occurrence of Orthopteran and Plant Species in Grasslands: Combining Network, Functional and Phylogenetic Approaches

PubMed Central

Fournier, Bertrand; Mouly, Arnaud; Gillet, François

2016-01-01

Understanding the factors underlying the co-occurrence of multiple species remains a challenge in ecology. Biotic interactions, environmental filtering and neutral processes are among the main mechanisms evoked to explain species co-occurrence. However, they are most often studied separately or even considered as mutually exclusive. This likely hampers a more global understanding of species assembly. Here, we investigate the general hypothesis that the structure of co-occurrence networks results from multiple assembly rules and its potential implications for grassland ecosystems. We surveyed orthopteran and plant communities in 48 permanent grasslands of the French Jura Mountains and gathered functional and phylogenetic data for all species. We constructed a network of plant and orthopteran species co-occurrences and verified whether its structure was modular or nested. We investigated the role of all species in the structure of the network (modularity and nestedness). We also investigated the assembly rules driving the structure of the plant-orthopteran co-occurrence network by using null models on species functional traits, phylogenetic relatedness and environmental conditions. We finally compared our results to abundance-based approaches. We found that the plant-orthopteran co-occurrence network had a modular organization. Community assembly rules differed among modules for plants while interactions with plants best explained the distribution of orthopterans into modules. Few species had a disproportionately high positive contribution to this modular organization and are likely to have a key importance to modulate future changes. The impact of agricultural practices was restricted to some modules (3 out of 5) suggesting that shifts in agricultural practices might not impact the entire plant-orthopteran co-occurrence network. These findings support our hypothesis that multiple assembly rules drive the modular structure of the plant-orthopteran network. This modular structure is likely to play a key role in the response of grassland ecosystems to future changes by limiting the impact of changes in agricultural practices such as intensification to some modules leaving species from other modules poorly impacted. The next step is to understand the importance of this modular structure for the long-term maintenance of grassland ecosystem structure and functions as well as to develop tools to integrate network structure into models to improve their capacity to predict future changes. PMID:27582754
Network Analysis of Publications on Topological Indices from the Web of Science.

PubMed

Bodlaj, Jernej; Batagelj, Vladimir

2014-08-01

In this paper we analyze a collection of bibliographic networks, constructed from the data from the Web of Science on works (papers, books, etc.) on the topic of topological indices and on relating scientific fields. We present the general outlook and more specific findings about authors, works and journals, subtopics and keywords and also important relations between them based on scientometric approaches like the strongest and main citation paths, the main themes on citation path based on keywords, results of co-authorship analysis in form of the most prominent islands of citing authors, groups of collaborating authors, two-mode cores of authors and works. We investigate the nature of citing of authors, important journals and citing of works between them, journals preferred by authors and expose hierarchy of similar collaborating authors, based on keywords they use. We perform temporal analysis on one important journal as well. We give a comprehensive scientometric insight into the field of topological indices. © 2014 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Signed weighted gene co-expression network analysis of transcriptional regulation in murine embryonic stem cells

PubMed Central

Mason, Mike J; Fan, Guoping; Plath, Kathrin; Zhou, Qing; Horvath, Steve

2009-01-01

Background Recent work has revealed that a core group of transcription factors (TFs) regulates the key characteristics of embryonic stem (ES) cells: pluripotency and self-renewal. Current efforts focus on identifying genes that play important roles in maintaining pluripotency and self-renewal in ES cells and aim to understand the interactions among these genes. To that end, we investigated the use of unsigned and signed network analysis to identify pluripotency and differentiation related genes. Results We show that signed networks provide a better systems level understanding of the regulatory mechanisms of ES cells than unsigned networks, using two independent murine ES cell expression data sets. Specifically, using signed weighted gene co-expression network analysis (WGCNA), we found a pluripotency module and a differentiation module, which are not identified in unsigned networks. We confirmed the importance of these modules by incorporating genome-wide TF binding data for key ES cell regulators. Interestingly, we find that the pluripotency module is enriched with genes related to DNA damage repair and mitochondrial function in addition to transcriptional regulation. Using a connectivity measure of module membership, we not only identify known regulators of ES cells but also show that Mrpl15, Msh6, Nrf1, Nup133, Ppif, Rbpj, Sh3gl2, and Zfp39, among other genes, have important roles in maintaining ES cell pluripotency and self-renewal. We also report highly significant relationships between module membership and epigenetic modifications (histone modifications and promoter CpG methylation status), which are known to play a role in controlling gene expression during ES cell self-renewal and differentiation. Conclusion Our systems biologic re-analysis of gene expression, transcription factor binding, epigenetic and gene ontology data provides a novel integrative view of ES cell biology. PMID:19619308
RedeR: R/Bioconductor package for representing modular structures, nested networks and multiple levels of hierarchical associations

PubMed Central

2012-01-01

Visualization and analysis of molecular networks are both central to systems biology. However, there still exists a large technological gap between them, especially when assessing multiple network levels or hierarchies. Here we present RedeR, an R/Bioconductor package combined with a Java core engine for representing modular networks. The functionality of RedeR is demonstrated in two different scenarios: hierarchical and modular organization in gene co-expression networks and nested structures in time-course gene expression subnetworks. Our results demonstrate RedeR as a new framework to deal with the multiple network levels that are inherent to complex biological systems. RedeR is available from http://bioconductor.org/packages/release/bioc/html/RedeR.html. PMID:22531049
Titanium biomaterials with complex surfaces induced aberrant peripheral circadian rhythms in bone marrow mesenchymal stromal cells.

PubMed

Hassan, Nathaniel; McCarville, Kirstin; Morinaga, Kenzo; Mengatto, Cristiane M; Langfelder, Peter; Hokugo, Akishige; Tahara, Yu; Colwell, Christopher S; Nishimura, Ichiro

2017-01-01

Circadian rhythms maintain a high level of homeostasis through internal feed-forward and -backward regulation by core molecules. In this study, we report the highly unusual peripheral circadian rhythm of bone marrow mesenchymal stromal cells (BMSCs) induced by titanium-based biomaterials with complex surface modifications (Ti biomaterial) commonly used for dental and orthopedic implants. When cultured on Ti biomaterials, human BMSCs suppressed circadian PER1 expression patterns, while NPAS2 was uniquely upregulated. The Ti biomaterials, which reduced Per1 expression and upregulated Npas2, were further examined with BMSCs harvested from Per1::luc transgenic rats. Next, we addressed the regulatory relationship between Per1 and Npas2 using BMSCs from Npas2 knockout mice. The Npas2 knockout mutation did not rescue the Ti biomaterial-induced Per1 suppression and did not affect Per2, Per3, Bmal1 and Clock expression, suggesting that the Ti biomaterial-induced Npas2 overexpression was likely an independent phenomenon. Previously, vitamin D deficiency was reported to interfere with Ti biomaterial osseointegration. The present study demonstrated that vitamin D supplementation significantly increased Per1::luc expression in BMSCs, though the presence of Ti biomaterials only moderately affected the suppressed Per1::luc expression. Available in vivo microarray data from femurs exposed to Ti biomaterials in vitamin D-deficient rats were evaluated by weighted gene co-expression network analysis. A large co-expression network containing Npas2, Bmal1, and Vdr was observed to form with the Ti biomaterials, which was disintegrated by vitamin D deficiency. Thus, the aberrant BMSC peripheral circadian rhythm may be essential for the integration of Ti biomaterials into bone.
High-yield, in vitro protein expression using a continuous-exchange, coupled transcription/ translation system.

PubMed

Martin, G A; Kawaguchi, R; Lam, Y; DeGiovanni, A; Fukushima, M; Mutter, W

2001-10-01

The Rapid Translation System (RTS 500) (Roche Molecular Biochemicals) is a high-yield protein expression system that utilizes an enhanced E. coli lysate for an in vitro transcription/translation reaction. In contrast to conventional transcription/translation, this system allows protein expression to continue for more than 24 h. We demonstrated the utility of the RTS 500 by expressing different soluble and active proteins that generally pose problems in cell-based expression systems. We first expressed GFP-lunasin, a fusion protein that, because of its toxicity, has been impossible to produce in whole cells. The second protein we expressed, human interleukin-2 (IL-2), is generally difficult to produce, either as the native molecule or as a GSTfusion protein, in a soluble form in bacteria. Finally, we demonstrated the capacity of the RTS 500 to co-express proteins, by the simultaneous production of GFP and CAT in a single reaction. This new technology appears to be particularly usefulfor the convenient production of preparative amounts (100-900 microg) of proteins that are toxic or insoluble in cell-based systems.
Analysis of the dynamic co-expression network of heart regeneration in the zebrafish

PubMed Central

Rodius, Sophie; Androsova, Ganna; Götz, Lou; Liechti, Robin; Crespo, Isaac; Merz, Susanne; Nazarov, Petr V.; de Klein, Niek; Jeanty, Céline; González-Rosa, Juan M.; Muller, Arnaud; Bernardin, Francois; Niclou, Simone P.; Vallar, Laurent; Mercader, Nadia; Ibberson, Mark; Xenarios, Ioannis; Azuaje, Francisco

2016-01-01

The zebrafish has the capacity to regenerate its heart after severe injury. While the function of a few genes during this process has been studied, we are far from fully understanding how genes interact to coordinate heart regeneration. To enable systematic insights into this phenomenon, we generated and integrated a dynamic co-expression network of heart regeneration in the zebrafish and linked systems-level properties to the underlying molecular events. Across multiple post-injury time points, the network displays topological attributes of biological relevance. We show that regeneration steps are mediated by modules of transcriptionally coordinated genes, and by genes acting as network hubs. We also established direct associations between hubs and validated drivers of heart regeneration with murine and human orthologs. The resulting models and interactive analysis tools are available at http://infused.vital-it.ch. Using a worked example, we demonstrate the usefulness of this unique open resource for hypothesis generation and in silico screening for genes involved in heart regeneration. PMID:27241320
Analysis of the dynamic co-expression network of heart regeneration in the zebrafish

NASA Astrophysics Data System (ADS)

Rodius, Sophie; Androsova, Ganna; Götz, Lou; Liechti, Robin; Crespo, Isaac; Merz, Susanne; Nazarov, Petr V.; de Klein, Niek; Jeanty, Céline; González-Rosa, Juan M.; Muller, Arnaud; Bernardin, Francois; Niclou, Simone P.; Vallar, Laurent; Mercader, Nadia; Ibberson, Mark; Xenarios, Ioannis; Azuaje, Francisco

2016-05-01

The zebrafish has the capacity to regenerate its heart after severe injury. While the function of a few genes during this process has been studied, we are far from fully understanding how genes interact to coordinate heart regeneration. To enable systematic insights into this phenomenon, we generated and integrated a dynamic co-expression network of heart regeneration in the zebrafish and linked systems-level properties to the underlying molecular events. Across multiple post-injury time points, the network displays topological attributes of biological relevance. We show that regeneration steps are mediated by modules of transcriptionally coordinated genes, and by genes acting as network hubs. We also established direct associations between hubs and validated drivers of heart regeneration with murine and human orthologs. The resulting models and interactive analysis tools are available at http://infused.vital-it.ch. Using a worked example, we demonstrate the usefulness of this unique open resource for hypothesis generation and in silico screening for genes involved in heart regeneration.
A long non-coding RNA expression profile can predict early recurrence in hepatocellular carcinoma after curative resection.

PubMed

Lv, Yufeng; Wei, Wenhao; Huang, Zhong; Chen, Zhichao; Fang, Yuan; Pan, Lili; Han, Xueqiong; Xu, Zihai

2018-06-20

The aim of this study was to develop a novel long non-coding RNA (lncRNA) expression signature to accurately predict early recurrence for patients with hepatocellular carcinoma (HCC) after curative resection. Using expression profiles downloaded from The Cancer Genome Atlas database, we identified multiple lncRNAs with differential expression between early recurrence (ER) group and non-early recurrence (non-ER) group of HCC. Least absolute shrinkage and selection operator (LASSO) for logistic regression models were used to develop a lncRNA-based classifier for predicting ER in the training set. An independent test set was used to validated the predictive value of this classifier. Futhermore, a co-expression network based on these lncRNAs and its highly related genes was constructed and Gene Ontology and Kyoto Encyclopedia of Genes and Genomes pathway enrichment analyses of genes in the network were performed. We identified 10 differentially expressed lncRNAs, including 3 that were upregulated and 7 that were downregulated in ER group. The lncRNA-based classifier was constructed based on 7 lncRNAs (AL035661.1, PART1, AC011632.1, AC109588.1, AL365361.1, LINC00861 and LINC02084), and its accuracy was 0.83 in training set, 0.87 in test set and 0.84 in total set. And ROC curve analysis showed the AUROC was 0.741 in training set, 0.824 in the test set and 0.765 in total set. A functional enrichment analysis suggested that the genes of which is highly related to 4 lncRNAs were involved in immune system. This 7-lncRNA expression profile can effectively predict the early recurrence after surgical resection for HCC. This article is protected by copyright. All rights reserved.
In-Silico Integration Approach to Identify a Key miRNA Regulating a Gene Network in Aggressive Prostate Cancer

PubMed Central

Colaprico, Antonio; Bontempi, Gianluca; Castiglioni, Isabella

2018-01-01

Like other cancer diseases, prostate cancer (PC) is caused by the accumulation of genetic alterations in the cells that drives malignant growth. These alterations are revealed by gene profiling and copy number alteration (CNA) analysis. Moreover, recent evidence suggests that also microRNAs have an important role in PC development. Despite efforts to profile PC, the alterations (gene, CNA, and miRNA) and biological processes that correlate with disease development and progression remain partially elusive. Many gene signatures proposed as diagnostic or prognostic tools in cancer poorly overlap. The identification of co-expressed genes, that are functionally related, can identify a core network of genes associated with PC with a better reproducibility. By combining different approaches, including the integration of mRNA expression profiles, CNAs, and miRNA expression levels, we identified a gene signature of four genes overlapping with other published gene signatures and able to distinguish, in silico, high Gleason-scored PC from normal human tissue, which was further enriched to 19 genes by gene co-expression analysis. From the analysis of miRNAs possibly regulating this network, we found that hsa-miR-153 was highly connected to the genes in the network. Our results identify a four-gene signature with diagnostic and prognostic value in PC and suggest an interesting gene network that could play a key regulatory role in PC development and progression. Furthermore, hsa-miR-153, controlling this network, could be a potential biomarker for theranostics in high Gleason-scored PC. PMID:29562723
Understanding network concepts in modules

PubMed Central

2007-01-01

Background Network concepts are increasingly used in biology and genetics. For example, the clustering coefficient has been used to understand network architecture; the connectivity (also known as degree) has been used to screen for cancer targets; and the topological overlap matrix has been used to define modules and to annotate genes. Dozens of potentially useful network concepts are known from graph theory. Results Here we study network concepts in special types of networks, which we refer to as approximately factorizable networks. In these networks, the pairwise connection strength (adjacency) between 2 network nodes can be factored into node specific contributions, named node 'conformity'. The node conformity turns out to be highly related to the connectivity. To provide a formalism for relating network concepts to each other, we define three types of network concepts: fundamental-, conformity-based-, and approximate conformity-based concepts. Fundamental concepts include the standard definitions of connectivity, density, centralization, heterogeneity, clustering coefficient, and topological overlap. The approximate conformity-based analogs of fundamental network concepts have several theoretical advantages. First, they allow one to derive simple relationships between seemingly disparate networks concepts. For example, we derive simple relationships between the clustering coefficient, the heterogeneity, the density, the centralization, and the topological overlap. The second advantage of approximate conformity-based network concepts is that they allow one to show that fundamental network concepts can be approximated by simple functions of the connectivity in module networks. Conclusion Using protein-protein interaction, gene co-expression, and simulated data, we show that a) many networks comprised of module nodes are approximately factorizable and b) in these types of networks, simple relationships exist between seemingly disparate network concepts. Our results are implemented in freely available R software code, which can be downloaded from the following webpage: http://www.genetics.ucla.edu/labs/horvath/ModuleConformity/ModuleNetworks PMID:17547772

An HCG-rich microenvironment contributes to ovarian cancer cell differentiation into endothelioid cells in a three-dimensional culture system.

PubMed

Su, Min; Fan, Chao; Gao, Sainan; Shen, Aiguo; Wang, Xiaoying; Zhang, Yuquan

2015-11-01

We investigated the expression of human chorionic gonadotropin (HCG) and its effects on vasculogenic mimicry (VM) formation in ovarian cancer cells under normoxic and hypoxic conditions in three-dimensional matrices preconditioned by an endothelial-trophoblast cell co-culture system. The co-culture model was established using human umbilical vein endothelial cells (HUVECs) and HTR-8 trophoblast cells in a three-dimensional culture system. The co-cultured cells were removed with NH4OH, and ovarian cancer cells were implanted into the preconditioned matrix. VM was identified morphologically and by detecting vascular markers expressed by cancer cells. The specificity of the effects of exogenous HCG in the microenvironment was assessed by inhibition with a neutralizing anti-HCG antibody. HCG siRNA was used to knock down endogenous HCG expression in OVCAR-3 ovarian cancer cells. HTR-8 cells 'fingerprinted' HUVECs to form capillary-like tube structures in co-cultures. In the preconditioned HCG-rich microenvironment, the number of vessel-like network structures formed by HCG receptor-positive OVCAR-3 cells and the expression levels of CD31, VEGF and factor VIII were significantly increased. The preconditioned HCG-rich microenvironment significantly increased the expression of hypoxia inducible factor-1α (HIF‑1α) and VM formation in OVCAR-3 cells under hypoxic conditions. Treatment with a neutralizing anti-HCG antibody but not HCG siRNA significantly inhibited the formation of vessel-like network structures. HCG in the microenvironment contributes to OVCAR-3 differentiation into endothelioid cells in three-dimensional matrices preconditioned with an endothelial-trophoblast cell co-culture system. HCG may synergistically enhance hypoxia-induced vascular markers and HIF-1α expression. These findings would provide perspectives on new therapeutic targets for ovarian cancer.
Elementary screening of lymph node metastatic-related genes in gastric cancer based on the co-expression network of messenger RNA, microRNA and long non-coding RNA.

PubMed

Song, Zhonghua; Zhao, Wenhua; Cao, Danfeng; Zhang, Jinqing; Chen, Shouhua

2018-01-01

Gastric cancer (GC) is the fifth most common cancer and the third leading cause of cancer-related deaths worldwide. The high mortality might be attributed to delay in detection and is closely related to lymph node metastasis. Therefore, it is of great importance to explore the mechanism of lymph node metastasis and find strategies to block GC metastasis. Messenger RNA (mRNA), microRNA (miRNA) and long non-coding RNA (lncRNA) expression data and clinical data were downloaded from The Cancer Genome Atlas (TCGA) database. A total of 908 differentially expressed factors with variance >0.5 including 542 genes, 42 miRNA, and 324 lncRNA were screened using significant analysis microarray algorithm, and interaction networks were constructed using these differentially expressed factors. Furthermore, we conducted functional modules analysis in the network, and found that yellow and turquoise modules could separate samples efficiently. The groups classified in the yellow and turquoise modules had a significant difference in survival time, which was verified in another independent GC mRNA dataset (GSE62254). The results suggested that differentially expressed factors in the yellow and turquoise modules may participate in lymph node metastasis of GC and could be applied as potential biomarkers or therapeutic targets for GC.
Elementary screening of lymph node metastatic-related genes in gastric cancer based on the co-expression network of messenger RNA, microRNA and long non-coding RNA

PubMed Central

Song, Zhonghua; Zhao, Wenhua; Cao, Danfeng; Zhang, Jinqing; Chen, Shouhua

2018-01-01

Gastric cancer (GC) is the fifth most common cancer and the third leading cause of cancer-related deaths worldwide. The high mortality might be attributed to delay in detection and is closely related to lymph node metastasis. Therefore, it is of great importance to explore the mechanism of lymph node metastasis and find strategies to block GC metastasis. Messenger RNA (mRNA), microRNA (miRNA) and long non-coding RNA (lncRNA) expression data and clinical data were downloaded from The Cancer Genome Atlas (TCGA) database. A total of 908 differentially expressed factors with variance >0.5 including 542 genes, 42 miRNA, and 324 lncRNA were screened using significant analysis microarray algorithm, and interaction networks were constructed using these differentially expressed factors. Furthermore, we conducted functional modules analysis in the network, and found that yellow and turquoise modules could separate samples efficiently. The groups classified in the yellow and turquoise modules had a significant difference in survival time, which was verified in another independent GC mRNA dataset (GSE62254). The results suggested that differentially expressed factors in the yellow and turquoise modules may participate in lymph node metastasis of GC and could be applied as potential biomarkers or therapeutic targets for GC. PMID:29489999
A co-expression gene network associated with developmental regulation of apple fruit acidity.

PubMed

Bai, Yang; Dougherty, Laura; Cheng, Lailiang; Xu, Kenong

2015-08-01

Apple fruit acidity, which affects the fruit's overall taste and flavor to a large extent, is primarily determined by the concentration of malic acid. Previous studies demonstrated that the major QTL malic acid (Ma) on chromosome 16 is largely responsible for fruit acidity variations in apple. Recent advances suggested that a natural mutation that gives rise to a premature stop codon in one of the two aluminum-activated malate transporter (ALMT)-like genes (called Ma1) is the genetic causal element underlying Ma. However, the natural mutation does not explain the developmental changes of fruit malate levels in a given genotype. Using RNA-seq data from the fruit of 'Golden Delicious' taken at 14 developmental stages from 1 week after full-bloom (WAF01) to harvest (WAF20), we characterized their transcriptomes in groups of high (12.2 ± 1.6 mg/g fw, WAF03-WAF08), mid (7.4 ± 0.5 mg/g fw, WAF01-WAF02 and WAF10-WAF14) and low (5.4 ± 0.4 mg/g fw, WAF16-WAF20) malate concentrations. Detailed analyses showed that a set of 3,066 genes (including Ma1) were expressed not only differentially (P FDR < 0.05) between the high and low malate groups (or between the early and late developmental stages) but also in significant (P < 0.05) correlation with malate concentrations. The 3,066 genes fell in 648 MapMan (sub-) bins or functional classes, and 19 of them were significantly (P FDR < 0.05) co-enriched or co-suppressed in a malate dependent manner. Network inferring using the 363 genes encompassed in the 19 (sub-) bins, identified a major co-expression network of 239 genes. Since the 239 genes were also differentially expressed between the early (WAF03-WAF08) and late (WAF16-WAF20) developmental stages, the major network was considered to be associated with developmental regulation of apple fruit acidity in 'Golden Delicious'.
Polarization in the social sciences: Assortative mixing in social science collaboration networks is resilient to interventions

NASA Astrophysics Data System (ADS)

Leifeld, Philip

2018-10-01

Academic collaboration in the social sciences is characterized by a polarization between hermeneutic and nomological researchers. This polarization is expressed in different publication strategies. The present article analyzes the complete co-authorship networks in a social science discipline in two separate countries over five years using an exponential random graph model. It examines whether and how assortative mixing in publication strategies is present and leads to a polarization in scientific collaboration. In the empirical analysis, assortative mixing is found to play a role in shaping the topology of the network and significantly explains collaboration. Co-authorship edges are more prevalent within each of the groups, but this mixing pattern does not fully account for the extent of polarization. Instead, a thought experiment reveals that other components of the complex system dampen or amplify polarization in the data-generating process and that microscopic interventions targeting behavior change with regard to assortativity would be hindered by the resilience of the system. The resilience to interventions is quantified in a series of simulations on the effect of microscopic behavior on macroscopic polarization. The empirical study controls for geographic proximity, supervision, and topical similarity (using a vector space model), and the interplay of these factors is likely responsible for this resilience. The paper also predicts the co-authorship network in one country based on the model of collaborations in the other country.
Analysis of electric power industry restructuring

NASA Astrophysics Data System (ADS)

Al-Agtash, Salem Yahya

1998-10-01

This thesis evaluates alternative structures of the electric power industry in a competitive environment. One structure is based on the principle of creating a mandatory power pool to foster competition and manage system economics. The structure is PoolCo (pool coordination). A second structure is based on the principle of allowing independent multilateral trading and decentralized market coordination. The structure is DecCo (decentralized coordination). The criteria I use to evaluate these two structures are: economic efficiency, system reliability and freedom of choice. Economic efficiency evaluation considers strategic behavior of individual generators as well as behavioral variations of different classes of consumers. A supply-function equilibria model is characterized for deriving bidding strategies of competing generators under PoolCo. It is shown that asymmetric equilibria can exist within the capacities of generators. An augmented Lagrangian approach is introduced to solve iteratively for global optimal operations schedules. Under DecCo, the process involves solving iteratively for system operations schedules. The schedules reflect generators strategic behavior and brokers' interactions for arranging profitable trades, allocating losses and managing network congestion. In the determination of PoolCo and DecCo operations schedules, overall costs of power generation (start-up and shut-down costs and availability of hydro electric power) as well as losses and costs of transmission network are considered. For system reliability evaluation, I examine the effect of PoolCo and DecCo operating conditions on the system security. Random component failure perturbations are generated to simulate the actual system behavior. This is done using Monte Carlo simulation. Freedom of choice evaluation accounts for schemes' beneficial opportunities and capabilities to respond to consumers expressed preferences. An IEEE 24-bus test system is used to illustrate the concepts developed for economic efficiency evaluation. The system was tested over two years time period. The results indicate 2.6684 and 2.7269 percent of efficiency loss on average for PoolCo and DecCo, respectively. These values, however, do not represent forecasts of efficiency losses of PoolCo- and DecCo-based competitive industries. Rather, they are illustrations of the efficiency losses for the given IEEE test system and based on the modeling assumptions underlying framework development.
Predicting gene regulatory networks of soybean nodulation from RNA-Seq transcriptome data.

PubMed

Zhu, Mingzhu; Dahmen, Jeremy L; Stacey, Gary; Cheng, Jianlin

2013-09-22

High-throughput RNA sequencing (RNA-Seq) is a revolutionary technique to study the transcriptome of a cell under various conditions at a systems level. Despite the wide application of RNA-Seq techniques to generate experimental data in the last few years, few computational methods are available to analyze this huge amount of transcription data. The computational methods for constructing gene regulatory networks from RNA-Seq expression data of hundreds or even thousands of genes are particularly lacking and urgently needed. We developed an automated bioinformatics method to predict gene regulatory networks from the quantitative expression values of differentially expressed genes based on RNA-Seq transcriptome data of a cell in different stages and conditions, integrating transcriptional, genomic and gene function data. We applied the method to the RNA-Seq transcriptome data generated for soybean root hair cells in three different development stages of nodulation after rhizobium infection. The method predicted a soybean nodulation-related gene regulatory network consisting of 10 regulatory modules common for all three stages, and 24, 49 and 70 modules separately for the first, second and third stage, each containing both a group of co-expressed genes and several transcription factors collaboratively controlling their expression under different conditions. 8 of 10 common regulatory modules were validated by at least two kinds of validations, such as independent DNA binding motif analysis, gene function enrichment test, and previous experimental data in the literature. We developed a computational method to reliably reconstruct gene regulatory networks from RNA-Seq transcriptome data. The method can generate valuable hypotheses for interpreting biological data and designing biological experiments such as ChIP-Seq, RNA interference, and yeast two hybrid experiments.
A powerful nonparametric method for detecting differentially co-expressed genes: distance correlation screening and edge-count test.

PubMed

Zhang, Qingyang

2018-05-16

Differential co-expression analysis, as a complement of differential expression analysis, offers significant insights into the changes in molecular mechanism of different phenotypes. A prevailing approach to detecting differentially co-expressed genes is to compare Pearson's correlation coefficients in two phenotypes. However, due to the limitations of Pearson's correlation measure, this approach lacks the power to detect nonlinear changes in gene co-expression which is common in gene regulatory networks. In this work, a new nonparametric procedure is proposed to search differentially co-expressed gene pairs in different phenotypes from large-scale data. Our computational pipeline consisted of two main steps, a screening step and a testing step. The screening step is to reduce the search space by filtering out all the independent gene pairs using distance correlation measure. In the testing step, we compare the gene co-expression patterns in different phenotypes by a recently developed edge-count test. Both steps are distribution-free and targeting nonlinear relations. We illustrate the promise of the new approach by analyzing the Cancer Genome Atlas data and the METABRIC data for breast cancer subtypes. Compared with some existing methods, the new method is more powerful in detecting nonlinear type of differential co-expressions. The distance correlation screening can greatly improve computational efficiency, facilitating its application to large data sets.
Estimating the spatial distribution of soil moisture based on Bayesian maximum entropy method with auxiliary data from remote sensing

NASA Astrophysics Data System (ADS)

Gao, Shengguo; Zhu, Zhongli; Liu, Shaomin; Jin, Rui; Yang, Guangchao; Tan, Lei

2014-10-01

Soil moisture (SM) plays a fundamental role in the land-atmosphere exchange process. Spatial estimation based on multi in situ (network) data is a critical way to understand the spatial structure and variation of land surface soil moisture. Theoretically, integrating densely sampled auxiliary data spatially correlated with soil moisture into the procedure of spatial estimation can improve its accuracy. In this study, we present a novel approach to estimate the spatial pattern of soil moisture by using the BME method based on wireless sensor network data and auxiliary information from ASTER (Terra) land surface temperature measurements. For comparison, three traditional geostatistic methods were also applied: ordinary kriging (OK), which used the wireless sensor network data only, regression kriging (RK) and ordinary co-kriging (Co-OK) which both integrated the ASTER land surface temperature as a covariate. In Co-OK, LST was linearly contained in the estimator, in RK, estimator is expressed as the sum of the regression estimate and the kriged estimate of the spatially correlated residual, but in BME, the ASTER land surface temperature was first retrieved as soil moisture based on the linear regression, then, the t-distributed prediction interval (PI) of soil moisture was estimated and used as soft data in probability form. The results indicate that all three methods provide reasonable estimations. Co-OK, RK and BME can provide a more accurate spatial estimation by integrating the auxiliary information Compared to OK. RK and BME shows more obvious improvement compared to Co-OK, and even BME can perform slightly better than RK. The inherent issue of spatial estimation (overestimation in the range of low values and underestimation in the range of high values) can also be further improved in both RK and BME. We can conclude that integrating auxiliary data into spatial estimation can indeed improve the accuracy, BME and RK take better advantage of the auxiliary information compared to Co-OK, and BME outperforms RK by integrating the auxiliary data in a probability form.
Internal evaluation of the European network for health technology assessment project.

PubMed

Håheim, Lise Lund; Imaz, Iñaki; Loud, Marlène Läubli; Gasparetto, Teresa; González-Enriquez, Jesús; Dahlgren, Helena; Trofimovs, Igor; Berti, Elena; Mørland, Berit

2009-12-01

The internal evaluation studied the development of the European network for Health Technology Assessment (EUnetHTA) Project in achieving the general objective of establishing an effective and a sustainable network of health technology assessment (HTA) in Europe. The Work Package 3 group was dedicated to this task and performed the work. Information on activities during the project was collected from three sources. First, three yearly cross-sectional studies surveyed the participants' opinions. Responses were by individuals or by institutions. The last round included surveys to the Steering Committee, the Stakeholder Forum, and the Secretariat. Second, the Work Package Lead Partners were interviewed bi-annually, five times in total, to update the information on the Project's progress. Third, additional information was sought in available documents. The organizational structure remained stable. The Project succeeded in developing tools aimed at providing common methodology with intent to establish a standard of conducting and reporting HTA and to facilitate greater collaboration among agencies. The participants/agencies expressed their belief in a network and in maintaining local/national autonomy. The Work Package Leaders expressed a strong belief in the solid base of the Project for a future network on which to build, but were aware of the need for funding and governmental support. Participants and Work Package Leaders have expressed support for a future network that will improve national and international collaboration in HTA based on the experience from the EUnetHTA project.
Structure and transcriptional regulation of the major intrinsic protein gene family in grapevine.

PubMed

Wong, Darren Chern Jan; Zhang, Li; Merlin, Isabelle; Castellarin, Simone D; Gambetta, Gregory A

2018-04-11

The major intrinsic protein (MIP) family is a family of proteins, including aquaporins, which facilitate water and small molecule transport across plasma membranes. In plants, MIPs function in a huge variety of processes including water transport, growth, stress response, and fruit development. In this study, we characterize the structure and transcriptional regulation of the MIP family in grapevine, describing the putative genome duplication events leading to the family structure and characterizing the family's tissue and developmental specific expression patterns across numerous preexisting microarray and RNAseq datasets. Gene co-expression network (GCN) analyses were carried out across these datasets and the promoters of each family member were analyzed for cis-regulatory element structure in order to provide insight into their transcriptional regulation. A total of 29 Vitis vinifera MIP family members (excluding putative pseudogenes) were identified of which all but two were mapped onto Vitis vinifera chromosomes. In this study, segmental duplication events were identified for five plasma membrane intrinsic protein (PIP) and four tonoplast intrinsic protein (TIP) genes, contributing to the expansion of PIPs and TIPs in grapevine. Grapevine MIP family members have distinct tissue and developmental expression patterns and hierarchical clustering revealed two primary groups regardless of the datasets analyzed. Composite microarray and RNA-seq gene co-expression networks (GCNs) highlighted the relationships between MIP genes and functional categories involved in cell wall modification and transport, as well as with other MIPs revealing a strong co-regulation within the family itself. Some duplicated MIP family members have undergone sub-functionalization and exhibit distinct expression patterns and GCNs. Cis-regulatory element (CRE) analyses of the MIP promoters and their associated GCN members revealed enrichment for numerous CREs including AP2/ERFs and NACs. Combining phylogenetic analyses, gene expression profiling, gene co-expression network analyses, and cis-regulatory element enrichment, this study provides a comprehensive overview of the structure and transcriptional regulation of the grapevine MIP family. The study highlights the duplication and sub-functionalization of the family, its strong coordinated expression with genes involved in growth and transport, and the putative classes of TFs responsible for its regulation.
Understanding developmental and adaptive cues in pine through metabolite profiling and co-expression network analysis

PubMed Central

Cañas, Rafael A.; Canales, Javier; Muñoz-Hernández, Carmen; Granados, Jose M.; Ávila, Concepción; García-Martín, María L.; Cánovas, Francisco M.

2015-01-01

Conifers include long-lived evergreen trees of great economic and ecological importance, including pines and spruces. During their long lives conifers must respond to seasonal environmental changes, adapt to unpredictable environmental stresses, and co-ordinate their adaptive adjustments with internal developmental programmes. To gain insights into these responses, we examined metabolite and transcriptomic profiles of needles from naturally growing 25-year-old maritime pine (Pinus pinaster L. Aiton) trees over a year. The effect of environmental parameters such as temperature and rain on needle development were studied. Our results show that seasonal changes in the metabolite profiles were mainly affected by the needles’ age and acclimation for winter, but changes in transcript profiles were mainly dependent on climatic factors. The relative abundance of most transcripts correlated well with temperature, particularly for genes involved in photosynthesis or winter acclimation. Gene network analysis revealed relationships between 14 co-expressed gene modules and development and adaptation to environmental stimuli. Novel Myb transcription factors were identified as candidate regulators during needle development. Our systems-based analysis provides integrated data of the seasonal regulation of maritime pine growth, opening new perspectives for understanding the complex regulatory mechanisms underlying conifers’ adaptive responses. Taken together, our results suggest that the environment regulates the transcriptome for fine tuning of the metabolome during development. PMID:25873654
Identification and functional analysis of long non-coding RNAs in human and mouse early embryos based on single-cell transcriptome data

PubMed Central

Qiu, Jia-jun; Ren, Zhao-rui; Yan, Jing-bin

2016-01-01

Epigenetics regulations have an important role in fertilization and proper embryonic development, and several human diseases are associated with epigenetic modification disorders, such as Rett syndrome, Beckwith-Wiedemann syndrome and Angelman syndrome. However, the dynamics and functions of long non-coding RNAs (lncRNAs), one type of epigenetic regulators, in human pre-implantation development have not yet been demonstrated. In this study, a comprehensive analysis of human and mouse early-stage embryonic lncRNAs was performed based on public single-cell RNA sequencing data. Expression profile analysis revealed that lncRNAs are expressed in a developmental stage–specific manner during human early-stage embryonic development, whereas a more temporal-specific expression pattern was identified in mouse embryos. Weighted gene co-expression network analysis suggested that lncRNAs involved in human early-stage embryonic development are associated with several important functions and processes, such as oocyte maturation, zygotic genome activation and mitochondrial functions. We also found that the network of lncRNAs involved in zygotic genome activation was highly preservative between human and mouse embryos, whereas in other stages no strong correlation between human and mouse embryo was observed. This study provides insight into the molecular mechanism underlying lncRNA involvement in human pre-implantation embryonic development. PMID:27542205
Discovering Condition-Specific Gene Co-Expression Patterns Using Gaussian Mixture Models: A Cancer Case Study.

PubMed

Ficklin, Stephen P; Dunwoodie, Leland J; Poehlman, William L; Watson, Christopher; Roche, Kimberly E; Feltus, F Alex

2017-08-17

A gene co-expression network (GCN) describes associations between genes and points to genetic coordination of biochemical pathways. However, genetic correlations in a GCN are only detectable if they are present in the sampled conditions. With the increasing quantity of gene expression samples available in public repositories, there is greater potential for discovery of genetic correlations from a variety of biologically interesting conditions. However, even if gene correlations are present, their discovery can be masked by noise. Noise is introduced from natural variation (intrinsic and extrinsic), systematic variation (caused by sample measurement protocols and instruments), and algorithmic and statistical variation created by selection of data processing tools. A variety of published studies, approaches and methods attempt to address each of these contributions of variation to reduce noise. Here we describe an approach using Gaussian Mixture Models (GMMs) to address natural extrinsic (condition-specific) variation during network construction from mixed input conditions. To demonstrate utility, we build and analyze a condition-annotated GCN from a compendium of 2,016 mixed gene expression data sets from five tumor subtypes obtained from The Cancer Genome Atlas. Our results show that GMMs help discover tumor subtype specific gene co-expression patterns (modules) that are significantly enriched for clinical attributes.
Characterizing mutation-expression network relationships in multiple cancers.

PubMed

Ghazanfar, Shila; Yang, Jean Yee Hwa

2016-08-01

Data made available through large cancer consortia like The Cancer Genome Atlas make for a rich source of information to be studied across and between cancers. In recent years, network approaches have been applied to such data in uncovering the complex interrelationships between mutational and expression profiles, but lack direct testing for expression changes via mutation. In this pan-cancer study we analyze mutation and gene expression information in an integrative manner by considering the networks generated by testing for differences in expression in direct association with specific mutations. We relate our findings among the 19 cancers examined to identify commonalities and differences as well as their characteristics. Using somatic mutation and gene expression information across 19 cancers, we generated mutation-expression networks per cancer. On evaluation we found that our generated networks were significantly enriched for known cancer-related genes, such as skin cutaneous melanoma (p<0.01 using Network of Cancer Genes 4.0). Our framework identified that while different cancers contained commonly mutated genes, there was little concordance between associated gene expression changes among cancers. Comparison between cancers showed a greater overlap of network nodes for cancers with higher overall non-silent mutation load, compared to those with a lower overall non-silent mutation load. This study offers a framework that explores network information through co-analysis of somatic mutations and gene expression profiles. Our pan-cancer application of this approach suggests that while mutations are frequently common among cancer types, the impact they have on the surrounding networks via gene expression changes varies. Despite this finding, there are some cancers for which mutation-associated network behaviour appears to be similar: suggesting a potential framework for uncovering related cancers for which similar therapeutic strategies may be applicable. Our framework for understanding relationships among cancers has been integrated into an interactive R Shiny application, PAn Cancer Mutation Expression Networks (PACMEN), containing dynamic and static network visualization of the mutation-expression networks. PACMEN also features tools for further examination of network topology characteristics among cancers. Copyright © 2016 Elsevier Ltd. All rights reserved.
Genomic survey, expression profile and co-expression network analysis of OsWD40 family in rice

PubMed Central

2012-01-01

Background WD40 proteins represent a large family in eukaryotes, which have been involved in a broad spectrum of crucial functions. Systematic characterization and co-expression analysis of OsWD40 genes enable us to understand the networks of the WD40 proteins and their biological processes and gene functions in rice. Results In this study, we identify and analyze 200 potential OsWD40 genes in rice, describing their gene structures, genome localizations, and evolutionary relationship of each member. Expression profiles covering the whole life cycle in rice has revealed that transcripts of OsWD40 were accumulated differentially during vegetative and reproductive development and preferentially up or down-regulated in different tissues. Under phytohormone treatments, 25 OsWD40 genes were differentially expressed with treatments of one or more of the phytohormone NAA, KT, or GA3 in rice seedlings. We also used a combined analysis of expression correlation and Gene Ontology annotation to infer the biological role of the OsWD40 genes in rice. The results suggested that OsWD40 genes may perform their diverse functions by complex network, thus were predictive for understanding their biological pathways. The analysis also revealed that OsWD40 genes might interact with each other to take part in metabolic pathways, suggesting a more complex feedback network. Conclusions All of these analyses suggest that the functions of OsWD40 genes are diversified, which provide useful references for selecting candidate genes for further functional studies. PMID:22429805
Engineering intracellular active transport systems as in vivo biomolecular tools.

DOE Office of Scientific and Technical Information (OSTI.GOV)

Bachand, George David; Carroll-Portillo, Amanda

2006-11-01

Active transport systems provide essential functions in terms of cell physiology and metastasis. These systems, however, are also co-opted by invading viruses, enabling directed transport of the virus to and from the cell's nucleus (i.e., the site of virus replication). Based on this concept, fundamentally new approaches for interrogating and manipulating the inner workings of living cells may be achievable by co-opting Nature's active transport systems as an in vivo biomolecular tool. The overall goal of this project was to investigate the ability to engineer kinesin-based transport systems for in vivo applications, specifically the collection of effector proteins (e.g., transcriptionalmore » regulators) within single cells. In the first part of this project, a chimeric fusion protein consisting of kinesin and a single chain variable fragment (scFv) of an antibody was successfully produced through a recombinant expression system. The kinesin-scFv retained both catalytic and antigenic functionality, enabling selective capture and transport of target antigens. The incorporation of a rabbit IgG-specific scFv into the kinesin established a generalized system for functionalizing kinesin with a wide range of target-selective antibodies raised in rabbits. The second objective was to develop methods of isolating the intact microtubule network from live cells as a platform for evaluating kinesin-based transport within the cytoskeletal architecture of a cell. Successful isolation of intact microtubule networks from two distinct cell types was demonstrated using glutaraldehyde and methanol fixation methods. This work provides a platform for inferring the ability of kinesin-scFv to function in vivo, and may also serve as a three-dimensional scaffold for evaluating and exploiting kinesin-based transport for nanotechnological applications. Overall, the technology developed in this project represents a first-step in engineering active transport system for in vivo applications. Further development could potentially enable selective capture of intracellular antigens, targeted delivery of therapeutic agents, or disruption of the transport systems and consequently the infection and pathogenesis cycle of biothreat agents.« less
Co-expression analysis identifies CRC and AP1 the regulator of Arabidopsis fatty acid biosynthesis.

PubMed

Han, Xinxin; Yin, Linlin; Xue, Hongwei

2012-07-01

Fatty acids (FAs) play crucial rules in signal transduction and plant development, however, the regulation of FA metabolism is still poorly understood. To study the relevant regulatory network, fifty-eight FA biosynthesis genes including de novo synthases, desaturases and elongases were selected as "guide genes" to construct the co-expression network. Calculation of the correlation between all Arabidopsis thaliana (L.) genes with each guide gene by Arabidopsis co-expression dating mining tools (ACT) identifies 797 candidate FA-correlated genes. Gene ontology (GO) analysis of these co-expressed genes showed they are tightly correlated to photosynthesis and carbohydrate metabolism, and function in many processes. Interestingly, 63 transcription factors (TFs) were identified as candidate FA biosynthesis regulators and 8 TF families are enriched. Two TF genes, CRC and AP1, both correlating with 8 FA guide genes, were further characterized. Analyses of the ap1 and crc mutant showed the altered total FA composition of mature seeds. The contents of palmitoleic acid, stearic acid, arachidic acid and eicosadienoic acid are decreased, whereas that of oleic acid is increased in ap1 and crc seeds, which is consistent with the qRT-PCR analysis revealing the suppressed expression of the corresponding guide genes. In addition, yeast one-hybrid analysis and electrophoretic mobility shift assay (EMSA) revealed that CRC can bind to the promoter regions of KCS7 and KCS15, indicating that CRC may directly regulate FA biosynthesis. © 2012 Institute of Botany, Chinese Academy of Sciences.
Comprehensive gene and microRNA expression profiling on cardiovascular system in zebrafish co-exposured of SiNPs and MeHg.

PubMed

Hu, Hejing; Shi, Yanfeng; Zhang, Yannan; Wu, Jing; Asweto, Collins Otieno; Feng, Lin; Yang, Xiaozhe; Duan, Junchao; Sun, Zhiwei

2017-12-31

Air pollution has been shown to increase cardiovascular diseases. However, little attention has been paid to the combined effects of PM and air pollutants on the cardiovascular system. To explore this, a high-throughput sequencing technology was used to determine combined effects of silica nanoparticles (SiNPs) and MeHg in zebrafish. Our study demonstrated that SiNPs and MeHg co-exposure could cause significant changes in mRNA and miRNA expression patterns in zebrafish. The differentially expressed (DE) genes in profiles 17 and 26 of STC analysis suggest that SiNPs and MeHg co-exposure had more proinflammatory and cardiovascular toxicity in zebrafish than single exposure. Major gene functions associated with cardiovascular system in the co-exposed zebrafish were discerned from the dynamic-gene-network, including stxbp1a, celf4, ahr1b and bai2. In addition, the prominently expressed pathway of cardiac muscle contraction was targeted by 3 DE miRNAs identified by the miRNA-pathway-network (dre-miR-7147, dre-miR-26a and dre-miR-375), which included 23 DE genes. This study presents a global view of the combined SiNPs and MeHg toxicity on the dynamic expression of both mRNAs and miRNAs in zebrafish, and could serve as fundamental research clues for future studies, especially on cardiovascular system toxicity. Copyright © 2017 Elsevier B.V. All rights reserved.
Machine Learning–Based Differential Network Analysis: A Study of Stress-Responsive Transcriptomes in Arabidopsis[W

PubMed Central

Ma, Chuang; Xin, Mingming; Feldmann, Kenneth A.; Wang, Xiangfeng

2014-01-01

Machine learning (ML) is an intelligent data mining technique that builds a prediction model based on the learning of prior knowledge to recognize patterns in large-scale data sets. We present an ML-based methodology for transcriptome analysis via comparison of gene coexpression networks, implemented as an R package called machine learning–based differential network analysis (mlDNA) and apply this method to reanalyze a set of abiotic stress expression data in Arabidopsis thaliana. The mlDNA first used a ML-based filtering process to remove nonexpressed, constitutively expressed, or non-stress-responsive “noninformative” genes prior to network construction, through learning the patterns of 32 expression characteristics of known stress-related genes. The retained “informative” genes were subsequently analyzed by ML-based network comparison to predict candidate stress-related genes showing expression and network differences between control and stress networks, based on 33 network topological characteristics. Comparative evaluation of the network-centric and gene-centric analytic methods showed that mlDNA substantially outperformed traditional statistical testing–based differential expression analysis at identifying stress-related genes, with markedly improved prediction accuracy. To experimentally validate the mlDNA predictions, we selected 89 candidates out of the 1784 predicted salt stress–related genes with available SALK T-DNA mutagenesis lines for phenotypic screening and identified two previously unreported genes, mutants of which showed salt-sensitive phenotypes. PMID:24520154

Genome-wide targeted prediction of ABA responsive genes in rice based on over-represented cis-motif in co-expressed genes.

PubMed

Lenka, Sangram K; Lohia, Bikash; Kumar, Abhay; Chinnusamy, Viswanathan; Bansal, Kailash C

2009-02-01

Abscisic acid (ABA), the popular plant stress hormone, plays a key role in regulation of sub-set of stress responsive genes. These genes respond to ABA through specific transcription factors which bind to cis-regulatory elements present in their promoters. We discovered the ABA Responsive Element (ABRE) core (ACGT) containing CGMCACGTGB motif as over-represented motif among the promoters of ABA responsive co-expressed genes in rice. Targeted gene prediction strategy using this motif led to the identification of 402 protein coding genes potentially regulated by ABA-dependent molecular genetic network. RT-PCR analysis of arbitrarily chosen 45 genes from the predicted 402 genes confirmed 80% accuracy of our prediction. Plant Gene Ontology (GO) analysis of ABA responsive genes showed enrichment of signal transduction and stress related genes among diverse functional categories.
Cannabis, cigarettes, and their co-occurring use: disentangling differences in default mode network functional connectivity

PubMed Central

Wetherill, Reagan R.; Fang, Zhuo; Jagannathan, Kanchana; Childress, Anna Rose; Rao, Hengyi; Franklin, Teresa R.

2015-01-01

Background Resting-state functional connectivity is a noninvasive, neuroimaging method for assessing neural network function. Altered functional connectivity among regions of the default-mode network have been associated with both nicotine and cannabis use; however, less is known about co-occurring cannabis and tobacco use. Methods We used posterior cingulate cortex (PCC) seed-based resting-state functional connectivity analyses to examine default mode network (DMN) connectivity strength differences between four groups: 1) individuals diagnosed with cannabis dependence who do not smoke tobacco (n=19; ages 20–50), 2) cannabis-dependent individuals who smoke tobacco (n=23, ages 21–52), 3) cannabis-naïve, nicotine-dependent individuals who smoke tobacco (n=24, ages 21–57), and 4) cannabis- and tobacco-naïve healthy controls (n=21, ages 21–50), controlling for age, sex, and alcohol use. We also explored associations between connectivity strength and measures of cannabis and tobacco use. Results PCC seed-based analyses identified the core nodes of the DMN (i.e., PCC, medial prefrontal cortex, inferior parietal cortex, and temporal cortex). In general, the cannabis-dependent, nicotine-dependent, and co-occurring use groups showed lower DMN connectivity strengths than controls, with unique group differences in connectivity strength between the PCC and the cerebellum, medial prefrontal cortex, parahippocampus, and anterior insula. In cannabis-dependent individuals, PCC-right anterior insula connectivity strength correlated with duration of cannabis use. Conclusions This study extends previous research that independently examined the differences in resting-state functional connectivity among individuals who smoke cannabis and tobacco by including an examination of co-occurring cannabis and tobacco use and provides further evidence that cannabis and tobacco exposure is associated with alterations in DMN connectivity. PMID:26094186
Systematic identification and comparison of expressed profiles of lncRNAs and circRNAs with associated co-expression and ceRNA networks in mouse germline stem cells

PubMed Central

Wu, Ji

2017-01-01

Accumulating evidence indicates that long noncoding RNAs (lncRNAs) and circular RNAs (circRNAs) involve in germ cell development. However, little is known about the functions and mechanisms of lncRNAs and circRNAs in self-renewal and differentiation of germline stem cells. Therefore, we explored the expression profiles of mRNAs, lncRNAs, and circRNAs in male and female mouse germline stem cells by high-throughput sequencing. We identified 18573 novel lncRNAs and 18822 circRNAs in the germline stem cells and further confirmed the existence of these lncRNAs and circRNAs by RT-PCR. The results showed that male and female germline stem cells had similar GDNF signaling mechanism. Subsequently, 8115 mRNAs, 3996 lncRNAs, and 921 circRNAs exhibited sex-biased expression that may be associated with germline stem cell acquisition of the sex-specific properties required for differentiation into gametes. Gene Ontology (GO) and KEGG pathway enrichment analyses revealed different functions for these sex-biased lncRNAs and circRNAs. We further constructed correlated expression networks including coding–noncoding co-expression and competing endogenous RNAs with bioinformatics. Co-expression analysis showed hundreds of lncRNAs were correlated with sex differences in mouse germline stem cells, including lncRNA Gm11851, lncRNA Gm12840, lncRNA 4930405O22Rik, and lncRNA Atp10d. CeRNA network inferred that lncRNA Meg3 and cirRNA Igf1r could bind competitively with miRNA-15a-5p increasing target gene Inha, Acsl3, Kif21b, and Igfbp2 expressions. These findings provide novel perspectives on lncRNAs and circRNAs and lay a foundation for future research into the regulating mechanisms of lncRNAs and circRNAs in germline stem cells. PMID:28404936
A generalized locomotion CPG architecture based on oscillatory building blocks.

PubMed

Yang, Zhijun; França, Felipe M G

2003-07-01

Neural oscillation is one of the most extensively investigated topics of artificial neural networks. Scientific approaches to the functionalities of both natural and artificial intelligences are strongly related to mechanisms underlying oscillatory activities. This paper concerns itself with the assumption of the existence of central pattern generators (CPGs), which are the plausible neural architectures with oscillatory capabilities, and presents a discrete and generalized approach to the functionality of locomotor CPGs of legged animals. Based on scheduling by multiple edge reversal (SMER), a primitive and deterministic distributed algorithm, it is shown how oscillatory building block (OBB) modules can be created and, hence, how OBB-based networks can be formulated as asymmetric Hopfield-like neural networks for the generation of complex coordinated rhythmic patterns observed among pairs of biological motor neurons working during different gait patterns. It is also shown that the resulting Hopfield-like network possesses the property of reproducing the whole spectrum of different gaits intrinsic to the target locomotor CPGs. Although the new approach is not restricted to the understanding of the neurolocomotor system of any particular animal, hexapodal and quadrupedal gait patterns are chosen as illustrations given the wide interest expressed by the ongoing research in the area.
Applying thermosettable zwitterionic copolymers as general fouling-resistant and thermal-tolerant biomaterial interfaces.

PubMed

Chou, Ying-Nien; Chang, Yung; Wen, Ten-Chin

2015-05-20

We introduced a thermosettable zwitterionic copolymer to design a high temperature tolerance biomaterial as a general antifouling polymer interface. The original synthetic fouling-resistant copolymer, poly(vinylpyrrolidone)-co-poly(sulfobetaine methacrylate) (poly(VP-co-SBMA)), is both thermal-tolerant and fouling-resistant, and the antifouling stability of copolymer coated interfaces can be effectively controlled by regulating the VP/SBMA composition ratio. We studied poly(VP-co-SBMA) copolymer gels and networks with a focus on their general resistance to protein, cell, and bacterial bioadhesion, as influenced by the thermosetting process. Interestingly, we found that the shape of the poly(VP-co-SBMA) copolymer material can be set at a high annealing temperature of 200 °C while maintaining good antifouling properties. However, while the zwitterionic PSBMA polymer gels were bioinert as expected, control of the fouling resistance of the PSBMA polymer networks was lost in the high temperature annealing process. A poly(VP-co-SBMA) copolymer network composed of PSBMA segments at 32 mol % showed reduced fibrinogen adsorption, tissue cell adhesion, and bacterial attachment, but a relatively higher PSBMA content of 61 mol % was required to optimize resistance to platelet adhesion and erythrocyte attachment to confer hemocompatibility to human blood. We suggest that poly(VP-co-SBMA) copolymers capable of retaining stable fouling resistance after high temperature shaping have a potential application as thermosettable materials in a bioinert interface for medical devices, such as the thermosettable coating on a stainless steel blood-compatible metal stent investigated in this study.
Co-expression of mitosis-regulating genes contributes to malignant progression and prognosis in oligodendrogliomas

PubMed Central

Liu, Yanwei; Hu, Huimin; Zhang, Chuanbao; Wang, Haoyuan; Zhang, Wenlong; Wang, Zheng; Li, Mingyang; Zhang, Wei; Zhou, Dabiao; Jiang, Tao

2015-01-01

The clinical prognosis of patients with glioma is determined by tumor grades, but tumors of different subtypes with equal malignancy grade usually have different prognosis that is largely determined by genetic abnormalities. Oligodendrogliomas (ODs) are the second most common type of gliomas. In this study, integrative analyses found that distribution of TCGA transcriptomic subtypes was associated with grade progression in ODs. To identify critical gene(s) associated with tumor grades and TCGA subtypes, we analyzed 34 normal brain tissue (NBT), 146 WHO grade II and 130 grade III ODs by microarray and RNA sequencing, and identified a co-expression network of six genes (AURKA, NDC80,CENPK, KIAA0101, TIMELESS and MELK) that was associated with tumor grades and TCGA subtypes as well as Ki-67 expression. Validation of the six genes was performed by qPCR in additional 28 ODs. Importantly, these genes also were validated in four high-grade recurrent gliomas and the initial lower-grade gliomas resected from the same patients. Finally, the RNA data on two genes with the highest discrimination potential (AURKA and NDC80) and Ki-67 were validated on an independent cohort (5 NBTs and 86 ODs) by immunohistochemistry. Knockdown of AURKA and NDC80 by siRNAs suppressed Ki-67 expression and proliferation of gliomas cells. Survival analysis showed that high expression of the six genes corporately indicated a poor survival outcome. Correlation and protein interaction analysis provided further evidence for this co-expression network. These data suggest that the co-expression of the six mitosis-regulating genes was associated with malignant progression and prognosis in ODs. PMID:26468983
circlncRNAnet: an integrated web-based resource for mapping functional networks of long or circular forms of noncoding RNAs.

PubMed

Wu, Shao-Min; Liu, Hsuan; Huang, Po-Jung; Chang, Ian Yi-Feng; Lee, Chi-Ching; Yang, Chia-Yu; Tsai, Wen-Sy; Tan, Bertrand Chin-Ming

2018-01-01

Despite their lack of protein-coding potential, long noncoding RNAs (lncRNAs) and circular RNAs (circRNAs) have emerged as key determinants in gene regulation, acting to fine-tune transcriptional and signaling output. These noncoding RNA transcripts are known to affect expression of messenger RNAs (mRNAs) via epigenetic and post-transcriptional regulation. Given their widespread target spectrum, as well as extensive modes of action, a complete understanding of their biological relevance will depend on integrative analyses of systems data at various levels. While a handful of publicly available databases have been reported, existing tools do not fully capture, from a network perspective, the functional implications of lncRNAs or circRNAs of interest. Through an integrated and streamlined design, circlncRNAnet aims to broaden the understanding of ncRNA candidates by testing in silico several hypotheses of ncRNA-based functions, on the basis of large-scale RNA-seq data. This web server is implemented with several features that represent advances in the bioinformatics of ncRNAs: (1) a flexible framework that accepts and processes user-defined next-generation sequencing-based expression data; (2) multiple analytic modules that assign and productively assess the regulatory networks of user-selected ncRNAs by cross-referencing extensively curated databases; (3) an all-purpose, information-rich workflow design that is tailored to all types of ncRNAs. Outputs on expression profiles, co-expression networks and pathways, and molecular interactomes, are dynamically and interactively displayed according to user-defined criteria. In short, users may apply circlncRNAnet to obtain, in real time, multiple lines of functionally relevant information on circRNAs/lncRNAs of their interest. In summary, circlncRNAnet provides a "one-stop" resource for in-depth analyses of ncRNA biology. circlncRNAnet is freely available at http://app.cgu.edu.tw/circlnc/. © The Authors 2017. Published by Oxford University Press.
Construction of differential mRNA-lncRNA crosstalk networks based on ceRNA hypothesis uncover key roles of lncRNAs implicated in esophageal squamous cell carcinoma

PubMed Central

Li, Yixue

2016-01-01

Increasing evidence has indicated that lncRNAs acting as competing endogenous RNAs (ceRNAs) play crucial roles in tumorigenesis, metastasis and diagnosis of cancer. However, the function of lncRNAs as ceRNAs involved in esophageal squamous cell carcinoma (ESCC) is still largely unknown. In this study, clinical implications of two intrinsic subtypes of ESCC were identified based on expression profiles of lncRNA and mRNA. ESCC subtype-specific differential co-expression networks between mRNAs and lncRNAs were constructed to reveal dynamic changes of their crosstalks mediated by miRNAs during tumorigenesis. Several well-known cancer-associated lncRNAs as the hubs of the two networks were firstly proposed in ESCC. Based on the ceRNA mechanism, we illustrated that the“loss” of miR-186-mediated PVT1-mRNA and miR-26b-mediated LINC00240-mRNA crosstalks were related to the two ESCC subtypes respectively. In addition, crosstalks between LINC00152 and EGFR, LINC00240 and LOX gene family were identified, which were associated with the function of “response to wounding” and “extracellular matrix-receptor interaction”. Furthermore, functional cooperation of multiple lncRNAs was discovered in the two differential mRNA-lncRNA crosstalk networks. These together systematically uncovered the roles of lncRNAs as ceRNAs implicated in ESCC. PMID:27966444
Integrated in silico analyses of regulatory and metabolic networks of Synechococcus sp. PCC 7002 reveal relationships between gene centrality and essentiality

DOE PAGES

Song, Hyun-Seob; McClure, Ryan S.; Bernstein, Hans C.; ...

2015-03-27

Cyanobacteria dynamically relay environmental inputs to intracellular adaptations through a coordinated adjustment of photosynthetic efficiency and carbon processing rates. The output of such adaptations is reflected through changes in transcriptional patterns and metabolic flux distributions that ultimately define growth strategy. To address interrelationships between metabolism and regulation, we performed integrative analyses of metabolic and gene co-expression networks in a model cyanobacterium, Synechococcus sp. PCC 7002. Centrality analyses using the gene co-expression network identified a set of key genes, which were defined here as ‘topologically important.’ Parallel in silico gene knock-out simulations, using the genome-scale metabolic network, classified what we termedmore » as ‘functionally important’ genes, deletion of which affected growth or metabolism. A strong positive correlation was observed between topologically and functionally important genes. Functionally important genes exhibited variable levels of topological centrality; however, the majority of topologically central genes were found to be functionally essential for growth. Subsequent functional enrichment analysis revealed that both functionally and topologically important genes in Synechococcus sp. PCC 7002 are predominantly associated with translation and energy metabolism, two cellular processes critical for growth. This research demonstrates how synergistic network-level analyses can be used for reconciliation of metabolic and gene expression data to uncover fundamental biological principles.« less
Integrated in silico analyses of regulatory and metabolic networks of Synechococcus sp. PCC 7002 reveal relationships between gene centrality and essentiality

DOE Office of Scientific and Technical Information (OSTI.GOV)

Song, Hyun-Seob; McClure, Ryan S.; Bernstein, Hans C.

Cyanobacteria dynamically relay environmental inputs to intracellular adaptations through a coordinated adjustment of photosynthetic efficiency and carbon processing rates. The output of such adaptations is reflected through changes in transcriptional patterns and metabolic flux distributions that ultimately define growth strategy. To address interrelationships between metabolism and regulation, we performed integrative analyses of metabolic and gene co-expression networks in a model cyanobacterium, Synechococcus sp. PCC 7002. Centrality analyses using the gene co-expression network identified a set of key genes, which were defined here as ‘topologically important.’ Parallel in silico gene knock-out simulations, using the genome-scale metabolic network, classified what we termedmore » as ‘functionally important’ genes, deletion of which affected growth or metabolism. A strong positive correlation was observed between topologically and functionally important genes. Functionally important genes exhibited variable levels of topological centrality; however, the majority of topologically central genes were found to be functionally essential for growth. Subsequent functional enrichment analysis revealed that both functionally and topologically important genes in Synechococcus sp. PCC 7002 are predominantly associated with translation and energy metabolism, two cellular processes critical for growth. This research demonstrates how synergistic network-level analyses can be used for reconciliation of metabolic and gene expression data to uncover fundamental biological principles.« less
MiRNA and TF co-regulatory network analysis for the pathology and recurrence of myocardial infarction.

PubMed

Lin, Ying; Sibanda, Vusumuzi Leroy; Zhang, Hong-Mei; Hu, Hui; Liu, Hui; Guo, An-Yuan

2015-04-13

Myocardial infarction (MI) is a leading cause of death in the world and many genes are involved in it. Transcription factor (TFs) and microRNAs (miRNAs) are key regulators of gene expression. We hypothesized that miRNAs and TFs might play combinatory regulatory roles in MI. After collecting MI candidate genes and miRNAs from various resources, we constructed a comprehensive MI-specific miRNA-TF co-regulatory network by integrating predicted and experimentally validated TF and miRNA targets. We found some hub nodes (e.g. miR-16 and miR-26) in this network are important regulators, and the network can be severed as a bridge to interpret the associations of previous results, which is shown by the case of miR-29 in this study. We also constructed a regulatory network for MI recurrence and found several important genes (e.g. DAB2, BMP6, miR-320 and miR-103), the abnormal expressions of which may be potential regulatory mechanisms and markers of MI recurrence. At last we proposed a cellular model to discuss major TF and miRNA regulators with signaling pathways in MI. This study provides more details on gene expression regulation and regulators involved in MI progression and recurrence. It also linked up and interpreted many previous results.
Synergistic Modification Induced Specific Recognition between Histone and TRIM24 via Fluctuation Correlation Network Analysis

NASA Astrophysics Data System (ADS)

Zhang, Jinmai; Luo, Huajie; Liu, Hao; Ye, Wei; Luo, Ray; Chen, Hai-Feng

2016-04-01

Histone modification plays a key role in gene regulation and gene expression. TRIM24 as a histone reader can recognize histone modification. However the specific recognition mechanism between TRIM24 and histone modification is unsolved. Here, systems biology method of dynamics correlation network based on molecular dynamics simulation was used to answer the question. Our network analysis shows that the dynamics correlation network of H3K23ac is distinctly different from that of wild type and other modifications. A hypothesis of “synergistic modification induced recognition” is then proposed to link histone modification and TRIM24 binding. These observations were further confirmed from community analysis of networks with mutation and network perturbation. Finally, a possible recognition pathway is also identified based on the shortest path search for H3K23ac. Significant difference of recognition pathway was found among different systems due to methylation and acetylation modifications. The analysis presented here and other studies show that the dynamic network-based analysis might be a useful general strategy to study the biology of protein post-translational modification and associated recognition.
Facial expression recognition based on improved deep belief networks

NASA Astrophysics Data System (ADS)

Wu, Yao; Qiu, Weigen

2017-08-01

In order to improve the robustness of facial expression recognition, a method of face expression recognition based on Local Binary Pattern (LBP) combined with improved deep belief networks (DBNs) is proposed. This method uses LBP to extract the feature, and then uses the improved deep belief networks as the detector and classifier to extract the LBP feature. The combination of LBP and improved deep belief networks is realized in facial expression recognition. In the JAFFE (Japanese Female Facial Expression) database on the recognition rate has improved significantly.
Linking Proteomic and Transcriptional Data through the Interactome and Epigenome Reveals a Map of Oncogene-induced Signaling

PubMed Central

Huang, Shao-shan Carol; Clarke, David C.; Gosline, Sara J. C.; Labadorf, Adam; Chouinard, Candace R.; Gordon, William; Lauffenburger, Douglas A.; Fraenkel, Ernest

2013-01-01

Cellular signal transduction generally involves cascades of post-translational protein modifications that rapidly catalyze changes in protein-DNA interactions and gene expression. High-throughput measurements are improving our ability to study each of these stages individually, but do not capture the connections between them. Here we present an approach for building a network of physical links among these data that can be used to prioritize targets for pharmacological intervention. Our method recovers the critical missing links between proteomic and transcriptional data by relating changes in chromatin accessibility to changes in expression and then uses these links to connect proteomic and transcriptome data. We applied our approach to integrate epigenomic, phosphoproteomic and transcriptome changes induced by the variant III mutation of the epidermal growth factor receptor (EGFRvIII) in a cell line model of glioblastoma multiforme (GBM). To test the relevance of the network, we used small molecules to target highly connected nodes implicated by the network model that were not detected by the experimental data in isolation and we found that a large fraction of these agents alter cell viability. Among these are two compounds, ICG-001, targeting CREB binding protein (CREBBP), and PKF118–310, targeting β-catenin (CTNNB1), which have not been tested previously for effectiveness against GBM. At the level of transcriptional regulation, we used chromatin immunoprecipitation sequencing (ChIP-Seq) to experimentally determine the genome-wide binding locations of p300, a transcriptional co-regulator highly connected in the network. Analysis of p300 target genes suggested its role in tumorigenesis. We propose that this general method, in which experimental measurements are used as constraints for building regulatory networks from the interactome while taking into account noise and missing data, should be applicable to a wide range of high-throughput datasets. PMID:23408876
Frontotemporal dementia: insights into the biological underpinnings of disease through gene co-expression network analysis.

PubMed

Ferrari, Raffaele; Forabosco, Paola; Vandrovcova, Jana; Botía, Juan A; Guelfi, Sebastian; Warren, Jason D; Momeni, Parastoo; Weale, Michael E; Ryten, Mina; Hardy, John

2016-02-24

In frontotemporal dementia (FTD) there is a critical lack in the understanding of biological and molecular mechanisms involved in disease pathogenesis. The heterogeneous genetic features associated with FTD suggest that multiple disease-mechanisms are likely to contribute to the development of this neurodegenerative condition. We here present a systems biology approach with the scope of i) shedding light on the biological processes potentially implicated in the pathogenesis of FTD and ii) identifying novel potential risk factors for FTD. We performed a gene co-expression network analysis of microarray expression data from 101 individuals without neurodegenerative diseases to explore regional-specific co-expression patterns in the frontal and temporal cortices for 12 genes (MAPT, GRN, CHMP2B, CTSC, HLA-DRA, TMEM106B, C9orf72, VCP, UBQLN2, OPTN, TARDBP and FUS) associated with FTD and we then carried out gene set enrichment and pathway analyses, and investigated known protein-protein interactors (PPIs) of FTD-genes products. Gene co-expression networks revealed that several FTD-genes (such as MAPT and GRN, CTSC and HLA-DRA, TMEM106B, and C9orf72, VCP, UBQLN2 and OPTN) were clustering in modules of relevance in the frontal and temporal cortices. Functional annotation and pathway analyses of such modules indicated enrichment for: i) DNA metabolism, i.e. transcription regulation, DNA protection and chromatin remodelling (MAPT and GRN modules); ii) immune and lysosomal processes (CTSC and HLA-DRA modules), and; iii) protein meta/catabolism (C9orf72, VCP, UBQLN2 and OPTN, and TMEM106B modules). PPI analysis supported the results of the functional annotation and pathway analyses. This work further characterizes known FTD-genes and elaborates on their biological relevance to disease: not only do we indicate likely impacted regional-specific biological processes driven by FTD-genes containing modules, but also do we suggest novel potential risk factors among the FTD-genes interactors as targets for further mechanistic characterization in hypothesis driven cell biology work.
Nuclear Receptor Rev-erb Alpha (Nr1d1) Functions in Concert with Nr2e3 to Regulate Transcriptional Networks in the Retina

PubMed Central

Mollema, Nissa J.; Yuan, Yang; Jelcick, Austin S.; Sachs, Andrew J.; von Alpen, Désirée; Schorderet, Daniel; Escher, Pascal; Haider, Neena B.

2011-01-01

The majority of diseases in the retina are caused by genetic mutations affecting the development and function of photoreceptor cells. The transcriptional networks directing these processes are regulated by genes such as nuclear hormone receptors. The nuclear hormone receptor gene Rev-erb alpha/Nr1d1 has been widely studied for its role in the circadian cycle and cell metabolism, however its role in the retina is unknown. In order to understand the role of Rev-erb alpha/Nr1d1 in the retina, we evaluated the effects of loss of Nr1d1 to the developing retina and its co-regulation with the photoreceptor-specific nuclear receptor gene Nr2e3 in the developing and mature retina. Knock-down of Nr1d1 expression in the developing retina results in pan-retinal spotting and reduced retinal function by electroretinogram. Our studies show that NR1D1 protein is co-expressed with NR2E3 in the outer neuroblastic layer of the developing mouse retina. In the adult retina, NR1D1 is expressed in the ganglion cell layer and is co-expressed with NR2E3 in the outer nuclear layer, within rods and cones. Several genes co-targeted by NR2E3 and NR1D1 were identified that include: Nr2c1, Recoverin, Rgr, Rarres2, Pde8a, and Nupr1. We examined the cyclic expression of Nr1d1 and Nr2e3 over a twenty-four hour period and observed that both nuclear receptors cycle in a similar manner. Taken together, these studies reveal a novel role for Nr1d1, in conjunction with its cofactor Nr2e3, in regulating transcriptional networks critical for photoreceptor development and function. PMID:21408158
Directed evolution to re-adapt a co-evolved network within an enzyme.

PubMed

Strafford, John; Payongsri, Panwajee; Hibbert, Edward G; Morris, Phattaraporn; Batth, Sukhjeet S; Steadman, David; Smith, Mark E B; Ward, John M; Hailes, Helen C; Dalby, Paul A

2012-01-01

We have previously used targeted active-site saturation mutagenesis to identify a number of transketolase single mutants that improved activity towards either glycolaldehyde (GA), or the non-natural substrate propionaldehyde (PA). Here, all attempts to recombine the singles into double mutants led to unexpected losses of specific activity towards both substrates. A typical trade-off occurred between soluble expression levels and specific activity for all single mutants, but many double mutants decreased both properties more severely suggesting a critical loss of protein stability or native folding. Statistical coupling analysis (SCA) of a large multiple sequence alignment revealed a network of nine co-evolved residues that affected all but one double mutant. Such networks maintain important functional properties such as activity, specificity, folding, stability, and solubility and may be rapidly disrupted by introducing one or more non-naturally occurring mutations. To identify variants of this network that would accept and improve upon our best D469 mutants for activity towards PA, we created a library of random single, double and triple mutants across seven of the co-evolved residues, combining our D469 variants with only naturally occurring mutations at the remaining sites. A triple mutant cluster at D469, E498 and R520 was found to behave synergistically for the specific activity towards PA. Protein expression was severely reduced by E498D and improved by R520Q, yet variants containing both mutations led to improved specific activity and enzyme expression, but with loss of solubility and the formation of inclusion bodies. D469S and R520Q combined synergistically to improve k(cat) 20-fold for PA, more than for any previous transketolase mutant. R520Q also doubled the specific activity of the previously identified D469T to create our most active transketolase mutant to date. Our results show that recombining active-site mutants obtained by saturation mutagenesis can rapidly destabilise critical networks of co-evolved residues, whereas beneficial single mutants can be retained and improved upon by randomly recombining them with natural variants at other positions in the network. Copyright © 2011 Elsevier B.V. All rights reserved.
Prediction of miRNA-mRNA associations in Alzheimer's disease mice using network topology.

PubMed

Noh, Haneul; Park, Charny; Park, Soojun; Lee, Young Seek; Cho, Soo Young; Seo, Hyemyung

2014-08-03

Little is known about the relationship between miRNA and mRNA expression in Alzheimer's disease (AD) at early- or late-symptomatic stages. Sequence-based target prediction algorithms and anti-correlation profiles have been applied to predict miRNA targets using omics data, but this approach often leads to false positive predictions. Here, we applied the joint profiling analysis of mRNA and miRNA expression levels to Tg6799 AD model mice at 4 and 8 months of age using a network topology-based method. We constructed gene regulatory networks and used the PageRank algorithm to predict significant interactions between miRNA and mRNA. In total, 8 cluster modules were predicted by the transcriptome data for co-expression networks of AD pathology. In total, 54 miRNAs were identified as being differentially expressed in AD. Among these, 50 significant miRNA-mRNA interactions were predicted by integrating sequence target prediction, expression analysis, and the PageRank algorithm. We identified a set of miRNA-mRNA interactions that were changed in the hippocampus of Tg6799 AD model mice. We determined the expression levels of several candidate genes and miRNA. For functional validation in primary cultured neurons from Tg6799 mice (MT) and littermate (LM) controls, the overexpression of ARRDC3 enhanced PPP1R3C expression. ARRDC3 overexpression showed the tendency to decrease the expression of miR139-5p and miR3470a in both LM and MT primary cells. Pathological environment created by Aβ treatment increased the gene expression of PPP1R3C and Sfpq but did not significantly alter the expression of miR139-5p or miR3470a. Aβ treatment increased the promoter activity of ARRDC3 gene in LM primary cells but not in MT primary cells. Our results demonstrate AD-specific changes in the miRNA regulatory system as well as the relationship between the expression levels of miRNAs and their targets in the hippocampus of Tg6799 mice. These data help further our understanding of the function and mechanism of various miRNAs and their target genes in the molecular pathology of AD.
Human umbilical vein endothelial cells synergize osteo/odontogenic differentiation of periodontal ligament stem cells in 3D cell sheets.

PubMed

Pandula, P K C Prgeeth; Samaranayake, L P; Jin, L J; Zhang, C F

2014-06-01

To investigate the expression of osteo/odontogenic differentiation markers and vascular network formation in a 3D cell sheet with varying cell ratios of periodontal ligament stem cells (PDLSCs) and human umbilical vein endothelial cells (HUVECs). Human PDLSCs were isolated and characterized by flow cytometry, and co-cultured with HUVECs for the construction of cell sheets. Both types of cells were seeded on temperature-responsive culture dishes with PDLSCs alone, HUVECs alone and various ratios of the latter cells (1 : 1, 2 : 1, 5 : 1 and 1 : 5) to obtain confluent cell sheets. The expressions of osteo/odontogenic pathway markers, including alkaline phosphatase (ALP), bone sialoprotein (BSP) and runt-related transcription factor 2 (RUNX2), were analyzed at 3 and 7 d using RT-PCR. Further ALP protein quantification was performed at 7 and 14 d using ALP assay. The calcium nodule formation was assessed qualitatively and quantitatively by alizarin red assay. Histological evaluations of three cell sheet constructs treated with different combinations (PDLSC-PDLSC-PDLSC/PDLSC-HUVEC-PDLSC/co-culture-co-culture-co-culture) were performed with hematoxylin and eosin and immunofluorescence staining. Statistical analysis was performed using t-test (p < 0.05). Significantly higher ALP gene expression was observed at 3 d in 1 : 1 (PDLSC-HUVEC) (2.52 ± 0.67) and 5 : 1 (4.05 ± 1.07) co-culture groups compared with other groups (p < 0.05); this was consistent with ALP protein quantification. However, the expression of BSP and RUNX2 genes was higher at 7 d compared to 3 d. Significant calcium mineralization was detected as quantified by alizarin red assay at 14 d in 1 : 1 (1323.55 ± 6.54 μm) and 5 : 1 (994.67 ± 4.15 μm) co-cultures as compared with monoculture cell sheets (p < 0.05). Hematoxylin and eosin and CD31 immunostaining clearly exemplified the development of a layered cell sheet structure with endothelial cell islands within the constructed PDLSC-HUVEC-PDLSC and co-culture groups. Furthermore, HUVECs invaded the layered cell sheet, suggestive of rudimentary vascular network initiation. This study suggests that the PDLSC-HUVEC co-culture, cell sheet, model exhibits significantly high levels of osteo/odontogenic markers with signs of initial vascular formation. This novel 3D cell sheet-based approach may be potentially beneficial for periodontal regenerative therapy. © 2013 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
A Formal Analysis of Cytokine Networks in Chronic Fatigue Syndrome

PubMed Central

Broderick, Gordon; Fuite, Jim; Kreitz, Andrea; Vernon, Suzanne D; Klimas, Nancy; Fletcher, Mary Ann

2010-01-01

Chronic Fatigue Syndrome (CFS) is a complex illness affecting 4 million Americans for which no characteristic lesion has been identified. Instead of searching for a deficiency in any single marker, we propose that CFS is associated with a profound imbalance in the regulation of immune function forcing a departure from standard preprogrammed responses. To identify these imbalances we apply network analysis to the co-expression of 16 cytokines in CFS subjects and healthy controls. Concentrations of IL-1a, 1b, 2, 4, 5, 6, 8, 10, 12, 13, 15, 17 and 23, IFN-γ, lymphotoxin-α (LT-α) and TNF-α were measured in the plasma of 40 female CFS and 59 case-matched controls. Cytokine co-expression networks were constructed from the pair-wise mutual information (MI) patterns found within each subject group. These networks differed in topology significantly more than expected by chance with the CFS network being more hub-like in design. Analysis of local modularity isolated statistically distinct cytokine communities recognizable as pre-programmed immune functional components. These showed highly attenuated Th1 and Th17 immune responses in CFS. High Th2 marker expression but weak interaction patterns pointed to an established Th2 inflammatory milieu. Similarly, altered associations in CFS provided indirect evidence of diminished NK cell responsiveness to IL-12 and LTα stimulus. These observations are consistent with several processes active in latent viral infection and would not have been uncovered by assessing marker expression alone. Furthermore this analysis identifies key subnetworks such as IL-2:IFNγ:TNFα that might be targeted in restoring normal immune function. PMID:20447453

Improving the measurement of semantic similarity by combining gene ontology and co-functional network: a random walk based approach.

PubMed

Peng, Jiajie; Zhang, Xuanshuo; Hui, Weiwei; Lu, Junya; Li, Qianqian; Liu, Shuhui; Shang, Xuequn

2018-03-19

Gene Ontology (GO) is one of the most popular bioinformatics resources. In the past decade, Gene Ontology-based gene semantic similarity has been effectively used to model gene-to-gene interactions in multiple research areas. However, most existing semantic similarity approaches rely only on GO annotations and structure, or incorporate only local interactions in the co-functional network. This may lead to inaccurate GO-based similarity resulting from the incomplete GO topology structure and gene annotations. We present NETSIM2, a new network-based method that allows researchers to measure GO-based gene functional similarities by considering the global structure of the co-functional network with a random walk with restart (RWR)-based method, and by selecting the significant term pairs to decrease the noise information. Based on the EC number (Enzyme Commission)-based groups of yeast and Arabidopsis, evaluation test shows that NETSIM2 can enhance the accuracy of Gene Ontology-based gene functional similarity. Using NETSIM2 as an example, we found that the accuracy of semantic similarities can be significantly improved after effectively incorporating the global gene-to-gene interactions in the co-functional network, especially on the species that gene annotations in GO are far from complete.
Inferring gene regression networks with model trees

PubMed Central

2010-01-01

Background Novel strategies are required in order to handle the huge amount of data produced by microarray technologies. To infer gene regulatory networks, the first step is to find direct regulatory relationships between genes building the so-called gene co-expression networks. They are typically generated using correlation statistics as pairwise similarity measures. Correlation-based methods are very useful in order to determine whether two genes have a strong global similarity but do not detect local similarities. Results We propose model trees as a method to identify gene interaction networks. While correlation-based methods analyze each pair of genes, in our approach we generate a single regression tree for each gene from the remaining genes. Finally, a graph from all the relationships among output and input genes is built taking into account whether the pair of genes is statistically significant. For this reason we apply a statistical procedure to control the false discovery rate. The performance of our approach, named REGNET, is experimentally tested on two well-known data sets: Saccharomyces Cerevisiae and E.coli data set. First, the biological coherence of the results are tested. Second the E.coli transcriptional network (in the Regulon database) is used as control to compare the results to that of a correlation-based method. This experiment shows that REGNET performs more accurately at detecting true gene associations than the Pearson and Spearman zeroth and first-order correlation-based methods. Conclusions REGNET generates gene association networks from gene expression data, and differs from correlation-based methods in that the relationship between one gene and others is calculated simultaneously. Model trees are very useful techniques to estimate the numerical values for the target genes by linear regression functions. They are very often more precise than linear regression models because they can add just different linear regressions to separate areas of the search space favoring to infer localized similarities over a more global similarity. Furthermore, experimental results show the good performance of REGNET. PMID:20950452
The development of a network for community-based obesity prevention: the CO-OPS Collaboration

PubMed Central

2011-01-01

Background Community-based interventions are a promising approach and an important component of a comprehensive response to obesity. In this paper we describe the Collaboration of COmmunity-based Obesity Prevention Sites (CO-OPS Collaboration) in Australia as an example of a collaborative network to enhance the quality and quantity of obesity prevention action at the community level. The core aims of the CO-OPS Collaboration are to: identify and analyse the lessons learned from a range of community-based initiatives aimed at tackling obesity, and; to identify the elements that make community-based obesity prevention initiatives successful and share the knowledge gained with other communities. Methods Key activities of the collaboration to date have included the development of a set of Best Practice Principles and knowledge translation and exchange activities to promote the application (or use) of evidence, evaluation and analysis in practice. Results The establishment of the CO-OPS Collaboration is a significant step toward strengthening action in this area, by bringing together research, practice and policy expertise to promote best practice, high quality evaluation and knowledge translation and exchange. Future development of the network should include facilitation of further evidence generation and translation drawing from process, impact and outcome evaluation of existing community-based interventions. Conclusions The lessons presented in this paper may help other networks like CO-OPS as they emerge around the globe. It is important that networks integrate with each other and share the experience of creating these networks. PMID:21349185
Elastic theory of origami-based metamaterials

NASA Astrophysics Data System (ADS)

Brunck, V.; Lechenault, F.; Reid, A.; Adda-Bedia, M.

2016-03-01

Origami offers the possibility for new metamaterials whose overall mechanical properties can be programed by acting locally on each crease. Starting from a thin plate and having knowledge about the properties of the material and the folding procedure, one would like to determine the shape taken by the structure at rest and its mechanical response. In this article, we introduce a vector deformation field acting on the imprinted network of creases that allows us to express the geometrical constraints of rigid origami structures in a simple and systematic way. This formalism is then used to write a general covariant expression of the elastic energy of n -creases meeting at a single vertex. Computations of the equilibrium states are then carried out explicitly in two special cases: the generalized waterbomb base and the Miura-Ori. For the waterbomb, we show a generic bistability for any number of creases. For the Miura folding, however, we uncover a phase transition from monostable to bistable states that explains the efficient deployability of this structure for a given range of geometrical and mechanical parameters. Moreover, the analysis shows that geometric frustration induces residual stresses in origami structures that should be taken into account in determining their mechanical response. This formalism can be extended to a general crease network, ordered or otherwise, and so opens new perspectives for the mechanics and the physics of origami-based metamaterials.
The Role of Retinal Determination Gene Network (RDGN) in Hormone Signaling Transduction and Prostate Tumorigenesis

DTIC Science & Technology

2012-10-01

support with our hypothesis, expressions of AR co-repressors (48-50), HDAC1, HDAC3 or SirT1 inhibit the ligand-induced AR activation at different...signaling and androgen-dependent growth. We hypothesis that DACH1/Six1/Eya pathway is an endogenous regulator of AR trans- activation and contributes to...mechanism. Inhibitory function of Eya1 on AR transactivation required a phosphates activity and could be enhanced by ectopic expression of co-repressors
The WRKY transcription factor family and senescence in switchgrass.

PubMed

Rinerson, Charles I; Scully, Erin D; Palmer, Nathan A; Donze-Reiner, Teresa; Rabara, Roel C; Tripathi, Prateek; Shen, Qingxi J; Sattler, Scott E; Rohila, Jai S; Sarath, Gautam; Rushton, Paul J

2015-11-09

Early aerial senescence in switchgrass (Panicum virgatum) can significantly limit biomass yields. WRKY transcription factors that can regulate senescence could be used to reprogram senescence and enhance biomass yields. All potential WRKY genes present in the version 1.0 of the switchgrass genome were identified and curated using manual and bioinformatic methods. Expression profiles of WRKY genes in switchgrass flag leaf RNA-Seq datasets were analyzed using clustering and network analyses tools to identify both WRKY and WRKY-associated gene co-expression networks during leaf development and senescence onset. We identified 240 switchgrass WRKY genes including members of the RW5 and RW6 families of resistance proteins. Weighted gene co-expression network analysis of the flag leaf transcriptomes across development readily separated clusters of co-expressed genes into thirteen modules. A visualization highlighted separation of modules associated with the early and senescence-onset phases of flag leaf growth. The senescence-associated module contained 3000 genes including 23 WRKYs. Putative promoter regions of senescence-associated WRKY genes contained several cis-element-like sequences suggestive of responsiveness to both senescence and stress signaling pathways. A phylogenetic comparison of senescence-associated WRKY genes from switchgrass flag leaf with senescence-associated WRKY genes from other plants revealed notable hotspots in Group I, IIb, and IIe of the phylogenetic tree. We have identified and named 240 WRKY genes in the switchgrass genome. Twenty three of these genes show elevated mRNA levels during the onset of flag leaf senescence. Eleven of the WRKY genes were found in hotspots of related senescence-associated genes from multiple species and thus represent promising targets for future switchgrass genetic improvement. Overall, individual WRKY gene expression profiles could be readily linked to developmental stages of flag leaves.
SpidermiR: An R/Bioconductor Package for Integrative Analysis with miRNA Data.

PubMed

Cava, Claudia; Colaprico, Antonio; Bertoli, Gloria; Graudenzi, Alex; Silva, Tiago C; Olsen, Catharina; Noushmehr, Houtan; Bontempi, Gianluca; Mauri, Giancarlo; Castiglioni, Isabella

2017-01-27

Gene Regulatory Networks (GRNs) control many biological systems, but how such network coordination is shaped is still unknown. GRNs can be subdivided into basic connections that describe how the network members interact e.g., co-expression, physical interaction, co-localization, genetic influence, pathways, and shared protein domains. The important regulatory mechanisms of these networks involve miRNAs. We developed an R/Bioconductor package, namely SpidermiR, which offers an easy access to both GRNs and miRNAs to the end user, and integrates this information with differentially expressed genes obtained from The Cancer Genome Atlas. Specifically, SpidermiR allows the users to: (i) query and download GRNs and miRNAs from validated and predicted repositories; (ii) integrate miRNAs with GRNs in order to obtain miRNA-gene-gene and miRNA-protein-protein interactions, and to analyze miRNA GRNs in order to identify miRNA-gene communities; and (iii) graphically visualize the results of the analyses. These analyses can be performed through a single interface and without the need for any downloads. The full data sets are then rapidly integrated and processed locally.
Finding gene regulatory network candidates using the gene expression knowledge base.

PubMed

Venkatesan, Aravind; Tripathi, Sushil; Sanz de Galdeano, Alejandro; Blondé, Ward; Lægreid, Astrid; Mironov, Vladimir; Kuiper, Martin

2014-12-10

Network-based approaches for the analysis of large-scale genomics data have become well established. Biological networks provide a knowledge scaffold against which the patterns and dynamics of 'omics' data can be interpreted. The background information required for the construction of such networks is often dispersed across a multitude of knowledge bases in a variety of formats. The seamless integration of this information is one of the main challenges in bioinformatics. The Semantic Web offers powerful technologies for the assembly of integrated knowledge bases that are computationally comprehensible, thereby providing a potentially powerful resource for constructing biological networks and network-based analysis. We have developed the Gene eXpression Knowledge Base (GeXKB), a semantic web technology based resource that contains integrated knowledge about gene expression regulation. To affirm the utility of GeXKB we demonstrate how this resource can be exploited for the identification of candidate regulatory network proteins. We present four use cases that were designed from a biological perspective in order to find candidate members relevant for the gastrin hormone signaling network model. We show how a combination of specific query definitions and additional selection criteria derived from gene expression data and prior knowledge concerning candidate proteins can be used to retrieve a set of proteins that constitute valid candidates for regulatory network extensions. Semantic web technologies provide the means for processing and integrating various heterogeneous information sources. The GeXKB offers biologists such an integrated knowledge resource, allowing them to address complex biological questions pertaining to gene expression. This work illustrates how GeXKB can be used in combination with gene expression results and literature information to identify new potential candidates that may be considered for extending a gene regulatory network.
Circular RNAs are long-lived and display only minimal early alterations in response to a growth factor

PubMed Central

Enuka, Yehoshua; Lauriola, Mattia; Feldman, Morris E.; Sas-Chen, Aldema; Ulitsky, Igor; Yarden, Yosef

2016-01-01

Circular RNAs (circRNAs) are widespread circles of non-coding RNAs with largely unknown function. Because stimulation of mammary cells with the epidermal growth factor (EGF) leads to dynamic changes in the abundance of coding and non-coding RNA molecules, and culminates in the acquisition of a robust migratory phenotype, this cellular model might disclose functions of circRNAs. Here we show that circRNAs of EGF-stimulated mammary cells are stably expressed, while mRNAs and microRNAs change within minutes. In general, the circRNAs we detected are relatively long-lived and weakly expressed. Interestingly, they are almost ubiquitously co-expressed with the corresponding linear transcripts, and the respective, shared promoter regions are more active compared to genes producing linear isoforms with no detectable circRNAs. These findings imply that altered abundance of circRNAs, unlike changes in the levels of other RNAs, might not play critical roles in signaling cascades and downstream transcriptional networks that rapidly commit cells to specific outcomes. PMID:26657629
Chilling Affects Phytohormone and Post-Embryonic Development Pathways during Bud Break and Fruit Set in Apple (Malus domestica Borkh.)

PubMed Central

Kumar, Gulshan; Gupta, Khushboo; Pathania, Shivalika; Swarnkar, Mohit Kumar; Rattan, Usha Kumari; Singh, Gagandeep; Sharma, Ram Kumar; Singh, Anil Kumar

2017-01-01

The availability of sufficient chilling during bud dormancy plays an important role in the subsequent yield and quality of apple fruit, whereas, insufficient chilling availability negatively impacts the apple production. The transcriptome profiling during bud dormancy release and initial fruit set under low and high chill conditions was performed using RNA-seq. The comparative high number of differentially expressed genes during bud break and fruit set under high chill condition indicates that chilling availability was associated with transcriptional reorganization. The comparative analysis reveals the differential expression of genes involved in phytohormone metabolism, particularly for Abscisic acid, gibberellic acid, ethylene, auxin and cytokinin. The expression of Dormancy Associated MADS-box, Flowering Locus C-like, Flowering Locus T-like and Terminal Flower 1-like genes was found to be modulated under differential chilling. The co-expression network analysis indentified two high chill specific modules that were found to be enriched for “post-embryonic development” GO terms. The network analysis also identified hub genes including Early flowering 7, RAF10, ZEP4 and F-box, which may be involved in regulating chilling-mediated dormancy release and fruit set. The results of transcriptome and co-expression network analysis indicate that chilling availability majorly regulates phytohormone-related pathways and post-embryonic development during bud break. PMID:28198417
Mathematical Model of a Telomerase Transcriptional Regulatory Network Developed by Cell-Based Screening: Analysis of Inhibitor Effects and Telomerase Expression Mechanisms

PubMed Central

Bilsland, Alan E.; Stevenson, Katrina; Liu, Yu; Hoare, Stacey; Cairney, Claire J.; Roffey, Jon; Keith, W. Nicol

2014-01-01

Cancer cells depend on transcription of telomerase reverse transcriptase (TERT). Many transcription factors affect TERT, though regulation occurs in context of a broader network. Network effects on telomerase regulation have not been investigated, though deeper understanding of TERT transcription requires a systems view. However, control over individual interactions in complex networks is not easily achievable. Mathematical modelling provides an attractive approach for analysis of complex systems and some models may prove useful in systems pharmacology approaches to drug discovery. In this report, we used transfection screening to test interactions among 14 TERT regulatory transcription factors and their respective promoters in ovarian cancer cells. The results were used to generate a network model of TERT transcription and to implement a dynamic Boolean model whose steady states were analysed. Modelled effects of signal transduction inhibitors successfully predicted TERT repression by Src-family inhibitor SU6656 and lack of repression by ERK inhibitor FR180204, results confirmed by RT-QPCR analysis of endogenous TERT expression in treated cells. Modelled effects of GSK3 inhibitor 6-bromoindirubin-3′-oxime (BIO) predicted unstable TERT repression dependent on noise and expression of JUN, corresponding with observations from a previous study. MYC expression is critical in TERT activation in the model, consistent with its well known function in endogenous TERT regulation. Loss of MYC caused complete TERT suppression in our model, substantially rescued only by co-suppression of AR. Interestingly expression was easily rescued under modelled Ets-factor gain of function, as occurs in TERT promoter mutation. RNAi targeting AR, JUN, MXD1, SP3, or TP53, showed that AR suppression does rescue endogenous TERT expression following MYC knockdown in these cells and SP3 or TP53 siRNA also cause partial recovery. The model therefore successfully predicted several aspects of TERT regulation including previously unknown mechanisms. An extrapolation suggests that a dominant stimulatory system may programme TERT for transcriptional stability. PMID:24550717
Genome-Wide Analysis of Long Noncoding RNA (lncRNA) Expression in Hepatoblastoma Tissues

PubMed Central

Xue, Ping; Cui, Ximao; Li, Kai; Zheng, Shan; He, Xianghuo; Dong, Kuiran

2014-01-01

Long noncoding RNAs (lncRNAs) have crucial roles in cancer biology. We performed a genome-wide analysis of lncRNA expression in hepatoblastoma tissues to identify novel targets for further study of hepatoblastoma. Hepatoblastoma and normal liver tissue samples were obtained from hepatoblastoma patients. The genome-wide analysis of lncRNA expression in these tissues was performed using a 4×180 K lncRNA microarray and Sureprint G3 Human lncRNA Chips. Quantitative RT-PCR (qRT-PCR) was performed to confirm these results. The differential expressions of lncRNAs and mRNAs were identified through fold-change filtering. Gene Ontology (GO) and pathway analyses were performed using the standard enrichment computation method. Associations between lncRNAs and adjacent protein-coding genes were determined through complex transcriptional loci analysis. We found that 2736 lncRNAs were differentially expressed in hepatoblastoma tissues. Among these, 1757 lncRNAs were upregulated more than two-fold relative to normal tissues and 979 lncRNAs were downregulated. Moreover, in hepatoblastoma there were 420 matched lncRNA-mRNA pairs for 120 differentially expressed lncRNAs, and 167 differentially expressed mRNAs. The co-expression network analysis predicted 252 network nodes and 420 connections between 120 lncRNAs and 132 coding genes. Within this co-expression network, 369 pairs were positive, and 51 pairs were negative. Lastly, qRT-PCR data verified six upregulated and downregulated lncRNAs in hepatoblastoma, plus endothelial cell-specific molecule 1 (ESM1) mRNA. Our results demonstrated that expression of these aberrant lncRNAs could respond to hepatoblastoma development. Further study of these lncRNAs could provide useful insight into hepatoblastoma biology. PMID:24465615
The evolution analysis of listed companies co-holding non-listed financial companies based on two-mode heterogeneous networks

NASA Astrophysics Data System (ADS)

An, Pengli; Li, Huajiao; Zhou, Jinsheng; Chen, Fan

2017-10-01

Complex network theory is a widely used tool in the empirical research of financial markets. Two-mode and multi-mode networks are new trends and represent new directions in that they can more accurately simulate relationships between entities. In this paper, we use data for Chinese listed companies holding non-listed financial companies over a ten-year period to construct two networks: a two-mode primitive network in which listed companies and non-listed financial companies are considered actors and events, respectively, and a one-mode network that is constructed based on the decreasing-mode method in which listed companies are considered nodes. We analyze the evolution of the listed company co-holding network from several perspectives, including that of the whole network, of information control ability, of implicit relationships, of community division and of small-world characteristics. The results of the analysis indicate that (1) China's developing stock market affects the share-holding condition of listed companies holding non-listed financial companies; (2) the information control ability of co-holding networks is focused on a few listed companies and the implicit relationship of investment preference between listed companies is determined by the co-holding behavior; (3) the community division of the co-holding network is increasingly obvious, as determined by the investment preferences among listed companies; and (4) the small-world characteristics of the co-holding network are increasingly obvious, resulting in reduced communication costs. In this paper, we conduct an evolution analysis and develop an understanding of the factors that influence the listed companies co-holding network. This study will help illuminate research on evolution analysis.
High-resolution gene expression data from blastoderm embryos of the scuttle fly Megaselia abdita

PubMed Central

Wotton, Karl R; Jiménez-Guri, Eva; Crombach, Anton; Cicin-Sain, Damjan; Jaeger, Johannes

2015-01-01

Gap genes are involved in segment determination during early development in dipteran insects (flies, midges, and mosquitoes). We carried out a systematic quantitative comparative analysis of the gap gene network across different dipteran species. Our work provides mechanistic insights into the evolution of this pattern-forming network. As a central component of our project, we created a high-resolution quantitative spatio-temporal data set of gap and maternal co-ordinate gene expression in the blastoderm embryo of the non-drosophilid scuttle fly, Megaselia abdita. Our data include expression patterns in both wild-type and RNAi-treated embryos. The data—covering 10 genes, 10 time points, and over 1,000 individual embryos—consist of original embryo images, quantified expression profiles, extracted positions of expression boundaries, and integrated expression patterns, plus metadata and intermediate processing steps. These data provide a valuable resource for researchers interested in the comparative study of gene regulatory networks and pattern formation, an essential step towards a more quantitative and mechanistic understanding of developmental evolution. PMID:25977812
Preserved Network Metrics across Translated Texts

NASA Astrophysics Data System (ADS)

Cabatbat, Josephine Jill T.; Monsanto, Jica P.; Tapang, Giovanni A.

2014-09-01

Co-occurrence language networks based on Bible translations and the Universal Declaration of Human Rights (UDHR) translations in different languages were constructed and compared with random text networks. Among the considered network metrics, the network size, N, the normalized betweenness centrality (BC), and the average k-nearest neighbors, knn, were found to be the most preserved across translations. Moreover, similar frequency distributions of co-occurring network motifs were observed for translated texts networks.
DiffSLC: A graph centrality method to detect essential proteins of a protein-protein interaction network.

PubMed

Mistry, Divya; Wise, Roger P; Dickerson, Julie A

2017-01-01

Identification of central genes and proteins in biomolecular networks provides credible candidates for pathway analysis, functional analysis, and essentiality prediction. The DiffSLC centrality measure predicts central and essential genes and proteins using a protein-protein interaction network. Network centrality measures prioritize nodes and edges based on their importance to the network topology. These measures helped identify critical genes and proteins in biomolecular networks. The proposed centrality measure, DiffSLC, combines the number of interactions of a protein and the gene coexpression values of genes from which those proteins were translated, as a weighting factor to bias the identification of essential proteins in a protein interaction network. Potentially essential proteins with low node degree are promoted through eigenvector centrality. Thus, the gene coexpression values are used in conjunction with the eigenvector of the network's adjacency matrix and edge clustering coefficient to improve essentiality prediction. The outcome of this prediction is shown using three variations: (1) inclusion or exclusion of gene co-expression data, (2) impact of different coexpression measures, and (3) impact of different gene expression data sets. For a total of seven networks, DiffSLC is compared to other centrality measures using Saccharomyces cerevisiae protein interaction networks and gene expression data. Comparisons are also performed for the top ranked proteins against the known essential genes from the Saccharomyces Gene Deletion Project, which show that DiffSLC detects more essential proteins and has a higher area under the ROC curve than other compared methods. This makes DiffSLC a stronger alternative to other centrality methods for detecting essential genes using a protein-protein interaction network that obeys centrality-lethality principle. DiffSLC is implemented using the igraph package in R, and networkx package in Python. The python package can be obtained from git.io/diffslcpy. The R implementation and code to reproduce the analysis is available via git.io/diffslc.
Nonsynaptic glycine release is involved in the early KCC2 expression.

PubMed

Allain, Anne-Emilie; Cazenave, William; Delpy, Alain; Exertier, Prisca; Barthe, Christophe; Meyrand, Pierre; Cattaert, Daniel; Branchereau, Pascal

2016-07-01

The cation-chloride co-transporters are important regulators of the cellular Cl(-) homeostasis. Among them the Na(+) -K(+) -2Cl(-) co-transporter (NKCC1) is responsible for intracellular chloride accumulation in most immature brain structures, whereas the K(+) -Cl(-) co-transporter (KCC2) extrudes chloride from mature neurons, ensuring chloride-mediated inhibitory effects of GABA/glycine. We have shown that both KCC2 and NKCC1 are expressed at early embryonic stages (E11.5) in the ventral spinal cord (SC). The mechanisms by which KCC2 is prematurely expressed are unknown. In this study, we found that chronically blocking glycine receptors (GlyR) by strychnine led to a loss of KCC2 expression, without affecting NKCC1 level. This effect was not dependent on the firing of Na(+) action potentials but was mimicked by a Ca(2+) -dependent PKC blocker. Blocking the vesicular release of neurotransmitters did not impinge on strychnine effect whereas blocking volume-sensitive outwardly rectifying (VSOR) chloride channels reproduced the GlyR blockade, suggesting that KCC2 is controlled by a glycine release from progenitor radial cells in immature ventral spinal networks. Finally, we showed that the strychnine treatment prevented the maturation of rhythmic spontaneous activity. Thereby, the GlyR-activation is a necessary developmental process for the expression of functional spinal motor networks. © 2015 Wiley Periodicals, Inc. Develop Neurobiol 76: 764-779, 2016. © 2015 Wiley Periodicals, Inc.
An integrative systems genetics approach reveals potential causal genes and pathways related to obesity.

PubMed

Kogelman, Lisette J A; Zhernakova, Daria V; Westra, Harm-Jan; Cirera, Susanna; Fredholm, Merete; Franke, Lude; Kadarmideen, Haja N

2015-10-20

Obesity is a multi-factorial health problem in which genetic factors play an important role. Limited results have been obtained in single-gene studies using either genomic or transcriptomic data. RNA sequencing technology has shown its potential in gaining accurate knowledge about the transcriptome, and may reveal novel genes affecting complex diseases. Integration of genomic and transcriptomic variation (expression quantitative trait loci [eQTL] mapping) has identified causal variants that affect complex diseases. We integrated transcriptomic data from adipose tissue and genomic data from a porcine model to investigate the mechanisms involved in obesity using a systems genetics approach. Using a selective gene expression profiling approach, we selected 36 animals based on a previously created genomic Obesity Index for RNA sequencing of subcutaneous adipose tissue. Differential expression analysis was performed using the Obesity Index as a continuous variable in a linear model. eQTL mapping was then performed to integrate 60 K porcine SNP chip data with the RNA sequencing data. Results were restricted based on genome-wide significant single nucleotide polymorphisms, detected differentially expressed genes, and previously detected co-expressed gene modules. Further data integration was performed by detecting co-expression patterns among eQTLs and integration with protein data. Differential expression analysis of RNA sequencing data revealed 458 differentially expressed genes. The eQTL mapping resulted in 987 cis-eQTLs and 73 trans-eQTLs (false discovery rate < 0.05), of which the cis-eQTLs were associated with metabolic pathways. We reduced the eQTL search space by focusing on differentially expressed and co-expressed genes and disease-associated single nucleotide polymorphisms to detect obesity-related genes and pathways. Building a co-expression network using eQTLs resulted in the detection of a module strongly associated with lipid pathways. Furthermore, we detected several obesity candidate genes, for example, ENPP1, CTSL, and ABHD12B. To our knowledge, this is the first study to perform an integrated genomics and transcriptomics (eQTL) study using, and modeling, genomic and subcutaneous adipose tissue RNA sequencing data on obesity in a porcine model. We detected several pathways and potential causal genes for obesity. Further validation and investigation may reveal their exact function and association with obesity.
Estimation of Dynamic Systems for Gene Regulatory Networks from Dependent Time-Course Data.

PubMed

Kim, Yoonji; Kim, Jaejik

2018-06-15

Dynamic system consisting of ordinary differential equations (ODEs) is a well-known tool for describing dynamic nature of gene regulatory networks (GRNs), and the dynamic features of GRNs are usually captured through time-course gene expression data. Owing to high-throughput technologies, time-course gene expression data have complex structures such as heteroscedasticity, correlations between genes, and time dependence. Since gene experiments typically yield highly noisy data with small sample size, for a more accurate prediction of the dynamics, the complex structures should be taken into account in ODE models. Hence, this study proposes an ODE model considering such data structures and a fast and stable estimation method for the ODE parameters based on the generalized profiling approach with data smoothing techniques. The proposed method also provides statistical inference for the ODE estimator and it is applied to a zebrafish retina cell network.
Tumor SHB gene expression affects disease characteristics in human acute myeloid leukemia.

PubMed

Jamalpour, Maria; Li, Xiujuan; Cavelier, Lucia; Gustafsson, Karin; Mostoslavsky, Gustavo; Höglund, Martin; Welsh, Michael

2017-10-01

The mouse Shb gene coding for the Src Homology 2-domain containing adapter protein B has recently been placed in context of BCRABL1-induced myeloid leukemia in mice and the current study was performed in order to relate SHB to human acute myeloid leukemia (AML). Publicly available AML databases were mined for SHB gene expression and patient survival. SHB gene expression was determined in the Uppsala cohort of AML patients by qPCR. Cell proliferation was determined after SHB gene knockdown in leukemic cell lines. Despite a low frequency of SHB gene mutations, many tumors overexpressed SHB mRNA compared with normal myeloid blood cells. AML patients with tumors expressing low SHB mRNA displayed longer survival times. A subgroup of AML exhibiting a favorable prognosis, acute promyelocytic leukemia (APL) with a PMLRARA translocation, expressed less SHB mRNA than AML tumors in general. When examining genes co-expressed with SHB in AML tumors, four other genes ( PAX5, HDAC7, BCORL1, TET1) related to leukemia were identified. A network consisting of these genes plus SHB was identified that relates to certain phenotypic characteristics, such as immune cell, vascular and apoptotic features. SHB knockdown in the APL PMLRARA cell line NB4 and the monocyte/macrophage cell line MM6 adversely affected proliferation, linking SHB gene expression to tumor cell expansion and consequently to patient survival. It is concluded that tumor SHB gene expression relates to AML survival and its subgroup APL. Moreover, this gene is included in a network of genes that plays a role for an AML phenotype exhibiting certain immune cell, vascular and apoptotic characteristics.

Molecular determinants of caste differentiation in the highly eusocial honeybee Apis mellifera.

PubMed

Barchuk, Angel R; Cristino, Alexandre S; Kucharski, Robert; Costa, Luciano F; Simões, Zilá L P; Maleszka, Ryszard

2007-06-18

In honeybees, differential feeding of female larvae promotes the occurrence of two different phenotypes, a queen and a worker, from identical genotypes, through incremental alterations, which affect general growth, and character state alterations that result in the presence or absence of specific structures. Although previous studies revealed a link between incremental alterations and differential expression of physiometabolic genes, the molecular changes accompanying character state alterations remain unknown. By using cDNA microarray analyses of >6,000 Apis mellifera ESTs, we found 240 differentially expressed genes (DEGs) between developing queens and workers. Many genes recorded as up-regulated in prospective workers appear to be unique to A. mellifera, suggesting that the workers' developmental pathway involves the participation of novel genes. Workers up-regulate more developmental genes than queens, whereas queens up-regulate a greater proportion of physiometabolic genes, including genes coding for metabolic enzymes and genes whose products are known to regulate the rate of mass-transforming processes and the general growth of the organism (e.g., tor). Many DEGs are likely to be involved in processes favoring the development of caste-biased structures, like brain, legs and ovaries, as well as genes that code for cytoskeleton constituents. Treatment of developing worker larvae with juvenile hormone (JH) revealed 52 JH responsive genes, specifically during the critical period of caste development. Using Gibbs sampling and Expectation Maximization algorithms, we discovered eight overrepresented cis-elements from four gene groups. Graph theory and complex networks concepts were adopted to attain powerful graphical representations of the interrelation between cis-elements and genes and objectively quantify the degree of relationship between these entities. We suggest that clusters of functionally related DEGs are co-regulated during caste development in honeybees. This network of interactions is activated by nutrition-driven stimuli in early larval stages. Our data are consistent with the hypothesis that JH is a key component of the developmental determination of queen-like characters. Finally, we propose a conceptual model of caste differentiation in A. mellifera based on gene-regulatory networks.
Molecular determinants of caste differentiation in the highly eusocial honeybee Apis mellifera

PubMed Central

Barchuk, Angel R; Cristino, Alexandre S; Kucharski, Robert; Costa, Luciano F; Simões, Zilá LP; Maleszka, Ryszard

2007-01-01

Background In honeybees, differential feeding of female larvae promotes the occurrence of two different phenotypes, a queen and a worker, from identical genotypes, through incremental alterations, which affect general growth, and character state alterations that result in the presence or absence of specific structures. Although previous studies revealed a link between incremental alterations and differential expression of physiometabolic genes, the molecular changes accompanying character state alterations remain unknown. Results By using cDNA microarray analyses of >6,000 Apis mellifera ESTs, we found 240 differentially expressed genes (DEGs) between developing queens and workers. Many genes recorded as up-regulated in prospective workers appear to be unique to A. mellifera, suggesting that the workers' developmental pathway involves the participation of novel genes. Workers up-regulate more developmental genes than queens, whereas queens up-regulate a greater proportion of physiometabolic genes, including genes coding for metabolic enzymes and genes whose products are known to regulate the rate of mass-transforming processes and the general growth of the organism (e.g., tor). Many DEGs are likely to be involved in processes favoring the development of caste-biased structures, like brain, legs and ovaries, as well as genes that code for cytoskeleton constituents. Treatment of developing worker larvae with juvenile hormone (JH) revealed 52 JH responsive genes, specifically during the critical period of caste development. Using Gibbs sampling and Expectation Maximization algorithms, we discovered eight overrepresented cis-elements from four gene groups. Graph theory and complex networks concepts were adopted to attain powerful graphical representations of the interrelation between cis-elements and genes and objectively quantify the degree of relationship between these entities. Conclusion We suggest that clusters of functionally related DEGs are co-regulated during caste development in honeybees. This network of interactions is activated by nutrition-driven stimuli in early larval stages. Our data are consistent with the hypothesis that JH is a key component of the developmental determination of queen-like characters. Finally, we propose a conceptual model of caste differentiation in A. mellifera based on gene-regulatory networks. PMID:17577409
Regulatory complexity revealed by integrated cytological and RNA-seq analyses of meiotic substages in mouse spermatocytes.

PubMed

Ball, Robyn L; Fujiwara, Yasuhiro; Sun, Fengyun; Hu, Jianjun; Hibbs, Matthew A; Handel, Mary Ann; Carter, Gregory W

2016-08-12

The continuous and non-synchronous nature of postnatal male germ-cell development has impeded stage-specific resolution of molecular events of mammalian meiotic prophase in the testis. Here the juvenile onset of spermatogenesis in mice is analyzed by combining cytological and transcriptomic data in a novel computational analysis that allows decomposition of the transcriptional programs of spermatogonia and meiotic prophase substages. Germ cells from testes of individual mice were obtained at two-day intervals from 8 to 18 days post-partum (dpp), prepared as surface-spread chromatin and immunolabeled for meiotic stage-specific protein markers (STRA8, SYCP3, phosphorylated H2AFX, and HISTH1T). Eight stages were discriminated cytologically by combinatorial antibody labeling, and RNA-seq was performed on the same samples. Independent principal component analyses of cytological and transcriptomic data yielded similar patterns for both data types, providing strong evidence for substage-specific gene expression signatures. A novel permutation-based maximum covariance analysis (PMCA) was developed to map co-expressed transcripts to one or more of the eight meiotic prophase substages, thereby linking distinct molecular programs to cytologically defined cell states. Expression of meiosis-specific genes is not substage-limited, suggesting regulation of substage transitions at other levels. This integrated analysis provides a general method for resolving complex cell populations. Here it revealed not only features of meiotic substage-specific gene expression, but also a network of substage-specific transcription factors and relationships to potential target genes.
DOE Office of Scientific and Technical Information (OSTI.GOV)

Bonachea, Dan; Hargrove, P.

GASNet is a language-independent, low-level networking layer that provides network-independent, high-performance communication primitives tailored for implementing parallel global address space SPMD languages and libraries such as UPC, UPC++, Co-Array Fortran, Legion, Chapel, and many others. The interface is primarily intended as a compilation target and for use by runtime library writers (as opposed to end users), and the primary goals are high performance, interface portability, and expressiveness. GASNet stands for "Global-Address Space Networking".
The total carbon column observing network.

PubMed

Wunch, Debra; Toon, Geoffrey C; Blavier, Jean-François L; Washenfelder, Rebecca A; Notholt, Justus; Connor, Brian J; Griffith, David W T; Sherlock, Vanessa; Wennberg, Paul O

2011-05-28

A global network of ground-based Fourier transform spectrometers has been founded to remotely measure column abundances of CO(2), CO, CH(4), N(2)O and other molecules that absorb in the near-infrared. These measurements are directly comparable with the near-infrared total column measurements from space-based instruments. With stringent requirements on the instrumentation, acquisition procedures, data processing and calibration, the Total Carbon Column Observing Network (TCCON) achieves an accuracy and precision in total column measurements that is unprecedented for remote-sensing observations (better than 0.25% for CO(2)). This has enabled carbon-cycle science investigations using the TCCON dataset, and allows the TCCON to provide a link between satellite measurements and the extensive ground-based in situ network. © 2011 The Royal Society
Systems Genetic Analysis of Osteoblast-Lineage Cells

PubMed Central

Calabrese, Gina; Bennett, Brian J.; Orozco, Luz; Kang, Hyun M.; Eskin, Eleazar; Dombret, Carlos; De Backer, Olivier; Lusis, Aldons J.; Farber, Charles R.

2012-01-01

The osteoblast-lineage consists of cells at various stages of maturation that are essential for skeletal development, growth, and maintenance. Over the past decade, many of the signaling cascades that regulate this lineage have been elucidated; however, little is known of the networks that coordinate, modulate, and transmit these signals. Here, we identify a gene network specific to the osteoblast-lineage through the reconstruction of a bone co-expression network using microarray profiles collected on 96 Hybrid Mouse Diversity Panel (HMDP) inbred strains. Of the 21 modules that comprised the bone network, module 9 (M9) contained genes that were highly correlated with prototypical osteoblast maker genes and were more highly expressed in osteoblasts relative to other bone cells. In addition, the M9 contained many of the key genes that define the osteoblast-lineage, which together suggested that it was specific to this lineage. To use the M9 to identify novel osteoblast genes and highlight its biological relevance, we knocked-down the expression of its two most connected “hub” genes, Maged1 and Pard6g. Their perturbation altered both osteoblast proliferation and differentiation. Furthermore, we demonstrated the mice deficient in Maged1 had decreased bone mineral density (BMD). It was also discovered that a local expression quantitative trait locus (eQTL) regulating the Wnt signaling antagonist Sfrp1 was a key driver of the M9. We also show that the M9 is associated with BMD in the HMDP and is enriched for genes implicated in the regulation of human BMD through genome-wide association studies. In conclusion, we have identified a physiologically relevant gene network and used it to discover novel genes and regulatory mechanisms involved in the function of osteoblast-lineage cells. Our results highlight the power of harnessing natural genetic variation to generate co-expression networks that can be used to gain insight into the function of specific cell-types. PMID:23300464
Evidence of inflammatory immune signaling in chronic fatigue syndrome: A pilot study of gene expression in peripheral blood.

PubMed

Aspler, Anne L; Bolshin, Carly; Vernon, Suzanne D; Broderick, Gordon

2008-09-26

Genomic profiling of peripheral blood reveals altered immunity in chronic fatigue syndrome (CFS) however interpretation remains challenging without immune demographic context. The object of this work is to identify modulation of specific immune functional components and restructuring of co-expression networks characteristic of CFS using the quantitative genomics of peripheral blood. Gene sets were constructed a priori for CD4+ T cells, CD8+ T cells, CD19+ B cells, CD14+ monocytes and CD16+ neutrophils from published data. A group of 111 women were classified using empiric case definition (U.S. Centers for Disease Control and Prevention) and unsupervised latent cluster analysis (LCA). Microarray profiles of peripheral blood were analyzed for expression of leukocyte-specific gene sets and characteristic changes in co-expression identified from topological evaluation of linear correlation networks. Median expression for a set of 6 genes preferentially up-regulated in CD19+ B cells was significantly lower in CFS (p = 0.01) due mainly to PTPRK and TSPAN3 expression. Although no other gene set was differentially expressed at p < 0.05, patterns of co-expression in each group differed markedly. Significant co-expression of CD14+ monocyte with CD16+ neutrophil (p = 0.01) and CD19+ B cell sets (p = 0.00) characterized CFS and fatigue phenotype groups. Also in CFS was a significant negative correlation between CD8+ and both CD19+ up-regulated (p = 0.02) and NK gene sets (p = 0.08). These patterns were absent in controls. Dissection of blood microarray profiles points to B cell dysfunction with coordinated immune activation supporting persistent inflammation and antibody-mediated NK cell modulation of T cell activity. This has clinical implications as the CD19+ genes identified could provide robust and biologically meaningful basis for the early detection and unambiguous phenotyping of CFS.
Changing expression of vertebrate immunity genes in an anthropogenic environment: a controlled experiment.

PubMed

Hablützel, Pascal I; Brown, Martha; Friberg, Ida M; Jackson, Joseph A

2016-09-01

The effect of anthropogenic environments on the function of the vertebrate immune system is a problem of general importance. For example, it relates to the increasing rates of immunologically-based disease in modern human populations and to the desirability of identifying optimal immune function in domesticated animals. Despite this importance, our present understanding is compromised by a deficit of experimental studies that make adequately matched comparisons between wild and captive vertebrates. We transferred post-larval fishes (three-spined sticklebacks), collected in the wild, to an anthropogenic (captive) environment. We then monitored, over 11 months, how the systemic expression of immunity genes changed in comparison to cohort-matched wild individuals in the originator population (total n = 299). We found that a range of innate (lyz, defbl2, il1r-like, tbk1) and adaptive (cd8a, igmh) immunity genes were up-regulated in captivity, accompanied by an increase in expression of the antioxidant enzyme, gpx4a. For some genes previously known to show seasonality in the wild, this appeared to be reduced in captive fishes. Captive fishes tended to express immunity genes, including igzh, foxp3b, lyz, defbl2, and il1r-like, more variably. Furthermore, although gene co-expression patterns (analyzed through gene-by-gene correlations and mutual information theory based networks) shared common structure in wild and captive fishes, there was also significant divergence. For one gene in particular, defbl2, high expression was associated with adverse health outcomes in captive fishes. Taken together, these results demonstrate widespread regulatory changes in the immune system in captive populations, and that the expression of immunity genes is more constrained in the wild. An increase in constitutive systemic immune activity, such as we observed here, may alter the risk of immunopathology and contribute to variance in health in vertebrate populations exposed to anthropogenic environments.
Communication security in open health care networks.

PubMed

Blobel, B; Pharow, P; Engel, K; Spiegel, V; Krohn, R

1999-01-01

Fulfilling the shared care paradigm, health care networks providing open systems' interoperability in health care are needed. Such communicating and co-operating health information systems, dealing with sensitive personal medical information across organisational, regional, national or even international boundaries, require appropriate security solutions. Based on the generic security model, within the European MEDSEC project an open approach for secure EDI like HL7, EDIFACT, XDT or XML has been developed. The consideration includes both securing the message in an unsecure network and the transport of the unprotected information via secure channels (SSL, TLS etc.). Regarding EDI, an open and widely usable security solution has been specified and practically implemented for the examples of secure mailing and secure file transfer (FTP) via wrapping the sensitive information expressed by the corresponding protocols. The results are currently prepared for standardisation.
CoCoFolio: A Web-Based Electronic Portfolio for Enriching Students' Learning by Collaboration.

ERIC Educational Resources Information Center

Sugiyama, Takeshi; Kakehi, Naoyuki; Kura, Tsuneko; Takahashi Tokiichiro

A Web-based electronic portfolio, CoCoFolio, was developed for enriching students' learning by collaboration. CoCoFolio consists of two collaboration tools: a multi-layer drawing tool, CoCoBoard, and a small bulletin board, Discussion Board, for each student's submission. These tools support a series of expression activities: expression, sharing,…
Sustained synchronized neuronal network activity in a human astrocyte co-culture system

PubMed Central

Kuijlaars, Jacobine; Oyelami, Tutu; Diels, Annick; Rohrbacher, Jutta; Versweyveld, Sofie; Meneghello, Giulia; Tuefferd, Marianne; Verstraelen, Peter; Detrez, Jan R.; Verschuuren, Marlies; De Vos, Winnok H.; Meert, Theo; Peeters, Pieter J.; Cik, Miroslav; Nuydens, Rony; Brône, Bert; Verheyen, An

2016-01-01

Impaired neuronal network function is a hallmark of neurodevelopmental and neurodegenerative disorders such as autism, schizophrenia, and Alzheimer’s disease and is typically studied using genetically modified cellular and animal models. Weak predictive capacity and poor translational value of these models urge for better human derived in vitro models. The implementation of human induced pluripotent stem cells (hiPSCs) allows studying pathologies in differentiated disease-relevant and patient-derived neuronal cells. However, the differentiation process and growth conditions of hiPSC-derived neurons are non-trivial. In order to study neuronal network formation and (mal)function in a fully humanized system, we have established an in vitro co-culture model of hiPSC-derived cortical neurons and human primary astrocytes that recapitulates neuronal network synchronization and connectivity within three to four weeks after final plating. Live cell calcium imaging, electrophysiology and high content image analyses revealed an increased maturation of network functionality and synchronicity over time for co-cultures compared to neuronal monocultures. The cells express GABAergic and glutamatergic markers and respond to inhibitors of both neurotransmitter pathways in a functional assay. The combination of this co-culture model with quantitative imaging of network morphofunction is amenable to high throughput screening for lead discovery and drug optimization for neurological diseases. PMID:27819315
Construction of diagnosis system and gene regulatory networks based on microarray analysis.

PubMed

Hong, Chun-Fu; Chen, Ying-Chen; Chen, Wei-Chun; Tu, Keng-Chang; Tsai, Meng-Hsiun; Chan, Yung-Kuan; Yu, Shyr Shen

2018-05-01

A microarray analysis generally contains expression data of thousands of genes, but most of them are irrelevant to the disease of interest, making analyzing the genes concerning specific diseases complicated. Therefore, filtering out a few essential genes as well as their regulatory networks is critical, and a disease can be easily diagnosed just depending on the expression profiles of a few critical genes. In this study, a target gene screening (TGS) system, which is a microarray-based information system that integrates F-statistics, pattern recognition matching, a two-layer K-means classifier, a Parameter Detection Genetic Algorithm (PDGA), a genetic-based gene selector (GBG selector) and the association rule, was developed to screen out a small subset of genes that can discriminate malignant stages of cancers. During the first stage, F-statistic, pattern recognition matching, and a two-layer K-means classifier were applied in the system to filter out the 20 critical genes most relevant to ovarian cancer from 9600 genes, and the PDGA was used to decide the fittest values of the parameters for these critical genes. Among the 20 critical genes, 15 are associated with cancer progression. In the second stage, we further employed a GBG selector and the association rule to screen out seven target gene sets, each with only four to six genes, and each of which can precisely identify the malignancy stage of ovarian cancer based on their expression profiles. We further deduced the gene regulatory networks of the 20 critical genes by applying the Pearson correlation coefficient to evaluate the correlationship between the expression of each gene at the same stages and at different stages. Correlationships between gene pairs were calculated, and then, three regulatory networks were deduced. Their correlationships were further confirmed by the Ingenuity pathway analysis. The prognostic significances of the genes identified via regulatory networks were examined using online tools, and most represented biomarker candidates. In summary, our proposed system provides a new strategy to identify critical genes or biomarkers, as well as their regulatory networks, from microarray data. Copyright © 2018. Published by Elsevier Inc.
Features analysis for identification of date and party hubs in protein interaction network of Saccharomyces Cerevisiae.

PubMed

Mirzarezaee, Mitra; Araabi, Babak N; Sadeghi, Mehdi

2010-12-19

It has been understood that biological networks have modular organizations which are the sources of their observed complexity. Analysis of networks and motifs has shown that two types of hubs, party hubs and date hubs, are responsible for this complexity. Party hubs are local coordinators because of their high co-expressions with their partners, whereas date hubs display low co-expressions and are assumed as global connectors. However there is no mutual agreement on these concepts in related literature with different studies reporting their results on different data sets. We investigated whether there is a relation between the biological features of Saccharomyces Cerevisiae's proteins and their roles as non-hubs, intermediately connected, party hubs, and date hubs. We propose a classifier that separates these four classes. We extracted different biological characteristics including amino acid sequences, domain contents, repeated domains, functional categories, biological processes, cellular compartments, disordered regions, and position specific scoring matrix from various sources. Several classifiers are examined and the best feature-sets based on average correct classification rate and correlation coefficients of the results are selected. We show that fusion of five feature-sets including domains, Position Specific Scoring Matrix-400, cellular compartments level one, and composition pairs with two and one gaps provide the best discrimination with an average correct classification rate of 77%. We study a variety of known biological feature-sets of the proteins and show that there is a relation between domains, Position Specific Scoring Matrix-400, cellular compartments level one, composition pairs with two and one gaps of Saccharomyces Cerevisiae's proteins, and their roles in the protein interaction network as non-hubs, intermediately connected, party hubs and date hubs. This study also confirms the possibility of predicting non-hubs, party hubs and date hubs based on their biological features with acceptable accuracy. If such a hypothesis is correct for other species as well, similar methods can be applied to predict the roles of proteins in those species.
Integrated pathway-based transcription regulation network mining and visualization based on gene expression profiles.

PubMed

Kibinge, Nelson; Ono, Naoaki; Horie, Masafumi; Sato, Tetsuo; Sugiura, Tadao; Altaf-Ul-Amin, Md; Saito, Akira; Kanaya, Shigehiko

2016-06-01

Conventionally, workflows examining transcription regulation networks from gene expression data involve distinct analytical steps. There is a need for pipelines that unify data mining and inference deduction into a singular framework to enhance interpretation and hypotheses generation. We propose a workflow that merges network construction with gene expression data mining focusing on regulation processes in the context of transcription factor driven gene regulation. The pipeline implements pathway-based modularization of expression profiles into functional units to improve biological interpretation. The integrated workflow was implemented as a web application software (TransReguloNet) with functions that enable pathway visualization and comparison of transcription factor activity between sample conditions defined in the experimental design. The pipeline merges differential expression, network construction, pathway-based abstraction, clustering and visualization. The framework was applied in analysis of actual expression datasets related to lung, breast and prostrate cancer. Copyright © 2016 Elsevier Inc. All rights reserved.
Genome wide predictions of miRNA regulation by transcription factors.

PubMed

Ruffalo, Matthew; Bar-Joseph, Ziv

2016-09-01

Reconstructing regulatory networks from expression and interaction data is a major goal of systems biology. While much work has focused on trying to experimentally and computationally determine the set of transcription-factors (TFs) and microRNAs (miRNAs) that regulate genes in these networks, relatively little work has focused on inferring the regulation of miRNAs by TFs. Such regulation can play an important role in several biological processes including development and disease. The main challenge for predicting such interactions is the very small positive training set currently available. Another challenge is the fact that a large fraction of miRNAs are encoded within genes making it hard to determine the specific way in which they are regulated. To enable genome wide predictions of TF-miRNA interactions, we extended semi-supervised machine-learning approaches to integrate a large set of different types of data including sequence, expression, ChIP-seq and epigenetic data. As we show, the methods we develop achieve good performance on both a labeled test set, and when analyzing general co-expression networks. We next analyze mRNA and miRNA cancer expression data, demonstrating the advantage of using the predicted set of interactions for identifying more coherent and relevant modules, genes, and miRNAs. The complete set of predictions is available on the supporting website and can be used by any method that combines miRNAs, genes, and TFs. Code and full set of predictions are available from the supporting website: http://cs.cmu.edu/~mruffalo/tf-mirna/ zivbj@cs.cmu.edu Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Genome-wide screen identifies a novel prognostic signature for breast cancer survival

DOE PAGES

Mao, Xuan Y.; Lee, Matthew J.; Zhu, Jeffrey; ...

2017-01-21

Large genomic datasets in combination with clinical data can be used as an unbiased tool to identify genes important in patient survival and discover potential therapeutic targets. We used a genome-wide screen to identify 587 genes significantly and robustly deregulated across four independent breast cancer (BC) datasets compared to normal breast tissue. Gene expression of 381 genes was significantly associated with relapse-free survival (RFS) in BC patients. We used a gene co-expression network approach to visualize the genetic architecture in normal breast and BCs. In normal breast tissue, co-expression cliques were identified enriched for cell cycle, gene transcription, cell adhesion,more » cytoskeletal organization and metabolism. In contrast, in BC, only two major co-expression cliques were identified enriched for cell cycle-related processes or blood vessel development, cell adhesion and mammary gland development processes. Interestingly, gene expression levels of 7 genes were found to be negatively correlated with many cell cycle related genes, highlighting these genes as potential tumor suppressors and novel therapeutic targets. A forward-conditional Cox regression analysis was used to identify a 12-gene signature associated with RFS. A prognostic scoring system was created based on the 12-gene signature. This scoring system robustly predicted BC patient RFS in 60 sampling test sets and was further validated in TCGA and METABRIC BC data. Our integrated study identified a 12-gene prognostic signature that could guide adjuvant therapy for BC patients and includes novel potential molecular targets for therapy.« less
Genome-wide screen identifies a novel prognostic signature for breast cancer survival

DOE Office of Scientific and Technical Information (OSTI.GOV)

Mao, Xuan Y.; Lee, Matthew J.; Zhu, Jeffrey

Large genomic datasets in combination with clinical data can be used as an unbiased tool to identify genes important in patient survival and discover potential therapeutic targets. We used a genome-wide screen to identify 587 genes significantly and robustly deregulated across four independent breast cancer (BC) datasets compared to normal breast tissue. Gene expression of 381 genes was significantly associated with relapse-free survival (RFS) in BC patients. We used a gene co-expression network approach to visualize the genetic architecture in normal breast and BCs. In normal breast tissue, co-expression cliques were identified enriched for cell cycle, gene transcription, cell adhesion,more » cytoskeletal organization and metabolism. In contrast, in BC, only two major co-expression cliques were identified enriched for cell cycle-related processes or blood vessel development, cell adhesion and mammary gland development processes. Interestingly, gene expression levels of 7 genes were found to be negatively correlated with many cell cycle related genes, highlighting these genes as potential tumor suppressors and novel therapeutic targets. A forward-conditional Cox regression analysis was used to identify a 12-gene signature associated with RFS. A prognostic scoring system was created based on the 12-gene signature. This scoring system robustly predicted BC patient RFS in 60 sampling test sets and was further validated in TCGA and METABRIC BC data. Our integrated study identified a 12-gene prognostic signature that could guide adjuvant therapy for BC patients and includes novel potential molecular targets for therapy.« less
Characteristics of patients seeking health information online via social health networks versus general Internet sites: a comparative study.

PubMed

Magnezi, Racheli; Grosberg, Dafna; Novikov, Ilya; Ziv, Arnona; Shani, Mordechai; Freedman, Laurence S

2015-03-01

Camoni.co.il, a Hebrew-language social health network offers advice, consultation, and connection to others with chronic illness. This study compared characteristics and objectives of Camoni.co.il users and individuals seeking medical information through general Internet sites. Similar questionnaires were sent to 1009 Internet and 900 Camoni users. Cluster analysis defined four modes of online social health network use: "acquiring information and support", "communicating", "networking" and "browsing". Six hundred and five Internet and 125 Camoni users responded. Diabetes, hypertension, obesity and lung diseases were found more often among general Internet users than Camoni users. Among Camoni users, "acquiring information and support" was the main motivation for individuals over age 55 years, women, those with lower income, chronic pain, obesity and depression. "Communicating" was the main incentive of men, those 20-34 years old, those with less education, or an eating disorder. "Networking" was the most significant motivation for those with multiple sclerosis or depression. Browsing was most frequent among individuals with multiple sclerosis. Identifying needs of social health network surfers will allow planning unique contents and enhancing social health sites. Physicians might advise patients to use them to obtain support and information regarding their conditions, possibly leading to improved compliance and self-management.
Separating the Role of Protein Restraints and Local Metal-Site Interaction Chemistry in the Thermodynamics of a Zinc Finger Protein

PubMed Central

Dixit, Purushottam D.; Asthagiri, D.

2011-01-01

We express the effective Hamiltonian of an ion-binding site in a protein as a combination of the Hamiltonian of the ion-bound site in vacuum and the restraints of the protein on the site. The protein restraints are described by the quadratic elastic network model. The Hamiltonian of the ion-bound site in vacuum is approximated as a generalized Hessian around the minimum energy configuration. The resultant of the two quadratic Hamiltonians is cast into a pure quadratic form. In the canonical ensemble, the quadratic nature of the resultant Hamiltonian allows us to express analytically the excess free energy, enthalpy, and entropy of ion binding to the protein. The analytical expressions allow us to separate the roles of the dynamic restraints imposed by the protein on the binding site and the temperature-independent chemical effects in metal-ligand coordination. For the consensus zinc-finger peptide, relative to the aqueous phase, the calculated free energy of exchanging Zn2+ with Fe2+, Co2+, Ni2+, and Cd2+ are in agreement with experiments. The predicted excess enthalpy of ion exchange between Zn2+ and Co2+ also agrees with the available experimental estimate. The free energy of applying the protein restraints reveals that relative to Zn2+, the Co2+, and Cd2+-site clusters are more destabilized by the protein restraints. This leads to an experimentally testable hypothesis that a tetrahedral metal binding site with minimal protein restraints will be less selective for Zn2+ over Co2+ and Cd2+ compared to a zinc finger peptide. No appreciable change is expected for Fe2+ and Ni2+. The framework presented here may prove useful in protein engineering to tune metal selectivity. PMID:21943427
Cell cycle gene expression networks discovered using systems biology: Significance in carcinogenesis

PubMed Central

Scott, RE; Ghule, PN; Stein, JL; Stein, GS

2015-01-01

The early stages of carcinogenesis are linked to defects in the cell cycle. A series of cell cycle checkpoints are involved in this process. The G1/S checkpoint that serves to integrate the control of cell proliferation and differentiation is linked to carcinogenesis and the mitotic spindle checkpoint with the development of chromosomal instability. This paper presents the outcome of systems biology studies designed to evaluate if networks of covariate cell cycle gene transcripts exist in proliferative mammalian tissues including mice, rats and humans. The GeneNetwork website that contains numerous gene expression datasets from different species, sexes and tissues represents the foundational resource for these studies (www.genenetwork.org). In addition, WebGestalt, a gene ontology tool, facilitated the identification of expression networks of genes that co-vary with key cell cycle targets, especially Cdc20 and Plk1 (www.bioinfo.vanderbilt.edu/webgestalt). Cell cycle expression networks of such covariate mRNAs exist in multiple proliferative tissues including liver, lung, pituitary, adipose and lymphoid tissues among others but not in brain or retina that have low proliferative potential. Sixty-three covariate cell cycle gene transcripts (mRNAs) compose the average cell cycle network with p = e−13 to e−36. Cell cycle expression networks show species, sex and tissue variability and they are enriched in mRNA transcripts associated with mitosis many of which are associated with chromosomal instability. PMID:25808367

The Influence of Academic Tracking on Adolescent Social Networks

ERIC Educational Resources Information Center

Fisher, Kim W.; Shogren, Karrie A.

2016-01-01

This study examined adolescents' social capital, through social network analyses (i.e., ego network analyses), in two high schools where students were placed into academic tracks adopted by the schools and shaped by disability status (i.e., general education, co-taught, segregated special education classrooms). The impact of academic tracks, as…
Photo-cross-linked poly(thioether-co-carbonate) networks derived from the natural product quinic acid.

PubMed

Link, Lauren A; Lonnecker, Alexander T; Hearon, Keith; Maher, Cameron A; Raymond, Jeffery E; Wooley, Karen L

2014-10-22

Polycarbonate networks derived from the natural product quinic acid that can potentially return to their natural building blocks upon hydrolytic degradation are described herein. Solvent-free thiol-ene chemistry was utilized in the copolymerization of tris(alloc)quinic acid and a variety of multifunctional thiol monomers to obtain poly(thioether-co-carbonate) networks with a wide range of achievable thermomechanical properties including glass transition temperatures from -18 to +65 °C and rubbery moduli from 3.8 to 20 MPa. The network containing 1,2-ethanedithiol expressed an average toughness at 25 and 63 °C of 1.08 and 2.35 MJ/m(3), respectively, and an order-of-magnitude increase in the average toughness at 37 °C of 15.56 MJ/m(3).
The autophagy interaction network of the aging model Podospora anserina.

PubMed

Philipp, Oliver; Hamann, Andrea; Osiewacz, Heinz D; Koch, Ina

2017-03-27

Autophagy is a conserved molecular pathway involved in the degradation and recycling of cellular components. It is active either as response to starvation or molecular damage. Evidence is emerging that autophagy plays a key role in the degradation of damaged cellular components and thereby affects aging and lifespan control. In earlier studies, it was found that autophagy in the aging model Podospora anserina acts as a longevity assurance mechanism. However, only little is known about the individual components controlling autophagy in this aging model. Here, we report a biochemical and bioinformatics study to detect the protein-protein interaction (PPI) network of P. anserina combining experimental and theoretical methods. We constructed the PPI network of autophagy in P. anserina based on the corresponding networks of yeast and human. We integrated PaATG8 interaction partners identified in an own yeast two-hybrid analysis using ATG8 of P. anserina as bait. Additionally, we included age-dependent transcriptome data. The resulting network consists of 89 proteins involved in 186 interactions. We applied bioinformatics approaches to analyze the network topology and to prove that the network is not random, but exhibits biologically meaningful properties. We identified hub proteins which play an essential role in the network as well as seven putative sub-pathways, and interactions which are likely to be evolutionary conserved amongst species. We confirmed that autophagy-associated genes are significantly often up-regulated and co-expressed during aging of P. anserina. With the present study, we provide a comprehensive biological network of the autophagy pathway in P. anserina comprising PPI and gene expression data. It is based on computational prediction as well as experimental data. We identified sub-pathways, important hub proteins, and evolutionary conserved interactions. The network clearly illustrates the relation of autophagy to aging processes and enables further specific studies to understand autophagy and aging in P. anserina as well as in other systems.
Integrative analyses of leprosy susceptibility genes indicate a common autoimmune profile.

PubMed

Zhang, Deng-Feng; Wang, Dong; Li, Yu-Ye; Yao, Yong-Gang

2016-04-01

Leprosy is an ancient chronic infection in the skin and peripheral nerves caused by Mycobacterium leprae. The development of leprosy depends on genetic background and the immune status of the host. However, there is no systematic view focusing on the biological pathways, interaction networks and overall expression pattern of leprosy-related immune and genetic factors. To identify the hub genes in the center of leprosy genetic network and to provide an insight into immune and genetic factors contributing to leprosy. We retrieved all reported leprosy-related genes and performed integrative analyses covering gene expression profiling, pathway analysis, protein-protein interaction network, and evolutionary analyses. A list of 123 differentially expressed leprosy related genes, which were enriched in activation and regulation of immune response, was obtained in our analyses. Cross-disorder analysis showed that the list of leprosy susceptibility genes was largely shared by typical autoimmune diseases such as lupus erythematosus and arthritis, suggesting that similar pathways might be affected in leprosy and autoimmune diseases. Protein-protein interaction (PPI) and positive selection analyses revealed a co-evolution network of leprosy risk genes. Our analyses showed that leprosy associated genes constituted a co-evolution network and might undergo positive selection driven by M. leprae. We suggested that leprosy may be a kind of autoimmune disease and the development of leprosy is a matter of defect or over-activation of body immunity. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
A Proxy Design to Leverage the Interconnection of CoAP Wireless Sensor Networks with Web Applications

PubMed Central

Ludovici, Alessandro; Calveras, Anna

2015-01-01

In this paper, we present the design of a Constrained Application Protocol (CoAP) proxy able to interconnect Web applications based on Hypertext Transfer Protocol (HTTP) and WebSocket with CoAP based Wireless Sensor Networks. Sensor networks are commonly used to monitor and control physical objects or environments. Smart Cities represent applications of such a nature. Wireless Sensor Networks gather data from their surroundings and send them to a remote application. This data flow may be short or long lived. The traditional HTTP long-polling used by Web applications may not be adequate in long-term communications. To overcome this problem, we include the WebSocket protocol in the design of the CoAP proxy. We evaluate the performance of the CoAP proxy in terms of latency and memory consumption. The tests consider long and short-lived communications. In both cases, we evaluate the performance obtained by the CoAP proxy according to the use of WebSocket and HTTP long-polling. PMID:25585107
Applications of Social Network Analysis

NASA Astrophysics Data System (ADS)

Thilagam, P. Santhi

A social network [2] is a description of the social structure between actors, mostly persons, groups or organizations. It indicates the ways in which they are connected with each other by some relationship such as friendship, kinship, finance exchange etc. In a nutshell, when the person uses already known/unknown people to create new contacts, it forms social networking. The social network is not a new concept rather it can be formed when similar people interact with each other directly or indirectly to perform particular task. Examples of social networks include a friendship networks, collaboration networks, co-authorship networks, and co-employees networks which depict the direct interaction among the people. There are also other forms of social networks, such as entertainment networks, business Networks, citation networks, and hyperlink networks, in which interaction among the people is indirect. Generally, social networks operate on many levels, from families up to the level of nations and assists in improving interactive knowledge sharing, interoperability and collaboration.
Investigating cellular network heterogeneity and modularity in cancer: a network entropy and unbalanced motif approach.

PubMed

Cheng, Feixiong; Liu, Chuang; Shen, Bairong; Zhao, Zhongming

2016-08-26

Cancer is increasingly recognized as a cellular system phenomenon that is attributed to the accumulation of genetic or epigenetic alterations leading to the perturbation of the molecular network architecture. Elucidation of network properties that can characterize tumor initiation and progression, or pinpoint the molecular targets related to the drug sensitivity or resistance, is therefore of critical importance for providing systems-level insights into tumorigenesis and clinical outcome in the molecularly targeted cancer therapy. In this study, we developed a network-based framework to quantitatively examine cellular network heterogeneity and modularity in cancer. Specifically, we constructed gene co-expressed protein interaction networks derived from large-scale RNA-Seq data across 8 cancer types generated in The Cancer Genome Atlas (TCGA) project. We performed gene network entropy and balanced versus unbalanced motif analysis to investigate cellular network heterogeneity and modularity in tumor versus normal tissues, different stages of progression, and drug resistant versus sensitive cancer cell lines. We found that tumorigenesis could be characterized by a significant increase of gene network entropy in all of the 8 cancer types. The ratio of the balanced motifs in normal tissues is higher than that of tumors, while the ratio of unbalanced motifs in tumors is higher than that of normal tissues in all of the 8 cancer types. Furthermore, we showed that network entropy could be used to characterize tumor progression and anticancer drug responses. For example, we found that kinase inhibitor resistant cancer cell lines had higher entropy compared to that of sensitive cell lines using the integrative analysis of microarray gene expression and drug pharmacological data collected from the Genomics of Drug Sensitivity in Cancer database. In addition, we provided potential network-level evidence that smoking might increase cancer cellular network heterogeneity and further contribute to tyrosine kinase inhibitor (e.g., gefitinib) resistance. In summary, we demonstrated that network properties such as network entropy and unbalanced motifs associated with tumor initiation, progression, and anticancer drug responses, suggesting new potential network-based prognostic and predictive measure in cancer.
Analysis of global gene expression in Brachypodium distachyon reveals extensive network plasticity in response to abiotic stress.

PubMed

Priest, Henry D; Fox, Samuel E; Rowley, Erik R; Murray, Jessica R; Michael, Todd P; Mockler, Todd C

2014-01-01

Brachypodium distachyon is a close relative of many important cereal crops. Abiotic stress tolerance has a significant impact on productivity of agriculturally important food and feedstock crops. Analysis of the transcriptome of Brachypodium after chilling, high-salinity, drought, and heat stresses revealed diverse differential expression of many transcripts. Weighted Gene Co-Expression Network Analysis revealed 22 distinct gene modules with specific profiles of expression under each stress. Promoter analysis implicated short DNA sequences directly upstream of module members in the regulation of 21 of 22 modules. Functional analysis of module members revealed enrichment in functional terms for 10 of 22 network modules. Analysis of condition-specific correlations between differentially expressed gene pairs revealed extensive plasticity in the expression relationships of gene pairs. Photosynthesis, cell cycle, and cell wall expression modules were down-regulated by all abiotic stresses. Modules which were up-regulated by each abiotic stress fell into diverse and unique gene ontology GO categories. This study provides genomics resources and improves our understanding of abiotic stress responses of Brachypodium.
Identification of AUXIN RESPONSE FACTOR gene family from Prunus sibirica and its expression analysis during mesocarp and kernel development.

PubMed

Niu, Jun; Bi, Quanxin; Deng, Shuya; Chen, Huiping; Yu, Haiyan; Wang, Libing; Lin, Shanzhi

2018-01-24

Auxin response factors (ARFs) in auxin signaling pathway are an important component that can regulate the transcription of auxin-responsive genes involved in almost all aspects of plant growth and development. To our knowledge, the comprehensive and systematic characterization of ARF genes has never been reported in Prunus sibirica, a novel woody biodiesel feedstock in China. In this study, we identified 14 PsARF genes with a perfect open reading frame (ORF) in P. sibirica by using its previous transcriptomic data. Conserved motif analysis showed that all identified PsARF proteins had typical DNA-binding and ARF domain, but 5 members (PsARF3, 8 10, 16 and 17) lacked the dimerization domain. Phylogenetic analysis of the ARF proteins generated from various plant species indicated that ARFs could be categorized into 4 major groups (Class I, II, III and IV), in which all identified ARFs from P. sibirica showed a closest relationship with those from P. mume. Comparison of the expression profiles of 14 PsARF genes in different developmental stages of Siberian apricot mesocarp (SAM) and kernel (SAK) reflected distinct temporal or spatial expression patterns for PsARF genes. Additionally, based on the expressed data from fruit and seed development of multiple plant species, we identified 1514 ARF-correlated genes using weighted gene co-expression network analysis (WGCNA). And the major portion of ARF-correlated gene was characterized to be involved in protein, nucleic acid and carbohydrate metabolic, transport and regulatory processes. In summary, we systematically and comprehensively analyzed the structure, expression pattern and co-expression network of ARF gene family in P. sibirica. All our findings provide theoretical foundation for the PsARF gene family and will pave the way for elucidating the precise role of PsARF genes in SAM and SAK development.
UDP-arabinopyranose mutase 3 is required for pollen wall morphogenesis in rice (Oryza sativa).

PubMed

Sumiyoshi, Minako; Inamura, Takuya; Nakamura, Atsuko; Aohara, Tsutomu; Ishii, Tadashi; Satoh, Shinobu; Iwai, Hiroaki

2015-02-01

l-Arabinose is one of the main constituents of cell wall polysaccharides such as pectic rhamnogalacturonan I (RG-I), glucuronoarabinoxylans and other glycoproteins. It is found predominantly in the furanose form rather than in the thermodynamically more stable pyranose form. UDP-L-arabinofuranose (UDP-Araf), rather than UDP-L-arabinopyranose (UDP-Arap), is a sugar donor for the biosynthesis of arabinofuranosyl (Araf) residues. UDP-arabinopyranose mutases (UAMs) have been shown to interconvert UDP-Araf and UDP-Arap and are involved in the biosynthesis of polysaccharides including Araf. The UAM gene family has three members in Oryza sativa. Co-expression network in silico analysis showed that OsUAM3 expression was independent from OsUAM1 and OsUAM2 co-expression networks. OsUAM1 and OsUAM2 were expressed ubiquitously throughout plant development, but OsUAM3 was expressed primarily in reproductive tissue, particularly at the pollen cell wall formation developmental stage. OsUAM3 co-expression networks include pectin catabolic enzymes. To determine the function of OsUAMs in reproductive tissues, we analyzed RNA interference (RNAi)-knockdown transformants (OsUAM3-KD) specific for OsUAM3. OsUAM3-KD plants grew normally and showed abnormal phenotypes in reproductive tissues, especially in terms of the pollen cell wall and exine. In addition, we examined modifications of cell wall polysaccharides at the cellular level using antibodies against polysaccharides including Araf. Immunolocalization of arabinan using the LM6 antibody showed low levels of arabinan in OsUAM3-KD pollen grains. Our results suggest that the function of OsUAM3 is important for synthesis of arabinan side chains of RG-I and is required for reproductive developmental processes, especially the formation of the cell wall in pollen. © The Author 2014. Published by Oxford University Press on behalf of Japanese Society of Plant Physiologists. All rights reserved. For permissions, please email: journals.permissions@oup.com.
Topological analysis of metabolic networks integrating co-segregating transcriptomes and metabolomes in type 2 diabetic rat congenic series.

PubMed

Dumas, Marc-Emmanuel; Domange, Céline; Calderari, Sophie; Martínez, Andrea Rodríguez; Ayala, Rafael; Wilder, Steven P; Suárez-Zamorano, Nicolas; Collins, Stephan C; Wallis, Robert H; Gu, Quan; Wang, Yulan; Hue, Christophe; Otto, Georg W; Argoud, Karène; Navratil, Vincent; Mitchell, Steve C; Lindon, John C; Holmes, Elaine; Cazier, Jean-Baptiste; Nicholson, Jeremy K; Gauguier, Dominique

2016-09-30

The genetic regulation of metabolic phenotypes (i.e., metabotypes) in type 2 diabetes mellitus occurs through complex organ-specific cellular mechanisms and networks contributing to impaired insulin secretion and insulin resistance. Genome-wide gene expression profiling systems can dissect the genetic contributions to metabolome and transcriptome regulations. The integrative analysis of multiple gene expression traits and metabolic phenotypes (i.e., metabotypes) together with their underlying genetic regulation remains a challenge. Here, we introduce a systems genetics approach based on the topological analysis of a combined molecular network made of genes and metabolites identified through expression and metabotype quantitative trait locus mapping (i.e., eQTL and mQTL) to prioritise biological characterisation of candidate genes and traits. We used systematic metabotyping by 1 H NMR spectroscopy and genome-wide gene expression in white adipose tissue to map molecular phenotypes to genomic blocks associated with obesity and insulin secretion in a series of rat congenic strains derived from spontaneously diabetic Goto-Kakizaki (GK) and normoglycemic Brown-Norway (BN) rats. We implemented a network biology strategy approach to visualize the shortest paths between metabolites and genes significantly associated with each genomic block. Despite strong genomic similarities (95-99 %) among congenics, each strain exhibited specific patterns of gene expression and metabotypes, reflecting the metabolic consequences of series of linked genetic polymorphisms in the congenic intervals. We subsequently used the congenic panel to map quantitative trait loci underlying specific mQTLs and genome-wide eQTLs. Variation in key metabolites like glucose, succinate, lactate, or 3-hydroxybutyrate and second messenger precursors like inositol was associated with several independent genomic intervals, indicating functional redundancy in these regions. To navigate through the complexity of these association networks we mapped candidate genes and metabolites onto metabolic pathways and implemented a shortest path strategy to highlight potential mechanistic links between metabolites and transcripts at colocalized mQTLs and eQTLs. Minimizing the shortest path length drove prioritization of biological validations by gene silencing. These results underline the importance of network-based integration of multilevel systems genetics datasets to improve understanding of the genetic architecture of metabotype and transcriptomic regulation and to characterize novel functional roles for genes determining tissue-specific metabolism.
A Co-Citation Network of Young Children's Learning with Technology

ERIC Educational Resources Information Center

Tang, Kai-Yu; Li, Ming-Chaun; Hsin, Ching-Ting; Tsai, Chin-Chung

2016-01-01

This paper used a novel literature review approach--co-citation network analysis--to illuminate the latent structure of 87 empirical papers in the field of young children's learning with technology (YCLT). Based on the document co-citation analysis, a total of 206 co-citation relationships among the 87 papers were identified and then graphically…
Development and pilot trial of a web-based job placement information network.

PubMed

Chan, Eliza W C; Tam, S F

2005-01-01

The purpose of this project was to develop and pilot a web-based job placement information network aiming at enhancing the work trial and job placement opportunities of people with disabilities (PWD). Efficient uses of information technology in vocational rehabilitation were suggested to help improve PWD employment opportunities and thus enable them to contribute as responsible citizens to the society. In this preliminary study, a web-based employer network was so developed to explore Hong Kong employers' needs and intentions in employing PWD. The results indicated that Hong Kong employers generally agreed to arrange work trials for PWD whose work abilities match job requirements. They also expressed that they would offer permanent job placements to those PWD who showed satisfactory performance in work trials. The present study evidenced that using an information network could expedite communications between employers and job placement services, and thus job placement service outcomes. It is hoped that a job placement databank could thus be developed through accumulating responses from potential employers.
A bioinformatics prediction approach towards analyzing the glycosylation, co-expression and interaction patterns of epithelial membrane antigen (EMA/MUC1)

DOE Office of Scientific and Technical Information (OSTI.GOV)

Kalra, Rajkumar S., E-mail: renu-wadhwa@aist.go.jp; Wadhwa, Renu, E-mail: renu-wadhwa@aist.go.jp

2015-02-27

Epithelial membrane antigen (EMA or MUC1) is a heavily glycosylated, type I transmembrane glycoprotein commonly expressed by epithelial cells of duct organs. It has been shown to be aberrantly glycosylated in several diseases including cancer. Protein sequence based annotation and analysis of glycosylation profile of glycoproteins by robust computational and comprehensive algorithms provides possible insights to the mechanism(s) of anomalous glycosylation. In present report, by using a number of bioinformatics applications we studied EMA/MUC1 and explored its trans-membrane structural domain sequence that is widely subjected to glycosylation. Exploration of different extracellular motifs led to prediction of N and O-linked glycosylationmore » target sites. Based on the putative O-linked target sites, glycosylated moieties and pathways were envisaged. Furthermore, Protein network analysis demonstrated physical interaction of EMA with a number of proteins and confirmed its functional involvement in cell growth and proliferation pathways. Gene Ontology analysis suggested an involvement of EMA in a number of functions including signal transduction, protein binding, processing and transport along with glycosylation. Thus, present study explored potential of bioinformatics prediction approach in analyzing glycosylation, co-expression and interaction patterns of EMA/MUC1 glycoprotein.« less
Evolutionary history of the recruitment of conserved developmental genes in association to the formation and diversification of a novel trait.

PubMed

Shirai, Leila T; Saenko, Suzanne V; Keller, Roberto A; Jerónimo, Maria A; Brakefield, Paul M; Descimon, Henri; Wahlberg, Niklas; Beldade, Patrícia

2012-02-15

The origin and modification of novel traits are important aspects of biological diversification. Studies combining concepts and approaches of developmental genetics and evolutionary biology have uncovered many examples of the recruitment, or co-option, of genes conserved across lineages for the formation of novel, lineage-restricted traits. However, little is known about the evolutionary history of the recruitment of those genes, and of the relationship between them -for example, whether the co-option involves whole or parts of existing networks, or whether it occurs by redeployment of individual genes with de novo rewiring. We use a model novel trait, color pattern elements on butterfly wings called eyespots, to explore these questions. Eyespots have greatly diversified under natural and sexual selection, and their formation involves genetic circuitries shared across insects. We investigated the evolutionary history of the recruitment and co-recruitment of four conserved transcription regulators to the larval wing disc region where circular pattern elements develop. The co-localization of Antennapedia, Notch, Distal-less, and Spalt with presumptive (eye)spot organizers was examined in 13 butterfly species, providing the largest comparative dataset available for the system. We found variation between families, between subfamilies, and between tribes. Phylogenetic reconstructions by parsimony and maximum likelihood methods revealed an unambiguous evolutionary history only for Antennapedia, with a resolved single origin of eyespot-associated expression, and many homoplastic events for Notch, Distal-less, and Spalt. The flexibility in the (co-)recruitment of the targeted genes includes cases where different gene combinations are associated with morphologically similar eyespots, as well as cases where identical protein combinations are associated with very different phenotypes. The evolutionary history of gene (co-)recruitment is consistent with both divergence from a recruited putative ancestral network, and with independent co-option of individual genes. The diversity in the combinations of genes expressed in association with eyespot formation does not parallel diversity in characteristics of the adult phenotype. We discuss these results in the context of inferring homology. Our study underscores the importance of widening the representation of phylogenetic, morphological, and genetic diversity in order to establish general principles about the mechanisms behind the evolution of novel traits.
EIGENVECTOR-BASED CENTRALITY MEASURES FOR TEMPORAL NETWORKS*

PubMed Central

TAYLOR, DANE; MYERS, SEAN A.; CLAUSET, AARON; PORTER, MASON A.; MUCHA, PETER J.

2017-01-01

Numerous centrality measures have been developed to quantify the importances of nodes in time-independent networks, and many of them can be expressed as the leading eigenvector of some matrix. With the increasing availability of network data that changes in time, it is important to extend such eigenvector-based centrality measures to time-dependent networks. In this paper, we introduce a principled generalization of network centrality measures that is valid for any eigenvector-based centrality. We consider a temporal network with N nodes as a sequence of T layers that describe the network during different time windows, and we couple centrality matrices for the layers into a supra-centrality matrix of size NT × NT whose dominant eigenvector gives the centrality of each node i at each time t. We refer to this eigenvector and its components as a joint centrality, as it reflects the importances of both the node i and the time layer t. We also introduce the concepts of marginal and conditional centralities, which facilitate the study of centrality trajectories over time. We find that the strength of coupling between layers is important for determining multiscale properties of centrality, such as localization phenomena and the time scale of centrality changes. In the strong-coupling regime, we derive expressions for time-averaged centralities, which are given by the zeroth-order terms of a singular perturbation expansion. We also study first-order terms to obtain first-order-mover scores, which concisely describe the magnitude of nodes’ centrality changes over time. As examples, we apply our method to three empirical temporal networks: the United States Ph.D. exchange in mathematics, costarring relationships among top-billed actors during the Golden Age of Hollywood, and citations of decisions from the United States Supreme Court. PMID:29046619
The condition-dependent transcriptional network in Escherichia coli.

PubMed

Lemmens, Karen; De Bie, Tijl; Dhollander, Thomas; Monsieurs, Pieter; De Moor, Bart; Collado-Vides, Julio; Engelen, Kristof; Marchal, Kathleen

2009-03-01

Thanks to the availability of high-throughput omics data, bioinformatics approaches are able to hypothesize thus-far undocumented genetic interactions. However, due to the amount of noise in these data, inferences based on a single data source are often unreliable. A popular approach to overcome this problem is to integrate different data sources. In this study, we describe DISTILLER, a novel framework for data integration that simultaneously analyzes microarray and motif information to find modules that consist of genes that are co-expressed in a subset of conditions, and their corresponding regulators. By applying our method on publicly available data, we evaluated the condition-specific transcriptional network of Escherichia coli. DISTILLER confirmed 62% of 736 interactions described in RegulonDB, and 278 novel interactions were predicted.
Lotus Base: An integrated information portal for the model legume Lotus japonicus

PubMed Central

Mun, Terry; Bachmann, Asger; Gupta, Vikas; Stougaard, Jens; Andersen, Stig U.

2016-01-01

Lotus japonicus is a well-characterized model legume widely used in the study of plant-microbe interactions. However, datasets from various Lotus studies are poorly integrated and lack interoperability. We recognize the need for a comprehensive repository that allows comprehensive and dynamic exploration of Lotus genomic and transcriptomic data. Equally important are user-friendly in-browser tools designed for data visualization and interpretation. Here, we present Lotus Base, which opens to the research community a large, established LORE1 insertion mutant population containing an excess of 120,000 lines, and serves the end-user tightly integrated data from Lotus, such as the reference genome, annotated proteins, and expression profiling data. We report the integration of expression data from the L. japonicus gene expression atlas project, and the development of tools to cluster and export such data, allowing users to construct, visualize, and annotate co-expression gene networks. Lotus Base takes advantage of modern advances in browser technology to deliver powerful data interpretation for biologists. Its modular construction and publicly available application programming interface enable developers to tap into the wealth of integrated Lotus data. Lotus Base is freely accessible at: https://lotus.au.dk. PMID:28008948
NIMEFI: gene regulatory network inference using multiple ensemble feature importance algorithms.

PubMed

Ruyssinck, Joeri; Huynh-Thu, Vân Anh; Geurts, Pierre; Dhaene, Tom; Demeester, Piet; Saeys, Yvan

2014-01-01

One of the long-standing open challenges in computational systems biology is the topology inference of gene regulatory networks from high-throughput omics data. Recently, two community-wide efforts, DREAM4 and DREAM5, have been established to benchmark network inference techniques using gene expression measurements. In these challenges the overall top performer was the GENIE3 algorithm. This method decomposes the network inference task into separate regression problems for each gene in the network in which the expression values of a particular target gene are predicted using all other genes as possible predictors. Next, using tree-based ensemble methods, an importance measure for each predictor gene is calculated with respect to the target gene and a high feature importance is considered as putative evidence of a regulatory link existing between both genes. The contribution of this work is twofold. First, we generalize the regression decomposition strategy of GENIE3 to other feature importance methods. We compare the performance of support vector regression, the elastic net, random forest regression, symbolic regression and their ensemble variants in this setting to the original GENIE3 algorithm. To create the ensemble variants, we propose a subsampling approach which allows us to cast any feature selection algorithm that produces a feature ranking into an ensemble feature importance algorithm. We demonstrate that the ensemble setting is key to the network inference task, as only ensemble variants achieve top performance. As second contribution, we explore the effect of using rankwise averaged predictions of multiple ensemble algorithms as opposed to only one. We name this approach NIMEFI (Network Inference using Multiple Ensemble Feature Importance algorithms) and show that this approach outperforms all individual methods in general, although on a specific network a single method can perform better. An implementation of NIMEFI has been made publicly available.
Comparing Pearson, Spearman and Hoeffding's D measure for gene expression association analysis.

PubMed

Fujita, André; Sato, João Ricardo; Demasi, Marcos Angelo Almeida; Sogayar, Mari Cleide; Ferreira, Carlos Eduardo; Miyano, Satoru

2009-08-01

DNA microarrays have become a powerful tool to describe gene expression profiles associated with different cellular states, various phenotypes and responses to drugs and other extra- or intra-cellular perturbations. In order to cluster co-expressed genes and/or to construct regulatory networks, definition of distance or similarity between measured gene expression data is usually required, the most common choices being Pearson's and Spearman's correlations. Here, we evaluate these two methods and also compare them with a third one, namely Hoeffding's D measure, which is used to infer nonlinear and non-monotonic associations, i.e. independence in a general sense. By comparing three different variable association approaches, namely Pearson's correlation, Spearman's correlation and Hoeffding's D measure, we aimed at assessing the most appropriate one for each purpose. Using simulations, we demonstrate that the Hoeffding's D measure outperforms Pearson's and Spearman's approaches in identifying nonlinear associations. Our results demonstrate that Hoeffding's D measure is less sensitive to outliers and is a more powerful tool to identify nonlinear and non-monotonic associations. We have also applied Hoeffding's D measure in order to identify new putative genes associated with tp53. Therefore, we propose the Hoeffding's D measure to identify nonlinear associations between gene expression profiles.

Who joins the network? Physicians' resistance to take budgetary co-responsibility.

PubMed

Rischatsch, Maurus

2015-03-01

Managed Care (MC) is expected to provide health care at a lower cost than conventional provision. Therefore, Switzerland intends to promote MC by forcing health insurers to write MC contracts and introducing budgetary co-responsibility for ambulatory care physicians. A discrete choice experiment conducted in 2011 including 872 physicians reveals a strong preference heterogeneity with respect to network participation and alternative remuneration schemes. The number of physicians working in networks is unlikely to rise on a voluntary basis, while general practitioners are more likely to join networks than specialists with surgical activities. For physicians considering joining networks, cost savings are predicted to be higher than the estimated willingness-to-accept payments. Copyright © 2014 Elsevier B.V. All rights reserved.
Differential co-expression analysis of rheumatoid arthritis with microarray data.

PubMed

Wang, Kunpeng; Zhao, Liqiang; Liu, Xuefeng; Hao, Zhenyong; Zhou, Yong; Yang, Chuandong; Li, Hongqiang

2014-11-01

The aim of the present study was to investigate the underlying molecular mechanisms of rheumatoid arthritis (RA) using microarray expression profiles from osteoarthritis and RA patients, to improve diagnosis and treatment strategies for the condition. The gene expression profile of GSE27390 was downloaded from Gene Expression Omnibus, including 19 samples from patients with RA (n=9) or osteoarthritis (n=10). Firstly, the differentially expressed genes (DEGs) were obtained with the thresholds of |logFC|>1.0 and P<0.05, using the t‑test method in LIMMA package. Then, differentially co-expressed genes (DCGs) and differentially co-expressed links (DCLs) were screened with q<0.25 by the differential coexpression analysis and differential regulation analysis of gene expression microarray data package. Secondly, pathway enrichment analysis for DCGs was performed by the Database for Annotation, Visualization and Integrated Discovery and the DCLs associated with RA were selected by comparing the obtained DCLs with known transcription factor (TF)-targets in the TRANSFAC database. Finally, the obtained TFs were mapped to the known TF-targets to construct the network using cytoscape software. A total of 1755 DEGs, 457 DCGs and 101988 DCLs were achieved and there were 20 TFs in the obtained six TF-target relations (STAT3-TNF, PBX1‑PLAU, SOCS3-STAT3, GATA1-ETS2, ETS1-ICAM4 and CEBPE‑GATA1) and 457 DCGs. A number of TF-target relations in the constructed network were not within DCLs when the TF and target gene were DCGs. The identified TFs may have an important role in the pathogenesis of RA and have the potential to be used as biomarkers for the development of novel diagnostic and therapeutic strategies for RA.
The Private Lives of Minerals: Social Network Analysis Applied to Mineralogy and Petrology

NASA Astrophysics Data System (ADS)

Hazen, R. M.; Morrison, S. M.; Fox, P. A.; Golden, J. J.; Downs, R. T.; Eleish, A.; Prabhu, A.; Li, C.; Liu, C.

2016-12-01

Comprehensive databases of mineral species (rruff.info/ima) and their geographic localities and co-existing mineral assemblages (mindat.org) reveal patterns of mineral association and distribution that mimic social networks, as commonly applied to such varied topics as social media interactions, the spread of disease, terrorism networks, and research collaborations. Applying social network analysis (SNA) to common assemblages of rock-forming igneous and regional metamorphic mineral species, we find patterns of cohesion, segregation, density, and cliques that are similar to those of human social networks. These patterns highlight classic trends in lithologic evolution and are illustrated with sociograms, in which mineral species are the "nodes" and co-existing species form "links." Filters based on chemistry, age, structural group, and other parameters highlight visually both familiar and new aspects of mineralogy and petrology. We quantify sociograms with SNA metrics, including connectivity (based on the frequency of co-occurrence of mineral pairs), homophily (the extent to which co-existing mineral species share compositional and other characteristics), network closure (based on the degree of network interconnectivity), and segmentation (as revealed by isolated "cliques" of mineral species). Exploitation of large and growing mineral data resources with SNA offers promising avenues for discovering previously hidden trends in mineral diversity-distribution systematics, as well as providing new pedagogical approaches to teaching mineralogy and petrology.
DNAJC17 is localized in nuclear speckles and interacts with splicing machinery components.

PubMed

Pascarella, A; Ferrandino, G; Credendino, S C; Moccia, C; D'Angelo, F; Miranda, B; D'Ambrosio, C; Bielli, P; Spadaro, O; Ceccarelli, M; Scaloni, A; Sette, C; De Felice, M; De Vita, G; Amendola, E

2018-05-17

DNAJC17 is a heat shock protein (HSP40) family member, identified in mouse as susceptibility gene for congenital hypothyroidism. DNAJC17 knockout mouse embryos die prior to implantation. In humans, germline homozygous mutations in DNAJC17 have been found in syndromic retinal dystrophy patients, while heterozygous mutations represent candidate pathogenic events for myeloproliferative disorders. Despite widespread expression and involvement in human diseases, DNAJC17 function is still poorly understood. Herein, we have investigated its function through high-throughput transcriptomic and proteomic approaches. DNAJC17-depleted cells transcriptome highlighted genes involved in general functional categories, mainly related to gene expression. Conversely, DNAJC17 interactome can be classified in very specific functional networks, with the most enriched one including proteins involved in splicing. Furthermore, several splicing-related interactors, were independently validated by co-immunoprecipitation and in vivo co-localization. Accordingly, co-localization of DNAJC17 with SC35, a marker of nuclear speckles, further supported its interaction with spliceosomal components. Lastly, DNAJC17 up-regulation enhanced splicing efficiency of minigene reporter in live cells, while its knockdown induced perturbations of splicing efficiency at whole genome level, as demonstrated by specific analysis of RNAseq data. In conclusion, our study strongly suggests a role of DNAJC17 in splicing-related processes and provides support to its recognized essential function in early development.
Systematic Evaluation of Molecular Networks for Discovery of Disease Genes.

PubMed

Huang, Justin K; Carlin, Daniel E; Yu, Michael Ku; Zhang, Wei; Kreisberg, Jason F; Tamayo, Pablo; Ideker, Trey

2018-04-25

Gene networks are rapidly growing in size and number, raising the question of which networks are most appropriate for particular applications. Here, we evaluate 21 human genome-wide interaction networks for their ability to recover 446 disease gene sets identified through literature curation, gene expression profiling, or genome-wide association studies. While all networks have some ability to recover disease genes, we observe a wide range of performance with STRING, ConsensusPathDB, and GIANT networks having the best performance overall. A general tendency is that performance scales with network size, suggesting that new interaction discovery currently outweighs the detrimental effects of false positives. Correcting for size, we find that the DIP network provides the highest efficiency (value per interaction). Based on these results, we create a parsimonious composite network with both high efficiency and performance. This work provides a benchmark for selection of molecular networks in human disease research. Copyright © 2018 Elsevier Inc. All rights reserved.
Implications of miR166 and miR159 induction to the basal response mechanisms of an andigena potato (Solanum tuberosum subsp. andigena) to salinity stress, predicted from network models in Arabidopsis.

PubMed

Kitazumi, Ai; Kawahara, Yoshihiro; Onda, Ty S; De Koeyer, David; de los Reyes, Benildo G

2015-01-01

MicroRNA (miRNA) mediated changes in gene expression by post-transcriptional modulation of major regulatory transcription factors is a potent mechanism for integrating growth and stress-related responses. Exotic plants including many traditional varieties of Andean potatoes (Solanum tuberosum subsp. andigena) are known for better adaptation to marginal environments. Stress physiological studies confirmed earlier reports on the salinity tolerance potentials of certain andigena cultivars. Guided by the hypothesis that certain miRNAs play important roles in growth modulation under suboptimal conditions, we identified and characterized salinity stress-responsive miRNA-target gene pairs in the andigena cultivar Sullu by parallel analysis of noncoding and coding RNA transcriptomes. Inverse relationships were established by the reverse co-expression between two salinity stress-regulated miRNAs (miR166, miR159) and their target transcriptional regulators HD-ZIP-Phabulosa/Phavulota and Myb101, respectively. Based on heterologous models in Arabidopsis, the miR166-HD-ZIP-Phabulosa/Phavulota network appears to be involved in modulating growth perhaps by mediating vegetative dormancy, with linkages to defense-related pathways. The miR159-Myb101 network may be important for the modulation of vegetative growth while also controlling stress-induced premature transition to reproductive phase. We postulate that the induction of miR166 and miR159 under salinity stress represents important network hubs for balancing gene expression required for basal growth adjustments.
Adding biological meaning to human protein-protein interactions identified by yeast two-hybrid screenings: A guide through bioinformatics tools.

PubMed

Felgueiras, Juliana; Silva, Joana Vieira; Fardilha, Margarida

2018-01-16

"A man is known by the company he keeps" is a popular expression that perfectly fits proteins. A common approach to characterize the function of a target protein is to identify its interacting partners and thus infer its roles based on the known functions of the interactors. Protein-protein interaction networks (PPINs) have been created for several organisms, including humans, primarily as results of high-throughput screenings, such as yeast two-hybrid (Y2H). Their unequivocal use to understand events underlying human pathophysiology is promising in identifying genes and proteins associated with diseases. Therefore, numerous opportunities have emerged for PPINs as tools for clinical management of diseases: network-based disease classification systems, discovery of biomarkers and identification of therapeutic targets. Despite the great advantages of PPINs, their use is still unrecognised by several researchers who generate high-throughput data to generally characterize interactions in a certain model or to select an interaction to study in detail. We strongly believe that both approaches are not exclusive and that we can use PPINs as a complementary methodology and rich-source of information to the initial study proposal. Here, we suggest a pipeline to deal with Y2H results using bioinformatics tools freely available for academics. Yeast two-hybrid is widely-used to identify protein-protein interactions. Conventionally, the positive clones that result from a yeast two-hybrid screening are sequenced to identify the interactors of the protein of interest (also known as bait protein), and few interactions, thought as potentially relevant for the model in study, are selected for further validation using biochemical methods (e.g. co-immunoprecipitation and co-localization). The huge amount of data that is potentially lost during this conservative approach motivated us to write this tutorial-like review, so that researchers feel encouraged to take advantage of bioinformatics tools to their full potential to analyse protein-protein interactions as a comprehensive network. Copyright © 2017 Elsevier B.V. All rights reserved.
LncRNAs expression in adjuvant-induced arthritis rats reveals the potential role of LncRNAs contributing to rheumatoid arthritis pathogenesis.

PubMed

Jiang, Hui; Qin, Xiu-Juan; Li, Wei-Ping; Ma, Rong; Wang, Ting; Li, Zhu-Qing

2016-11-15

Long non-coding RNAs (LncRNAs) are an important class of widespread molecules involved in diverse biological functions, which are exceptionally expressed in numerous types of diseases. Currently, limited study on LncRNA in rheumatoid arthritis (RA) is available. In this study, we aimed to identify the specifically expressed LncRNA that are relevant to adjuvant-induced arthritis (AA) in rats, and to explore the possible molecular mechanisms of RA pathogenesis. To identify LncRNAs specifically expressed in rheumatoid arthritis, the expression of LncRNAs in synoviums of rats from the model group (n=3) was compared with that in the control group (n=3) using Arraystar Rat LncRNA/mRNA microarray and real-time polymerase chain reaction (RT-PCR). Up to 260 LncRNAs were found to be differentially expressed (≥1.5-fold-change) in the synoviums between AA model and the normal rats (170 up-regulated and 90 down-regulated LncRNAs in AA rats compared with normal rats). Coding-non-coding gene co-expression networks (CNC network) were drawn based on the correlation analysis between the differentially expressed LncRNAs and mRNAs. Six LncRNAs, XR_008357, U75927, MRAK046251, XR_006457, DQ266363 and MRAK003448, were selected to analyze the relationship between LncRNAs and RA via the CNC network and GO analysis. Real-time PCR result confirmed that the six LncRNAs were specifically expressed in the AA rats. These results revealed that clusters of LncRNAs were uniquely expressed in AA rats compared with controls, which manifests that these differentially expressed LncRNAs in AA rats might play a vital role in RA development. Up-regulation or down-regulation of the six LncRNAs might contribute to the molecular mechanism underlying RA. To sum up, our study provides potential targets for treatment of RA and novel profound understanding of the pathogenesis of RA. Copyright © 2016. Published by Elsevier B.V.
Relationships Among Tweets Related to Radiation: Visualization Using Co-Occurring Networks.

PubMed

Yagahara, Ayako; Hanai, Keiri; Hasegawa, Shin; Ogasawara, Katsuhiko

2018-03-15

After the Fukushima Daiichi nuclear accident on March 11, 2011, interest in, and fear of, radiation increased among citizens. When such accidents occur, appropriate risk communication must provided by the government. It is therefore necessary to understand the fears of citizens in the days after such accidents. This study aimed to identify the progression of people's concerns, specifically fear, from a study of radiation-related tweets in the days after the Fukushima Daiichi nuclear accident. From approximately 1.5 million tweets in Japanese including any of the phrases "radiation" (), "radioactivity" (), and "radioactive substance" () sent March 11-17, 2011, we extracted tweets that expressed fear. We then performed a morphological analysis on the extracted tweets. Citizens' fears were visualized by creating co-occurrence networks using co-occurrence degrees showing relationship strength. Moreover, we calculated the Jaccard coefficient, which is one of the co-occurrence indices for expressing the strength of the relationship between morphemes when creating networks. From the visualization of the co-occurrence networks, we found high citizen interest in "nuclear power plant" on March 11 and 12, "health" on March 12 and 13, "medium" on March 13 and 14, and "economy" on March 15. On March 16 and 17, citizens' interest changed to "lack of goods in the afflicted area." In each co-occurrence network, trending topics, citizens' fears, and opinions to the government were extracted. This study used Twitter to understand changes in the concerns of Japanese citizens during the week after the Fukushima Daiichi nuclear accident, with a focus specifically on citizens' fears. We found that immediately after the accident, the interest in the accident itself was high, and then interest shifted to concerns affecting life, such as health and economy, as the week progressed. Clarifying citizens' fears and the dissemination of information through mass media and social media can add to improved risk communication in the future. ©Ayako Yagahara, Keiri Hanai, Shin Hasegawa, Katsuhiko Ogasawara. Originally published in JMIR Public Health and Surveillance (http://publichealth.jmir.org), 15.03.2018.
Gene network reconstruction from transcriptional dynamics under kinetic model uncertainty: a case for the second derivative

PubMed Central

Bickel, David R.; Montazeri, Zahra; Hsieh, Pei-Chun; Beatty, Mary; Lawit, Shai J.; Bate, Nicholas J.

2009-01-01

Motivation: Measurements of gene expression over time enable the reconstruction of transcriptional networks. However, Bayesian networks and many other current reconstruction methods rely on assumptions that conflict with the differential equations that describe transcriptional kinetics. Practical approximations of kinetic models would enable inferring causal relationships between genes from expression data of microarray, tag-based and conventional platforms, but conclusions are sensitive to the assumptions made. Results: The representation of a sufficiently large portion of genome enables computation of an upper bound on how much confidence one may place in influences between genes on the basis of expression data. Information about which genes encode transcription factors is not necessary but may be incorporated if available. The methodology is generalized to cover cases in which expression measurements are missing for many of the genes that might control the transcription of the genes of interest. The assumption that the gene expression level is roughly proportional to the rate of translation led to better empirical performance than did either the assumption that the gene expression level is roughly proportional to the protein level or the Bayesian model average of both assumptions. Availability: http://www.oisb.ca points to R code implementing the methods (R Development Core Team 2004). Contact: dbickel@uottawa.ca Supplementary information: http://www.davidbickel.com PMID:19218351
A comprehensive analysis on preservation patterns of gene co-expression networks during Alzheimer's disease progression.

PubMed

Ray, Sumanta; Hossain, Sk Md Mosaddek; Khatun, Lutfunnesa; Mukhopadhyay, Anirban

2017-12-20

Alzheimer's disease (AD) is a chronic neuro-degenerative disruption of the brain which involves in large scale transcriptomic variation. The disease does not impact every regions of the brain at the same time, instead it progresses slowly involving somewhat sequential interaction with different regions. Analysis of the expression patterns of the genes in different regions of the brain influenced in AD surely contribute for a enhanced comprehension of AD pathogenesis and shed light on the early characterization of the disease. Here, we have proposed a framework to identify perturbation and preservation characteristics of gene expression patterns across six distinct regions of the brain ("EC", "HIP", "PC", "MTG", "SFG", and "VCX") affected in AD. Co-expression modules were discovered considering a couple of regions at once. These are then analyzed to know the preservation and perturbation characteristics. Different module preservation statistics and a rank aggregation mechanism have been adopted to detect the changes of expression patterns across brain regions. Gene ontology (GO) and pathway based analysis were also carried out to know the biological meaning of preserved and perturbed modules. In this article, we have extensively studied the preservation patterns of co-expressed modules in six distinct brain regions affected in AD. Some modules are emerged as the most preserved while some others are detected as perturbed between a pair of brain regions. Further investigation on the topological properties of preserved and non-preserved modules reveals a substantial association amongst "betweenness centrality" and "degree" of the involved genes. Our findings may render a deeper realization of the preservation characteristics of gene expression patterns in discrete brain regions affected by AD.
SpoT-Mediated Regulation and Amino Acid Prototrophy Are Essential for Pyocyanin Production During Parasitic Growth of Pseudomonas aeruginosa in a Co-culture Model System With Aeromonas hydrophila

PubMed Central

Jagmann, Nina; Philipp, Bodo

2018-01-01

The opportunistic pathogen Pseudomonas aeruginosa employs its complex quorum sensing (QS) network to regulate the expression of virulence factors such as pyocyanin. Besides cell density, QS in this bacterium is co-regulated by environmental cues. In this study, we employed a previously established co-culture model system to identify metabolic influences that are involved in the regulation of pyocyanin production in P. aeruginosa. In this co-culture consisting of P. aeruginosa and the chitinolytic bacterium Aeromonas hydrophila, parasitic growth of P. aeruginosa is strictly dependent on the production of pyocyanin. We could show that in this co-culture, pyocyanin production is likely induced by the stringent response mediated by SpoT in response to nutrient limitation. Pyocyanin production by stringent response mutants in the co-culture could not be complemented by overexpression of PqsE. Via transposon mutagenesis, several amino acid auxotrophic mutants were identified that were also unable to produce pyocyanin when PqsE was overexpressed or when complementing amino acids were present. The inability to produce pyocyanin even though PqsE was overexpressed was likely a general effect of amino acid auxotrophy. These results show the value of the co-culture approach to identify both extra- and intracellular metabolic influences on QS that might be important in infection processes as well. PMID:29720972
Co-Option and De Novo Gene Evolution Underlie Molluscan Shell Diversity

PubMed Central

Aguilera, Felipe; McDougall, Carmel

2017-01-01

Abstract Molluscs fabricate shells of incredible diversity and complexity by localized secretions from the dorsal epithelium of the mantle. Although distantly related molluscs express remarkably different secreted gene products, it remains unclear if the evolution of shell structure and pattern is underpinned by the differential co-option of conserved genes or the integration of lineage-specific genes into the mantle regulatory program. To address this, we compare the mantle transcriptomes of 11 bivalves and gastropods of varying relatedness. We find that each species, including four Pinctada (pearl oyster) species that diverged within the last 20 Ma, expresses a unique mantle secretome. Lineage- or species-specific genes comprise a large proportion of each species’ mantle secretome. A majority of these secreted proteins have unique domain architectures that include repetitive, low complexity domains (RLCDs), which evolve rapidly, and have a proclivity to expand, contract and rearrange in the genome. There are also a large number of secretome genes expressed in the mantle that arose before the origin of gastropods and bivalves. Each species expresses a unique set of these more ancient genes consistent with their independent co-option into these mantle gene regulatory networks. From this analysis, we infer lineage-specific secretomes underlie shell diversity, and include both rapidly evolving RLCD-containing proteins, and the continual recruitment and loss of both ancient and recently evolved genes into the periphery of the regulatory network controlling gene expression in the mantle epithelium. PMID:28053006
Whole-genome transcriptomic insights into protective molecular mechanisms in metabolically healthy obese African Americans.

PubMed

Gaye, Amadou; Doumatey, Ayo P; Davis, Sharon K; Rotimi, Charles N; Gibbons, Gary H

2018-01-01

Several clinical guidelines have been proposed to distinguish metabolically healthy obesity (MHO) from other subgroups of obesity but the molecular mechanisms by which MHO individuals remain metabolically healthy despite having a high fat mass are yet to be elucidated. We conducted the first whole blood transcriptomic study designed to identify specific sets of genes that might shed novel insights into the molecular mechanisms that protect or delay the occurrence of obesity-related co-morbidities in MHO. The study included 29 African-American obese individuals, 8 MHO and 21 metabolically abnormal obese (MAO). Unbiased transcriptome-wide network analysis was carried out to identify molecular modules of co-expressed genes that are collectively associated with MHO. Network analysis identified a group of 23 co-expressed genes, including ribosomal protein genes (RPs), which were significantly downregulated in MHO subjects. The three pathways enriched in the group of co-expressed genes are EIF2 signaling, regulation of eIF4 and p70S6K signaling, and mTOR signaling. The expression of ten of the RPs collectively predicted MHO status with an area under the curve of 0.81. Triglycerides/HDL (TG/HDL) ratio, an index of insulin resistance, was the best predictor of the expression of genes in the MHO group. The higher TG/HDL values observed in the MAO subjects may underlie the activation of endoplasmic reticulum (ER) and related-stress pathways that lead to a chronic inflammatory state. In summary, these findings suggest that controlling ER stress and/or ribosomal stress by downregulating RPs or controlling TG/HDL ratio may represent effective strategies to prevent or delay the occurrence of metabolic disorders in obese individuals.
Protein interaction networks at the host-microbe interface in Diaphorina citri, the insect vector of the citrus greening pathogen.

PubMed

Ramsey, J S; Chavez, J D; Johnson, R; Hosseinzadeh, S; Mahoney, J E; Mohr, J P; Robison, F; Zhong, X; Hall, D G; MacCoss, M; Bruce, J; Cilia, M

2017-02-01

The Asian citrus psyllid ( Diaphorina citri) is the insect vector responsible for the worldwide spread of ' Candidatus Liberibacter asiaticus' (CLas), the bacterial pathogen associated with citrus greening disease. Developmental changes in the insect vector impact pathogen transmission, such that D. citri transmission of CLas is more efficient when bacteria are acquired by nymphs when compared with adults. We hypothesize that expression changes in the D. citri immune system and commensal microbiota occur during development and regulate vector competency. In support of this hypothesis, more proteins, with greater fold changes, were differentially expressed in response to CLas in adults when compared with nymphs, including insect proteins involved in bacterial adhesion and immunity. Compared with nymphs, adult insects had a higher titre of CLas and the bacterial endosymbionts Wolbachia, Profftella and Carsonella. All Wolbachia and Profftella proteins differentially expressed between nymphs and adults are upregulated in adults, while most differentially expressed Carsonella proteins are upregulated in nymphs. Discovery of protein interaction networks has broad applicability to the study of host-microbe relationships. Using protein interaction reporter technology, a D. citri haemocyanin protein highly upregulated in response to CLas was found to physically interact with the CLas coenzyme A (CoA) biosynthesis enzyme phosphopantothenoylcysteine synthetase/decarboxylase. CLas pantothenate kinase, which catalyses the rate-limiting step of CoA biosynthesis, was found to interact with a D. citri myosin protein. Two Carsonella enzymes involved in histidine and tryptophan biosynthesis were found to physically interact with D. citri proteins. These co-evolved protein interaction networks at the host-microbe interface are highly specific targets for controlling the insect vector responsible for the spread of citrus greening.
Protein interaction networks at the host–microbe interface in Diaphorina citri, the insect vector of the citrus greening pathogen

PubMed Central

Chavez, J. D.; Johnson, R.; Hosseinzadeh, S.; Mahoney, J. E.; Mohr, J. P.; Robison, F.; Zhong, X.; Hall, D. G.; MacCoss, M.; Bruce, J.; Cilia, M.

2017-01-01

The Asian citrus psyllid (Diaphorina citri) is the insect vector responsible for the worldwide spread of ‘Candidatus Liberibacter asiaticus’ (CLas), the bacterial pathogen associated with citrus greening disease. Developmental changes in the insect vector impact pathogen transmission, such that D. citri transmission of CLas is more efficient when bacteria are acquired by nymphs when compared with adults. We hypothesize that expression changes in the D. citri immune system and commensal microbiota occur during development and regulate vector competency. In support of this hypothesis, more proteins, with greater fold changes, were differentially expressed in response to CLas in adults when compared with nymphs, including insect proteins involved in bacterial adhesion and immunity. Compared with nymphs, adult insects had a higher titre of CLas and the bacterial endosymbionts Wolbachia, Profftella and Carsonella. All Wolbachia and Profftella proteins differentially expressed between nymphs and adults are upregulated in adults, while most differentially expressed Carsonella proteins are upregulated in nymphs. Discovery of protein interaction networks has broad applicability to the study of host–microbe relationships. Using protein interaction reporter technology, a D. citri haemocyanin protein highly upregulated in response to CLas was found to physically interact with the CLas coenzyme A (CoA) biosynthesis enzyme phosphopantothenoylcysteine synthetase/decarboxylase. CLas pantothenate kinase, which catalyses the rate-limiting step of CoA biosynthesis, was found to interact with a D. citri myosin protein. Two Carsonella enzymes involved in histidine and tryptophan biosynthesis were found to physically interact with D. citri proteins. These co-evolved protein interaction networks at the host–microbe interface are highly specific targets for controlling the insect vector responsible for the spread of citrus greening. PMID:28386418
Study of co-authorship network of papers in the Journal of Research in Medical Sciences using social network analysis

PubMed Central

Zare-Farashbandi, Firoozeh; Geraei, Ehsan; Siamaki, Saba

2014-01-01

Background: Co-authorship is one of the most tangible forms of research collaboration. A co-authorship network is a social network in which the authors through participation in one or more publication through an indirect path have linked to each other. The present research using the social network analysis studied co-authorship network of 681 articles published in Journal of Research in Medical Sciences (JRMS) during 2008-2012. Materials and Methods: The study was carried out with the scientometrics approach and using co-authorship network analysis of authors. The topology of the co-authorship network of 681 published articles in JRMS between 2008 and 2012 was analyzed using macro-level metrics indicators of network analysis such as density, clustering coefficient, components and mean distance. In addition, in order to evaluate the performance of each authors and countries in the network, the micro-level indicators such as degree centrality, closeness centrality and betweenness centrality as well as productivity index were used. The UCINET and NetDraw softwares were used to draw and analyze the co-authorship network of the papers. Results: The assessment of the authors productivity in this journal showed that the first ranks were belonged to only five authors, respectively. Furthermore, analysis of the co-authorship of the authors in the network demonstrated that in the betweenness centrality index, three authors of them had the good position in the network. They can be considered as the network leaders able to control the flow of information in the network compared with the other members based on the shortest paths. On the other hand, the key role of the network according to the productivity and centrality indexes was belonged to Iran, Malaysia and United States of America. Conclusion: Co-authorship network of JRMS has the characteristics of a small world network. In addition, the theory of 6° separation is valid in this network was also true. PMID:24672564
Co-Inheritance Analysis within the Domains of Life Substantially Improves Network Inference by Phylogenetic Profiling

PubMed Central

Shin, Junha; Lee, Insuk

2015-01-01

Phylogenetic profiling, a network inference method based on gene inheritance profiles, has been widely used to construct functional gene networks in microbes. However, its utility for network inference in higher eukaryotes has been limited. An improved algorithm with an in-depth understanding of pathway evolution may overcome this limitation. In this study, we investigated the effects of taxonomic structures on co-inheritance analysis using 2,144 reference species in four query species: Escherichia coli, Saccharomyces cerevisiae, Arabidopsis thaliana, and Homo sapiens. We observed three clusters of reference species based on a principal component analysis of the phylogenetic profiles, which correspond to the three domains of life—Archaea, Bacteria, and Eukaryota—suggesting that pathways inherit primarily within specific domains or lower-ranked taxonomic groups during speciation. Hence, the co-inheritance pattern within a taxonomic group may be eroded by confounding inheritance patterns from irrelevant taxonomic groups. We demonstrated that co-inheritance analysis within domains substantially improved network inference not only in microbe species but also in the higher eukaryotes, including humans. Although we observed two sub-domain clusters of reference species within Eukaryota, co-inheritance analysis within these sub-domain taxonomic groups only marginally improved network inference. Therefore, we conclude that co-inheritance analysis within domains is the optimal approach to network inference with the given reference species. The construction of a series of human gene networks with increasing sample sizes of the reference species for each domain revealed that the size of the high-accuracy networks increased as additional reference species genomes were included, suggesting that within-domain co-inheritance analysis will continue to expand human gene networks as genomes of additional species are sequenced. Taken together, we propose that co-inheritance analysis within the domains of life will greatly potentiate the use of the expected onslaught of sequenced genomes in the study of molecular pathways in higher eukaryotes. PMID:26394049
Measuring science-technology interactions using patent citations and author-inventor links: an exploration analysis from Chinese nanotechnology

NASA Astrophysics Data System (ADS)

Wang, Gangbo; Guan, Jiancheng

2011-12-01

This article contributes to the growing study on the interactions between science and technology with China's evidence in the field of nanotechnology, based on the database of United States Patent and Trademark Office. The analysis is focused during the period of 1991-2008, a rapid increasing period for the development of nanotechnology. Using the non-patent references cited by patents, we first investigate the science-technology connections in the context of Chinese nanotechnology, especially in institutional sectors and its application fields. Those patents, produced by academic researchers and directed towards basic scientific knowledge, generally cite more scientific references with a higher proportion of self-citations. It is interesting to find that patents contributed by collaborations between public organizations and corporations seldom contain scientific references. Following an interesting path on matching the data of publications and patents, we establish the author-inventor links in this emerging field. Author-inventors, who are co-active in publishing and patenting, are at the very top of the most prolific and highly cited researchers. Finally, we employ social network analysis to explore the characteristics of scientific and technological networks generated by co-authorship and co-invention data, to investigate the position and the role of patenting-publishing scientists in these research networks.
Formation of the spinal network in zebrafish determined by domain-specific Pax genes

PubMed Central

Ikenaga, Takanori; Urban, Jason M.; Gebhart, Nichole; Hatta, Kohei; Kawakami, Koichi; Ono, Fumihito

2012-01-01

In the formation of the spinal network, various transcription factors interact to develop specific cell types. Using a gene trap technique, we established a stable line of zebrafish in which the red fluorescent protein (RFP) was inserted in the pax8 gene. RFP insertion marked putative pax8-lineage cells with fluorescence and inhibited pax8 expression in homozygous embryos. Pax8 homozygous embryos displayed defects in the otic vesicle, as previously reported in studies using morpholinos. The pax8 homozygous embryos survived to adulthood in contrast to mammalian counterparts that die prematurely. RFP is expressed in the dorsal spinal cord. Examination of the axon morphology revealed that RFP (+) neurons include Commissural Bifurcating Longitudinal (CoBL) interneurons, but other inhibitory neurons such as Commissural Local (CoLo) interneurons and Circumferential Ascending (CiA) interneurons do not express RFP. We examined the effect of inhibiting pax2a/pax8 expression on interneuron development. In pax8 homozygous fish, the RFP (+) cells undergo differentiation similar to that of pax8 heterozygous fish, and the swimming behavior remained intact. In contrast, the RFP (+) cells of pax2a/pax8 double mutants displayed altered cell fates. CoBLs were not observed. Instead, RFP (+) cells exhibited axons descending ipsilaterally: a morphology resembling that of V2a/V2b interneurons. PMID:21452218

Formation of the spinal network in zebrafish determined by domain-specific pax genes.

PubMed

Ikenaga, Takanori; Urban, Jason M; Gebhart, Nichole; Hatta, Kohei; Kawakami, Koichi; Ono, Fumihito

2011-06-01

In the formation of the spinal network, various transcription factors interact to develop specific cell types. By using a gene trap technique, we established a stable line of zebrafish in which the red fluorescent protein (RFP) was inserted into the pax8 gene. RFP insertion marked putative pax8-lineage cells with fluorescence and inhibited pax8 expression in homozygous embryos. Pax8 homozygous embryos displayed defects in the otic vesicle, as previously reported in studies with morpholinos. The pax8 homozygous embryos survived to adulthood, in contrast to mammalian counterparts that die prematurely. RFP is expressed in the dorsal spinal cord. Examination of the axon morphology revealed that RFP(+) neurons include commissural bifurcating longitudinal (CoBL) interneurons, but other inhibitory neurons such as commissural local (CoLo) interneurons and circumferential ascending (CiA) interneurons do not express RFP. We examined the effect of inhibiting pax2a/pax8 expression on interneuron development. In pax8 homozygous fish, the RFP(+) cells underwent differentiation similar to that of pax8 heterozygous fish, and the swimming behavior remained intact. In contrast, the RFP(+) cells of pax2a/pax8 double mutants displayed altered cell fates. CoBLs were not observed. Instead, RFP(+) cells exhibited axons descending ipsilaterally, a morphology resembling that of V2a/V2b interneurons. Copyright © 2010 Wiley-Liss, Inc.
A tripartite clustering analysis on microRNA, gene and disease model.

PubMed

Shen, Chengcheng; Liu, Ying

2012-02-01

Alteration of gene expression in response to regulatory molecules or mutations could lead to different diseases. MicroRNAs (miRNAs) have been discovered to be involved in regulation of gene expression and a wide variety of diseases. In a tripartite biological network of human miRNAs, their predicted target genes and the diseases caused by altered expressions of these genes, valuable knowledge about the pathogenicity of miRNAs, involved genes and related disease classes can be revealed by co-clustering miRNAs, target genes and diseases simultaneously. Tripartite co-clustering can lead to more informative results than traditional co-clustering with only two kinds of members and pass the hidden relational information along the relation chain by considering multi-type members. Here we report a spectral co-clustering algorithm for k-partite graph to find clusters with heterogeneous members. We use the method to explore the potential relationships among miRNAs, genes and diseases. The clusters obtained from the algorithm have significantly higher density than randomly selected clusters, which means members in the same cluster are more likely to have common connections. Results also show that miRNAs in the same family based on the hairpin sequences tend to belong to the same cluster. We also validate the clustering results by checking the correlation of enriched gene functions and disease classes in the same cluster. Finally, widely studied miR-17-92 and its paralogs are analyzed as a case study to reveal that genes and diseases co-clustered with the miRNAs are in accordance with current research findings.
Exploring the Genomic Roadmap and Molecular Phylogenetics Associated with MODY Cascades Using Computational Biology.

PubMed

Chakraborty, Chiranjib; Bandyopadhyay, Sanghamitra; Doss, C George Priya; Agoramoorthy, Govindasamy

2015-04-01

Maturity onset diabetes of the young (MODY) is a metabolic and genetic disorder. It is different from type 1 and type 2 diabetes with low occurrence level (1-2%) among all diabetes. This disorder is a consequence of β-cell dysfunction. Till date, 11 subtypes of MODY have been identified, and all of them can cause gene mutations. However, very little is known about the gene mapping, molecular phylogenetics, and co-expression among MODY genes and networking between cascades. This study has used latest servers and software such as VarioWatch, ClustalW, MUSCLE, G Blocks, Phylogeny.fr, iTOL, WebLogo, STRING, and KEGG PATHWAY to perform comprehensive analyses of gene mapping, multiple sequences alignment, molecular phylogenetics, protein-protein network design, co-expression analysis of MODY genes, and pathway development. The MODY genes are located in chromosomes-2, 7, 8, 9, 11, 12, 13, 17, and 20. Highly aligned block shows Pro, Gly, Leu, Arg, and Pro residues are highly aligned in the positions of 296, 386, 437, 455, 456 and 598, respectively. Alignment scores inform us that HNF1A and HNF1B proteins have shown high sequence similarity among MODY proteins. Protein-protein network design shows that HNF1A, HNF1B, HNF4A, NEUROD1, PDX1, PAX4, INS, and GCK are strongly connected, and the co-expression analyses between MODY genes also show distinct association between HNF1A and HNF4A genes. This study has used latest tools of bioinformatics to develop a rapid method to assess the evolutionary relationship, the network development, and the associations among eleven MODY genes and cascades. The prediction of sequence conservation, molecular phylogenetics, protein-protein network and the association between the MODY cascades enhances opportunities to get more insights into the less-known MODY disease.
Computational, Integrative, and Comparative Methods for the Elucidation of Genetic Coexpression Networks

DOE PAGES

Baldwin, Nicole E.; Chesler, Elissa J.; Kirov, Stefan; ...

2005-01-01

Gene expression microarray data can be used for the assembly of genetic coexpression network graphs. Using mRNA samples obtained from recombinant inbred Mus musculus strains, it is possible to integrate allelic variation with molecular and higher-order phenotypes. The depth of quantitative genetic analysis of microarray data can be vastly enhanced utilizing this mouse resource in combination with powerful computational algorithms, platforms, and data repositories. The resulting network graphs transect many levels of biological scale. This approach is illustrated with the extraction of cliques of putatively co-regulated genes and their annotation using gene ontology analysis and cis -regulatory element discovery. Themore » causal basis for co-regulation is detected through the use of quantitative trait locus mapping.« less
Genome-wide patterns of promoter sharing and co-expression in bovine skeletal muscle.

PubMed

Gu, Quan; Nagaraj, Shivashankar H; Hudson, Nicholas J; Dalrymple, Brian P; Reverter, Antonio

2011-01-12

Gene regulation by transcription factors (TF) is species, tissue and time specific. To better understand how the genetic code controls gene expression in bovine muscle we associated gene expression data from developing Longissimus thoracis et lumborum skeletal muscle with bovine promoter sequence information. We created a highly conserved genome-wide promoter landscape comprising 87,408 interactions relating 333 TFs with their 9,242 predicted target genes (TGs). We discovered that the complete set of predicted TGs share an average of 2.75 predicted TF binding sites (TFBSs) and that the average co-expression between a TF and its predicted TGs is higher than the average co-expression between the same TF and all genes. Conversely, pairs of TFs sharing predicted TGs showed a co-expression correlation higher that pairs of TFs not sharing TGs. Finally, we exploited the co-occurrence of predicted TFBS in the context of muscle-derived functionally-coherent modules including cell cycle, mitochondria, immune system, fat metabolism, muscle/glycolysis, and ribosome. Our findings enabled us to reverse engineer a regulatory network of core processes, and correctly identified the involvement of E2F1, GATA2 and NFKB1 in the regulation of cell cycle, fat, and muscle/glycolysis, respectively. The pivotal implication of our research is two-fold: (1) there exists a robust genome-wide expression signal between TFs and their predicted TGs in cattle muscle consistent with the extent of promoter sharing; and (2) this signal can be exploited to recover the cellular mechanisms underpinning transcription regulation of muscle structure and development in bovine. Our study represents the first genome-wide report linking tissue specific co-expression to co-regulation in a non-model vertebrate.
Histological analysis and identification of spermatogenesis-related genes in 2-, 6-, and 12-month-old sheep testes

NASA Astrophysics Data System (ADS)

Bai, Man; Sun, Limin; Zhao, Jia; Xiang, Lujie; Cheng, Xiaoyin; Li, Jiarong; Jia, Chao; Jiang, Huaizhi

2017-10-01

Testis development and spermatogenesis are vital factors that influence male animal fertility. In order to identify spermatogenesis-related genes and further provide a theory basis for finding biomarkers related to male sheep fertility, 2-, 6-, and 12-month-old Small Tail Han Sheep testes were selected to investigate the dynamic changes of sheep testis development. Hematoxylin-eosin routine staining and RNA-Seq technique were used to perform histological and transcriptome analysis for these testes. The results showed that 630, 102, and 322 differentially expressed genes (DEGs) were identified in 2- vs 6-month-old, 6- vs 12-month-old, and 2- vs 12-month-old testes, respectively. GO and KEGG analysis showed the following: DEGs in 2- vs 6-month-old testes were mainly related to the GO terms of sexual maturation and the pathways of multiple metabolism and biosynthesis; in 6- vs 12-month-old testes, most of the GO terms that DEGs involved in were related to metabolism and translation processes; the most significantly enriched pathway is the ribosome pathway. The union of DEGs in 2- vs 6-month-old, 6- vs 12-month-old, and 2- vs 12-month-old testes was categorized into eight profiles by series cluster. Subsequently, the eight profiles were classified into four model profiles and four co-expression networks were constructed based on the DEGs in these model profiles. Finally, 29 key regulatory genes related to spermatogenesis were identified in the four co-expression networks. The expression of 13 DEGs (CA3, APOH, MYOC, CATSPER4, SYT6, SERPINA10, DAZL, ADIPOR2, RAB13, CEP41, SPAG4, ODF1, and FRG1) was validated by RT-PCR.
Stimuli sensitive super-macroporous cryogels based on photo-crosslinked 2-hydroxyethylcellulose and chitosan.

PubMed

Stoyneva, Veselina; Momekova, Denitsa; Kostova, Bistra; Petrov, Petar

2014-01-01

Original pH sensitive cryogels, based on two biodegradable natural polymers chitosan (CS) and 2-hydroxyethylcellulose (HEC), were obtained via cryogenic treatment of semi-dilute aqueous solutions and UV induced crosslinking in frozen state. H₂O₂ and N,N'-methylenebisacrylamide (BisAAm) were used as photoinitiator and crosslinking agent, respectively. BisAAm facilitated the formation of polymer co-network and increased both the gel fraction yield and mechanical strength of cryogels. The influence of chitosan content on the physico-mechanical properties of HEC-CS cryogels was investigated. In general, the increase of CS fraction in the polymer co-network increased the degree of swelling and enhanced significantly the storage modulus of materials. All HEC-CS cryogels obtained were opalescent sponge-like materials, which quickly release/uptake water due to their open porous structure. The incorporation of CS provided pH dependent swelling and good bioadhesive properties of cryogels. HEC-CS cryogels were further exploited as drug delivery systems of the highly water soluble drug metronidazole belonging to BCS Class l. Copyright © 2013 Elsevier Ltd. All rights reserved.
Interfacial spin-filter assisted spin transfer torque effect in Co/BeO/Co magnetic tunnel junction

NASA Astrophysics Data System (ADS)

Tang, Y.-H.; Chu, F.-C.

2015-03-01

The first-principles calculation is employed to demonstrate the spin-selective transport properties and the non-collinear spin-transfer torque (STT) effect in the newly proposed Co/BeO/Co magnetic tunnel junction. The subtle spin-polarized charge transfer solely at O/Co interface gives rise to the interfacial spin-filter (ISF) effect, which can be simulated within the tight binding model to verify the general expression of STT. This allows us to predict the asymmetric bias behavior of non-collinear STT directly via the interplay between the first-principles calculated spin current densities in collinear magnetic configurations. We believe that the ISF effect, introduced by the combination between wurtzite-BeO barrier and the fcc-Co electrode, may open a new and promising route in semiconductor-based spintronics applications.
Supporting Emerging Disciplines with e-Communities: Needs and Benefits

PubMed Central

Butler, Brian S; Schleyer, Titus K; Weiss, Patricia M; Wang, Xiaoqing; Thyvalikakath, Thankam P; Hatala, Courtney L; Naderi, Reza A

2008-01-01

Background Science has developed from a solitary pursuit into a team-based collaborative activity and, more recently, into a multidisciplinary research enterprise. The increasingly collaborative character of science, mandated by complex research questions and problems that require many competencies, requires that researchers lower the barriers to the creation of collaborative networks of experts, such as communities of practice (CoPs). Objectives The aim was to assess the information needs of prospective members of a CoP in an emerging field, dental informatics, and to evaluate their expectations of an e-community in order to design a suitable electronic infrastructure. Methods A Web-based survey instrument was designed and administered to 2768 members of the target audience. Benefit expectations were analyzed for their relationship to (1) the respondents’ willingness to participate in the CoP and (2) their involvement in funded research. Two raters coded the respondents’ answers regarding expected benefits using a 14-category coding scheme (Kappa = 0.834). Results The 256 respondents (11.1% response rate) preferred electronic resources over traditional print material to satisfy their information needs. The most frequently expected benefits from participation in the CoP were general information (85% of respondents), peer networking (31.1%), and identification of potential collaborators and/or research opportunities (23.2%). Conclusions The competitive social-information environment in which CoPs are embedded presents both threats to sustainability and opportunities for greater integration and impact. CoP planners seeking to support the development of emerging biomedical science disciplines should blend information resources, social search and filtering, and visibility mechanisms to provide a portfolio of social and information benefits. Assessing benefit expectations and alternatives provides useful information for CoP planners seeking to prioritize community infrastructure development and encourage participation. PMID:18653443
DCMDN: Deep Convolutional Mixture Density Network

NASA Astrophysics Data System (ADS)

D'Isanto, Antonio; Polsterer, Kai Lars

2017-09-01

Deep Convolutional Mixture Density Network (DCMDN) estimates probabilistic photometric redshift directly from multi-band imaging data by combining a version of a deep convolutional network with a mixture density network. The estimates are expressed as Gaussian mixture models representing the probability density functions (PDFs) in the redshift space. In addition to the traditional scores, the continuous ranked probability score (CRPS) and the probability integral transform (PIT) are applied as performance criteria. DCMDN is able to predict redshift PDFs independently from the type of source, e.g. galaxies, quasars or stars and renders pre-classification of objects and feature extraction unnecessary; the method is extremely general and allows the solving of any kind of probabilistic regression problems based on imaging data, such as estimating metallicity or star formation rate in galaxies.
Robust optimal control of material flows in demand-driven supply networks

NASA Astrophysics Data System (ADS)

Laumanns, Marco; Lefeber, Erjen

2006-04-01

We develop a model based on stochastic discrete-time controlled dynamical systems in order to derive optimal policies for controlling the material flow in supply networks. Each node in the network is described as a transducer such that the dynamics of the material and information flows within the entire network can be expressed by a system of first-order difference equations, where some inputs to the system act as external disturbances. We apply methods from constrained robust optimal control to compute the explicit control law as a function of the current state. For the numerical examples considered, these control laws correspond to certain classes of optimal ordering policies from inventory management while avoiding, however, any a priori assumptions about the general form of the policy.
From Coexpression to Coregulation: An Approach to Inferring Transcriptional Regulation Among Gene Classes from Large-Scale Expression Data

NASA Technical Reports Server (NTRS)

Mjolsness, Eric; Castano, Rebecca; Mann, Tobias; Wold, Barbara

2000-01-01

We provide preliminary evidence that existing algorithms for inferring small-scale gene regulation networks from gene expression data can be adapted to large-scale gene expression data coming from hybridization microarrays. The essential steps are (I) clustering many genes by their expression time-course data into a minimal set of clusters of co-expressed genes, (2) theoretically modeling the various conditions under which the time-courses are measured using a continuous-time analog recurrent neural network for the cluster mean time-courses, (3) fitting such a regulatory model to the cluster mean time courses by simulated annealing with weight decay, and (4) analysing several such fits for commonalities in the circuit parameter sets including the connection matrices. This procedure can be used to assess the adequacy of existing and future gene expression time-course data sets for determining transcriptional regulatory relationships such as coregulation.
Social influence on 5-year survival in a longitudinal chemotherapy ward co-presence network.

PubMed

Lienert, Jeffrey; Marcum, Christopher Steven; Finney, John; Reed-Tsochas, Felix; Koehly, Laura

2017-09-01

Chemotherapy is often administered in openly designed hospital wards, where the possibility of patient-patient social influence on health exists. Previous research found that social relationships influence cancer patient's health; however, we have yet to understand social influence among patients receiving chemotherapy in the hospital. We investigate the influence of co-presence in a chemotherapy ward. We use data on 4,691 cancer patients undergoing chemotherapy in Oxfordshire, United Kingdom who average 59.8 years of age, and 44% are Male. We construct a network of patients where edges exist when patients are co-present in the ward, weighted by both patients' time in the ward. Social influence is based on total weighted co-presence with focal patients' immediate neighbors, considering neighbors' 5-year mortality. Generalized estimating equations evaluated the effect of neighbors' 5-year mortality on focal patient's 5-year mortality. Each 1,000-unit increase in weighted co-presence with a patient who dies within 5 years increases a patient's mortality odds by 42% ( β = 0.357, CI:0.204,0.510). Each 1,000-unit increase in co-presence with a patient surviving 5 years reduces a patient's odds of dying by 30% ( β = -0.344, CI:-0.538,0.149). Our results suggest that social influence occurs in chemotherapy wards, and thus may need to be considered in chemotherapy delivery.
Social influence on 5-year survival in a longitudinal chemotherapy ward co-presence network

PubMed Central

LIENERT, JEFFREY; MARCUM, CHRISTOPHER STEVEN; FINNEY, JOHN; REED-TSOCHAS, FELIX; KOEHLY, LAURA

2018-01-01

Chemotherapy is often administered in openly designed hospital wards, where the possibility of patient–patient social influence on health exists. Previous research found that social relationships influence cancer patient’s health; however, we have yet to understand social influence among patients receiving chemotherapy in the hospital. We investigate the influence of co-presence in a chemotherapy ward. We use data on 4,691 cancer patients undergoing chemotherapy in Oxfordshire, United Kingdom who average 59.8 years of age, and 44% are Male. We construct a network of patients where edges exist when patients are co-present in the ward, weighted by both patients’ time in the ward. Social influence is based on total weighted co-presence with focal patients’ immediate neighbors, considering neighbors’ 5-year mortality. Generalized estimating equations evaluated the effect of neighbors’ 5-year mortality on focal patient’s 5-year mortality. Each 1,000-unit increase in weighted co-presence with a patient who dies within 5 years increases a patient’s mortality odds by 42% (β = 0.357, CI:0.204,0.510). Each 1,000-unit increase in co-presence with a patient surviving 5 years reduces a patient’s odds of dying by 30% (β = −0.344, CI:−0.538,0.149). Our results suggest that social influence occurs in chemotherapy wards, and thus may need to be considered in chemotherapy delivery. PMID:29503731
A Machine Learning Approach to Predict Gene Regulatory Networks in Seed Development in Arabidopsis

PubMed Central

Ni, Ying; Aghamirzaie, Delasa; Elmarakeby, Haitham; Collakova, Eva; Li, Song; Grene, Ruth; Heath, Lenwood S.

2016-01-01

Gene regulatory networks (GRNs) provide a representation of relationships between regulators and their target genes. Several methods for GRN inference, both unsupervised and supervised, have been developed to date. Because regulatory relationships consistently reprogram in diverse tissues or under different conditions, GRNs inferred without specific biological contexts are of limited applicability. In this report, a machine learning approach is presented to predict GRNs specific to developing Arabidopsis thaliana embryos. We developed the Beacon GRN inference tool to predict GRNs occurring during seed development in Arabidopsis based on a support vector machine (SVM) model. We developed both global and local inference models and compared their performance, demonstrating that local models are generally superior for our application. Using both the expression levels of the genes expressed in developing embryos and prior known regulatory relationships, GRNs were predicted for specific embryonic developmental stages. The targets that are strongly positively correlated with their regulators are mostly expressed at the beginning of seed development. Potential direct targets were identified based on a match between the promoter regions of these inferred targets and the cis elements recognized by specific regulators. Our analysis also provides evidence for previously unknown inhibitory effects of three positive regulators of gene expression. The Beacon GRN inference tool provides a valuable model system for context-specific GRN inference and is freely available at https://github.com/BeaconProjectAtVirginiaTech/beacon_network_inference.git. PMID:28066488
Two-Way Gene Interaction From Microarray Data Based on Correlation Methods.

PubMed

Alavi Majd, Hamid; Talebi, Atefeh; Gilany, Kambiz; Khayyer, Nasibeh

2016-06-01

Gene networks have generated a massive explosion in the development of high-throughput techniques for monitoring various aspects of gene activity. Networks offer a natural way to model interactions between genes, and extracting gene network information from high-throughput genomic data is an important and difficult task. The purpose of this study is to construct a two-way gene network based on parametric and nonparametric correlation coefficients. The first step in constructing a Gene Co-expression Network is to score all pairs of gene vectors. The second step is to select a score threshold and connect all gene pairs whose scores exceed this value. In the foundation-application study, we constructed two-way gene networks using nonparametric methods, such as Spearman's rank correlation coefficient and Blomqvist's measure, and compared them with Pearson's correlation coefficient. We surveyed six genes of venous thrombosis disease, made a matrix entry representing the score for the corresponding gene pair, and obtained two-way interactions using Pearson's correlation, Spearman's rank correlation, and Blomqvist's coefficient. Finally, these methods were compared with Cytoscape, based on BIND, and Gene Ontology, based on molecular function visual methods; R software version 3.2 and Bioconductor were used to perform these methods. Based on the Pearson and Spearman correlations, the results were the same and were confirmed by Cytoscape and GO visual methods; however, Blomqvist's coefficient was not confirmed by visual methods. Some results of the correlation coefficients are not the same with visualization. The reason may be due to the small number of data.
Dentate granule cell recruitment of feedforward inhibition governs engram maintenance and remote memory generalization.

PubMed

Guo, Nannan; Soden, Marta E; Herber, Charlotte; Kim, Michael TaeWoo; Besnard, Antoine; Lin, Paoyan; Ma, Xiang; Cepko, Constance L; Zweifel, Larry S; Sahay, Amar

2018-05-01

Memories become less precise and generalized over time as memory traces reorganize in hippocampal-cortical networks. Increased time-dependent loss of memory precision is characterized by an overgeneralization of fear in individuals with post-traumatic stress disorder (PTSD) or age-related cognitive impairments. In the hippocampal dentate gyrus (DG), memories are thought to be encoded by so-called 'engram-bearing' dentate granule cells (eDGCs). Here we show, using rodents, that contextual fear conditioning increases connectivity between eDGCs and inhibitory interneurons (INs) in the downstream hippocampal CA3 region. We identify actin-binding LIM protein 3 (ABLIM3) as a mossy-fiber-terminal-localized cytoskeletal factor whose levels decrease after learning. Downregulation of ABLIM3 expression in DGCs was sufficient to increase connectivity with CA3 stratum lucidum INs (SLINs), promote parvalbumin (PV)-expressing SLIN activation, enhance feedforward inhibition onto CA3 and maintain a fear memory engram in the DG over time. Furthermore, downregulation of ABLIM3 expression in DGCs conferred conditioned context-specific reactivation of memory traces in hippocampal-cortical and amygdalar networks and decreased fear memory generalization at remote (i.e., distal) time points. Consistent with the observation of age-related hyperactivity of CA3, learning failed to increase DGC-SLIN connectivity in 17-month-old mice, whereas downregulation of ABLIM3 expression was sufficient to restore DGC-SLIN connectivity, increase PV+ SLIN activation and improve the precision of remote memories. These studies exemplify a connectivity-based strategy that targets a molecular brake of feedforward inhibition in DG-CA3 and may be harnessed to decrease time-dependent memory generalization in individuals with PTSD and improve memory precision in aging individuals.
The Intellectual Structure of Research on Educational Technology in Science Education (ETiSE): A Co-citation Network Analysis of Publications in Selected Journals (2008-2013)

NASA Astrophysics Data System (ADS)

Tang, Kai-Yu; Tsai, Chin-Chung

2016-01-01

The main purpose of this paper is to investigate the intellectual structure of the research on educational technology in science education (ETiSE) within the most recent years (2008-2013). Based on the criteria for educational technology research and the citation threshold for educational co-citation analysis, a total of 137 relevant ETiSE papers were identified from the International Journal of Science Education, the Journal of Research in Science Teaching, Science Education, and the Journal of Science Education and Technology. Then, a series of methodologies were performed to analyze all 137 source documents, including document co-citation analysis, social network analysis, and exploratory factor analysis. As a result, 454 co-citation ties were obtained and then graphically visualized with an undirected network, presenting a global structure of the current ETiSE research network. In addition, four major underlying intellectual subfields within the main component of the ETiSE network were extracted and named as: (1) technology-enhanced science inquiry, (2) simulation and visualization for understanding, (3) technology-enhanced chemistry learning, and (4) game-based science learning. The most influential co-citation pairs and cross-boundary phenomena were then analyzed and visualized in a co-citation network. This is the very first attempt to illuminate the core ideas underlying ETiSE research by integrating the co-citation method, factor analysis, and the networking visualization technique. The findings of this study provide a platform for scholarly discussion of the dissemination and research trends within the current ETiSE literature.
miR-638 regulates gene expression networks associated with emphysematous lung destruction

PubMed Central

2013-01-01

Background Chronic obstructive pulmonary disease (COPD) is a heterogeneous disease characterized by varying degrees of emphysematous lung destruction and small airway disease, each with distinct effects on clinical outcomes. There is little known about how microRNAs contribute specifically to the emphysema phenotype. We examined how genome-wide microRNA expression is altered with regional emphysema severity and how these microRNAs regulate disease-associated gene expression networks. Methods We profiled microRNAs in different regions of the lung with varying degrees of emphysema from 6 smokers with COPD and 2 controls (8 regions × 8 lungs = 64 samples). Regional emphysema severity was quantified by mean linear intercept. Whole genome microRNA and gene expression data were integrated in the same samples to build co-expression networks. Candidate microRNAs were perturbed in human lung fibroblasts in order to validate these networks. Results The expression levels of 63 microRNAs (P < 0.05) were altered with regional emphysema. A subset, including miR-638, miR-30c, and miR-181d, had expression levels that were associated with those of their predicted mRNA targets. Genes correlated with these microRNAs were enriched in pathways associated with emphysema pathophysiology (for example, oxidative stress and accelerated aging). Inhibition of miR-638 expression in lung fibroblasts led to modulation of these same emphysema-related pathways. Gene targets of miR-638 in these pathways were amongst those negatively correlated with miR-638 expression in emphysema. Conclusions Our findings demonstrate that microRNAs are altered with regional emphysema severity and modulate disease-associated gene expression networks. Furthermore, miR-638 may regulate gene expression pathways related to the oxidative stress response and aging in emphysematous lung tissue and lung fibroblasts. PMID:24380442
An integrated systems genetics screen reveals the transcriptional structure of inherited predisposition to metastatic disease

PubMed Central

Faraji, Farhoud; Hu, Ying; Wu, Gang; Goldberger, Natalie E.; Walker, Renard C.; Zhang, Jinghui; Hunter, Kent W.

2014-01-01

Metastasis is the result of stochastic genomic and epigenetic events leading to gene expression profiles that drive tumor dissemination. Here we exploit the principle that metastatic propensity is modified by the genetic background to generate prognostic gene expression signatures that illuminate regulators of metastasis. We also identify multiple microRNAs whose germline variation is causally linked to tumor progression and metastasis. We employ network analysis of global gene expression profiles in tumors derived from a panel of recombinant inbred mice to identify a network of co-expressed genes centered on Cnot2 that predicts metastasis-free survival. Modulating Cnot2 expression changes tumor cell metastatic potential in vivo, supporting a functional role for Cnot2 in metastasis. Small RNA sequencing of the same tumor set revealed a negative correlation between expression of the Mir216/217 cluster and tumor progression. Expression quantitative trait locus analysis (eQTL) identified cis-eQTLs at the Mir216/217 locus, indicating that differences in expression may be inherited. Ectopic expression of Mir216/217 in tumor cells suppressed metastasis in vivo. Finally, small RNA sequencing and mRNA expression profiling data were integrated to reveal that miR-3470a/b target a high proportion of network transcripts. In vivo analysis of Mir3470a/b demonstrated that both promote metastasis. Moreover, Mir3470b is a likely regulator of the Cnot2 network as its overexpression down-regulated expression of network hub genes and enhanced metastasis in vivo, phenocopying Cnot2 knockdown. The resulting data from this strategy identify Cnot2 as a novel regulator of metastasis and demonstrate the power of our systems-level approach in identifying modifiers of metastasis. PMID:24322557

A model of gene expression based on random dynamical systems reveals modularity properties of gene regulatory networks.

PubMed

Antoneli, Fernando; Ferreira, Renata C; Briones, Marcelo R S

2016-06-01

Here we propose a new approach to modeling gene expression based on the theory of random dynamical systems (RDS) that provides a general coupling prescription between the nodes of any given regulatory network given the dynamics of each node is modeled by a RDS. The main virtues of this approach are the following: (i) it provides a natural way to obtain arbitrarily large networks by coupling together simple basic pieces, thus revealing the modularity of regulatory networks; (ii) the assumptions about the stochastic processes used in the modeling are fairly general, in the sense that the only requirement is stationarity; (iii) there is a well developed mathematical theory, which is a blend of smooth dynamical systems theory, ergodic theory and stochastic analysis that allows one to extract relevant dynamical and statistical information without solving the system; (iv) one may obtain the classical rate equations form the corresponding stochastic version by averaging the dynamic random variables (small noise limit). It is important to emphasize that unlike the deterministic case, where coupling two equations is a trivial matter, coupling two RDS is non-trivial, specially in our case, where the coupling is performed between a state variable of one gene and the switching stochastic process of another gene and, hence, it is not a priori true that the resulting coupled system will satisfy the definition of a random dynamical system. We shall provide the necessary arguments that ensure that our coupling prescription does indeed furnish a coupled regulatory network of random dynamical systems. Finally, the fact that classical rate equations are the small noise limit of our stochastic model ensures that any validation or prediction made on the basis of the classical theory is also a validation or prediction of our model. We illustrate our framework with some simple examples of single-gene system and network motifs. Copyright © 2016 Elsevier Inc. All rights reserved.
A Gene Co-Expression Network in Whole Blood of Schizophrenia Patients Is Independent of Antipsychotic-Use and Enriched for Brain-Expressed Genes

PubMed Central

de Jong, Simone; Boks, Marco P. M.; Fuller, Tova F.; Strengman, Eric; Janson, Esther; de Kovel, Carolien G. F.; Ori, Anil P. S.; Vi, Nancy; Mulder, Flip; Blom, Jan Dirk; Glenthøj, Birte; Schubart, Chris D.; Cahn, Wiepke; Kahn, René S.; Horvath, Steve; Ophoff, Roel A.

2012-01-01

Despite large-scale genome-wide association studies (GWAS), the underlying genes for schizophrenia are largely unknown. Additional approaches are therefore required to identify the genetic background of this disorder. Here we report findings from a large gene expression study in peripheral blood of schizophrenia patients and controls. We applied a systems biology approach to genome-wide expression data from whole blood of 92 medicated and 29 antipsychotic-free schizophrenia patients and 118 healthy controls. We show that gene expression profiling in whole blood can identify twelve large gene co-expression modules associated with schizophrenia. Several of these disease related modules are likely to reflect expression changes due to antipsychotic medication. However, two of the disease modules could be replicated in an independent second data set involving antipsychotic-free patients and controls. One of these robustly defined disease modules is significantly enriched with brain-expressed genes and with genetic variants that were implicated in a GWAS study, which could imply a causal role in schizophrenia etiology. The most highly connected intramodular hub gene in this module (ABCF1), is located in, and regulated by the major histocompatibility (MHC) complex, which is intriguing in light of the fact that common allelic variants from the MHC region have been implicated in schizophrenia. This suggests that the MHC increases schizophrenia susceptibility via altered gene expression of regulatory genes in this network. PMID:22761806
Uncovering co-expression gene network modules regulating fruit acidity in diverse apples.

PubMed

Bai, Yang; Dougherty, Laura; Cheng, Lailiang; Zhong, Gan-Yuan; Xu, Kenong

2015-08-16

Acidity is a major contributor to fruit quality. Several organic acids are present in apple fruit, but malic acid is predominant and determines fruit acidity. The trait is largely controlled by the Malic acid (Ma) locus, underpinning which Ma1 that putatively encodes a vacuolar aluminum-activated malate transporter1 (ALMT1)-like protein is a strong candidate gene. We hypothesize that fruit acidity is governed by a gene network in which Ma1 is key member. The goal of this study is to identify the gene network and the potential mechanisms through which the network operates. Guided by Ma1, we analyzed the transcriptomes of mature fruit of contrasting acidity from six apple accessions of genotype Ma_ (MaMa or Mama) and four of mama using RNA-seq and identified 1301 fruit acidity associated genes, among which 18 were most significant acidity genes (MSAGs). Network inferring using weighted gene co-expression network analysis (WGCNA) revealed five co-expression gene network modules of significant (P < 0.001) correlation with malate. Of these, the Ma1 containing module (Turquoise) of 336 genes showed the highest correlation (0.79). We also identified 12 intramodular hub genes from each of the five modules and 18 enriched gene ontology (GO) terms and MapMan sub-bines, including two GO terms (GO:0015979 and GO:0009765) and two MapMap sub-bins (1.3.4 and 1.1.1.1) related to photosynthesis in module Turquoise. Using Lemon-Tree algorithms, we identified 12 regulator genes of probabilistic scores 35.5-81.0, including MDP0000525602 (a LLR receptor kinase), MDP0000319170 (an IQD2-like CaM binding protein) and MDP0000190273 (an EIN3-like transcription factor) of greater interest for being one of the 18 MSAGs or one of the 12 intramodular hub genes in Turquoise, and/or a regulator to the cluster containing Ma1. The most relevant finding of this study is the identification of the MSAGs, intramodular hub genes, enriched photosynthesis related processes, and regulator genes in a WGCNA module Turquoise that not only encompasses Ma1 but also shows the highest modular correlation with acidity. Overall, this study provides important insight into the Ma1-mediated gene network controlling acidity in mature apple fruit of diverse genetic background.
Magneto-transport and microstructure of Co{sub 2}Fe(Ga{sub 0.5}Ge{sub 0.5})/Cu lateral spin valves prepared by top-down microfabrication process

DOE Office of Scientific and Technical Information (OSTI.GOV)

Ikhtiar,; Mitani, S.; Hono, K., E-mail: kazuhiro.hono@nims.go.jp

Heusler alloy-based lateral spin valves with ohmic contacts are prepared for the Co{sub 2}Fe(Ga{sub 0.5}Ge{sub 0.5})/Cu system by means of the top-down microfabrication process. The magneto-transport and microstructure are characterized to investigate the influence of the microfabrication route on the spin dependent transport of lateral spin valves systematically. A large non-local spin signal (△R{sub S}) of 17.3 mΩ is observed at room temperature, which is attributed to the highly spin-polarized Co{sub 2}Fe(Ga{sub 0.5}Ge{sub 0.5}) ferromagnet and the clean Co{sub 2}Fe(Ga{sub 0.5}Ge{sub 0.5})/Cu interfaces confirmed by transmission electron microscopy. Based on the general expression of one-dimensional spin diffusion model, we discuss themore » importance of interfacial spin polarization in Heusler alloy-based lateral spin valves.« less
CorSig: a general framework for estimating statistical significance of correlation and its application to gene co-expression analysis.

PubMed

Wang, Hong-Qiang; Tsai, Chung-Jui

2013-01-01

With the rapid increase of omics data, correlation analysis has become an indispensable tool for inferring meaningful associations from a large number of observations. Pearson correlation coefficient (PCC) and its variants are widely used for such purposes. However, it remains challenging to test whether an observed association is reliable both statistically and biologically. We present here a new method, CorSig, for statistical inference of correlation significance. CorSig is based on a biology-informed null hypothesis, i.e., testing whether the true PCC (ρ) between two variables is statistically larger than a user-specified PCC cutoff (τ), as opposed to the simple null hypothesis of ρ = 0 in existing methods, i.e., testing whether an association can be declared without a threshold. CorSig incorporates Fisher's Z transformation of the observed PCC (r), which facilitates use of standard techniques for p-value computation and multiple testing corrections. We compared CorSig against two methods: one uses a minimum PCC cutoff while the other (Zhu's procedure) controls correlation strength and statistical significance in two discrete steps. CorSig consistently outperformed these methods in various simulation data scenarios by balancing between false positives and false negatives. When tested on real-world Populus microarray data, CorSig effectively identified co-expressed genes in the flavonoid pathway, and discriminated between closely related gene family members for their differential association with flavonoid and lignin pathways. The p-values obtained by CorSig can be used as a stand-alone parameter for stratification of co-expressed genes according to their correlation strength in lieu of an arbitrary cutoff. CorSig requires one single tunable parameter, and can be readily extended to other correlation measures. Thus, CorSig should be useful for a wide range of applications, particularly for network analysis of high-dimensional genomic data. A web server for CorSig is provided at http://202.127.200.1:8080/probeWeb. R code for CorSig is freely available for non-commercial use at http://aspendb.uga.edu/downloads.
Screening of miRNA profiles and construction of regulation networks in early and late lactation of dairy goat mammary glands.

PubMed

Ji, Zhibin; Liu, Zhaohua; Chao, Tianle; Hou, Lei; Fan, Rui; He, Rongyan; Wang, Guizhi; Wang, Jianmin

2017-09-20

In recent years, studies related to the expression profiles of miRNAs in the dairy goat mammary gland were performed, but regulatory mechanisms in the physiological environment and the dynamic homeostasis of mammary gland development and lactation are not clear. In the present study, sequencing data analysis of early and late lactation uncovered a total of 1,487 unique miRNAs, including 45 novel miRNA candidates and 1,442 known and conserved miRNAs, of which 758 miRNAs were co-expressed and 378 differentially expressed with P < 0.05. Moreover, 76 non-redundant target genes were annotated in 347 GO consortiums, with 3,143 candidate target genes grouped into 33 pathways. Additionally, 18 predicted target genes of 214 miRNAs were directly annotated in mammary gland development and used to construct regulatory networks based on GO annotation and the KEGG pathway. The expression levels of seven known miRNAs and three novel miRNAs were examined using quantitative real-time PCR. The results showed that miRNAs might play important roles in early and late lactation during dairy goat mammary gland development, which will be helpful to obtain a better understanding of the genetic control of mammary gland lactation and development.
The mature anther-preferentially expressed genes are associated with pollen fertility, pollen germination and anther dehiscence in rice.

PubMed

Ling, Sheng; Chen, Caisheng; Wang, Yang; Sun, Xiaocong; Lu, Zhanhua; Ouyang, Yidan; Yao, Jialing

2015-02-19

The anthers and pollen grains are critical for male fertility and hybrid rice breeding. The development of rice mature anther and pollen consists of multiple continuous stages. However, molecular mechanisms regulating mature anther development were poorly understood. In this study, we have identified 291 mature anther-preferentially expressed genes (OsSTA) in rice based on Affymetrix microarray data. Gene Ontology (GO) analysis indicated that OsSTA genes mainly participated in metabolic and cellular processes that are likely important for rice anther and pollen development. The expression patterns of OsSTA genes were validated using real-time PCR and mRNA in situ hybridizations. Cis-element identification showed that most of the OsSTA genes had the cis-elements responsive to phytohormone regulation. Co-expression analysis of OsSTA genes showed that genes annotated with pectinesterase and calcium ion binding activities were rich in the network, suggesting that OsSTA genes could be involved in pollen germination and anther dehiscence. Furthermore, OsSTA RNAi transgenic lines showed male-sterility and pollen germination defects. The results suggested that OsSTA genes function in rice male fertility, pollen germination and anther dehiscence and established molecular regulating networks that lay the foundation for further functional studies.
Genome-Wide Identification of Regulatory Elements and Reconstruction of Gene Regulatory Networks of the Green Alga Chlamydomonas reinhardtii under Carbon Deprivation

PubMed Central

Vischi Winck, Flavia; Arvidsson, Samuel; Riaño-Pachón, Diego Mauricio; Hempel, Sabrina; Koseska, Aneta; Nikoloski, Zoran; Urbina Gomez, David Alejandro; Rupprecht, Jens; Mueller-Roeber, Bernd

2013-01-01

The unicellular green alga Chlamydomonas reinhardtii is a long-established model organism for studies on photosynthesis and carbon metabolism-related physiology. Under conditions of air-level carbon dioxide concentration [CO2], a carbon concentrating mechanism (CCM) is induced to facilitate cellular carbon uptake. CCM increases the availability of carbon dioxide at the site of cellular carbon fixation. To improve our understanding of the transcriptional control of the CCM, we employed FAIRE-seq (formaldehyde-assisted Isolation of Regulatory Elements, followed by deep sequencing) to determine nucleosome-depleted chromatin regions of algal cells subjected to carbon deprivation. Our FAIRE data recapitulated the positions of known regulatory elements in the promoter of the periplasmic carbonic anhydrase (Cah1) gene, which is upregulated during CCM induction, and revealed new candidate regulatory elements at a genome-wide scale. In addition, time series expression patterns of 130 transcription factor (TF) and transcription regulator (TR) genes were obtained for cells cultured under photoautotrophic condition and subjected to a shift from high to low [CO2]. Groups of co-expressed genes were identified and a putative directed gene-regulatory network underlying the CCM was reconstructed from the gene expression data using the recently developed IOTA (inner composition alignment) method. Among the candidate regulatory genes, two members of the MYB-related TF family, Lcr1 (Low-CO 2 response regulator 1) and Lcr2 (Low-CO 2 response regulator 2), may play an important role in down-regulating the expression of a particular set of TF and TR genes in response to low [CO2]. The results obtained provide new insights into the transcriptional control of the CCM and revealed more than 60 new candidate regulatory genes. Deep sequencing of nucleosome-depleted genomic regions indicated the presence of new, previously unknown regulatory elements in the C. reinhardtii genome. Our work can serve as a basis for future functional studies of transcriptional regulator genes and genomic regulatory elements in Chlamydomonas. PMID:24224019
Application of Weighted Gene Co-expression Network Analysis for Data from Paired Design.

PubMed

Li, Jianqiang; Zhou, Doudou; Qiu, Weiliang; Shi, Yuliang; Yang, Ji-Jiang; Chen, Shi; Wang, Qing; Pan, Hui

2018-01-12

Investigating how genes jointly affect complex human diseases is important, yet challenging. The network approach (e.g., weighted gene co-expression network analysis (WGCNA)) is a powerful tool. However, genomic data usually contain substantial batch effects, which could mask true genomic signals. Paired design is a powerful tool that can reduce batch effects. However, it is currently unclear how to appropriately apply WGCNA to genomic data from paired design. In this paper, we modified the current WGCNA pipeline to analyse high-throughput genomic data from paired design. We illustrated the modified WGCNA pipeline by analysing the miRNA dataset provided by Shiah et al. (2014), which contains forty oral squamous cell carcinoma (OSCC) specimens and their matched non-tumourous epithelial counterparts. OSCC is the sixth most common cancer worldwide. The modified WGCNA pipeline identified two sets of novel miRNAs associated with OSCC, in addition to the existing miRNAs reported by Shiah et al. (2014). Thus, this work will be of great interest to readers of various scientific disciplines, in particular, genetic and genomic scientists as well as medical scientists working on cancer.
A powerful score-based test statistic for detecting gene-gene co-association.

PubMed

Xu, Jing; Yuan, Zhongshang; Ji, Jiadong; Zhang, Xiaoshuai; Li, Hongkai; Wu, Xuesen; Xue, Fuzhong; Liu, Yanxun

2016-01-29

The genetic variants identified by Genome-wide association study (GWAS) can only account for a small proportion of the total heritability for complex disease. The existence of gene-gene joint effects which contains the main effects and their co-association is one of the possible explanations for the "missing heritability" problems. Gene-gene co-association refers to the extent to which the joint effects of two genes differ from the main effects, not only due to the traditional interaction under nearly independent condition but the correlation between genes. Generally, genes tend to work collaboratively within specific pathway or network contributing to the disease and the specific disease-associated locus will often be highly correlated (e.g. single nucleotide polymorphisms (SNPs) in linkage disequilibrium). Therefore, we proposed a novel score-based statistic (SBS) as a gene-based method for detecting gene-gene co-association. Various simulations illustrate that, under different sample sizes, marginal effects of causal SNPs and co-association levels, the proposed SBS has the better performance than other existed methods including single SNP-based and principle component analysis (PCA)-based logistic regression model, the statistics based on canonical correlations (CCU), kernel canonical correlation analysis (KCCU), partial least squares path modeling (PLSPM) and delta-square (δ (2)) statistic. The real data analysis of rheumatoid arthritis (RA) further confirmed its advantages in practice. SBS is a powerful and efficient gene-based method for detecting gene-gene co-association.
Large Scale Proteomic Data and Network-Based Systems Biology Approaches to Explore the Plant World.

PubMed

Di Silvestre, Dario; Bergamaschi, Andrea; Bellini, Edoardo; Mauri, PierLuigi

2018-06-03

The investigation of plant organisms by means of data-derived systems biology approaches based on network modeling is mainly characterized by genomic data, while the potential of proteomics is largely unexplored. This delay is mainly caused by the paucity of plant genomic/proteomic sequences and annotations which are fundamental to perform mass-spectrometry (MS) data interpretation. However, Next Generation Sequencing (NGS) techniques are contributing to filling this gap and an increasing number of studies are focusing on plant proteome profiling and protein-protein interactions (PPIs) identification. Interesting results were obtained by evaluating the topology of PPI networks in the context of organ-associated biological processes as well as plant-pathogen relationships. These examples foreshadow well the benefits that these approaches may provide to plant research. Thus, in addition to providing an overview of the main-omic technologies recently used on plant organisms, we will focus on studies that rely on concepts of module, hub and shortest path, and how they can contribute to the plant discovery processes. In this scenario, we will also consider gene co-expression networks, and some examples of integration with metabolomic data and genome-wide association studies (GWAS) to select candidate genes will be mentioned.
Discovery of novel xylosides in co-culture of basidiomycetes Trametes versicolor and Ganoderma applanatum by integrated metabolomics and bioinformatics

NASA Astrophysics Data System (ADS)

Yao, Lu; Zhu, Li-Ping; Xu, Xiao-Yan; Tan, Ling-Ling; Sadilek, Martin; Fan, Huan; Hu, Bo; Shen, Xiao-Ting; Yang, Jie; Qiao, Bin; Yang, Song

2016-09-01

Transcriptomic analysis of cultured fungi suggests that many genes for secondary metabolite synthesis are presumably silent under standard laboratory condition. In order to investigate the expression of silent genes in symbiotic systems, 136 fungi-fungi symbiotic systems were built up by co-culturing seventeen basidiomycetes, among which the co-culture of Trametes versicolor and Ganoderma applanatum demonstrated the strongest coloration of confrontation zones. Metabolomics study of this co-culture discovered that sixty-two features were either newly synthesized or highly produced in the co-culture compared with individual cultures. Molecular network analysis highlighted a subnetwork including two novel xylosides (compounds 2 and 3). Compound 2 was further identified as N-(4-methoxyphenyl)formamide 2-O-β-D-xyloside and was revealed to have the potential to enhance the cell viability of human immortalized bronchial epithelial cell line of Beas-2B. Moreover, bioinformatics and transcriptional analysis of T. versicolor revealed a potential candidate gene (GI: 636605689) encoding xylosyltransferases for xylosylation. Additionally, 3-phenyllactic acid and orsellinic acid were detected for the first time in G. applanatum, which may be ascribed to response against T.versicolor stress. In general, the described co-culture platform provides a powerful tool to discover novel metabolites and help gain insights into the mechanism of silent gene activation in fungal defense.
Discovery of novel xylosides in co-culture of basidiomycetes Trametes versicolor and Ganoderma applanatum by integrated metabolomics and bioinformatics

PubMed Central

Yao, Lu; Zhu, Li-Ping; Xu, Xiao-Yan; Tan, Ling-Ling; Sadilek, Martin; Fan, Huan; Hu, Bo; Shen, Xiao-Ting; Yang, Jie; Qiao, Bin; Yang, Song

2016-01-01

Transcriptomic analysis of cultured fungi suggests that many genes for secondary metabolite synthesis are presumably silent under standard laboratory condition. In order to investigate the expression of silent genes in symbiotic systems, 136 fungi-fungi symbiotic systems were built up by co-culturing seventeen basidiomycetes, among which the co-culture of Trametes versicolor and Ganoderma applanatum demonstrated the strongest coloration of confrontation zones. Metabolomics study of this co-culture discovered that sixty-two features were either newly synthesized or highly produced in the co-culture compared with individual cultures. Molecular network analysis highlighted a subnetwork including two novel xylosides (compounds 2 and 3). Compound 2 was further identified as N-(4-methoxyphenyl)formamide 2-O-β-D-xyloside and was revealed to have the potential to enhance the cell viability of human immortalized bronchial epithelial cell line of Beas-2B. Moreover, bioinformatics and transcriptional analysis of T. versicolor revealed a potential candidate gene (GI: 636605689) encoding xylosyltransferases for xylosylation. Additionally, 3-phenyllactic acid and orsellinic acid were detected for the first time in G. applanatum, which may be ascribed to response against T.versicolor stress. In general, the described co-culture platform provides a powerful tool to discover novel metabolites and help gain insights into the mechanism of silent gene activation in fungal defense. PMID:27616058
An Integrative Analysis of Preeclampsia Based on the Construction of an Extended Composite Network Featuring Protein-Protein Physical Interactions and Transcriptional Relationships

PubMed Central

Vaiman, Daniel; Miralles, Francisco

2016-01-01

Preeclampsia (PE) is a pregnancy disorder defined by hypertension and proteinuria. This disease remains a major cause of maternal and fetal morbidity and mortality. Defective placentation is generally described as being at the root of the disease. The characterization of the transcriptome signature of the preeclamptic placenta has allowed to identify differentially expressed genes (DEGs). However, we still lack a detailed knowledge on how these DEGs impact the function of the placenta. The tools of network biology offer a methodology to explore complex diseases at a systems level. In this study we performed a cross-platform meta-analysis of seven publically available gene expression datasets comparing non-pathological and preeclamptic placentas. Using the rank product algorithm we identified a total of 369 DEGs consistently modified in PE. The DEGs were used as seeds to build both an extended physical protein-protein interactions network and a transcription factors regulatory network. Topological and clustering analysis was conducted to analyze the connectivity properties of the networks. Finally both networks were merged into a composite network which presents an integrated view of the regulatory pathways involved in preeclampsia and the crosstalk between them. This network is a useful tool to explore the relationship between the DEGs and enable hypothesis generation for functional experimentation. PMID:27802351
Improving ecophysiological simulation models to predict the impact of elevated atmospheric CO2 concentration on crop productivity

PubMed Central

Yin, Xinyou

2013-01-01

Background Process-based ecophysiological crop models are pivotal in assessing responses of crop productivity and designing strategies of adaptation to climate change. Most existing crop models generally over-estimate the effect of elevated atmospheric [CO2], despite decades of experimental research on crop growth response to [CO2]. Analysis A review of the literature indicates that the quantitative relationships for a number of traits, once expressed as a function of internal plant nitrogen status, are altered little by the elevated [CO2]. A model incorporating these nitrogen-based functional relationships and mechanisms simulated photosynthetic acclimation to elevated [CO2], thereby reducing the chance of over-estimating crop response to [CO2]. Robust crop models to have small parameterization requirements and yet generate phenotypic plasticity under changing environmental conditions need to capture the carbon–nitrogen interactions during crop growth. Conclusions The performance of the improved models depends little on the type of the experimental facilities used to obtain data for parameterization, and allows accurate projections of the impact of elevated [CO2] and other climatic variables on crop productivity. PMID:23388883
Meta-connectomics: human brain network and connectivity meta-analyses.

PubMed

Crossley, N A; Fox, P T; Bullmore, E T

2016-04-01

Abnormal brain connectivity or network dysfunction has been suggested as a paradigm to understand several psychiatric disorders. We here review the use of novel meta-analytic approaches in neuroscience that go beyond a summary description of existing results by applying network analysis methods to previously published studies and/or publicly accessible databases. We define this strategy of combining connectivity with other brain characteristics as 'meta-connectomics'. For example, we show how network analysis of task-based neuroimaging studies has been used to infer functional co-activation from primary data on regional activations. This approach has been able to relate cognition to functional network topology, demonstrating that the brain is composed of cognitively specialized functional subnetworks or modules, linked by a rich club of cognitively generalized regions that mediate many inter-modular connections. Another major application of meta-connectomics has been efforts to link meta-analytic maps of disorder-related abnormalities or MRI 'lesions' to the complex topology of the normative connectome. This work has highlighted the general importance of network hubs as hotspots for concentration of cortical grey-matter deficits in schizophrenia, Alzheimer's disease and other disorders. Finally, we show how by incorporating cellular and transcriptional data on individual nodes with network models of the connectome, studies have begun to elucidate the microscopic mechanisms underpinning the macroscopic organization of whole-brain networks. We argue that meta-connectomics is an exciting field, providing robust and integrative insights into brain organization that will likely play an important future role in consolidating network models of psychiatric disorders.
Exploring candidate biomarkers for lung and prostate cancers using gene expression and flux variability analysis.

PubMed

Asgari, Yazdan; Khosravi, Pegah; Zabihinpour, Zahra; Habibi, Mahnaz

2018-02-19

Genome-scale metabolic models have provided valuable resources for exploring changes in metabolism under normal and cancer conditions. However, metabolism itself is strongly linked to gene expression, so integration of gene expression data into metabolic models might improve the detection of genes involved in the control of tumor progression. Herein, we considered gene expression data as extra constraints to enhance the predictive powers of metabolic models. We reconstructed genome-scale metabolic models for lung and prostate, under normal and cancer conditions to detect the major genes associated with critical subsystems during tumor development. Furthermore, we utilized gene expression data in combination with an information theory-based approach to reconstruct co-expression networks of the human lung and prostate in both cohorts. Our results revealed 19 genes as candidate biomarkers for lung and prostate cancer cells. This study also revealed that the development of a complementary approach (integration of gene expression and metabolic profiles) could lead to proposing novel biomarkers and suggesting renovated cancer treatment strategies which have not been possible to detect using either of the methods alone.
Spatially generalizable representations of facial expressions: Decoding across partial face samples.

PubMed

Greening, Steven G; Mitchell, Derek G V; Smith, Fraser W

2018-04-01

A network of cortical and sub-cortical regions is known to be important in the processing of facial expression. However, to date no study has investigated whether representations of facial expressions present in this network permit generalization across independent samples of face information (e.g., eye region vs mouth region). We presented participants with partial face samples of five expression categories in a rapid event-related fMRI experiment. We reveal a network of face-sensitive regions that contain information about facial expression categories regardless of which part of the face is presented. We further reveal that the neural information present in a subset of these regions: dorsal prefrontal cortex (dPFC), superior temporal sulcus (STS), lateral occipital and ventral temporal cortex, and even early visual cortex, enables reliable generalization across independent visual inputs (faces depicting the 'eyes only' vs 'eyes removed'). Furthermore, classification performance was correlated to behavioral performance in STS and dPFC. Our results demonstrate that both higher (e.g., STS, dPFC) and lower level cortical regions contain information useful for facial expression decoding that go beyond the visual information presented, and implicate a key role for contextual mechanisms such as cortical feedback in facial expression perception under challenging conditions of visual occlusion. Copyright © 2017 Elsevier Ltd. All rights reserved.
A Transcriptome Meta-Analysis Proposes Novel Biological Roles for the Antifungal Protein AnAFP in Aspergillus niger

PubMed Central

Schäpe, Paul; Müller-Hagen, Dirk; Ouedraogo, Jean-Paul; Heiderich, Caroline; Jedamzick, Johanna; van den Hondel, Cees A.; Ram, Arthur F.; Meyer, Vera

2016-01-01

Understanding the genetic, molecular and evolutionary basis of cysteine-stabilized antifungal proteins (AFPs) from fungi is important for understanding whether their function is mainly defensive or associated with fungal growth and development. In the current study, a transcriptome meta-analysis of the Aspergillus niger γ-core protein AnAFP was performed to explore co-expressed genes and pathways, based on independent expression profiling microarrays covering 155 distinct cultivation conditions. This analysis uncovered that anafp displays a highly coordinated temporal and spatial transcriptional profile which is concomitant with key nutritional and developmental processes. Its expression profile coincides with early starvation response and parallels with genes involved in nutrient mobilization and autophagy. Using fluorescence- and luciferase reporter strains we demonstrated that the anafp promoter is active in highly vacuolated compartments and foraging hyphal cells during carbon starvation with CreA and FlbA, but not BrlA, as most likely regulators of anafp. A co-expression network analysis supported by luciferase-based reporter assays uncovered that anafp expression is embedded in several cellular processes including allorecognition, osmotic and oxidative stress survival, development, secondary metabolism and autophagy, and predicted StuA and VelC as additional regulators. The transcriptomic resources available for A. niger provide unparalleled resources to investigate the function of proteins. Our work illustrates how transcriptomic meta-analyses can lead to hypotheses regarding protein function and predict a role for AnAFP during slow growth, allorecognition, asexual development and nutrient recycling of A. niger and propose that it interacts with the autophagic machinery to enable these processes. PMID:27835655
A Transcriptome Meta-Analysis Proposes Novel Biological Roles for the Antifungal Protein AnAFP in Aspergillus niger.

PubMed

Paege, Norman; Jung, Sascha; Schäpe, Paul; Müller-Hagen, Dirk; Ouedraogo, Jean-Paul; Heiderich, Caroline; Jedamzick, Johanna; Nitsche, Benjamin M; van den Hondel, Cees A; Ram, Arthur F; Meyer, Vera

2016-01-01

Understanding the genetic, molecular and evolutionary basis of cysteine-stabilized antifungal proteins (AFPs) from fungi is important for understanding whether their function is mainly defensive or associated with fungal growth and development. In the current study, a transcriptome meta-analysis of the Aspergillus niger γ-core protein AnAFP was performed to explore co-expressed genes and pathways, based on independent expression profiling microarrays covering 155 distinct cultivation conditions. This analysis uncovered that anafp displays a highly coordinated temporal and spatial transcriptional profile which is concomitant with key nutritional and developmental processes. Its expression profile coincides with early starvation response and parallels with genes involved in nutrient mobilization and autophagy. Using fluorescence- and luciferase reporter strains we demonstrated that the anafp promoter is active in highly vacuolated compartments and foraging hyphal cells during carbon starvation with CreA and FlbA, but not BrlA, as most likely regulators of anafp. A co-expression network analysis supported by luciferase-based reporter assays uncovered that anafp expression is embedded in several cellular processes including allorecognition, osmotic and oxidative stress survival, development, secondary metabolism and autophagy, and predicted StuA and VelC as additional regulators. The transcriptomic resources available for A. niger provide unparalleled resources to investigate the function of proteins. Our work illustrates how transcriptomic meta-analyses can lead to hypotheses regarding protein function and predict a role for AnAFP during slow growth, allorecognition, asexual development and nutrient recycling of A. niger and propose that it interacts with the autophagic machinery to enable these processes.

A social network analysis approach to alcohol use and co-occurring addictive behavior in young adults.

PubMed

Meisel, Matthew K; Clifton, Allan D; MacKillop, James; Goodie, Adam S

2015-12-01

The current study applied egocentric social network analysis (SNA) to investigate the prevalence of addictive behavior and co-occurring substance use in college students' networks. Specifically, we examined individuals' perceptions of the frequency of network members' co-occurring addictive behavior and investigated whether co-occurring addictive behavior is spread evenly throughout networks or is more localized in clusters. We also examined differences in network composition between individuals with varying levels of alcohol use. The study utilized an egocentric SNA approach in which respondents ("egos") enumerated 30 of their closest friends, family members, co-workers, and significant others ("alters") and the relations among alters listed. Participants were 281 undergraduates at a large university in the Southeastern United States. Robust associations were observed among the frequencies of gambling, smoking, drinking, and using marijuana by network members. We also found that alters tended to cluster together into two distinct groups: one cluster moderate-to-high on co-occurring addictive behavior and the other low on co-occurring addictive behavior. Lastly, significant differences were present when examining egos' perceptions of alters' substance use between the networks of at-risk, light, and nondrinkers. These findings provide empirical evidence of distinct clustering of addictive behavior among young adults and suggest the promise of social network-based interventions for this cohort. Copyright © 2015. Published by Elsevier Ltd.
ChIPBase v2.0: decoding transcriptional regulatory networks of non-coding RNAs and protein-coding genes from ChIP-seq data.

PubMed

Zhou, Ke-Ren; Liu, Shun; Sun, Wen-Ju; Zheng, Ling-Ling; Zhou, Hui; Yang, Jian-Hua; Qu, Liang-Hu

2017-01-04

The abnormal transcriptional regulation of non-coding RNAs (ncRNAs) and protein-coding genes (PCGs) is contributed to various biological processes and linked with human diseases, but the underlying mechanisms remain elusive. In this study, we developed ChIPBase v2.0 (http://rna.sysu.edu.cn/chipbase/) to explore the transcriptional regulatory networks of ncRNAs and PCGs. ChIPBase v2.0 has been expanded with ∼10 200 curated ChIP-seq datasets, which represent about 20 times expansion when comparing to the previous released version. We identified thousands of binding motif matrices and their binding sites from ChIP-seq data of DNA-binding proteins and predicted millions of transcriptional regulatory relationships between transcription factors (TFs) and genes. We constructed 'Regulator' module to predict hundreds of TFs and histone modifications that were involved in or affected transcription of ncRNAs and PCGs. Moreover, we built a web-based tool, Co-Expression, to explore the co-expression patterns between DNA-binding proteins and various types of genes by integrating the gene expression profiles of ∼10 000 tumor samples and ∼9100 normal tissues and cell lines. ChIPBase also provides a ChIP-Function tool and a genome browser to predict functions of diverse genes and visualize various ChIP-seq data. This study will greatly expand our understanding of the transcriptional regulations of ncRNAs and PCGs. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Co-expression modules construction by WGCNA and identify potential prognostic markers of uveal melanoma.

PubMed

Wan, Qi; Tang, Jing; Han, Yu; Wang, Dan

2018-01-01

Uveal melanoma is an aggressive cancer which has a high percentage recurrence and with a worse prognosis. Identify the potential prognostic markers of uveal melanoma may provide information for early detection of recurrence and treatment. RNA sequence data of uveal melanoma and patient clinic traits were obtained from The Cancer Genome Atlas (TCGA) database. Co-expression modules were built by weighted gene co -expression network analysis (WGCNA) and applied to investigate the relationship underlying modules and clinic traits. Besides, functional enrichment analysis was performed on these co-expression genes from interested modules. First, using WGCNA, identified 21 co-expression modules were constructed by the 10975 genes from the 80 human uveal melanoma samples. The number of genes in these modules ranged from 42 to 5091. Found four co -expression modules significantly correlated with three clinic traits (status, recurrence and recurrence Time). Module red, and purple positively correlated with patient's life status and recurrence Time. Module green positively correlates with recurrence. The result of functional enrichment analysis showed that the module magenta was mainly enriched genetic material assemble processes, the purple module was mainly enriched in tissue homeostasis and melanosome membrane and the module red was mainly enriched metastasis of cell, suggesting its critical role in the recurrence and development of the disease. Additionally, identified the hug gene (top connectivity with other genes) in each module. The hub gene SLC17A7, NTRK2, ABTB1 and ADPRHL1 might play a vital role in recurrence of uveal melanoma. Our findings provided the framework of co-expression gene modules of uveal melanoma and identified some prognostic markers might be detection of recurrence and treatment for uveal melanoma. Copyright © 2017 Elsevier Ltd. All rights reserved.
Systems Level Analysis of Systemic Sclerosis Shows a Network of Immune and Profibrotic Pathways Connected with Genetic Polymorphisms

PubMed Central

Mahoney, J. Matthew; Taroni, Jaclyn; Martyanov, Viktor; Wood, Tammara A.; Greene, Casey S.; Pioli, Patricia A.; Hinchcliff, Monique E.; Whitfield, Michael L.

2015-01-01

Systemic sclerosis (SSc) is a rare systemic autoimmune disease characterized by skin and organ fibrosis. The pathogenesis of SSc and its progression are poorly understood. The SSc intrinsic gene expression subsets (inflammatory, fibroproliferative, normal-like, and limited) are observed in multiple clinical cohorts of patients with SSc. Analysis of longitudinal skin biopsies suggests that a patient's subset assignment is stable over 6–12 months. Genetically, SSc is multi-factorial with many genetic risk loci for SSc generally and for specific clinical manifestations. Here we identify the genes consistently associated with the intrinsic subsets across three independent cohorts, show the relationship between these genes using a gene-gene interaction network, and place the genetic risk loci in the context of the intrinsic subsets. To identify gene expression modules common to three independent datasets from three different clinical centers, we developed a consensus clustering procedure based on mutual information of partitions, an information theory concept, and performed a meta-analysis of these genome-wide gene expression datasets. We created a gene-gene interaction network of the conserved molecular features across the intrinsic subsets and analyzed their connections with SSc-associated genetic polymorphisms. The network is composed of distinct, but interconnected, components related to interferon activation, M2 macrophages, adaptive immunity, extracellular matrix remodeling, and cell proliferation. The network shows extensive connections between the inflammatory- and fibroproliferative-specific genes. The network also shows connections between these subset-specific genes and 30 SSc-associated polymorphic genes including STAT4, BLK, IRF7, NOTCH4, PLAUR, CSK, IRAK1, and several human leukocyte antigen (HLA) genes. Our analyses suggest that the gene expression changes underlying the SSc subsets may be long-lived, but mechanistically interconnected and related to a patients underlying genetic risk. PMID:25569146
A novel strategy of integrated microarray analysis identifies CENPA, CDK1 and CDC20 as a cluster of diagnostic biomarkers in lung adenocarcinoma.

PubMed

Liu, Wan-Ting; Wang, Yang; Zhang, Jing; Ye, Fei; Huang, Xiao-Hui; Li, Bin; He, Qing-Yu

2018-07-01

Lung adenocarcinoma (LAC) is the most lethal cancer and the leading cause of cancer-related death worldwide. The identification of meaningful clusters of co-expressed genes or representative biomarkers may help improve the accuracy of LAC diagnoses. Public databases, such as the Gene Expression Omnibus (GEO), provide rich resources of valuable information for clinics, however, the integration of multiple microarray datasets from various platforms and institutes remained a challenge. To determine potential indicators of LAC, we performed genome-wide relative significance (GWRS), genome-wide global significance (GWGS) and support vector machine (SVM) analyses progressively to identify robust gene biomarker signatures from 5 different microarray datasets that included 330 samples. The top 200 genes with robust signatures were selected for integrative analysis according to "guilt-by-association" methods, including protein-protein interaction (PPI) analysis and gene co-expression analysis. Of these 200 genes, only 10 genes showed both intensive PPI network and high gene co-expression correlation (r > 0.8). IPA analysis of this regulatory networks suggested that the cell cycle process is a crucial determinant of LAC. CENPA, as well as two linked hub genes CDK1 and CDC20, are determined to be potential indicators of LAC. Immunohistochemical staining showed that CENPA, CDK1 and CDC20 were highly expressed in LAC cancer tissue with co-expression patterns. A Cox regression model indicated that LAC patients with CENPA + /CDK1 + and CENPA + /CDC20 + were high-risk groups in terms of overall survival. In conclusion, our integrated microarray analysis demonstrated that CENPA, CDK1 and CDC20 might serve as novel cluster of prognostic biomarkers for LAC, and the cooperative unit of three genes provides a technically simple approach for identification of LAC patients. Copyright © 2018 Elsevier B.V. All rights reserved.
A neutron spectrum unfolding code based on generalized regression artificial neural networks.

PubMed

Del Rosario Martinez-Blanco, Ma; Ornelas-Vargas, Gerardo; Castañeda-Miranda, Celina Lizeth; Solís-Sánchez, Luis Octavio; Castañeda-Miranada, Rodrigo; Vega-Carrillo, Héctor René; Celaya-Padilla, Jose M; Garza-Veloz, Idalia; Martínez-Fierro, Margarita; Ortiz-Rodríguez, José Manuel

2016-11-01

The most delicate part of neutron spectrometry, is the unfolding process. The derivation of the spectral information is not simple because the unknown is not given directly as a result of the measurements. Novel methods based on Artificial Neural Networks have been widely investigated. In prior works, back propagation neural networks (BPNN) have been used to solve the neutron spectrometry problem, however, some drawbacks still exist using this kind of neural nets, i.e. the optimum selection of the network topology and the long training time. Compared to BPNN, it's usually much faster to train a generalized regression neural network (GRNN). That's mainly because spread constant is the only parameter used in GRNN. Another feature is that the network will converge to a global minimum, provided that the optimal values of spread has been determined and that the dataset adequately represents the problem space. In addition, GRNN are often more accurate than BPNN in the prediction. These characteristics make GRNNs to be of great interest in the neutron spectrometry domain. This work presents a computational tool based on GRNN capable to solve the neutron spectrometry problem. This computational code, automates the pre-processing, training and testing stages using a k-fold cross validation of 3 folds, the statistical analysis and the post-processing of the information, using 7 Bonner spheres rate counts as only entrance data. The code was designed for a Bonner Spheres System based on a 6 LiI(Eu) neutron detector and a response matrix expressed in 60 energy bins taken from an International Atomic Energy Agency compilation. Copyright © 2016 Elsevier Ltd. All rights reserved.
Mapping the knowledge structure of research on patient adherence: knowledge domain visualization based co-word analysis and social network analysis.

PubMed

Zhang, Juan; Xie, Jun; Hou, Wanli; Tu, Xiaochen; Xu, Jing; Song, Fujian; Wang, Zhihong; Lu, Zuxun

2012-01-01

Patient adherence is an important issue for health service providers and health researchers. However, the knowledge structure of diverse research on treatment adherence is unclear. This study used co-word analysis and social network analysis techniques to analyze research literature on adherence, and to show their knowledge structure and evolution over time. Published scientific papers about treatment adherence were retrieved from Web of Science (2000 to May 2011). A total of 2308 relevant articles were included: 788 articles published in 2000-2005 and 1520 articles published in 2006-2011. The keywords of each article were extracted by using the software Biblexcel, and the synonym and isogenous words were merged manually. The frequency of keywords and their co-occurrence frequency were counted. High frequency keywords were selected to yield the co-words matrix. Finally the decomposition maps were used to comb the complex knowledge structures. Research themes were more general in the first period (2000 to 2005), and more extensive with many more new terms in the second period (2006 to 2011). Research on adherence has covered more and more diseases, populations and methods, but other diseases/conditions are not as hot as HIV/AIDS and have not become specialty themes/sub-directions. Most studies originated from the United States. The dynamic of this field is mainly divergent, with increasing number of new sub-directions of research. Future research is required to investigate specific directions and converge as well to construct a general paradigm in this field.
Mapping the Knowledge Structure of Research on Patient Adherence: Knowledge Domain Visualization Based Co-Word Analysis and Social Network Analysis

PubMed Central

Hou, Wanli; Tu, Xiaochen; Xu, Jing; Song, Fujian; Wang, Zhihong; Lu, Zuxun

2012-01-01

Background Patient adherence is an important issue for health service providers and health researchers. However, the knowledge structure of diverse research on treatment adherence is unclear. This study used co-word analysis and social network analysis techniques to analyze research literature on adherence, and to show their knowledge structure and evolution over time. Methods Published scientific papers about treatment adherence were retrieved from Web of Science (2000 to May 2011). A total of 2308 relevant articles were included: 788 articles published in 2000–2005 and 1520 articles published in 2006–2011. The keywords of each article were extracted by using the software Biblexcel, and the synonym and isogenous words were merged manually. The frequency of keywords and their co-occurrence frequency were counted. High frequency keywords were selected to yield the co-words matrix. Finally the decomposition maps were used to comb the complex knowledge structures. Results Research themes were more general in the first period (2000 to 2005), and more extensive with many more new terms in the second period (2006 to 2011). Research on adherence has covered more and more diseases, populations and methods, but other diseases/conditions are not as hot as HIV/AIDS and have not become specialty themes/sub-directions. Most studies originated from the United States. Conclusion The dynamic of this field is mainly divergent, with increasing number of new sub-directions of research. Future research is required to investigate specific directions and converge as well to construct a general paradigm in this field. PMID:22496819
Early activation of quorum sensing in Pseudomonas aeruginosa reveals the architecture of a complex regulon.

PubMed

Schuster, Martin; Greenberg, E Peter

2007-08-22

Quorum-sensing regulation of gene expression in Pseudomonas aeruginosa is complex. Two interconnected acyl-homoserine lactone (acyl-HSL) signal-receptor pairs, 3-oxo-dodecanoyl-HSL-LasR and butanoyl-HSL-RhlR, regulate more than 300 genes. The induction of most of the genes is delayed during growth of P. aeruginosa in complex medium, cannot be advanced by addition of exogenous signal, and requires additional regulatory components. Many of these late genes can be induced by addition of signals early by using specific media conditions. While several factors super-regulate the quorum receptors, others may co-regulate target promoters or may affect expression posttranscriptionally. To better understand the contributions of super-regulation and co-regulation to quorum-sensing gene expression, and to better understand the general structure of the quorum sensing network, we ectopically expressed the two receptors (in the presence of their cognate signals) and another component that affects quorum sensing, the stationary phase sigma factor RpoS, early in growth. We determined the effect on target gene expression by microarray and real-time PCR analysis. Our results show that many target genes (e.g. lasB and hcnABC) are directly responsive to receptor protein levels. Most genes (e.g. lasA, lecA, and phnAB), however, are not significantly affected, although at least some of these genes are directly regulated by quorum sensing. The majority of promoters advanced by RhlR appeared to be regulated directly, which allowed us to build a RhlR consensus sequence. The direct responsiveness of many quorum sensing target genes to receptor protein levels early in growth confirms the role of super-regulation in quorum sensing gene expression. The observation that the induction of most target genes is not affected by signal or receptor protein levels indicates that either target promoters are co-regulated by other transcription factors, or that expression is controlled posttranscriptionally. This architecture permits the integration of multiple signaling pathways resulting in quorum responses that require a "quorum" but are otherwise highly adaptable and receptive to environmental conditions.
Facile synthesis of a mesoporous Co3O4 network for Li-storage via thermal decomposition of an amorphous metal complex.

PubMed

Wen, Wei; Wu, Jin-Ming; Cao, Min-Hua

2014-11-07

A facile strategy is developed for mass fabrication of porous Co3O4 networks via the thermal decomposition of an amorphous cobalt-based complex. At a low mass loading, the achieved porous Co3O4 network exhibits excellent performance for lithium storage, which has a high capacity of 587 mA h g(-1) after 500 cycles at a current density of 1000 mA g(-1).
Coexistence or Operational Necessity: The Role of Formally Structured Organisation and Informal Networks during Deployments

DTIC Science & Technology

2011-06-01

informal communication and informal networks which provide the quickest means of communication in organisations . These informal groupings develop ...promote creativity through sensemaking and self- organisation will better enable the military to respond to environmental changes (McDaniel, 2007... need to be addressed to facilitate the co-existence of formal organisation and informal networks during deployments? The military in general and
Architecture of the human regulatory network derived from ENCODE data.

PubMed

Gerstein, Mark B; Kundaje, Anshul; Hariharan, Manoj; Landt, Stephen G; Yan, Koon-Kiu; Cheng, Chao; Mu, Xinmeng Jasmine; Khurana, Ekta; Rozowsky, Joel; Alexander, Roger; Min, Renqiang; Alves, Pedro; Abyzov, Alexej; Addleman, Nick; Bhardwaj, Nitin; Boyle, Alan P; Cayting, Philip; Charos, Alexandra; Chen, David Z; Cheng, Yong; Clarke, Declan; Eastman, Catharine; Euskirchen, Ghia; Frietze, Seth; Fu, Yao; Gertz, Jason; Grubert, Fabian; Harmanci, Arif; Jain, Preti; Kasowski, Maya; Lacroute, Phil; Leng, Jing Jane; Lian, Jin; Monahan, Hannah; O'Geen, Henriette; Ouyang, Zhengqing; Partridge, E Christopher; Patacsil, Dorrelyn; Pauli, Florencia; Raha, Debasish; Ramirez, Lucia; Reddy, Timothy E; Reed, Brian; Shi, Minyi; Slifer, Teri; Wang, Jing; Wu, Linfeng; Yang, Xinqiong; Yip, Kevin Y; Zilberman-Schapira, Gili; Batzoglou, Serafim; Sidow, Arend; Farnham, Peggy J; Myers, Richard M; Weissman, Sherman M; Snyder, Michael

2012-09-06

Transcription factors bind in a combinatorial fashion to specify the on-and-off states of genes; the ensemble of these binding events forms a regulatory network, constituting the wiring diagram for a cell. To examine the principles of the human transcriptional regulatory network, we determined the genomic binding information of 119 transcription-related factors in over 450 distinct experiments. We found the combinatorial, co-association of transcription factors to be highly context specific: distinct combinations of factors bind at specific genomic locations. In particular, there are significant differences in the binding proximal and distal to genes. We organized all the transcription factor binding into a hierarchy and integrated it with other genomic information (for example, microRNA regulation), forming a dense meta-network. Factors at different levels have different properties; for instance, top-level transcription factors more strongly influence expression and middle-level ones co-regulate targets to mitigate information-flow bottlenecks. Moreover, these co-regulations give rise to many enriched network motifs (for example, noise-buffering feed-forward loops). Finally, more connected network components are under stronger selection and exhibit a greater degree of allele-specific activity (that is, differential binding to the two parental alleles). The regulatory information obtained in this study will be crucial for interpreting personal genome sequences and understanding basic principles of human biology and disease.
Bottom-up view of water network-mediated CO2 reduction using cryogenic cluster ion spectroscopy and direct dynamics simulations.

PubMed

Breen, Kristin J; DeBlase, Andrew F; Guasco, Timothy L; Voora, Vamsee K; Jordan, Kenneth D; Nagata, Takashi; Johnson, Mark A

2012-01-26

The transition states of a chemical reaction in solution are generally accessed through exchange of thermal energy between the solvent and the reactants. As such, an ensemble of reacting systems approaches the transition state configuration of reactant and surrounding solvent in an incoherent manner that does not lend itself to direct experimental observation. Here we describe how gas-phase cluster chemistry can provide a detailed picture of the microscopic mechanics at play when a network of six water molecules mediates the trapping of a highly reactive "hydrated electron" onto a neutral CO(2) molecule to form a radical anion. The exothermic reaction is triggered from a metastable intermediate by selective excitation of either the reactant CO(2) or the water network, which is evidenced by the evaporative decomposition of the product cluster. Ab initio molecular dynamics simulations of energized CO(2)·(H(2)O)(6)(-) clusters are used to elucidate the nature of the network deformations that mediate intracluster electron capture, thus revealing the detailed solvent fluctuations implicit in the Marcus theory for electron-transfer kinetics in solution.
Expression quantitative trait loci and genetic regulatory network analysis reveals that Gabra2 is involved in stress responses in the mouse.

PubMed

Dai, Jiajuan; Wang, Xusheng; Chen, Ying; Wang, Xiaodong; Zhu, Jun; Lu, Lu

2009-11-01

Previous studies have revealed that the subunit alpha 2 (Gabra2) of the gamma-aminobutyric acid receptor plays a critical role in the stress response. However, little is known about the gentetic regulatory network for Gabra2 and the stress response. We combined gene expression microarray analysis and quantitative trait loci (QTL) mapping to characterize the genetic regulatory network for Gabra2 expression in the hippocampus of BXD recombinant inbred (RI) mice. Our analysis found that the expression level of Gabra2 exhibited much variation in the hippocampus across the BXD RI strains and between the parental strains, C57BL/6J, and DBA/2J. Expression QTL (eQTL) mapping showed three microarray probe sets of Gabra2 to have highly significant linkage likelihood ratio statistic (LRS) scores. Gene co-regulatory network analysis showed that 10 genes, including Gria3, Chka, Drd3, Homer1, Grik2, Odz4, Prkag2, Grm5, Gabrb1, and Nlgn1 are directly or indirectly associated with stress responses. Eleven genes were implicated as Gabra2 downstream genes through mapping joint modulation. The genetical genomics approach demonstrates the importance and the potential power of the eQTL studies in identifying genetic regulatory networks that contribute to complex traits, such as stress responses.
Relationships between probabilistic Boolean networks and dynamic Bayesian networks as models of gene regulatory networks

PubMed Central

Lähdesmäki, Harri; Hautaniemi, Sampsa; Shmulevich, Ilya; Yli-Harja, Olli

2006-01-01

A significant amount of attention has recently been focused on modeling of gene regulatory networks. Two frequently used large-scale modeling frameworks are Bayesian networks (BNs) and Boolean networks, the latter one being a special case of its recent stochastic extension, probabilistic Boolean networks (PBNs). PBN is a promising model class that generalizes the standard rule-based interactions of Boolean networks into the stochastic setting. Dynamic Bayesian networks (DBNs) is a general and versatile model class that is able to represent complex temporal stochastic processes and has also been proposed as a model for gene regulatory systems. In this paper, we concentrate on these two model classes and demonstrate that PBNs and a certain subclass of DBNs can represent the same joint probability distribution over their common variables. The major benefit of introducing the relationships between the models is that it opens up the possibility of applying the standard tools of DBNs to PBNs and vice versa. Hence, the standard learning tools of DBNs can be applied in the context of PBNs, and the inference methods give a natural way of handling the missing values in PBNs which are often present in gene expression measurements. Conversely, the tools for controlling the stationary behavior of the networks, tools for projecting networks onto sub-networks, and efficient learning schemes can be used for DBNs. In other words, the introduced relationships between the models extend the collection of analysis tools for both model classes. PMID:17415411
Retroviruses facilitate the rapid evolution of the mammalian placenta

PubMed Central

Chuong, Edward B.

2015-01-01

The mammalian placenta exhibits elevated expression of endogenous retroviruses (ERVs), but the evolutionary significance of this feature remains unclear. I propose that ERV-mediated regulatory evolution was, and continues to be, an important mechanism underlying the evolution of placenta development. Many recent studies have focused on the co-option of ERV-derived genes for specific functional adaptations in the placenta. However, the co-option of ERV-derived regulatory elements has the potential to co-opt entire gene regulatory networks, which, I argue, would facilitate relatively rapid developmental evolution of the placenta. I suggest a model in which an ancient retroviral infection led to the establishment of the ancestral placental developmental gene network through the co-option of ERV-derived regulatory elements. Consequently, placenta development would require elevated tolerance to ERV activity, which in turn would expose a continuous stream of novel ERV mutations that may have catalyzed the developmental diversification of the mammalian placenta. PMID:23873343
Microarray expression profiling and co-expression network analysis of circulating LncRNAs and mRNAs associated with neurotoxicity induced by BPA.

PubMed

Pang, Wei; Lian, Fu-Zhi; Leng, Xue; Wang, Shu-Min; Li, Yi-Bo; Wang, Zi-Yu; Li, Kai-Ren; Gao, Zhi-Xian; Jiang, Yu-Gang

2018-05-01

A growing body of evidence has shown bisphenol A (BPA), an estrogen-like industrial chemical, has adverse effects on the nervous system. In this study, we investigated the transcriptional behavior of long non-coding RNAs (lncRNAs) and mRNAs to provide the information to explore neurotoxic effects induced by BPA. By microarray expression profiling, we discovered 151 differentially expressed lncRNAs and 794 differentially expressed mRNAs in the BPA intervention group compared with the control group. Gene ontology analysis indicated the differentially expressed mRNAs were mainly involved in fundamental metabolic processes and physiological and pathological conditions, such as development, synaptic transmission, homeostasis, injury, and neuroinflammation responses. In the expression network of the BPA-induced group, a great number of nodes and connections were found in comparison to the control-derived network. We identified lncRNAs that were aberrantly expressed in the BPA group, among which, growth arrest specific 5 (GAS5) might participate in the BPA-induced neurotoxicity by regulating Jun, RAS, and other pathways indirectly through these differentially expressed genes. This study provides the first investigation of genome-wide lncRNA expression and correlation between lncRNA and mRNA expression in the BPA-induced neurotoxicity. Our results suggest that the elevated expression of lncRNAs is a major biomarker in the neurotoxicity induced by BPA.
SLC9A9 Co-expression modules in autism-associated brain regions.

PubMed

Patak, Jameson; Hess, Jonathan L; Zhang-James, Yanli; Glatt, Stephen J; Faraone, Stephen V

2017-03-01

SLC9A9 is a sodium hydrogen exchanger present in the recycling endosome and highly expressed in the brain. It is implicated in neuropsychiatric disorders, including autism spectrum disorders (ASDs). Little research concerning its gene expression patterns and biological pathways has been conducted. We sought to investigate its possible biological roles in autism-associated brain regions throughout development. We conducted a weighted gene co-expression network analysis on RNA-seq data downloaded from Brainspan. We compared prenatal and postnatal gene expression networks for three ASD-associated brain regions known to have high SLC9A9 gene expression. We also performed an ASD-associated single nucleotide polymorphism enrichment analysis and a cell signature enrichment analysis. The modules showed differences in gene constituents (membership), gene number, and connectivity throughout time. SLC9A9 was highly associated with immune system functions, metabolism, apoptosis, endocytosis, and signaling cascades. Gene list comparison with co-immunoprecipitation data was significant for multiple modules. We found a disproportionately high autism risk signal among genes constituting the prenatal hippocampal module. The modules were enriched with astrocyte and oligodendrocyte markers. SLC9A9 is potentially involved in the pathophysiology of ASDs. Our investigation confirmed proposed functions for SLC9A9, such as endocytosis and immune regulation, while also revealing potential roles in mTOR signaling and cell survival.. By providing a concise molecular map and interactions, evidence of cell type and implicated brain regions we hope this will guide future research on SLC9A9. Autism Res 2017, 10: 414-429. © 2016 International Society for Autism Research, Wiley Periodicals, Inc. © 2016 International Society for Autism Research, Wiley Periodicals, Inc.
Identification of host transcriptional networks showing concentration-dependent regulation by HPV16 E6 and E7 proteins in basal cervical squamous epithelial cells

PubMed Central

Smith, Stephen P.; Scarpini, Cinzia G.; Groves, Ian J.; Odle, Richard I.; Coleman, Nicholas

2016-01-01

Development of cervical squamous cell carcinoma requires increased expression of the major high-risk human-papillomavirus (HPV) oncogenes E6 and E7 in basal cervical epithelial cells. We used a systems biology approach to identify host transcriptional networks in such cells and study the concentration-dependent changes produced by HPV16-E6 and -E7 oncoproteins. We investigated sample sets derived from the W12 model of cervical neoplastic progression, for which high quality phenotype/genotype data were available. We defined a gene co-expression matrix containing a small number of highly-connected hub nodes that controlled large numbers of downstream genes (regulons), indicating the scale-free nature of host gene co-expression in W12. We identified a small number of ‘master regulators’ for which downstream effector genes were significantly associated with protein levels of HPV16 E6 (n = 7) or HPV16 E7 (n = 5). We validated our data by depleting E6/E7 in relevant cells and by functional analysis of selected genes in vitro. We conclude that the network of transcriptional interactions in HPV16-infected basal-type cervical epithelium is regulated in a concentration-dependent manner by E6/E7, via a limited number of central master-regulators. These effects are likely to be significant in cervical carcinogenesis, where there is competitive selection of cells with elevated expression of virus oncoproteins. PMID:27457222
Exploring metabolic engineering design principles for the photosynthetic production of lactic acid by Synechocystis sp. PCC6803

PubMed Central

2014-01-01

Background Molecular engineering of the intermediary physiology of cyanobacteria has become important for the sustainable production of biofuels and commodity compounds from CO2 and sunlight by “designer microbes.” The chemical commodity product L-lactic acid can be synthesized in one step from a key intermediary metabolite of these organisms, pyruvate, catalyzed by a lactate dehydrogenase. Synthetic biology engineering to make “designer microbes” includes the introduction and overexpression of the product-forming biochemical pathway. For further optimization of product formation, modifications in the surrounding biochemical network of intermediary metabolism have to be made. Results To improve light-driven L-lactic acid production from CO2, we explored several metabolic engineering design principles, using a previously engineered L-lactic acid producing mutant strain of Synechocystis sp. PCC6803 as the benchmark. These strategies included: (i) increasing the expression level of the relevant product-forming enzyme, lactate dehydrogenase (LDH), for example, via expression from a replicative plasmid; (ii) co-expression of a heterologous pyruvate kinase to increase the flux towards pyruvate; and (iii) knockdown of phosphoenolpyruvate carboxylase to decrease the flux through a competing pathway (from phosphoenolpyruvate to oxaloacetate). In addition, we tested selected lactate dehydrogenases, some of which were further optimized through site-directed mutagenesis to improve the enzyme’s affinity for the co-factor nicotinamide adenine dinucleotide phosphate (NADPH). The carbon partitioning between biomass and lactic acid was increased from about 5% to over 50% by strain optimization. Conclusion An efficient photosynthetic microbial cell factory will display a high rate and extent of conversion of substrate (CO2) into product (here: L-lactic acid). In the existing CO2-based cyanobacterial cell factories that have been described in the literature, by far most of the control over product formation resides in the genetically introduced fermentative pathway. Here we show that a strong promoter, in combination with increased gene expression, can take away a significant part of the control of this step in lactic acid production from CO2. Under these premises, modulation of the intracellular precursor, pyruvate, can significantly increase productivity. Additionally, production enhancement is achieved by protein engineering to increase co-factor specificity of the heterologously expressed LDH. PMID:24991233

Resilient protein co-expression network in male orbitofrontal cortex layer 2/3 during human aging.

PubMed

Pabba, Mohan; Scifo, Enzo; Kapadia, Fenika; Nikolova, Yuliya S; Ma, Tianzhou; Mechawar, Naguib; Tseng, George C; Sibille, Etienne

2017-10-01

The orbitofrontal cortex (OFC) is vulnerable to normal and pathologic aging. Currently, layer resolution large-scale proteomic studies describing "normal" age-related alterations at OFC are not available. Here, we performed a large-scale exploratory high-throughput mass spectrometry-based protein analysis on OFC layer 2/3 from 15 "young" (15-43 years) and 18 "old" (62-88 years) human male subjects. We detected 4193 proteins and identified 127 differentially expressed (DE) proteins (p-value ≤0.05; effect size >20%), including 65 up- and 62 downregulated proteins (e.g., GFAP, CALB1). Using a previously described categorization of biological aging based on somatic tissues, that is, peripheral "hallmarks of aging," and considering overlap in protein function, we show the highest representation of altered cell-cell communication (54%), deregulated nutrient sensing (39%), and loss of proteostasis (35%) in the set of OFC layer 2/3 DE proteins. DE proteins also showed a significant association with several neurologic disorders; for example, Alzheimer's disease and schizophrenia. Notably, despite age-related changes in individual protein levels, protein co-expression modules were remarkably conserved across age groups, suggesting robust functional homeostasis. Collectively, these results provide biological insight into aging and associated homeostatic mechanisms that maintain normal brain function with advancing age. Copyright © 2017 Elsevier Inc. All rights reserved.
Implications of Network Topology on Stability

PubMed Central

Kinkhabwala, Ali

2015-01-01

In analogy to chemical reaction networks, I demonstrate the utility of expressing the governing equations of an arbitrary dynamical system (interaction network) as sums of real functions (generalized reactions) multiplied by real scalars (generalized stoichiometries) for analysis of its stability. The reaction stoichiometries and first derivatives define the network’s “influence topology”, a signed directed bipartite graph. Parameter reduction of the influence topology permits simplified expression of the principal minors (sums of products of non-overlapping bipartite cycles) and Hurwitz determinants (sums of products of the principal minors or the bipartite cycles directly) for assessing the network’s steady state stability. Visualization of the Hurwitz determinants over the reduced parameters defines the network’s stability phase space, delimiting the range of its dynamics (specifically, the possible numbers of unstable roots at each steady state solution). Any further explicit algebraic specification of the network will project onto this stability phase space. Stability analysis via this hierarchical approach is demonstrated on classical networks from multiple fields. PMID:25826219
The Community Collaborative Rain, Hail and Snow Network (CoCoRaHS): Timely Volunteer Precipitation Measurements to Supplement Existing Hydrometeorological Networks

NASA Astrophysics Data System (ADS)

Reges, H. W.; Doesken, N. J.; Cifelli, R. C.; Turner, J. S.

2005-12-01

The Community Collaborative Rain, Hail and Snow Network (CoCoRaHS) is a community-based, education-focused high density network of individual and family volunteers of all ages and backgrounds, who take daily measurements of rain, hail and snow at their homes, schools and businesses. Precipitation is measured using low-cost high capacity 4" diameter plastic rain gauges and Styrofoam wrapped in aluminum foil "hail pads". Thanks to the "low-tech/low-cost" approach, thousands of volunteers can afford to participate, giving the end user a large collection of data points that fill in gaps in many existing networks and data sets. Where feasible, CoCoRaHS is striving to achieve a station density approaching one observation per km-squared providing exceptional detail on cumulative storm precipitation over populated areas. These observations are collected and made available on the CoCoRaHS website: www.cocorahs.org in map and table format. The data are already being used daily by federal, state and community organizations and businesses for many resource management and hydrologic monitoring and predication applications. CoCoRaHS "Intense Rain Reports" and "Hail Reports" are used in "real time" by the National Weather Service in the issuing of flash flood warnings and severe thunderstorm warnings. While only providing once-daily and occasional event reports, CoCoRaHS does provide excellent observational consistency and accuracy including snowfall, depth and water content measurements, as well as the only comprehensive hail data currently being gathered in the U.S. The CoCoRaHS network currently engages over 2,000 volunteer observers in communities across six states, and the network continues to grow.
A framework for the establishment of a cnidarian gene regulatory network for "endomesoderm" specification: the inputs of ß-catenin/TCF signaling.

PubMed

Röttinger, Eric; Dahlin, Paul; Martindale, Mark Q

2012-01-01

Understanding the functional relationship between intracellular factors and extracellular signals is required for reconstructing gene regulatory networks (GRN) involved in complex biological processes. One of the best-studied bilaterian GRNs describes endomesoderm specification and predicts that both mesoderm and endoderm arose from a common GRN early in animal evolution. Compelling molecular, genomic, developmental, and evolutionary evidence supports the hypothesis that the bifunctional gastrodermis of the cnidarian-bilaterian ancestor is derived from the same evolutionary precursor of both endodermal and mesodermal germ layers in all other triploblastic bilaterian animals. We have begun to establish the framework of a provisional cnidarian "endomesodermal" gene regulatory network in the sea anemone, Nematostella vectensis, by using a genome-wide microarray analysis on embryos in which the canonical Wnt/ß-catenin pathway was ectopically targeted for activation by two distinct pharmaceutical agents (lithium chloride and 1-azakenpaullone) to identify potential targets of endomesoderm specification. We characterized 51 endomesodermally expressed transcription factors and signaling molecule genes (including 18 newly identified) with fine-scale temporal (qPCR) and spatial (in situ) analysis to define distinct co-expression domains within the animal plate of the embryo and clustered genes based on their earliest zygotic expression. Finally, we determined the input of the canonical Wnt/ß-catenin pathway into the cnidarian endomesodermal GRN using morpholino and mRNA overexpression experiments to show that NvTcf/canonical Wnt signaling is required to pattern both the future endomesodermal and ectodermal domains prior to gastrulation, and that both BMP and FGF (but not Notch) pathways play important roles in germ layer specification in this animal. We show both evolutionary conserved as well as profound differences in endomesodermal GRN structure compared to bilaterians that may provide fundamental insight into how GRN subcircuits have been adopted, rewired, or co-opted in various animal lineages that give rise to specialized endomesodermal cell types.
An empirical Bayes approach to network recovery using external knowledge.

PubMed

Kpogbezan, Gino B; van der Vaart, Aad W; van Wieringen, Wessel N; Leday, Gwenaël G R; van de Wiel, Mark A

2017-09-01

Reconstruction of a high-dimensional network may benefit substantially from the inclusion of prior knowledge on the network topology. In the case of gene interaction networks such knowledge may come for instance from pathway repositories like KEGG, or be inferred from data of a pilot study. The Bayesian framework provides a natural means of including such prior knowledge. Based on a Bayesian Simultaneous Equation Model, we develop an appealing Empirical Bayes (EB) procedure that automatically assesses the agreement of the used prior knowledge with the data at hand. We use variational Bayes method for posterior densities approximation and compare its accuracy with that of Gibbs sampling strategy. Our method is computationally fast, and can outperform known competitors. In a simulation study, we show that accurate prior data can greatly improve the reconstruction of the network, but need not harm the reconstruction if wrong. We demonstrate the benefits of the method in an analysis of gene expression data from GEO. In particular, the edges of the recovered network have superior reproducibility (compared to that of competitors) over resampled versions of the data. © 2017 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
ChlamyNET: a Chlamydomonas gene co-expression network reveals global properties of the transcriptome and the early setup of key co-expression patterns in the green lineage.

PubMed

Romero-Campero, Francisco J; Perez-Hurtado, Ignacio; Lucas-Reina, Eva; Romero, Jose M; Valverde, Federico

2016-03-12

Chlamydomonas reinhardtii is the model organism that serves as a reference for studies in algal genomics and physiology. It is of special interest in the study of the evolution of regulatory pathways from algae to higher plants. Additionally, it has recently gained attention as a potential source for bio-fuel and bio-hydrogen production. The genome of Chlamydomonas is available, facilitating the analysis of its transcriptome by RNA-seq data. This has produced a massive amount of data that remains fragmented making necessary the application of integrative approaches based on molecular systems biology. We constructed a gene co-expression network based on RNA-seq data and developed a web-based tool, ChlamyNET, for the exploration of the Chlamydomonas transcriptome. ChlamyNET exhibits a scale-free and small world topology. Applying clustering techniques, we identified nine gene clusters that capture the structure of the transcriptome under the analyzed conditions. One of the most central clusters was shown to be involved in carbon/nitrogen metabolism and signalling, whereas one of the most peripheral clusters was involved in DNA replication and cell cycle regulation. The transcription factors and regulators in the Chlamydomonas genome have been identified in ChlamyNET. The biological processes potentially regulated by them as well as their putative transcription factor binding sites were determined. The putative light regulated transcription factors and regulators in the Chlamydomonas genome were analyzed in order to provide a case study on the use of ChlamyNET. Finally, we used an independent data set to cross-validate the predictive power of ChlamyNET. The topological properties of ChlamyNET suggest that the Chlamydomonas transcriptome posseses important characteristics related to error tolerance, vulnerability and information propagation. The central part of ChlamyNET constitutes the core of the transcriptome where most authoritative hub genes are located interconnecting key biological processes such as light response with carbon and nitrogen metabolism. Our study reveals that key elements in the regulation of carbon and nitrogen metabolism, light response and cell cycle identified in higher plants were already established in Chlamydomonas. These conserved elements are not only limited to transcription factors, regulators and their targets, but also include the cis-regulatory elements recognized by them.
Identification of transcriptional regulatory nodes in soybean defense networks using transient co-transactivation assays

PubMed Central

Wang, Yongli; Wang, Hui; Ma, Yujie; Du, Haiping; Yang, Qing; Yu, Deyue

2015-01-01

Plant responses to major environmental stressors, such as insect feeding, not only occur via the functions of defense genes but also involve a series of regulatory factors. Our previous transcriptome studies proposed that, in addition to two defense-related genes, GmVSPβ and GmN:IFR, a high proportion of transcription factors (TFs) participate in the incompatible soybean-common cutworm interaction networks. However, the regulatory mechanisms and effects of these TFs on those induced defense-related genes remain unknown. In the present work, we isolated and identified 12 genes encoding MYB, WRKY, NAC, bZIP, and DREB TFs from a common cutworm-induced cDNA library of a resistant soybean line. Sequence analysis of the promoters of three co-expressed genes, including GmVSPα, GmVSPβ, and GmN:IFR, revealed the enrichment of various TF-binding sites for defense and stress responses. To further identify the regulatory nodes composed of these TFs and defense gene promoters, we performed extensive transient co-transactivation assays to directly test the transcriptional activity of the 12 TFs binding at different levels to the three co-expressed gene promoters. The results showed that all 12 TFs were able to transactivate the GmVSPβ and GmN:IFR promoters. GmbZIP110 and GmMYB75 functioned as distinct regulators of GmVSPα/β and GmN:IFR expression, respectively, while GmWRKY39 acted as a common central regulator of GmVSPα/β and GmN:IFR expression. These corresponding TFs play crucial roles in coordinated plant defense regulation, which provides valuable information for understanding the molecular mechanisms involved in insect-induced transcriptional regulation in soybean. More importantly, the identified TFs and suitable promoters can be used to engineer insect-resistant plants in molecular breeding studies. PMID:26579162
Identifying Corresponding Patches in SAR and Optical Images With a Pseudo-Siamese CNN

NASA Astrophysics Data System (ADS)

Hughes, Lloyd H.; Schmitt, Michael; Mou, Lichao; Wang, Yuanyuan; Zhu, Xiao Xiang

2018-05-01

In this letter, we propose a pseudo-siamese convolutional neural network (CNN) architecture that enables to solve the task of identifying corresponding patches in very-high-resolution (VHR) optical and synthetic aperture radar (SAR) remote sensing imagery. Using eight convolutional layers each in two parallel network streams, a fully connected layer for the fusion of the features learned in each stream, and a loss function based on binary cross-entropy, we achieve a one-hot indication if two patches correspond or not. The network is trained and tested on an automatically generated dataset that is based on a deterministic alignment of SAR and optical imagery via previously reconstructed and subsequently co-registered 3D point clouds. The satellite images, from which the patches comprising our dataset are extracted, show a complex urban scene containing many elevated objects (i.e. buildings), thus providing one of the most difficult experimental environments. The achieved results show that the network is able to predict corresponding patches with high accuracy, thus indicating great potential for further development towards a generalized multi-sensor key-point matching procedure. Index Terms-synthetic aperture radar (SAR), optical imagery, data fusion, deep learning, convolutional neural networks (CNN), image matching, deep matching
Gene networks associated with conditional fear in mice identified using a systems genetics approach

PubMed Central

2011-01-01

Background Our understanding of the genetic basis of learning and memory remains shrouded in mystery. To explore the genetic networks governing the biology of conditional fear, we used a systems genetics approach to analyze a hybrid mouse diversity panel (HMDP) with high mapping resolution. Results A total of 27 behavioral quantitative trait loci were mapped with a false discovery rate of 5%. By integrating fear phenotypes, transcript profiling data from hippocampus and striatum and also genotype information, two gene co-expression networks correlated with context-dependent immobility were identified. We prioritized the key markers and genes in these pathways using intramodular connectivity measures and structural equation modeling. Highly connected genes in the context fear modules included Psmd6, Ube2a and Usp33, suggesting an important role for ubiquitination in learning and memory. In addition, we surveyed the architecture of brain transcript regulation and demonstrated preservation of gene co-expression modules in hippocampus and striatum, while also highlighting important differences. Rps15a, Kif3a, Stard7, 6330503K22RIK, and Plvap were among the individual genes whose transcript abundance were strongly associated with fear phenotypes. Conclusion Application of our multi-faceted mapping strategy permits an increasingly detailed characterization of the genetic networks underlying behavior. PMID:21410935
Molecular signatures in Arabidopsis thaliana in response to insect attack and bacterial infection.

PubMed

Barah, Pankaj; Winge, Per; Kusnierczyk, Anna; Tran, Diem Hong; Bones, Atle M

2013-01-01

Under the threat of global climatic change and food shortages, it is essential to take the initiative to obtain a comprehensive understanding of common and specific defence mechanisms existing in plant systems for protection against different types of biotic invaders. We have implemented an integrated approach to analyse the overall transcriptomic reprogramming and systems-level defence responses in the model plant species Arabidopsis thaliana (A. thaliana henceforth) during insect Brevicoryne brassicae (B. brassicae henceforth) and bacterial Pseudomonas syringae pv. tomato strain DC3000 (P. syringae henceforth) attacks. The main aim of this study was to identify the attacker-specific and general defence response signatures in A. thaliana when attacked by phloem-feeding aphids or pathogenic bacteria. The obtained annotated networks of differentially expressed transcripts indicated that members of transcription factor families, such as WRKY, MYB, ERF, BHLH and bZIP, could be crucial for stress-specific defence regulation in Arabidopsis during aphid and P. syringae attack. The defence response pathways, signalling pathways and metabolic processes associated with aphid attack and P. syringae infection partially overlapped. Components of several important biosynthesis and signalling pathways, such as salicylic acid (SA), jasmonic acid (JA), ethylene (ET) and glucosinolates, were differentially affected during the two the treatments. Several stress-regulated transcription factors were known to be associated with stress-inducible microRNAs. The differentially regulated gene sets included many signature transcription factors, and our co-expression analysis showed that they were also strongly co-expressed during 69 other biotic stress experiments. Defence responses and functional networks that were unique and specific to aphid or P. syringae stresses were identified. Furthermore, our analysis revealed a probable link between biotic stress and microRNAs in Arabidopsis and, thus gives indicates a new direction for conducting large-scale targeted experiments to explore the detailed regulatory links between them. The presented results provide a comparative understanding of Arabidopsis - B. brassicae and Arabidopsis - P. syringae interactions at the transcriptomic level.
Gene expression patterns combined with bioinformatics analysis identify genes associated with cholangiocarcinoma.

PubMed

Li, Chen; Shen, Weixing; Shen, Sheng; Ai, Zhilong

2013-12-01

To explore the molecular mechanisms of cholangiocarcinoma (CC), microarray technology was used to find biomarkers for early detection and diagnosis. The gene expression profiles from 6 patients with CC and 5 normal controls were downloaded from Gene Expression Omnibus and compared. As a result, 204 differentially co-expressed genes (DCGs) in CC patients compared to normal controls were identified using a computational bioinformatics analysis. These genes were mainly involved in coenzyme metabolic process, peptidase activity and oxidation reduction. A regulatory network was constructed by mapping the DCGs to known regulation data. Four transcription factors, FOXC1, ZIC2, NKX2-2 and GCGR, were hub nodes in the network. In conclusion, this study provides a set of targets useful for future investigations into molecular biomarker studies. Copyright © 2013 Elsevier Ltd. All rights reserved.
The regulatory software of cellular metabolism.

PubMed

Segrè, Daniel

2004-06-01

Understanding the regulation of metabolic pathways in the cell is like unraveling the 'software' that is running on the 'hardware' of the metabolic network. Transcriptional regulation of enzymes is an important component of this software. A recent systematic analysis of metabolic gene-expression data in Saccharomyces cerevisiae reveals a complex modular organization of co-expressed genes, which could increase our ability to understand and engineer cellular metabolic functions.
Stochastic surface walking reaction sampling for resolving heterogeneous catalytic reaction network: A revisit to the mechanism of water-gas shift reaction on Cu

NASA Astrophysics Data System (ADS)

Zhang, Xiao-Jie; Shang, Cheng; Liu, Zhi-Pan

2017-10-01

Heterogeneous catalytic reactions on surface and interfaces are renowned for ample intermediate adsorbates and complex reaction networks. The common practice to reveal the reaction mechanism is via theoretical computation, which locates all likely transition states based on the pre-guessed reaction mechanism. Here we develop a new theoretical method, namely, stochastic surface walking (SSW)-Cat method, to resolve the lowest energy reaction pathway of heterogeneous catalytic reactions, which combines our recently developed SSW global structure optimization and SSW reaction sampling. The SSW-Cat is automated and massively parallel, taking a rough reaction pattern as input to guide reaction search. We present the detailed algorithm, discuss the key features, and demonstrate the efficiency in a model catalytic reaction, water-gas shift reaction on Cu(111) (CO + H2O → CO2 + H2). The SSW-Cat simulation shows that water dissociation is the rate-determining step and formic acid (HCOOH) is the kinetically favorable product, instead of the observed final products, CO2 and H2. It implies that CO2 and H2 are secondary products from further decomposition of HCOOH at high temperatures. Being a general purpose tool for reaction prediction, the SSW-Cat may be utilized for rational catalyst design via large-scale computations.
Integrating mRNA and miRNA Weighted Gene Co-Expression Networks with eQTLs in the Nucleus Accumbens of Subjects with Alcohol Dependence

PubMed Central

Blevins, Tana; Aliev, Fazil; Adkins, Amy; Hack, Laura; Bigdeli, Tim; D. van der Vaart, Andrew; Web, Bradley Todd; Bacanu, Silviu-Alin; Kalsi, Gursharan; Kendler, Kenneth S.; Miles, Michael F.; Dick, Danielle; Riley, Brien P.; Dumur, Catherine; Vladimirov, Vladimir I.

2015-01-01

Alcohol consumption is known to lead to gene expression changes in the brain. After performing weighted gene co-expression network analyses (WGCNA) on genome-wide mRNA and microRNA (miRNA) expression in Nucleus Accumbens (NAc) of subjects with alcohol dependence (AD; N = 18) and of matched controls (N = 18), six mRNA and three miRNA modules significantly correlated with AD were identified (Bonferoni-adj. p≤ 0.05). Cell-type-specific transcriptome analyses revealed two of the mRNA modules to be enriched for neuronal specific marker genes and downregulated in AD, whereas the remaining four mRNA modules were enriched for astrocyte and microglial specific marker genes and upregulated in AD. Gene set enrichment analysis demonstrated that neuronal specific modules were enriched for genes involved in oxidative phosphorylation, mitochondrial dysfunction and MAPK signaling. Glial-specific modules were predominantly enriched for genes involved in processes related to immune functions, i.e. cytokine signaling (all adj. p≤ 0.05). In mRNA and miRNA modules, 461 and 25 candidate hub genes were identified, respectively. In contrast to the expected biological functions of miRNAs, correlation analyses between mRNA and miRNA hub genes revealed a higher number of positive than negative correlations (χ2 test p≤ 0.0001). Integration of hub gene expression with genome-wide genotypic data resulted in 591 mRNA cis-eQTLs and 62 miRNA cis-eQTLs. mRNA cis-eQTLs were significantly enriched for AD diagnosis and AD symptom counts (adj. p = 0.014 and p = 0.024, respectively) in AD GWAS signals in a large, independent genetic sample from the Collaborative Study on Genetics of Alcohol (COGA). In conclusion, our study identified putative gene network hubs coordinating mRNA and miRNA co-expression changes in the NAc of AD subjects, and our genetic (cis-eQTL) analysis provides novel insights into the etiological mechanisms of AD. PMID:26381263
Genome-wide analysis of coordinated transcript abundance during seed development in different Brassica rapa morphotypes.

PubMed

Basnet, Ram Kumar; Moreno-Pachon, Natalia; Lin, Ke; Bucher, Johan; Visser, Richard G F; Maliepaard, Chris; Bonnema, Guusje

2013-12-01

Brassica seeds are important as basic units of plant growth and sources of vegetable oil. Seed development is regulated by many dynamic metabolic processes controlled by complex networks of spatially and temporally expressed genes. We conducted a global microarray gene co-expression analysis by measuring transcript abundance of developing seeds from two diverse B. rapa morphotypes: a pak choi (leafy-type) and a yellow sarson (oil-type), and two of their doubled haploid (DH) progenies, (1) to study the timing of metabolic processes in developing seeds, (2) to explore the major transcriptional differences in developing seeds of the two morphotypes, and (3) to identify the optimum stage for a genetical genomics study in B. rapa seed. Seed developmental stages were similar in developing seeds of pak choi and yellow sarson of B. rapa; however, the colour of embryo and seed coat differed among these two morphotypes. In this study, most transcriptional changes occurred between 25 and 35 DAP, which shows that the timing of seed developmental processes in B. rapa is at later developmental stages than in the related species B. napus. Using a Weighted Gene Co-expression Network Analysis (WGCNA), we identified 47 "gene modules", of which 27 showed a significant association with temporal and/or genotypic variation. An additional hierarchical cluster analysis identified broad spectra of gene expression patterns during seed development. The predominant variation in gene expression was according to developmental stages rather than morphotype differences. Since lipids are the major storage compounds of Brassica seeds, we investigated in more detail the regulation of lipid metabolism. Four co-regulated gene clusters were identified with 17 putative cis-regulatory elements predicted in their 1000 bp upstream region, either specific or common to different lipid metabolic pathways. This is the first study of genome-wide profiling of transcript abundance during seed development in B. rapa. The identification of key physiological events, major expression patterns, and putative cis-regulatory elements provides useful information to construct gene regulatory networks in B. rapa developing seeds and provides a starting point for a genetical genomics study of seed quality traits.
A copper-phosphonate network as a high-performance heterogeneous catalyst for the CO2 cycloaddition reactions and alcoholysis of epoxides.

PubMed

Ai, Jing; Min, Xue; Gao, Chao-Ying; Tian, Hong-Rui; Dang, Song; Sun, Zhong-Ming

2017-05-23

A novel 3D copper-phosphonate network, with the general formula Cu 7 (H 1 L) 2 (TPT) 3 (H 2 O) 6 , namely compound 1, has been synthesized using a rigid tetrahedral linker tetraphenylsilane tetrakis-4-phosphonic acid (H 8 L) and a nitrogen-containing ancillary ligand (TPT: [5-(4-(1H-1,2,4-triazol-1-yl)phenyl)-1H-tetrazole]) under hydrothermal conditions. The compound was fully characterized using PXRD, ICP, IR, TGA and elemental analysis. Compound 1 can be used as an efficient catalyst for the CO 2 coupling reaction that is greatly superior to many conventional MOF-based catalysts, where porosity is always mentioned and used. In addition, it shows excellent catalytic performance for ring-opening reactions with epoxides under ambient conditions. Additionally, compound 1 can be recycled at least three times without a significant compromise in the activity in the two catalytic reactions.
Dynamics of Bacterial Gene Regulatory Networks.

PubMed

Shis, David L; Bennett, Matthew R; Igoshin, Oleg A

2018-05-20

The ability of bacterial cells to adjust their gene expression program in response to environmental perturbation is often critical for their survival. Recent experimental advances allowing us to quantitatively record gene expression dynamics in single cells and in populations coupled with mathematical modeling enable mechanistic understanding on how these responses are shaped by the underlying regulatory networks. Here, we review how the combination of local and global factors affect dynamical responses of gene regulatory networks. Our goal is to discuss the general principles that allow extrapolation from a few model bacteria to less understood microbes. We emphasize that, in addition to well-studied effects of network architecture, network dynamics are shaped by global pleiotropic effects and cell physiology.
Hydrogen-assisted versus hydroxyl-assisted CO dissociation over Co-doped Cu(111): A DFT study

NASA Astrophysics Data System (ADS)

Zha, Hao; Dong, Xiuqin; Yu, Yingzhe; Zhang, Minhua

2018-03-01

First principle based density functional theory (DFT) was used to calculate the step-by-step hydrogenation and dissociation reaction network of carbon monoxide (CO) over Co-doped Cu(111) surface as a model for understanding the lateral interaction of surface hydroxyl species (OH) on these reactions. We discussed the Csbnd O bond length and the adsorption energy changes of reaction intermediates under different adsorption circumstances for purpose of making out the effect of surface hydroxyl on the reaction selectivity. Reaction intermediates co-adsorbed with H atom and hydroxyl could undergo H-assisted or OH-assisted routes. The calculations show that the OH-assisted route prefers with the formation of COH, CHOH and CH2OH while general H-assisted route prefers with the formation of HCO, CH2O and CH3O. Considering the rather low activation barrier of COH, CHOH and CH2OH to form CHX, the existence of hydroxyl on the surface is in favor of boosting the CHX and suppressing the methanol.
Integrating Genetic and Functional Genomic Data to Elucidate Common Disease Tra

NASA Astrophysics Data System (ADS)

Schadt, Eric

2005-03-01

The reconstruction of genetic networks in mammalian systems is one of the primary goals in biological research, especially as such reconstructions relate to elucidating not only common, polygenic human diseases, but living systems more generally. Here I present a statistical procedure for inferring causal relationships between gene expression traits and more classic clinical traits, including complex disease traits. This procedure has been generalized to the gene network reconstruction problem, where naturally occurring genetic variations in segregating mouse populations are used as a source of perturbations to elucidate tissue-specific gene networks. Differences in the extent of genetic control between genders and among four different tissues are highlighted. I also demonstrate that the networks derived from expression data in segregating mouse populations using the novel network reconstruction algorithm are able to capture causal associations between genes that result in increased predictive power, compared to more classically reconstructed networks derived from the same data. This approach to causal inference in large segregating mouse populations over multiple tissues not only elucidates fundamental aspects of transcriptional control, it also allows for the objective identification of key drivers of common human diseases.
Integrated Regulatory and Metabolic Networks of the Marine Diatom Phaeodactylum tricornutum Predict the Response to Rising CO 2 Levels

DOE PAGES

Levering, Jennifer; Dupont, Christopher L.; Allen, Andrew E.; ...

2017-02-14

Diatoms are eukaryotic microalgae that are responsible for up to 40% of the ocean’s primary productivity. How diatoms respond to environmental perturbations such as elevated carbon concentrations in the atmosphere is currently poorly understood. We developed a transcriptional regulatory network based on various transcriptome sequencing expression libraries for different environmental responses to gain insight into the marine diatom’s metabolic and regulatory interactions and provide a comprehensive framework of responses to increasing atmospheric carbon levels. This transcriptional regulatory network was integrated with a recently published genome-scale metabolic model of Phaeodactylum tricornutum to explore the connectivity of the regulatory network and sharedmore » metabolites. The integrated regulatory and metabolic model revealed highly connected modules within carbon and nitrogen metabolism. P. tricornutum’s response to rising carbon levels was analyzed by using the recent genome-scale metabolic model with cross comparison to experimental manipulations of carbon dioxide. Using a systems biology approach, we studied the response of the marine diatom Phaeodactylum tricornutum to changing atmospheric carbon concentrations on an ocean-wide scale. By integrating an available genome-scale metabolic model and a newly developed transcriptional regulatory network inferred from transcriptome sequencing expression data, we demonstrate that carbon metabolism and nitrogen metabolism are strongly connected and the genes involved are coregulated in this model diatom. These tight regulatory constraints could play a major role during the adaptation of P. tricornutum to increasing carbon levels. The transcriptional regulatory network developed can be further used to study the effects of different environmental perturbations on P. tricornutum’s metabolism.« less

Some links on this page may take you to non-federal websites. Their policies may differ from this site.