Sample records for core gene network

  1. Gene regulation is governed by a core network in hepatocellular carcinoma.

    PubMed

    Gu, Zuguang; Zhang, Chenyu; Wang, Jin

    2012-05-01

    Hepatocellular carcinoma (HCC) is one of the most lethal cancers worldwide, and the mechanisms that lead to the disease are still relatively unclear. However, with the development of high-throughput technologies it is possible to gain a systematic view of biological systems to enhance the understanding of the roles of genes associated with HCC. Thus, analysis of the mechanism of molecule interactions in the context of gene regulatory networks can reveal specific sub-networks that lead to the development of HCC. In this study, we aimed to identify the most important gene regulations that are dysfunctional in HCC generation. Our method for constructing gene regulatory network is based on predicted target interactions, experimentally-supported interactions, and co-expression model. Regulators in the network included both transcription factors and microRNAs to provide a complete view of gene regulation. Analysis of gene regulatory network revealed that gene regulation in HCC is highly modular, in which different sets of regulators take charge of specific biological processes. We found that microRNAs mainly control biological functions related to mitochondria and oxidative reduction, while transcription factors control immune responses, extracellular activity and the cell cycle. On the higher level of gene regulation, there exists a core network that organizes regulations between different modules and maintains the robustness of the whole network. There is direct experimental evidence for most of the regulators in the core gene regulatory network relating to HCC. We infer it is the central controller of gene regulation. Finally, we explored the influence of the core gene regulatory network on biological pathways. Our analysis provides insights into the mechanism of transcriptional and post-transcriptional control in HCC. In particular, we highlight the importance of the core gene regulatory network; we propose that it is highly related to HCC and we believe further experimental validation is worthwhile.

  2. Pan- and core- network analysis of co-expression genes in a model plant

    DOE PAGES

    He, Fei; Maslov, Sergei

    2016-12-16

    Genome-wide gene expression experiments have been performed using the model plant Arabidopsis during the last decade. Some studies involved construction of coexpression networks, a popular technique used to identify groups of co-regulated genes, to infer unknown gene functions. One approach is to construct a single coexpression network by combining multiple expression datasets generated in different labs. We advocate a complementary approach in which we construct a large collection of 134 coexpression networks based on expression datasets reported in individual publications. To this end we reanalyzed public expression data. To describe this collection of networks we introduced concepts of ‘pan-network’ andmore » ‘core-network’ representing union and intersection between a sizeable fractions of individual networks, respectively. Here, we showed that these two types of networks are different both in terms of their topology and biological function of interacting genes. For example, the modules of the pan-network are enriched in regulatory and signaling functions, while the modules of the core-network tend to include components of large macromolecular complexes such as ribosomes and photosynthetic machinery. Our analysis is aimed to help the plant research community to better explore the information contained within the existing vast collection of gene expression data in Arabidopsis.« less

  3. Pan- and core- network analysis of co-expression genes in a model plant

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    He, Fei; Maslov, Sergei

    Genome-wide gene expression experiments have been performed using the model plant Arabidopsis during the last decade. Some studies involved construction of coexpression networks, a popular technique used to identify groups of co-regulated genes, to infer unknown gene functions. One approach is to construct a single coexpression network by combining multiple expression datasets generated in different labs. We advocate a complementary approach in which we construct a large collection of 134 coexpression networks based on expression datasets reported in individual publications. To this end we reanalyzed public expression data. To describe this collection of networks we introduced concepts of ‘pan-network’ andmore » ‘core-network’ representing union and intersection between a sizeable fractions of individual networks, respectively. Here, we showed that these two types of networks are different both in terms of their topology and biological function of interacting genes. For example, the modules of the pan-network are enriched in regulatory and signaling functions, while the modules of the core-network tend to include components of large macromolecular complexes such as ribosomes and photosynthetic machinery. Our analysis is aimed to help the plant research community to better explore the information contained within the existing vast collection of gene expression data in Arabidopsis.« less

  4. Core Promoter Functions in the Regulation of Gene Expression of Drosophila Dorsal Target Genes*

    PubMed Central

    Zehavi, Yonathan; Kuznetsov, Olga; Ovadia-Shochat, Avital; Juven-Gershon, Tamar

    2014-01-01

    Developmental processes are highly dependent on transcriptional regulation by RNA polymerase II. The RNA polymerase II core promoter is the ultimate target of a multitude of transcription factors that control transcription initiation. Core promoters consist of core promoter motifs, e.g. the initiator, TATA box, and the downstream core promoter element (DPE), which confer specific properties to the core promoter. Here, we explored the importance of core promoter functions in the dorsal-ventral developmental gene regulatory network. This network includes multiple genes that are activated by different nuclear concentrations of Dorsal, an NFκB homolog transcription factor, along the dorsal-ventral axis. We show that over two-thirds of Dorsal target genes contain DPE sequence motifs, which is significantly higher than the proportion of DPE-containing promoters in Drosophila genes. We demonstrate that multiple Dorsal target genes are evolutionarily conserved and functionally dependent on the DPE. Furthermore, we have analyzed the activation of key Dorsal target genes by Dorsal, as well as by another Rel family transcription factor, Relish, and the dependence of their activation on the DPE motif. Using hybrid enhancer-promoter constructs in Drosophila cells and embryo extracts, we have demonstrated that the core promoter composition is an important determinant of transcriptional activity of Dorsal target genes. Taken together, our results provide evidence for the importance of core promoter composition in the regulation of Dorsal target genes. PMID:24634215

  5. s-core network decomposition: A generalization of k-core analysis to weighted networks

    NASA Astrophysics Data System (ADS)

    Eidsaa, Marius; Almaas, Eivind

    2013-12-01

    A broad range of systems spanning biology, technology, and social phenomena may be represented and analyzed as complex networks. Recent studies of such networks using k-core decomposition have uncovered groups of nodes that play important roles. Here, we present s-core analysis, a generalization of k-core (or k-shell) analysis to complex networks where the links have different strengths or weights. We demonstrate the s-core decomposition approach on two random networks (ER and configuration model with scale-free degree distribution) where the link weights are (i) random, (ii) correlated, and (iii) anticorrelated with the node degrees. Finally, we apply the s-core decomposition approach to the protein-interaction network of the yeast Saccharomyces cerevisiae in the context of two gene-expression experiments: oxidative stress in response to cumene hydroperoxide (CHP), and fermentation stress response (FSR). We find that the innermost s-cores are (i) different from innermost k-cores, (ii) different for the two stress conditions CHP and FSR, and (iii) enriched with proteins whose biological functions give insight into how yeast manages these specific stresses.

  6. Identifying core gene modules in glioblastoma based on multilayer factor-mediated dysfunctional regulatory networks through integrating multi-dimensional genomic data

    PubMed Central

    Ping, Yanyan; Deng, Yulan; Wang, Li; Zhang, Hongyi; Zhang, Yong; Xu, Chaohan; Zhao, Hongying; Fan, Huihui; Yu, Fulong; Xiao, Yun; Li, Xia

    2015-01-01

    The driver genetic aberrations collectively regulate core cellular processes underlying cancer development. However, identifying the modules of driver genetic alterations and characterizing their functional mechanisms are still major challenges for cancer studies. Here, we developed an integrative multi-omics method CMDD to identify the driver modules and their affecting dysregulated genes through characterizing genetic alteration-induced dysregulated networks. Applied to glioblastoma (GBM), the CMDD identified a core gene module of 17 genes, including seven known GBM drivers, and their dysregulated genes. The module showed significant association with shorter survival of GBM. When classifying driver genes in the module into two gene sets according to their genetic alteration patterns, we found that one gene set directly participated in the glioma pathway, while the other indirectly regulated the glioma pathway, mostly, via their dysregulated genes. Both of the two gene sets were significant contributors to survival and helpful for classifying GBM subtypes, suggesting their critical roles in GBM pathogenesis. Also, by applying the CMDD to other six cancers, we identified some novel core modules associated with overall survival of patients. Together, these results demonstrate integrative multi-omics data can identify driver modules and uncover their dysregulated genes, which is useful for interpreting cancer genome. PMID:25653168

  7. Topological and organizational properties of the products of house-keeping and tissue-specific genes in protein-protein interaction networks.

    PubMed

    Lin, Wen-Hsien; Liu, Wei-Chung; Hwang, Ming-Jing

    2009-03-11

    Human cells of various tissue types differ greatly in morphology despite having the same set of genetic information. Some genes are expressed in all cell types to perform house-keeping functions, while some are selectively expressed to perform tissue-specific functions. In this study, we wished to elucidate how proteins encoded by human house-keeping genes and tissue-specific genes are organized in human protein-protein interaction networks. We constructed protein-protein interaction networks for different tissue types using two gene expression datasets and one protein-protein interaction database. We then calculated three network indices of topological importance, the degree, closeness, and betweenness centralities, to measure the network position of proteins encoded by house-keeping and tissue-specific genes, and quantified their local connectivity structure. Compared to a random selection of proteins, house-keeping gene-encoded proteins tended to have a greater number of directly interacting neighbors and occupy network positions in several shortest paths of interaction between protein pairs, whereas tissue-specific gene-encoded proteins did not. In addition, house-keeping gene-encoded proteins tended to connect with other house-keeping gene-encoded proteins in all tissue types, whereas tissue-specific gene-encoded proteins also tended to connect with other tissue-specific gene-encoded proteins, but only in approximately half of the tissue types examined. Our analysis showed that house-keeping gene-encoded proteins tend to occupy important network positions, while those encoded by tissue-specific genes do not. The biological implications of our findings were discussed and we proposed a hypothesis regarding how cells organize their protein tools in protein-protein interaction networks. Our results led us to speculate that house-keeping gene-encoded proteins might form a core in human protein-protein interaction networks, while clusters of tissue-specific gene-encoded proteins are attached to the core at more peripheral positions of the networks.

  8. Investigating the specific core genetic-and-epigenetic networks of cellular mechanisms involved in human aging in peripheral blood mononuclear cells

    PubMed Central

    Li, Cheng-Wei; Wang, Wen-Hsin; Chen, Bor-Sen

    2016-01-01

    Aging is an inevitable part of life for humans, and slowing down the aging process has become a main focus of human endeavor. Here, we applied a systems biology approach to construct protein-protein interaction networks, gene regulatory networks, and epigenetic networks, i.e. genetic and epigenetic networks (GENs), of elderly individuals and young controls. We then compared these GENs to extract aging mechanisms using microarray data in peripheral blood mononuclear cells, microRNA (miRNA) data, and database mining. The core GENs of elderly individuals and young controls were obtained by applying principal network projection to GENs based on Principal Component Analysis. By comparing the core networks, we identified that to overcome the accumulated mutation of genes in the aging process the transcription factor JUN can be activated by stress signals, including the MAPK signaling, T-cell receptor signaling, and neurotrophin signaling pathways through DNA methylation of BTG3, G0S2, and AP2B1 and the regulations of mir-223 let-7d, and mir-130a. We also address the aging mechanisms in old men and women. Furthermore, we proposed that drugs designed to target these DNA methylated genes or miRNAs may delay aging. A multiple drug combination comprising phenylalanine, cholesterol, and palbociclib was finally designed for delaying the aging process. PMID:26895224

  9. Evolutionary Dynamics of Small RNAs in 27 Escherichia coli and Shigella Genomes

    PubMed Central

    Skippington, Elizabeth; Ragan, Mark A.

    2012-01-01

    Small RNAs (sRNAs) are widespread in bacteria and play critical roles in regulating physiological processes. They are best characterized in Escherichia coli K-12 MG1655, where 83 sRNAs constitute nearly 2% of the gene complement. Most sRNAs act by base pairing with a target mRNA, modulating its translation and/or stability; many of these RNAs share only limited complementarity to their mRNA target, and require the chaperone Hfq to facilitate base pairing. Little is known about the evolutionary dynamics of bacterial sRNAs. Here, we apply phylogenetic and network analyses to investigate the evolutionary processes and principles that govern sRNA gene distribution in 27 E. coli and Shigella genomes. We identify core (encoded in all 27 genomes) and variable sRNAs; more than two-thirds of the E. coli K-12 MG1655 sRNAs are core, whereas the others show patterns of presence and absence that are principally due to genetic loss, not duplication or lateral genetic transfer. We present evidence that variable sRNAs are less tightly integrated into cellular genetic regulatory networks than are the core sRNAs, and that Hfq facilitates posttranscriptional cross talk between the E. coli–Shigella core and variable genomes. Finally, we present evidence that more than 80% of genes targeted by Hfq-associated core sRNAs have been transferred within the E. coli–Shigella clade, and that most of these genes have been transferred intact. These results suggest that Hfq and sRNAs help integrate laterally acquired genes into established regulatory networks. PMID:22223756

  10. Planting increases the abundance and structure complexity of soil core functional genes relevant to carbon and nitrogen cycling

    PubMed Central

    Wang, Feng; Liang, Yuting; Jiang, Yuji; Yang, Yunfeng; Xue, Kai; Xiong, Jinbo; Zhou, Jizhong; Sun, Bo

    2015-01-01

    Plants have an important impact on soil microbial communities and their functions. However, how plants determine the microbial composition and network interactions is still poorly understood. During a four-year field experiment, we investigated the functional gene composition of three types of soils (Phaeozem, Cambisols and Acrisol) under maize planting and bare fallow regimes located in cold temperate, warm temperate and subtropical regions, respectively. The core genes were identified using high-throughput functional gene microarray (GeoChip 3.0), and functional molecular ecological networks (fMENs) were subsequently developed with the random matrix theory (RMT)-based conceptual framework. Our results demonstrated that planting significantly (P < 0.05) increased the gene alpha-diversity in terms of richness and Shannon – Simpson’s indexes for all three types of soils and 83.5% of microbial alpha-diversity can be explained by the plant factor. Moreover, planting had significant impacts on the microbial community structure and the network interactions of the microbial communities. The calculated network complexity was higher under maize planting than under bare fallow regimes. The increase of the functional genes led to an increase in both soil respiration and nitrification potential with maize planting, indicating that changes in the soil microbial communities and network interactions influenced ecological functioning. PMID:26396042

  11. FastGCN: A GPU Accelerated Tool for Fast Gene Co-Expression Networks

    PubMed Central

    Liang, Meimei; Zhang, Futao; Jin, Gulei; Zhu, Jun

    2015-01-01

    Gene co-expression networks comprise one type of valuable biological networks. Many methods and tools have been published to construct gene co-expression networks; however, most of these tools and methods are inconvenient and time consuming for large datasets. We have developed a user-friendly, accelerated and optimized tool for constructing gene co-expression networks that can fully harness the parallel nature of GPU (Graphic Processing Unit) architectures. Genetic entropies were exploited to filter out genes with no or small expression changes in the raw data preprocessing step. Pearson correlation coefficients were then calculated. After that, we normalized these coefficients and employed the False Discovery Rate to control the multiple tests. At last, modules identification was conducted to construct the co-expression networks. All of these calculations were implemented on a GPU. We also compressed the coefficient matrix to save space. We compared the performance of the GPU implementation with those of multi-core CPU implementations with 16 CPU threads, single-thread C/C++ implementation and single-thread R implementation. Our results show that GPU implementation largely outperforms single-thread C/C++ implementation and single-thread R implementation, and GPU implementation outperforms multi-core CPU implementation when the number of genes increases. With the test dataset containing 16,000 genes and 590 individuals, we can achieve greater than 63 times the speed using a GPU implementation compared with a single-thread R implementation when 50 percent of genes were filtered out and about 80 times the speed when no genes were filtered out. PMID:25602758

  12. FastGCN: a GPU accelerated tool for fast gene co-expression networks.

    PubMed

    Liang, Meimei; Zhang, Futao; Jin, Gulei; Zhu, Jun

    2015-01-01

    Gene co-expression networks comprise one type of valuable biological networks. Many methods and tools have been published to construct gene co-expression networks; however, most of these tools and methods are inconvenient and time consuming for large datasets. We have developed a user-friendly, accelerated and optimized tool for constructing gene co-expression networks that can fully harness the parallel nature of GPU (Graphic Processing Unit) architectures. Genetic entropies were exploited to filter out genes with no or small expression changes in the raw data preprocessing step. Pearson correlation coefficients were then calculated. After that, we normalized these coefficients and employed the False Discovery Rate to control the multiple tests. At last, modules identification was conducted to construct the co-expression networks. All of these calculations were implemented on a GPU. We also compressed the coefficient matrix to save space. We compared the performance of the GPU implementation with those of multi-core CPU implementations with 16 CPU threads, single-thread C/C++ implementation and single-thread R implementation. Our results show that GPU implementation largely outperforms single-thread C/C++ implementation and single-thread R implementation, and GPU implementation outperforms multi-core CPU implementation when the number of genes increases. With the test dataset containing 16,000 genes and 590 individuals, we can achieve greater than 63 times the speed using a GPU implementation compared with a single-thread R implementation when 50 percent of genes were filtered out and about 80 times the speed when no genes were filtered out.

  13. Past climate change on Sky Islands drives novelty in a core developmental gene network and its phenotype.

    PubMed

    Favé, Marie-Julie; Johnson, Robert A; Cover, Stefan; Handschuh, Stephan; Metscher, Brian D; Müller, Gerd B; Gopalan, Shyamalika; Abouheif, Ehab

    2015-09-04

    A fundamental and enduring problem in evolutionary biology is to understand how populations differentiate in the wild, yet little is known about what role organismal development plays in this process. Organismal development integrates environmental inputs with the action of gene regulatory networks to generate the phenotype. Core developmental gene networks have been highly conserved for millions of years across all animals, and therefore, organismal development may bias variation available for selection to work on. Biased variation may facilitate repeatable phenotypic responses when exposed to similar environmental inputs and ecological changes. To gain a more complete understanding of population differentiation in the wild, we integrated evolutionary developmental biology with population genetics, morphology, paleoecology and ecology. This integration was made possible by studying how populations of the ant species Monomorium emersoni respond to climatic and ecological changes across five 'Sky Islands' in Arizona, which are mountain ranges separated by vast 'seas' of desert. Sky Islands represent a replicated natural experiment allowing us to determine how repeatable is the response of M. emersoni populations to climate and ecological changes at the phenotypic, developmental, and gene network levels. We show that a core developmental gene network and its phenotype has kept pace with ecological and climate change on each Sky Island over the last ~90,000 years before present (BP). This response has produced two types of evolutionary change within an ant species: one type is unpredictable and contingent on the pattern of isolation of Sky lsland populations by climate warming, resulting in slight changes in gene expression, organ growth, and morphology. The other type is predictable and deterministic, resulting in the repeated evolution of a novel wingless queen phenotype and its underlying gene network in response to habitat changes induced by climate warming. Our findings reveal dynamics of developmental gene network evolution in wild populations. This holds important implications: (1) for understanding how phenotypic novelty is generated in the wild; (2) for providing a possible bridge between micro- and macroevolution; and (3) for understanding how development mediates the response of organisms to past, and potentially, future climate change.

  14. The computational core and fixed point organization in Boolean networks

    NASA Astrophysics Data System (ADS)

    Correale, L.; Leone, M.; Pagnani, A.; Weigt, M.; Zecchina, R.

    2006-03-01

    In this paper, we analyse large random Boolean networks in terms of a constraint satisfaction problem. We first develop an algorithmic scheme which allows us to prune simple logical cascades and underdetermined variables, returning thereby the computational core of the network. Second, we apply the cavity method to analyse the number and organization of fixed points. We find in particular a phase transition between an easy and a complex regulatory phase, the latter being characterized by the existence of an exponential number of macroscopically separated fixed point clusters. The different techniques developed are reinterpreted as algorithms for the analysis of single Boolean networks, and they are applied in the analysis of and in silico experiments on the gene regulatory networks of baker's yeast (Saccharomyces cerevisiae) and the segment-polarity genes of the fruitfly Drosophila melanogaster.

  15. A global interaction network maps a wiring diagram of cellular function

    PubMed Central

    Costanzo, Michael; VanderSluis, Benjamin; Koch, Elizabeth N.; Baryshnikova, Anastasia; Pons, Carles; Tan, Guihong; Wang, Wen; Usaj, Matej; Hanchard, Julia; Lee, Susan D.; Pelechano, Vicent; Styles, Erin B.; Billmann, Maximilian; van Leeuwen, Jolanda; van Dyk, Nydia; Lin, Zhen-Yuan; Kuzmin, Elena; Nelson, Justin; Piotrowski, Jeff S.; Srikumar, Tharan; Bahr, Sondra; Chen, Yiqun; Deshpande, Raamesh; Kurat, Christoph F.; Li, Sheena C.; Li, Zhijian; Usaj, Mojca Mattiazzi; Okada, Hiroki; Pascoe, Natasha; Luis, Bryan-Joseph San; Sharifpoor, Sara; Shuteriqi, Emira; Simpkins, Scott W.; Snider, Jamie; Suresh, Harsha Garadi; Tan, Yizhao; Zhu, Hongwei; Malod-Dognin, Noel; Janjic, Vuk; Przulj, Natasa; Troyanskaya, Olga G.; Stagljar, Igor; Xia, Tian; Ohya, Yoshikazu; Gingras, Anne-Claude; Raught, Brian; Boutros, Michael; Steinmetz, Lars M.; Moore, Claire L.; Rosebrock, Adam P.; Caudy, Amy A.; Myers, Chad L.; Andrews, Brenda; Boone, Charles

    2017-01-01

    We generated a global genetic interaction network for Saccharomyces cerevisiae, constructing over 23 million double mutants, identifying ~550,000 negative and ~350,000 positive genetic interactions. This comprehensive network maps genetic interactions for essential gene pairs, highlighting essential genes as densely connected hubs. Genetic interaction profiles enabled assembly of a hierarchical model of cell function, including modules corresponding to protein complexes and pathways, biological processes, and cellular compartments. Negative interactions connected functionally related genes, mapped core bioprocesses, and identified pleiotropic genes, whereas positive interactions often mapped general regulatory connections among gene pairs, rather than shared functionality. The global network illustrates how coherent sets of genetic interactions connect protein complex and pathway modules to map a functional wiring diagram of the cell. PMID:27708008

  16. Meta-review of protein network regulating obesity between validated obesity candidate genes in the white adipose tissue of high-fat diet-induced obese C57BL/6J mice.

    PubMed

    Kim, Eunjung; Kim, Eun Jung; Seo, Seung-Won; Hur, Cheol-Goo; McGregor, Robin A; Choi, Myung-Sook

    2014-01-01

    Worldwide obesity and related comorbidities are increasing, but identifying new therapeutic targets remains a challenge. A plethora of microarray studies in diet-induced obesity models has provided large datasets of obesity associated genes. In this review, we describe an approach to examine the underlying molecular network regulating obesity, and we discuss interactions between obesity candidate genes. We conducted network analysis on functional protein-protein interactions associated with 25 obesity candidate genes identified in a literature-driven approach based on published microarray studies of diet-induced obesity. The obesity candidate genes were closely associated with lipid metabolism and inflammation. Peroxisome proliferator activated receptor gamma (Pparg) appeared to be a core obesity gene, and obesity candidate genes were highly interconnected, suggesting a coordinately regulated molecular network in adipose tissue. In conclusion, the current network analysis approach may help elucidate the underlying molecular network regulating obesity and identify anti-obesity targets for therapeutic intervention.

  17. Extracting Fitness Relationships and Oncogenic Patterns among Driver Genes in Cancer.

    PubMed

    Zhang, Xindong; Gao, Lin; Jia, Songwei

    2017-12-25

    Driver mutation provides fitness advantage to cancer cells, the accumulation of which increases the fitness of cancer cells and accelerates cancer progression. This work seeks to extract patterns accumulated by driver genes ("fitness relationships") in tumorigenesis. We introduce a network-based method for extracting the fitness relationships of driver genes by modeling the network properties of the "fitness" of cancer cells. Colon adenocarcinoma (COAD) and skin cutaneous malignant melanoma (SKCM) are employed as case studies. Consistent results derived from different background networks suggest the reliability of the identified fitness relationships. Additionally co-occurrence analysis and pathway analysis reveal the functional significance of the fitness relationships with signaling transduction. In addition, a subset of driver genes called the "fitness core" is recognized for each case. Further analyses indicate the functional importance of the fitness core in carcinogenesis, and provide potential therapeutic opportunities in medicinal intervention. Fitness relationships characterize the functional continuity among driver genes in carcinogenesis, and suggest new insights in understanding the oncogenic mechanisms of cancers, as well as providing guiding information for medicinal intervention.

  18. Identifying Dynamic Protein Complexes Based on Gene Expression Profiles and PPI Networks

    PubMed Central

    Li, Min; Chen, Weijie; Wang, Jianxin; Pan, Yi

    2014-01-01

    Identification of protein complexes from protein-protein interaction networks has become a key problem for understanding cellular life in postgenomic era. Many computational methods have been proposed for identifying protein complexes. Up to now, the existing computational methods are mostly applied on static PPI networks. However, proteins and their interactions are dynamic in reality. Identifying dynamic protein complexes is more meaningful and challenging. In this paper, a novel algorithm, named DPC, is proposed to identify dynamic protein complexes by integrating PPI data and gene expression profiles. According to Core-Attachment assumption, these proteins which are always active in the molecular cycle are regarded as core proteins. The protein-complex cores are identified from these always active proteins by detecting dense subgraphs. Final protein complexes are extended from the protein-complex cores by adding attachments based on a topological character of “closeness” and dynamic meaning. The protein complexes produced by our algorithm DPC contain two parts: static core expressed in all the molecular cycle and dynamic attachments short-lived. The proposed algorithm DPC was applied on the data of Saccharomyces cerevisiae and the experimental results show that DPC outperforms CMC, MCL, SPICi, HC-PIN, COACH, and Core-Attachment based on the validation of matching with known complexes and hF-measures. PMID:24963481

  19. Cytoscape: a software environment for integrated models of biomolecular interaction networks.

    PubMed

    Shannon, Paul; Markiel, Andrew; Ozier, Owen; Baliga, Nitin S; Wang, Jonathan T; Ramage, Daniel; Amin, Nada; Schwikowski, Benno; Ideker, Trey

    2003-11-01

    Cytoscape is an open source software project for integrating biomolecular interaction networks with high-throughput expression data and other molecular states into a unified conceptual framework. Although applicable to any system of molecular components and interactions, Cytoscape is most powerful when used in conjunction with large databases of protein-protein, protein-DNA, and genetic interactions that are increasingly available for humans and model organisms. Cytoscape's software Core provides basic functionality to layout and query the network; to visually integrate the network with expression profiles, phenotypes, and other molecular states; and to link the network to databases of functional annotations. The Core is extensible through a straightforward plug-in architecture, allowing rapid development of additional computational analyses and features. Several case studies of Cytoscape plug-ins are surveyed, including a search for interaction pathways correlating with changes in gene expression, a study of protein complexes involved in cellular recovery to DNA damage, inference of a combined physical/functional interaction network for Halobacterium, and an interface to detailed stochastic/kinetic gene regulatory models.

  20. Generation of oscillating gene regulatory network motifs

    NASA Astrophysics Data System (ADS)

    van Dorp, M.; Lannoo, B.; Carlon, E.

    2013-07-01

    Using an improved version of an evolutionary algorithm originally proposed by François and Hakim [Proc. Natl. Acad. Sci. USAPNASA60027-842410.1073/pnas.0304532101 101, 580 (2004)], we generated small gene regulatory networks in which the concentration of a target protein oscillates in time. These networks may serve as candidates for oscillatory modules to be found in larger regulatory networks and protein interaction networks. The algorithm was run for 105 times to produce a large set of oscillating modules, which were systematically classified and analyzed. The robustness of the oscillations against variations of the kinetic rates was also determined, to filter out the least robust cases. Furthermore, we show that the set of evolved networks can serve as a database of models whose behavior can be compared to experimentally observed oscillations. The algorithm found three smallest (core) oscillators in which nonlinearities and number of components are minimal. Two of those are two-gene modules: the mixed feedback loop, already discussed in the literature, and an autorepressed gene coupled with a heterodimer. The third one is a single gene module which is competitively regulated by a monomer and a dimer. The evolutionary algorithm also generated larger oscillating networks, which are in part extensions of the three core modules and in part genuinely new modules. The latter includes oscillators which do not rely on feedback induced by transcription factors, but are purely of post-transcriptional type. Analysis of post-transcriptional mechanisms of oscillation may provide useful information for circadian clock research, as recent experiments showed that circadian rhythms are maintained even in the absence of transcription.

  1. Evolutionary trends and functional anatomy of the human expanded autophagy network

    PubMed Central

    Till, Andreas; Saito, Rintaro; Merkurjev, Daria; Liu, Jing-Jing; Syed, Gulam Hussain; Kolnik, Martin; Siddiqui, Aleem; Glas, Martin; Scheffler, Björn; Ideker, Trey; Subramani, Suresh

    2015-01-01

    All eukaryotic cells utilize autophagy for protein and organelle turnover, thus assuring subcellular quality control, homeostasis, and survival. In order to address recent advances in identification of human autophagy associated genes, and to describe autophagy on a system-wide level, we established an autophagy-centered gene interaction network by merging various primary data sets and by retrieving respective interaction data. The resulting network (‘AXAN’) was analyzed with respect to subnetworks, e.g. the prime gene subnetwork (including the core machinery, signaling pathways and autophagy receptors) and the transcription subnetwork. To describe aspects of evolution within this network, we assessed the presence of protein orthologs across 99 eukaryotic model organisms. We visualized evolutionary trends for prime gene categories and evolutionary tracks for selected AXAN genes. This analysis confirms the eukaryotic origin of autophagy core genes while it points to a diverse evolutionary history of autophagy receptors. Next, we used module identification to describe the functional anatomy of the network at the level of pathway modules. In addition to obvious pathways (e.g., lysosomal degradation, insulin signaling) our data unveil the existence of context-related modules such as Rho GTPase signaling. Last, we used a tripartite, image-based RNAi – screen to test candidate genes predicted to play a role in regulation of autophagy. We verified the Rho GTPase, CDC42, as a novel regulator of autophagy-related signaling. This study emphasizes the applicability of system-wide approaches to gain novel insights into a complex biological process and to describe the human autophagy pathway at a hitherto unprecedented level of detail. PMID:26103419

  2. Identifying Functional Mechanisms of Gene and Protein Regulatory Networks in Response to a Broader Range of Environmental Stresses

    PubMed Central

    Li, Cheng-Wei; Chen, Bor-Sen

    2010-01-01

    Cellular responses to sudden environmental stresses or physiological changes provide living organisms with the opportunity for final survival and further development. Therefore, it is an important topic to understand protective mechanisms against environmental stresses from the viewpoint of gene and protein networks. We propose two coupled nonlinear stochastic dynamic models to reconstruct stress-activated gene and protein regulatory networks via microarray data in response to environmental stresses. According to the reconstructed gene/protein networks, some possible mutual interactions, feedforward and feedback loops are found for accelerating response and filtering noises in these signaling pathways. A bow-tie core network is also identified to coordinate mutual interactions and feedforward loops, feedback inhibitions, feedback activations, and cross talks to cope efficiently with a broader range of environmental stresses with limited proteins and pathways. PMID:20454442

  3. Transcriptional responses in thyroid tissues from rats treated with a tumorigenic and a non-tumorigenic triazole conazole fungicide.

    PubMed

    Hester, Susan D; Nesnow, Stephen

    2008-03-15

    Conazoles are azole-containing fungicides that are used in agriculture and medicine. Conazoles can induce follicular cell adenomas of the thyroid in rats after chronic bioassay. The goal of this study was to identify pathways and networks of genes that were associated with thyroid tumorigenesis through transcriptional analyses. To this end, we compared transcriptional profiles from tissues of rats treated with a tumorigenic and a non-tumorigenic conazole. Triadimefon, a rat thyroid tumorigen, and myclobutanil, which was not tumorigenic in rats after a 2-year bioassay, were administered in the feed to male Wistar/Han rats for 30 or 90 days similar to the treatment conditions previously used in their chronic bioassays. Thyroid gene expression was determined using high density Affymetrix GeneChips (Rat 230_2). Gene expression was analyzed by the Gene Set Expression Analyses method which clearly separated the tumorigenic treatments (tumorigenic response group (TRG)) from the non-tumorigenic treatments (non-tumorigenic response group (NRG)). Core genes from these gene sets were mapped to canonical, metabolic, and GeneGo processes and these processes compared across group and treatment time. Extensive analyses were performed on the 30-day gene sets as they represented the major perturbations. Gene sets in the 30-day TRG group had over representation of fatty acid metabolism, oxidation, and degradation processes (including PPARgamma and CYP involvement), and of cell proliferation responses. Core genes from these gene sets were combined into networks and found to possess signaling interactions. In addition, the core genes in each gene set were compared with genes known to be associated with human thyroid cancer. Among the genes that appeared in both rat and human data sets were: Acaca, Asns, Cebpg, Crem, Ddit3, Gja1, Grn, Jun, Junb, and Vegf. These genes were major contributors in the previously developed network from triadimefon-treated rat thyroids. It is postulated that triadimefon induces oxidative response genes and activates the nuclear receptor, Ppargamma, initiating transcription of gene products and signaling to a series of genes involved in cell proliferation.

  4. Pluripotency gene network dynamics: System views from parametric analysis.

    PubMed

    Akberdin, Ilya R; Omelyanchuk, Nadezda A; Fadeev, Stanislav I; Leskova, Natalya E; Oschepkova, Evgeniya A; Kazantsev, Fedor V; Matushkin, Yury G; Afonnikov, Dmitry A; Kolchanov, Nikolay A

    2018-01-01

    Multiple experimental data demonstrated that the core gene network orchestrating self-renewal and differentiation of mouse embryonic stem cells involves activity of Oct4, Sox2 and Nanog genes by means of a number of positive feedback loops among them. However, recent studies indicated that the architecture of the core gene network should also incorporate negative Nanog autoregulation and might not include positive feedbacks from Nanog to Oct4 and Sox2. Thorough parametric analysis of the mathematical model based on this revisited core regulatory circuit identified that there are substantial changes in model dynamics occurred depending on the strength of Oct4 and Sox2 activation and molecular complexity of Nanog autorepression. The analysis showed the existence of four dynamical domains with different numbers of stable and unstable steady states. We hypothesize that these domains can constitute the checkpoints in a developmental progression from naïve to primed pluripotency and vice versa. During this transition, parametric conditions exist, which generate an oscillatory behavior of the system explaining heterogeneity in expression of pluripotent and differentiation factors in serum ESC cultures. Eventually, simulations showed that addition of positive feedbacks from Nanog to Oct4 and Sox2 leads mainly to increase of the parametric space for the naïve ESC state, in which pluripotency factors are strongly expressed while differentiation ones are repressed.

  5. Challenges of the information age: the impact of false discovery on pathway identification.

    PubMed

    Rog, Colin J; Chekuri, Srinivasa C; Edgerton, Mary E

    2012-11-21

    Pathways with members that have known relevance to a disease are used to support hypotheses generated from analyses of gene expression and proteomic studies. Using cancer as an example, the pitfalls of searching pathways databases as support for genes and proteins that could represent false discoveries are explored. The frequency with which networks could be generated from 100 instances each of randomly selected five and ten genes sets as input to MetaCore, a commercial pathways database, was measured. A PubMed search enumerated cancer-related literature published for any gene in the networks. Using three, two, and one maximum intervening step between input genes to populate the network, networks were generated with frequencies of 97%, 77%, and 7% using ten gene sets and 73%, 27%, and 1% using five gene sets. PubMed reported an average of 4225 cancer-related articles per network gene. This can be attributed to the richly populated pathways databases and the interest in the molecular basis of cancer. As information sources become enriched, they are more likely to generate plausible mechanisms for false discoveries.

  6. Reverse engineering and analysis of large genome-scale gene networks

    PubMed Central

    Aluru, Maneesha; Zola, Jaroslaw; Nettleton, Dan; Aluru, Srinivas

    2013-01-01

    Reverse engineering the whole-genome networks of complex multicellular organisms continues to remain a challenge. While simpler models easily scale to large number of genes and gene expression datasets, more accurate models are compute intensive limiting their scale of applicability. To enable fast and accurate reconstruction of large networks, we developed Tool for Inferring Network of Genes (TINGe), a parallel mutual information (MI)-based program. The novel features of our approach include: (i) B-spline-based formulation for linear-time computation of MI, (ii) a novel algorithm for direct permutation testing and (iii) development of parallel algorithms to reduce run-time and facilitate construction of large networks. We assess the quality of our method by comparison with ARACNe (Algorithm for the Reconstruction of Accurate Cellular Networks) and GeneNet and demonstrate its unique capability by reverse engineering the whole-genome network of Arabidopsis thaliana from 3137 Affymetrix ATH1 GeneChips in just 9 min on a 1024-core cluster. We further report on the development of a new software Gene Network Analyzer (GeNA) for extracting context-specific subnetworks from a given set of seed genes. Using TINGe and GeNA, we performed analysis of 241 Arabidopsis AraCyc 8.0 pathways, and the results are made available through the web. PMID:23042249

  7. Parallel Mutual Information Based Construction of Genome-Scale Networks on the Intel® Xeon Phi™ Coprocessor.

    PubMed

    Misra, Sanchit; Pamnany, Kiran; Aluru, Srinivas

    2015-01-01

    Construction of whole-genome networks from large-scale gene expression data is an important problem in systems biology. While several techniques have been developed, most cannot handle network reconstruction at the whole-genome scale, and the few that can, require large clusters. In this paper, we present a solution on the Intel Xeon Phi coprocessor, taking advantage of its multi-level parallelism including many x86-based cores, multiple threads per core, and vector processing units. We also present a solution on the Intel® Xeon® processor. Our solution is based on TINGe, a fast parallel network reconstruction technique that uses mutual information and permutation testing for assessing statistical significance. We demonstrate the first ever inference of a plant whole genome regulatory network on a single chip by constructing a 15,575 gene network of the plant Arabidopsis thaliana from 3,137 microarray experiments in only 22 minutes. In addition, our optimization for parallelizing mutual information computation on the Intel Xeon Phi coprocessor holds out lessons that are applicable to other domains.

  8. A pathway-based network analysis of hypertension-related genes

    NASA Astrophysics Data System (ADS)

    Wang, Huan; Hu, Jing-Bo; Xu, Chuan-Yun; Zhang, De-Hai; Yan, Qian; Xu, Ming; Cao, Ke-Fei; Zhang, Xu-Sheng

    2016-02-01

    Complex network approach has become an effective way to describe interrelationships among large amounts of biological data, which is especially useful in finding core functions and global behavior of biological systems. Hypertension is a complex disease caused by many reasons including genetic, physiological, psychological and even social factors. In this paper, based on the information of biological pathways, we construct a network model of hypertension-related genes of the salt-sensitive rat to explore the interrelationship between genes. Statistical and topological characteristics show that the network has the small-world but not scale-free property, and exhibits a modular structure, revealing compact and complex connections among these genes. By the threshold of integrated centrality larger than 0.71, seven key hub genes are found: Jun, Rps6kb1, Cycs, Creb312, Cdk4, Actg1 and RT1-Da. These genes should play an important role in hypertension, suggesting that the treatment of hypertension should focus on the combination of drugs on multiple genes.

  9. Genome-wide network of regulatory genes for construction of a chordate embryo.

    PubMed

    Shoguchi, Eiichi; Hamaguchi, Makoto; Satoh, Nori

    2008-04-15

    Animal development is controlled by gene regulation networks that are composed of sequence-specific transcription factors (TF) and cell signaling molecules (ST). Although housekeeping genes have been reported to show clustering in the animal genomes, whether the genes comprising a given regulatory network are physically clustered on a chromosome is uncertain. We examined this question in the present study. Ascidians are the closest living relatives of vertebrates, and their tadpole-type larva represents the basic body plan of chordates. The Ciona intestinalis genome contains 390 core TF genes and 119 major ST genes. Previous gene disruption assays led to the formulation of a basic chordate embryonic blueprint, based on over 3000 genetic interactions among 79 zygotic regulatory genes. Here, we mapped the regulatory genes, including all 79 regulatory genes, on the 14 pairs of Ciona chromosomes by fluorescent in situ hybridization (FISH). Chromosomal localization of upstream and downstream regulatory genes demonstrates that the components of coherent developmental gene networks are evenly distributed over the 14 chromosomes. Thus, this study provides the first comprehensive evidence that the physical clustering of regulatory genes, or their target genes, is not relevant for the genome-wide control of gene expression during development.

  10. The Comparative Toxicogenomics Database: update 2017.

    PubMed

    Davis, Allan Peter; Grondin, Cynthia J; Johnson, Robin J; Sciaky, Daniela; King, Benjamin L; McMorran, Roy; Wiegers, Jolene; Wiegers, Thomas C; Mattingly, Carolyn J

    2017-01-04

    The Comparative Toxicogenomics Database (CTD; http://ctdbase.org/) provides information about interactions between chemicals and gene products, and their relationships to diseases. Core CTD content (chemical-gene, chemical-disease and gene-disease interactions manually curated from the literature) are integrated with each other as well as with select external datasets to generate expanded networks and predict novel associations. Today, core CTD includes more than 30.5 million toxicogenomic connections relating chemicals/drugs, genes/proteins, diseases, taxa, Gene Ontology (GO) annotations, pathways, and gene interaction modules. In this update, we report a 33% increase in our core data content since 2015, describe our new exposure module (that harmonizes exposure science information with core toxicogenomic data) and introduce a novel dataset of GO-disease inferences (that identify common molecular underpinnings for seemingly unrelated pathologies). These advancements centralize and contextualize real-world chemical exposures with molecular pathways to help scientists generate testable hypotheses in an effort to understand the etiology and mechanisms underlying environmentally influenced diseases. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

  11. Genome plasticity and systems evolution in Streptomyces

    PubMed Central

    2012-01-01

    Background Streptomycetes are filamentous soil-dwelling bacteria. They are best known as the producers of a great variety of natural products such as antibiotics, antifungals, antiparasitics, and anticancer agents and the decomposers of organic substances for carbon recycling. They are also model organisms for the studies of gene regulatory networks, morphological differentiation, and stress response. The availability of sets of genomes from closely related Streptomyces strains makes it possible to assess the mechanisms underlying genome plasticity and systems adaptation. Results We present the results of a comprehensive analysis of the genomes of five Streptomyces species with distinct phenotypes. These streptomycetes have a pan-genome comprised of 17,362 orthologous families which includes 3,096 components in the core genome, 5,066 components in the dispensable genome, and 9,200 components that are uniquely present in only one species. The core genome makes up about 33%-45% of each genome repertoire. It contains important genes for Streptomyces biology including those involved in gene regulation, secretion, secondary metabolism and morphological differentiation. Abundant duplicate genes have been identified, with 4%-11% of the whole genomes composed of lineage-specific expansions (LSEs), suggesting that frequent gene duplication or lateral gene transfer events play a role in shaping the genome diversification within this genus. Two patterns of expansion, single gene expansion and chromosome block expansion are observed, representing different scales of duplication. Conclusions Our results provide a catalog of genome components and their potential functional roles in gene regulatory networks and metabolic networks. The core genome components reveal the minimum requirement for streptomycetes to sustain a successful lifecycle in the soil environment, reflecting the effects of both genome evolution and environmental stress acting upon the expressed phenotypes. A better understanding of the LSE gene families will, on the other hand, bring a wealth of new insights into the mechanisms underlying strain-specific phenotypes, such as the production of novel antibiotics, pathogenesis, and adaptive response to environmental challenges. PMID:22759432

  12. Evolution and Expression Patterns of TCP Genes in Asparagales

    PubMed Central

    Madrigal, Yesenia; Alzate, Juan F.; Pabón-Mora, Natalia

    2017-01-01

    CYCLOIDEA-like genes are involved in the symmetry gene network, limiting cell proliferation in the dorsal regions of bilateral flowers in core eudicots. CYC-like and closely related TCP genes (acronym for TEOSINTE BRANCHED1, CYCLOIDEA, and PROLIFERATION CELL FACTOR) have been poorly studied in Asparagales, the largest order of monocots that includes both bilateral flowers in Orchidaceae (ca. 25.000 spp) and radially symmetrical flowers in Hypoxidaceae (ca. 200 spp). With the aim of assessing TCP gene evolution in the Asparagales, we isolated TCP-like genes from publicly available databases and our own transcriptomes of Cattleya trianae (Orchidaceae) and Hypoxis decumbens (Hypoxidaceae). Our matrix contains 452 sequences representing the three major clades of TCP genes. Besides the previously identified CYC specific core eudicot duplications, our ML phylogenetic analyses recovered an early CIN-like duplication predating all angiosperms, two CIN-like Asparagales-specific duplications and a duplication prior to the diversification of Orchidoideae and Epidendroideae. In addition, we provide evidence of at least three duplications of PCF-like genes in Asparagales. While CIN-like and PCF-like genes have multiplied in Asparagales, likely enhancing the genetic network for cell proliferation, CYC-like genes remain as single, shorter copies with low expression. Homogeneous expression of CYC-like genes in the labellum as well as the lateral petals suggests little contribution to the bilateral perianth in C. trianae. CIN-like and PCF-like gene expression suggests conserved roles in cell proliferation in leaves, sepals and petals, carpels, ovules and fruits in Asparagales by comparison with previously reported functions in core eudicots and monocots. This is the first large scale analysis of TCP-like genes in Asparagales that will serve as a platform for in-depth functional studies in emerging model monocots. PMID:28144250

  13. A network-based, integrative study to identify core biological pathways that drive breast cancer clinical subtypes

    PubMed Central

    Dutta, B; Pusztai, L; Qi, Y; André, F; Lazar, V; Bianchini, G; Ueno, N; Agarwal, R; Wang, B; Shiang, C Y; Hortobagyi, G N; Mills, G B; Symmans, W F; Balázsi, G

    2012-01-01

    Background: The rapid collection of diverse genome-scale data raises the urgent need to integrate and utilise these resources for biological discovery or biomedical applications. For example, diverse transcriptomic and gene copy number variation data are currently collected for various cancers, but relatively few current methods are capable to utilise the emerging information. Methods: We developed and tested a data-integration method to identify gene networks that drive the biology of breast cancer clinical subtypes. The method simultaneously overlays gene expression and gene copy number data on protein–protein interaction, transcriptional-regulatory and signalling networks by identifying coincident genomic and transcriptional disturbances in local network neighborhoods. Results: We identified distinct driver-networks for each of the three common clinical breast cancer subtypes: oestrogen receptor (ER)+, human epidermal growth factor receptor 2 (HER2)+, and triple receptor-negative breast cancers (TNBC) from patient and cell line data sets. Driver-networks inferred from independent datasets were significantly reproducible. We also confirmed the functional relevance of a subset of randomly selected driver-network members for TNBC in gene knockdown experiments in vitro. We found that TNBC driver-network members genes have increased functional specificity to TNBC cell lines and higher functional sensitivity compared with genes selected by differential expression alone. Conclusion: Clinical subtype-specific driver-networks identified through data integration are reproducible and functionally important. PMID:22343619

  14. RedeR: R/Bioconductor package for representing modular structures, nested networks and multiple levels of hierarchical associations

    PubMed Central

    2012-01-01

    Visualization and analysis of molecular networks are both central to systems biology. However, there still exists a large technological gap between them, especially when assessing multiple network levels or hierarchies. Here we present RedeR, an R/Bioconductor package combined with a Java core engine for representing modular networks. The functionality of RedeR is demonstrated in two different scenarios: hierarchical and modular organization in gene co-expression networks and nested structures in time-course gene expression subnetworks. Our results demonstrate RedeR as a new framework to deal with the multiple network levels that are inherent to complex biological systems. RedeR is available from http://bioconductor.org/packages/release/bioc/html/RedeR.html. PMID:22531049

  15. An analysis of the gene interaction networks identifying the role of PARP1 in metastasis of non-small cell lung cancer.

    PubMed

    Chen, Kai; Li, Yajie; Xu, Hui; Zhang, Chunfeng; Li, Zhiqiang; Wang, Wei; Wang, Baofeng

    2017-10-20

    Though there were many researches about the effects of cancer cells on non-small cell lung cancer (NSCLC) currently, it has been rarely reported completed oncogene and its mechanism in tumors by far. Here, we used biological methods with known oncogene of NSCLC to find new oncogene and explore its functionary mechanism in NSCLC. The study firstly built NSCLC genetic interaction network based on bioinformatics methods and then combined shortest path algorithm with significance test to confirmed core genes that were closely involved with given genes; real-time qPCR was conducted to detect expression levels between patients with NSCLC and normal people; additionally, detection of PARP1's role in migration and invasion was performed by trans-well assays and wound-healing. Through gene interaction network, it was found that, core genes like PARP1, EGFR and ALK had a direct interaction. TCGA database showed that PARP1 presented strong expression in NSCLC and the expression level of metastatic NSCLC was significantly higher than that of non-metastatic NSCLC. Cell migration of NSCLC in accordance to the scratch test was suppressed by PARP1 silence but stimulated noticeably by PARP1 overexpression. According to Kaplan-meier survival curve, the higher PARP1 expression, the poorer patient survival rate and prognosis. Thus, PARP1 expression had a negative correction with patient survival rate and prognosis. New oncogene PARP1 was found from known NSCLC oncogene in terms of gene interaction network, demonstrating PARP1's impact on NSCLC cell migration.

  16. Identification of interactive gene networks: a novel approach in gene array profiling of myometrial events during guinea pig pregnancy.

    PubMed

    Mason, Clifford W; Swaan, Peter W; Weiner, Carl P

    2006-06-01

    The transition from myometrial quiescence to activation is poorly understood, and the analysis of array data is limited by the available data mining tools. We applied functional analysis and logical operations along regulatory gene networks to identify molecular processes and pathways underlying quiescence and activation. We analyzed some 18,400 transcripts and variants in guinea pig myometrium at stages corresponding to quiescence and activation, and compared them to the nonpregnant (control) counterpart using a functional mapping tool, MetaCore (GeneGo, St Joseph, MI) to identify novel gene networks composed of biological pathways during mid (MP) and late (LP) pregnancy. Genes altered during quiescence and or activation were identified following gene specific comparisons with myometrium from nonpregnant animals, and then linked to curated pathways and formulated networks. The MP and LP networks were subtracted from each other to identify unique genomic events during those periods. For example, changes 2-fold or greater in genes mediating protein biosynthesis, programmed cell death, microtubule polymerization, and microtubule based movement were noted during the transition to LP. We describe a novel approach combining microarrays and genetic data to identify networks associated with normal myometrial events. The resulting insights help identify potential biomarkers and permit future targeted investigations of these pathways or networks to confirm or refute their importance.

  17. ITEP: an integrated toolkit for exploration of microbial pan-genomes.

    PubMed

    Benedict, Matthew N; Henriksen, James R; Metcalf, William W; Whitaker, Rachel J; Price, Nathan D

    2014-01-03

    Comparative genomics is a powerful approach for studying variation in physiological traits as well as the evolution and ecology of microorganisms. Recent technological advances have enabled sequencing large numbers of related genomes in a single project, requiring computational tools for their integrated analysis. In particular, accurate annotations and identification of gene presence and absence are critical for understanding and modeling the cellular physiology of newly sequenced genomes. Although many tools are available to compare the gene contents of related genomes, new tools are necessary to enable close examination and curation of protein families from large numbers of closely related organisms, to integrate curation with the analysis of gain and loss, and to generate metabolic networks linking the annotations to observed phenotypes. We have developed ITEP, an Integrated Toolkit for Exploration of microbial Pan-genomes, to curate protein families, compute similarities to externally-defined domains, analyze gene gain and loss, and generate draft metabolic networks from one or more curated reference network reconstructions in groups of related microbial species among which the combination of core and variable genes constitute the their "pan-genomes". The ITEP toolkit consists of: (1) a series of modular command-line scripts for identification, comparison, curation, and analysis of protein families and their distribution across many genomes; (2) a set of Python libraries for programmatic access to the same data; and (3) pre-packaged scripts to perform common analysis workflows on a collection of genomes. ITEP's capabilities include de novo protein family prediction, ortholog detection, analysis of functional domains, identification of core and variable genes and gene regions, sequence alignments and tree generation, annotation curation, and the integration of cross-genome analysis and metabolic networks for study of metabolic network evolution. ITEP is a powerful, flexible toolkit for generation and curation of protein families. ITEP's modular design allows for straightforward extension as analysis methods and tools evolve. By integrating comparative genomics with the development of draft metabolic networks, ITEP harnesses the power of comparative genomics to build confidence in links between genotype and phenotype and helps disambiguate gene annotations when they are evaluated in both evolutionary and metabolic network contexts.

  18. Identifying protein complex by integrating characteristic of core-attachment into dynamic PPI network.

    PubMed

    Shen, Xianjun; Yi, Li; Jiang, Xingpeng; He, Tingting; Yang, Jincai; Xie, Wei; Hu, Po; Hu, Xiaohua

    2017-01-01

    How to identify protein complex is an important and challenging task in proteomics. It would make great contribution to our knowledge of molecular mechanism in cell life activities. However, the inherent organization and dynamic characteristic of cell system have rarely been incorporated into the existing algorithms for detecting protein complexes because of the limitation of protein-protein interaction (PPI) data produced by high throughput techniques. The availability of time course gene expression profile enables us to uncover the dynamics of molecular networks and improve the detection of protein complexes. In order to achieve this goal, this paper proposes a novel algorithm DCA (Dynamic Core-Attachment). It detects protein-complex core comprising of continually expressed and highly connected proteins in dynamic PPI network, and then the protein complex is formed by including the attachments with high adhesion into the core. The integration of core-attachment feature into the dynamic PPI network is responsible for the superiority of our algorithm. DCA has been applied on two different yeast dynamic PPI networks and the experimental results show that it performs significantly better than the state-of-the-art techniques in terms of prediction accuracy, hF-measure and statistical significance in biology. In addition, the identified complexes with strong biological significance provide potential candidate complexes for biologists to validate.

  19. Data Imputation in Epistatic MAPs by Network-Guided Matrix Completion

    PubMed Central

    Žitnik, Marinka; Zupan, Blaž

    2015-01-01

    Abstract Epistatic miniarray profile (E-MAP) is a popular large-scale genetic interaction discovery platform. E-MAPs benefit from quantitative output, which makes it possible to detect subtle interactions with greater precision. However, due to the limits of biotechnology, E-MAP studies fail to measure genetic interactions for up to 40% of gene pairs in an assay. Missing measurements can be recovered by computational techniques for data imputation, in this way completing the interaction profiles and enabling downstream analysis algorithms that could otherwise be sensitive to missing data values. We introduce a new interaction data imputation method called network-guided matrix completion (NG-MC). The core part of NG-MC is low-rank probabilistic matrix completion that incorporates prior knowledge presented as a collection of gene networks. NG-MC assumes that interactions are transitive, such that latent gene interaction profiles inferred by NG-MC depend on the profiles of their direct neighbors in gene networks. As the NG-MC inference algorithm progresses, it propagates latent interaction profiles through each of the networks and updates gene network weights toward improved prediction. In a study with four different E-MAP data assays and considered protein–protein interaction and gene ontology similarity networks, NG-MC significantly surpassed existing alternative techniques. Inclusion of information from gene networks also allowed NG-MC to predict interactions for genes that were not included in original E-MAP assays, a task that could not be considered by current imputation approaches. PMID:25658751

  20. Gene Networks and Functional Features of Gravitropic response in Rice Shoot Bases

    NASA Astrophysics Data System (ADS)

    Hu, Liwei; Zang, Aiping; Ai, Qianru; Chen, Haiying; Li, Lin; Li, Rui; Su, Feng; Chen, Xijiang; Rong, Hui; Dou, Xianying; Reinhold-Hurek, Barbara; Li, Qi; Cai, Weiming

    To delineate key genes and the corresponding physiological functions as well as the coordina-tion of genes involved in the gravitropism of rice shoot bases, we used whole-genome microarray analysis of upper and lower parts of rice shoot bases at 0.5 h and 6 h after gravistimulation. And bio-information analysis was applied including GO-analysis, expression tendency and net-work analysis. In the lower shoot bases, auxin-mediated signaling pathway and glutathione transferase activity with the biggest enrichment were activated at 0.5 h, while cytokinin stimu-lus and photosynthesis were activated at 6 h. Meanwhile, several processes were suppressed in the lower shoot bases, including: xyloglucan:xyloglucosyl transferase activity, glucan metabolic processes, and ATPase activity at 0.5 h; and tRNA isopentenyltransferase activity, and chiti-nase activity, etc. at 6 h. Gene expression profile responding to gravistimulation suggested that the asymmetrically activation of several phytohormone signaling pathways including auxin, gib-berellin and cytokinin brassinolide ethylene and cytokinin-related genes were involved in the differentially growth between the upper and lower parts of rice shoot bases, and so do cell wall-related genes. Topological analysis of the coexpression networks revealed the core statue of AY177699.1(apetala3-like protein) and AK105103.1 at 0.5 h; AK062612.1 (ethylene response factor) and AK099932.1 (lectin-like receptor kinase 72) at 6 h. All the core factors have the function "response to endogenous stimulus". Additionally, AK108057.1(similar to germin-like protein precursor) was discovered as the most important core gene in the upper shoot bases in 6h after gravistimualtion while AK067424.1(cellulose synthase-like protein), AK120101.1 (Zinc finger, B-box domain containing protein) and CR278698 (ATPase associated with various cel-lular activities cellulose synthase-like protein) contribute equally to gravitropic response in the lower shoot bases.

  1. Engineering a Functional Small RNA Negative Autoregulation Network with Model-Guided Design.

    PubMed

    Hu, Chelsea Y; Takahashi, Melissa K; Zhang, Yan; Lucks, Julius B

    2018-05-22

    RNA regulators are powerful components of the synthetic biology toolbox. Here, we expand the repertoire of synthetic gene networks built from these regulators by constructing a transcriptional negative autoregulation (NAR) network out of small RNAs (sRNAs). NAR network motifs are core motifs of natural genetic networks, and are known for reducing network response time and steady state signal. Here we use cell-free transcription-translation (TX-TL) reactions and a computational model to design and prototype sRNA NAR constructs. Using parameter sensitivity analysis, we design a simple set of experiments that allow us to accurately predict NAR function in TX-TL. We transfer successful network designs into Escherichia coli and show that our sRNA transcriptional network reduces both network response time and steady-state gene expression. This work broadens our ability to construct increasingly sophisticated RNA genetic networks with predictable function.

  2. The core regulatory network of the abscisic acid pathway in banana: genome-wide identification and expression analyses during development, ripening, and abiotic stress.

    PubMed

    Hu, Wei; Yan, Yan; Shi, Haitao; Liu, Juhua; Miao, Hongxia; Tie, Weiwei; Ding, Zehong; Ding, XuPo; Wu, Chunlai; Liu, Yang; Wang, Jiashui; Xu, Biyu; Jin, Zhiqiang

    2017-08-29

    Abscisic acid (ABA) signaling plays a crucial role in developmental and environmental adaptation processes of plants. However, the PYL-PP2C-SnRK2 families that function as the core components of ABA signaling are not well understood in banana. In the present study, 24 PYL, 87 PP2C, and 11 SnRK2 genes were identified from banana, which was further supported by evolutionary relationships, conserved motif and gene structure analyses. The comprehensive transcriptomic analyses showed that banana PYL-PP2C-SnRK2 genes are involved in tissue development, fruit development and ripening, and response to abiotic stress in two cultivated varieties. Moreover, comparative expression analyses of PYL-PP2C-SnRK2 genes between BaXi Jiao (BX) and Fen Jiao (FJ) revealed that PYL-PP2C-SnRK2-mediated ABA signaling might positively regulate banana fruit ripening and tolerance to cold, salt, and osmotic stresses. Finally, interaction networks and co-expression assays demonstrated that the core components of ABA signaling were more active in FJ than in BX in response to abiotic stress, further supporting the crucial role of the genes in tolerance to abiotic stress in banana. This study provides new insights into the complicated transcriptional control of PYL-PP2C-SnRK2 genes, improves the understanding of PYL-PP2C-SnRK2-mediated ABA signaling in the regulation of fruit development, ripening, and response to abiotic stress, and identifies some candidate genes for genetic improvement of banana.

  3. In-Silico Integration Approach to Identify a Key miRNA Regulating a Gene Network in Aggressive Prostate Cancer

    PubMed Central

    Colaprico, Antonio; Bontempi, Gianluca; Castiglioni, Isabella

    2018-01-01

    Like other cancer diseases, prostate cancer (PC) is caused by the accumulation of genetic alterations in the cells that drives malignant growth. These alterations are revealed by gene profiling and copy number alteration (CNA) analysis. Moreover, recent evidence suggests that also microRNAs have an important role in PC development. Despite efforts to profile PC, the alterations (gene, CNA, and miRNA) and biological processes that correlate with disease development and progression remain partially elusive. Many gene signatures proposed as diagnostic or prognostic tools in cancer poorly overlap. The identification of co-expressed genes, that are functionally related, can identify a core network of genes associated with PC with a better reproducibility. By combining different approaches, including the integration of mRNA expression profiles, CNAs, and miRNA expression levels, we identified a gene signature of four genes overlapping with other published gene signatures and able to distinguish, in silico, high Gleason-scored PC from normal human tissue, which was further enriched to 19 genes by gene co-expression analysis. From the analysis of miRNAs possibly regulating this network, we found that hsa-miR-153 was highly connected to the genes in the network. Our results identify a four-gene signature with diagnostic and prognostic value in PC and suggest an interesting gene network that could play a key regulatory role in PC development and progression. Furthermore, hsa-miR-153, controlling this network, could be a potential biomarker for theranostics in high Gleason-scored PC. PMID:29562723

  4. Network pharmacology-based prediction of active compounds and molecular targets in Yijin-Tang acting on hyperlipidaemia and atherosclerosis.

    PubMed

    Lee, A Yeong; Park, Won; Kang, Tae-Wook; Cha, Min Ho; Chun, Jin Mi

    2018-07-15

    Yijin-Tang (YJT) is a traditional prescription for the treatment of hyperlipidaemia, atherosclerosis and other ailments related to dampness phlegm, a typical pathological symptom of abnormal body fluid metabolism in Traditional Korean Medicine. However, a holistic network pharmacology approach to understanding the therapeutic mechanisms underlying hyperlipidaemia and atherosclerosis has not been pursued. To examine the network pharmacological potential effects of YJT on hyperlipidaemia and atherosclerosis, we analysed components, performed target prediction and network analysis, and investigated interacting pathways using a network pharmacology approach. Information on compounds in herbal medicines was obtained from public databases, and oral bioavailability and drug-likeness was screened using absorption, distribution, metabolism, and excretion (ADME) criteria. Correlations between compounds and genes were linked using the STITCH database, and genes related to hyperlipidaemia and atherosclerosis were gathered using the GeneCards database. Human genes were identified and subjected to Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway analysis. Network analysis identified 447 compounds in five herbal medicines that were subjected to ADME screening, and 21 compounds and 57 genes formed the main pathways linked to hyperlipidaemia and atherosclerosis. Among them, 10 compounds (naringenin, nobiletin, hesperidin, galangin, glycyrrhizin, homogentisic acid, stigmasterol, 6-gingerol, quercetin and glabridin) were linked to more than four genes, and are bioactive compounds and key chemicals. Core genes in this network were CASP3, CYP1A1, CYP1A2, MMP2 and MMP9. The compound-target gene network revealed close interactions between multiple components and multiple targets, and facilitates a better understanding of the potential therapeutic effects of YJT. Pharmacological network analysis can help to explain the potential effects of YJT for treating dampness phlegm-related diseases such as hyperlipidaemia and atherosclerosis. Copyright © 2018 Elsevier B.V. All rights reserved.

  5. Genomic analysis reveals a tight link between transcription factor dynamics and regulatory network architecture.

    PubMed

    Jothi, Raja; Balaji, S; Wuster, Arthur; Grochow, Joshua A; Gsponer, Jörg; Przytycka, Teresa M; Aravind, L; Babu, M Madan

    2009-01-01

    Although several studies have provided important insights into the general principles of biological networks, the link between network organization and the genome-scale dynamics of the underlying entities (genes, mRNAs, and proteins) and its role in systems behavior remain unclear. Here we show that transcription factor (TF) dynamics and regulatory network organization are tightly linked. By classifying TFs in the yeast regulatory network into three hierarchical layers (top, core, and bottom) and integrating diverse genome-scale datasets, we find that the TFs have static and dynamic properties that are similar within a layer and different across layers. At the protein level, the top-layer TFs are relatively abundant, long-lived, and noisy compared with the core- and bottom-layer TFs. Although variability in expression of top-layer TFs might confer a selective advantage, as this permits at least some members in a clonal cell population to initiate a response to changing conditions, tight regulation of the core- and bottom-layer TFs may minimize noise propagation and ensure fidelity in regulation. We propose that the interplay between network organization and TF dynamics could permit differential utilization of the same underlying network by distinct members of a clonal cell population.

  6. Complete Genome Viral Phylogenies Suggests the Concerted Evolution of Regulatory Cores and Accessory Satellites

    PubMed Central

    Zanotto, Paolo Marinho de Andrade; Krakauer, David C.

    2008-01-01

    We consider the concerted evolution of viral genomes in four families of DNA viruses. Given the high rate of horizontal gene transfer among viruses and their hosts, it is an open question as to how representative particular genes are of the evolutionary history of the complete genome. To address the concerted evolution of viral genes, we compared genomic evolution across four distinct, extant viral families. For all four viral families we constructed DNA-dependent DNA polymerase-based (DdDp) phylogenies and in addition, whole genome sequence, as quantitative descriptions of inter-genome relationships. We found that the history of the polymerase gene was highly predictive of the history of the genome as a whole, which we explain in terms of repeated, co-divergence events of the core DdDp gene accompanied by a number of satellite, accessory genetic loci. We also found that the rate of gene gain in baculovirus and poxviruses proceeds significantly more quickly than the rate of gene loss and that there is convergent acquisition of satellite functions promoting contextual adaptation when distinct viral families infect related hosts. The congruence of the genome and polymerase trees suggests that a large set of viral genes, including polymerase, derive from a phylogenetically conserved core of genes of host origin, secondarily reinforced by gene acquisition from common hosts or co-infecting viruses within the host. A single viral genome can be thought of as a mutualistic network, with the core genes acting as an effective host and the satellite genes as effective symbionts. Larger virus genomes show a greater departure from linkage equilibrium between core and satellites functions. PMID:18941535

  7. Identification of Major Signaling Pathways in Prion Disease Progression Using Network Analysis

    PubMed Central

    Newaz, Khalique; Sriram, K.; Bera, Debajyoti

    2015-01-01

    Prion diseases are transmissible neurodegenerative diseases that arise due to conformational change of normal, cellular prion protein (PrPC) to protease-resistant isofrom (rPrPSc). Deposition of misfolded PrpSc proteins leads to an alteration of many signaling pathways that includes immunological and apoptotic pathways. As a result, this culminates in the dysfunction and death of neuronal cells. Earlier works on transcriptomic studies have revealed some affected pathways, but it is not clear which is (are) the prime network pathway(s) that change during the disease progression and how these pathways are involved in crosstalks with each other from the time of incubation to clinical death. We perform network analysis on large-scale transcriptomic data of differentially expressed genes obtained from whole brain in six different mouse strain-prion strain combination models to determine the pathways involved in prion diseases, and to understand the role of crosstalks in disease propagation. We employ a notion of differential network centrality measures on protein interaction networks to identify the potential biological pathways involved. We also propose a crosstalk ranking method based on dynamic protein interaction networks to identify the core network elements involved in crosstalk with different pathways. We identify 148 DEGs (differentially expressed genes) potentially related to the prion disease progression. Functional association of the identified genes implicates a strong involvement of immunological pathways. We extract a bow-tie structure that is potentially dysregulated in prion disease. We also propose an ODE model for the bow-tie network. Predictions related to diseased condition suggests the downregulation of the core signaling elements (PI3Ks and AKTs) of the bow-tie network. In this work, we show using transcriptomic data that the neuronal dysfunction in prion disease is strongly related to the immunological pathways. We conclude that these immunological pathways occupy influential positions in the PFNs (protein functional networks) that are related to prion disease. Importantly, this functional network involvement is prevalent in all the five different mouse strain-prion strain combinations that we studied. We also conclude that the dysregulation of the core elements of the bow-tie structure, which belongs to PI3K-Akt signaling pathway, leads to dysregulation of the downstream components corresponding to other biological pathways. PMID:26646948

  8. Genetic adaptation of the antibacterial human innate immunity network.

    PubMed

    Casals, Ferran; Sikora, Martin; Laayouni, Hafid; Montanucci, Ludovica; Muntasell, Aura; Lazarus, Ross; Calafell, Francesc; Awadalla, Philip; Netea, Mihai G; Bertranpetit, Jaume

    2011-07-11

    Pathogens have represented an important selective force during the adaptation of modern human populations to changing social and other environmental conditions. The evolution of the immune system has therefore been influenced by these pressures. Genomic scans have revealed that immune system is one of the functions enriched with genes under adaptive selection. Here, we describe how the innate immune system has responded to these challenges, through the analysis of resequencing data for 132 innate immunity genes in two human populations. Results are interpreted in the context of the functional and interaction networks defined by these genes. Nucleotide diversity is lower in the adaptors and modulators functional classes, and is negatively correlated with the centrality of the proteins within the interaction network. We also produced a list of candidate genes under positive or balancing selection in each population detected by neutrality tests and showed that some functional classes are preferential targets for selection. We found evidence that the role of each gene in the network conditions the capacity to evolve or their evolvability: genes at the core of the network are more constrained, while adaptation mostly occurred at particular positions at the network edges. Interestingly, the functional classes containing most of the genes with signatures of balancing selection are involved in autoinflammatory and autoimmune diseases, suggesting a counterbalance between the beneficial and deleterious effects of the immune response.

  9. Genetic adaptation of the antibacterial human innate immunity network

    PubMed Central

    2011-01-01

    Background Pathogens have represented an important selective force during the adaptation of modern human populations to changing social and other environmental conditions. The evolution of the immune system has therefore been influenced by these pressures. Genomic scans have revealed that immune system is one of the functions enriched with genes under adaptive selection. Results Here, we describe how the innate immune system has responded to these challenges, through the analysis of resequencing data for 132 innate immunity genes in two human populations. Results are interpreted in the context of the functional and interaction networks defined by these genes. Nucleotide diversity is lower in the adaptors and modulators functional classes, and is negatively correlated with the centrality of the proteins within the interaction network. We also produced a list of candidate genes under positive or balancing selection in each population detected by neutrality tests and showed that some functional classes are preferential targets for selection. Conclusions We found evidence that the role of each gene in the network conditions the capacity to evolve or their evolvability: genes at the core of the network are more constrained, while adaptation mostly occurred at particular positions at the network edges. Interestingly, the functional classes containing most of the genes with signatures of balancing selection are involved in autoinflammatory and autoimmune diseases, suggesting a counterbalance between the beneficial and deleterious effects of the immune response. PMID:21745391

  10. Systems Genetics Analysis of GWAS reveals Novel Associations between Key Biological Processes and Coronary Artery Disease

    PubMed Central

    Ghosh, Sujoy; Vivar, Juan; Nelson, Christopher P; Willenborg, Christina; Segrè, Ayellet V; Mäkinen, Ville-Petteri; Nikpay, Majid; Erdmann, Jeannette; Blankenberg, Stefan; O'Donnell, Christopher; März, Winfried; Laaksonen, Reijo; Stewart, Alexandre FR; Epstein, Stephen E; Shah, Svati H; Granger, Christopher B; Hazen, Stanley L; Kathiresan, Sekar; Reilly, Muredach P; Yang, Xia; Quertermous, Thomas; Samani, Nilesh J; Schunkert, Heribert; Assimes, Themistocles L; McPherson, Ruth

    2016-01-01

    Objective Genome-wide association (GWA) studies have identified multiple genetic variants affecting the risk of coronary artery disease (CAD). However, individually these explain only a small fraction of the heritability of CAD and for most, the causal biological mechanisms remain unclear. We sought to obtain further insights into potential causal processes of CAD by integrating large-scale GWA data with expertly curated databases of core human pathways and functional networks. Approaches and Results Employing pathways (gene sets) from Reactome, we carried out a two-stage gene set enrichment analysis strategy. From a meta-analyzed discovery cohort of 7 CADGWAS data sets (9,889 cases/11,089 controls), nominally significant gene-sets were tested for replication in a meta-analysis of 9 additional studies (15,502 cases/55,730 controls) from the CARDIoGRAM Consortium. A total of 32 of 639 Reactome pathways tested showed convincing association with CAD (replication p<0.05). These pathways resided in 9 of 21 core biological processes represented in Reactome, and included pathways relevant to extracellular matrix integrity, innate immunity, axon guidance, and signaling by PDRF, NOTCH, and the TGF-β/SMAD receptor complex. Many of these pathways had strengths of association comparable to those observed in lipid transport pathways. Network analysis of unique genes within the replicated pathways further revealed several interconnected functional and topologically interacting modules representing novel associations (e.g. semaphorin regulated axonal guidance pathway) besides confirming known processes (lipid metabolism). The connectivity in the observed networks was statistically significant compared to random networks (p<0.001). Network centrality analysis (‘degree’ and ‘betweenness’) further identified genes (e.g. NCAM1, FYN, FURIN etc.) likely to play critical roles in the maintenance and functioning of several of the replicated pathways. Conclusions These findings provide novel insights into how genetic variation, interpreted in the context of biological processes and functional interactions among genes, may help define the genetic architecture of CAD. PMID:25977570

  11. Determination of performance characteristics of scientific applications on IBM Blue Gene/Q

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Evangelinos, C.; Walkup, R. E.; Sachdeva, V.

    The IBM Blue Gene®/Q platform presents scientists and engineers with a rich set of hardware features such as 16 cores per chip sharing a Level 2 cache, a wide SIMD (single-instruction, multiple-data) unit, a five-dimensional torus network, and hardware support for collective operations. Especially important is the feature related to cores that have four “hardware threads,” which makes it possible to hide latencies and obtain a high fraction of the peak issue rate from each core. All of these hardware resources present unique performance-tuning opportunities on Blue Gene/Q. We provide an overview of several important applications and solvers and studymore » them on Blue Gene/Q using performance counters and Message Passing Interface profiles. We also discuss how Blue Gene/Q tools help us understand the interaction of the application with the hardware and software layers and provide guidance for optimization. Furthermore, on the basis of our analysis, we discuss code improvement strategies targeting Blue Gene/Q. Information about how these algorithms map to the Blue Gene® architecture is expected to have an impact on future system design as we move to the exascale era.« less

  12. Saliva Microbiota Carry Caries-Specific Functional Gene Signatures

    PubMed Central

    Chang, Xingzhi; Yuan, Xiao; Tu, Qichao; Yuan, Tong; Deng, Ye; Hemme, Christopher L.; Van Nostrand, Joy; Cui, Xinping; He, Zhili; Chen, Zhenggang; Guo, Dawei; Yu, Jiangbo; Zhang, Yue; Zhou, Jizhong; Xu, Jian

    2014-01-01

    Human saliva microbiota is phylogenetically divergent among host individuals yet their roles in health and disease are poorly appreciated. We employed a microbial functional gene microarray, HuMiChip 1.0, to reconstruct the global functional profiles of human saliva microbiota from ten healthy and ten caries-active adults. Saliva microbiota in the pilot population featured a vast diversity of functional genes. No significant distinction in gene number or diversity indices was observed between healthy and caries-active microbiota. However, co-presence network analysis of functional genes revealed that caries-active microbiota was more divergent in non-core genes than healthy microbiota, despite both groups exhibited a similar degree of conservation at their respective core genes. Furthermore, functional gene structure of saliva microbiota could potentially distinguish caries-active patients from healthy hosts. Microbial functions such as Diaminopimelate epimerase, Prephenate dehydrogenase, Pyruvate-formate lyase and N-acetylmuramoyl-L-alanine amidase were significantly linked to caries. Therefore, saliva microbiota carried disease-associated functional signatures, which could be potentially exploited for caries diagnosis. PMID:24533043

  13. Saliva microbiota carry caries-specific functional gene signatures.

    PubMed

    Yang, Fang; Ning, Kang; Chang, Xingzhi; Yuan, Xiao; Tu, Qichao; Yuan, Tong; Deng, Ye; Hemme, Christopher L; Van Nostrand, Joy; Cui, Xinping; He, Zhili; Chen, Zhenggang; Guo, Dawei; Yu, Jiangbo; Zhang, Yue; Zhou, Jizhong; Xu, Jian

    2014-01-01

    Human saliva microbiota is phylogenetically divergent among host individuals yet their roles in health and disease are poorly appreciated. We employed a microbial functional gene microarray, HuMiChip 1.0, to reconstruct the global functional profiles of human saliva microbiota from ten healthy and ten caries-active adults. Saliva microbiota in the pilot population featured a vast diversity of functional genes. No significant distinction in gene number or diversity indices was observed between healthy and caries-active microbiota. However, co-presence network analysis of functional genes revealed that caries-active microbiota was more divergent in non-core genes than healthy microbiota, despite both groups exhibited a similar degree of conservation at their respective core genes. Furthermore, functional gene structure of saliva microbiota could potentially distinguish caries-active patients from healthy hosts. Microbial functions such as Diaminopimelate epimerase, Prephenate dehydrogenase, Pyruvate-formate lyase and N-acetylmuramoyl-L-alanine amidase were significantly linked to caries. Therefore, saliva microbiota carried disease-associated functional signatures, which could be potentially exploited for caries diagnosis.

  14. A Survey of Statistical Models for Reverse Engineering Gene Regulatory Networks

    PubMed Central

    Huang, Yufei; Tienda-Luna, Isabel M.; Wang, Yufeng

    2009-01-01

    Statistical models for reverse engineering gene regulatory networks are surveyed in this article. To provide readers with a system-level view of the modeling issues in this research, a graphical modeling framework is proposed. This framework serves as the scaffolding on which the review of different models can be systematically assembled. Based on the framework, we review many existing models for many aspects of gene regulation; the pros and cons of each model are discussed. In addition, network inference algorithms are also surveyed under the graphical modeling framework by the categories of point solutions and probabilistic solutions and the connections and differences among the algorithms are provided. This survey has the potential to elucidate the development and future of reverse engineering GRNs and bring statistical signal processing closer to the core of this research. PMID:20046885

  15. Signed weighted gene co-expression network analysis of transcriptional regulation in murine embryonic stem cells

    PubMed Central

    Mason, Mike J; Fan, Guoping; Plath, Kathrin; Zhou, Qing; Horvath, Steve

    2009-01-01

    Background Recent work has revealed that a core group of transcription factors (TFs) regulates the key characteristics of embryonic stem (ES) cells: pluripotency and self-renewal. Current efforts focus on identifying genes that play important roles in maintaining pluripotency and self-renewal in ES cells and aim to understand the interactions among these genes. To that end, we investigated the use of unsigned and signed network analysis to identify pluripotency and differentiation related genes. Results We show that signed networks provide a better systems level understanding of the regulatory mechanisms of ES cells than unsigned networks, using two independent murine ES cell expression data sets. Specifically, using signed weighted gene co-expression network analysis (WGCNA), we found a pluripotency module and a differentiation module, which are not identified in unsigned networks. We confirmed the importance of these modules by incorporating genome-wide TF binding data for key ES cell regulators. Interestingly, we find that the pluripotency module is enriched with genes related to DNA damage repair and mitochondrial function in addition to transcriptional regulation. Using a connectivity measure of module membership, we not only identify known regulators of ES cells but also show that Mrpl15, Msh6, Nrf1, Nup133, Ppif, Rbpj, Sh3gl2, and Zfp39, among other genes, have important roles in maintaining ES cell pluripotency and self-renewal. We also report highly significant relationships between module membership and epigenetic modifications (histone modifications and promoter CpG methylation status), which are known to play a role in controlling gene expression during ES cell self-renewal and differentiation. Conclusion Our systems biologic re-analysis of gene expression, transcription factor binding, epigenetic and gene ontology data provides a novel integrative view of ES cell biology. PMID:19619308

  16. Modularity and evolutionary constraints in a baculovirus gene regulatory network

    PubMed Central

    2013-01-01

    Background The structure of regulatory networks remains an open question in our understanding of complex biological systems. Interactions during complete viral life cycles present unique opportunities to understand how host-parasite network take shape and behave. The Anticarsia gemmatalis multiple nucleopolyhedrovirus (AgMNPV) is a large double-stranded DNA virus, whose genome may encode for 152 open reading frames (ORFs). Here we present the analysis of the ordered cascade of the AgMNPV gene expression. Results We observed an earlier onset of the expression than previously reported for other baculoviruses, especially for genes involved in DNA replication. Most ORFs were expressed at higher levels in a more permissive host cell line. Genes with more than one copy in the genome had distinct expression profiles, which could indicate the acquisition of new functionalities. The transcription gene regulatory network (GRN) for 149 ORFs had a modular topology comprising five communities of highly interconnected nodes that separated key genes that are functionally related on different communities, possibly maximizing redundancy and GRN robustness by compartmentalization of important functions. Core conserved functions showed expression synchronicity, distinct GRN features and significantly less genetic diversity, consistent with evolutionary constraints imposed in key elements of biological systems. This reduced genetic diversity also had a positive correlation with the importance of the gene in our estimated GRN, supporting a relationship between phylogenetic data of baculovirus genes and network features inferred from expression data. We also observed that gene arrangement in overlapping transcripts was conserved among related baculoviruses, suggesting a principle of genome organization. Conclusions Albeit with a reduced number of nodes (149), the AgMNPV GRN had a topology and key characteristics similar to those observed in complex cellular organisms, which indicates that modularity may be a general feature of biological gene regulatory networks. PMID:24006890

  17. MiR-3613-3p affects cell proliferation and cell cycle in hepatocellular carcinoma

    PubMed Central

    Zhang, Donghui; Liu, Enqin; Kang, Jian; Yang, Xin; Liu, Hong

    2017-01-01

    Hepatocellular carcinoma (HCC) is one of the most common types of malignant tumors with poor sensitivity to chemotherapy drugs and poor prognosis among patients. In the present study, we downloaded the original data from the Gene Expression Omnibus and compared gene expression profiles of liver cancer cells in patients with HCC with those of colon epithelial cells of healthy controls to identify differentially expressed genes (DEGs). After filtering target microRNAs (miRNA) from core DEGs, we cultured HepG2 cells in vitro, knocked down the miRNA and core mRNAs, and analyzed the effects. We found 228 differentially expressed genes between liver cancer tissue and healthy control tissue. We also integrated the protein-proteininteraction network and module analysis to screen 13 core genes, consisting of 12 up-regulated genes and 1 down-regulated gene. Five core genes were regulated hsa-miR-3613-3p, therefor we hypothesized that hsa-miR-3613-3p was a critical miRNA. After the transfection procedure, we found that changes in hsa-miR-3613-3p were the most obvious. Therefore, we speculated that hsa-miR-3613-3p was a main target miRNA. In addition, we transfected with si (BIRC5, CDK1, NUF2, ZWINT and SPC24), to target genes that can be targeted by miR-3613-3p. Our data shows that BIRC5, NUF2, and SPC24 may be promising liver cancer biomarkers that may not only predict disease occurrence but also potential personalized treatment options. PMID:29190974

  18. BIOLOGICAL NETWORK EXPLORATION WITH CYTOSCAPE 3

    PubMed Central

    Su, Gang; Morris, John H.; Demchak, Barry; Bader, Gary D.

    2014-01-01

    Cytoscape is one of the most popular open-source software tools for the visual exploration of biomedical networks composed of protein, gene and other types of interactions. It offers researchers a versatile and interactive visualization interface for exploring complex biological interconnections supported by diverse annotation and experimental data, thereby facilitating research tasks such as predicting gene function and pathway construction. Cytoscape provides core functionality to load, visualize, search, filter and save networks, and hundreds of Apps extend this functionality to address specific research needs. The latest generation of Cytoscape (version 3.0 and later) has substantial improvements in function, user interface and performance relative to previous versions. This protocol aims to jump-start new users with specific protocols for basic Cytoscape functions, such as installing Cytoscape and Cytoscape Apps, loading data, visualizing and navigating the network, visualizing network associated data (attributes) and identifying clusters. It also highlights new features that benefit experienced users. PMID:25199793

  19. LASSIM-A network inference toolbox for genome-wide mechanistic modeling.

    PubMed

    Magnusson, Rasmus; Mariotti, Guido Pio; Köpsén, Mattias; Lövfors, William; Gawel, Danuta R; Jörnsten, Rebecka; Linde, Jörg; Nordling, Torbjörn E M; Nyman, Elin; Schulze, Sylvie; Nestor, Colm E; Zhang, Huan; Cedersund, Gunnar; Benson, Mikael; Tjärnberg, Andreas; Gustafsson, Mika

    2017-06-01

    Recent technological advancements have made time-resolved, quantitative, multi-omics data available for many model systems, which could be integrated for systems pharmacokinetic use. Here, we present large-scale simulation modeling (LASSIM), which is a novel mathematical tool for performing large-scale inference using mechanistically defined ordinary differential equations (ODE) for gene regulatory networks (GRNs). LASSIM integrates structural knowledge about regulatory interactions and non-linear equations with multiple steady state and dynamic response expression datasets. The rationale behind LASSIM is that biological GRNs can be simplified using a limited subset of core genes that are assumed to regulate all other gene transcription events in the network. The LASSIM method is implemented as a general-purpose toolbox using the PyGMO Python package to make the most of multicore computers and high performance clusters, and is available at https://gitlab.com/Gustafsson-lab/lassim. As a method, LASSIM works in two steps, where it first infers a non-linear ODE system of the pre-specified core gene expression. Second, LASSIM in parallel optimizes the parameters that model the regulation of peripheral genes by core system genes. We showed the usefulness of this method by applying LASSIM to infer a large-scale non-linear model of naïve Th2 cell differentiation, made possible by integrating Th2 specific bindings, time-series together with six public and six novel siRNA-mediated knock-down experiments. ChIP-seq showed significant overlap for all tested transcription factors. Next, we performed novel time-series measurements of total T-cells during differentiation towards Th2 and verified that our LASSIM model could monitor those data significantly better than comparable models that used the same Th2 bindings. In summary, the LASSIM toolbox opens the door to a new type of model-based data analysis that combines the strengths of reliable mechanistic models with truly systems-level data. We demonstrate the power of this approach by inferring a mechanistically motivated, genome-wide model of the Th2 transcription regulatory system, which plays an important role in several immune related diseases.

  20. A Method for Predicting Protein Complexes from Dynamic Weighted Protein-Protein Interaction Networks.

    PubMed

    Liu, Lizhen; Sun, Xiaowu; Song, Wei; Du, Chao

    2018-06-01

    Predicting protein complexes from protein-protein interaction (PPI) network is of great significance to recognize the structure and function of cells. A protein may interact with different proteins under different time or conditions. Existing approaches only utilize static PPI network data that may lose much temporal biological information. First, this article proposed a novel method that combines gene expression data at different time points with traditional static PPI network to construct different dynamic subnetworks. Second, to further filter out the data noise, the semantic similarity based on gene ontology is regarded as the network weight together with the principal component analysis, which is introduced to deal with the weight computing by three traditional methods. Third, after building a dynamic PPI network, a predicting protein complexes algorithm based on "core-attachment" structural feature is applied to detect complexes from each dynamic subnetworks. Finally, it is revealed from the experimental results that our method proposed in this article performs well on detecting protein complexes from dynamic weighted PPI networks.

  1. Functional Analysis of OMICs Data and Small Molecule Compounds in an Integrated "Knowledge-Based" Platform.

    PubMed

    Dubovenko, Alexey; Nikolsky, Yuri; Rakhmatulin, Eugene; Nikolskaya, Tatiana

    2017-01-01

    Analysis of NGS and other sequencing data, gene variants, gene expression, proteomics, and other high-throughput (OMICs) data is challenging because of its biological complexity and high level of technical and biological noise. One way to deal with both problems is to perform analysis with a high fidelity annotated knowledgebase of protein interactions, pathways, and functional ontologies. This knowledgebase has to be structured in a computer-readable format and must include software tools for managing experimental data, analysis, and reporting. Here, we present MetaCore™ and Key Pathway Advisor (KPA), an integrated platform for functional data analysis. On the content side, MetaCore and KPA encompass a comprehensive database of molecular interactions of different types, pathways, network models, and ten functional ontologies covering human, mouse, and rat genes. The analytical toolkit includes tools for gene/protein list enrichment analysis, statistical "interactome" tool for the identification of over- and under-connected proteins in the dataset, and a biological network analysis module made up of network generation algorithms and filters. The suite also features Advanced Search, an application for combinatorial search of the database content, as well as a Java-based tool called Pathway Map Creator for drawing and editing custom pathway maps. Applications of MetaCore and KPA include molecular mode of action of disease research, identification of potential biomarkers and drug targets, pathway hypothesis generation, analysis of biological effects for novel small molecule compounds and clinical applications (analysis of large cohorts of patients, and translational and personalized medicine).

  2. Co-expression network with protein-protein interaction and transcription regulation in malaria parasite Plasmodium falciparum.

    PubMed

    Yu, Fu-Dong; Yang, Shao-You; Li, Yuan-Yuan; Hu, Wei

    2013-04-10

    Malaria continues to be one of the most severe global infectious diseases, as a major threat to human health and economic development. Network-based biological analysis is a promising approach to uncover key genes and biological processes from a network viewpoint, which could not be recognized from individual gene-based signatures. We integrated gene co-expression profile with protein-protein interaction and transcriptional regulation information to construct a comprehensive gene co-expression network of Plasmodium falciparum. Based on this network, we identified 10 core modules by using ICE (Iterative Clique Enumeration) algorithm, which were essential for malaria parasite development in intraerythrocytic developmental cycle (IDC) stages. In each module, all genes were highly correlated probably due to co-regulation or formation of a protein complex. Some of these genes were recognized to be differentially coexpressed among three close-by IDC stages. The gene of prpf8 (PFD0265w) encoding pre-mRNA processing splicing factor 8 product was identified as DCGs (differentially co-expressed genes) among IDC stages, although this gene function was seldom reported in previous researches. Integrating the species-specific gene prediction and differential co-expression gene detection, we found some modules could perform species-specific functions according to some of genes in these modules were species-specific genes, like the module 10. Furthermore, in order to reveal the underlying mechanisms of the erythrocyte invasion by P. falciparum, Steiner Tree algorithm was employed to identify the invasion subnetwork from our gene co-expression network. The subnetwork-based analysis indicated that some important Plasmodium parasite specific genes could corporate with each other and be co-regulated during the parasite invasion process, which including a head-to-head gene pair of PfRH2a (PF13_0198) and PfRH2b (MAL13P1.176). This study based on gene co-expression network could shed new insights on the mechanisms of pathogenesis, even virulence and P. falciparum development. Crown Copyright © 2012. Published by Elsevier B.V. All rights reserved.

  3. Systems Genetics Analysis of Genome-Wide Association Study Reveals Novel Associations Between Key Biological Processes and Coronary Artery Disease.

    PubMed

    Ghosh, Sujoy; Vivar, Juan; Nelson, Christopher P; Willenborg, Christina; Segrè, Ayellet V; Mäkinen, Ville-Petteri; Nikpay, Majid; Erdmann, Jeannette; Blankenberg, Stefan; O'Donnell, Christopher; März, Winfried; Laaksonen, Reijo; Stewart, Alexandre F R; Epstein, Stephen E; Shah, Svati H; Granger, Christopher B; Hazen, Stanley L; Kathiresan, Sekar; Reilly, Muredach P; Yang, Xia; Quertermous, Thomas; Samani, Nilesh J; Schunkert, Heribert; Assimes, Themistocles L; McPherson, Ruth

    2015-07-01

    Genome-wide association studies have identified multiple genetic variants affecting the risk of coronary artery disease (CAD). However, individually these explain only a small fraction of the heritability of CAD and for most, the causal biological mechanisms remain unclear. We sought to obtain further insights into potential causal processes of CAD by integrating large-scale GWA data with expertly curated databases of core human pathways and functional networks. Using pathways (gene sets) from Reactome, we carried out a 2-stage gene set enrichment analysis strategy. From a meta-analyzed discovery cohort of 7 CAD genome-wide association study data sets (9889 cases/11 089 controls), nominally significant gene sets were tested for replication in a meta-analysis of 9 additional studies (15 502 cases/55 730 controls) from the Coronary ARtery DIsease Genome wide Replication and Meta-analysis (CARDIoGRAM) Consortium. A total of 32 of 639 Reactome pathways tested showed convincing association with CAD (replication P<0.05). These pathways resided in 9 of 21 core biological processes represented in Reactome, and included pathways relevant to extracellular matrix (ECM) integrity, innate immunity, axon guidance, and signaling by PDRF (platelet-derived growth factor), NOTCH, and the transforming growth factor-β/SMAD receptor complex. Many of these pathways had strengths of association comparable to those observed in lipid transport pathways. Network analysis of unique genes within the replicated pathways further revealed several interconnected functional and topologically interacting modules representing novel associations (eg, semaphoring-regulated axonal guidance pathway) besides confirming known processes (lipid metabolism). The connectivity in the observed networks was statistically significant compared with random networks (P<0.001). Network centrality analysis (degree and betweenness) further identified genes (eg, NCAM1, FYN, FURIN, etc) likely to play critical roles in the maintenance and functioning of several of the replicated pathways. These findings provide novel insights into how genetic variation, interpreted in the context of biological processes and functional interactions among genes, may help define the genetic architecture of CAD. © 2015 American Heart Association, Inc.

  4. Network-based co-expression analysis for exploring the potential diagnostic biomarkers of metastatic melanoma.

    PubMed

    Wang, Li-Xin; Li, Yang; Chen, Guan-Zhi

    2018-01-01

    Metastatic melanoma is an aggressive skin cancer and is one of the global malignancies with high mortality and morbidity. It is essential to identify and verify diagnostic biomarkers of early metastatic melanoma. Previous studies have systematically assessed protein biomarkers and mRNA-based expression characteristics. However, molecular markers for the early diagnosis of metastatic melanoma have not been identified. To explore potential regulatory targets, we have analyzed the gene microarray expression profiles of malignant melanoma samples by co-expression analysis based on the network approach. The differentially expressed genes (DEGs) were screened by the EdgeR package of R software. A weighted gene co-expression network analysis (WGCNA) was used for the identification of DEGs in the special gene modules and hub genes. Subsequently, a protein-protein interaction network was constructed to extract hub genes associated with gene modules. Finally, twenty-four important hub genes (RASGRP2, IKZF1, CXCR5, LTB, BLK, LINGO3, CCR6, P2RY10, RHOH, JUP, KRT14, PLA2G3, SPRR1A, KRT78, SFN, CLDN4, IL1RN, PKP3, CBLC, KRT16, TMEM79, KLK8, LYPD3 and LYPD5) were treated as valuable factors involved in the immune response and tumor cell development in tumorigenesis. In addition, a transcriptional regulatory network was constructed for these specific modules or hub genes, and a few core transcriptional regulators were found to be mostly associated with our hub genes, including GATA1, STAT1, SP1, and PSG1. In summary, our findings enhance our understanding of the biological process of malignant melanoma metastasis, enabling us to identify specific genes to use for diagnostic and prognostic markers and possibly for targeted therapy.

  5. Coding and non-coding gene regulatory networks underlie the immune response in liver cirrhosis.

    PubMed

    Gao, Bo; Zhang, Xueming; Huang, Yongming; Yang, Zhengpeng; Zhang, Yuguo; Zhang, Weihui; Gao, Zu-Hua; Xue, Dongbo

    2017-01-01

    Liver cirrhosis is recognized as being the consequence of immune-mediated hepatocyte damage and repair processes. However, the regulation of these immune responses underlying liver cirrhosis has not been elucidated. In this study, we used GEO datasets and bioinformatics methods to established coding and non-coding gene regulatory networks including transcription factor-/lncRNA-microRNA-mRNA, and competing endogenous RNA interaction networks. Our results identified 2224 mRNAs, 70 lncRNAs and 46 microRNAs were differentially expressed in liver cirrhosis. The transcription factor -/lncRNA- microRNA-mRNA network we uncovered that results in immune-mediated liver cirrhosis is comprised of 5 core microRNAs (e.g., miR-203; miR-219-5p), 3 transcription factors (i.e., FOXP3, ETS1 and FOS) and 7 lncRNAs (e.g., ENTS00000671336, ENST00000575137). The competing endogenous RNA interaction network we identified includes a complex immune response regulatory subnetwork that controls the entire liver cirrhosis network. Additionally, we found 10 overlapping GO terms shared by both liver cirrhosis and hepatocellular carcinoma including "immune response" as well. Interestingly, the overlapping differentially expressed genes in liver cirrhosis and hepatocellular carcinoma were enriched in immune response-related functional terms. In summary, a complex gene regulatory network underlying immune response processes may play an important role in the development and progression of liver cirrhosis, and its development into hepatocellular carcinoma.

  6. Convergent roles of de novo mutations and common variants in schizophrenia in tissue-specific and spatiotemporal co-expression network.

    PubMed

    Jia, Peilin; Chen, Xiangning; Fanous, Ayman H; Zhao, Zhongming

    2018-05-24

    Genetic components susceptible to complex disease such as schizophrenia include a wide spectrum of variants, including common variants (CVs) and de novo mutations (DNMs). Although CVs and DNMs differ by origin, it remains elusive whether and how they interact at the gene, pathway, and network levels that leads to the disease. In this work, we characterized the genes harboring schizophrenia-associated CVs (CVgenes) and the genes harboring DNMs (DNMgenes) using measures from network, tissue-specific expression profile, and spatiotemporal brain expression profile. We developed an algorithm to link the DNMgenes and CVgenes in spatiotemporal brain co-expression networks. DNMgenes tended to have central roles in the human protein-protein interaction (PPI) network, evidenced in their high degree and high betweenness values. DNMgenes and CVgenes connected with each other significantly more often than with other genes in the networks. However, only CVgenes remained significantly connected after adjusting for their degree. In our gene co-expression PPI network, we found DNMgenes and CVgenes connected in a tissue-specific fashion, and such a pattern was similar to that in GTEx brain but not in other GTEx tissues. Importantly, DNMgene-CVgene subnetworks were enriched with pathways of chromatin remodeling, MHC protein complex binding, and neurotransmitter activities. In summary, our results unveiled that both DNMgenes and CVgenes contributed to a core set of biologically important pathways and networks, and their interactions may attribute to the risk for schizophrenia. Our results also suggested a stronger biological effect of DNMgenes than CVgenes in schizophrenia.

  7. Mapping the core of the Arabidopsis circadian clock defines the network structure of the oscillator.

    PubMed

    Huang, W; Pérez-García, P; Pokhilko, A; Millar, A J; Antoshechkin, I; Riechmann, J L; Mas, P

    2012-04-06

    In many organisms, the circadian clock is composed of functionally coupled morning and evening oscillators. In Arabidopsis, oscillator coupling relies on a core loop in which the evening oscillator component TIMING OF CAB EXPRESSION 1 (TOC1) was proposed to activate a subset of morning-expressed oscillator genes. Here, we show that TOC1 does not function as an activator but rather as a general repressor of oscillator gene expression. Repression occurs through TOC1 rhythmic association to the promoters of the oscillator genes. Hormone-dependent induction of TOC1 and analysis of RNA interference plants show that TOC1 prevents the activation of morning-expressed genes at night. Our study overturns the prevailing model of the Arabidopsis circadian clock, showing that the morning and evening oscillator loops are connected through the repressing activity of TOC1.

  8. Cross-species transcriptional network analysis reveals conservation and variation in response to metal stress in cyanobacteria

    PubMed Central

    2013-01-01

    Background As one of the most dominant bacterial groups on Earth, cyanobacteria play a pivotal role in the global carbon cycling and the Earth atmosphere composition. Understanding their molecular responses to environmental perturbations has important scientific and environmental values. Since important biological processes or networks are often evolutionarily conserved, the cross-species transcriptional network analysis offers a useful strategy to decipher conserved and species-specific transcriptional mechanisms that cells utilize to deal with various biotic and abiotic disturbances, and it will eventually lead to a better understanding of associated adaptation and regulatory networks. Results In this study, the Weighted Gene Co-expression Network Analysis (WGCNA) approach was used to establish transcriptional networks for four important cyanobacteria species under metal stress, including iron depletion and high copper conditions. Cross-species network comparison led to discovery of several core response modules and genes possibly essential to metal stress, as well as species-specific hub genes for metal stresses in different cyanobacteria species, shedding light on survival strategies of cyanobacteria responding to different environmental perturbations. Conclusions The WGCNA analysis demonstrated that the application of cross-species transcriptional network analysis will lead to novel insights to molecular response to environmental changes which will otherwise not be achieved by analyzing data from a single species. PMID:23421563

  9. Virophages, polintons, and transpovirons: a complex evolutionary network of diverse selfish genetic elements with different reproduction strategies.

    PubMed

    Yutin, Natalya; Raoult, Didier; Koonin, Eugene V

    2013-05-23

    Recent advances of genomics and metagenomics reveal remarkable diversity of viruses and other selfish genetic elements. In particular, giant viruses have been shown to possess their own mobilomes that include virophages, small viruses that parasitize on giant viruses of the Mimiviridae family, and transpovirons, distinct linear plasmids. One of the virophages known as the Mavirus, a parasite of the giant Cafeteria roenbergensis virus, shares several genes with large eukaryotic self-replicating transposon of the Polinton (Maverick) family, and it has been proposed that the polintons evolved from a Mavirus-like ancestor. We performed a comprehensive phylogenomic analysis of the available genomes of virophages and traced the evolutionary connections between the virophages and other selfish genetic elements. The comparison of the gene composition and genome organization of the virophages reveals 6 conserved, core genes that are organized in partially conserved arrays. Phylogenetic analysis of those core virophage genes, for which a sufficient diversity of homologs outside the virophages was detected, including the maturation protease and the packaging ATPase, supports the monophyly of the virophages. The results of this analysis appear incompatible with the origin of polintons from a Mavirus-like agent but rather suggest that Mavirus evolved through recombination between a polinton and an unknown virus. Altogether, virophages, polintons, a distinct Tetrahymena transposable element Tlr1, transpovirons, adenoviruses, and some bacteriophages form a network of evolutionary relationships that is held together by overlapping sets of shared genes and appears to represent a distinct module in the vast total network of viruses and mobile elements. The results of the phylogenomic analysis of the virophages and related genetic elements are compatible with the concept of network-like evolution of the virus world and emphasize multiple evolutionary connections between bona fide viruses and other classes of capsid-less mobile elements.

  10. Virophages, polintons, and transpovirons: a complex evolutionary network of diverse selfish genetic elements with different reproduction strategies

    PubMed Central

    2013-01-01

    Background Recent advances of genomics and metagenomics reveal remarkable diversity of viruses and other selfish genetic elements. In particular, giant viruses have been shown to possess their own mobilomes that include virophages, small viruses that parasitize on giant viruses of the Mimiviridae family, and transpovirons, distinct linear plasmids. One of the virophages known as the Mavirus, a parasite of the giant Cafeteria roenbergensis virus, shares several genes with large eukaryotic self-replicating transposon of the Polinton (Maverick) family, and it has been proposed that the polintons evolved from a Mavirus-like ancestor. Results We performed a comprehensive phylogenomic analysis of the available genomes of virophages and traced the evolutionary connections between the virophages and other selfish genetic elements. The comparison of the gene composition and genome organization of the virophages reveals 6 conserved, core genes that are organized in partially conserved arrays. Phylogenetic analysis of those core virophage genes, for which a sufficient diversity of homologs outside the virophages was detected, including the maturation protease and the packaging ATPase, supports the monophyly of the virophages. The results of this analysis appear incompatible with the origin of polintons from a Mavirus-like agent but rather suggest that Mavirus evolved through recombination between a polinton and an unknownvirus. Altogether, virophages, polintons, a distinct Tetrahymena transposable element Tlr1, transpovirons, adenoviruses, and some bacteriophages form a network of evolutionary relationships that is held together by overlapping sets of shared genes and appears to represent a distinct module in the vast total network of viruses and mobile elements. Conclusions The results of the phylogenomic analysis of the virophages and related genetic elements are compatible with the concept of network-like evolution of the virus world and emphasize multiple evolutionary connections between bona fide viruses and other classes of capsid-less mobile elements. PMID:23701946

  11. Initial deployment of the cardiogenic gene regulatory network in the basal chordate, Ciona intestinalis.

    PubMed

    Woznica, Arielle; Haeussler, Maximilian; Starobinska, Ella; Jemmett, Jessica; Li, Younan; Mount, David; Davidson, Brad

    2012-08-01

    The complex, partially redundant gene regulatory architecture underlying vertebrate heart formation has been difficult to characterize. Here, we dissect the primary cardiac gene regulatory network in the invertebrate chordate, Ciona intestinalis. The Ciona heart progenitor lineage is first specified by Fibroblast Growth Factor/Map Kinase (FGF/MapK) activation of the transcription factor Ets1/2 (Ets). Through microarray analysis of sorted heart progenitor cells, we identified the complete set of primary genes upregulated by FGF/Ets shortly after heart progenitor emergence. Combinatorial sequence analysis of these co-regulated genes generated a hypothetical regulatory code consisting of Ets binding sites associated with a specific co-motif, ATTA. Through extensive reporter analysis, we confirmed the functional importance of the ATTA co-motif in primary heart progenitor gene regulation. We then used the Ets/ATTA combination motif to successfully predict a number of additional heart progenitor gene regulatory elements, including an intronic element driving expression of the core conserved cardiac transcription factor, GATAa. This work significantly advances our understanding of the Ciona heart gene network. Furthermore, this work has begun to elucidate the precise regulatory architecture underlying the conserved, primary role of FGF/Ets in chordate heart lineage specification. Copyright © 2012 Elsevier Inc. All rights reserved.

  12. Identifying novel genes and chemicals related to nasopharyngeal cancer in a heterogeneous network.

    PubMed

    Li, Zhandong; An, Lifeng; Li, Hao; Wang, ShaoPeng; Zhou, You; Yuan, Fei; Li, Lin

    2016-05-05

    Nasopharyngeal cancer or nasopharyngeal carcinoma (NPC) is the most common cancer originating in the nasopharynx. The factors that induce nasopharyngeal cancer are still not clear. Additional information about the chemicals or genes related to nasopharyngeal cancer will promote a better understanding of the pathogenesis of this cancer and the factors that induce it. Thus, a computational method NPC-RGCP was proposed in this study to identify the possible relevant chemicals and genes based on the presently known chemicals and genes related to nasopharyngeal cancer. To extensively utilize the functional associations between proteins and chemicals, a heterogeneous network was constructed based on interactions of proteins and chemicals. The NPC-RGCP included two stages: the searching stage and the screening stage. The former stage is for finding new possible genes and chemicals in the heterogeneous network, while the latter stage is for screening and removing false discoveries and selecting the core genes and chemicals. As a result, five putative genes, CXCR3, IRF1, CDK1, GSTP1, and CDH2, and seven putative chemicals, iron, propionic acid, dimethyl sulfoxide, isopropanol, erythrose 4-phosphate, β-D-Fructose 6-phosphate, and flavin adenine dinucleotide, were identified by NPC-RGCP. Extensive analyses provided confirmation that the putative genes and chemicals have significant associations with nasopharyngeal cancer.

  13. Identifying novel genes and chemicals related to nasopharyngeal cancer in a heterogeneous network

    PubMed Central

    Li, Zhandong; An, Lifeng; Li, Hao; Wang, ShaoPeng; Zhou, You; Yuan, Fei; Li, Lin

    2016-01-01

    Nasopharyngeal cancer or nasopharyngeal carcinoma (NPC) is the most common cancer originating in the nasopharynx. The factors that induce nasopharyngeal cancer are still not clear. Additional information about the chemicals or genes related to nasopharyngeal cancer will promote a better understanding of the pathogenesis of this cancer and the factors that induce it. Thus, a computational method NPC-RGCP was proposed in this study to identify the possible relevant chemicals and genes based on the presently known chemicals and genes related to nasopharyngeal cancer. To extensively utilize the functional associations between proteins and chemicals, a heterogeneous network was constructed based on interactions of proteins and chemicals. The NPC-RGCP included two stages: the searching stage and the screening stage. The former stage is for finding new possible genes and chemicals in the heterogeneous network, while the latter stage is for screening and removing false discoveries and selecting the core genes and chemicals. As a result, five putative genes, CXCR3, IRF1, CDK1, GSTP1, and CDH2, and seven putative chemicals, iron, propionic acid, dimethyl sulfoxide, isopropanol, erythrose 4-phosphate, β-D-Fructose 6-phosphate, and flavin adenine dinucleotide, were identified by NPC-RGCP. Extensive analyses provided confirmation that the putative genes and chemicals have significant associations with nasopharyngeal cancer. PMID:27149165

  14. Systematical analysis of cutaneous squamous cell carcinoma network of microRNAs, transcription factors, and target and host genes.

    PubMed

    Wang, Ning; Xu, Zhi-Wen; Wang, Kun-Hao

    2014-01-01

    MicroRNAs (miRNAs) are small non-coding RNA molecules found in multicellular eukaryotes which are implicated in development of cancer, including cutaneous squamous cell carcinoma (cSCC). Expression is controlled by transcription factors (TFs) that bind to specific DNA sequences, thereby controlling the flow (or transcription) of genetic information from DNA to messenger RNA. Interactions result in biological signal control networks. Molecular components involved in cSCC were here assembled at abnormally expressed, related and global levels. Networks at these three levels were constructed with corresponding biological factors in term of interactions between miRNAs and target genes, TFs and miRNAs, and host genes and miRNAs. Up/down regulation or mutation of the factors were considered in the context of the regulation and significant patterns were extracted. Participants of the networks were evaluated based on their expression and regulation of other factors. Sub-networks with two core TFs, TP53 and EIF2C2, as the centers are identified. These share self-adapt feedback regulation in which a mutual restraint exists. Up or down regulation of certain genes and miRNAs are discussed. Some, for example the expression of MMP13, were in line with expectation while others, including FGFR3, need further investigation of their unexpected behavior. The present research suggests that dozens of components, miRNAs, TFs, target genes and host genes included, unite as networks through their regulation to function systematically in human cSCC. Networks built under the currently available sources provide critical signal controlling pathways and frequent patterns. Inappropriate controlling signal flow from abnormal expression of key TFs may push the system into an incontrollable situation and therefore contributes to cSCC development.

  15. Transcriptome comparison and gene coexpression network analysis provide a systems view of citrus response to ‘Candidatus Liberibacter asiaticus’ infection

    PubMed Central

    2013-01-01

    Background Huanglongbing (HLB) is arguably the most destructive disease for the citrus industry. HLB is caused by infection of the bacterium, Candidatus Liberibacter spp. Several citrus GeneChip studies have revealed thousands of genes that are up- or down-regulated by infection with Ca. Liberibacter asiaticus. However, whether and how these host genes act to protect against HLB remains poorly understood. Results As a first step towards a mechanistic view of citrus in response to the HLB bacterial infection, we performed a comparative transcriptome analysis and found that a total of 21 Probesets are commonly up-regulated by the HLB bacterial infection. In addition, a number of genes are likely regulated specifically at early, late or very late stages of the infection. Furthermore, using Pearson correlation coefficient-based gene coexpression analysis, we constructed a citrus HLB response network consisting of 3,507 Probesets and 56,287 interactions. Genes involved in carbohydrate and nitrogen metabolic processes, transport, defense, signaling and hormone response were overrepresented in the HLB response network and the subnetworks for these processes were constructed. Analysis of the defense and hormone response subnetworks indicates that hormone response is interconnected with defense response. In addition, mapping the commonly up-regulated HLB responsive genes into the HLB response network resulted in a core subnetwork where transport plays a key role in the citrus response to the HLB bacterial infection. Moreover, analysis of a phloem protein subnetwork indicates a role for this protein and zinc transporters or zinc-binding proteins in the citrus HLB defense response. Conclusion Through integrating transcriptome comparison and gene coexpression network analysis, we have provided for the first time a systems view of citrus in response to the Ca. Liberibacter spp. infection causing HLB. PMID:23324561

  16. Investigating core genetic-and-epigenetic cell cycle networks for stemness and carcinogenic mechanisms, and cancer drug design using big database mining and genome-wide next-generation sequencing data.

    PubMed

    Li, Cheng-Wei; Chen, Bor-Sen

    2016-10-01

    Recent studies have demonstrated that cell cycle plays a central role in development and carcinogenesis. Thus, the use of big databases and genome-wide high-throughput data to unravel the genetic and epigenetic mechanisms underlying cell cycle progression in stem cells and cancer cells is a matter of considerable interest. Real genetic-and-epigenetic cell cycle networks (GECNs) of embryonic stem cells (ESCs) and HeLa cancer cells were constructed by applying system modeling, system identification, and big database mining to genome-wide next-generation sequencing data. Real GECNs were then reduced to core GECNs of HeLa cells and ESCs by applying principal genome-wide network projection. In this study, we investigated potential carcinogenic and stemness mechanisms for systems cancer drug design by identifying common core and specific GECNs between HeLa cells and ESCs. Integrating drug database information with the specific GECNs of HeLa cells could lead to identification of multiple drugs for cervical cancer treatment with minimal side-effects on the genes in the common core. We found that dysregulation of miR-29C, miR-34A, miR-98, and miR-215; and methylation of ANKRD1, ARID5B, CDCA2, PIF1, STAMBPL1, TROAP, ZNF165, and HIST1H2AJ in HeLa cells could result in cell proliferation and anti-apoptosis through NFκB, TGF-β, and PI3K pathways. We also identified 3 drugs, methotrexate, quercetin, and mimosine, which repressed the activated cell cycle genes, ARID5B, STK17B, and CCL2, in HeLa cells with minimal side-effects.

  17. Identification of Causal Genes, Networks, and Transcriptional Regulators of REM Sleep and Wake

    PubMed Central

    Millstein, Joshua; Winrow, Christopher J.; Kasarskis, Andrew; Owens, Joseph R.; Zhou, Lili; Summa, Keith C.; Fitzpatrick, Karrie; Zhang, Bin; Vitaterna, Martha H.; Schadt, Eric E.; Renger, John J.; Turek, Fred W.

    2011-01-01

    Study Objective: Sleep-wake traits are well-known to be under substantial genetic control, but the specific genes and gene networks underlying primary sleep-wake traits have largely eluded identification using conventional approaches, especially in mammals. Thus, the aim of this study was to use systems genetics and statistical approaches to uncover the genetic networks underlying 2 primary sleep traits in the mouse: 24-h duration of REM sleep and wake. Design: Genome-wide RNA expression data from 3 tissues (anterior cortex, hypothalamus, thalamus/midbrain) were used in conjunction with high-density genotyping to identify candidate causal genes and networks mediating the effects of 2 QTL regulating the 24-h duration of REM sleep and one regulating the 24-h duration of wake. Setting: Basic sleep research laboratory. Patients or Participants: Male [C57BL/6J × (BALB/cByJ × C57BL/6J*) F1] N2 mice (n = 283). Interventions: None. Measurements and Results: The genetic variation of a mouse N2 mapping cross was leveraged against sleep-state phenotypic variation as well as quantitative gene expression measurement in key brain regions using integrative genomics approaches to uncover multiple causal sleep-state regulatory genes, including several surprising novel candidates, which interact as components of networks that modulate REM sleep and wake. In particular, it was discovered that a core network module, consisting of 20 genes, involved in the regulation of REM sleep duration is conserved across the cortex, hypothalamus, and thalamus. A novel application of a formal causal inference test was also used to identify those genes directly regulating sleep via control of expression. Conclusion: Systems genetics approaches reveal novel candidate genes, complex networks and specific transcriptional regulators of REM sleep and wake duration in mammals. Citation: Millstein J; Winrow CJ; Kasarskis A; Owens JR; Zhou L; Summa KC; Fitzpatrick K; Zhang B; Vitaterna MH; Schadt EE; Renger JJ; Turek FW. Identification of causal genes, networks, and transcriptional regulators of REM sleep and wake. SLEEP 2011;34(11):1469-1477. PMID:22043117

  18. Arabidopsis Ensemble Reverse-Engineered Gene Regulatory Network Discloses Interconnected Transcription Factors in Oxidative Stress[W

    PubMed Central

    Vermeirssen, Vanessa; De Clercq, Inge; Van Parys, Thomas; Van Breusegem, Frank; Van de Peer, Yves

    2014-01-01

    The abiotic stress response in plants is complex and tightly controlled by gene regulation. We present an abiotic stress gene regulatory network of 200,014 interactions for 11,938 target genes by integrating four complementary reverse-engineering solutions through average rank aggregation on an Arabidopsis thaliana microarray expression compendium. This ensemble performed the most robustly in benchmarking and greatly expands upon the availability of interactions currently reported. Besides recovering 1182 known regulatory interactions, cis-regulatory motifs and coherent functionalities of target genes corresponded with the predicted transcription factors. We provide a valuable resource of 572 abiotic stress modules of coregulated genes with functional and regulatory information, from which we deduced functional relationships for 1966 uncharacterized genes and many regulators. Using gain- and loss-of-function mutants of seven transcription factors grown under control and salt stress conditions, we experimentally validated 141 out of 271 predictions (52% precision) for 102 selected genes and mapped 148 additional transcription factor-gene regulatory interactions (49% recall). We identified an intricate core oxidative stress regulatory network where NAC13, NAC053, ERF6, WRKY6, and NAC032 transcription factors interconnect and function in detoxification. Our work shows that ensemble reverse-engineering can generate robust biological hypotheses of gene regulation in a multicellular eukaryote that can be tested by medium-throughput experimental validation. PMID:25549671

  19. Arabidopsis ensemble reverse-engineered gene regulatory network discloses interconnected transcription factors in oxidative stress.

    PubMed

    Vermeirssen, Vanessa; De Clercq, Inge; Van Parys, Thomas; Van Breusegem, Frank; Van de Peer, Yves

    2014-12-01

    The abiotic stress response in plants is complex and tightly controlled by gene regulation. We present an abiotic stress gene regulatory network of 200,014 interactions for 11,938 target genes by integrating four complementary reverse-engineering solutions through average rank aggregation on an Arabidopsis thaliana microarray expression compendium. This ensemble performed the most robustly in benchmarking and greatly expands upon the availability of interactions currently reported. Besides recovering 1182 known regulatory interactions, cis-regulatory motifs and coherent functionalities of target genes corresponded with the predicted transcription factors. We provide a valuable resource of 572 abiotic stress modules of coregulated genes with functional and regulatory information, from which we deduced functional relationships for 1966 uncharacterized genes and many regulators. Using gain- and loss-of-function mutants of seven transcription factors grown under control and salt stress conditions, we experimentally validated 141 out of 271 predictions (52% precision) for 102 selected genes and mapped 148 additional transcription factor-gene regulatory interactions (49% recall). We identified an intricate core oxidative stress regulatory network where NAC13, NAC053, ERF6, WRKY6, and NAC032 transcription factors interconnect and function in detoxification. Our work shows that ensemble reverse-engineering can generate robust biological hypotheses of gene regulation in a multicellular eukaryote that can be tested by medium-throughput experimental validation. © 2014 American Society of Plant Biologists. All rights reserved.

  20. [Exploration of common biological pathways for attention deficit hyperactivity disorder and low birth weight].

    PubMed

    Xiang, Bo; Yu, Minglan; Liang, Xuemei; Lei, Wei; Huang, Chaohua; Chen, Jing; He, Wenying; Zhang, Tao; Li, Tao; Liu, Kezhi

    2017-12-10

    To explore common biological pathways for attention deficit hyperactivity disorder (ADHD) and low birth weight (LBW). Thei-Gsea4GwasV2 software was used to analyze the result of genome-wide association analysis (GWAS) for LBW (pathways were derived from Reactome), and nominally significant (P< 0.05, FDR< 0.25) pathways were tested for replication in ADHD.Significant pathways were analyzed with DAPPLE and Reatome FI software to identify genes involved in such pathways, with each cluster enriched with the gene ontology (GO). The Centiscape2.0 software was used to calculate the degree of genetic networks and the betweenness value to explore the core node (gene). Weighed gene co-expression network analysis (WGCNA) was then used to explore the co-expression of genes in these pathways.With gene expression data derived from BrainSpan, GO enrichment was carried out for each gene module. Eleven significant biological pathways was identified in association with LBW, among which two (Selenoamino acid metabolism and Diseases associated with glycosaminoglycan metabolism) were replicated during subsequent ADHD analysis. Network analysis of 130 genes in these pathways revealed that some of the sub-networksare related with morphology of cerebellum, development of hippocampus, and plasticity of synaptic structure. Upon co-expression network analysis, 120 genes passed the quality control and were found to express in 3 gene modules. These modules are mainly related to the regulation of synaptic structure and activity regulation. ADHD and LBW share some biological regulation processes. Anomalies of such proces sesmay predispose to ADHD.

  1. The PluriNetWork: An Electronic Representation of the Network Underlying Pluripotency in Mouse, and Its Applications

    PubMed Central

    Greber, Boris; Siatkowski, Marcin; Paudel, Yogesh; Warsow, Gregor; Cap, Clemens; Schöler, Hans; Fuellen, Georg

    2010-01-01

    Background Analysis of the mechanisms underlying pluripotency and reprogramming would benefit substantially from easy access to an electronic network of genes, proteins and mechanisms. Moreover, interpreting gene expression data needs to move beyond just the identification of the up-/downregulation of key genes and of overrepresented processes and pathways, towards clarifying the essential effects of the experiment in molecular terms. Methodology/Principal Findings We have assembled a network of 574 molecular interactions, stimulations and inhibitions, based on a collection of research data from 177 publications until June 2010, involving 274 mouse genes/proteins, all in a standard electronic format, enabling analyses by readily available software such as Cytoscape and its plugins. The network includes the core circuit of Oct4 (Pou5f1), Sox2 and Nanog, its periphery (such as Stat3, Klf4, Esrrb, and c-Myc), connections to upstream signaling pathways (such as Activin, WNT, FGF, BMP, Insulin, Notch and LIF), and epigenetic regulators as well as some other relevant genes/proteins, such as proteins involved in nuclear import/export. We describe the general properties of the network, as well as a Gene Ontology analysis of the genes included. We use several expression data sets to condense the network to a set of network links that are affected in the course of an experiment, yielding hypotheses about the underlying mechanisms. Conclusions/Significance We have initiated an electronic data repository that will be useful to understand pluripotency and to facilitate the interpretation of high-throughput data. To keep up with the growth of knowledge on the fundamental processes of pluripotency and reprogramming, we suggest to combine Wiki and social networking software towards a community curation system that is easy to use and flexible, and tailored to provide a benefit for the scientist, and to improve communication and exchange of research results. A PluriNetWork tutorial is available at http://www.ibima.med.uni-rostock.de/IBIMA/PluriNetWork/. PMID:21179244

  2. fastBMA: scalable network inference and transitive reduction.

    PubMed

    Hung, Ling-Hong; Shi, Kaiyuan; Wu, Migao; Young, William Chad; Raftery, Adrian E; Yeung, Ka Yee

    2017-10-01

    Inferring genetic networks from genome-wide expression data is extremely demanding computationally. We have developed fastBMA, a distributed, parallel, and scalable implementation of Bayesian model averaging (BMA) for this purpose. fastBMA also includes a computationally efficient module for eliminating redundant indirect edges in the network by mapping the transitive reduction to an easily solved shortest-path problem. We evaluated the performance of fastBMA on synthetic data and experimental genome-wide time series yeast and human datasets. When using a single CPU core, fastBMA is up to 100 times faster than the next fastest method, LASSO, with increased accuracy. It is a memory-efficient, parallel, and distributed application that scales to human genome-wide expression data. A 10 000-gene regulation network can be obtained in a matter of hours using a 32-core cloud cluster (2 nodes of 16 cores). fastBMA is a significant improvement over its predecessor ScanBMA. It is more accurate and orders of magnitude faster than other fast network inference methods such as the 1 based on LASSO. The improved scalability allows it to calculate networks from genome scale data in a reasonable time frame. The transitive reduction method can improve accuracy in denser networks. fastBMA is available as code (M.I.T. license) from GitHub (https://github.com/lhhunghimself/fastBMA), as part of the updated networkBMA Bioconductor package (https://www.bioconductor.org/packages/release/bioc/html/networkBMA.html) and as ready-to-deploy Docker images (https://hub.docker.com/r/biodepot/fastbma/). © The Authors 2017. Published by Oxford University Press.

  3. Filtering Gene Ontology semantic similarity for identifying protein complexes in large protein interaction networks.

    PubMed

    Wang, Jian; Xie, Dong; Lin, Hongfei; Yang, Zhihao; Zhang, Yijia

    2012-06-21

    Many biological processes recognize in particular the importance of protein complexes, and various computational approaches have been developed to identify complexes from protein-protein interaction (PPI) networks. However, high false-positive rate of PPIs leads to challenging identification. A protein semantic similarity measure is proposed in this study, based on the ontology structure of Gene Ontology (GO) terms and GO annotations to estimate the reliability of interactions in PPI networks. Interaction pairs with low GO semantic similarity are removed from the network as unreliable interactions. Then, a cluster-expanding algorithm is used to detect complexes with core-attachment structure on filtered network. Our method is applied to three different yeast PPI networks. The effectiveness of our method is examined on two benchmark complex datasets. Experimental results show that our method performed better than other state-of-the-art approaches in most evaluation metrics. The method detects protein complexes from large scale PPI networks by filtering GO semantic similarity. Removing interactions with low GO similarity significantly improves the performance of complex identification. The expanding strategy is also effective to identify attachment proteins of complexes.

  4. Coding and non-coding gene regulatory networks underlie the immune response in liver cirrhosis

    PubMed Central

    Zhang, Xueming; Huang, Yongming; Yang, Zhengpeng; Zhang, Yuguo; Zhang, Weihui; Gao, Zu-hua; Xue, Dongbo

    2017-01-01

    Liver cirrhosis is recognized as being the consequence of immune-mediated hepatocyte damage and repair processes. However, the regulation of these immune responses underlying liver cirrhosis has not been elucidated. In this study, we used GEO datasets and bioinformatics methods to established coding and non-coding gene regulatory networks including transcription factor-/lncRNA-microRNA-mRNA, and competing endogenous RNA interaction networks. Our results identified 2224 mRNAs, 70 lncRNAs and 46 microRNAs were differentially expressed in liver cirrhosis. The transcription factor -/lncRNA- microRNA-mRNA network we uncovered that results in immune-mediated liver cirrhosis is comprised of 5 core microRNAs (e.g., miR-203; miR-219-5p), 3 transcription factors (i.e., FOXP3, ETS1 and FOS) and 7 lncRNAs (e.g., ENTS00000671336, ENST00000575137). The competing endogenous RNA interaction network we identified includes a complex immune response regulatory subnetwork that controls the entire liver cirrhosis network. Additionally, we found 10 overlapping GO terms shared by both liver cirrhosis and hepatocellular carcinoma including “immune response” as well. Interestingly, the overlapping differentially expressed genes in liver cirrhosis and hepatocellular carcinoma were enriched in immune response-related functional terms. In summary, a complex gene regulatory network underlying immune response processes may play an important role in the development and progression of liver cirrhosis, and its development into hepatocellular carcinoma. PMID:28355233

  5. Gene network reconstruction from transcriptional dynamics under kinetic model uncertainty: a case for the second derivative

    PubMed Central

    Bickel, David R.; Montazeri, Zahra; Hsieh, Pei-Chun; Beatty, Mary; Lawit, Shai J.; Bate, Nicholas J.

    2009-01-01

    Motivation: Measurements of gene expression over time enable the reconstruction of transcriptional networks. However, Bayesian networks and many other current reconstruction methods rely on assumptions that conflict with the differential equations that describe transcriptional kinetics. Practical approximations of kinetic models would enable inferring causal relationships between genes from expression data of microarray, tag-based and conventional platforms, but conclusions are sensitive to the assumptions made. Results: The representation of a sufficiently large portion of genome enables computation of an upper bound on how much confidence one may place in influences between genes on the basis of expression data. Information about which genes encode transcription factors is not necessary but may be incorporated if available. The methodology is generalized to cover cases in which expression measurements are missing for many of the genes that might control the transcription of the genes of interest. The assumption that the gene expression level is roughly proportional to the rate of translation led to better empirical performance than did either the assumption that the gene expression level is roughly proportional to the protein level or the Bayesian model average of both assumptions. Availability: http://www.oisb.ca points to R code implementing the methods (R Development Core Team 2004). Contact: dbickel@uottawa.ca Supplementary information: http://www.davidbickel.com PMID:19218351

  6. A prior-based integrative framework for functional transcriptional regulatory network inference

    PubMed Central

    Siahpirani, Alireza F.

    2017-01-01

    Abstract Transcriptional regulatory networks specify regulatory proteins controlling the context-specific expression levels of genes. Inference of genome-wide regulatory networks is central to understanding gene regulation, but remains an open challenge. Expression-based network inference is among the most popular methods to infer regulatory networks, however, networks inferred from such methods have low overlap with experimentally derived (e.g. ChIP-chip and transcription factor (TF) knockouts) networks. Currently we have a limited understanding of this discrepancy. To address this gap, we first develop a regulatory network inference algorithm, based on probabilistic graphical models, to integrate expression with auxiliary datasets supporting a regulatory edge. Second, we comprehensively analyze our and other state-of-the-art methods on different expression perturbation datasets. Networks inferred by integrating sequence-specific motifs with expression have substantially greater agreement with experimentally derived networks, while remaining more predictive of expression than motif-based networks. Our analysis suggests natural genetic variation as the most informative perturbation for network inference, and, identifies core TFs whose targets are predictable from expression. Multiple reasons make the identification of targets of other TFs difficult, including network architecture and insufficient variation of TF mRNA level. Finally, we demonstrate the utility of our inference algorithm to infer stress-specific regulatory networks and for regulator prioritization. PMID:27794550

  7. Identification of gene expression profiles and key genes in subchondral bone of osteoarthritis using weighted gene coexpression network analysis.

    PubMed

    Guo, Sheng-Min; Wang, Jian-Xiong; Li, Jin; Xu, Fang-Yuan; Wei, Quan; Wang, Hai-Ming; Huang, Hou-Qiang; Zheng, Si-Lin; Xie, Yu-Jie; Zhang, Chi

    2018-06-15

    Osteoarthritis (OA) significantly influences the quality life of people around the world. It is urgent to find an effective way to understand the genetic etiology of OA. We used weighted gene coexpression network analysis (WGCNA) to explore the key genes involved in the subchondral bone pathological process of OA. Fifty gene expression profiles of GSE51588 were downloaded from the Gene Expression Omnibus database. The OA-associated genes and gene ontologies were acquired from JuniorDoc. Weighted gene coexpression network analysis was used to find disease-related networks based on 21756 gene expression correlation coefficients, hub-genes with the highest connectivity in each module were selected, and the correlation between module eigengene and clinical traits was calculated. The genes in the traits-related gene coexpression modules were subject to functional annotation and pathway enrichment analysis using ClusterProfiler. A total of 73 gene modules were identified, of which, 12 modules were found with high connectivity with clinical traits. Five modules were found with enriched OA-associated genes. Moreover, 310 OA-associated genes were found, and 34 of them were among hub-genes in each module. Consequently, enrichment results indicated some key metabolic pathways, such as extracellular matrix (ECM)-receptor interaction (hsa04512), focal adhesion (hsa04510), the phosphatidylinositol 3'-kinase (PI3K)-Akt signaling pathway (PI3K-AKT) (hsa04151), transforming growth factor beta pathway, and Wnt pathway. We intended to identify some core genes, collagen (COL)6A3, COL6A1, ITGA11, BAMBI, and HCK, which could influence downstream signaling pathways once they were activated. In this study, we identified important genes within key coexpression modules, which associate with a pathological process of subchondral bone in OA. Functional analysis results could provide important information to understand the mechanism of OA. © 2018 Wiley Periodicals, Inc.

  8. Functional networks inference from rule-based machine learning models.

    PubMed

    Lazzarini, Nicola; Widera, Paweł; Williamson, Stuart; Heer, Rakesh; Krasnogor, Natalio; Bacardit, Jaume

    2016-01-01

    Functional networks play an important role in the analysis of biological processes and systems. The inference of these networks from high-throughput (-omics) data is an area of intense research. So far, the similarity-based inference paradigm (e.g. gene co-expression) has been the most popular approach. It assumes a functional relationship between genes which are expressed at similar levels across different samples. An alternative to this paradigm is the inference of relationships from the structure of machine learning models. These models are able to capture complex relationships between variables, that often are different/complementary to the similarity-based methods. We propose a protocol to infer functional networks from machine learning models, called FuNeL. It assumes, that genes used together within a rule-based machine learning model to classify the samples, might also be functionally related at a biological level. The protocol is first tested on synthetic datasets and then evaluated on a test suite of 8 real-world datasets related to human cancer. The networks inferred from the real-world data are compared against gene co-expression networks of equal size, generated with 3 different methods. The comparison is performed from two different points of view. We analyse the enriched biological terms in the set of network nodes and the relationships between known disease-associated genes in a context of the network topology. The comparison confirms both the biological relevance and the complementary character of the knowledge captured by the FuNeL networks in relation to similarity-based methods and demonstrates its potential to identify known disease associations as core elements of the network. Finally, using a prostate cancer dataset as a case study, we confirm that the biological knowledge captured by our method is relevant to the disease and consistent with the specialised literature and with an independent dataset not used in the inference process. The implementation of our network inference protocol is available at: http://ico2s.org/software/funel.html.

  9. Heat shock instructs hESCs to exit from the self-renewal program through negative regulation of OCT4 by SAPK/JNK and HSF1 pathway.

    PubMed

    Byun, Kyunghee; Kim, Taek-Kyun; Oh, Jeehyun; Bayarsaikhan, Enkhjargal; Kim, Daesik; Lee, Min Young; Pack, Chan-Gi; Hwang, Daehee; Lee, Bonghee

    2013-11-01

    Environmental factors affect self-renewal of stem cells by modulating the components of self-renewal networks. Heat shock, an environmental factor, induces heat shock factors (HSFs), which up-regulate stress response-related genes. However, the link of heat shock to self-renewal of stem cells has not been elucidated yet. Here, we present the direct link of heat shock to a core stem cell regulator, OCT4, in the self-renewal network through SAPK/JNK and HSF1 pathway. We first showed that heat shock initiated differentiation of human embryonic stem cells (hESCs). Gene expression analysis revealed that heat shock increased the expression of many genes involved in cellular processes related to differentiation of stem cells. We then examined the effects of HSFs induced by heat shock on core self-renewal factors. Among HSFs, heat shock induced mainly HSF1 in hESCs. The HSF1 repressed the expression of OCT4, leading to the differentiation of hESCs and the above differentiation-related gene expression change. We further examined the effects of the upstream MAP (mitogen-activated protein) kinases of HSF1 on the repression of OCT4 expression by HSF1. Among the MAP kinases, SAPK/JNK controlled predominantly the repression of the OCT4 expression by HSF1. The direct link of heat shock to the core self-renewal regulator through SAPK/JNK and HSF1 provides a fundamental basis for understanding the effect of heat and other stresses involving activation of HSF1 on the self-renewal program and further controlling differentiation of hESCs in a broad spectrum of stem cell applications using these stresses. © 2013.

  10. Comparative Analysis of Tocopherol Biosynthesis Genes and Its Transcriptional Regulation in Soybean Seeds.

    PubMed

    T, Vinutha; Bansal, Navita; Kumari, Khushboo; Prashat G, Rama; Sreevathsa, Rohini; Krishnan, Veda; Kumari, Sweta; Dahuja, Anil; Lal, S K; Sachdev, Archana; Praveen, Shelly

    2017-12-20

    Tocopherols composed of four isoforms (α, β, γ, and δ) and its biosynthesis comprises of three pathways: methylerythritol 4-phosphate (MEP), shikimate (SK) and tocopherol-core pathways regulated by 25 enzymes. To understand pathway regulatory mechanism at transcriptional level, gene expression profile of tocopherol-biosynthesis genes in two soybean genotypes was carried out, the results showed significantly differential expression of 5 genes: 1-deoxy-d-xylulose-5-P-reductoisomerase (DXR), geranyl geranyl reductase (GGDR) from MEP, arogenate dehydrogenase (TyrA), tyrosine aminotransferase (TAT) from SK and γ-tocopherol methyl transferase 3 (γ-TMT3) from tocopherol-core pathways. Expression data were further analyzed for total tocopherol (T-toc) and α-tocopherol (α-toc) content by coregulation network and gene clustering approaches, the results showed least and strong association of γ-TMT3/tocopherol cyclase (TC) and DXR/DXS, respectively, with gene clusters of tocopherol biosynthesis suggested the specific role of γ-TMT3/TC in determining tocopherol accumulation and intricacy of DXR/DXS genes in coordinating precursor pathways toward tocopherol biosynthesis in soybean seeds. Thus, the present study provides insight into the major role of these genes regulating the tocopherol synthesis in soybean seeds.

  11. Evolution of fruit development genes in flowering plants

    PubMed Central

    Pabón-Mora, Natalia; Wong, Gane Ka-Shu; Ambrose, Barbara A.

    2014-01-01

    The genetic mechanisms regulating dry fruit development and opercular dehiscence have been identified in Arabidopsis thaliana. In the bicarpellate silique, valve elongation and differentiation is controlled by FRUITFULL (FUL) that antagonizes SHATTERPROOF1-2 (SHP1/SHP2) and INDEHISCENT (IND) at the dehiscence zone where they control normal lignification. SHP1/2 are also repressed by REPLUMLESS (RPL), responsible for replum formation. Similarly, FUL indirectly controls two other factors ALCATRAZ (ALC) and SPATULA (SPT) that function in the proper formation of the separation layer. FUL and SHP1/2 belong to the MADS-box family, IND and ALC belong to the bHLH family and RPL belongs to the homeodomain family, all of which are large transcription factor families. These families have undergone numerous duplications and losses in plants, likely accompanied by functional changes. Functional analyses of homologous genes suggest that this network is fairly conserved in Brassicaceae and less conserved in other core eudicots. Only the MADS box genes have been functionally characterized in basal eudicots and suggest partial conservation of the functions recorded for Brassicaceae. Here we do a comprehensive search of SHP, IND, ALC, SPT, and RPL homologs across core-eudicots, basal eudicots, monocots and basal angiosperms. Based on gene-tree analyses we hypothesize what parts of the network for fruit development in Brassicaceae, in particular regarding direct and indirect targets of FUL, might be conserved across angiosperms. PMID:25018763

  12. Fanconi Anemia Core Complex Gene Promoters Harbor Conserved Transcription Regulatory Elements

    PubMed Central

    Meier, Daniel; Schindler, Detlev

    2011-01-01

    The Fanconi anemia (FA) gene family is a recent addition to the complex network of proteins that respond to and repair certain types of DNA damage in the human genome. Since little is known about the regulation of this novel group of genes at the DNA level, we characterized the promoters of the eight genes (FANCA, B, C, E, F, G, L and M) that compose the FA core complex. The promoters of these genes show the characteristic attributes of housekeeping genes, such as a high GC content and CpG islands, a lack of TATA boxes and a low conservation. The promoters functioned in a monodirectional way and were, in their most active regions, comparable in strength to the SV40 promoter in our reporter plasmids. They were also marked by a distinctive transcriptional start site (TSS). In the 5′ region of each promoter, we identified a region that was able to negatively regulate the promoter activity in HeLa and HEK 293 cells in isolation. The central and 3′ regions of the promoter sequences harbor binding sites for several common and rare transcription factors, including STAT, SMAD, E2F, AP1 and YY1, which indicates that there may be cross-connections to several established regulatory pathways. Electrophoretic mobility shift assays and siRNA experiments confirmed the shared regulatory responses between the prominent members of the TGF-β and JAK/STAT pathways and members of the FA core complex. Although the promoters are not well conserved, they share region and sequence specific regulatory motifs and transcription factor binding sites (TBFs), and we identified a bi-partite nature to these promoters. These results support a hypothesis based on the co-evolution of the FA core complex genes that was expanded to include their promoters. PMID:21826217

  13. Fanconi anemia core complex gene promoters harbor conserved transcription regulatory elements.

    PubMed

    Meier, Daniel; Schindler, Detlev

    2011-01-01

    The Fanconi anemia (FA) gene family is a recent addition to the complex network of proteins that respond to and repair certain types of DNA damage in the human genome. Since little is known about the regulation of this novel group of genes at the DNA level, we characterized the promoters of the eight genes (FANCA, B, C, E, F, G, L and M) that compose the FA core complex. The promoters of these genes show the characteristic attributes of housekeeping genes, such as a high GC content and CpG islands, a lack of TATA boxes and a low conservation. The promoters functioned in a monodirectional way and were, in their most active regions, comparable in strength to the SV40 promoter in our reporter plasmids. They were also marked by a distinctive transcriptional start site (TSS). In the 5' region of each promoter, we identified a region that was able to negatively regulate the promoter activity in HeLa and HEK 293 cells in isolation. The central and 3' regions of the promoter sequences harbor binding sites for several common and rare transcription factors, including STAT, SMAD, E2F, AP1 and YY1, which indicates that there may be cross-connections to several established regulatory pathways. Electrophoretic mobility shift assays and siRNA experiments confirmed the shared regulatory responses between the prominent members of the TGF-β and JAK/STAT pathways and members of the FA core complex. Although the promoters are not well conserved, they share region and sequence specific regulatory motifs and transcription factor binding sites (TBFs), and we identified a bi-partite nature to these promoters. These results support a hypothesis based on the co-evolution of the FA core complex genes that was expanded to include their promoters.

  14. Gene network analysis shows immune-signaling and ERK1/2 as novel genetic markers for multiple addiction phenotypes: alcohol, smoking and opioid addiction.

    PubMed

    Reyes-Gibby, Cielito C; Yuan, Christine; Wang, Jian; Yeung, Sai-Ching J; Shete, Sanjay

    2015-06-05

    Addictions to alcohol and tobacco, known risk factors for cancer, are complex heritable disorders. Addictive behaviors have a bidirectional relationship with pain. We hypothesize that the associations between alcohol, smoking, and opioid addiction observed in cancer patients have a genetic basis. Therefore, using bioinformatics tools, we explored the underlying genetic basis and identified new candidate genes and common biological pathways for smoking, alcohol, and opioid addiction. Literature search showed 56 genes associated with alcohol, smoking and opioid addiction. Using Core Analysis function in Ingenuity Pathway Analysis software, we found that ERK1/2 was strongly interconnected across all three addiction networks. Genes involved in immune signaling pathways were shown across all three networks. Connect function from IPA My Pathway toolbox showed that DRD2 is the gene common to both the list of genetic variations associated with all three addiction phenotypes and the components of the brain neuronal signaling network involved in substance addiction. The top canonical pathways associated with the 56 genes were: 1) calcium signaling, 2) GPCR signaling, 3) cAMP-mediated signaling, 4) GABA receptor signaling, and 5) G-alpha i signaling. Cancer patients are often prescribed opioids for cancer pain thus increasing their risk for opioid abuse and addiction. Our findings provide candidate genes and biological pathways underlying addiction phenotypes, which may be future targets for treatment of addiction. Further study of the variations of the candidate genes could allow physicians to make more informed decisions when treating cancer pain with opioid analgesics.

  15. Inference of developmental gene regulatory networks beyond classical model systems: new approaches in the post-genomic era.

    PubMed

    Fernandez-Valverde, Selene L; Aguilera, Felipe; Ramos-Díaz, René Alexander

    2018-06-18

    The advent of high-throughput sequencing technologies has revolutionized the way we understand the transformation of genetic information into morphological traits. Elucidating the network of interactions between genes that govern cell differentiation through development is one of the core challenges in genome research. These networks are known as developmental gene regulatory networks (dGRNs) and consist largely of the functional linkage between developmental control genes, cis-regulatory modules and differentiation genes, which generate spatially and temporally refined patterns of gene expression. Over the last 20 years, great advances have been made in determining these gene interactions mainly in classical model systems, including human, mouse, sea urchin, fruit fly, and worm. This has brought about a radical transformation in the fields of developmental biology and evolutionary biology, allowing the generation of high-resolution gene regulatory maps to analyse cell differentiation during animal development. Such maps have enabled the identification of gene regulatory circuits and have led to the development of network inference methods that can recapitulate the differentiation of specific cell-types or developmental stages. In contrast, dGRN research in non-classical model systems has been limited to the identification of developmental control genes via the candidate gene approach and the characterization of their spatiotemporal expression patterns, as well as to the discovery of cis-regulatory modules via patterns of sequence conservation and/or predicted transcription-factor binding sites. However, thanks to the continuous advances in high-throughput sequencing technologies, this scenario is rapidly changing. Here, we give a historical overview on the architecture and elucidation of the dGRNs. Subsequently, we summarize the approaches available to unravel these regulatory networks, highlighting the vast range of possibilities of integrating multiple technical advances and theoretical approaches to expand our understanding on the global of gene regulation during animal development in non-classical model systems. Such new knowledge will not only lead to greater insights into the evolution of molecular mechanisms underlying cell identity and animal body plans, but also into the evolution of morphological key innovations in animals.

  16. Integrated Analysis of Long Noncoding RNA and mRNA Expression Profile in Advanced Laryngeal Squamous Cell Carcinoma.

    PubMed

    Feng, Ling; Wang, Ru; Lian, Meng; Ma, Hongzhi; He, Ning; Liu, Honggang; Wang, Haizhou; Fang, Jugao

    2016-01-01

    Long non-coding RNA (lncRNA) plays an important role in tumorigenesis. However, the expression pattern and function of lncRNAs in laryngeal squamous cell carcinoma (LSCC) are still unclear. To investigate the aberrantly expressed lncRNAs and mRNAs in advanced LSCC, we screened lncRNA and mRNA expression profiles in 9 pairs of primary Stage IVA LSCC tissues and adjacent non-neoplastic tissues by lncRNA and mRNA integrated microarrays. Gene Ontology and pathway analysis were performed to find out the significant function and pathway of the differentially expressed mRNAs, gene-gene functional interaction network and ceRNA network were constructed to select core mRNAs, and lncRNA-mRNA expression correlation network was built to identify the interactions between lncRNA and mRNA. qRT-PCR was performed to further validate the expressions of selected lncRNAs and mRNAs in advanced LSCC. We found 1459 differentially expressed lncRNAs and 2381 differentially expressed mRNAs, including 846 up-regulated lncRNAs and 613 down-regulated lncRNAs, 1542 up-regulated mRNAs and 839 down-regulated mRNAs. The mRNAs ITGB1, HIF1A, and DDIT4 were selected as core mRNAs, which are mainly involved in biological processes, such as matrix organization, cell cycle, adhesion, and metabolic pathway. LncRNA-mRNA expression correlation network showed LncRNA NR_027340, MIR31HG were positively correlated with ITGB1, HIF1A respectively. LncRNA SOX2-OT was negatively correlated with DDIT4. qRT-PCR further validated the expression of these lncRNAs and mRNAs. The work provides convincing evidence that the identified lncRNAs and mRNAs are potential biomarkers in advanced LSCC for further future studies.

  17. CaSPIAN: A Causal Compressive Sensing Algorithm for Discovering Directed Interactions in Gene Networks

    PubMed Central

    Emad, Amin; Milenkovic, Olgica

    2014-01-01

    We introduce a novel algorithm for inference of causal gene interactions, termed CaSPIAN (Causal Subspace Pursuit for Inference and Analysis of Networks), which is based on coupling compressive sensing and Granger causality techniques. The core of the approach is to discover sparse linear dependencies between shifted time series of gene expressions using a sequential list-version of the subspace pursuit reconstruction algorithm and to estimate the direction of gene interactions via Granger-type elimination. The method is conceptually simple and computationally efficient, and it allows for dealing with noisy measurements. Its performance as a stand-alone platform without biological side-information was tested on simulated networks, on the synthetic IRMA network in Saccharomyces cerevisiae, and on data pertaining to the human HeLa cell network and the SOS network in E. coli. The results produced by CaSPIAN are compared to the results of several related algorithms, demonstrating significant improvements in inference accuracy of documented interactions. These findings highlight the importance of Granger causality techniques for reducing the number of false-positives, as well as the influence of noise and sampling period on the accuracy of the estimates. In addition, the performance of the method was tested in conjunction with biological side information of the form of sparse “scaffold networks”, to which new edges were added using available RNA-seq or microarray data. These biological priors aid in increasing the sensitivity and precision of the algorithm in the small sample regime. PMID:24622336

  18. A high-resolution transcriptome map of cell cycle reveals novel connections between periodic genes and cancer

    PubMed Central

    Dominguez, Daniel; Tsai, Yi-Hsuan; Gomez, Nicholas; Jha, Deepak Kumar; Davis, Ian; Wang, Zefeng

    2016-01-01

    Progression through the cell cycle is largely dependent on waves of periodic gene expression, and the regulatory networks for these transcriptome dynamics have emerged as critical points of vulnerability in various aspects of tumor biology. Through RNA-sequencing of human cells during two continuous cell cycles (>2.3 billion paired reads), we identified over 1 000 mRNAs, non-coding RNAs and pseudogenes with periodic expression. Periodic transcripts are enriched in functions related to DNA metabolism, mitosis, and DNA damage response, indicating these genes likely represent putative cell cycle regulators. Using our set of periodic genes, we developed a new approach termed “mitotic trait” that can classify primary tumors and normal tissues by their transcriptome similarity to different cell cycle stages. By analyzing >4 000 tumor samples in The Cancer Genome Atlas (TCGA) and other expression data sets, we found that mitotic trait significantly correlates with genetic alterations, tumor subtype and, notably, patient survival. We further defined a core set of 67 genes with robust periodic expression in multiple cell types. Proteins encoded by these genes function as major hubs of protein-protein interaction and are mostly required for cell cycle progression. The core genes also have unique chromatin features including increased levels of CTCF/RAD21 binding and H3K36me3. Loss of these features in uterine and kidney cancers is associated with altered expression of the core 67 genes. Our study suggests new chromatin-associated mechanisms for periodic gene regulation and offers a predictor of cancer patient outcomes. PMID:27364684

  19. Core and region-enriched networks of behaviorally regulated genes and the singing genome

    PubMed Central

    Whitney, Osceola; Pfenning, Andreas R.; Howard, Jason T.; Blatti, Charles A; Liu, Fang; Ward, James M.; Wang, Rui; Audet, Jean-Nicolas; Kellis, Manolis; Mukherjee, Sayan; Sinha, Saurabh; Hartemink, Alexander J.; West, Anne E.; Jarvis, Erich D.

    2015-01-01

    Songbirds represent an important model organism for elucidating molecular mechanisms that link genes with complex behaviors, in part because they have discrete vocal learning circuits that have parallels with those that mediate human speech. We found that ~10% of the genes in the avian genome were regulated by singing, and we found a striking regional diversity of both basal and singing-induced programs in the four key song nuclei of the zebra finch, a vocal learning songbird. The region-enriched patterns were a result of distinct combinations of region-enriched transcription factors (TFs), their binding motifs, and presinging acetylation of histone 3 at lysine 27 (H3K27ac) enhancer activity in the regulatory regions of the associated genes. RNA interference manipulations validated the role of the calcium-response transcription factor (CaRF) in regulating genes preferentially expressed in specific song nuclei in response to singing. Thus, differential combinatorial binding of a small group of activity-regulated TFs and predefined epigenetic enhancer activity influences the anatomical diversity of behaviorally regulated gene networks. PMID:25504732

  20. MPIGeneNet: Parallel Calculation of Gene Co-Expression Networks on Multicore Clusters.

    PubMed

    Gonzalez-Dominguez, Jorge; Martin, Maria J

    2017-10-10

    In this work we present MPIGeneNet, a parallel tool that applies Pearson's correlation and Random Matrix Theory to construct gene co-expression networks. It is based on the state-of-the-art sequential tool RMTGeneNet, which provides networks with high robustness and sensitivity at the expenses of relatively long runtimes for large scale input datasets. MPIGeneNet returns the same results as RMTGeneNet but improves the memory management, reduces the I/O cost, and accelerates the two most computationally demanding steps of co-expression network construction by exploiting the compute capabilities of common multicore CPU clusters. Our performance evaluation on two different systems using three typical input datasets shows that MPIGeneNet is significantly faster than RMTGeneNet. As an example, our tool is up to 175.41 times faster on a cluster with eight nodes, each one containing two 12-core Intel Haswell processors. Source code of MPIGeneNet, as well as a reference manual, are available at https://sourceforge.net/projects/mpigenenet/.

  1. 78 FR 775 - Goodman Networks, Inc. Core Network Engineering (Deployment Engineering) Division Alpharetta, GA...

    Federal Register 2010, 2011, 2012, 2013, 2014

    2013-01-04

    ...,846B; TA-W-81,846C; TA-W-81,846D] Goodman Networks, Inc. Core Network Engineering (Deployment Engineering) Division Alpharetta, GA; Goodman Networks, Inc. Core Network Engineering (Deployment Engineering) Division Hunt Valley, MD; Goodman Networks, Inc. Core Network Engineering (Deployment Engineering) Division...

  2. The Histone Acetyltransferase MOF is a Key Regulator of the Embryonic Stem Cell Core Transcriptional Network

    PubMed Central

    Li, Xiangzhi; Li, Li; Pandey, Ruchi; Byun, Jung S.; Gardner, Kevin; Qin, Zhaohui; Dou, Yali

    2012-01-01

    SUMMARY Pluripotent embryonic stem cells (ESCs) maintain self-renewal and the potential for rapid response to differentiation cues. Both ESC features are subject to epigenetic regulation. Here we show that histone acetyltransferase Mof plays an essential role in the maintenance of ESC self-renewal and pluripotency. ESCs with Mof deletion lose characteristic morphology, alkaline phosphatase (AP) staining and differentiation potential. They also have aberrant expression of core transcription factors Nanog, Oct4 and Sox2. Importantly, the phenotypes of Mof null ESCs can be partially suppressed by Nanog overexpression, supporting that Mof functions as an upstream regulator of Nanog in ESCs. Genome-wide ChIP sequencing and transcriptome analyses further demonstrate that Mof is an integral component of ESC core transcription network and Mof primes genes for diverse developmental programs. Mof is also required for Wdr5 recruitment and H3 K4 methylation at key regulatory loci, highlighting complexity and interconnectivity of various chromatin regulators in ESCs. PMID:22862943

  3. Characterizing core-periphery structure of complex network by h-core and fingerprint curve

    NASA Astrophysics Data System (ADS)

    Li, Simon S.; Ye, Adam Y.; Qi, Eric P.; Stanley, H. Eugene; Ye, Fred Y.

    2018-02-01

    It is proposed that the core-periphery structure of complex networks can be simulated by h-cores and fingerprint curves. While the features of core structure are characterized by h-core, the features of periphery structure are visualized by rose or spiral curve as the fingerprint curve linking to entire-network parameters. It is suggested that a complex network can be approached by h-core and rose curves as the first-order Fourier-approach, where the core-periphery structure is characterized by five parameters: network h-index, network radius, degree power, network density and average clustering coefficient. The simulation looks Fourier-like analysis.

  4. Genome-wide targeted prediction of ABA responsive genes in rice based on over-represented cis-motif in co-expressed genes.

    PubMed

    Lenka, Sangram K; Lohia, Bikash; Kumar, Abhay; Chinnusamy, Viswanathan; Bansal, Kailash C

    2009-02-01

    Abscisic acid (ABA), the popular plant stress hormone, plays a key role in regulation of sub-set of stress responsive genes. These genes respond to ABA through specific transcription factors which bind to cis-regulatory elements present in their promoters. We discovered the ABA Responsive Element (ABRE) core (ACGT) containing CGMCACGTGB motif as over-represented motif among the promoters of ABA responsive co-expressed genes in rice. Targeted gene prediction strategy using this motif led to the identification of 402 protein coding genes potentially regulated by ABA-dependent molecular genetic network. RT-PCR analysis of arbitrarily chosen 45 genes from the predicted 402 genes confirmed 80% accuracy of our prediction. Plant Gene Ontology (GO) analysis of ABA responsive genes showed enrichment of signal transduction and stress related genes among diverse functional categories.

  5. redGEM: Systematic reduction and analysis of genome-scale metabolic reconstructions for development of consistent core metabolic models

    PubMed Central

    Ataman, Meric

    2017-01-01

    Genome-scale metabolic reconstructions have proven to be valuable resources in enhancing our understanding of metabolic networks as they encapsulate all known metabolic capabilities of the organisms from genes to proteins to their functions. However the complexity of these large metabolic networks often hinders their utility in various practical applications. Although reduced models are commonly used for modeling and in integrating experimental data, they are often inconsistent across different studies and laboratories due to different criteria and detail, which can compromise transferability of the findings and also integration of experimental data from different groups. In this study, we have developed a systematic semi-automatic approach to reduce genome-scale models into core models in a consistent and logical manner focusing on the central metabolism or subsystems of interest. The method minimizes the loss of information using an approach that combines graph-based search and optimization methods. The resulting core models are shown to be able to capture key properties of the genome-scale models and preserve consistency in terms of biomass and by-product yields, flux and concentration variability and gene essentiality. The development of these “consistently-reduced” models will help to clarify and facilitate integration of different experimental data to draw new understanding that can be directly extendable to genome-scale models. PMID:28727725

  6. 78 FR 12359 - Goodman Networks, Inc., Core Network Engineering (Deployment Engineering) Division Including...

    Federal Register 2010, 2011, 2012, 2013, 2014

    2013-02-22

    ...., Core Network Engineering (Deployment Engineering) Division Including Workers in the Core Network Engineering (Deployment Engineering) Division in Alpharetta, GA, Hunt Valley, MD, Naperville, IL, and St... Reconsideration applicable to workers and former workers of Goodman Networks, Inc., Core Network Engineering...

  7. Performing a local reduction operation on a parallel computer

    DOEpatents

    Blocksome, Michael A; Faraj, Daniel A

    2013-06-04

    A parallel computer including compute nodes, each including two reduction processing cores, a network write processing core, and a network read processing core, each processing core assigned an input buffer. Copying, in interleaved chunks by the reduction processing cores, contents of the reduction processing cores' input buffers to an interleaved buffer in shared memory; copying, by one of the reduction processing cores, contents of the network write processing core's input buffer to shared memory; copying, by another of the reduction processing cores, contents of the network read processing core's input buffer to shared memory; and locally reducing in parallel by the reduction processing cores: the contents of the reduction processing core's input buffer; every other interleaved chunk of the interleaved buffer; the copied contents of the network write processing core's input buffer; and the copied contents of the network read processing core's input buffer.

  8. Performing a local reduction operation on a parallel computer

    DOEpatents

    Blocksome, Michael A.; Faraj, Daniel A.

    2012-12-11

    A parallel computer including compute nodes, each including two reduction processing cores, a network write processing core, and a network read processing core, each processing core assigned an input buffer. Copying, in interleaved chunks by the reduction processing cores, contents of the reduction processing cores' input buffers to an interleaved buffer in shared memory; copying, by one of the reduction processing cores, contents of the network write processing core's input buffer to shared memory; copying, by another of the reduction processing cores, contents of the network read processing core's input buffer to shared memory; and locally reducing in parallel by the reduction processing cores: the contents of the reduction processing core's input buffer; every other interleaved chunk of the interleaved buffer; the copied contents of the network write processing core's input buffer; and the copied contents of the network read processing core's input buffer.

  9. Robustness encoded across essential and accessory replicons of the ecologically versatile bacterium Sinorhizobium meliloti

    PubMed Central

    Walker, Graham C.; Finan, Turlough M.; Mengoni, Alessio; Griffitts, Joel S.

    2018-01-01

    Bacterial genome evolution is characterized by gains, losses, and rearrangements of functional genetic segments. The extent to which large-scale genomic alterations influence genotype-phenotype relationships has not been investigated in a high-throughput manner. In the symbiotic soil bacterium Sinorhizobium meliloti, the genome is composed of a chromosome and two large extrachromosomal replicons (pSymA and pSymB, which together constitute 45% of the genome). Massively parallel transposon insertion sequencing (Tn-seq) was employed to evaluate the contributions of chromosomal genes to growth fitness in both the presence and absence of these extrachromosomal replicons. Ten percent of chromosomal genes from diverse functional categories are shown to genetically interact with pSymA and pSymB. These results demonstrate the pervasive robustness provided by the extrachromosomal replicons, which is further supported by constraint-based metabolic modeling. A comprehensive picture of core S. meliloti metabolism was generated through a Tn-seq-guided in silico metabolic network reconstruction, producing a core network encompassing 726 genes. This integrated approach facilitated functional assignments for previously uncharacterized genes, while also revealing that Tn-seq alone missed over a quarter of wild-type metabolism. This work highlights the many functional dependencies and epistatic relationships that may arise between bacterial replicons and across a genome, while also demonstrating how Tn-seq and metabolic modeling can be used together to yield insights not obtainable by either method alone. PMID:29672509

  10. GeNeDA: An Open-Source Workflow for Design Automation of Gene Regulatory Networks Inspired from Microelectronics.

    PubMed

    Madec, Morgan; Pecheux, François; Gendrault, Yves; Rosati, Elise; Lallement, Christophe; Haiech, Jacques

    2016-10-01

    The topic of this article is the development of an open-source automated design framework for synthetic biology, specifically for the design of artificial gene regulatory networks based on a digital approach. In opposition to other tools, GeNeDA is an open-source online software based on existing tools used in microelectronics that have proven their efficiency over the last 30 years. The complete framework is composed of a computation core directly adapted from an Electronic Design Automation tool, input and output interfaces, a library of elementary parts that can be achieved with gene regulatory networks, and an interface with an electrical circuit simulator. Each of these modules is an extension of microelectronics tools and concepts: ODIN II, ABC, the Verilog language, SPICE simulator, and SystemC-AMS. GeNeDA is first validated on a benchmark of several combinatorial circuits. The results highlight the importance of the part library. Then, this framework is used for the design of a sequential circuit including a biological state machine.

  11. Integration of Genome-Wide Computation DRE Search, AhR ChIP-chip and Gene Expression Analyses of TCDD-Elicited Responses in the Mouse Liver

    PubMed Central

    2011-01-01

    Background The aryl hydrocarbon receptor (AhR) is a ligand-activated transcription factor (TF) that mediates responses to 2,3,7,8-tetrachlorodibenzo-p-dioxin (TCDD). Integration of TCDD-induced genome-wide AhR enrichment, differential gene expression and computational dioxin response element (DRE) analyses further elucidate the hepatic AhR regulatory network. Results Global ChIP-chip and gene expression analyses were performed on hepatic tissue from immature ovariectomized mice orally gavaged with 30 μg/kg TCDD. ChIP-chip analysis identified 14,446 and 974 AhR enriched regions (1% false discovery rate) at 2 and 24 hrs, respectively. Enrichment density was greatest in the proximal promoter, and more specifically, within ± 1.5 kb of a transcriptional start site (TSS). AhR enrichment also occurred distal to a TSS (e.g. intergenic DNA and 3' UTR), extending the potential gene expression regulatory roles of the AhR. Although TF binding site analyses identified over-represented DRE sequences within enriched regions, approximately 50% of all AhR enriched regions lacked a DRE core (5'-GCGTG-3'). Microarray analysis identified 1,896 number of TCDD-responsive genes (|fold change| ≥ 1.5, P1(t) > 0.999). Integrating this gene expression data with our ChIP-chip and DRE analyses only identified 625 differentially expressed genes that involved an AhR interaction at a DRE. Functional annotation analysis of differentially regulated genes associated with AhR enrichment identified overrepresented processes related to fatty acid and lipid metabolism and transport, and xenobiotic metabolism, which are consistent with TCDD-elicited steatosis in the mouse liver. Conclusions Details of the AhR regulatory network have been expanded to include AhR-DNA interactions within intragenic and intergenic genomic regions. Moreover, the AhR can interact with DNA independent of a DRE core suggesting there are alternative mechanisms of AhR-mediated gene regulation. PMID:21762485

  12. Conserved regulatory mechanism controls the development of cells with rooting functions in land plants.

    PubMed

    Tam, Thomas Ho Yuen; Catarino, Bruno; Dolan, Liam

    2015-07-21

    Land plants develop filamentous cells-root hairs, rhizoids, and caulonemata-at the interface with the soil. Members of the group XI basic helix-loop-helix (bHLH) transcription factors encoded by LOTUS JAPONICUS ROOTHAIRLESS1-LIKE (LRL) genes positively regulate the development of root hairs in the angiosperms Lotus japonicus, Arabidopsis thaliana, and rice (Oryza sativa). Here we show that auxin promotes rhizoid and caulonema development by positively regulating the expression of PpLRL1 and PpLRL2, the two LRL genes in the Physcomitrella patens genome. Although the group VIII bHLH proteins, AtROOT HAIR DEFECTIVE6 and AtROOT HAIR DEFECTIVE SIX-LIKE1, promote root-hair development by positively regulating the expression of AtLRL3 in A. thaliana, LRL genes promote rhizoid development independently of PpROOT HAIR DEFECTIVE SIX-LIKE1 and PpROOT HAIR DEFECITVE SIX-LIKE2 (PpRSL1 and PpRSL2) gene function in P. patens. Together, these data demonstrate that both LRL and RSL genes are components of an ancient auxin-regulated gene network that controls the development of tip-growing cells with rooting functions among most extant land plants. Although this network has diverged in the moss and the angiosperm lineages, our data demonstrate that the core network acted in the last common ancestor of the mosses and angiosperms that existed sometime before 420 million years ago.

  13. Conserved regulatory mechanism controls the development of cells with rooting functions in land plants

    PubMed Central

    Tam, Thomas Ho Yuen; Catarino, Bruno; Dolan, Liam

    2015-01-01

    Land plants develop filamentous cells—root hairs, rhizoids, and caulonemata—at the interface with the soil. Members of the group XI basic helix–loop–helix (bHLH) transcription factors encoded by LOTUS JAPONICUS ROOTHAIRLESS1-LIKE (LRL) genes positively regulate the development of root hairs in the angiosperms Lotus japonicus, Arabidopsis thaliana, and rice (Oryza sativa). Here we show that auxin promotes rhizoid and caulonema development by positively regulating the expression of PpLRL1 and PpLRL2, the two LRL genes in the Physcomitrella patens genome. Although the group VIII bHLH proteins, AtROOT HAIR DEFECTIVE6 and AtROOT HAIR DEFECTIVE SIX-LIKE1, promote root-hair development by positively regulating the expression of AtLRL3 in A. thaliana, LRL genes promote rhizoid development independently of PpROOT HAIR DEFECTIVE SIX-LIKE1 and PpROOT HAIR DEFECITVE SIX-LIKE2 (PpRSL1 and PpRSL2) gene function in P. patens. Together, these data demonstrate that both LRL and RSL genes are components of an ancient auxin-regulated gene network that controls the development of tip-growing cells with rooting functions among most extant land plants. Although this network has diverged in the moss and the angiosperm lineages, our data demonstrate that the core network acted in the last common ancestor of the mosses and angiosperms that existed sometime before 420 million years ago. PMID:26150509

  14. Pan-histone deacetylase inhibitors regulate signaling pathways involved in proliferative and pro-inflammatory mechanisms in H9c2 cells

    PubMed Central

    2012-01-01

    Background We have shown previously that pan-HDAC inhibitors (HDACIs) m-carboxycinnamic acid bis-hydroxamide (CBHA) and trichostatin A (TSA) attenuated cardiac hypertrophy in BALB/c mice by inducing hyper-acetylation of cardiac chromatin that was accompanied by suppression of pro-inflammatory gene networks. However, it was not feasible to determine the precise contribution of the myocytes- and non-myocytes to HDACI-induced gene expression in the intact heart. Therefore, the current study was undertaken with a primary goal of elucidating temporal changes in the transcriptomes of cardiac myocytes exposed to CBHA and TSA. Results We incubated H9c2 cardiac myocytes in growth medium containing either of the two HDACIs for 6h and 24h and analyzed changes in gene expression using Illumina microarrays. H9c2 cells exposed to TSA for 6h and 24h led to differential expression of 468 and 231 genes, respectively. In contrast, cardiac myocytes incubated with CBHA for 6h and 24h elicited differential expression of 768 and 999 genes, respectively. We analyzed CBHA- and TSA-induced differentially expressed genes by Ingenuity Pathway (IPA), Kyoto Encyclopedia of Genes and Genomes (KEGG) and Core_TF programs and discovered that CBHA and TSA impinged on several common gene networks. Thus, both HDACIs induced a repertoire of signaling kinases (PTEN-PI3K-AKT and MAPK) and transcription factors (Myc, p53, NFkB and HNF4A) representing canonical TGFβ, TNF-α, IFNγ and IL-6 specific networks. An overrepresentation of E2F, AP2, EGR1 and SP1 specific motifs was also found in the promoters of the differentially expressed genes. Apparently, TSA elicited predominantly TGFβ- and TNF-α-intensive gene networks regardless of the duration of treatment. In contrast, CBHA elicited TNF-α and IFNγ specific networks at 6 h, followed by elicitation of IL-6 and IFNγ-centered gene networks at 24h. Conclusions Our data show that both CBHA and TSA induced similar, but not identical, time-dependent, gene networks in H9c2 cardiac myocytes. Initially, both HDACIs impinged on numerous genes associated with adipokine signaling, intracellular metabolism and energetics, and cell cycle. A continued exposure to either CBHA or TSA led to the emergence of a number of apoptosis- and inflammation-specific gene networks that were apparently suppressed by both HDACIs. Based on these data we posit that the anti-inflammatory and anti-proliferative actions of HDACIs are myocyte-intrinsic. These findings advance our understanding of the mechanisms of actions of HDACIs on cardiac myocytes and reveal potential signaling pathways that may be targeted therapeutically. PMID:23249388

  15. Therapeutic effects of Euphorbia Pekinensis and Glycyrrhiza glabra on Hepatocellular Carcinoma Ascites Partially Via Regulating the Frk-Arhgdib-Inpp5d-Avpr2-Aqp4 Signal Axis

    NASA Astrophysics Data System (ADS)

    Zhang, Yanqiong; Yan, Chen; Li, Yuting; Mao, Xia; Tao, Weiwei; Tang, Yuping; Lin, Ya; Guo, Qiuyan; Duan, Jingao; Lin, Na

    2017-02-01

    To clarify unknown rationalities of herbaceous compatibility of Euphorbia Pekinensis (DJ) and Glycyrrhiza glabra (GC) acting on hepatocellular carcinoma (HCC) ascites, peritoneum transcriptomics profiling of 15 subjects, including normal control (Con), HCC ascites mouse model (Mod), DJ-alone, DJ/GC-synergy and DJ/GC-antagonism treatment groups were performed on OneArray platform, followed by differentially expressed genes (DEGs) screening. DEGs between Mod and Con groups were considered as HCC ascites-related genes, and those among different drug treatment and Mod groups were identified as DJ/GC-combination-related genes. Then, an interaction network of HCC ascites-related gene-DJ/GC combination-related gene-known therapeutic target gene for ascites was constructed. Based on nodes’ degree, closeness, betweenness and k-coreness, the Frk-Arhgdib-Inpp5d-Avpr2-Aqp4 axis with highly network topological importance was demonstrated to be a candidate target of DJ/GC combination acting on HCC ascites. Importantly, both qPCR and western blot analyses verified this regulatory effects based on HCC ascites mice in vivo and M-1 collecting duct cells in vitro. Collectively, different combination designs of DJ and GC may lead to synergistic or antagonistic effects on HCC ascites partially via regulating the Frk-Arhgdib-Inpp5d-Avpr2-Aqp4 axis, implying that global gene expression profiling combined with network analysis can offer an effective way to understand pharmacological mechanisms of traditional Chinese medicine prescriptions.

  16. The BlueGene/L supercomputer

    NASA Astrophysics Data System (ADS)

    Bhanota, Gyan; Chen, Dong; Gara, Alan; Vranas, Pavlos

    2003-05-01

    The architecture of the BlueGene/L massively parallel supercomputer is described. Each computing node consists of a single compute ASIC plus 256 MB of external memory. The compute ASIC integrates two 700 MHz PowerPC 440 integer CPU cores, two 2.8 Gflops floating point units, 4 MB of embedded DRAM as cache, a memory controller for external memory, six 1.4 Gbit/s bi-directional ports for a 3-dimensional torus network connection, three 2.8 Gbit/s bi-directional ports for connecting to a global tree network and a Gigabit Ethernet for I/O. 65,536 of such nodes are connected into a 3-d torus with a geometry of 32×32×64. The total peak performance of the system is 360 Teraflops and the total amount of memory is 16 TeraBytes.

  17. Dynamic optimization of metabolic networks coupled with gene expression.

    PubMed

    Waldherr, Steffen; Oyarzún, Diego A; Bockmayr, Alexander

    2015-01-21

    The regulation of metabolic activity by tuning enzyme expression levels is crucial to sustain cellular growth in changing environments. Metabolic networks are often studied at steady state using constraint-based models and optimization techniques. However, metabolic adaptations driven by changes in gene expression cannot be analyzed by steady state models, as these do not account for temporal changes in biomass composition. Here we present a dynamic optimization framework that integrates the metabolic network with the dynamics of biomass production and composition. An approximation by a timescale separation leads to a coupled model of quasi-steady state constraints on the metabolic reactions, and differential equations for the substrate concentrations and biomass composition. We propose a dynamic optimization approach to determine reaction fluxes for this model, explicitly taking into account enzyme production costs and enzymatic capacity. In contrast to the established dynamic flux balance analysis, our approach allows predicting dynamic changes in both the metabolic fluxes and the biomass composition during metabolic adaptations. Discretization of the optimization problems leads to a linear program that can be efficiently solved. We applied our algorithm in two case studies: a minimal nutrient uptake network, and an abstraction of core metabolic processes in bacteria. In the minimal model, we show that the optimized uptake rates reproduce the empirical Monod growth for bacterial cultures. For the network of core metabolic processes, the dynamic optimization algorithm predicted commonly observed metabolic adaptations, such as a diauxic switch with a preference ranking for different nutrients, re-utilization of waste products after depletion of the original substrate, and metabolic adaptation to an impending nutrient depletion. These examples illustrate how dynamic adaptations of enzyme expression can be predicted solely from an optimization principle. Copyright © 2014 Elsevier Ltd. All rights reserved.

  18. Dissecting the Calcium-Induced Differentiation of Human Primary Keratinocytes Stem Cells by Integrative and Structural Network Analyses

    PubMed Central

    Toufighi, Kiana; Yang, Jae-Seong; Luis, Nuno Miguel; Aznar Benitah, Salvador; Lehner, Ben; Serrano, Luis; Kiel, Christina

    2015-01-01

    The molecular details underlying the time-dependent assembly of protein complexes in cellular networks, such as those that occur during differentiation, are largely unexplored. Focusing on the calcium-induced differentiation of primary human keratinocytes as a model system for a major cellular reorganization process, we look at the expression of genes whose products are involved in manually-annotated protein complexes. Clustering analyses revealed only moderate co-expression of functionally related proteins during differentiation. However, when we looked at protein complexes, we found that the majority (55%) are composed of non-dynamic and dynamic gene products (‘di-chromatic’), 19% are non-dynamic, and 26% only dynamic. Considering three-dimensional protein structures to predict steric interactions, we found that proteins encoded by dynamic genes frequently interact with a common non-dynamic protein in a mutually exclusive fashion. This suggests that during differentiation, complex assemblies may also change through variation in the abundance of proteins that compete for binding to common proteins as found in some cases for paralogous proteins. Considering the example of the TNF-α/NFκB signaling complex, we suggest that the same core complex can guide signals into diverse context-specific outputs by addition of time specific expressed subunits, while keeping other cellular functions constant. Thus, our analysis provides evidence that complex assembly with stable core components and competition could contribute to cell differentiation. PMID:25946651

  19. Characterization of the Porphyromonas gingivalis Type IX Secretion Trans-envelope PorKLMNP Core Complex*

    PubMed Central

    Vincent, Maxence S.; Canestrari, Mickaël J.; Leone, Philippe; Stathopulos, Julien; Ize, Bérengère; Zoued, Abdelrahim; Cambillau, Christian; Kellenberger, Christine; Roussel, Alain

    2017-01-01

    The transport of proteins at the cell surface of Bacteroidetes depends on a secretory apparatus known as type IX secretion system (T9SS). This machine is responsible for the cell surface exposition of various proteins, such as adhesins, required for gliding motility in Flavobacterium, S-layer components in Tannerella forsythia, and tooth tissue-degrading enzymes in the oral pathogen Porphyromonas gingivalis. Although a number of subunits of the T9SS have been identified, we lack details on the architecture of this secretion apparatus. Here we provide evidence that five of the genes encoding the core complex of the T9SS are co-transcribed and that the gene products are distributed in the cell envelope. Protein-protein interaction studies then revealed that these proteins oligomerize and interact through a dense network of contacts. PMID:28057754

  20. A highly efficient multi-core algorithm for clustering extremely large datasets

    PubMed Central

    2010-01-01

    Background In recent years, the demand for computational power in computational biology has increased due to rapidly growing data sets from microarray and other high-throughput technologies. This demand is likely to increase. Standard algorithms for analyzing data, such as cluster algorithms, need to be parallelized for fast processing. Unfortunately, most approaches for parallelizing algorithms largely rely on network communication protocols connecting and requiring multiple computers. One answer to this problem is to utilize the intrinsic capabilities in current multi-core hardware to distribute the tasks among the different cores of one computer. Results We introduce a multi-core parallelization of the k-means and k-modes cluster algorithms based on the design principles of transactional memory for clustering gene expression microarray type data and categorial SNP data. Our new shared memory parallel algorithms show to be highly efficient. We demonstrate their computational power and show their utility in cluster stability and sensitivity analysis employing repeated runs with slightly changed parameters. Computation speed of our Java based algorithm was increased by a factor of 10 for large data sets while preserving computational accuracy compared to single-core implementations and a recently published network based parallelization. Conclusions Most desktop computers and even notebooks provide at least dual-core processors. Our multi-core algorithms show that using modern algorithmic concepts, parallelization makes it possible to perform even such laborious tasks as cluster sensitivity and cluster number estimation on the laboratory computer. PMID:20370922

  1. Gene-Disease Network Analysis Reveals Functional Modules in Mendelian, Complex and Environmental Diseases

    PubMed Central

    Bauer-Mehren, Anna; Bundschus, Markus; Rautschka, Michael; Mayer, Miguel A.; Sanz, Ferran; Furlong, Laura I.

    2011-01-01

    Background Scientists have been trying to understand the molecular mechanisms of diseases to design preventive and therapeutic strategies for a long time. For some diseases, it has become evident that it is not enough to obtain a catalogue of the disease-related genes but to uncover how disruptions of molecular networks in the cell give rise to disease phenotypes. Moreover, with the unprecedented wealth of information available, even obtaining such catalogue is extremely difficult. Principal Findings We developed a comprehensive gene-disease association database by integrating associations from several sources that cover different biomedical aspects of diseases. In particular, we focus on the current knowledge of human genetic diseases including mendelian, complex and environmental diseases. To assess the concept of modularity of human diseases, we performed a systematic study of the emergent properties of human gene-disease networks by means of network topology and functional annotation analysis. The results indicate a highly shared genetic origin of human diseases and show that for most diseases, including mendelian, complex and environmental diseases, functional modules exist. Moreover, a core set of biological pathways is found to be associated with most human diseases. We obtained similar results when studying clusters of diseases, suggesting that related diseases might arise due to dysfunction of common biological processes in the cell. Conclusions For the first time, we include mendelian, complex and environmental diseases in an integrated gene-disease association database and show that the concept of modularity applies for all of them. We furthermore provide a functional analysis of disease-related modules providing important new biological insights, which might not be discovered when considering each of the gene-disease association repositories independently. Hence, we present a suitable framework for the study of how genetic and environmental factors, such as drugs, contribute to diseases. Availability The gene-disease networks used in this study and part of the analysis are available at http://ibi.imim.es/DisGeNET/DisGeNETweb.html#Download. PMID:21695124

  2. Gene-disease network analysis reveals functional modules in mendelian, complex and environmental diseases.

    PubMed

    Bauer-Mehren, Anna; Bundschus, Markus; Rautschka, Michael; Mayer, Miguel A; Sanz, Ferran; Furlong, Laura I

    2011-01-01

    Scientists have been trying to understand the molecular mechanisms of diseases to design preventive and therapeutic strategies for a long time. For some diseases, it has become evident that it is not enough to obtain a catalogue of the disease-related genes but to uncover how disruptions of molecular networks in the cell give rise to disease phenotypes. Moreover, with the unprecedented wealth of information available, even obtaining such catalogue is extremely difficult. We developed a comprehensive gene-disease association database by integrating associations from several sources that cover different biomedical aspects of diseases. In particular, we focus on the current knowledge of human genetic diseases including mendelian, complex and environmental diseases. To assess the concept of modularity of human diseases, we performed a systematic study of the emergent properties of human gene-disease networks by means of network topology and functional annotation analysis. The results indicate a highly shared genetic origin of human diseases and show that for most diseases, including mendelian, complex and environmental diseases, functional modules exist. Moreover, a core set of biological pathways is found to be associated with most human diseases. We obtained similar results when studying clusters of diseases, suggesting that related diseases might arise due to dysfunction of common biological processes in the cell. For the first time, we include mendelian, complex and environmental diseases in an integrated gene-disease association database and show that the concept of modularity applies for all of them. We furthermore provide a functional analysis of disease-related modules providing important new biological insights, which might not be discovered when considering each of the gene-disease association repositories independently. Hence, we present a suitable framework for the study of how genetic and environmental factors, such as drugs, contribute to diseases. The gene-disease networks used in this study and part of the analysis are available at http://ibi.imim.es/DisGeNET/DisGeNETweb.html#Download.

  3. Network Biomarkers of Bladder Cancer Based on a Genome-Wide Genetic and Epigenetic Network Derived from Next-Generation Sequencing Data.

    PubMed

    Li, Cheng-Wei; Chen, Bor-Sen

    2016-01-01

    Epigenetic and microRNA (miRNA) regulation are associated with carcinogenesis and the development of cancer. By using the available omics data, including those from next-generation sequencing (NGS), genome-wide methylation profiling, candidate integrated genetic and epigenetic network (IGEN) analysis, and drug response genome-wide microarray analysis, we constructed an IGEN system based on three coupling regression models that characterize protein-protein interaction networks (PPINs), gene regulatory networks (GRNs), miRNA regulatory networks (MRNs), and epigenetic regulatory networks (ERNs). By applying system identification method and principal genome-wide network projection (PGNP) to IGEN analysis, we identified the core network biomarkers to investigate bladder carcinogenic mechanisms and design multiple drug combinations for treating bladder cancer with minimal side-effects. The progression of DNA repair and cell proliferation in stage 1 bladder cancer ultimately results not only in the derepression of miR-200a and miR-200b but also in the regulation of the TNF pathway to metastasis-related genes or proteins, cell proliferation, and DNA repair in stage 4 bladder cancer. We designed a multiple drug combination comprising gefitinib, estradiol, yohimbine, and fulvestrant for treating stage 1 bladder cancer with minimal side-effects, and another multiple drug combination comprising gefitinib, estradiol, chlorpromazine, and LY294002 for treating stage 4 bladder cancer with minimal side-effects.

  4. Molecular mechanisms of system responses to novel stimuli are predictable from public data

    PubMed Central

    Danziger, Samuel A.; Ratushny, Alexander V.; Smith, Jennifer J.; Saleem, Ramsey A.; Wan, Yakun; Arens, Christina E.; Armstrong, Abraham M.; Sitko, Katherine; Chen, Wei-Ming; Chiang, Jung-Hsien; Reiss, David J.; Baliga, Nitin S.; Aitchison, John D.

    2014-01-01

    Systems scale models provide the foundation for an effective iterative cycle between hypothesis generation, experiment and model refinement. Such models also enable predictions facilitating the understanding of biological complexity and the control of biological systems. Here, we demonstrate the reconstruction of a globally predictive gene regulatory model from public data: a model that can drive rational experiment design and reveal new regulatory mechanisms underlying responses to novel environments. Specifically, using ∼1500 publically available genome-wide transcriptome data sets from Saccharomyces cerevisiae, we have reconstructed an environment and gene regulatory influence network that accurately predicts regulatory mechanisms and gene expression changes on exposure of cells to completely novel environments. Focusing on transcriptional networks that induce peroxisomes biogenesis, the model-guided experiments allow us to expand a core regulatory network to include novel transcriptional influences and linkage across signaling and transcription. Thus, the approach and model provides a multi-scalar picture of gene dynamics and are powerful resources for exploiting extant data to rationally guide experimentation. The techniques outlined here are generally applicable to any biological system, which is especially important when experimental systems are challenging and samples are difficult and expensive to obtain—a common problem in laboratory animal and human studies. PMID:24185701

  5. Of woods and webs: possible alternatives to the tree of life for studying genomic fluidity in E. coli.

    PubMed

    Beauregard-Racine, Julie; Bicep, Cédric; Schliep, Klaus; Lopez, Philippe; Lapointe, François-Joseph; Bapteste, Eric

    2011-07-20

    We introduce several forest-based and network-based methods for exploring microbial evolution, and apply them to the study of thousands of genes from 30 strains of E. coli. This case study illustrates how additional analyses could offer fast heuristic alternatives to standard tree of life (TOL) approaches. We use gene networks to identify genes with atypical modes of evolution, and genome networks to characterize the evolution of genetic partnerships between E. coli and mobile genetic elements. We develop a novel polychromatic quartet method to capture patterns of recombination within E. coli, to update the clanistic toolkit, and to search for the impact of lateral gene transfer and of pathogenicity on gene evolution in two large forests of trees bearing E. coli. We unravel high rates of lateral gene transfer involving E. coli (about 40% of the trees under study), and show that both core genes and shell genes of E. coli are affected by non-tree-like evolutionary processes. We show that pathogenic lifestyle impacted the structure of 30% of the gene trees, and that pathogenic strains are more likely to transfer genes with one another than with non-pathogenic strains. In addition, we propose five groups of genes as candidate mobile modules of pathogenicity. We also present strong evidence for recent lateral gene transfer between E. coli and mobile genetic elements. Depending on which evolutionary questions biologists want to address (i.e. the identification of modules, genetic partnerships, recombination, lateral gene transfer, or genes with atypical evolutionary modes, etc.), forest-based and network-based methods are preferable to the reconstruction of a single tree, because they provide insights and produce hypotheses about the dynamics of genome evolution, rather than the relative branching order of species and lineages. Such a methodological pluralism - the use of woods and webs - is to be encouraged to analyse the evolutionary processes at play in microbial evolution.This manuscript was reviewed by: Ford Doolittle, Tal Pupko, Richard Burian, James McInerney, Didier Raoult, and Yan Boucher.

  6. Comparative cell cycle transcriptomics reveals synchronization of developmental transcription factor networks in cancer cells

    PubMed Central

    Johard, Helena; Mahdessian, Diana; Fedr, Radek; Marks, Carolyn; Medalová, Jiřina; Souček, Karel; Lundberg, Emma; Linnarsson, Sten; Bryja, Vítězslav; Sekyrova, Petra; Altun, Mikael; Andäng, Michael

    2017-01-01

    The cell cycle coordinates core functions such as replication and cell division. However, cell-cycle-regulated transcription in the control of non-core functions, such as cell identity maintenance through specific transcription factors (TFs) and signalling pathways remains unclear. Here, we provide a resource consisting of mapped transcriptomes in unsynchronized HeLa and U2OS cancer cells sorted for cell cycle phase by Fucci reporter expression. We developed a novel algorithm for data analysis that enables efficient visualization and data comparisons and identified cell cycle synchronization of Notch signalling and TFs associated with development. Furthermore, the cell cycle synchronizes with the circadian clock, providing a possible link between developmental transcriptional networks and the cell cycle. In conclusion we find that cell cycle synchronized transcriptional patterns are temporally compartmentalized and more complex than previously anticipated, involving genes, which control cell identity and development. PMID:29228002

  7. Genome network medicine: innovation to overcome huge challenges in cancer therapy.

    PubMed

    Roukos, Dimitrios H

    2014-01-01

    The post-ENCODE era shapes now a new biomedical research direction for understanding transcriptional and signaling networks driving gene expression and core cellular processes such as cell fate, survival, and apoptosis. Over the past half century, the Francis Crick 'central dogma' of single n gene/protein-phenotype (trait/disease) has defined biology, human physiology, disease, diagnostics, and drugs discovery. However, the ENCODE project and several other genomic studies using high-throughput sequencing technologies, computational strategies, and imaging techniques to visualize regulatory networks, provide evidence that transcriptional process and gene expression are regulated by highly complex dynamic molecular and signaling networks. This Focus article describes the linear experimentation-based limitations of diagnostics and therapeutics to cure advanced cancer and the need to move on from reductionist to network-based approaches. With evident a wide genomic heterogeneity, the power and challenges of next-generation sequencing (NGS) technologies to identify a patient's personal mutational landscape for tailoring the best target drugs in the individual patient are discussed. However, the available drugs are not capable of targeting aberrant signaling networks and research on functional transcriptional heterogeneity and functional genome organization is poorly understood. Therefore, the future clinical genome network medicine aiming at overcoming multiple problems in the new fields of regulatory DNA mapping, noncoding RNA, enhancer RNAs, and dynamic complexity of transcriptional circuitry are also discussed expecting in new innovation technology and strong appreciation of clinical data and evidence-based medicine. The problematic and potential solutions in the discovery of next-generation, molecular, and signaling circuitry-based biomarkers and drugs are explored. © 2013 Wiley Periodicals, Inc.

  8. Differential Gene Expression (DEX) and Alternative Splicing Events (ASE) for Temporal Dynamic Processes Using HMMs and Hierarchical Bayesian Modeling Approaches.

    PubMed

    Oh, Sunghee; Song, Seongho

    2017-01-01

    In gene expression profile, data analysis pipeline is categorized into four levels, major downstream tasks, i.e., (1) identification of differential expression; (2) clustering co-expression patterns; (3) classification of subtypes of samples; and (4) detection of genetic regulatory networks, are performed posterior to preprocessing procedure such as normalization techniques. To be more specific, temporal dynamic gene expression data has its inherent feature, namely, two neighboring time points (previous and current state) are highly correlated with each other, compared to static expression data which samples are assumed as independent individuals. In this chapter, we demonstrate how HMMs and hierarchical Bayesian modeling methods capture the horizontal time dependency structures in time series expression profiles by focusing on the identification of differential expression. In addition, those differential expression genes and transcript variant isoforms over time detected in core prerequisite steps can be generally further applied in detection of genetic regulatory networks to comprehensively uncover dynamic repertoires in the aspects of system biology as the coupled framework.

  9. The influence of the depth of k-core layers on the robustness of interdependent networks against cascading failures

    NASA Astrophysics Data System (ADS)

    Dong, Zhengcheng; Fang, Yanjun; Tian, Meng; Kong, Zhengmin

    The hierarchical structure, k-core, is common in various complex networks, and the actual network always has successive layers from 1-core layer (the peripheral layer) to km-core layer (the core layer). The nodes within the core layer have been proved to be the most influential spreaders, but there is few work about how the depth of k-core layers (the value of km) can affect the robustness against cascading failures, rather than the interdependent networks. First, following the preferential attachment, a novel method is proposed to generate the scale-free network with successive k-core layers (KCBA network), and the KCBA network is validated more realistic than the traditional BA network. Then, with KCBA interdependent networks, the effect of the depth of k-core layers is investigated. Considering the load-based model, the loss of capacity on nodes is adopted to quantify the robustness instead of the number of functional nodes in the end. We conduct two attacking strategies, i.e. the RO-attack (Randomly remove only one node) and the RF-attack (Randomly remove a fraction of nodes). Results show that the robustness of KCBA networks not only depends on the depth of k-core layers, but also is slightly influenced by the initial load. With RO-attack, the networks with less k-core layers are more robust when the initial load is small. With RF-attack, the robustness improves with small km, but the improvement is getting weaker with the increment of the initial load. In a word, the lower the depth is, the more robust the networks will be.

  10. Therapeutic effects of Euphorbia Pekinensis and Glycyrrhiza glabra on Hepatocellular Carcinoma Ascites Partially Via Regulating the Frk-Arhgdib-Inpp5d-Avpr2-Aqp4 Signal Axis

    PubMed Central

    Zhang, Yanqiong; Yan, Chen; Li, Yuting; Mao, Xia; Tao, Weiwei; Tang, Yuping; Lin, Ya; Guo, Qiuyan; Duan, Jingao; Lin, Na

    2017-01-01

    To clarify unknown rationalities of herbaceous compatibility of Euphorbia Pekinensis (DJ) and Glycyrrhiza glabra (GC) acting on hepatocellular carcinoma (HCC) ascites, peritoneum transcriptomics profiling of 15 subjects, including normal control (Con), HCC ascites mouse model (Mod), DJ-alone, DJ/GC-synergy and DJ/GC-antagonism treatment groups were performed on OneArray platform, followed by differentially expressed genes (DEGs) screening. DEGs between Mod and Con groups were considered as HCC ascites-related genes, and those among different drug treatment and Mod groups were identified as DJ/GC-combination-related genes. Then, an interaction network of HCC ascites-related gene-DJ/GC combination-related gene-known therapeutic target gene for ascites was constructed. Based on nodes’ degree, closeness, betweenness and k-coreness, the Frk-Arhgdib-Inpp5d-Avpr2-Aqp4 axis with highly network topological importance was demonstrated to be a candidate target of DJ/GC combination acting on HCC ascites. Importantly, both qPCR and western blot analyses verified this regulatory effects based on HCC ascites mice in vivo and M-1 collecting duct cells in vitro. Collectively, different combination designs of DJ and GC may lead to synergistic or antagonistic effects on HCC ascites partially via regulating the Frk-Arhgdib-Inpp5d-Avpr2-Aqp4 axis, implying that global gene expression profiling combined with network analysis can offer an effective way to understand pharmacological mechanisms of traditional Chinese medicine prescriptions. PMID:28165501

  11. Cross-Study Comparison Reveals Common Genomic, Network, and Functional Signatures of Desiccation Resistance in Drosophila melanogaster

    PubMed Central

    Telonis-Scott, Marina; Sgrò, Carla M.; Hoffmann, Ary A.; Griffin, Philippa C.

    2016-01-01

    Repeated attempts to map the genomic basis of complex traits often yield different outcomes because of the influence of genetic background, gene-by-environment interactions, and/or statistical limitations. However, where repeatability is low at the level of individual genes, overlap often occurs in gene ontology categories, genetic pathways, and interaction networks. Here we report on the genomic overlap for natural desiccation resistance from a Pool-genome-wide association study experiment and a selection experiment in flies collected from the same region in southeastern Australia in different years. We identified over 600 single nucleotide polymorphisms associated with desiccation resistance in flies derived from almost 1,000 wild-caught genotypes, a similar number of loci to that observed in our previous genomic study of selected lines, demonstrating the genetic complexity of this ecologically important trait. By harnessing the power of cross-study comparison, we narrowed the candidates from almost 400 genes in each study to a core set of 45 genes, enriched for stimulus, stress, and defense responses. In addition to gene-level overlap, there was higher order congruence at the network and functional levels, suggesting genetic redundancy in key stress sensing, stress response, immunity, signaling, and gene expression pathways. We also identified variants linked to different molecular aspects of desiccation physiology previously verified from functional experiments. Our approach provides insight into the genomic basis of a complex and ecologically important trait and predicts candidate genetic pathways to explore in multiple genetic backgrounds and related species within a functional framework. PMID:26733490

  12. Sequencing-based gene network analysis provides a core set of gene resource for understanding thermal adaptation in Zhikong scallop Chlamys farreri.

    PubMed

    Fu, X; Sun, Y; Wang, J; Xing, Q; Zou, J; Li, R; Wang, Z; Wang, S; Hu, X; Zhang, L; Bao, Z

    2014-01-01

    Marine organisms are commonly exposed to variable environmental conditions, and many of them are under threat from increased sea temperatures caused by global climate change. Generating transcriptomic resources under different stress conditions are crucial for understanding molecular mechanisms underlying thermal adaptation. In this study, we conducted transcriptome-wide gene expression profiling of the scallop Chlamys farreri challenged by acute and chronic heat stress. Of the 13 953 unique tags, more than 850 were significantly differentially expressed at each time point after acute heat stress, which was more than the number of tags differentially expressed (320-350) under chronic heat stress. To obtain a systemic view of gene expression alterations during thermal stress, a weighted gene coexpression network was constructed. Six modules were identified as acute heat stress-responsive modules. Among them, four modules involved in apoptosis regulation, mRNA binding, mitochondrial envelope formation and oxidation reduction were downregulated. The remaining two modules were upregulated. One was enriched with chaperone and the other with microsatellite sequences, whose coexpression may originate from a transcription factor binding site. These results indicated that C. farreri triggered several cellular processes to acclimate to elevated temperature. No modules responded to chronic heat stress, suggesting that the scallops might have acclimated to elevated temperature within 3 days. This study represents the first sequencing-based gene network analysis in a nonmodel aquatic species and provides valuable gene resources for the study of thermal adaptation, which should assist in the development of heat-tolerant scallop lines for aquaculture. © 2013 John Wiley & Sons Ltd.

  13. How the evolution of multicellularity set the stage for cancer

    PubMed Central

    Trigos, Anna S; Pearson, Richard B; Papenfuss, Anthony T; Goode, David L

    2018-01-01

    Neoplastic growth and many of the hallmark properties of cancer are driven by the disruption of molecular networks established during the emergence of multicellularity. Regulatory pathways and molecules that evolved to impose regulatory constraints upon networks established in earlier unicellular organisms enabled greater communication and coordination between the diverse cell types required for multicellularity, but also created liabilities in the form of points of vulnerability in the network that when mutated or dysregulated facilitate the development of cancer. These factors are usually overlooked in genomic analyses of cancer, but understanding where vulnerabilities to cancer lie in the networks of multicellular species would provide important new insights into how core molecular processes and gene regulation change during tumourigenesis. We describe how the evolutionary origins of genes influence their roles in cancer, and how connections formed between unicellular and multicellular genes that act as key regulatory hubs for normal tissue homeostasis can also contribute to malignant transformation when disrupted. Tumours in general are characterised by increased dependence on unicellular processes for survival, and major dysregulation of the control structures imposed on these processes during the evolution of multicellularity. Mounting molecular evidence suggests altered interactions at the interface between unicellular and multicellular genes play key roles in the initiation and progression of cancer. Furthermore, unicellular network regions activated in cancer show high degrees of robustness and plasticity, conferring increased adaptability to tumour cells by supporting effective responses to environmental pressures such as drug exposure. Examining how the links between multicellular and unicellular regions get disrupted in tumours has great potential to identify novel drivers of cancer, and to guide improvements to cancer treatment by identifying more effective therapeutic strategies. Recent successes in targeting unicellular processes by novel compounds underscore the logic of such approaches. Further gains could come from identifying genes at the interface between unicellular and multicellular processes and manipulating the communication between network regions of different evolutionary ages. PMID:29337961

  14. Connecting Core Percolation and Controllability of Complex Networks

    PubMed Central

    Jia, Tao; Pósfai, Márton

    2014-01-01

    Core percolation is a fundamental structural transition in complex networks related to a wide range of important problems. Recent advances have provided us an analytical framework of core percolation in uncorrelated random networks with arbitrary degree distributions. Here we apply the tools in analysis of network controllability. We confirm analytically that the emergence of the bifurcation in control coincides with the formation of the core and the structure of the core determines the control mode of the network. We also derive the analytical expression related to the controllability robustness by extending the deduction in core percolation. These findings help us better understand the interesting interplay between the structural and dynamical properties of complex networks. PMID:24946797

  15. Identifying gene coexpression networks underlying the dynamic regulation of wood-forming tissues in Populus under diverse environmental conditions.

    PubMed

    Zinkgraf, Matthew; Liu, Lijun; Groover, Andrew; Filkov, Vladimir

    2017-06-01

    Trees modify wood formation through integration of environmental and developmental signals in complex but poorly defined transcriptional networks, allowing trees to produce woody tissues appropriate to diverse environmental conditions. In order to identify relationships among genes expressed during wood formation, we integrated data from new and publically available datasets in Populus. These datasets were generated from woody tissue and include transcriptome profiling, transcription factor binding, DNA accessibility and genome-wide association mapping experiments. Coexpression modules were calculated, each of which contains genes showing similar expression patterns across experimental conditions, genotypes and treatments. Conserved gene coexpression modules (four modules totaling 8398 genes) were identified that were highly preserved across diverse environmental conditions and genetic backgrounds. Functional annotations as well as correlations with specific experimental treatments associated individual conserved modules with distinct biological processes underlying wood formation, such as cell-wall biosynthesis, meristem development and epigenetic pathways. Module genes were also enriched for DNase I hypersensitivity footprints and binding from four transcription factors associated with wood formation. The conserved modules are excellent candidates for modeling core developmental pathways common to wood formation in diverse environments and genotypes, and serve as testbeds for hypothesis generation and testing for future studies. No claim to original US government works. New Phytologist © 2017 New Phytologist Trust.

  16. Integrative functional transcriptomic analyses implicate specific molecular pathways in pulmonary toxicity from exposure to aluminum oxide nanoparticles.

    PubMed

    Li, Xiaobo; Zhang, Chengcheng; Bian, Qian; Gao, Na; Zhang, Xin; Meng, Qingtao; Wu, Shenshen; Wang, Shizhi; Xia, Yankai; Chen, Rui

    2016-09-01

    Gene expression profiling has developed rapidly in recent years and it can predict and define mechanisms underlying chemical toxicity. Here, RNA microarray and computational technology were used to show that aluminum oxide nanoparticles (Al2O3 NPs) were capable of triggering up-regulation of genes related to the cell cycle and cell death in a human A549 lung adenocarcinoma cell line. Gene expression levels were validated in Al2O3 NPs exposed A549 cells and mice lung tissues, most of which showed consistent trends in regulation. Gene-transcription factor network analysis coupled with cell- and animal-based assays demonstrated that the genes encoding PTPN6, RTN4, BAX and IER play a role in the biological responses induced by the nanoparticle exposure, which caused cell death and cell cycle arrest in the G2/S phase. Further, down-regulated PTPN6 expression demonstrated a core role in the network, thus expression level of PTPN6 was rescued by plasmid transfection, which showed ameliorative effects of A549 cells against cell death and cell cycle arrest. These results demonstrate the feasibility of using gene expression profiling to predict cellular responses induced by nanomaterials, which could be used to develop a comprehensive knowledge of nanotoxicity.

  17. Who Can You Turn to? Tie Activation within Core Business Discussion Networks

    ERIC Educational Resources Information Center

    Renzulli, Linda A.; Aldrich, Howard

    2005-01-01

    We examine the connection between personal network characteristics and the activation of ties for access to resources during routine times. We focus on factors affecting business owners' use of their core network ties to obtain legal, loan, financial and expert advice. Owners rely more on core business ties when their core networks contain a high…

  18. Spatial gradients of protein-level time delays set the pace of the traveling segmentation clock waves

    PubMed Central

    Ay, Ahmet; Holland, Jack; Sperlea, Adriana; Devakanmalai, Gnanapackiam Sheela; Knierer, Stephan; Sangervasi, Sebastian; Stevenson, Angel; Özbudak, Ertuğrul M.

    2014-01-01

    The vertebrate segmentation clock is a gene expression oscillator controlling rhythmic segmentation of the vertebral column during embryonic development. The period of oscillations becomes longer as cells are displaced along the posterior to anterior axis, which results in traveling waves of clock gene expression sweeping in the unsegmented tissue. Although various hypotheses necessitating the inclusion of additional regulatory genes into the core clock network at different spatial locations have been proposed, the mechanism underlying traveling waves has remained elusive. Here, we combined molecular-level computational modeling and quantitative experimentation to solve this puzzle. Our model predicts the existence of an increasing gradient of gene expression time delays along the posterior to anterior direction to recapitulate spatiotemporal profiles of the traveling segmentation clock waves in different genetic backgrounds in zebrafish. We validated this prediction by measuring an increased time delay of oscillatory Her1 protein production along the unsegmented tissue. Our results refuted the need for spatial expansion of the core feedback loop to explain the occurrence of traveling waves. Spatial regulation of gene expression time delays is a novel way of creating dynamic patterns; this is the first report demonstrating such a control mechanism in any tissue and future investigations will explore the presence of analogous examples in other biological systems. PMID:25336742

  19. Developmental Self-Construction and -Configuration of Functional Neocortical Neuronal Networks

    PubMed Central

    Bauer, Roman; Zubler, Frédéric; Pfister, Sabina; Hauri, Andreas; Pfeiffer, Michael; Muir, Dylan R.; Douglas, Rodney J.

    2014-01-01

    The prenatal development of neural circuits must provide sufficient configuration to support at least a set of core postnatal behaviors. Although knowledge of various genetic and cellular aspects of development is accumulating rapidly, there is less systematic understanding of how these various processes play together in order to construct such functional networks. Here we make some steps toward such understanding by demonstrating through detailed simulations how a competitive co-operative (‘winner-take-all’, WTA) network architecture can arise by development from a single precursor cell. This precursor is granted a simplified gene regulatory network that directs cell mitosis, differentiation, migration, neurite outgrowth and synaptogenesis. Once initial axonal connection patterns are established, their synaptic weights undergo homeostatic unsupervised learning that is shaped by wave-like input patterns. We demonstrate how this autonomous genetically directed developmental sequence can give rise to self-calibrated WTA networks, and compare our simulation results with biological data. PMID:25474693

  20. Bipartite Community Structure of eQTLs.

    PubMed

    Platig, John; Castaldi, Peter J; DeMeo, Dawn; Quackenbush, John

    2016-09-01

    Genome Wide Association Studies (GWAS) and expression quantitative trait locus (eQTL) analyses have identified genetic associations with a wide range of human phenotypes. However, many of these variants have weak effects and understanding their combined effect remains a challenge. One hypothesis is that multiple SNPs interact in complex networks to influence functional processes that ultimately lead to complex phenotypes, including disease states. Here we present CONDOR, a method that represents both cis- and trans-acting SNPs and the genes with which they are associated as a bipartite graph and then uses the modular structure of that graph to place SNPs into a functional context. In applying CONDOR to eQTLs in chronic obstructive pulmonary disease (COPD), we found the global network "hub" SNPs were devoid of disease associations through GWAS. However, the network was organized into 52 communities of SNPs and genes, many of which were enriched for genes in specific functional classes. We identified local hubs within each community ("core SNPs") and these were enriched for GWAS SNPs for COPD and many other diseases. These results speak to our intuition: rather than single SNPs influencing single genes, we see groups of SNPs associated with the expression of families of functionally related genes and that disease SNPs are associated with the perturbation of those functions. These methods are not limited in their application to COPD and can be used in the analysis of a wide variety of disease processes and other phenotypic traits.

  1. Genome-scale cold stress response regulatory networks in ten Arabidopsis thaliana ecotypes

    PubMed Central

    2013-01-01

    Background Low temperature leads to major crop losses every year. Although several studies have been conducted focusing on diversity of cold tolerance level in multiple phenotypically divergent Arabidopsis thaliana (A. thaliana) ecotypes, genome-scale molecular understanding is still lacking. Results In this study, we report genome-scale transcript response diversity of 10 A. thaliana ecotypes originating from different geographical locations to non-freezing cold stress (10°C). To analyze the transcriptional response diversity, we initially compared transcriptome changes in all 10 ecotypes using Arabidopsis NimbleGen ATH6 microarrays. In total 6061 transcripts were significantly cold regulated (p < 0.01) in 10 ecotypes, including 498 transcription factors and 315 transposable elements. The majority of the transcripts (75%) showed ecotype specific expression pattern. By using sequence data available from Arabidopsis thaliana 1001 genome project, we further investigated sequence polymorphisms in the core cold stress regulon genes. Significant numbers of non-synonymous amino acid changes were observed in the coding region of the CBF regulon genes. Considering the limited knowledge about regulatory interactions between transcription factors and their target genes in the model plant A. thaliana, we have adopted a powerful systems genetics approach- Network Component Analysis (NCA) to construct an in-silico transcriptional regulatory network model during response to cold stress. The resulting regulatory network contained 1,275 nodes and 7,720 connections, with 178 transcription factors and 1,331 target genes. Conclusions A. thaliana ecotypes exhibit considerable variation in transcriptome level responses to non-freezing cold stress treatment. Ecotype specific transcripts and related gene ontology (GO) categories were identified to delineate natural variation of cold stress regulated differential gene expression in the model plant A. thaliana. The predicted regulatory network model was able to identify new ecotype specific transcription factors and their regulatory interactions, which might be crucial for their local geographic adaptation to cold temperature. Additionally, since the approach presented here is general, it could be adapted to study networks regulating biological process in any biological systems. PMID:24148294

  2. Core-periphery structure requires something else in the network

    NASA Astrophysics Data System (ADS)

    Kojaku, Sadamori; Masuda, Naoki

    2018-04-01

    A network with core-periphery structure consists of core nodes that are densely interconnected. In contrast to a community structure, which is a different meso-scale structure of networks, core nodes can be connected to peripheral nodes and peripheral nodes are not densely interconnected. Although core-periphery structure sounds reasonable, we argue that it is merely accounted for by heterogeneous degree distributions, if one partitions a network into a single core block and a single periphery block, which the famous Borgatti–Everett algorithm and many succeeding algorithms assume. In other words, there is a strong tendency that high-degree and low-degree nodes are judged to be core and peripheral nodes, respectively. To discuss core-periphery structure beyond the expectation of the node’s degree (as described by the configuration model), we propose that one needs to assume at least one block of nodes apart from the focal core-periphery structure, such as a different core-periphery pair, community or nodes not belonging to any meso-scale structure. We propose a scalable algorithm to detect pairs of core and periphery in networks, controlling for the effect of the node’s degree. We illustrate our algorithm using various empirical networks.

  3. Biological computational approaches: new hopes to improve (re)programming robustness, regenerative medicine and cancer therapeutics.

    PubMed

    Ebrahimi, Behnam

    2016-01-01

    Hundreds of transcription factors (TFs) are expressed and work in each cell type, but the identity of the cells is defined and maintained through the activity of a small number of core TFs. Existing reprogramming strategies predominantly focus on the ectopic expression of core TFs of an intended fate in a given cell type regardless of the state of native/somatic gene regulatory networks (GRNs) of the starting cells. Interestingly, an important point is that how much products of the reprogramming, transdifferentiation and differentiation (programming) are identical to their in vivo counterparts. There is evidence that shows that direct fate conversions of somatic cells are not complete, with target cell identity not fully achieved. Manipulation of core TFs provides a powerful tool for engineering cell fate in terms of extinguishment of native GRNs, the establishment of a new GRN, and preventing installation of aberrant GRNs. Conventionally, core TFs are selected to convert one cell type into another mostly based on literature and the experimental identification of genes that are differentially expressed in one cell type compared to the specific cell types. Currently, there is not a universal standard strategy for identifying candidate core TFs. Remarkably, several biological computational platforms are developed, which are capable of evaluating the fidelity of reprogramming methods and refining existing protocols. The current review discusses some deficiencies of reprogramming technologies in the production of a pure population of authentic target cells. Furthermore, it reviews the role of computational approaches (e.g. CellNet, KeyGenes, Mogrify, etc.) in improving (re)programming methods and consequently in regenerative medicine and cancer therapeutics. Copyright © 2016 International Society of Differentiation. Published by Elsevier B.V. All rights reserved.

  4. Landscape genetics for the empirical assessment of resistance surfaces: the European pine marten (Martes martes) as a target-species of a regional ecological network.

    PubMed

    Ruiz-González, Aritz; Gurrutxaga, Mikel; Cushman, Samuel A; Madeira, María José; Randi, Ettore; Gómez-Moliner, Benjamin J

    2014-01-01

    Coherent ecological networks (EN) composed of core areas linked by ecological corridors are being developed worldwide with the goal of promoting landscape connectivity and biodiversity conservation. However, empirical assessment of the performance of EN designs is critical to evaluate the utility of these networks to mitigate effects of habitat loss and fragmentation. Landscape genetics provides a particularly valuable framework to address the question of functional connectivity by providing a direct means to investigate the effects of landscape structure on gene flow. The goals of this study are (1) to evaluate the landscape features that drive gene flow of an EN target species (European pine marten), and (2) evaluate the optimality of a regional EN design in providing connectivity for this species within the Basque Country (North Spain). Using partial Mantel tests in a reciprocal causal modeling framework we competed 59 alternative models, including isolation by distance and the regional EN. Our analysis indicated that the regional EN was among the most supported resistance models for the pine marten, but was not the best supported model. Gene flow of pine marten in northern Spain is facilitated by natural vegetation, and is resisted by anthropogenic landcover types and roads. Our results suggest that the regional EN design being implemented in the Basque Country will effectively facilitate gene flow of forest dwelling species at regional scale.

  5. Structures and Boolean Dynamics in Gene Regulatory Networks

    NASA Astrophysics Data System (ADS)

    Szedlak, Anthony

    This dissertation discusses the topological and dynamical properties of GRNs in cancer, and is divided into four main chapters. First, the basic tools of modern complex network theory are introduced. These traditional tools as well as those developed by myself (set efficiency, interset efficiency, and nested communities) are crucial for understanding the intricate topological properties of GRNs, and later chapters recall these concepts. Second, the biology of gene regulation is discussed, and a method for disease-specific GRN reconstruction developed by our collaboration is presented. This complements the traditional exhaustive experimental approach of building GRNs edge-by-edge by quickly inferring the existence of as of yet undiscovered edges using correlations across sets of gene expression data. This method also provides insight into the distribution of common mutations across GRNs. Third, I demonstrate that the structures present in these reconstructed networks are strongly related to the evolutionary histories of their constituent genes. Investigation of how the forces of evolution shaped the topology of GRNs in multicellular organisms by growing outward from a core of ancient, conserved genes can shed light upon the ''reverse evolution'' of normal cells into unicellular-like cancer states. Next, I simulate the dynamics of the GRNs of cancer cells using the Hopfield model, an infinite range spin-glass model designed with the ability to encode Boolean data as attractor states. This attractor-driven approach facilitates the integration of gene expression data into predictive mathematical models. Perturbations representing therapeutic interventions are applied to sets of genes, and the resulting deviations from their attractor states are recorded, suggesting new potential drug targets for experimentation. Finally, I extend the Hopfield model to modular networks, cyclic attractors, and complex attractors, and apply these concepts to simulations of the cell cycle process. Futher development of these and other theoretical and computational tools is necessary to analyze the deluge of experimental data produced by modern and future biological high throughput methods. (Abstract shortened by ProQuest.).

  6. Gene Network Polymorphism Illuminates Loss and Retention of Novel RNAi Silencing Components in the Cryptococcus Pathogenic Species Complex.

    PubMed

    Feretzaki, Marianna; Billmyre, R Blake; Clancey, Shelly Applen; Wang, Xuying; Heitman, Joseph

    2016-03-01

    RNAi is a ubiquitous pathway that serves central functions throughout eukaryotes, including maintenance of genome stability and repression of transposon expression and movement. However, a number of organisms have lost their RNAi pathways, including the model yeast Saccharomyces cerevisiae, the maize pathogen Ustilago maydis, the human pathogen Cryptococcus deuterogattii, and some human parasite pathogens, suggesting there may be adaptive benefits associated with both retention and loss of RNAi. By comparing the RNAi-deficient genome of the Pacific Northwest Outbreak C. deuterogattii strain R265 with the RNAi-proficient genomes of the Cryptococcus pathogenic species complex, we identified a set of conserved genes that were lost in R265 and all other C. deuterogattii isolates examined. Genetic and molecular analyses reveal several of these lost genes play roles in RNAi pathways. Four novel components were examined further. Znf3 (a zinc finger protein) and Qip1 (a homolog of N. crassa Qip) were found to be essential for RNAi, while Cpr2 (a constitutive pheromone receptor) and Fzc28 (a transcription factor) are involved in sex-induced but not mitosis-induced silencing. Our results demonstrate that the mitotic and sex-induced RNAi pathways rely on the same core components, but sex-induced silencing may be a more specific, highly induced variant that involves additional specialized or regulatory components. Our studies further illustrate how gene network polymorphisms involving known components of key cellular pathways can inform identification of novel elements and suggest that RNAi loss may have been a core event in the speciation of C. deuterogattii and possibly contributed to its pathogenic trajectory.

  7. An Ancient Gene Network Is Co-opted for Teeth on Old and New Jaws

    PubMed Central

    Fraser, Gareth J; Hulsey, C. Darrin; Bloomquist, Ryan F; Uyesugi, Kristine; Manley, Nancy R; Streelman, J. Todd

    2009-01-01

    Vertebrate dentitions originated in the posterior pharynx of jawless fishes more than half a billion years ago. As gnathostomes (jawed vertebrates) evolved, teeth developed on oral jaws and helped to establish the dominance of this lineage on land and in the sea. The advent of oral jaws was facilitated, in part, by absence of hox gene expression in the first, most anterior, pharyngeal arch. Much later in evolutionary time, teleost fishes evolved a novel toothed jaw in the pharynx, the location of the first vertebrate teeth. To examine the evolutionary modularity of dentitions, we asked whether oral and pharyngeal teeth develop using common or independent gene regulatory pathways. First, we showed that tooth number is correlated on oral and pharyngeal jaws across species of cichlid fishes from Lake Malawi (East Africa), suggestive of common regulatory mechanisms for tooth initiation. Surprisingly, we found that cichlid pharyngeal dentitions develop in a region of dense hox gene expression. Thus, regulation of tooth number is conserved, despite distinct developmental environments of oral and pharyngeal jaws; pharyngeal jaws occupy hox-positive, endodermal sites, and oral jaws develop in hox-negative regions with ectodermal cell contributions. Next, we studied the expression of a dental gene network for tooth initiation, most genes of which are similarly deployed across the two disparate jaw sites. This collection of genes includes members of the ectodysplasin pathway, eda and edar, expressed identically during the patterning of oral and pharyngeal teeth. Taken together, these data suggest that pharyngeal teeth of jawless vertebrates utilized an ancient gene network before the origin of oral jaws, oral teeth, and ectodermal appendages. The first vertebrate dentition likely appeared in a hox-positive, endodermal environment and expressed a genetic program including ectodysplasin pathway genes. This ancient regulatory circuit was co-opted and modified for teeth in oral jaws of the first jawed vertebrate, and subsequently deployed as jaws enveloped teeth on novel pharyngeal jaws. Our data highlight an amazing modularity of jaws and teeth as they coevolved during the history of vertebrates. We exploit this diversity to infer a core dental gene network, common to the first tooth and all of its descendants. PMID:19215146

  8. Multiple genome alignment for identifying the core structure among moderately related microbial genomes.

    PubMed

    Uchiyama, Ikuo

    2008-10-31

    Identifying the set of intrinsically conserved genes, or the genomic core, among related genomes is crucial for understanding prokaryotic genomes where horizontal gene transfers are common. Although core genome identification appears to be obvious among very closely related genomes, it becomes more difficult when more distantly related genomes are compared. Here, we consider the core structure as a set of sufficiently long segments in which gene orders are conserved so that they are likely to have been inherited mainly through vertical transfer, and developed a method for identifying the core structure by finding the order of pre-identified orthologous groups (OGs) that maximally retains the conserved gene orders. The method was applied to genome comparisons of two well-characterized families, Bacillaceae and Enterobacteriaceae, and identified their core structures comprising 1438 and 2125 OGs, respectively. The core sets contained most of the essential genes and their related genes, which were primarily included in the intersection of the two core sets comprising around 700 OGs. The definition of the genomic core based on gene order conservation was demonstrated to be more robust than the simpler approach based only on gene conservation. We also investigated the core structures in terms of G+C content homogeneity and phylogenetic congruence, and found that the core genes primarily exhibited the expected characteristic, i.e., being indigenous and sharing the same history, more than the non-core genes. The results demonstrate that our strategy of genome alignment based on gene order conservation can provide an effective approach to identify the genomic core among moderately related microbial genomes.

  9. Use of natural variation reveals core genes in the transcriptome of iron-deficient Arabidopsis thaliana roots

    PubMed Central

    Stein, Ricardo J.; Waters, Brian M.

    2012-01-01

    Iron (Fe) is an essential mineral micronutrient for plants and animals. Plants respond to Fe deficiency by increasing root uptake capacity. Identification of gene networks for Fe uptake and homeostasis could result in improved crop growth and nutritional value. Previous studies have used microarrays to identify a large number of genes regulated by Fe deficiency in roots of three Arabidopsis ecotypes. However, a large proportion of these genes may be involved in secondary or genotype-influenced responses rather than in a universal role in Fe uptake or homeostasis. Here we show that a small percentage of the Fe deficiency transcriptome of two contrasting ecotypes, Kas-1 and Tsu-1, was shared with other ecotypes. Kas-1 and Tsu-1 had different timing and magnitude of ferric reductase activity upon Fe withdrawal, and different categories of overrepresented Fe-regulated genes. To gain insights into universal responses of Arabidopsis to Fe deficiency, the Kas-1 and Tsu-1 transcriptomes were compared with those of Col-0, Ler, and C24. In early Fe deficiency (24–48 h), no Fe-downregulated genes and only 10 upregulated genes were found in all ecotypes, and only 20 Fe-downregulated and 58 upregulated genes were found in at least three of the five ecotypes. Supernode gene networks were constructed to visualize conserved Fe homeostasis responses. Contrasting gene expression highlighted different responses to Fe deficiency between ecotypes. This study demonstrates the use of natural variation to identify central Fe-deficiency-regulated genes in plants, and identified genes with potential new roles in signalling during Fe deficiency. PMID:22039296

  10. Network Coding on Heterogeneous Multi-Core Processors for Wireless Sensor Networks

    PubMed Central

    Kim, Deokho; Park, Karam; Ro, Won W.

    2011-01-01

    While network coding is well known for its efficiency and usefulness in wireless sensor networks, the excessive costs associated with decoding computation and complexity still hinder its adoption into practical use. On the other hand, high-performance microprocessors with heterogeneous multi-cores would be used as processing nodes of the wireless sensor networks in the near future. To this end, this paper introduces an efficient network coding algorithm developed for the heterogenous multi-core processors. The proposed idea is fully tested on one of the currently available heterogeneous multi-core processors referred to as the Cell Broadband Engine. PMID:22164053

  11. Virtual optical network mapping and core allocation in elastic optical networks using multi-core fibers

    NASA Astrophysics Data System (ADS)

    Xuan, Hejun; Wang, Yuping; Xu, Zhanqi; Hao, Shanshan; Wang, Xiaoli

    2017-11-01

    Virtualization technology can greatly improve the efficiency of the networks by allowing the virtual optical networks to share the resources of the physical networks. However, it will face some challenges, such as finding the efficient strategies for virtual nodes mapping, virtual links mapping and spectrum assignment. It is even more complex and challenging when the physical elastic optical networks using multi-core fibers. To tackle these challenges, we establish a constrained optimization model to determine the optimal schemes of optical network mapping, core allocation and spectrum assignment. To solve the model efficiently, tailor-made encoding scheme, crossover and mutation operators are designed. Based on these, an efficient genetic algorithm is proposed to obtain the optimal schemes of the virtual nodes mapping, virtual links mapping, core allocation. The simulation experiments are conducted on three widely used networks, and the experimental results show the effectiveness of the proposed model and algorithm.

  12. Identification of a 5‑lncRNA signature‑based risk scoring system for survival prediction in colorectal cancer.

    PubMed

    Gu, Liqiang; Yu, Jun; Wang, Qing; Xu, Bin; Ji, Liechen; Yu, Lin; Zhang, Xipeng; Cai, Hui

    2018-05-03

    The present study aimed to investigate potential prognostic long noncoding RNAs (lncRNAs) associated with colorectal cancer (CRC). An mRNA‑seq dataset obtained from The Cancer Genome Atlas was employed to identify the differentially expressed lncRNAs (DELs) between CRC patients with good and poor prognoses. Subsequently, univariate and multivariate Cox regression analyses were conducted to analyze the prognosis‑associated lncRNAs among all DELs. In addition, a risk scoring system was developed according to the expression levels of the prognostic lncRNAs, which was then applied to a training set and an independent testing set. Furthermore, the co‑expressed genes of prognostic lncRNAs were screened using a Multi‑Experiment Matrix online tool for construction of lncRNA‑gene networks. Finally, Kyoto Encyclopedia of Genes and Genomes pathway and Gene Ontology (GO) function enrichment analyses were performed on genes in the lncRNA‑gene networks using KOBAS, GOATOOLS and ClusterProfiler. The present study identified 82 DELs, of which long intergenic nonprotein coding RNA 2159, RP11‑452L6.6, RP11‑894P9.1 and RP11‑69M1.6, and whey acidic protein four‑disulfide core domain 21 (WFDC21P) were reported to be independently associated with the prognosis of patients with CRC. A 5‑lncRNA signature‑based risk scoring system was developed, which may be used to classify patients into low‑ and high‑risk groups with significantly different recurrence‑free survival times in the training and testing sets (P<0.05). Co‑expressed genes of WFDC21P or RP11‑69M1.6 were utilized to construct the lncRNA‑gene networks. Genes in the networks were significantly enriched in 'tight junction', 'focal adhesion' and 'regulation of actin cytoskeleton' pathways, and numerous GO terms associated with 'reactive oxygen species metabolism' and 'nitric oxide metabolism'. The present study proposed a 5‑lncRNA signature‑based risk scoring system for predicting the prognosis of patients with CRC, and revealed the associated signaling pathways and biological processes. The results of the present study may help improve prognostic evaluation in clinical practice.

  13. Evolution of Drosophila ribosomal protein gene core promoters.

    PubMed

    Ma, Xiaotu; Zhang, Kangyu; Li, Xiaoman

    2009-03-01

    The coordinated expression of ribosomal protein genes (RPGs) has been well documented in many species. Previous analyses of RPG promoters focus only on Fungi and mammals. Recognizing this gap and using a comparative genomics approach, we utilize a motif-finding algorithm that incorporates cross-species conservation to identify several significant motifs in Drosophila RPG promoters. As a result, significant differences of the enriched motifs in RPG promoter are found among Drosophila, Fungi, and mammals, demonstrating the evolutionary dynamics of the ribosomal gene regulatory network. We also report a motif present in similar numbers of RPGs among Drosophila species which does not appear to be conserved at the individual RPG gene level. A module-wise stabilizing selection theory is proposed to explain this observation. Overall, our results provide significant insight into the fast-evolving nature of transcriptional regulation in the RPG module.

  14. Evolution of Drosophila ribosomal protein gene core promoters

    PubMed Central

    Ma, Xiaotu; Zhang, Kangyu; Li, Xiaoman

    2011-01-01

    The coordinated expression of ribosomal protein genes (RPGs) has been well documented in many species. Previous analyses of RPG promoters focus only on Fungi and mammals. Recognizing this gap and using a comparative genomics approach, we utilize a motif-finding algorithm that incorporates cross-species conservation to identify several significant motifs in Drosophila RPG promoters. As a result, significant differences of the enriched motifs in RPG promoter are found among Drosophila, Fungi, and mammals, demonstrating the evolutionary dynamics of the ribosomal gene regulatory network. We also report a motif present in similar numbers of RPGs among Drosophila species which does not appear to be conserved at the individual RPG gene level. A module-wise stabilizing selection theory is proposed to explain this observation. Overall, our results provide significant insight into the fast-evolving nature of transcriptional regulation in the RPG module. PMID:19059316

  15. A physical mechanism of cancer heterogeneity

    NASA Astrophysics Data System (ADS)

    Chen, Cong; Wang, Jin

    2016-02-01

    We studied a core cancer gene regulatory network motif to uncover possible source of cancer heterogeneity from epigenetic sources. When the time scale of the protein regulation to the gene is faster compared to the protein synthesis and degradation (adiabatic regime), normal state, cancer state and an intermediate premalignant state emerge. Due to the epigenetics such as DNA methylation and histone remodification, the time scale of the protein regulation to the gene can be slower or comparable to the protein synthesis and degradation (non-adiabatic regime). In this case, many more states emerge as possible phenotype alternations. This gives the origin of the heterogeneity. The cancer heterogeneity is reflected from the emergence of more phenotypic states, larger protein concentration fluctuations, wider kinetic distributions and multiplicity of kinetic paths from normal to cancer state, higher energy cost per gene switching, and weaker stability.

  16. Stability of ecological industry chain: an entropy model approach.

    PubMed

    Wang, Qingsong; Qiu, Shishou; Yuan, Xueliang; Zuo, Jian; Cao, Dayong; Hong, Jinglan; Zhang, Jian; Dong, Yong; Zheng, Ying

    2016-07-01

    A novel methodology is proposed in this study to examine the stability of ecological industry chain network based on entropy theory. This methodology is developed according to the associated dissipative structure characteristics, i.e., complexity, openness, and nonlinear. As defined in the methodology, network organization is the object while the main focus is the identification of core enterprises and core industry chains. It is proposed that the chain network should be established around the core enterprise while supplementation to the core industry chain helps to improve system stability, which is verified quantitatively. Relational entropy model can be used to identify core enterprise and core eco-industry chain. It could determine the core of the network organization and core eco-industry chain through the link form and direction of node enterprises. Similarly, the conductive mechanism of different node enterprises can be examined quantitatively despite the absence of key data. Structural entropy model can be employed to solve the problem of order degree for network organization. Results showed that the stability of the entire system could be enhanced by the supplemented chain around the core enterprise in eco-industry chain network organization. As a result, the sustainability of the entire system could be further improved.

  17. Neural networks within multi-core optic fibers

    PubMed Central

    Cohen, Eyal; Malka, Dror; Shemer, Amir; Shahmoon, Asaf; Zalevsky, Zeev; London, Michael

    2016-01-01

    Hardware implementation of artificial neural networks facilitates real-time parallel processing of massive data sets. Optical neural networks offer low-volume 3D connectivity together with large bandwidth and minimal heat production in contrast to electronic implementation. Here, we present a conceptual design for in-fiber optical neural networks. Neurons and synapses are realized as individual silica cores in a multi-core fiber. Optical signals are transferred transversely between cores by means of optical coupling. Pump driven amplification in erbium-doped cores mimics synaptic interactions. We simulated three-layered feed-forward neural networks and explored their capabilities. Simulations suggest that networks can differentiate between given inputs depending on specific configurations of amplification; this implies classification and learning capabilities. Finally, we tested experimentally our basic neuronal elements using fibers, couplers, and amplifiers, and demonstrated that this configuration implements a neuron-like function. Therefore, devices similar to our proposed multi-core fiber could potentially serve as building blocks for future large-scale small-volume optical artificial neural networks. PMID:27383911

  18. Neural networks within multi-core optic fibers.

    PubMed

    Cohen, Eyal; Malka, Dror; Shemer, Amir; Shahmoon, Asaf; Zalevsky, Zeev; London, Michael

    2016-07-07

    Hardware implementation of artificial neural networks facilitates real-time parallel processing of massive data sets. Optical neural networks offer low-volume 3D connectivity together with large bandwidth and minimal heat production in contrast to electronic implementation. Here, we present a conceptual design for in-fiber optical neural networks. Neurons and synapses are realized as individual silica cores in a multi-core fiber. Optical signals are transferred transversely between cores by means of optical coupling. Pump driven amplification in erbium-doped cores mimics synaptic interactions. We simulated three-layered feed-forward neural networks and explored their capabilities. Simulations suggest that networks can differentiate between given inputs depending on specific configurations of amplification; this implies classification and learning capabilities. Finally, we tested experimentally our basic neuronal elements using fibers, couplers, and amplifiers, and demonstrated that this configuration implements a neuron-like function. Therefore, devices similar to our proposed multi-core fiber could potentially serve as building blocks for future large-scale small-volume optical artificial neural networks.

  19. Dynamical analysis of cellular ageing by modeling of gene regulatory network based attractor landscape.

    PubMed

    Chong, Ket Hing; Zhang, Xiaomeng; Zheng, Jie

    2018-01-01

    Ageing is a natural phenomenon that is inherently complex and remains a mystery. Conceptual model of cellular ageing landscape was proposed for computational studies of ageing. However, there is a lack of quantitative model of cellular ageing landscape. This study aims to investigate the mechanism of cellular ageing in a theoretical model using the framework of Waddington's epigenetic landscape. We construct an ageing gene regulatory network (GRN) consisting of the core cell cycle regulatory genes (including p53). A model parameter (activation rate) is used as a measure of the accumulation of DNA damage. Using the bifurcation diagrams to estimate the parameter values that lead to multi-stability, we obtained a conceptual model for capturing three distinct stable steady states (or attractors) corresponding to homeostasis, cell cycle arrest, and senescence or apoptosis. In addition, we applied a Monte Carlo computational method to quantify the potential landscape, which displays: I) one homeostasis attractor for low accumulation of DNA damage; II) two attractors for cell cycle arrest and senescence (or apoptosis) in response to high accumulation of DNA damage. Using the Waddington's epigenetic landscape framework, the process of ageing can be characterized by state transitions from landscape I to II. By in silico perturbations, we identified the potential landscape of a perturbed network (inactivation of p53), and thereby demonstrated the emergence of a cancer attractor. The simulated dynamics of the perturbed network displays a landscape with four basins of attraction: homeostasis, cell cycle arrest, senescence (or apoptosis) and cancer. Our analysis also showed that for the same perturbed network with low DNA damage, the landscape displays only the homeostasis attractor. The mechanistic model offers theoretical insights that can facilitate discovery of potential strategies for network medicine of ageing-related diseases such as cancer.

  20. Modulation of dynamic modes by interplay between positive and negative feedback loops in gene regulatory networks

    NASA Astrophysics Data System (ADS)

    Wang, Liu-Suo; Li, Ning-Xi; Chen, Jing-Jia; Zhang, Xiao-Peng; Liu, Feng; Wang, Wei

    2018-04-01

    A positive and a negative feedback loop can induce bistability and oscillation, respectively, in biological networks. Nevertheless, they are frequently interlinked to perform more elaborate functions in many gene regulatory networks. Coupled positive and negative feedback loops may exhibit either oscillation or bistability depending on the intensity of the stimulus in some particular networks. It is less understood how the transition between the two dynamic modes is modulated by the positive and negative feedback loops. We developed an abstract model of such systems, largely based on the core p53 pathway, to explore the mechanism for the transformation of dynamic behaviors. Our results show that enhancing the positive feedback may promote or suppress oscillations depending on the strength of both feedback loops. We found that the system oscillates with low amplitudes in response to a moderate stimulus and switches to the on state upon a strong stimulus. When the positive feedback is activated much later than the negative one in response to a strong stimulus, the system exhibits long-term oscillations before switching to the on state. We explain this intriguing phenomenon using quasistatic approximation. Moreover, early switching to the on state may occur when the system starts from a steady state in the absence of stimuli. The interplay between the positive and negative feedback plays a key role in the transitions between oscillation and bistability. Of note, our conclusions should be applicable only to some specific gene regulatory networks, especially the p53 network, in which both oscillation and bistability exist in response to a certain type of stimulus. Our work also underscores the significance of transient dynamics in determining cellular outcome.

  1. Targeted interactomics reveals a complex core cell cycle machinery in Arabidopsis thaliana.

    PubMed

    Van Leene, Jelle; Hollunder, Jens; Eeckhout, Dominique; Persiau, Geert; Van De Slijke, Eveline; Stals, Hilde; Van Isterdael, Gert; Verkest, Aurine; Neirynck, Sandy; Buffel, Yelle; De Bodt, Stefanie; Maere, Steven; Laukens, Kris; Pharazyn, Anne; Ferreira, Paulo C G; Eloy, Nubia; Renne, Charlotte; Meyer, Christian; Faure, Jean-Denis; Steinbrenner, Jens; Beynon, Jim; Larkin, John C; Van de Peer, Yves; Hilson, Pierre; Kuiper, Martin; De Veylder, Lieven; Van Onckelen, Harry; Inzé, Dirk; Witters, Erwin; De Jaeger, Geert

    2010-08-10

    Cell proliferation is the main driving force for plant growth. Although genome sequence analysis revealed a high number of cell cycle genes in plants, little is known about the molecular complexes steering cell division. In a targeted proteomics approach, we mapped the core complex machinery at the heart of the Arabidopsis thaliana cell cycle control. Besides a central regulatory network of core complexes, we distinguished a peripheral network that links the core machinery to up- and downstream pathways. Over 100 new candidate cell cycle proteins were predicted and an in-depth biological interpretation demonstrated the hypothesis-generating power of the interaction data. The data set provided a comprehensive view on heterodimeric cyclin-dependent kinase (CDK)-cyclin complexes in plants. For the first time, inhibitory proteins of plant-specific B-type CDKs were discovered and the anaphase-promoting complex was characterized and extended. Important conclusions were that mitotic A- and B-type cyclins form complexes with the plant-specific B-type CDKs and not with CDKA;1, and that D-type cyclins and S-phase-specific A-type cyclins seem to be associated exclusively with CDKA;1. Furthermore, we could show that plants have evolved a combinatorial toolkit consisting of at least 92 different CDK-cyclin complex variants, which strongly underscores the functional diversification among the large family of cyclins and reflects the pivotal role of cell cycle regulation in the developmental plasticity of plants.

  2. Multiple network interface core apparatus and method

    DOEpatents

    Underwood, Keith D [Albuquerque, NM; Hemmert, Karl Scott [Albuquerque, NM

    2011-04-26

    A network interface controller and network interface control method comprising providing a single integrated circuit as a network interface controller and employing a plurality of network interface cores on the single integrated circuit.

  3. Genome-wide strategies identify downstream target genes of chick connective tissue-associated transcription factors.

    PubMed

    Orgeur, Mickael; Martens, Marvin; Leonte, Georgeta; Nassari, Sonya; Bonnin, Marie-Ange; Börno, Stefan T; Timmermann, Bernd; Hecht, Jochen; Duprez, Delphine; Stricker, Sigmar

    2018-03-29

    Connective tissues support organs and play crucial roles in development, homeostasis and fibrosis, yet our understanding of their formation is still limited. To gain insight into the molecular mechanisms of connective tissue specification, we selected five zinc-finger transcription factors - OSR1, OSR2, EGR1, KLF2 and KLF4 - based on their expression patterns and/or known involvement in connective tissue subtype differentiation. RNA-seq and ChIP-seq profiling of chick limb micromass cultures revealed a set of common genes regulated by all five transcription factors, which we describe as a connective tissue core expression set. This common core was enriched with genes associated with axon guidance and myofibroblast signature, including fibrosis-related genes. In addition, each transcription factor regulated a specific set of signalling molecules and extracellular matrix components. This suggests a concept whereby local molecular niches can be created by the expression of specific transcription factors impinging on the specification of local microenvironments. The regulatory network established here identifies common and distinct molecular signatures of limb connective tissue subtypes, provides novel insight into the signalling pathways governing connective tissue specification, and serves as a resource for connective tissue development. © 2018. Published by The Company of Biologists Ltd.

  4. Identification and functional analysis of a core gene module associated with hepatitis C virus-induced human hepatocellular carcinoma progression.

    PubMed

    Bai, Gaobo; Zheng, Wenling; Ma, Wenli

    2018-05-01

    Hepatitis C virus (HCV)-induced human hepatocellular carcinoma (HCC) progression may be due to a complex multi-step processes. The developmental mechanism of these processes is worth investigating for the prevention, diagnosis and therapy of HCC. The aim of the present study was to investigate the molecular mechanism underlying the progression of HCV-induced hepatocarcinogenesis. First, the dynamic gene module, consisting of key genes associated with progression between the normal stage and HCC, was identified using the Weighted Gene Co-expression Network Analysis tool from R language. By defining those genes in the module as seeds, the change of co-expression in differentially expressed gene sets in two consecutive stages of pathological progression was examined. Finally, interaction pairs of HCV viral proteins and their directly targeted proteins in the identified module were extracted from the literature and a comprehensive interaction dataset from yeast two-hybrid experiments. By combining the interactions between HCV and their targets, and protein-protein interactions in the Search Tool for the Retrieval of Interacting Genes database (STRING), the HCV-key genes interaction network was constructed and visualized using Cytoscape software 3.2. As a result, a module containing 44 key genes was identified to be associated with HCC progression, due to the dynamic features and functions of those genes in the module. Several important differentially co-expressed gene pairs were identified between non-HCC and HCC stages. In the key genes, cyclin dependent kinase 1 (CDK1), NDC80, cyclin A2 (CCNA2) and rac GTPase activating protein 1 (RACGAP1) were shown to be targeted by the HCV nonstructural proteins NS5A, NS3 and NS5B, respectively. The four genes perform an intermediary role between the HCV viral proteins and the dysfunctional module in the HCV key genes interaction network. These findings provided valuable information for understanding the mechanism of HCV-induced HCC progression and for seeking drug targets for the therapy and prevention of HCC.

  5. Role of extracytoplasmic function sigma factor PG1660 (RpoE) in the oxidative stress resistance regulatory network of Porphyromonas gingivalis

    PubMed Central

    Dou, Y.; Rutanhira, H.; Chen, X.; Mishra, A.; Wang, C.; Fletcher, H.M.

    2018-01-01

    Summary In Porphyromonas gingivalis, the protein PG1660, composed of 174 amino acids, is annotated as an extracytoplasmic function (ECF) sigma factor (RpoE homologue-σ24). Because PG1660 can modulate several virulence factors and responds to environmental signals in P. gingivalis, its genetic properties were evaluated. PG1660 is co-transcribed with its downstream gene PG1659, and the transcription start site was identified as adenine residue 54-nucleotides upstream of the ATG translation start codon. In addition to binding its own promoter, using the purified rPG1660 and RNAP core enzyme from Escherichia coli with the PG1660 promoter DNA as template, the function of PG1660 as a sigma factor was demonstrated in an in vitro transcription assay. Transcriptome analyses of a P. gingivalis PG1660-defective isogenic mutant revealed that under oxidative stress conditions 176 genes including genes involved in secondary metabolism were downregulated more than two-fold compared with the parental strain. The rPG1660 protein also showed the ability to bind to the promoters of the highly downregulated genes in the PG1660-deficient mutant. As the ECF sigma factor PG0162 has a 29% identity with PG1660 and can modulate its expression, the cross-talk between their regulatory networks was explored. The expression profile of the PG0162PG1660-deficient mutant (P. gingivalis FLL356) revealed that the type IX secretion system genes and several virulence genes were downregulated under hydrogen peroxide stress conditions. Taken together, we have confirmed that PG1660 can function as a sigma factor, and plays an important regulatory role in the oxidative stress and virulence regulatory network of P. gingivalis. PMID:29059500

  6. Social capital and core network ties: a validation study of individual-level social capital measures and their association with extra- and intra-neighborhood ties, and self-rated health.

    PubMed

    Moore, Spencer; Bockenholt, Ulf; Daniel, Mark; Frohlich, Katherine; Kestens, Yan; Richard, Lucie

    2011-03-01

    Research on social capital and health has assumed that measures of trust, participation, and perceived cohesion capture aspects of people's neighborhood social connections. This study uses data on the personal networks of 2707 Montreal adults in 300 different neighborhoods to examine the association of socio-demographic and social capital variables with the likelihood of having core ties, core neighborhood ties, and high self-rated health (SRH). Persons with higher household income were more likely to have core ties, but less likely to have core neighborhood ties. Persons with greater diversity in extra-neighborhood network capital were more likely to have core ties, and persons with greater diversity in intra-neighborhood network capital were more likely to have core neighborhood ties. Generalized trust, perceived neighborhood cohesion, and extra-neighborhood network diversity were shown associated with high SRH. Conventional measures of social capital may not capture network mechanisms. Findings suggest a critical appraisal of the mechanisms linking social capital and health, and the further delineation of network and psychosocial mechanisms in understanding these links. Copyright © 2010 Elsevier Ltd. All rights reserved.

  7. Interrogating the topological robustness of gene regulatory circuits by randomization

    PubMed Central

    Levine, Herbert; Onuchic, Jose N.

    2017-01-01

    One of the most important roles of cells is performing their cellular tasks properly for survival. Cells usually achieve robust functionality, for example, cell-fate decision-making and signal transduction, through multiple layers of regulation involving many genes. Despite the combinatorial complexity of gene regulation, its quantitative behavior has been typically studied on the basis of experimentally verified core gene regulatory circuitry, composed of a small set of important elements. It is still unclear how such a core circuit operates in the presence of many other regulatory molecules and in a crowded and noisy cellular environment. Here we report a new computational method, named random circuit perturbation (RACIPE), for interrogating the robust dynamical behavior of a gene regulatory circuit even without accurate measurements of circuit kinetic parameters. RACIPE generates an ensemble of random kinetic models corresponding to a fixed circuit topology, and utilizes statistical tools to identify generic properties of the circuit. By applying RACIPE to simple toggle-switch-like motifs, we observed that the stable states of all models converge to experimentally observed gene state clusters even when the parameters are strongly perturbed. RACIPE was further applied to a proposed 22-gene network of the Epithelial-to-Mesenchymal Transition (EMT), from which we identified four experimentally observed gene states, including the states that are associated with two different types of hybrid Epithelial/Mesenchymal phenotypes. Our results suggest that dynamics of a gene circuit is mainly determined by its topology, not by detailed circuit parameters. Our work provides a theoretical foundation for circuit-based systems biology modeling. We anticipate RACIPE to be a powerful tool to predict and decode circuit design principles in an unbiased manner, and to quantitatively evaluate the robustness and heterogeneity of gene expression. PMID:28362798

  8. In silico evolution of the Drosophila gap gene regulatory sequence under elevated mutational pressure.

    PubMed

    Chertkova, Aleksandra A; Schiffman, Joshua S; Nuzhdin, Sergey V; Kozlov, Konstantin N; Samsonova, Maria G; Gursky, Vitaly V

    2017-02-07

    Cis-regulatory sequences are often composed of many low-affinity transcription factor binding sites (TFBSs). Determining the evolutionary and functional importance of regulatory sequence composition is impeded without a detailed knowledge of the genotype-phenotype map. We simulate the evolution of regulatory sequences involved in Drosophila melanogaster embryo segmentation during early development. Natural selection evaluates gene expression dynamics produced by a computational model of the developmental network. We observe a dramatic decrease in the total number of transcription factor binding sites through the course of evolution. Despite a decrease in average sequence binding energies through time, the regulatory sequences tend towards organisations containing increased high affinity transcription factor binding sites. Additionally, the binding energies of separate sequence segments demonstrate ubiquitous mutual correlations through time. Fewer than 10% of initial TFBSs are maintained throughout the entire simulation, deemed 'core' sites. These sites have increased functional importance as assessed under wild-type conditions and their binding energy distributions are highly conserved. Furthermore, TFBSs within close proximity of core sites exhibit increased longevity, reflecting functional regulatory interactions with core sites. In response to elevated mutational pressure, evolution tends to sample regulatory sequence organisations with fewer, albeit on average, stronger functional transcription factor binding sites. These organisations are also shaped by the regulatory interactions among core binding sites with sites in their local vicinity.

  9. Landscape Genetics for the Empirical Assessment of Resistance Surfaces: The European Pine Marten (Martes martes) as a Target-Species of a Regional Ecological Network

    PubMed Central

    Ruiz-González, Aritz; Gurrutxaga, Mikel; Cushman, Samuel A.; Madeira, María José; Randi, Ettore; Gómez-Moliner, Benjamin J.

    2014-01-01

    Coherent ecological networks (EN) composed of core areas linked by ecological corridors are being developed worldwide with the goal of promoting landscape connectivity and biodiversity conservation. However, empirical assessment of the performance of EN designs is critical to evaluate the utility of these networks to mitigate effects of habitat loss and fragmentation. Landscape genetics provides a particularly valuable framework to address the question of functional connectivity by providing a direct means to investigate the effects of landscape structure on gene flow. The goals of this study are (1) to evaluate the landscape features that drive gene flow of an EN target species (European pine marten), and (2) evaluate the optimality of a regional EN design in providing connectivity for this species within the Basque Country (North Spain). Using partial Mantel tests in a reciprocal causal modeling framework we competed 59 alternative models, including isolation by distance and the regional EN. Our analysis indicated that the regional EN was among the most supported resistance models for the pine marten, but was not the best supported model. Gene flow of pine marten in northern Spain is facilitated by natural vegetation, and is resisted by anthropogenic landcover types and roads. Our results suggest that the regional EN design being implemented in the Basque Country will effectively facilitate gene flow of forest dwelling species at regional scale. PMID:25329047

  10. Core psychopathology in anorexia nervosa and bulimia nervosa: A network analysis.

    PubMed

    Forrest, Lauren N; Jones, Payton J; Ortiz, Shelby N; Smith, April R

    2018-04-25

    The cognitive-behavioral theory of eating disorders (EDs) proposes that shape and weight overvaluation are the core ED psychopathology. Core symptoms can be statistically identified using network analysis. Existing ED network studies support that shape and weight overvaluation are the core ED psychopathology, yet no studies have estimated AN core psychopathology and concerns exist about the replicability of network analysis findings. The current study estimated ED symptom networks among people with anorexia nervosa (AN) and bulimia nervosa (BN) and among a combined group of people with AN and BN. Participants were girls and women with AN (n = 604) and BN (n = 477) seeking residential ED treatment. ED symptoms were assessed with the Eating Disorder Examination-Questionnaire (EDE-Q); 27 of the EDE-Q items were included as nodes in symptom networks. Core symptoms were determined by expected influence and strength values. In all networks, desiring weight loss, restraint, shape and weight preoccupation, and shape overvaluation emerged as the most important symptoms. In addition, in the AN and combined networks, fearing weight gain emerged as an important symptom. In the BN network, weight overvaluation emerged as another important symptom. Findings support the cognitive-behavioral premise that shape and weight overvaluation are at the core of AN psychopathology. Our BN and combined network findings provide a high degree of replication of previous findings. Clinically, findings highlight the importance of considering shape and weight overvaluation as a severity specifier and primary treatment target for people with EDs. © 2018 Wiley Periodicals, Inc.

  11. Dissection of combinatorial control by the Met4 transcriptional complex.

    PubMed

    Lee, Traci A; Jorgensen, Paul; Bognar, Andrew L; Peyraud, Caroline; Thomas, Dominique; Tyers, Mike

    2010-02-01

    Met4 is the transcriptional activator of the sulfur metabolic network in Saccharomyces cerevisiae. Lacking DNA-binding ability, Met4 must interact with proteins called Met4 cofactors to target promoters for transcription. Two types of DNA-binding cofactors (Cbf1 and Met31/Met32) recruit Met4 to promoters and one cofactor (Met28) stabilizes the DNA-bound Met4 complexes. To dissect this combinatorial system, we systematically deleted each category of cofactor(s) and analyzed Met4-activated transcription on a genome-wide scale. We defined a core regulon for Met4, consisting of 45 target genes. Deletion of both Met31 and Met32 eliminated activation of the core regulon, whereas loss of Met28 or Cbf1 interfered with only a subset of targets that map to distinct sectors of the sulfur metabolic network. These transcriptional dependencies roughly correlated with the presence of Cbf1 promoter motifs. Quantitative analysis of in vivo promoter binding properties indicated varying levels of cooperativity and interdependency exists between members of this combinatorial system. Cbf1 was the only cofactor to remain fully bound to target promoters under all conditions, whereas other factors exhibited different degrees of regulated binding in a promoter-specific fashion. Taken together, Met4 cofactors use a variety of mechanisms to allow differential transcription of target genes in response to various cues.

  12. Non-Ischemic Heart Failure With Reduced Ejection Fraction Is Associated With Altered Intestinal Microbiota.

    PubMed

    Katsimichas, Themistoklis; Ohtani, Tomohito; Motooka, Daisuke; Tsukamoto, Yasumasa; Kioka, Hidetaka; Nakamoto, Kei; Konishi, Shozo; Chimura, Misato; Sengoku, Kaoruko; Miyawaki, Hiroshi; Sakaguchi, Taiki; Okumura, Ryu; Theofilis, Konstantinos; Iida, Tetsuya; Takeda, Kiyoshi; Nakamura, Shota; Sakata, Yasushi

    2018-05-25

    Research suggests that heart failure with reduced ejection fraction (HFrEF) is a state of systemic inflammation that may be triggered by microbial products passing into the bloodstream through a compromised intestinal barrier. However, whether the intestinal microbiota exhibits dysbiosis in HFrEF patients is largely unknown.Methods and Results:Twenty eight non-ischemic HFrEF patients and 19 healthy controls were assessed by 16S rRNA analysis of bacterial DNA extracted from stool samples. After processing of sequencing data, bacteria were taxonomically classified, diversity indices were used to examine microbial ecology, and relative abundances of common core genera were compared between groups. Furthermore, we predicted gene carriage for bacterial metabolic pathways and inferred microbial interaction networks on multiple taxonomic levels.Bacterial communities of both groups were dominated by the Firmicutes and Bacteroidetes phyla. The most abundant genus in both groups wasBacteroides. Although α diversity did not differ between groups, ordination by β diversity metrics revealed a separation of the groups across components of variation.StreptococcusandVeillonellawere enriched in the common core microbiota of patients, whileSMB53was depleted. Gene families in amino acid, carbohydrate, vitamin, and xenobiotic metabolism showed significant differences between groups. Interaction networks revealed a higher degree of correlations between bacteria in patients. Non-ischemic HFrEF patients exhibited multidimensional differences in intestinal microbial communities compared with healthy subjects.

  13. RNA-Seq and Gene Network Analysis Uncover Activation of an ABA-Dependent Signalosome During the Cork Oak Root Response to Drought

    PubMed Central

    Magalhães, Alexandre P.; Verde, Nuno; Reis, Francisca; Martins, Inês; Costa, Daniela; Lino-Neto, Teresa; Castro, Pedro H.; Tavares, Rui M.; Azevedo, Herlânder

    2016-01-01

    Quercus suber (cork oak) is a West Mediterranean species of key economic interest, being extensively explored for its ability to generate cork. Like other Mediterranean plants, Q. suber is significantly threatened by climatic changes, imposing the need to quickly understand its physiological and molecular adaptability to drought stress imposition. In the present report, we uncovered the differential transcriptome of Q. suber roots exposed to long-term drought, using an RNA-Seq approach. 454-sequencing reads were used to de novo assemble a reference transcriptome, and mapping of reads allowed the identification of 546 differentially expressed unigenes. These were enriched in both effector genes (e.g., LEA, chaperones, transporters) as well as regulatory genes, including transcription factors (TFs) belonging to various different classes, and genes associated with protein turnover. To further extend functional characterization, we identified the orthologs of differentially expressed unigenes in the model species Arabidopsis thaliana, which then allowed us to perform in silico functional inference, including gene network analysis for protein function, protein subcellular localization and gene co-expression, and in silico enrichment analysis for TFs and cis-elements. Results indicated the existence of extensive transcriptional regulatory events, including activation of ABA-responsive genes and ABF-dependent signaling. We were then able to establish that a core ABA-signaling pathway involving PP2C-SnRK2-ABF components was induced in stressed Q. suber roots, identifying a key mechanism in this species’ response to drought. PMID:26793200

  14. The common transcriptional subnetworks of the grape berry skin in the late stages of ripening.

    PubMed

    Ghan, Ryan; Petereit, Juli; Tillett, Richard L; Schlauch, Karen A; Toubiana, David; Fait, Aaron; Cramer, Grant R

    2017-05-30

    Wine grapes are important economically in many countries around the world. Defining the optimum time for grape harvest is a major challenge to the grower and winemaker. Berry skins are an important source of flavor, color and other quality traits in the ripening stage. Senescent-like processes such as chloroplast disorganization and cell death characterize the late ripening stage. To better understand the molecular and physiological processes involved in the late stages of berry ripening, RNA-seq analysis of the skins of seven wine grape cultivars (Cabernet Franc, Cabernet Sauvignon, Merlot, Pinot Noir, Chardonnay, Sauvignon Blanc and Semillon) was performed. RNA-seq analysis identified approximately 2000 common differentially expressed genes for all seven cultivars across four different berry sugar levels (20 to 26 °Brix). Network analyses, both a posteriori (standard) and a priori (gene co-expression network analysis), were used to elucidate transcriptional subnetworks and hub genes associated with traits in the berry skins of the late stages of berry ripening. These independent approaches revealed genes involved in photosynthesis, catabolism, and nucleotide metabolism. The transcript abundance of most photosynthetic genes declined with increasing sugar levels in the berries. The transcript abundance of other processes increased such as nucleic acid metabolism, chromosome organization and lipid catabolism. Weighted gene co-expression network analysis (WGCNA) identified 64 gene modules that were organized into 12 subnetworks of three modules or more and six higher order gene subnetworks. Some gene subnetworks were highly correlated with sugar levels and some subnetworks were highly enriched in the chloroplast and nucleus. The petal R package was utilized independently to construct a true small-world and scale-free complex gene co-expression network model. A subnetwork of 216 genes with the highest connectivity was elucidated, consistent with the module results from WGCNA. Hub genes in these subnetworks were identified including numerous members of the core circadian clock, RNA splicing, proteolysis and chromosome organization. An integrated model was constructed linking light sensing with alternative splicing, chromosome remodeling and the circadian clock. A common set of differentially expressed genes and gene subnetworks from seven different cultivars were examined in the skin of the late stages of grapevine berry ripening. A densely connected gene subnetwork was elucidated involving a complex interaction of berry senescent processes (autophagy), catabolism, the circadian clock, RNA splicing, proteolysis and epigenetic regulation. Hypotheses were induced from these data sets involving sugar accumulation, light, autophagy, epigenetic regulation, and fruit development. This work provides a better understanding of berry development and the transcriptional processes involved in the late stages of ripening.

  15. Placental gene-expression profiles of intrahepatic cholestasis of pregnancy reveal involvement of multiple molecular pathways in blood vessel formation and inflammation.

    PubMed

    Du, QiaoLing; Pan, YouDong; Zhang, YouHua; Zhang, HaiLong; Zheng, YaJuan; Lu, Ling; Wang, JunLei; Duan, Tao; Chen, JianFeng

    2014-07-07

    Intrahepatic cholestasis of pregnancy (ICP) is a pregnancy-associated liver disease with potentially deleterious consequences for the fetus, particularly when maternal serum bile-acid concentration >40 μM. However, the etiology and pathogenesis of ICP remain elusive. To reveal the underlying molecular mechanisms for the association of maternal serum bile-acid level and fetal outcome in ICP patients, DNA microarray was applied to characterize the whole-genome expression profiles of placentas from healthy women and women diagnosed with ICP. Thirty pregnant women recruited in this study were categorized evenly into three groups: healthy group; mild ICP, with serum bile-acid concentration ranging from 10-40 μM; and severe ICP, with bile-acid concentration >40 μM. Gene Ontology analysis in combination with construction of gene-interaction and gene co-expression networks were applied to identify the core regulatory genes associated with ICP pathogenesis, which were further validated by quantitative real-time PCR and histological staining. The core regulatory genes were mainly involved in immune response, VEGF signaling pathway and G-protein-coupled receptor signaling, implying essential roles of immune response, vasculogenesis and angiogenesis in ICP pathogenesis. This implication was supported by the observed aggregated immune-cell infiltration and deficient blood vessel formation in ICP placentas. Our study provides a system-level insight into the placental gene-expression profiles of women with mild or severe ICP, and reveals multiple molecular pathways in immune response and blood vessel formation that might contribute to ICP pathogenesis.

  16. Analysis of core-periphery organization in protein contact networks reveals groups of structurally and functionally critical residues.

    PubMed

    Isaac, Arnold Emerson; Sinha, Sitabhra

    2015-10-01

    The representation of proteins as networks of interacting amino acids, referred to as protein contact networks (PCN), and their subsequent analyses using graph theoretic tools, can provide novel insights into the key functional roles of specific groups of residues. We have characterized the networks corresponding to the native states of 66 proteins (belonging to different families) in terms of their core-periphery organization. The resulting hierarchical classification of the amino acid constituents of a protein arranges the residues into successive layers - having higher core order - with increasing connection density, ranging from a sparsely linked periphery to a densely intra-connected core (distinct from the earlier concept of protein core defined in terms of the three-dimensional geometry of the native state, which has least solvent accessibility). Our results show that residues in the inner cores are more conserved than those at the periphery. Underlining the functional importance of the network core, we see that the receptor sites for known ligand molecules of most proteins occur in the innermost core. Furthermore, the association of residues with structural pockets and cavities in binding or active sites increases with the core order. From mutation sensitivity analysis, we show that the probability of deleterious or intolerant mutations also increases with the core order. We also show that stabilization centre residues are in the innermost cores, suggesting that the network core is critically important in maintaining the structural stability of the protein. A publicly available Web resource for performing core-periphery analysis of any protein whose native state is known has been made available by us at http://www.imsc.res.in/ ~sitabhra/proteinKcore/index.html.

  17. Systems biology definition of the core proteome of metabolism and expression is consistent with high-throughput data.

    PubMed

    Yang, Laurence; Tan, Justin; O'Brien, Edward J; Monk, Jonathan M; Kim, Donghyuk; Li, Howard J; Charusanti, Pep; Ebrahim, Ali; Lloyd, Colton J; Yurkovich, James T; Du, Bin; Dräger, Andreas; Thomas, Alex; Sun, Yuekai; Saunders, Michael A; Palsson, Bernhard O

    2015-08-25

    Finding the minimal set of gene functions needed to sustain life is of both fundamental and practical importance. Minimal gene lists have been proposed by using comparative genomics-based core proteome definitions. A definition of a core proteome that is supported by empirical data, is understood at the systems-level, and provides a basis for computing essential cell functions is lacking. Here, we use a systems biology-based genome-scale model of metabolism and expression to define a functional core proteome consisting of 356 gene products, accounting for 44% of the Escherichia coli proteome by mass based on proteomics data. This systems biology core proteome includes 212 genes not found in previous comparative genomics-based core proteome definitions, accounts for 65% of known essential genes in E. coli, and has 78% gene function overlap with minimal genomes (Buchnera aphidicola and Mycoplasma genitalium). Based on transcriptomics data across environmental and genetic backgrounds, the systems biology core proteome is significantly enriched in nondifferentially expressed genes and depleted in differentially expressed genes. Compared with the noncore, core gene expression levels are also similar across genetic backgrounds (two times higher Spearman rank correlation) and exhibit significantly more complex transcriptional and posttranscriptional regulatory features (40% more transcription start sites per gene, 22% longer 5'UTR). Thus, genome-scale systems biology approaches rigorously identify a functional core proteome needed to support growth. This framework, validated by using high-throughput datasets, facilitates a mechanistic understanding of systems-level core proteome function through in silico models; it de facto defines a paleome.

  18. Intratumoral and Intertumoral Genomic Heterogeneity of Multifocal Localized Prostate Cancer Impacts Molecular Classifications and Genomic Prognosticators

    PubMed Central

    Wei, Lei; Wang, Jianmin; Lampert, Erika; Schlanger, Simon; DePriest, Adam D.; Hu, Qiang; Gomez, Eduardo Cortes; Murakam, Mitsuko; Glenn, Sean T.; Conroy, Jeffrey; Morrison, Carl; Azabdaftari, Gissou; Mohler, James L.; Liu, Song; Heemers, Hannelore V.

    2018-01-01

    Background Next-generation sequencing is revealing genomic heterogeneity in localized prostate cancer (CaP). Incomplete sampling of CaP multiclonality has limited the implications for molecular subtyping, stratification, and systemic treatment. Objective To determine the impact of genomic and transcriptomic diversity within and among intraprostatic CaP foci on CaP molecular taxonomy, predictors of progression, and actionable therapeutic targets. Design, setting, and participants Four consecutive patients with clinically localized National Comprehensive Cancer Network intermediate- or high-risk CaP who did not receive neoadjuvant therapy underwent radical prostatectomy at Roswell Park Cancer Institute in June–July 2014. Presurgical information on CaP content and a customized tissue procurement procedure were used to isolate nonmicroscopic and noncontiguous CaP foci in radical prostatectomy specimens. Three cores were obtained from the index lesion and one core from smaller lesions. RNA and DNA were extracted simultaneously from 26 cores with ≥90% CaP content and analyzed using whole-exome sequencing, single-nucleotide polymorphism arrays, and RNA sequencing. Outcome measurements and statistical analysis Somatic mutations, copy number alternations, gene expression, gene fusions, and phylogeny were defined. The impact of genomic alterations on CaP molecular classification, gene sets measured in Oncotype DX, Prolaris, and Decipher assays, and androgen receptor activity among CaP cores was determined. Results and limitations There was considerable variability in genomic alterations among CaP cores, and between RNA- and DNA-based platforms. Heterogeneity was found in molecular grouping of individual CaP foci and the activity of gene sets underlying the assays for risk stratification and androgen receptor activity, and was validated in independent genomic data sets. Determination of the implications for clinical decision-making requires follow-up studies. Conclusions Genomic make-up varies widely among CaP foci, so care should be taken when making treatment decisions based on a single biopsy or index lesions. Patient summary We examined the molecular composition of individual cancers in a patient’s prostate. We found a lot of genetic diversity among these cancers, and concluded that information from a single cancer biopsy is not sufficient to guide treatment decisions. PMID:27451135

  19. Core networks and their reconfiguration patterns across cognitive loads.

    PubMed

    Zuo, Nianming; Yang, Zhengyi; Liu, Yong; Li, Jin; Jiang, Tianzi

    2018-04-20

    Different cognitively demanding tasks recruit globally distributed but functionally specific networks. However, the configuration of core networks and their reconfiguration patterns across cognitive loads remain unclear, as does whether these patterns are indicators for the performance of cognitive tasks. In this study, we analyzed functional magnetic resonance imaging data of a large cohort of 448 subjects, acquired with the brain at resting state and executing N-back working memory (WM) tasks. We discriminated core networks by functional interaction strength and connection flexibility. Results demonstrated that the frontoparietal network (FPN) and default mode network (DMN) were core networks, but each exhibited different patterns across cognitive loads. The FPN and DMN both showed strengthened internal connections at the low demand state (0-back) compared with the resting state (control level); whereas, from the low (0-back) to high demand state (2-back), some connections to the FPN weakened and were rewired to the DMN (whose connections all remained strong). Of note, more intensive reconfiguration of both the whole brain and core networks (but no other networks) across load levels indicated relatively poor cognitive performance. Collectively these findings indicate that the FPN and DMN have distinct roles and reconfiguration patterns across cognitively demanding loads. This study advances our understanding of the core networks and their reconfiguration patterns across cognitive loads and provides a new feature to evaluate and predict cognitive capability (e.g., WM performance) based on brain networks. © 2018 Wiley Periodicals, Inc.

  20. Shared effects of the clusterin gene on the default mode network among individuals at risk for Alzheimer's disease.

    PubMed

    Ye, Qing; Su, Fan; Shu, Hao; Gong, Liang; Xie, Chun-Ming; Zhou, Hong; Zhang, Zhi-Jun; Bai, Feng

    2017-05-01

    To explore the common effects of the clusterin (CLU) rs11136000 variant on the default mode network (DMN) in amnestic mild cognitive impairment (aMCI) subjects and remitted geriatric depression (RGD) subjects. Fifty-one aMCI subjects, 38 RGD subjects, and 64 cognitively normal elderly subjects underwent resting-state fMRI scans and neuropsychological tests at both baseline and a 35-month follow-up. Posterior cingulate cortex seed-based functional connectivity (FC) analysis was used to obtain the DMN patterns. A CLU gene×disease×time interaction for aMCI subjects was mainly detected in the core cortical midline structures of the DMN, and the interaction for RGD subjects was mainly detected in the limbic system. However, they overlapped in two frontal regions, where consistent effects of the CLU gene on FC alterations were found between aMCI and RGD groups. Furthermore, the alterations of FC with frontal, parietal, and limbic regions compensated for episodic memory impairments in CLU-CT/TT carriers, while no such compensation was found in CLU-CC carriers. The CLU gene could consistently affect the DMN FC with frontal regions among individuals at risk for Alzheimer's disease, and the CLU-T allele was associated with more compensatory neural processes in DMN changes. © 2017 John Wiley & Sons Ltd.

  1. Responses of Pathogenic and Nonpathogenic Yeast Species to Steroids Reveal the Functioning and Evolution of Multidrug Resistance Transcriptional Networks▿ †

    PubMed Central

    Banerjee, Dibyendu; Lelandais, Gaelle; Shukla, Sudhanshu; Mukhopadhyay, Gauranga; Jacq, Claude; Devaux, Frederic; Prasad, Rajendra

    2008-01-01

    Steroids are known to induce pleiotropic drug resistance states in hemiascomycetes, with tremendous potential consequences for human fungal infections. Our analysis of gene expression in Saccharomyces cerevisiae and Candida albicans cells subjected to three different concentrations of progesterone revealed that their pleiotropic drug resistance (PDR) networks were strikingly sensitive to steroids. In S. cerevisiae, 20 of the Pdr1p/Pdr3p target genes, including PDR3 itself, were rapidly induced by progesterone, which mimics the effects of PDR1 gain-of-function alleles. This unique property allowed us to decipher the respective roles of Pdr1p and Pdr3p in PDR induction and to define functional modules among their target genes. Although the expression profiles of the major PDR transporters encoding genes ScPDR5 and CaCDR1 were similar, the S. cerevisiae global PDR response to progesterone was only partly conserved in C. albicans. In particular, the role of Tac1p, the main C. albicans PDR regulator, in the progesterone response was apparently restricted to five genes. These results suggest that the C. albicans and S. cerevisiae PDR networks, although sharing a conserved core regarding the regulation of membrane properties, have different structures and properties. Additionally, our data indicate that other as yet undiscovered regulators may second Tac1p in the C. albicans drug response. PMID:17993571

  2. Comparison of Long Noncoding RNA and mRNA Expression Profiles in Mesenchymal Stem Cells Derived from Human Periodontal Ligament and Bone Marrow

    PubMed Central

    Dong, Rui; Du, Juan; Wang, Liping; Wang, Jinsong; Ding, Gang; Wang, Songlin; Fan, Zhipeng

    2014-01-01

    Mesenchymal stem cells (MSCs) in different anatomic locations possess diverse biological activities. Maintaining the pluripotent state and differentiation depend on the expression and regulation of thousands of genes, but it remains unclear which molecular mechanisms underlie MSC diversity. Thus, potential MSC applications are restricted. Long noncoding RNAs (lncRNAs) are implicated in the complex molecular circuitry of cellular processes. We investigated differences in lncRNA and mRNA expression profiles between bone marrow stem cells (BMSCs) and periodontal ligament stem cells (PDLSCs) with lncRNA microarray assays and bioinformatics analysis. In PDLSCs, numerous lncRNAs were significantly upregulated (n = 457) or downregulated (n = 513) compared to BMSCs. Furthermore, 1,578 mRNAs were differentially expressed. These genes implicated cellular pathways that may be associated with MSC characteristics, including apoptosis, MAPK, cell cycle, and Wnt signaling pathway. Signal-net analysis indicated that phospholipase C beta 4, filamin B beta, calcium/calmodulin-dependent protein kinase II gamma, and the ionotropic glutamate receptor, AMPA 1, had the highest betweenness centrality among significant genes in the differential gene profile network. A comparison between the coding-noncoding gene coexpression networks of PDLSCs and BMSCs identified chemokine (C-X-C motif) ligand 12 as a core regulatory factor in MSC biology. These results provided insight into the mechanisms underlying MSC biology. PMID:24790996

  3. Protecting core networks with dual-homing: A study on enhanced network availability, resource efficiency, and energy-savings

    NASA Astrophysics Data System (ADS)

    Abeywickrama, Sandu; Furdek, Marija; Monti, Paolo; Wosinska, Lena; Wong, Elaine

    2016-12-01

    Core network survivability affects the reliability performance of telecommunication networks and remains one of the most important network design considerations. This paper critically examines the benefits arising from utilizing dual-homing in the optical access networks to provide resource-efficient protection against link and node failures in the optical core segment. Four novel, heuristic-based RWA algorithms that provide dedicated path protection in networks with dual-homing are proposed and studied. These algorithms protect against different failure scenarios (i.e. single link or node failures) and are implemented with different optimization objectives (i.e., minimization of wavelength usage and path length). Results obtained through simulations and comparison with baseline architectures indicate that exploiting dual-homed architecture in the access segment can bring significant improvements in terms of core network resource usage, connection availability, and power consumption.

  4. Evolution of the metabolic and regulatory networks associated with oxygen availability in two phytopathogenic enterobacteria

    PubMed Central

    2012-01-01

    Background Dickeya dadantii and Pectobacterium atrosepticum are phytopathogenic enterobacteria capable of facultative anaerobic growth in a wide range of O2 concentrations found in plant and natural environments. The transcriptional response to O2 remains under-explored for these and other phytopathogenic enterobacteria although it has been well characterized for animal-associated genera including Escherichia coli and Salmonella enterica. Knowledge of the extent of conservation of the transcriptional response across orthologous genes in more distantly related species is useful to identify rates and patterns of regulon evolution. Evolutionary events such as loss and acquisition of genes by lateral transfer events along each evolutionary branch results in lineage-specific genes, some of which may have been subsequently incorporated into the O2-responsive stimulon. Here we present a comparison of transcriptional profiles measured using densely tiled oligonucleotide arrays for two phytopathogens, Dickeya dadantii 3937 and Pectobacterium atrosepticum SCRI1043, grown to mid-log phase in MOPS minimal medium (0.1% glucose) with and without O2. Results More than 7% of the genes of each phytopathogen are differentially expressed with greater than 3-fold changes under anaerobic conditions. In addition to anaerobic metabolism genes, the O2 responsive stimulon includes a variety of virulence and pathogenicity-genes. Few of these genes overlap with orthologous genes in the anaerobic stimulon of E. coli. We define these as the conserved core, in which the transcriptional pattern as well as genetic architecture are well preserved. This conserved core includes previously described anaerobic metabolic pathways such as fermentation. Other components of the anaerobic stimulon show variation in genetic content, genome architecture and regulation. Notably formate metabolism, nitrate/nitrite metabolism, and fermentative butanediol production, differ between E. coli and the phytopathogens. Surprisingly, the overlap of the anaerobic stimulon between the phytopathogens is also relatively small considering that they are closely related, occupy similar niches and employ similar strategies to cause disease. There are cases of interesting divergences in the pattern of transcription of genes between Dickeya and Pectobacterium for virulence-associated subsystems including the type VI secretion system (T6SS), suggesting that fine-tuning of the stimulon impacts interaction with plants or competing microbes. Conclusions The small number of genes (an even smaller number if we consider operons) comprising the conserved core transcriptional response to O2 limitation demonstrates the extent of regulatory divergence prevalent in the Enterobacteriaceae. Our orthology-driven comparative transcriptomics approach indicates that the adaptive response in the eneterobacteria is a result of interaction of core (regulators) and lineage-specific (structural and regulatory) genes. Our subsystems based approach reveals that similar phenotypic outcomes are sometimes achieved by each organism using different genes and regulatory strategies. PMID:22439737

  5. Core signaling pathways in ovarian cancer stem cell revealed by integrative analysis of multi-marker genomics data.

    PubMed

    Zhang, Tianyu; Xu, Jielin; Deng, Siyuan; Zhou, Fengqi; Li, Jin; Zhang, Liwei; Li, Lang; Wang, Qi-En; Li, Fuhai

    2018-01-01

    Tumor recurrence occurs in more than 70% of ovarian cancer patients, and the majority eventually becomes refractory to treatments. Ovarian Cancer Stem Cells (OCSCs) are believed to be responsible for the tumor relapse and drug resistance. Therefore, eliminating ovarian CSCs is important to improve the prognosis of ovarian cancer patients. However, there is a lack of effective drugs to eliminate OCSCs because the core signaling pathways regulating OCSCs remain unclear. Also it is often hard for biologists to identify a few testable targets and infer driver signaling pathways regulating CSCs from a large number of differentially expression genes in an unbiased manner. In this study, we propose a straightforward and integrative analysis to identify potential core signaling pathways of OCSCs by integrating transcriptome data of OCSCs isolated based on two distinctive markers, ALDH and side population, with regulatory network (Transcription Factor (TF) and Target Interactome) and signaling pathways. We first identify the common activated TFs in two OCSC populations integrating the gene expression and TF-target Interactome; and then uncover up-stream signaling cascades regulating the activated TFs. In specific, 22 activated TFs are identified. Through literature search validation, 15 of them have been reported in association with cancer stem cells. Additionally, 10 TFs are found in the KEGG signaling pathways, and their up-stream signaling cascades are extracted, which also provide potential treatment targets. Moreover, 40 FDA approved drugs are identified to target on the up-stream signaling cascades, and 15 of them have been reported in literatures in cancer stem cell treatment. In conclusion, the proposed approach can uncover the activated up-stream signaling, activated TFs and up-regulated target genes that constitute the potential core signaling pathways of ovarian CSC. Also drugs and drug combinations targeting on the core signaling pathways might be able to eliminate OCSCs. The proposed approach can also be applied for identifying potential activated signaling pathways of other types of cancers.

  6. Targeted interactomics reveals a complex core cell cycle machinery in Arabidopsis thaliana

    PubMed Central

    Van Leene, Jelle; Hollunder, Jens; Eeckhout, Dominique; Persiau, Geert; Van De Slijke, Eveline; Stals, Hilde; Van Isterdael, Gert; Verkest, Aurine; Neirynck, Sandy; Buffel, Yelle; De Bodt, Stefanie; Maere, Steven; Laukens, Kris; Pharazyn, Anne; Ferreira, Paulo C G; Eloy, Nubia; Renne, Charlotte; Meyer, Christian; Faure, Jean-Denis; Steinbrenner, Jens; Beynon, Jim; Larkin, John C; Van de Peer, Yves; Hilson, Pierre; Kuiper, Martin; De Veylder, Lieven; Van Onckelen, Harry; Inzé, Dirk; Witters, Erwin; De Jaeger, Geert

    2010-01-01

    Cell proliferation is the main driving force for plant growth. Although genome sequence analysis revealed a high number of cell cycle genes in plants, little is known about the molecular complexes steering cell division. In a targeted proteomics approach, we mapped the core complex machinery at the heart of the Arabidopsis thaliana cell cycle control. Besides a central regulatory network of core complexes, we distinguished a peripheral network that links the core machinery to up- and downstream pathways. Over 100 new candidate cell cycle proteins were predicted and an in-depth biological interpretation demonstrated the hypothesis-generating power of the interaction data. The data set provided a comprehensive view on heterodimeric cyclin-dependent kinase (CDK)–cyclin complexes in plants. For the first time, inhibitory proteins of plant-specific B-type CDKs were discovered and the anaphase-promoting complex was characterized and extended. Important conclusions were that mitotic A- and B-type cyclins form complexes with the plant-specific B-type CDKs and not with CDKA;1, and that D-type cyclins and S-phase-specific A-type cyclins seem to be associated exclusively with CDKA;1. Furthermore, we could show that plants have evolved a combinatorial toolkit consisting of at least 92 different CDK–cyclin complex variants, which strongly underscores the functional diversification among the large family of cyclins and reflects the pivotal role of cell cycle regulation in the developmental plasticity of plants. PMID:20706207

  7. Two-component signal transduction systems of Xanthomonas spp.: a lesson from genomics.

    PubMed

    Qian, Wei; Han, Zhong-Ji; He, Chaozu

    2008-02-01

    The two-component signal transduction systems (TCSTSs), consisting of a histidine kinase sensor (HK) and a response regulator (RR), are the dominant molecular mechanisms by which prokaryotes sense and respond to environmental stimuli. Genomes of Xanthomonas generally contain a large repertoire of TCSTS genes (approximately 92 to 121 for each genome), which encode diverse structural groups of HKs and RRs. Among them, although a core set of 70 TCSTS genes (about two-thirds in total) which accumulates point mutations with a slow rate are shared by these genomes, the other genes, especially hybrid HKs, experienced extensive genetic recombination, including genomic rearrangement, gene duplication, addition or deletion, and fusion or fission. The recombinations potentially promote the efficiency and complexity of TCSTSs in regulating gene expression. In addition, our analysis suggests that a co-evolutionary model, rather than a selfish operon model, is the major mechanism for the maintenance and microevolution of TCSTS genes in the genomes of Xanthomonas. Genomic annotation, secondary protein structure prediction, and comparative genomic analyses of TCSTS genes reviewed here provide insights into our understanding of signal networks in these important phytopathogenic bacteria.

  8. Behavior-specific changes in transcriptional modules lead to distinct and predictable neurogenomic states

    PubMed Central

    Chandrasekaran, Sriram; Ament, Seth A.; Eddy, James A.; Rodriguez-Zas, Sandra L.; Schatz, Bruce R.; Price, Nathan D.; Robinson, Gene E.

    2011-01-01

    Using brain transcriptomic profiles from 853 individual honey bees exhibiting 48 distinct behavioral phenotypes in naturalistic contexts, we report that behavior-specific neurogenomic states can be inferred from the coordinated action of transcription factors (TFs) and their predicted target genes. Unsupervised hierarchical clustering of these transcriptomic profiles showed three clusters that correspond to three ecologically important behavioral categories: aggression, maturation, and foraging. To explore the genetic influences potentially regulating these behavior-specific neurogenomic states, we reconstructed a brain transcriptional regulatory network (TRN) model. This brain TRN quantitatively predicts with high accuracy gene expression changes of more than 2,000 genes involved in behavior, even for behavioral phenotypes on which it was not trained, suggesting that there is a core set of TFs that regulates behavior-specific gene expression in the bee brain, and other TFs more specific to particular categories. TFs playing key roles in the TRN include well-known regulators of neural and behavioral plasticity, e.g., Creb, as well as TFs better known in other biological contexts, e.g., NF-κB (immunity). Our results reveal three insights concerning the relationship between genes and behavior. First, distinct behaviors are subserved by distinct neurogenomic states in the brain. Second, the neurogenomic states underlying different behaviors rely upon both shared and distinct transcriptional modules. Third, despite the complexity of the brain, simple linear relationships between TFs and their putative target genes are a surprisingly prominent feature of the networks underlying behavior. PMID:21960440

  9. Low-abundant bacteria drive compositional changes in the gut microbiota after dietary alteration.

    PubMed

    Benjamino, Jacquelynn; Lincoln, Stephen; Srivastava, Ranjan; Graf, Joerg

    2018-05-10

    As the importance of beneficial bacteria is better recognized, understanding the dynamics of symbioses becomes increasingly crucial. In many gut symbioses, it is essential to understand whether changes in host diet play a role in the persistence of the bacterial gut community. In this study, termites were fed six dietary sources and the microbial community was monitored over a 49-day period using 16S rRNA gene sequencing. A deep backpropagation artificial neural network (ANN) was used to learn how the six different lignocellulose food sources affected the temporal composition of the hindgut microbiota of the termite as well as taxon-taxon and taxon-substrate interactions. Shifts in the termite gut microbiota after diet change in each colony were observed using 16S rRNA gene sequencing and beta diversity analyses. The artificial neural network accurately predicted the relative abundances of taxa at random points in the temporal study and showed that low-abundant taxa maintain community driving correlations in the hindgut. This combinatorial approach utilizing 16S rRNA gene sequencing and deep learning revealed that low-abundant bacteria that often do not belong to the core community are drivers of the termite hindgut bacterial community composition.

  10. Control, responses and modularity of cellular regulatory networks: a control analysis perspective.

    PubMed

    Bruggeman, F J; Snoep, J L; Westerhoff, H V

    2008-11-01

    Cells adapt to changes in environmental conditions through the concerted action of signalling, gene expression and metabolic subsystems. The authors will discuss a theoretical framework addressing such integrated systems. This 'hierarchical analysis' was first developed as an extension to a metabolic control analysis. It builds on the phenomenon that often the communication between signalling, gene expression and metabolic subsystems is almost exclusively via regulatory interactions and not via mass flow interactions. This allows for the treatment of the said subsystems as 'levels' in a hierarchical view of the organisation of the molecular reaction network of cells. Such a hierarchical approach has as a major advantage that levels can be analysed conceptually in isolation of each other (from a local intra-level perspective) and at a later stage integrated via their interactions (from a global inter-level perspective). Hereby, it allows for a modular approach with variable scope. A number of different approaches have been developed for the analysis of hierarchical systems, for example hierarchical control analysis and modular response analysis. The authors, here, review these methods and illustrate the strength of these types of analyses using a core model of a system with gene expression, metabolic and signal transduction levels.

  11. The pangenome of the genus Clostridium.

    PubMed

    Udaondo, Zulema; Duque, Estrella; Ramos, Juan-Luis

    2017-07-01

    The pangenome for the genus Clostridium sensu stricto, which was obtained using highly curated and annotated genomes from 16 species is presented; some of these cause disease, while others are used for the production of added-value chemicals. Multilocus sequencing analysis revealed that species of this genus group into at least two clades that include non-pathogenic and pathogenic strains, suggesting that pathogenicity is dispersed across the phylogenetic tree. The core genome of the genus includes 546 protein families, which mainly comprise those involved in protein translation and DNA repair. The GS-GOGAT may represent the central pathway for generating organic nitrogen from inorganic nitrogen sources. Glycerol and glucose metabolism genes are well represented in the core genome together with a set of energy conservation systems. A metabolic network comprising proteins/enzymes, RNAs and metabolites, whose topological structure is a non-random and scale-free network with hierarchically structured modules was built. These modules shed light on the interactions between RNAs, proteins and metabolites, revealing biological features of transcription and translation, cell wall biosynthesis, C1 metabolism and N metabolism. Network analysis identified four nodes that function as hubs and bottlenecks, namely, coenzyme A, HPr kinases, S-adenosylmethionine and the ribonuclease P-protein, suggesting pivotal roles for them in Clostridium. © 2017 Society for Applied Microbiology and John Wiley & Sons Ltd.

  12. Mapping, Awareness, and Virtualization Network Administrator Training Tool (MAVNATT) Architecture and Framework

    DTIC Science & Technology

    2015-06-01

    unit may setup and teardown the entire tactical infrastructure multiple times per day. This tactical network administrator training is a critical...language and runs on Linux and Unix based systems. All provisioning is based around the Nagios Core application, a powerful backend solution for network...start up a large number of virtual machines quickly. CORE supports the simulation of fixed and mobile networks. CORE is open-source, written in Python

  13. Research on NGN network control technology

    NASA Astrophysics Data System (ADS)

    Li, WenYao; Zhou, Fang; Wu, JianXue; Li, ZhiGuang

    2004-04-01

    Nowadays NGN (Next Generation Network) is the hotspot for discussion and research in IT section. The NGN core technology is the network control technology. The key goal of NGN is to realize the network convergence and evolution. Referring to overlay network model core on Softswitch technology, circuit switch network and IP network convergence realized. Referring to the optical transmission network core on ASTN/ASON, service layer (i.e. IP layer) and optical transmission convergence realized. Together with the distributing feature of NGN network control technology, on NGN platform, overview of combining Softswitch and ASTN/ASON control technology, the solution whether IP should be the NGN core carrier platform attracts general attention, and this is also a QoS problem on NGN end to end. This solution produces the significant practical meaning on equipment development, network deployment, network design and optimization, especially on realizing present network smooth evolving to the NGN. This is why this paper puts forward the research topic on the NGN network control technology. This paper introduces basics on NGN network control technology, then proposes NGN network control reference model, at the same time describes a realizable network structure of NGN. Based on above, from the view of function realization, NGN network control technology is discussed and its work mechanism is analyzed.

  14. Comparative Analysis of Vertebrate Diurnal/Circadian Transcriptomes

    PubMed Central

    Boyle, Greg; Richter, Kerstin; Priest, Henry D.; Traver, David; Mockler, Todd C.; Chang, Jeffrey T.; Kay, Steve A.

    2017-01-01

    From photosynthetic bacteria to mammals, the circadian clock evolved to track diurnal rhythms and enable organisms to anticipate daily recurring changes such as temperature and light. It orchestrates a broad spectrum of physiology such as the sleep/wake and eating/fasting cycles. While we have made tremendous advances in our understanding of the molecular details of the circadian clock mechanism and how it is synchronized with the environment, we still have rudimentary knowledge regarding its connection to help regulate diurnal physiology. One potential reason is the sheer size of the output network. Diurnal/circadian transcriptomic studies are reporting that around 10% of the expressed genome is rhythmically controlled. Zebrafish is an important model system for the study of the core circadian mechanism in vertebrate. As Zebrafish share more than 70% of its genes with human, it could also be an additional model in addition to rodent for exploring the diurnal/circadian output with potential for translational relevance. Here we performed comparative diurnal/circadian transcriptome analysis with established mouse liver and other tissue datasets. First, by combining liver tissue sampling in a 48h time series, transcription profiling using oligonucleotide arrays and bioinformatics analysis, we profiled rhythmic transcripts and identified 2609 rhythmic genes. The comparative analysis revealed interesting features of the output network regarding number of rhythmic genes, proportion of tissue specific genes and the extent of transcription factor family expression. Undoubtedly, the Zebrafish model system will help identify new vertebrate outputs and their regulators and provides leads for further characterization of the diurnal cis-regulatory network. PMID:28076377

  15. A Network of Genes Antagonistic to the LIN-35 Retinoblastoma Protein of Caenorhabditis elegans

    PubMed Central

    Polley, Stanley R. G.; Fay, David S.

    2012-01-01

    The Caenorhabditis elegans pRb ortholog, LIN-35, functions in a wide range of cellular and developmental processes. This includes a role of LIN-35 in nutrient utilization by the intestine, which it carries out redundantly with SLR-2, a zinc-finger protein. This and other redundant functions of LIN-35 were identified in genetic screens for mutations that display synthetic phenotypes in conjunction with loss of lin-35. To explore the intestinal role of LIN-35, we conducted a genome-wide RNA-interference-feeding screen for suppressors of lin-35; slr-2 early larval arrest. Of the 26 suppressors identified, 17 fall into three functional classes: (1) ribosome biogenesis genes, (2) mitochondrial prohibitins, and (3) chromatin regulators. Further characterization indicates that different categories of suppressors act through distinct molecular mechanisms. We also tested lin-35; slr-2 suppressors, as well as suppressors of the synthetic multivulval phenotype, to determine the spectrum of lin-35-synthetic phenotypes that could be suppressed following inhibition of these genes. We identified 19 genes, most of which are evolutionarily conserved, that can suppress multiple unrelated lin-35-synthetic phenotypes. Our study reveals a network of genes broadly antagonistic to LIN-35 as well as genes specific to the role of LIN-35 in intestinal and vulval development. Suppressors of multiple lin-35 phenotypes may be candidate targets for anticancer therapies. Moreover, screening for suppressors of phenotypically distinct synthetic interactions, which share a common altered gene, may prove to be a novel and effective approach for identifying genes whose activities are most directly relevant to the core functions of the shared gene. PMID:22542970

  16. Optical interconnection network for parallel access to multi-rank memory in future computing systems.

    PubMed

    Wang, Kang; Gu, Huaxi; Yang, Yintang; Wang, Kun

    2015-08-10

    With the number of cores increasing, there is an emerging need for a high-bandwidth low-latency interconnection network, serving core-to-memory communication. In this paper, aiming at the goal of simultaneous access to multi-rank memory, we propose an optical interconnection network for core-to-memory communication. In the proposed network, the wavelength usage is delicately arranged so that cores can communicate with different ranks at the same time and broadcast for flow control can be achieved. A distributed memory controller architecture that works in a pipeline mode is also designed for efficient optical communication and transaction address processes. The scaling method and wavelength assignment for the proposed network are investigated. Compared with traditional electronic bus-based core-to-memory communication, the simulation results based on the PARSEC benchmark show that the bandwidth enhancement and latency reduction are apparent.

  17. Transcriptome Analysis Reveals the Mechanism Underlying the Production of a High Quantity of Chlorogenic Acid in Young Leaves of Lonicera macranthoides Hand.-Mazz

    PubMed Central

    Chen, Zexiong; Tang, Ning; You, Yuming; Lan, Jianbin; Liu, Yiqing; Li, Zhengguo

    2015-01-01

    Lonicera macranthoides Hand.-Mazz (L. macranthoides) is a medicinal herb that is widely distributed in southern China. The biosynthetic and metabolic pathways for a core secondary metabolite in L. macranthoides, chlorogenic acid (CGA), have been elucidated in many species. However, the mechanisms of CGA biosynthesis and the related gene regulatory network in L. macranthoides are still not well understood. In this study, CGA content was quantified by high performance liquid chromatography (HPLC), and CGA levels differed significantly among three tissues; specifically, the CGA content in young leaves (YL) was greater than that in young stems (YS), which was greater than that in mature flowers (MF). Transcriptome analysis of L. macranthoides yielded a total of 53,533,014 clean reads (average length 90 bp) and 76,453 unigenes (average length 703 bp). A total of 3,767 unigenes were involved in biosynthesis pathways of secondary metabolites. Of these unigenes, 80 were possibly related to CGA biosynthesis. Furthermore, differentially expressed genes (DEGs) were screened in different tissues including YL, MF and YS. In these tissues, 24 DEGs were found to be associated with CGA biosynthesis, including six phenylalanine ammonia lyase (PAL) genes, six 4-coumarate coenzyme A ligase (4CL) genes, four cinnamate 4-Hydroxylase (C4H) genes, seven hydroxycinnamoyl transferase/hydroxycinnamoyl-CoA quinate transferase HCT/HQT genes and one coumarate 3-hydroxylase (C3H) gene.These results further the understanding of CGA biosynthesis and the related regulatory network in L. macranthoides. PMID:26381882

  18. Irf8-Regulated Genomic Responses Drive Pathological Inflammation during Cerebral Malaria

    PubMed Central

    Radovanovic, Irena; Tam, Mifong; MacMicking, John D.; Stevenson, Mary M.; Gros, Philippe

    2013-01-01

    Interferon Regulatory Factor 8 (IRF8) is required for development, maturation and expression of anti-microbial defenses of myeloid cells. BXH2 mice harbor a severely hypomorphic allele at Irf8 (Irf8R294C) that causes susceptibility to infection with intracellular pathogens including Mycobacterium tuberculosis. We report that BXH2 are completely resistant to the development of cerebral malaria (ECM) following Plasmodium berghei ANKA infection. Comparative transcriptional profiling of brain RNA as well as chromatin immunoprecipitation and high-throughput sequencing (ChIP-seq) was used to identify IRF8-regulated genes whose expression is associated with pathological acute neuroinflammation. Genes increased by infection were strongly enriched for IRF8 binding sites, suggesting that IRF8 acts as a transcriptional activator in inflammatory programs. These lists were enriched for myeloid-specific pathways, including interferon responses, antigen presentation and Th1 polarizing cytokines. We show that inactivation of several of these downstream target genes (including the Irf8 transcription partner Irf1) confers protection against ECM. ECM-resistance in Irf8 and Irf1 mutants is associated with impaired myeloid and lymphoid cells function, including production of IL12p40 and IFNγ. We note strong overlap between genes bound and regulated by IRF8 during ECM and genes regulated in the lungs of M. tuberculosis infected mice. This IRF8-dependent network contains several genes recently identified as risk factors in acute and chronic human inflammatory conditions. We report a common core of IRF8-bound genes forming a critical inflammatory host-response network. PMID:23853600

  19. Unique core genomes of the bacterial family vibrionaceae: insights into niche adaptation and speciation.

    PubMed

    Kahlke, Tim; Goesmann, Alexander; Hjerde, Erik; Willassen, Nils Peder; Haugen, Peik

    2012-05-10

    The criteria for defining bacterial species and even the concept of bacterial species itself are under debate, and the discussion is apparently intensifying as more genome sequence data is becoming available. However, it is still unclear how the new advances in genomics should be used most efficiently to address this question. In this study we identify genes that are common to any group of genomes in our dataset, to determine whether genes specific to a particular taxon exist and to investigate their potential role in adaptation of bacteria to their specific niche. These genes were named unique core genes. Additionally, we investigate the existence and importance of unique core genes that are found in isolates of phylogenetically non-coherent groups. These groups of isolates, that share a genetic feature without sharing a closest common ancestor, are termed genophyletic groups. The bacterial family Vibrionaceae was used as the model, and we compiled and compared genome sequences of 64 different isolates. Using the software orthoMCL we determined clusters of homologous genes among the investigated genome sequences. We used multilocus sequence analysis to build a host phylogeny and mapped the numbers of unique core genes of all distinct groups of isolates onto the tree. The results show that unique core genes are more likely to be found in monophyletic groups of isolates. Genophyletic groups of isolates, in contrast, are less common especially for large groups of isolate. The subsequent annotation of unique core genes that are present in genophyletic groups indicate a high degree of horizontally transferred genes. Finally, the annotation of the unique core genes of Vibrio cholerae revealed genes involved in aerotaxis and biosynthesis of the iron-chelator vibriobactin. The presented work indicates that genes specific for any taxon inside the bacterial family Vibrionaceae exist. These unique core genes encode conserved metabolic functions that can shed light on the adaptation of a species to its ecological niche. Additionally, our study suggests that unique core genes can be used to aid classification of bacteria and contribute to a bacterial species definition on a genomic level. Furthermore, these genes may be of importance in clinical diagnostics and drug development.

  20. Diel pattern of circadian clock and storage protein gene expression in leaves and during seed filling in cowpea (Vigna unguiculata).

    PubMed

    Weiss, Julia; Terry, Marta I; Martos-Fuentes, Marina; Letourneux, Lisa; Ruiz-Hernández, Victoria; Fernández, Juan A; Egea-Cortines, Marcos

    2018-02-14

    Cowpea (Vigna unguiculata) is an important source of protein supply for animal and human nutrition. The major storage globulins VICILIN and LEGUMIN (LEG) are synthesized from several genes including LEGA, LEGB, LEGJ and CVC (CONVICILIN). The current hypothesis is that the plant circadian core clock genes are conserved in a wide array of species and that primary metabolism is to a large extent controlled by the plant circadian clock. Our aim was to investigate a possible link between gene expression of storage proteins and the circadian clock. We identified cowpea orthologues of the core clock genes VunLHY, VunTOC1, VunGI and VunELF3, the protein storage genes VunLEG, VunLEGJ, and VunCVC as well as nine candidate reference genes used in RT-PCR. ELONGATION FACTOR 1-A (ELF1A) resulted the most suitable reference gene. The clock genes VunELF3, VunGI, VunTOC1 and VunLHY showed a rhythmic expression profile in leaves with a typical evening/night and morning/midday phased expression. The diel patterns were not completely robust and only VungGI and VungELF3 retained a rhythmic pattern under free running conditions of darkness. Under field conditions, rhythmicity and phasing apparently faded during early pod and seed development and was regained in ripening pods for VunTOC1 and VunLHY. Mature seeds showed a rhythmic expression of VunGI resembling leaf tissue under controlled growth chamber conditions. Comparing time windows during developmental stages we found that VunCVC and VunLEG were significantly down regulated during the night in mature pods as compared to intermediate ripe pods, while changes in seeds were non-significant due to high variance. The rhythmic expression under field conditions was lost under growth chamber conditions. The core clock gene network is conserved in cowpea leaves showing a robust diel expression pattern except VunELF3 under growth chamber conditions. There appears to be a clock transcriptional reprogramming in pods and seeds compared to leaves. Storage protein deposition may be circadian regulated under field conditions but the strong environmental signals are not met under artificial growth conditions. Diel expression pattern in field conditions may result in better usage of energy for protein storage.

  1. Organization of complex networks

    NASA Astrophysics Data System (ADS)

    Kitsak, Maksim

    Many large complex systems can be successfully analyzed using the language of graphs and networks. Interactions between the objects in a network are treated as links connecting nodes. This approach to understanding the structure of networks is an important step toward understanding the way corresponding complex systems function. Using the tools of statistical physics, we analyze the structure of networks as they are found in complex systems such as the Internet, the World Wide Web, and numerous industrial and social networks. In the first chapter we apply the concept of self-similarity to the study of transport properties in complex networks. Self-similar or fractal networks, unlike non-fractal networks, exhibit similarity on a range of scales. We find that these fractal networks have transport properties that differ from those of non-fractal networks. In non-fractal networks, transport flows primarily through the hubs. In fractal networks, the self-similar structure requires any transport to also flow through nodes that have only a few connections. We also study, in models and in real networks, the crossover from fractal to non-fractal networks that occurs when a small number of random interactions are added by means of scaling techniques. In the second chapter we use k-core techniques to study dynamic processes in networks. The k-core of a network is the network's largest component that, within itself, exhibits all nodes with at least k connections. We use this k-core analysis to estimate the relative leadership positions of firms in the Life Science (LS) and Information and Communication Technology (ICT) sectors of industry. We study the differences in the k-core structure between the LS and the ICT sectors. We find that the lead segment (highest k-core) of the LS sector, unlike that of the ICT sector, is remarkably stable over time: once a particular firm enters the lead segment, it is likely to remain there for many years. In the third chapter we study how epidemics spread though networks. Our results indicate that a virus is more likely to infect a large area of a network if it originates at a node contained within k-core of high index k.

  2. EDdb: a web resource for eating disorder and its application to identify an extended adipocytokine signaling pathway related to eating disorder.

    PubMed

    Zhao, Min; Li, XiaoMo; Qu, Hong

    2013-12-01

    Eating disorder is a group of physiological and psychological disorders affecting approximately 1% of the female population worldwide. Although the genetic epidemiology of eating disorder is becoming increasingly clear with accumulated studies, the underlying molecular mechanisms are still unclear. Recently, integration of various high-throughput data expanded the range of candidate genes and started to generate hypotheses for understanding potential pathogenesis in complex diseases. This article presents EDdb (Eating Disorder database), the first evidence-based gene resource for eating disorder. Fifty-nine experimentally validated genes from the literature in relation to eating disorder were collected as the core dataset. Another four datasets with 2824 candidate genes across 601 genome regions were expanded based on the core dataset using different criteria (e.g., protein-protein interactions, shared cytobands, and related complex diseases). Based on human protein-protein interaction data, we reconstructed a potential molecular sub-network related to eating disorder. Furthermore, with an integrative pathway enrichment analysis of genes in EDdb, we identified an extended adipocytokine signaling pathway in eating disorder. Three genes in EDdb (ADIPO (adiponectin), TNF (tumor necrosis factor) and NR3C1 (nuclear receptor subfamily 3, group C, member 1)) link the KEGG (Kyoto Encyclopedia of Genes and Genomes) "adipocytokine signaling pathway" with the BioCarta "visceral fat deposits and the metabolic syndrome" pathway to form a joint pathway. In total, the joint pathway contains 43 genes, among which 39 genes are related to eating disorder. As the first comprehensive gene resource for eating disorder, EDdb ( http://eddb.cbi.pku.edu.cn ) enables the exploration of gene-disease relationships and cross-talk mechanisms between related disorders. Through pathway statistical studies, we revealed that abnormal body weight caused by eating disorder and obesity may both be related to dysregulation of the novel joint pathway of adipocytokine signaling. In addition, this joint pathway may be the common pathway for body weight regulation in complex human diseases related to unhealthy lifestyle.

  3. Exploring the key genes and pathways in enchondromas using a gene expression microarray.

    PubMed

    Shi, Zhongju; Zhou, Hengxing; Pan, Bin; Lu, Lu; Kang, Yi; Liu, Lu; Wei, Zhijian; Feng, Shiqing

    2017-07-04

    Enchondromas are the most common primary benign osseous neoplasms that occur in the medullary bone; they can undergo malignant transformation into chondrosarcoma. However, enchondromas are always undetected in patients, and the molecular mechanism is unclear. To identify key genes and pathways associated with the occurrence and development of enchondromas, we downloaded the gene expression dataset GSE22855 and obtained the differentially expressed genes (DEGs) by analyzing high-throughput gene expression in enchondromas. In total, 635 genes were identified as DEGs. Of these, 225 genes (35.43%) were up-regulated, and the remaining 410 genes (64.57%) were down-regulated. We identified the predominant gene ontology (GO) categories and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways that were significantly over-represented in the enchondromas samples compared with the control samples. Subsequently the top 10 core genes were identified from the protein-protein interaction (PPI) network. The enrichment analyses of the genes mainly involved in two significant modules showed that the DEGs were principally related to ribosomes, protein digestion and absorption, ECM-receptor interaction, focal adhesion, amoebiasis and the PI3K-Akt signaling pathway.Together, these data elucidate the molecular mechanisms underlying the occurrence and development of enchondromas and provide promising candidates for therapeutic intervention and prognostic evaluation. However, further experimental studies are needed to confirm these results.

  4. Neural networks underlying trait aggression depend on MAOA gene alleles.

    PubMed

    Klasen, Martin; Wolf, Dhana; Eisner, Patrick D; Habel, Ute; Repple, Jonathan; Vernaleken, Ingo; Schlüter, Thorben; Eggermann, Thomas; Zerres, Klaus; Zepf, Florian D; Mathiak, Klaus

    2018-03-01

    Low expressing alleles of the MAOA gene (MAOA-L) have been associated with an increased risk for developing an aggressive personality. This suggests an MAOA-L-specific neurobiological vulnerability associated with trait aggression. The neural networks underlying this vulnerability are unknown. The present study investigated genotype-specific associations between resting state brain networks and trait aggression (Buss-Perry Aggression Questionnaire) in 82 healthy Caucasian males. Genotype influences on aggression-related networks were studied for intrinsic and seed-based brain connectivity. Intrinsic connectivity was higher in the ventromedial prefrontal cortex (VMPFC) of MAOA-L compared to high expressing allele (MAOA-H) carriers. Seed-based connectivity analyses revealed genotype differences in the functional involvement of this region. MAOA genotype modulated the relationship between trait aggression and VMPFC connectivity with supramarginal gyrus (SMG) and areas of the default mode network (DMN). Separate analyses for the two groups were performed to better understand how the genotype modulated the relationship between aggression and brain networks. They revealed a positive correlation between VMPFC connectivity and aggression in right angular gyrus (AG) and a negative correlation in right SMG in the MAOA-L group. No such effect emerged in the MAOA-H carriers. The results indicate a particular relevance of VMPFC for aggression in MAOA-L carriers; in specific, a detachment from the DMN along with a strengthened coupling to the AG seems to go along with lower trait aggression. MAOA-L carriers may thus depend on a synchronization of emotion regulation systems (VMPFC) with core areas of empathy (SMG) to prevent aggression.

  5. Protein complex prediction for large protein protein interaction networks with the Core&Peel method.

    PubMed

    Pellegrini, Marco; Baglioni, Miriam; Geraci, Filippo

    2016-11-08

    Biological networks play an increasingly important role in the exploration of functional modularity and cellular organization at a systemic level. Quite often the first tools used to analyze these networks are clustering algorithms. We concentrate here on the specific task of predicting protein complexes (PC) in large protein-protein interaction networks (PPIN). Currently, many state-of-the-art algorithms work well for networks of small or moderate size. However, their performance on much larger networks, which are becoming increasingly common in modern proteome-wise studies, needs to be re-assessed. We present a new fast algorithm for clustering large sparse networks: Core&Peel, which runs essentially in time and storage O(a(G)m+n) for a network G of n nodes and m arcs, where a(G) is the arboricity of G (which is roughly proportional to the maximum average degree of any induced subgraph in G). We evaluated Core&Peel on five PPI networks of large size and one of medium size from both yeast and homo sapiens, comparing its performance against those of ten state-of-the-art methods. We demonstrate that Core&Peel consistently outperforms the ten competitors in its ability to identify known protein complexes and in the functional coherence of its predictions. Our method is remarkably robust, being quite insensible to the injection of random interactions. Core&Peel is also empirically efficient attaining the second best running time over large networks among the tested algorithms. Our algorithm Core&Peel pushes forward the state-of the-art in PPIN clustering providing an algorithmic solution with polynomial running time that attains experimentally demonstrable good output quality and speed on challenging large real networks.

  6. Nuclear receptors and metabolism: from feast to famine.

    PubMed

    Hong, Suk-Hyun; Ahmadian, Maryam; Yu, Ruth T; Atkins, Annette R; Downes, Michael; Evans, Ronald M

    2014-05-01

    The ability to adapt to cycles of feast and famine is critical for survival. Communication between multiple metabolic organs must be integrated to properly metabolise nutrients. By controlling networks of genes in major metabolic organs, nuclear hormone receptors (NHRs) play central roles in regulating metabolism in a tissue-specific manner. NHRs also establish daily rhythmicity by controlling the expression of core clock genes both centrally and peripherally. Recent findings show that many of the metabolic effects of NHRs are mediated through certain members of the fibroblast growth factor (FGF) family. This review focuses on the roles of NHRs in critical metabolic organs, including adipose tissue, liver and muscle, during the fed and fasted states, as well as their roles in circadian metabolism and downstream regulation of FGFs.

  7. Core regulatory network motif underlies the ocellar complex patterning in Drosophila melanogaster

    NASA Astrophysics Data System (ADS)

    Aguilar-Hidalgo, D.; Lemos, M. C.; Córdoba, A.

    2015-03-01

    During organogenesis, developmental programs governed by Gene Regulatory Networks (GRN) define the functionality, size and shape of the different constituents of living organisms. Robustness, thus, is an essential characteristic that GRNs need to fulfill in order to maintain viability and reproducibility in a species. In the present work we analyze the robustness of the patterning for the ocellar complex formation in Drosophila melanogaster fly. We have systematically pruned the GRN that drives the development of this visual system to obtain the minimum pathway able to satisfy this pattern. We found that the mechanism underlying the patterning obeys to the dynamics of a 3-nodes network motif with a double negative feedback loop fed by a morphogenetic gradient that triggers the inhibition in a French flag problem fashion. A Boolean modeling of the GRN confirms robustness in the patterning mechanism showing the same result for different network complexity levels. Interestingly, the network provides a steady state solution in the interocellar part of the patterning and an oscillatory regime in the ocelli. This theoretical result predicts that the ocellar pattern may underlie oscillatory dynamics in its genetic regulation.

  8. Evolution of Associative Learning in Chemical Networks

    PubMed Central

    McGregor, Simon; Vasas, Vera; Husbands, Phil; Fernando, Chrisantha

    2012-01-01

    Organisms that can learn about their environment and modify their behaviour appropriately during their lifetime are more likely to survive and reproduce than organisms that do not. While associative learning – the ability to detect correlated features of the environment – has been studied extensively in nervous systems, where the underlying mechanisms are reasonably well understood, mechanisms within single cells that could allow associative learning have received little attention. Here, using in silico evolution of chemical networks, we show that there exists a diversity of remarkably simple and plausible chemical solutions to the associative learning problem, the simplest of which uses only one core chemical reaction. We then asked to what extent a linear combination of chemical concentrations in the network could approximate the ideal Bayesian posterior of an environment given the stimulus history so far? This Bayesian analysis revealed the ‘memory traces’ of the chemical network. The implication of this paper is that there is little reason to believe that a lack of suitable phenotypic variation would prevent associative learning from evolving in cell signalling, metabolic, gene regulatory, or a mixture of these networks in cells. PMID:23133353

  9. Immunomediated Pan-cancer Regulation Networks are Dominant Fingerprints After Treatment of Cell Lines with Demethylation.

    PubMed

    El Baroudi, Mariama; Cinti, Caterina; Capobianco, Enrico

    2016-01-01

    Pan-cancer studies are particularly relevant not only for addressing the complexity of the inherently observed heterogeneity but also for identifying clinically relevant features that may be common to the cancer types. Immune system regulations usually reveal synergistic modulation with other cancer mechanisms and in combination provide insights on possible advances in cancer immunotherapies. Network inference is a powerful approach to decipher pan-cancer systems dynamics. The methodology proposed in this study elucidates the impacts of epigenetic treatment on the drivers of complex pan-cancer regulation circuits involving cell lines of five cancer types. These patterns were observed from differential gene expression measurements following demethylation with 5-azacytidine. Networks were built to establish associations of phenotypes at molecular level with cancer hallmarks through both transcriptional and post-transcriptional regulation mechanisms. The most prominent feature that emerges from our integrative network maps, linking pathway landscapes to disease and drug-target associations, refers primarily to a mosaic of immune-system crosslinked influences. Therefore, characteristics initially evidenced in single cancer maps become motifs well summarized by network cores and fingerprints.

  10. UMA/GAN network architecture analysis

    NASA Astrophysics Data System (ADS)

    Yang, Liang; Li, Wensheng; Deng, Chunjian; Lv, Yi

    2009-07-01

    This paper is to critically analyze the architecture of UMA which is one of Fix Mobile Convergence (FMC) solutions, and also included by the third generation partnership project(3GPP). In UMA/GAN network architecture, UMA Network Controller (UNC) is the key equipment which connects with cellular core network and mobile station (MS). UMA network could be easily integrated into the existing cellular networks without influencing mobile core network, and could provides high-quality mobile services with preferentially priced indoor voice and data usage. This helps to improve subscriber's experience. On the other hand, UMA/GAN architecture helps to integrate other radio technique into cellular network which includes WiFi, Bluetooth, and WiMax and so on. This offers the traditional mobile operators an opportunity to integrate WiMax technique into cellular network. In the end of this article, we also give an analysis of potential influence on the cellular core networks ,which is pulled by UMA network.

  11. Computing prokaryotic gene ubiquity: rescuing the core from extinction.

    PubMed

    Charlebois, Robert L; Doolittle, W Ford

    2004-12-01

    The genomic core concept has found several uses in comparative and evolutionary genomics. Defined as the set of all genes common to (ubiquitous among) all genomes in a phylogenetically coherent group, core size decreases as the number and phylogenetic diversity of the relevant group increases. Here, we focus on methods for defining the size and composition of the core of all genes shared by sequenced genomes of prokaryotes (Bacteria and Archaea). There are few (almost certainly less than 50) genes shared by all of the 147 genomes compared, surely insufficient to conduct all essential functions. Sequencing and annotation errors are responsible for the apparent absence of some genes, while very limited but genuine disappearances (from just one or a few genomes) can account for several others. Core size will continue to decrease as more genome sequences appear, unless the requirement for ubiquity is relaxed. Such relaxation seems consistent with any reasonable biological purpose for seeking a core, but it renders the problem of definition more problematic. We propose an alternative approach (the phylogenetically balanced core), which preserves some of the biological utility of the core concept. Cores, however delimited, preferentially contain informational rather than operational genes; we present a new hypothesis for why this might be so.

  12. Transnational cocaine and heroin flow networks in western Europe: A comparison.

    PubMed

    Chandra, Siddharth; Joba, Johnathan

    2015-08-01

    A comparison of the properties of drug flow networks for cocaine and heroin in a group of 17 western European countries is provided with the aim of understanding the implications of their similarities and differences for drug policy. Drug flow data for the cocaine and heroin networks were analyzed using the UCINET software package. Country-level characteristics including hub and authority scores, core and periphery membership, and centrality, and network-level characteristics including network density, the results of a triad census, and the final fitness of the core-periphery structure of the network, were computed and compared between the two networks. The cocaine network contains fewer path redundancies and a smaller, more tightly knit core than the heroin network. Authorities, hubs and countries central to the cocaine network tend to have higher hub, authority, and centrality scores than those in the heroin network. The core-periphery and hub-authority structures of the cocaine and heroin networks reflect the west-to-east and east-to-west patterns of flow of cocaine and heroin respectively across Europe. The key nodes in the cocaine and heroin networks are generally distinct from one another. The analysis of drug flow networks can reveal important structural features of trafficking networks that can be useful for the allocation of scarce drug control resources. The identification of authorities, hubs, network cores, and network-central nodes can suggest foci for the allocation of these resources. In the case of Europe, while some countries are important to both cocaine and heroin networks, different sets of countries occupy positions of prominence in the two networks. The distinct nature of the cocaine and heroin networks also suggests that a one-size-fits-all supply- and interdiction-focused policy may not work as well as an approach that takes into account the particular characteristics of each network. Copyright © 2015 Elsevier B.V. All rights reserved.

  13. Ranking the spreading ability of nodes in network core

    NASA Astrophysics Data System (ADS)

    Tong, Xiao-Lei; Liu, Jian-Guo; Wang, Jiang-Pan; Guo, Qiang; Ni, Jing

    2015-11-01

    Ranking nodes by their spreading ability in complex networks is of vital significance to better understand the network structure and more efficiently spread information. The k-shell decomposition method could identify the most influential nodes, namely network core, with the same ks values regardless to their different spreading influence. In this paper, we present an improved method based on the k-shell decomposition method and closeness centrality (CC) to rank the node spreading influence of the network core. Experiment results on the data from the scientific collaboration network and U.S. aviation network show that the accuracy of the presented method could be increased by 31% and 45% than the one obtained by the degree k, 32% and 31% than the one by the betweenness.

  14. Analysis of pan-genome to identify the core genes and essential genes of Brucella spp.

    PubMed

    Yang, Xiaowen; Li, Yajie; Zang, Juan; Li, Yexia; Bie, Pengfei; Lu, Yanli; Wu, Qingmin

    2016-04-01

    Brucella spp. are facultative intracellular pathogens, that cause a contagious zoonotic disease, that can result in such outcomes as abortion or sterility in susceptible animal hosts and grave, debilitating illness in humans. For deciphering the survival mechanism of Brucella spp. in vivo, 42 Brucella complete genomes from NCBI were analyzed for the pan-genome and core genome by identification of their composition and function of Brucella genomes. The results showed that the total 132,143 protein-coding genes in these genomes were divided into 5369 clusters. Among these, 1710 clusters were associated with the core genome, 1182 clusters with strain-specific genes and 2477 clusters with dispensable genomes. COG analysis indicated that 44 % of the core genes were devoted to metabolism, which were mainly responsible for energy production and conversion (COG category C), and amino acid transport and metabolism (COG category E). Meanwhile, approximately 35 % of the core genes were in positive selection. In addition, 1252 potential essential genes were predicted in the core genome by comparison with a prokaryote database of essential genes. The results suggested that the core genes in Brucella genomes are relatively conservation, and the energy and amino acid metabolism play a more important role in the process of growth and reproduction in Brucella spp. This study might help us to better understand the mechanisms of Brucella persistent infection and provide some clues for further exploring the gene modules of the intracellular survival in Brucella spp.

  15. Analysis of Structure and Composition of Bacterial Core Communities in Mature Drinking Water Biofilms and Bulk Water of a Citywide Network in Germany

    PubMed Central

    Henne, Karsten; Kahlisch, Leila; Brettar, Ingrid

    2012-01-01

    The bacterial core communities of bulk water and corresponding biofilms of a more than 20-year-old drinking water network were compared using 16S rRNA single-strand confirmation polymorphism (SSCP) fingerprints based on extracted DNA and RNA. The structure and composition of the bacterial core community in the bulk water was highly similar (>70%) across the city of Braunschweig, Germany, whereas all biofilm samples contained a unique community with no overlapping phylotypes from bulk water. Biofilm samples consisted mainly of Alphaproteobacteria (26% of all phylotypes), Gammaproteobacteria (11%), candidate division TM6 (11%), Chlamydiales (9%), and Betaproteobacteria (9%). The bulk water community consisted primarily of Bacteroidetes (25%), Betaproteobacteria (20%), Actinobacteria (16%), and Alphaproteobacteria (11%). All biofilm communities showed higher relative abundances of single phylotypes and a reduced richness compared to bulk water. Only biofilm communities sampled at nearby sampling points showed similar communities irrespective of support materials. In all of our bulk water studies, the community composition determined from 16S rRNA was completely different from the 16S rRNA gene-based community composition, whereas in biofilms both molecular fractions resulted in community compositions that were similar to each other. We hypothesize that a higher fraction of active bacterial phylotypes and a better protection from oxidative stress in drinking water biofilms are responsible for this higher similarity. PMID:22389373

  16. Analysis of structure and composition of bacterial core communities in mature drinking water biofilms and bulk water of a citywide network in Germany.

    PubMed

    Henne, Karsten; Kahlisch, Leila; Brettar, Ingrid; Höfle, Manfred G

    2012-05-01

    The bacterial core communities of bulk water and corresponding biofilms of a more than 20-year-old drinking water network were compared using 16S rRNA single-strand confirmation polymorphism (SSCP) fingerprints based on extracted DNA and RNA. The structure and composition of the bacterial core community in the bulk water was highly similar (>70%) across the city of Braunschweig, Germany, whereas all biofilm samples contained a unique community with no overlapping phylotypes from bulk water. Biofilm samples consisted mainly of Alphaproteobacteria (26% of all phylotypes), Gammaproteobacteria (11%), candidate division TM6 (11%), Chlamydiales (9%), and Betaproteobacteria (9%). The bulk water community consisted primarily of Bacteroidetes (25%), Betaproteobacteria (20%), Actinobacteria (16%), and Alphaproteobacteria (11%). All biofilm communities showed higher relative abundances of single phylotypes and a reduced richness compared to bulk water. Only biofilm communities sampled at nearby sampling points showed similar communities irrespective of support materials. In all of our bulk water studies, the community composition determined from 16S rRNA was completely different from the 16S rRNA gene-based community composition, whereas in biofilms both molecular fractions resulted in community compositions that were similar to each other. We hypothesize that a higher fraction of active bacterial phylotypes and a better protection from oxidative stress in drinking water biofilms are responsible for this higher similarity.

  17. Epigenetic regulation of female puberty.

    PubMed

    Lomniczi, Alejandro; Wright, Hollis; Ojeda, Sergio R

    2015-01-01

    Substantial progress has been made in recent years toward deciphering the molecular and genetic underpinnings of the pubertal process. The availability of powerful new methods to interrogate the human genome has led to the identification of genes that are essential for puberty to occur. Evidence has also emerged suggesting that the initiation of puberty requires the coordinated activity of gene sets organized into functional networks. At a cellular level, it is currently thought that loss of transsynaptic inhibition, accompanied by an increase in excitatory inputs, results in the pubertal activation of GnRH release. This concept notwithstanding, a mechanism of epigenetic repression targeting genes required for the pubertal activation of GnRH neurons was recently identified as a core component of the molecular machinery underlying the central restraint of puberty. In this chapter we will discuss the potential contribution of various mechanisms of epigenetic regulation to the hypothalamic control of female puberty. Copyright © 2014 Elsevier Inc. All rights reserved.

  18. Astrocyte deletion of Bmal1 alters daily locomotor activity and cognitive functions via GABA signalling

    PubMed Central

    Barca-Mayo, Olga; Pons-Espinal, Meritxell; Follert, Philipp; Armirotti, Andrea; Berdondini, Luca; De Pietri Tonelli, Davide

    2017-01-01

    Circadian rhythms are controlled by a network of clock neurons in the central pacemaker, the suprachiasmatic nucleus (SCN). Core clock genes, such as Bmal1, are expressed in SCN neurons and in other brain cells, such as astrocytes. However, the role of astrocytic clock genes in controlling rhythmic behaviour is unknown. Here we show that ablation of Bmal1 in GLAST-positive astrocytes alters circadian locomotor behaviour and cognition in mice. Specifically, deletion of astrocytic Bmal1 has an impact on the neuronal clock through GABA signalling. Importantly, pharmacological modulation of GABAA-receptor signalling completely rescues the behavioural phenotypes. Our results reveal a crucial role of astrocytic Bmal1 for the coordination of neuronal clocks and propose a new cellular target, astrocytes, for neuropharmacology of transient or chronic perturbation of circadian rhythms, where alteration of astrocytic clock genes might contribute to the impairment of the neurobehavioural outputs such as cognition. PMID:28186121

  19. Imagining the future: The core episodic simulation network dissociates as a function of timecourse and the amount of simulated information

    PubMed Central

    Thakral, Preston P.; Benoit, Roland G.; Schacter, Daniel L.

    2017-01-01

    Neuroimaging data indicate that episodic memory (i.e., remembering specific past experiences) and episodic simulation (i.e., imagining specific future experiences) are associated with enhanced activity in a common set of neural regions, often referred to as the core network. This network comprises the hippocampus, parahippocampal cortex, lateral and medial parietal cortex, lateral temporal cortex, and medial prefrontal cortex. Evidence for a core network has been taken as support for the idea that episodic memory and episodic simulation are supported by common processes. Much remains to be learned about how specific core network regions contribute to specific aspects of episodic simulation. Prior neuroimaging studies of episodic memory indicate that certain regions within the core network are differentially sensitive to the amount of information recollected (e.g., the left lateral parietal cortex). In addition, certain core network regions dissociate as a function of their timecourse of engagement during episodic memory (e.g., transient activity in the posterior hippocampus and sustained activity in the left lateral parietal cortex). In the current study, we assessed whether similar dissociations could be observed during episodic simulation. We found that the left lateral parietal cortex modulates as a function of the amount of simulated details. Of particular interest, while the hippocampus was insensitive to the amount of simulated details, we observed a temporal dissociation within the hippocampus: transient activity occurred in relatively posterior portions of the hippocampus and sustained activity occurred in anterior portions. Because the posterior hippocampal and lateral parietal findings parallel those observed previously during episodic memory, the present results add to the evidence that episodic memory and episodic simulation are supported by common processes. Critically, the present study also provides evidence that regions within the core network support dissociable processes. PMID:28324695

  20. Discovering Implicit Entity Relation with the Gene-Citation-Gene Network

    PubMed Central

    Song, Min; Han, Nam-Gi; Kim, Yong-Hwan; Ding, Ying; Chambers, Tamy

    2013-01-01

    In this paper, we apply the entitymetrics model to our constructed Gene-Citation-Gene (GCG) network. Based on the premise there is a hidden, but plausible, relationship between an entity in one article and an entity in its citing article, we constructed a GCG network of gene pairs implicitly connected through citation. We compare the performance of this GCG network to a gene-gene (GG) network constructed over the same corpus but which uses gene pairs explicitly connected through traditional co-occurrence. Using 331,411 MEDLINE abstracts collected from 18,323 seed articles and their references, we identify 25 gene pairs. A comparison of these pairs with interactions found in BioGRID reveal that 96% of the gene pairs in the GCG network have known interactions. We measure network performance using degree, weighted degree, closeness, betweenness centrality and PageRank. Combining all measures, we find the GCG network has more gene pairs, but a lower matching rate than the GG network. However, combining top ranked genes in both networks produces a matching rate of 35.53%. By visualizing both the GG and GCG networks, we find that cancer is the most dominant disease associated with the genes in both networks. Overall, the study indicates that the GCG network can be useful for detecting gene interaction in an implicit manner. PMID:24358368

  1. Genome-scale reconstruction of the sigma factor network in Escherichia coli: topology and functional states

    PubMed Central

    2014-01-01

    Background At the beginning of the transcription process, the RNA polymerase (RNAP) core enzyme requires a σ-factor to recognize the genomic location at which the process initiates. Although the crucial role of σ-factors has long been appreciated and characterized for many individual promoters, we do not yet have a genome-scale assessment of their function. Results Using multiple genome-scale measurements, we elucidated the network of σ-factor and promoter interactions in Escherichia coli. The reconstructed network includes 4,724 σ-factor-specific promoters corresponding to transcription units (TUs), representing an increase of more than 300% over what has been previously reported. The reconstructed network was used to investigate competition between alternative σ-factors (the σ70 and σ38 regulons), confirming the competition model of σ substitution and negative regulation by alternative σ-factors. Comparison with σ-factor binding in Klebsiella pneumoniae showed that transcriptional regulation of conserved genes in closely related species is unexpectedly divergent. Conclusions The reconstructed network reveals the regulatory complexity of the promoter architecture in prokaryotic genomes, and opens a path to the direct determination of the systems biology of their transcriptional regulatory networks. PMID:24461193

  2. Network-guided genomic and metagenomic analysis of the faecal microbiota of the critically endangered kakapo.

    PubMed

    Waite, David W; Dsouza, Melissa; Sekiguchi, Yuji; Hugenholtz, Philip; Taylor, Michael W

    2018-05-25

    The kakapo is a critically endangered, herbivorous parrot endemic to New Zealand. The kakapo hindgut hosts a dense microbial community of low taxonomic diversity, typically dominated by Escherichia fergusonii, and has proven to be a remarkably stable ecosystem, displaying little variation in core membership over years of study. To elucidate mechanisms underlying this robustness, we performed 16S rRNA gene-based co-occurrence network analysis to identify potential interactions between E. fergusonii and the wider bacterial community. Genomic and metagenomic sequencing were employed to facilitate interpretation of potential interactions observed in the network. E. fergusonii maintained very few correlations with other members of the microbiota, and isolates possessed genes for the generation of energy from a wide range of carbohydrate sources, including plant fibres such as cellulose. We surmise that this dominant microorganism is abundant not due to ecological interaction with other members of the microbiota, but its ability to metabolise a wide range of nutrients in the gut. This research represents the first concerted effort to understand the functional roles of the kakapo microbiota, and leverages metagenomic data to contextualise co-occurrence patterns. By combining these two techniques we provide a means for studying the diversity-stability hypothesis in the context of bacterial ecosystems.

  3. TFIIB-facilitated recruitment of preinitiation complexes by a TAF-independent mechanism.

    PubMed

    Hori, Roderick T; Xu, Shuping; Hu, Xianyuan; Pyo, Sung

    2004-01-01

    Gene activators contain activation domains that are thought to recruit limiting components of the transcription machinery to a core promoter. VP16, a viral gene activator, has served as a model for studying the mechanistic aspects of transcriptional activation from yeast to human. The VP16 activation domain can be divided into two modules--an N-terminal subdomain (VPN) and a C-terminal subdomain (VPC). This study demonstrates that VPC stimulates core promoters that are either independent or dependent on TAFs (TATA-box Binding Protein-Associated Factors). In contrast, VPN only activates the TAF-independent core promoter and this activity increases in a synergistic fashion when VPN is dimerized (VPN2). Compared to one copy of VPN (VPN1), VPN2 also displays a highly cooperative increase in binding hTFIIB. The increased TFIIB binding correlates with VPN2's increased ability to recruit a complex containing TFIID, TFIIA and TFIIB. However, VPN1 and VPN2 do not increase the assembly of a complex containing only TFIID and TFIIA. The VPN subdomain also facilitates assembly of a complex containing TBP:TFIIA:TFIIB, which lacks TAFs, and provides a mechanism that could function at TAF-independent promoters. Taken together, these results suggest the interaction between VPN and TFIIB potentially initiate a network of contacts allowing the activator to indirectly tether TFIID or TBP to DNA.

  4. TFIIB-facilitated recruitment of preinitiation complexes by a TAF-independent mechanism

    PubMed Central

    Hori, Roderick T.; Xu, Shuping; Hu, Xianyuan; Pyo, Sung

    2004-01-01

    Gene activators contain activation domains that are thought to recruit limiting components of the transcription machinery to a core promoter. VP16, a viral gene activator, has served as a model for studying the mechanistic aspects of transcriptional activation from yeast to human. The VP16 activation domain can be divided into two modules—an N-terminal subdomain (VPN) and a C-terminal subdomain (VPC). This study demonstrates that VPC stimulates core promoters that are either independent or dependent on TAFs (TATA-box Binding Protein-Associated Factors). In contrast, VPN only activates the TAF-independent core promoter and this activity increases in a synergistic fashion when VPN is dimerized (VPN2). Compared to one copy of VPN (VPN1), VPN2 also displays a highly cooperative increase in binding hTFIIB. The increased TFIIB binding correlates with VPN2's increased ability to recruit a complex containing TFIID, TFIIA and TFIIB. However, VPN1 and VPN2 do not increase the assembly of a complex containing only TFIID and TFIIA. The VPN subdomain also facilitates assembly of a complex containing TBP:TFIIA:TFIIB, which lacks TAFs, and provides a mechanism that could function at TAF-independent promoters. Taken together, these results suggest the interaction between VPN and TFIIB potentially initiate a network of contacts allowing the activator to indirectly tether TFIID or TBP to DNA. PMID:15272087

  5. Reverse engineering a gene network using an asynchronous parallel evolution strategy

    PubMed Central

    2010-01-01

    Background The use of reverse engineering methods to infer gene regulatory networks by fitting mathematical models to gene expression data is becoming increasingly popular and successful. However, increasing model complexity means that more powerful global optimisation techniques are required for model fitting. The parallel Lam Simulated Annealing (pLSA) algorithm has been used in such approaches, but recent research has shown that island Evolutionary Strategies can produce faster, more reliable results. However, no parallel island Evolutionary Strategy (piES) has yet been demonstrated to be effective for this task. Results Here, we present synchronous and asynchronous versions of the piES algorithm, and apply them to a real reverse engineering problem: inferring parameters in the gap gene network. We find that the asynchronous piES exhibits very little communication overhead, and shows significant speed-up for up to 50 nodes: the piES running on 50 nodes is nearly 10 times faster than the best serial algorithm. We compare the asynchronous piES to pLSA on the same test problem, measuring the time required to reach particular levels of residual error, and show that it shows much faster convergence than pLSA across all optimisation conditions tested. Conclusions Our results demonstrate that the piES is consistently faster and more reliable than the pLSA algorithm on this problem, and scales better with increasing numbers of nodes. In addition, the piES is especially well suited to further improvements and adaptations: Firstly, the algorithm's fast initial descent speed and high reliability make it a good candidate for being used as part of a global/local search hybrid algorithm. Secondly, it has the potential to be used as part of a hierarchical evolutionary algorithm, which takes advantage of modern multi-core computing architectures. PMID:20196855

  6. Identification of fever and vaccine-associated gene interaction networks using ontology-based literature mining

    PubMed Central

    2012-01-01

    Background Fever is one of the most common adverse events of vaccines. The detailed mechanisms of fever and vaccine-associated gene interaction networks are not fully understood. In the present study, we employed a genome-wide, Centrality and Ontology-based Network Discovery using Literature data (CONDL) approach to analyse the genes and gene interaction networks associated with fever or vaccine-related fever responses. Results Over 170,000 fever-related articles from PubMed abstracts and titles were retrieved and analysed at the sentence level using natural language processing techniques to identify genes and vaccines (including 186 Vaccine Ontology terms) as well as their interactions. This resulted in a generic fever network consisting of 403 genes and 577 gene interactions. A vaccine-specific fever sub-network consisting of 29 genes and 28 gene interactions was extracted from articles that are related to both fever and vaccines. In addition, gene-vaccine interactions were identified. Vaccines (including 4 specific vaccine names) were found to directly interact with 26 genes. Gene set enrichment analysis was performed using the genes in the generated interaction networks. Moreover, the genes in these networks were prioritized using network centrality metrics. Making scientific discoveries and generating new hypotheses were possible by using network centrality and gene set enrichment analyses. For example, our study found that the genes in the generic fever network were more enriched in cell death and responses to wounding, and the vaccine sub-network had more gene enrichment in leukocyte activation and phosphorylation regulation. The most central genes in the vaccine-specific fever network are predicted to be highly relevant to vaccine-induced fever, whereas genes that are central only in the generic fever network are likely to be highly relevant to generic fever responses. Interestingly, no Toll-like receptors (TLRs) were found in the gene-vaccine interaction network. Since multiple TLRs were found in the generic fever network, it is reasonable to hypothesize that vaccine-TLR interactions may play an important role in inducing fever response, which deserves a further investigation. Conclusions This study demonstrated that ontology-based literature mining is a powerful method for analyzing gene interaction networks and generating new scientific hypotheses. PMID:23256563

  7. Rich-Cores in Networks

    PubMed Central

    Ma, Athen; Mondragón, Raúl J.

    2015-01-01

    A core comprises of a group of central and densely connected nodes which governs the overall behaviour of a network. It is recognised as one of the key meso-scale structures in complex networks. Profiling this meso-scale structure currently relies on a limited number of methods which are often complex and parameter dependent or require a null model. As a result, scalability issues are likely to arise when dealing with very large networks together with the need for subjective adjustment of parameters. The notion of a rich-club describes nodes which are essentially the hub of a network, as they play a dominating role in structural and functional properties. The definition of a rich-club naturally emphasises high degree nodes and divides a network into two subgroups. Here, we develop a method to characterise a rich-core in networks by theoretically coupling the underlying principle of a rich-club with the escape time of a random walker. The method is fast, scalable to large networks and completely parameter free. In particular, we show that the evolution of the core in World Trade and C. elegans networks correspond to responses to historical events and key stages in their physical development, respectively. PMID:25799585

  8. Rich-cores in networks.

    PubMed

    Ma, Athen; Mondragón, Raúl J

    2015-01-01

    A core comprises of a group of central and densely connected nodes which governs the overall behaviour of a network. It is recognised as one of the key meso-scale structures in complex networks. Profiling this meso-scale structure currently relies on a limited number of methods which are often complex and parameter dependent or require a null model. As a result, scalability issues are likely to arise when dealing with very large networks together with the need for subjective adjustment of parameters. The notion of a rich-club describes nodes which are essentially the hub of a network, as they play a dominating role in structural and functional properties. The definition of a rich-club naturally emphasises high degree nodes and divides a network into two subgroups. Here, we develop a method to characterise a rich-core in networks by theoretically coupling the underlying principle of a rich-club with the escape time of a random walker. The method is fast, scalable to large networks and completely parameter free. In particular, we show that the evolution of the core in World Trade and C. elegans networks correspond to responses to historical events and key stages in their physical development, respectively.

  9. Stochastic loss and gain of symmetric divisions in the C. elegans epidermis perturbs robustness of stem cell number

    PubMed Central

    Katsanos, Dimitris; Koneru, Sneha L.; Mestek Boukhibar, Lamia; Gritti, Nicola; Ghose, Ritobrata; Appleford, Peter J.; Doitsidou, Maria; Woollard, Alison; van Zon, Jeroen S.; Poole, Richard J.

    2017-01-01

    Biological systems are subject to inherent stochasticity. Nevertheless, development is remarkably robust, ensuring the consistency of key phenotypic traits such as correct cell numbers in a certain tissue. It is currently unclear which genes modulate phenotypic variability, what their relationship is to core components of developmental gene networks, and what is the developmental basis of variable phenotypes. Here, we start addressing these questions using the robust number of Caenorhabditis elegans epidermal stem cells, known as seam cells, as a readout. We employ genetics, cell lineage tracing, and single molecule imaging to show that mutations in lin-22, a Hes-related basic helix-loop-helix (bHLH) transcription factor, increase seam cell number variability. We show that the increase in phenotypic variability is due to stochastic conversion of normally symmetric cell divisions to asymmetric and vice versa during development, which affect the terminal seam cell number in opposing directions. We demonstrate that LIN-22 acts within the epidermal gene network to antagonise the Wnt signalling pathway. However, lin-22 mutants exhibit cell-to-cell variability in Wnt pathway activation, which correlates with and may drive phenotypic variability. Our study demonstrates the feasibility to study phenotypic trait variance in tractable model organisms using unbiased mutagenesis screens. PMID:29108019

  10. Specifying the core network supporting episodic simulation and episodic memory by activation likelihood estimation.

    PubMed

    Benoit, Roland G; Schacter, Daniel L

    2015-08-01

    It has been suggested that the simulation of hypothetical episodes and the recollection of past episodes are supported by fundamentally the same set of brain regions. The present article specifies this core network via Activation Likelihood Estimation (ALE). Specifically, a first meta-analysis revealed joint engagement of expected core-network regions during episodic memory and episodic simulation. These include parts of the medial surface, the hippocampus and parahippocampal cortex within the medial temporal lobes, and the temporal and inferior posterior parietal cortices on the lateral surface. Both capacities also jointly recruited additional regions such as parts of the bilateral dorsolateral prefrontal cortex. All of these core regions overlapped with the default network. Moreover, it has further been suggested that episodic simulation may require a stronger engagement of some of the core network's nodes as well as the recruitment of additional brain regions supporting control functions. A second ALE meta-analysis indeed identified such regions that were consistently more strongly engaged during episodic simulation than episodic memory. These comprised the core-network clusters located in the left dorsolateral prefrontal cortex and posterior inferior parietal lobe and other structures distributed broadly across the default and fronto-parietal control networks. Together, the analyses determine the set of brain regions that allow us to experience past and hypothetical episodes, thus providing an important foundation for studying the regions' specialized contributions and interactions. Copyright © 2015 Elsevier Ltd. All rights reserved.

  11. A novel complex networks clustering algorithm based on the core influence of nodes.

    PubMed

    Tong, Chao; Niu, Jianwei; Dai, Bin; Xie, Zhongyu

    2014-01-01

    In complex networks, cluster structure, identified by the heterogeneity of nodes, has become a common and important topological property. Network clustering methods are thus significant for the study of complex networks. Currently, many typical clustering algorithms have some weakness like inaccuracy and slow convergence. In this paper, we propose a clustering algorithm by calculating the core influence of nodes. The clustering process is a simulation of the process of cluster formation in sociology. The algorithm detects the nodes with core influence through their betweenness centrality, and builds the cluster's core structure by discriminant functions. Next, the algorithm gets the final cluster structure after clustering the rest of the nodes in the network by optimizing method. Experiments on different datasets show that the clustering accuracy of this algorithm is superior to the classical clustering algorithm (Fast-Newman algorithm). It clusters faster and plays a positive role in revealing the real cluster structure of complex networks precisely.

  12. Free-space optics technology employed in an UMTS release 4 bearer independent core network access part

    NASA Astrophysics Data System (ADS)

    Bibac, Ionut

    2005-08-01

    The UMTS Bearer Independent Core Network program introduced the 3rd Generation Partnership Program Release 4 BICN architecture into the legacy UMTS TDM-switched network. BICN is the application of calI server archltecture for voice and circuit switched data, enabling the provisioning of traditional circuit-switched services using a packet-switched transport network. Today"s business climate has made it essential for service providers to develop a comprehensive networking strategy that means introduction of RCBICN networks. The R4-BICN solution to the evolution of the Core Network in UMTS will enable operators to significantly reduce the capital and operational costs of delivering both traditional voice sewices and new multimedia services. To build the optical backbone, which can support the third generation (3G) packetized infrastructure, the operators could choose a fibre connection, or they could retain the benefits of a wireless connectivity by using a FSO - Free Space Optical lmk, the only wireless technology available that is capable of achieving data rates up to 2.4 Gbit/s. FSO offers viable alternatives for both core transmission networks and for replacing microwaves links in NodeB - RNC access networks. The paper and presentation aim to demonstrate the manner in which FSO products and networks are employed into R4-BICN design solutions.

  13. Specifying the core network supporting episodic simulation and episodic memory by activation likelihood estimation

    PubMed Central

    Benoit, Roland G.; Schacter, Daniel L.

    2015-01-01

    It has been suggested that the simulation of hypothetical episodes and the recollection of past episodes are supported by fundamentally the same set of brain regions. The present article specifies this core network via Activation Likelihood Estimation (ALE). Specifically, a first meta-analysis revealed joint engagement of core network regions during episodic memory and episodic simulation. These include parts of the medial surface, the hippocampus and parahippocampal cortex within the medial temporal lobes, and the lateral temporal and inferior posterior parietal cortices on the lateral surface. Both capacities also jointly recruited additional regions such as parts of the bilateral dorsolateral prefrontal cortex. All of these core regions overlapped with the default network. Moreover, it has further been suggested that episodic simulation may require a stronger engagement of some of the core network’s nodes as wells as the recruitment of additional brain regions supporting control functions. A second ALE meta-analysis indeed identified such regions that were consistently more strongly engaged during episodic simulation than episodic memory. These comprised the core-network clusters located in the left dorsolateral prefrontal cortex and posterior inferior parietal lobe and other structures distributed broadly across the default and fronto-parietal control networks. Together, the analyses determine the set of brain regions that allow us to experience past and hypothetical episodes, thus providing an important foundation for studying the regions’ specialized contributions and interactions. PMID:26142352

  14. The Drosophila Translational Control Element (TCE) Is Required for High-Level Transcription of Many Genes That Are Specifically Expressed in Testes

    PubMed Central

    Anderson, Ashley K.; Ohler, Uwe; Wassarman, David A.

    2012-01-01

    To investigate the importance of core promoter elements for tissue-specific transcription of RNA polymerase II genes, we examined testis-specific transcription in Drosophila melanogaster. Bioinformatic analyses of core promoter sequences from 190 genes that are specifically expressed in testes identified a 10 bp A/T-rich motif that is identical to the translational control element (TCE). The TCE functions in the 5′ untranslated region of Mst(3)CGP mRNAs to repress translation, and it also functions in a heterologous gene to regulate transcription. We found that among genes with focused initiation patterns, the TCE is significantly enriched in core promoters of genes that are specifically expressed in testes but not in core promoters of genes that are specifically expressed in other tissues. The TCE is variably located in core promoters and is conserved in melanogaster subgroup species, but conservation dramatically drops in more distant species. In transgenic flies, short (300–400 bp) genomic regions containing a TCE directed testis-specific transcription of a reporter gene. Mutation of the TCE significantly reduced but did not abolish reporter gene transcription indicating that the TCE is important but not essential for transcription activation. Finally, mutation of testis-specific TFIID (tTFIID) subunits significantly reduced the transcription of a subset of endogenous TCE-containing but not TCE-lacking genes, suggesting that tTFIID activity is limited to TCE-containing genes but that tTFIID is not an obligatory regulator of TCE-containing genes. Thus, the TCE is a core promoter element in a subset of genes that are specifically expressed in testes. Furthermore, the TCE regulates transcription in the context of short genomic regions, from variable locations in the core promoter, and both dependently and independently of tTFIID. These findings set the stage for determining the mechanism by which the TCE regulates testis-specific transcription and understanding the dual role of the TCE in translational and transcriptional regulation. PMID:22984601

  15. The Drosophila Translational Control Element (TCE) is required for high-level transcription of many genes that are specifically expressed in testes.

    PubMed

    Katzenberger, Rebeccah J; Rach, Elizabeth A; Anderson, Ashley K; Ohler, Uwe; Wassarman, David A

    2012-01-01

    To investigate the importance of core promoter elements for tissue-specific transcription of RNA polymerase II genes, we examined testis-specific transcription in Drosophila melanogaster. Bioinformatic analyses of core promoter sequences from 190 genes that are specifically expressed in testes identified a 10 bp A/T-rich motif that is identical to the translational control element (TCE). The TCE functions in the 5' untranslated region of Mst(3)CGP mRNAs to repress translation, and it also functions in a heterologous gene to regulate transcription. We found that among genes with focused initiation patterns, the TCE is significantly enriched in core promoters of genes that are specifically expressed in testes but not in core promoters of genes that are specifically expressed in other tissues. The TCE is variably located in core promoters and is conserved in melanogaster subgroup species, but conservation dramatically drops in more distant species. In transgenic flies, short (300-400 bp) genomic regions containing a TCE directed testis-specific transcription of a reporter gene. Mutation of the TCE significantly reduced but did not abolish reporter gene transcription indicating that the TCE is important but not essential for transcription activation. Finally, mutation of testis-specific TFIID (tTFIID) subunits significantly reduced the transcription of a subset of endogenous TCE-containing but not TCE-lacking genes, suggesting that tTFIID activity is limited to TCE-containing genes but that tTFIID is not an obligatory regulator of TCE-containing genes. Thus, the TCE is a core promoter element in a subset of genes that are specifically expressed in testes. Furthermore, the TCE regulates transcription in the context of short genomic regions, from variable locations in the core promoter, and both dependently and independently of tTFIID. These findings set the stage for determining the mechanism by which the TCE regulates testis-specific transcription and understanding the dual role of the TCE in translational and transcriptional regulation.

  16. Expression of the core antigen gene of hepatitis B virus (HBV) in Acetobacter methanolicus using broad-host-range vectors.

    PubMed

    Schröder, R; Maassen, A; Lippoldt, A; Börner, T; von Baehr, R; Dobrowolski, P

    1991-08-01

    Using the broad-host-range promoter probe vector pRS201 for cloning of phage Acm1 promoters, we established a convenient vector system for expression of heterologous genes in different Gram-negative bacteria. The usefulness of this system was demonstrated by expression of the HBV core gene in Acetobacter methanolicus. Plasmids carrying the HBV core gene downstream of different Acm1-phage promoters were transferred to A. methanolicus, a new potential host for recombinant DNA expression. Using enzyme immunoassay and immunoblot techniques, the amount and composition of core antigen produced in A. methanolicus were compared with that derived from Escherichia coli. The expression of immunoreactive core antigen in A. methanolicus exceeds by sevenfold that in E. coli using an expression system with tandemly arranged promoters. Morphological observations by electron microscopy show that the HBV core gene products isolated from both hosts are assembled into regular spherical particles with a diameter of about 28 nm that are comparable to original viral nucleocapsids.

  17. Dynamic optical resource allocation for mobile core networks with software defined elastic optical networking.

    PubMed

    Zhao, Yongli; Chen, Zhendong; Zhang, Jie; Wang, Xinbo

    2016-07-25

    Driven by the forthcoming of 5G mobile communications, the all-IP architecture of mobile core networks, i.e. evolved packet core (EPC) proposed by 3GPP, has been greatly challenged by the users' demands for higher data rate and more reliable end-to-end connection, as well as operators' demands for low operational cost. These challenges can be potentially met by software defined optical networking (SDON), which enables dynamic resource allocation according to the users' requirement. In this article, a novel network architecture for mobile core network is proposed based on SDON. A software defined network (SDN) controller is designed to realize the coordinated control over different entities in EPC networks. We analyze the requirement of EPC-lightpath (EPCL) in data plane and propose an optical switch load balancing (OSLB) algorithm for resource allocation in optical layer. The procedure of establishment and adjustment of EPCLs is demonstrated on a SDON-based EPC testbed with extended OpenFlow protocol. We also evaluate the OSLB algorithm through simulation in terms of bandwidth blocking ratio, traffic load distribution, and resource utilization ratio compared with link-based load balancing (LLB) and MinHops algorithms.

  18. Best core stabilization exercise to facilitate subcortical neuroplasticity: A functional MRI neuroimaging study.

    PubMed

    Kim, Do Hyun; Lee, Jae Jin; You, Sung Joshua Hyun

    2018-03-23

    To investigate the effects of conscious (ADIM) and subconscious (DNS) core stabilization exercises on cortical changes in adults with core instability. Five non-symptomatic participants with core instability. A novel core stabilization task switching paradigm was designed to separate cortical or subcortical neural substrates during a series of DNS or ADIM core stabilization tasks. fMRI blood BOLD analysis revealed a distinctive subcortical activation pattern during the performance of the DNS, whereas the cortical motor network was primarily activated during an ADIM. Peak voxel volume values showed significantly greater DNS (11.08 ± 1.51) compared with the ADIM (8.81 ± 0.21) (p= 0.043). The ADIM exercise activated the cortical PMC-SMC-SMA motor network, whereas the DNS exercise activated both these same cortical areas and the subcortical cerebellum-BG-thalamus-cingulate cortex network.

  19. Construction and analysis of gene-gene dynamics influence networks based on a Boolean model.

    PubMed

    Mazaya, Maulida; Trinh, Hung-Cuong; Kwon, Yung-Keun

    2017-12-21

    Identification of novel gene-gene relations is a crucial issue to understand system-level biological phenomena. To this end, many methods based on a correlation analysis of gene expressions or structural analysis of molecular interaction networks have been proposed. They have a limitation in identifying more complicated gene-gene dynamical relations, though. To overcome this limitation, we proposed a measure to quantify a gene-gene dynamical influence (GDI) using a Boolean network model and constructed a GDI network to indicate existence of a dynamical influence for every ordered pair of genes. It represents how much a state trajectory of a target gene is changed by a knockout mutation subject to a source gene in a gene-gene molecular interaction (GMI) network. Through a topological comparison between GDI and GMI networks, we observed that the former network is denser than the latter network, which implies that there exist many gene pairs of dynamically influencing but molecularly non-interacting relations. In addition, a larger number of hub genes were generated in the GDI network. On the other hand, there was a correlation between these networks such that the degree value of a node was positively correlated to each other. We further investigated the relationships of the GDI value with structural properties and found that there are negative and positive correlations with the length of a shortest path and the number of paths, respectively. In addition, a GDI network could predict a set of genes whose steady-state expression is affected in E. coli gene-knockout experiments. More interestingly, we found that the drug-targets with side-effects have a larger number of outgoing links than the other genes in the GDI network, which implies that they are more likely to influence the dynamics of other genes. Finally, we found biological evidences showing that the gene pairs which are not molecularly interacting but dynamically influential can be considered for novel gene-gene relationships. Taken together, construction and analysis of the GDI network can be a useful approach to identify novel gene-gene relationships in terms of the dynamical influence.

  20. Global Profiling of Rice and Poplar Transcriptomes Highlights Key Conserved Circadian-Controlled Pathways and cis-Regulatory Modules

    PubMed Central

    Filichkin, Sergei A.; Breton, Ghislain; Priest, Henry D.; Dharmawardhana, Palitha; Jaiswal, Pankaj; Fox, Samuel E.; Michael, Todd P.; Chory, Joanne; Kay, Steve A.; Mockler, Todd C.

    2011-01-01

    Background Circadian clocks provide an adaptive advantage through anticipation of daily and seasonal environmental changes. In plants, the central clock oscillator is regulated by several interlocking feedback loops. It was shown that a substantial proportion of the Arabidopsis genome cycles with phases of peak expression covering the entire day. Synchronized transcriptome cycling is driven through an extensive network of diurnal and clock-regulated transcription factors and their target cis-regulatory elements. Study of the cycling transcriptome in other plant species could thus help elucidate the similarities and differences and identify hubs of regulation common to monocot and dicot plants. Methodology/Principal Findings Using a combination of oligonucleotide microarrays and data mining pipelines, we examined daily rhythms in gene expression in one monocotyledonous and one dicotyledonous plant, rice and poplar, respectively. Cycling transcriptomes were interrogated under different diurnal (driven) and circadian (free running) light and temperature conditions. Collectively, photocycles and thermocycles regulated about 60% of the expressed nuclear genes in rice and poplar. Depending on the condition tested, up to one third of oscillating Arabidopsis-poplar-rice orthologs were phased within three hours of each other suggesting a high degree of conservation in terms of rhythmic gene expression. We identified clusters of rhythmically co-expressed genes and searched their promoter sequences to identify phase-specific cis-elements, including elements that were conserved in the promoters of Arabidopsis, poplar, and rice. Conclusions/Significance Our results show that the cycling patterns of many circadian clock genes are highly conserved across poplar, rice, and Arabidopsis. The expression of many orthologous genes in key metabolic and regulatory pathways is diurnal and/or circadian regulated and phased to similar times of day. Our results confirm previous findings in Arabidopsis of three major classes of cis-regulatory modules within the plant circadian network: the morning (ME, GBOX), evening (EE, GATA), and midnight (PBX/TBX/SBX) modules. Identification of identical overrepresented motifs in the promoters of cycling genes from different species suggests that the core diurnal/circadian cis-regulatory network is deeply conserved between mono- and dicotyledonous species. PMID:21694767

  1. Hepatitis B core protein promotes liver cancer metastasis through miR-382-5p/DLC-1 axis.

    PubMed

    Du, Juan; Bai, Fuxiang; Zhao, Peiqing; Li, Xiaoyan; Li, Xueen; Gao, Lifen; Ma, Chunhong; Liang, Xiaohong

    2018-01-01

    The hepatitis B virus core protein (HBc), also named core antigen, is well-known for its key role in viral capsid formation and virus replication. Recently, studies showed that HBc has the potential to control cell biology activity by regulating host gene expression. Here, we utilized miRNA microarray to identify 24 upregulated miRNAs and 21 downregulated miRNAs in HBc-expressed HCC cells, which were involved in multiple biological processes, including cell motility. Consistently, the in vitro transwell assay and the in vivo tail-vein injection model showed HBc promotion on HCC metastasis. Further, the miRNA-target gene network analysis displayed that the deleted in liver cancer (DLC-1) gene, an important negative regulator for cell motility, was potentially targeted by several differentially expressed miRNAs in HBc-introduced cells. Introduction of miRNAs mimics or inhibitors and 3'UTR luciferase activity assay proved that miR-382-5p efficiently suppressed DLC-1 expression and its 3'-UTR luciferase reporter activity. Importantly, cotransfection of miR-382-5p mimics/inhibitors and the DLC-1 expression vector almost abrogated HBc promotion on cell motility, indicating that the miR-382-5p/DLC-1 axis is important for mediating HBc-enhanced HCC motility. Clinical HCC samples also showed a negative correlation between miR-382-5p and DLC-1 expression level. Furthermore, HBc-positive HCC tissues showed high miR-382-5p level and reduced DLC-1 expression. In conclusion, our findings revealed that HBc promoted HCC motility by regulating the miR-382-5p/DLC-1 axis, which might provide a novel target for clinical diagnosis and treatment. Copyright © 2017 Elsevier B.V. All rights reserved.

  2. Tailoring the dendrimer core for efficient gene delivery.

    PubMed

    Hu, Jingjing; Hu, Ke; Cheng, Yiyun

    2016-04-15

    Dendrimers have been widely used as non-viral gene vectors due to well-defined chemical structures, high density of cationic charges and ease of surface modification. Although a large number of studies have reported the important roles of dendrimer architecture, component, generation and surface functionality in gene delivery, the effect of dendrimer core on this issue still remains unclear. Recent literatures suggest that a slight alternation in dendrimer core has a profound effect in the transfection efficacy and biocompatibility. In this review, we will discuss the transfection mechanism of dendrimers with different types of cores in respect of flexibility, hydrophobicity and functionality. We hope to open a possibility of designing efficient dendrimers for gene delivery by choosing a proper dendrimer core. As a branch of researches on dendrimers and dendritic polymers, the design of biocompatible and high efficient polymeric gene carriers has attracted increasing attentions during these years. Although the effect of dendrimer generation, species, architecture and surface functionality on gene delivery have been widely reported, the effect of dendrimer core on this issue still remains unclear. Recent literatures suggest that a minor variation on the dendrimer core has a profound effect in the transfection efficacy and biocompatibility. This critical review summarized the dendrimers with different types of cores and discussed the transfection mechanism with particular focus on the flexibility, hydrophobicity, and functionality. It is hoped to provide a new insight to design efficient and safe dendrimer-based gene vectors by choosing a proper core. To the best of our knowledge, this is the first review on the effect of dendrimer core on gene delivery. The findings obtained in this filed are of central importance in the design of efficient polymeric gene vectors. This article will appeal a wide readership such as physical chemist, dendrimer chemist, biological chemist, pharmaceutical scientist, and biomaterial researchers. We hope that this review article can be published by Acta Biomaterialia, a top journal that publishes important reviews in the field of biomaterials science. Copyright © 2016 Acta Materialia Inc. Published by Elsevier Ltd. All rights reserved.

  3. A study of structural properties of gene network graphs for mathematical modeling of integrated mosaic gene networks.

    PubMed

    Petrovskaya, Olga V; Petrovskiy, Evgeny D; Lavrik, Inna N; Ivanisenko, Vladimir A

    2017-04-01

    Gene network modeling is one of the widely used approaches in systems biology. It allows for the study of complex genetic systems function, including so-called mosaic gene networks, which consist of functionally interacting subnetworks. We conducted a study of a mosaic gene networks modeling method based on integration of models of gene subnetworks by linear control functionals. An automatic modeling of 10,000 synthetic mosaic gene regulatory networks was carried out using computer experiments on gene knockdowns/knockouts. Structural analysis of graphs of generated mosaic gene regulatory networks has revealed that the most important factor for building accurate integrated mathematical models, among those analyzed in the study, is data on expression of genes corresponding to the vertices with high properties of centrality.

  4. The impact of capacity growth in national telecommunications networks.

    PubMed

    Lord, Andrew; Soppera, Andrea; Jacquet, Arnaud

    2016-03-06

    This paper discusses both UK-based and global Internet data bandwidth growth, beginning with historical data for the BT network. We examine the time variations in consumer behaviour and how this is statistically aggregated into larger traffic loads on national core fibre communications networks. The random nature of consumer Internet behaviour, where very few consumers require maximum bandwidth simultaneously, provides the opportunity for a significant statistical gain. The paper looks at predictions for how this growth might continue over the next 10-20 years, giving estimates for the amount of bandwidth that networks should support in the future. The paper then explains how national networks are designed to accommodate these traffic levels, and the various network roles, including access, metro and core, are described. The physical layer network is put into the context of how the packet and service layers are designed and the applications and location of content are also included in an overall network overview. The specific role of content servers in alleviating core network traffic loads is highlighted. The status of the relevant transmission technologies in the access, metro and core is given, showing that these technologies, with adequate research, should be sufficient to provide bandwidth for consumers in the next 10-20 years. © 2016 The Author(s).

  5. Serotonin and the Brain's Rich Club-Association Between Molecular Genetic Variation on the TPH2 Gene and the Structural Connectome.

    PubMed

    Markett, Sebastian; de Reus, Marcel A; Reuter, Martin; Montag, Christian; Weber, Bernd; Schoene-Bake, Jan-Christoph; van den Heuvel, Martijn P

    2017-03-01

    The rich club comprises a densely mutually connected set of hub regions in the brain, thought to serve as a processing and integration core. We assessed the impact of normal variation of the tryptophane hydroxylase 2 gene's promotor region (TPH2 rs4570625) on structural connectivity of the rich club pathways by means of a candidate gene association design. Tryptophane hydroxylase 2 (TPH2) is a rate-limiting enzyme in the biosynthesis of serotonin and is known to inhibit, in addition to its role as a trans-synaptic messenger, axonal and dendritic growth. The TPH2 T-variant has been associated with reduced mRNA expression and reduced serotonin levels, which may particularly influence the development of macroscale anatomical connectivity. Here, we show larger mean connectivity in the rich club in carriers of the T-variant, suggesting potential effects of upregulation of neural connectivity growth in this central core system. In addition, by edge-removal statistics, we show that the TPH2-associated higher levels of rich club connectivity are of importance for the functioning of the total structural network. The observed association is speculated to result from an effect of serotonin levels on brain development, potentially leading to stronger structural connectivity in heavily interconnected hubs. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  6. Chemical-Gene Interactions from ToxCast Bioactivity Data ...

    EPA Pesticide Factsheets

    Characterizing the effects of chemicals in biological systems is often summarized by chemical-gene interactions, which have sparse coverage in the literature. The ToxCast chemical screening program has produced bioactivity data for nearly 2000 chemicals and over 450 gene targets. To evaluate the information gained from the ToxCast project, a ToxCast bioactivity network was created comprising ToxCast chemical-gene interactions based on assay data and compared to a chemical-gene association network from literature. The literature network was compiled from PubMed articles, excluding ToxCast publications, mapped to genes and chemicals. Genes were identified by curated associations available from NCBI while chemicals were identified by PubChem submissions. The frequencies of chemical-gene associations from the literature network were log-scaled and then compared to the ToxCast bioactivity network. In total, 140 times more chemical-gene associations were present in the ToxCast network in comparison to the literature-derived network highlighting the vast increase in chemical-gene interactions putatively elucidated by the ToxCast research program. There were 165 associations found in the literature network that were reproduced by ToxCast bioactivity data, and 336 associations in the literature network were not reproduced by the ToxCast bioactivity network. The literature network relies on the assumption that chemical-gene associations represent a true chemical-gene inte

  7. Defining the Estimated Core Genome of Bacterial Populations Using a Bayesian Decision Model

    PubMed Central

    van Tonder, Andries J.; Mistry, Shilan; Bray, James E.; Hill, Dorothea M. C.; Cody, Alison J.; Farmer, Chris L.; Klugman, Keith P.; von Gottberg, Anne; Bentley, Stephen D.; Parkhill, Julian; Jolley, Keith A.; Maiden, Martin C. J.; Brueggemann, Angela B.

    2014-01-01

    The bacterial core genome is of intense interest and the volume of whole genome sequence data in the public domain available to investigate it has increased dramatically. The aim of our study was to develop a model to estimate the bacterial core genome from next-generation whole genome sequencing data and use this model to identify novel genes associated with important biological functions. Five bacterial datasets were analysed, comprising 2096 genomes in total. We developed a Bayesian decision model to estimate the number of core genes, calculated pairwise evolutionary distances (p-distances) based on nucleotide sequence diversity, and plotted the median p-distance for each core gene relative to its genome location. We designed visually-informative genome diagrams to depict areas of interest in genomes. Case studies demonstrated how the model could identify areas for further study, e.g. 25% of the core genes with higher sequence diversity in the Campylobacter jejuni and Neisseria meningitidis genomes encoded hypothetical proteins. The core gene with the highest p-distance value in C. jejuni was annotated in the reference genome as a putative hydrolase, but further work revealed that it shared sequence homology with beta-lactamase/metallo-beta-lactamases (enzymes that provide resistance to a range of broad-spectrum antibiotics) and thioredoxin reductase genes (which reduce oxidative stress and are essential for DNA replication) in other C. jejuni genomes. Our Bayesian model of estimating the core genome is principled, easy to use and can be applied to large genome datasets. This study also highlighted the lack of knowledge currently available for many core genes in bacterial genomes of significant global public health importance. PMID:25144616

  8. Transmission in Optically Transparent Core Networks

    NASA Astrophysics Data System (ADS)

    Kilper, Dan; Jensen, Rich; Petermann, Klaus; Karasek, Miroslav

    2007-03-01

    Call for Papers: Transmission in Optically Transparent Core Networks

    Guest Feature Editors

    Dan Kilper and Rich Jensen, Coordinating Associate Editors Klaus Petermann and Miroslav Karasek, Guest Feature Editors

    Submission deadline: 15 June 2007
    Optically transparent networks in which optical transport signals are routed uninterrupted through multiple nodes have long been viewed as an important evolutionary step in fiber optic communications. More than a decade of research and development on transparent network technologies together with the requisite traffic growth has culminated in the recent deployment of commercial optically transparent systems. Although many of the traditional research goals of optical transmission remain important, optical transparency introduces new challenges. Greater emphasis is placed on system efficiency and control. The goal of minimizing signal terminations, which has been pursued through increasing reach and channel capacity, also can be realized through wavelength routing techniques. Rather than bounding system operation by rigid engineering rules, the physical layer is controlled and managed by automation tools. Many static signal impairments become dynamic due to network reconfiguration and transient fault events. Recently new directions in transmission research have emerged to address transparent networking problems. This special issue of the Journal of Optical Networking will examine the technologies and theory underpinning transmission in optically transparent core networks, including both metropolitan and long haul systems.

    Scope of Submission

    The special issue editors are soliciting high-quality original research papers related to transmission in optically transparent core networks. Although this does not include edge networks such as access or enterprise networks, core networks that have access capabilities will be considered in scope as will topics related to the interworking between core and edge networks. The core network topics suitable for inclusion in this feature issue are:
    • Optically transparent system design issues, transmission experiments, and field trials
    • Optically transparent network architectures and topologies
    • Dispersion management in reconfigurable and mesh systems
    • Optically transparent network device and sub-system performance, design, characterization and control, including: amplifiers, transmitters, receivers, switches, add/drop multiplexers
    • Transient and fault management
    • Physical layer system control
    • Monitoring and compensation to support transparency
    • Wavelength routing and planning as they relate to physical layer transmission
    • Hardware cost and configuration optimization for optically transparent networks
    To submit to this special issue, follow the normal procedure for submission to JON and select "TTCN" in the features indicator of the online submission form. For all other questions relating to this feature issue, please send an e-mail to jon@osa.org, subject line "TTCN." Additional information can be found on the JON website: . Submission Deadline: 15 June 2007

  9. Multisector Health Policy Networks in 15 Large US Cities.

    PubMed

    Harris, Jenine K; Leider, J P; Carothers, Bobbi J; Castrucci, Brian C; Hearne, Shelley

    2016-01-01

    Local health departments (LHDs) have historically not prioritized policy development, although it is one of the 3 core areas they address. One strategy that may influence policy in LHD jurisdictions is the formation of partnerships across sectors to work together on local public health policy. We used a network approach to examine LHD local health policy partnerships across 15 large cities from the Big Cities Health Coalition. We surveyed the health departments and their partners about their working relationships in 5 policy areas: core local funding, tobacco control, obesity and chronic disease, violence and injury prevention, and infant mortality. Drawing on prior literature linking network structures with performance, we examined network density, transitivity, centralization and centrality, member diversity, and assortativity of ties. Networks included an average of 21.8 organizations. Nonprofits and government agencies made up the largest proportions of the networks, with 28.8% and 21.7% of network members, whereas for-profits and foundations made up the smallest proportions in all of the networks, with just 1.2% and 2.4% on average. Mean values of density, transitivity, diversity, assortativity, centralization, and centrality showed similarity across policy areas and most LHDs. The tobacco control and obesity/chronic disease networks were densest and most diverse, whereas the infant mortality policy networks were the most centralized and had the highest assortativity. Core local funding policy networks had lower scores than other policy area networks by most network measures. Urban LHDs partner with organizations from diverse sectors to conduct local public health policy work. Network structures are similar across policy areas jurisdictions. Obesity and chronic disease, tobacco control, and infant mortality networks had structures consistent with higher performing networks, whereas core local funding networks had structures consistent with lower performing networks.

  10. Multisector Health Policy Networks in 15 Large US Cities

    PubMed Central

    Leider, J. P.; Carothers, Bobbi J.; Castrucci, Brian C.; Hearne, Shelley

    2016-01-01

    Context: Local health departments (LHDs) have historically not prioritized policy development, although it is one of the 3 core areas they address. One strategy that may influence policy in LHD jurisdictions is the formation of partnerships across sectors to work together on local public health policy. Design: We used a network approach to examine LHD local health policy partnerships across 15 large cities from the Big Cities Health Coalition. Setting/Participants: We surveyed the health departments and their partners about their working relationships in 5 policy areas: core local funding, tobacco control, obesity and chronic disease, violence and injury prevention, and infant mortality. Outcome Measures: Drawing on prior literature linking network structures with performance, we examined network density, transitivity, centralization and centrality, member diversity, and assortativity of ties. Results: Networks included an average of 21.8 organizations. Nonprofits and government agencies made up the largest proportions of the networks, with 28.8% and 21.7% of network members, whereas for-profits and foundations made up the smallest proportions in all of the networks, with just 1.2% and 2.4% on average. Mean values of density, transitivity, diversity, assortativity, centralization, and centrality showed similarity across policy areas and most LHDs. The tobacco control and obesity/chronic disease networks were densest and most diverse, whereas the infant mortality policy networks were the most centralized and had the highest assortativity. Core local funding policy networks had lower scores than other policy area networks by most network measures. Conclusion: Urban LHDs partner with organizations from diverse sectors to conduct local public health policy work. Network structures are similar across policy areas jurisdictions. Obesity and chronic disease, tobacco control, and infant mortality networks had structures consistent with higher performing networks, whereas core local funding networks had structures consistent with lower performing networks. PMID:26910868

  11. Inferring gene dependency network specific to phenotypic alteration based on gene expression data and clinical information of breast cancer.

    PubMed

    Zhou, Xionghui; Liu, Juan

    2014-01-01

    Although many methods have been proposed to reconstruct gene regulatory network, most of them, when applied in the sample-based data, can not reveal the gene regulatory relations underlying the phenotypic change (e.g. normal versus cancer). In this paper, we adopt phenotype as a variable when constructing the gene regulatory network, while former researches either neglected it or only used it to select the differentially expressed genes as the inputs to construct the gene regulatory network. To be specific, we integrate phenotype information with gene expression data to identify the gene dependency pairs by using the method of conditional mutual information. A gene dependency pair (A,B) means that the influence of gene A on the phenotype depends on gene B. All identified gene dependency pairs constitute a directed network underlying the phenotype, namely gene dependency network. By this way, we have constructed gene dependency network of breast cancer from gene expression data along with two different phenotype states (metastasis and non-metastasis). Moreover, we have found the network scale free, indicating that its hub genes with high out-degrees may play critical roles in the network. After functional investigation, these hub genes are found to be biologically significant and specially related to breast cancer, which suggests that our gene dependency network is meaningful. The validity has also been justified by literature investigation. From the network, we have selected 43 discriminative hubs as signature to build the classification model for distinguishing the distant metastasis risks of breast cancer patients, and the result outperforms those classification models with published signatures. In conclusion, we have proposed a promising way to construct the gene regulatory network by using sample-based data, which has been shown to be effective and accurate in uncovering the hidden mechanism of the biological process and identifying the gene signature for phenotypic change.

  12. Diversity and interactions of microbial functional genes under differing environmental conditions: insights from a membrane bioreactor and an oxidation ditch.

    PubMed

    Xia, Yu; Hu, Man; Wen, Xianghua; Wang, Xiaohui; Yang, Yunfeng; Zhou, Jizhong

    2016-01-08

    The effect of environmental conditions on the diversity and interactions of microbial communities has caused tremendous interest in microbial ecology. Here, we found that with identical influents but differing operational parameters (mainly mixed liquor suspended solid (MLSS) concentrations, solid retention time (SRT) and dissolved oxygen (DO) concentrations), two full-scale municipal wastewater treatment systems applying oxidation ditch (OD) and membrane bioreactor (MBR) processes harbored a majority of shared genes (87.2%) but had different overall functional gene structures as revealed by two datasets of 12-day time-series generated by a functional gene array-GeoChip 4.2. Association networks of core carbon, nitrogen and phosphorus cycling genes in each system based on random matrix theory (RMT) showed different topological properties and the MBR nodes showed an indication of higher connectivity. MLSS and DO were shown to be effective in shaping functional gene structures of the systems by statistical analyses. Higher MLSS concentrations resulting in decreased resource availability of the MBR system were thought to promote positive interactions of important functional genes. Together, these findings show the differences of functional potentials of some bioprocesses caused by differing environmental conditions and suggest that higher stress of resource limitation increased positive gene interactions in the MBR system.

  13. Diversity and interactions of microbial functional genes under differing environmental conditions: insights from a membrane bioreactor and an oxidation ditch

    NASA Astrophysics Data System (ADS)

    Xia, Yu; Hu, Man; Wen, Xianghua; Wang, Xiaohui; Yang, Yunfeng; Zhou, Jizhong

    2016-01-01

    The effect of environmental conditions on the diversity and interactions of microbial communities has caused tremendous interest in microbial ecology. Here, we found that with identical influents but differing operational parameters (mainly mixed liquor suspended solid (MLSS) concentrations, solid retention time (SRT) and dissolved oxygen (DO) concentrations), two full-scale municipal wastewater treatment systems applying oxidation ditch (OD) and membrane bioreactor (MBR) processes harbored a majority of shared genes (87.2%) but had different overall functional gene structures as revealed by two datasets of 12-day time-series generated by a functional gene array-GeoChip 4.2. Association networks of core carbon, nitrogen and phosphorus cycling genes in each system based on random matrix theory (RMT) showed different topological properties and the MBR nodes showed an indication of higher connectivity. MLSS and DO were shown to be effective in shaping functional gene structures of the systems by statistical analyses. Higher MLSS concentrations resulting in decreased resource availability of the MBR system were thought to promote positive interactions of important functional genes. Together, these findings show the differences of functional potentials of some bioprocesses caused by differing environmental conditions and suggest that higher stress of resource limitation increased positive gene interactions in the MBR system.

  14. Diversity and interactions of microbial functional genes under differing environmental conditions: insights from a membrane bioreactor and an oxidation ditch

    PubMed Central

    Xia, Yu; Hu, Man; Wen, Xianghua; Wang, Xiaohui; Yang, Yunfeng; Zhou, Jizhong

    2016-01-01

    The effect of environmental conditions on the diversity and interactions of microbial communities has caused tremendous interest in microbial ecology. Here, we found that with identical influents but differing operational parameters (mainly mixed liquor suspended solid (MLSS) concentrations, solid retention time (SRT) and dissolved oxygen (DO) concentrations), two full-scale municipal wastewater treatment systems applying oxidation ditch (OD) and membrane bioreactor (MBR) processes harbored a majority of shared genes (87.2%) but had different overall functional gene structures as revealed by two datasets of 12-day time-series generated by a functional gene array-GeoChip 4.2. Association networks of core carbon, nitrogen and phosphorus cycling genes in each system based on random matrix theory (RMT) showed different topological properties and the MBR nodes showed an indication of higher connectivity. MLSS and DO were shown to be effective in shaping functional gene structures of the systems by statistical analyses. Higher MLSS concentrations resulting in decreased resource availability of the MBR system were thought to promote positive interactions of important functional genes. Together, these findings show the differences of functional potentials of some bioprocesses caused by differing environmental conditions and suggest that higher stress of resource limitation increased positive gene interactions in the MBR system. PMID:26743465

  15. Microarray Analysis of Differential Gene Expression Profile Between Human Fetal and Adult Heart.

    PubMed

    Geng, Zhimin; Wang, Jue; Pan, Lulu; Li, Ming; Zhang, Jitai; Cai, Xueli; Chu, Maoping

    2017-04-01

    Although many changes have been discovered during heart maturation, the genetic mechanisms involved in the changes between immature and mature myocardium have only been partially elucidated. Here, gene expression profile changed between the human fetal and adult heart was characterized. A human microarray was applied to define the gene expression signatures of the fetal (13-17 weeks of gestation, n = 4) and adult hearts (30-40 years old, n = 4). Gene ontology analyses, pathway analyses, gene set enrichment analyses, and signal transduction network were performed to predict the function of the differentially expressed genes. Ten mRNAs were confirmed by quantificational real-time polymerase chain reaction. 5547 mRNAs were found to be significantly differentially expressed. "Cell cycle" was the most enriched pathway in the down-regulated genes. EFGR, IGF1R, and ITGB1 play a central role in the regulation of heart development. EGFR, IGF1R, and FGFR2 were the core genes regulating cardiac cell proliferation. The quantificational real-time polymerase chain reaction results were concordant with the microarray data. Our data identified the transcriptional regulation of heart development in the second trimester and the potential regulators that play a prominent role in the regulation of heart development and cardiac cells proliferation.

  16. Analysis of bHLH coding genes using gene co-expression network approach.

    PubMed

    Srivastava, Swati; Sanchita; Singh, Garima; Singh, Noopur; Srivastava, Gaurava; Sharma, Ashok

    2016-07-01

    Network analysis provides a powerful framework for the interpretation of data. It uses novel reference network-based metrices for module evolution. These could be used to identify module of highly connected genes showing variation in co-expression network. In this study, a co-expression network-based approach was used for analyzing the genes from microarray data. Our approach consists of a simple but robust rank-based network construction. The publicly available gene expression data of Solanum tuberosum under cold and heat stresses were considered to create and analyze a gene co-expression network. The analysis provide highly co-expressed module of bHLH coding genes based on correlation values. Our approach was to analyze the variation of genes expression, according to the time period of stress through co-expression network approach. As the result, the seed genes were identified showing multiple connections with other genes in the same cluster. Seed genes were found to be vary in different time periods of stress. These analyzed seed genes may be utilized further as marker genes for developing the stress tolerant plant species.

  17. Prioritizing chronic obstructive pulmonary disease (COPD) candidate genes in COPD-related networks

    PubMed Central

    Zhang, Yihua; Li, Wan; Feng, Yuyan; Guo, Shanshan; Zhao, Xilei; Wang, Yahui; He, Yuehan; He, Weiming; Chen, Lina

    2017-01-01

    Chronic obstructive pulmonary disease (COPD) is a multi-factor disease, which could be caused by many factors, including disturbances of metabolism and protein-protein interactions (PPIs). In this paper, a weighted COPD-related metabolic network and a weighted COPD-related PPI network were constructed base on COPD disease genes and functional information. Candidate genes in these weighted COPD-related networks were prioritized by making use of a gene prioritization method, respectively. Literature review and functional enrichment analysis of the top 100 genes in these two networks suggested the correlation of COPD and these genes. The performance of our gene prioritization method was superior to that of ToppGene and ToppNet for genes from the COPD-related metabolic network or the COPD-related PPI network after assessing using leave-one-out cross-validation, literature validation and functional enrichment analysis. The top-ranked genes prioritized from COPD-related metabolic and PPI networks could promote the better understanding about the molecular mechanism of this disease from different perspectives. The top 100 genes in COPD-related metabolic network or COPD-related PPI network might be potential markers for the diagnosis and treatment of COPD. PMID:29262568

  18. Prioritizing chronic obstructive pulmonary disease (COPD) candidate genes in COPD-related networks.

    PubMed

    Zhang, Yihua; Li, Wan; Feng, Yuyan; Guo, Shanshan; Zhao, Xilei; Wang, Yahui; He, Yuehan; He, Weiming; Chen, Lina

    2017-11-28

    Chronic obstructive pulmonary disease (COPD) is a multi-factor disease, which could be caused by many factors, including disturbances of metabolism and protein-protein interactions (PPIs). In this paper, a weighted COPD-related metabolic network and a weighted COPD-related PPI network were constructed base on COPD disease genes and functional information. Candidate genes in these weighted COPD-related networks were prioritized by making use of a gene prioritization method, respectively. Literature review and functional enrichment analysis of the top 100 genes in these two networks suggested the correlation of COPD and these genes. The performance of our gene prioritization method was superior to that of ToppGene and ToppNet for genes from the COPD-related metabolic network or the COPD-related PPI network after assessing using leave-one-out cross-validation, literature validation and functional enrichment analysis. The top-ranked genes prioritized from COPD-related metabolic and PPI networks could promote the better understanding about the molecular mechanism of this disease from different perspectives. The top 100 genes in COPD-related metabolic network or COPD-related PPI network might be potential markers for the diagnosis and treatment of COPD.

  19. Parallel Algorithms for Monte Carlo Particle Transport Simulation on Exascale Computing Architectures

    NASA Astrophysics Data System (ADS)

    Romano, Paul Kollath

    Monte Carlo particle transport methods are being considered as a viable option for high-fidelity simulation of nuclear reactors. While Monte Carlo methods offer several potential advantages over deterministic methods, there are a number of algorithmic shortcomings that would prevent their immediate adoption for full-core analyses. In this thesis, algorithms are proposed both to ameliorate the degradation in parallel efficiency typically observed for large numbers of processors and to offer a means of decomposing large tally data that will be needed for reactor analysis. A nearest-neighbor fission bank algorithm was proposed and subsequently implemented in the OpenMC Monte Carlo code. A theoretical analysis of the communication pattern shows that the expected cost is O( N ) whereas traditional fission bank algorithms are O(N) at best. The algorithm was tested on two supercomputers, the Intrepid Blue Gene/P and the Titan Cray XK7, and demonstrated nearly linear parallel scaling up to 163,840 processor cores on a full-core benchmark problem. An algorithm for reducing network communication arising from tally reduction was analyzed and implemented in OpenMC. The proposed algorithm groups only particle histories on a single processor into batches for tally purposes---in doing so it prevents all network communication for tallies until the very end of the simulation. The algorithm was tested, again on a full-core benchmark, and shown to reduce network communication substantially. A model was developed to predict the impact of load imbalances on the performance of domain decomposed simulations. The analysis demonstrated that load imbalances in domain decomposed simulations arise from two distinct phenomena: non-uniform particle densities and non-uniform spatial leakage. The dominant performance penalty for domain decomposition was shown to come from these physical effects rather than insufficient network bandwidth or high latency. The model predictions were verified with measured data from simulations in OpenMC on a full-core benchmark problem. Finally, a novel algorithm for decomposing large tally data was proposed, analyzed, and implemented/tested in OpenMC. The algorithm relies on disjoint sets of compute processes and tally servers. The analysis showed that for a range of parameters relevant to LWR analysis, the tally server algorithm should perform with minimal overhead. Tests were performed on Intrepid and Titan and demonstrated that the algorithm did indeed perform well over a wide range of parameters. (Copies available exclusively from MIT Libraries, libraries.mit.edu/docs - docs mit.edu)

  20. Synergistic and Dose-Controlled Regulation of Cellulase Gene Expression in Penicillium oxalicum.

    PubMed

    Li, Zhonghai; Yao, Guangshan; Wu, Ruimei; Gao, Liwei; Kan, Qinbiao; Liu, Meng; Yang, Piao; Liu, Guodong; Qin, Yuqi; Song, Xin; Zhong, Yaohua; Fang, Xu; Qu, Yinbo

    2015-09-01

    Filamentous fungus Penicillium oxalicum produces diverse lignocellulolytic enzymes, which are regulated by the combinations of many transcription factors. Here, a single-gene disruptant library for 470 transcription factors was constructed and systematically screened for cellulase production. Twenty transcription factors (including ClrB, CreA, XlnR, Ace1, AmyR, and 15 unknown proteins) were identified to play putative roles in the activation or repression of cellulase synthesis. Most of these regulators have not been characterized in any fungi before. We identified the ClrB, CreA, XlnR, and AmyR transcription factors as critical dose-dependent regulators of cellulase expression, the core regulons of which were identified by analyzing several transcriptomes and/or secretomes. Synergistic and additive modes of combinatorial control of each cellulase gene by these regulatory factors were achieved, and cellulase expression was fine-tuned in a proper and controlled manner. With one of these targets, the expression of the major intracellular β-glucosidase Bgl2 was found to be dependent on ClrB. The Bgl2-deficient background resulted in a substantial gene activation by ClrB and proved to be closely correlated with the relief of repression mediated by CreA and AmyR during cellulase induction. Our results also signify that probing the synergistic and dose-controlled regulation mechanisms of cellulolytic regulators and using it for reconstruction of expression regulation network (RERN) may be a promising strategy for cellulolytic fungi to develop enzyme hyper-producers. Based on our data, ClrB was identified as focal point for the synergistic activation regulation of cellulase expression by integrating cellulolytic regulators and their target genes, which refined our understanding of transcriptional-regulatory network as a "seesaw model" in which the coordinated regulation of cellulolytic genes is established by counteracting activators and repressors.

  1. Synergistic and Dose-Controlled Regulation of Cellulase Gene Expression in Penicillium oxalicum

    PubMed Central

    Li, Zhonghai; Yao, Guangshan; Wu, Ruimei; Gao, Liwei; Kan, Qinbiao; Liu, Meng; Yang, Piao; Liu, Guodong; Qin, Yuqi; Song, Xin; Zhong, Yaohua; Fang, Xu; Qu, Yinbo

    2015-01-01

    Filamentous fungus Penicillium oxalicum produces diverse lignocellulolytic enzymes, which are regulated by the combinations of many transcription factors. Here, a single-gene disruptant library for 470 transcription factors was constructed and systematically screened for cellulase production. Twenty transcription factors (including ClrB, CreA, XlnR, Ace1, AmyR, and 15 unknown proteins) were identified to play putative roles in the activation or repression of cellulase synthesis. Most of these regulators have not been characterized in any fungi before. We identified the ClrB, CreA, XlnR, and AmyR transcription factors as critical dose-dependent regulators of cellulase expression, the core regulons of which were identified by analyzing several transcriptomes and/or secretomes. Synergistic and additive modes of combinatorial control of each cellulase gene by these regulatory factors were achieved, and cellulase expression was fine-tuned in a proper and controlled manner. With one of these targets, the expression of the major intracellular β-glucosidase Bgl2 was found to be dependent on ClrB. The Bgl2-deficient background resulted in a substantial gene activation by ClrB and proved to be closely correlated with the relief of repression mediated by CreA and AmyR during cellulase induction. Our results also signify that probing the synergistic and dose-controlled regulation mechanisms of cellulolytic regulators and using it for reconstruction of expression regulation network (RERN) may be a promising strategy for cellulolytic fungi to develop enzyme hyper-producers. Based on our data, ClrB was identified as focal point for the synergistic activation regulation of cellulase expression by integrating cellulolytic regulators and their target genes, which refined our understanding of transcriptional-regulatory network as a “seesaw model” in which the coordinated regulation of cellulolytic genes is established by counteracting activators and repressors. PMID:26360497

  2. 300 Gb/s IM/DD based SDM-WDM-PON with laserless ONUs.

    PubMed

    Bao, Fangdi; Morioka, Toshio; Oxenløwe, Leif K; Hu, Hao

    2018-04-02

    A low-cost, high-speed SDM-WDM-PON architecture is proposed by using a multi-core fiber (MCF) and intensity modulation/directly detection (IM/DD). One of the MCF cores is used for sending laser sources from optical line terminal (OLT) to optical network unit (ONU), thus facilitating laserless and colorless ONUs, and providing ease of network management and maintenance. In addition, the wavelengths of the ONUs are controlled on the OLT side, which also enables flexible optical networks. Thanks to the low inter-core crosstalk of a MCF, downstream (DS) and upstream (US) signals are transmitted independently in different cores of the MCF, not only increasing the aggregated capacity but also avoiding the Rayleigh backscattering noise. Finally, a proof-of-principle experiment is performed by using a 7-core fiber, achieving 300 /120 Gb/s aggregated capacity for DS and US (3 × cores, 4 × wavelengths, 25/10 Gb/s per wavelength), respectively.

  3. Feasibility of using tissue microarray cores of paraffin-embedded breast cancer tissue for measurement of gene expression: a proof-of-concept study.

    PubMed

    Drury, Suzanne; Salter, Janine; Baehner, Frederick L; Shak, Steven; Dowsett, Mitch

    2010-06-01

    To determine whether 0.6 mm cores of formalin-fixed paraffin-embedded (FFPE) tissue, as commonly used to construct immunohistochemical tissue microarrays, may be a valid alternative to tissue sections as source material for quantitative real-time PCR-based transcriptional profiling of breast cancer. Four matched 0.6 mm cores of invasive breast tumour and two 10 microm whole sections were taken from eight FFPE blocks. RNA was extracted and reverse transcribed, and TaqMan assays were performed on the 21 genes of the Oncotype DX Breast Cancer assay. Expression of the 16 recurrence-related genes was normalised to the set of five reference genes, and the recurrence score (RS) was calculated. RNA yield was lower from 0.6 mm cores than from 10 microm whole sections, but was still more than sufficient to perform the assay. RS and single gene data from cores were highly comparable with those from whole sections (RS p=0.005). Greater variability was seen between cores than between sections. FFPE sections are preferable to 0.6 mm cores for RNA profiling in order to maximise RNA yield and to allow for standard histopathological assessment. However, 0.6 mm cores are sufficient and would be appropriate to use for large cohort studies.

  4. Role of HCV Core gene of genotype 1a and 3a and host gene Cox-2 in HCV-induced pathogenesis

    PubMed Central

    2011-01-01

    Background Hepatitis C virus (HCV) Core protein is thought to trigger activation of multiple signaling pathways and play a significant role in the alteration of cellular gene expression responsible for HCV pathogenesis leading to hepatocellular carcinoma (HCC). However, the exact molecular mechanism of HCV genome specific pathogenesis remains unclear. We examined the in vitro effects of HCV Core protein of HCV genotype 3a and 1a on the cellular genes involved in oxidative stress and angiogenesis. We also studied the ability of HCV Core and Cox-2 siRNA either alone or in combination to inhibit viral replication and cell proliferation in HCV serum infected Huh-7 cells. Results Over expression of Core gene of HCV 3a genotype showed stronger effect in regulating RNA and protein levels of Cox-2, iNOS, VEGF, p-Akt as compared to HCV-1a Core in hepatocellular carcinoma cell line Huh-7 accompanied by enhanced PGE2 release and cell proliferation. We also observed higher expression levels of above genes in HCV 3a patient's blood and biopsy samples. Interestingly, the Core and Cox-2-specific siRNAs down regulated the Core 3a-enhanced expression of Cox-2, iNOS, VEGF, p-Akt. Furthermore, the combined siRNA treatment also showed a dramatic reduction in viral titer and expression of these genes in HCV serum-infected Huh-7 cells. Taken together, these results demonstrated a differential response by HCV 3a genotype in HCV-induced pathogenesis, which may be due to Core and host factor Cox-2 individually or in combination. Conclusions Collectively, these studies not only suggest a genotype-specific interaction between key players of HCV pathogenesis but also may represent combined viral and host gene silencing as a potential therapeutic strategy. PMID:21457561

  5. The impact of tropical forest logging and oil palm agriculture on the soil microbiome.

    PubMed

    Tripathi, Binu M; Edwards, David P; Mendes, Lucas William; Kim, Mincheol; Dong, Ke; Kim, Hyoki; Adams, Jonathan M

    2016-05-01

    Selective logging and forest conversion to oil palm agriculture are rapidly altering tropical forests. However, functional responses of the soil microbiome to these land-use changes are poorly understood. Using 16S rRNA gene and shotgun metagenomic sequencing, we compared composition and functional attributes of soil biota between unlogged, once-logged and twice-logged rainforest, and areas converted to oil palm plantations in Sabah, Borneo. Although there was no significant effect of logging history, we found a significant difference between the taxonomic and functional composition of both primary and logged forests and oil palm. Oil palm had greater abundances of genes associated with DNA, RNA, protein metabolism and other core metabolic functions, but conversely, lower abundance of genes associated with secondary metabolism and cell-cell interactions, indicating less importance of antagonism or mutualism in the more oligotrophic oil palm environment. Overall, these results show a striking difference in taxonomic composition and functional gene diversity of soil microorganisms between oil palm and forest, but no significant difference between primary forest and forest areas with differing logging history. This reinforces the view that logged forest retains most features and functions of the original soil community. However, networks based on strong correlations between taxonomy and functions showed that network complexity is unexpectedly increased due to both logging and oil palm agriculture, which suggests a pervasive effect of both land-use changes on the interaction of soil microbes. © 2016 John Wiley & Sons Ltd.

  6. SABRE is required for stabilization of root hair patterning in Arabidopsis thaliana.

    PubMed

    Pietra, Stefano; Lang, Patricia; Grebe, Markus

    2015-03-01

    Patterned differentiation of distinct cell types is essential for the development of multicellular organisms. The root epidermis of Arabidopsis thaliana is composed of alternating files of root hair and non-hair cells and represents a model system for studying the control of cell-fate acquisition. Epidermal cell fate is regulated by a network of genes that translate positional information from the underlying cortical cell layer into a specific pattern of differentiated cells. While much is known about the genes of this network, new players continue to be discovered. Here we show that the SABRE (SAB) gene, known to mediate microtubule organization, anisotropic cell growth and planar polarity, has an effect on root epidermal hair cell patterning. Loss of SAB function results in ectopic root hair formation and destabilizes the expression of cell fate and differentiation markers in the root epidermis, including expression of the WEREWOLF (WER) and GLABRA2 (GL2) genes. Double mutant analysis reveal that wer and caprice (cpc) mutants, defective in core components of the epidermal patterning pathway, genetically interact with sab. This suggests that SAB may act on epidermal patterning upstream of WER and CPC. Hence, we provide evidence for a role of SAB in root epidermal patterning by affecting cell-fate stabilization. Our work opens the door for future studies addressing SAB-dependent functions of the cytoskeleton during root epidermal patterning. © 2014 The Authors. Physiologia Plantarum published by John Wiley & Sons Ltd on behalf of Scandinavian Plant Physiology Society.

  7. Mining Gene Regulatory Networks by Neural Modeling of Expression Time-Series.

    PubMed

    Rubiolo, Mariano; Milone, Diego H; Stegmayer, Georgina

    2015-01-01

    Discovering gene regulatory networks from data is one of the most studied topics in recent years. Neural networks can be successfully used to infer an underlying gene network by modeling expression profiles as times series. This work proposes a novel method based on a pool of neural networks for obtaining a gene regulatory network from a gene expression dataset. They are used for modeling each possible interaction between pairs of genes in the dataset, and a set of mining rules is applied to accurately detect the subjacent relations among genes. The results obtained on artificial and real datasets confirm the method effectiveness for discovering regulatory networks from a proper modeling of the temporal dynamics of gene expression profiles.

  8. Flexibility and symmetry of prokaryotic genome rearrangement reveal lineage-associated core-gene-defined genome organizational frameworks.

    PubMed

    Kang, Yu; Gu, Chaohao; Yuan, Lina; Wang, Yue; Zhu, Yanmin; Li, Xinna; Luo, Qibin; Xiao, Jingfa; Jiang, Daquan; Qian, Minping; Ahmed Khan, Aftab; Chen, Fei; Zhang, Zhang; Yu, Jun

    2014-11-25

    The prokaryotic pangenome partitions genes into core and dispensable genes. The order of core genes, albeit assumed to be stable under selection in general, is frequently interrupted by horizontal gene transfer and rearrangement, but how a core-gene-defined genome maintains its stability or flexibility remains to be investigated. Based on data from 30 species, including 425 genomes from six phyla, we grouped core genes into syntenic blocks in the context of a pangenome according to their stability across multiple isolates. A subset of the core genes, often species specific and lineage associated, formed a core-gene-defined genome organizational framework (cGOF). Such cGOFs are either single segmental (one-third of the species analyzed) or multisegmental (the rest). Multisegment cGOFs were further classified into symmetric or asymmetric according to segment orientations toward the origin-terminus axis. The cGOFs in Gram-positive species are exclusively symmetric and often reversible in orientation, as opposed to those of the Gram-negative bacteria, which are all asymmetric and irreversible. Meanwhile, all species showing strong strand-biased gene distribution contain symmetric cGOFs and often specific DnaE (α subunit of DNA polymerase III) isoforms. Furthermore, functional evaluations revealed that cGOF genes are hub associated with regard to cellular activities, and the stability of cGOF provides efficient indexes for scaffold orientation as demonstrated by assembling virtual and empirical genome drafts. cGOFs show species specificity, and the symmetry of multisegmental cGOFs is conserved among taxa and constrained by DNA polymerase-centric strand-biased gene distribution. The definition of species-specific cGOFs provides powerful guidance for genome assembly and other structure-based analysis. Prokaryotic genomes are frequently interrupted by horizontal gene transfer (HGT) and rearrangement. To know whether there is a set of genes not only conserved in position among isolates but also functionally essential for a given species and to further evaluate the stability or flexibility of such genome structures across lineages are of importance. Based on a large number of multi-isolate pangenomic data, our analysis reveals that a subset of core genes is organized into a core-gene-defined genome organizational framework, or cGOF. Furthermore, the lineage-associated cGOFs among Gram-positive and Gram-negative bacteria behave differently: the former, composed of 2 to 4 segments, have their fragments symmetrically rearranged around the origin-terminus axis, whereas the latter show more complex segmentation and are partitioned asymmetrically into chromosomal structures. The definition of cGOFs provides new insights into prokaryotic genome organization and efficient guidance for genome assembly and analysis. Copyright © 2014 Kang et al.

  9. Constructing an integrated gene similarity network for the identification of disease genes.

    PubMed

    Tian, Zhen; Guo, Maozu; Wang, Chunyu; Xing, LinLin; Wang, Lei; Zhang, Yin

    2017-09-20

    Discovering novel genes that are involved human diseases is a challenging task in biomedical research. In recent years, several computational approaches have been proposed to prioritize candidate disease genes. Most of these methods are mainly based on protein-protein interaction (PPI) networks. However, since these PPI networks contain false positives and only cover less half of known human genes, their reliability and coverage are very low. Therefore, it is highly necessary to fuse multiple genomic data to construct a credible gene similarity network and then infer disease genes on the whole genomic scale. We proposed a novel method, named RWRB, to infer causal genes of interested diseases. First, we construct five individual gene (protein) similarity networks based on multiple genomic data of human genes. Then, an integrated gene similarity network (IGSN) is reconstructed based on similarity network fusion (SNF) method. Finally, we employee the random walk with restart algorithm on the phenotype-gene bilayer network, which combines phenotype similarity network, IGSN as well as phenotype-gene association network, to prioritize candidate disease genes. We investigate the effectiveness of RWRB through leave-one-out cross-validation methods in inferring phenotype-gene relationships. Results show that RWRB is more accurate than state-of-the-art methods on most evaluation metrics. Further analysis shows that the success of RWRB is benefited from IGSN which has a wider coverage and higher reliability comparing with current PPI networks. Moreover, we conduct a comprehensive case study for Alzheimer's disease and predict some novel disease genes that supported by literature. RWRB is an effective and reliable algorithm in prioritizing candidate disease genes on the genomic scale. Software and supplementary information are available at http://nclab.hit.edu.cn/~tianzhen/RWRB/ .

  10. A Network of HSPG Core Proteins and HS Modifying Enzymes Regulates Netrin-Dependent Guidance of D-Type Motor Neurons in Caenorhabditis elegans

    PubMed Central

    Gysi, Stephan; Rhiner, Christa; Flibotte, Stephane; Moerman, Donald G.; Hengartner, Michael O.

    2013-01-01

    Heparan sulfate proteoglycans (HSPGs) are proteins with long covalently attached sugar side chains of the heparan sulfate (HS) type. Depending on the cellular context HS chains carry multiple structural modifications such as sulfate residues or epimerized sugars allowing them to bind to a wide range of molecules. HSPGs have been found to play extremely diverse roles in animal development and were shown to interact with certain axon guidance molecules. In this study we describe the role of the Caenorhabditis elegans HSPG core proteins Syndecan (SDN-1) and Glypican (LON-2) and the HS modifying enzymes in the dorsal guidance of D-type motor axons, a process controlled mainly by the conserved axon guidance molecule UNC-6/Netrin. Our genetic analysis established the specific HS code relevant for this axon guidance event. Using two sensitized genetic backgrounds, we isolated novel components influencing D-type motor axon guidance with a link to HSPGs, as well as new alleles of several previously characterized axon guidance genes. Interestingly, the dorsal axon guidance defects induced by mutations in zfp-1 or lin-35 depended on the transgene oxIs12 used to visualize the D-type motor neurons. oxIs12 is a large multi-copy transgene that enlarges the X chromosome by approximately 20%. In a search for genes with a comparable phenotype we found that a mutation in the known dosage compensation gene dpy-21 showed similar axon guidance defects as zfp-1 or lin-35 mutants. Thus, derepression of genes on X, where many genes relevant for HS dependent axon guidance are located, might also influence axon guidance of D-type motor neurons. PMID:24066155

  11. On the Concept of Cis-regulatory Information: From Sequence Motifs to Logic Functions

    NASA Astrophysics Data System (ADS)

    Tarpine, Ryan; Istrail, Sorin

    The regulatory genome is about the “system level organization of the core genomic regulatory apparatus, and how this is the locus of causality underlying the twin phenomena of animal development and animal evolution” (E.H. Davidson. The Regulatory Genome: Gene Regulatory Networks in Development and Evolution, Academic Press, 2006). Information processing in the regulatory genome is done through regulatory states, defined as sets of transcription factors (sequence-specific DNA binding proteins which determine gene expression) that are expressed and active at the same time. The core information processing machinery consists of modular DNA sequence elements, called cis-modules, that interact with transcription factors. The cis-modules “read” the information contained in the regulatory state of the cell through transcription factor binding, “process” it, and directly or indirectly communicate with the basal transcription apparatus to determine gene expression. This endowment of each gene with the information-receiving capacity through their cis-regulatory modules is essential for the response to every possible regulatory state to which it might be exposed during all phases of the life cycle and in all cell types. We present here a set of challenges addressed by our CYRENE research project aimed at studying the cis-regulatory code of the regulatory genome. The CYRENE Project is devoted to (1) the construction of a database, the cis-Lexicon, containing comprehensive information across species about experimentally validated cis-regulatory modules; and (2) the software development of a next-generation genome browser, the cis-Browser, specialized for the regulatory genome. The presentation is anchored on three main computational challenges: the Gene Naming Problem, the Consensus Sequence Bottleneck Problem, and the Logic Function Inference Problem.

  12. Reverse engineering highlights potential principles of large gene regulatory network design and learning.

    PubMed

    Carré, Clément; Mas, André; Krouk, Gabriel

    2017-01-01

    Inferring transcriptional gene regulatory networks from transcriptomic datasets is a key challenge of systems biology, with potential impacts ranging from medicine to agronomy. There are several techniques used presently to experimentally assay transcription factors to target relationships, defining important information about real gene regulatory networks connections. These techniques include classical ChIP-seq, yeast one-hybrid, or more recently, DAP-seq or target technologies. These techniques are usually used to validate algorithm predictions. Here, we developed a reverse engineering approach based on mathematical and computer simulation to evaluate the impact that this prior knowledge on gene regulatory networks may have on training machine learning algorithms. First, we developed a gene regulatory networks-simulating engine called FRANK (Fast Randomizing Algorithm for Network Knowledge) that is able to simulate large gene regulatory networks (containing 10 4 genes) with characteristics of gene regulatory networks observed in vivo. FRANK also generates stable or oscillatory gene expression directly produced by the simulated gene regulatory networks. The development of FRANK leads to important general conclusions concerning the design of large and stable gene regulatory networks harboring scale free properties (built ex nihilo). In combination with supervised (accepting prior knowledge) support vector machine algorithm we (i) address biologically oriented questions concerning our capacity to accurately reconstruct gene regulatory networks and in particular we demonstrate that prior-knowledge structure is crucial for accurate learning, and (ii) draw conclusions to inform experimental design to performed learning able to solve gene regulatory networks in the future. By demonstrating that our predictions concerning the influence of the prior-knowledge structure on support vector machine learning capacity holds true on real data ( Escherichia coli K14 network reconstruction using network and transcriptomic data), we show that the formalism used to build FRANK can to some extent be a reasonable model for gene regulatory networks in real cells.

  13. Coexpression network based on natural variation in human gene expression reveals gene interactions and functions

    PubMed Central

    Nayak, Renuka R.; Kearns, Michael; Spielman, Richard S.; Cheung, Vivian G.

    2009-01-01

    Genes interact in networks to orchestrate cellular processes. Analysis of these networks provides insights into gene interactions and functions. Here, we took advantage of normal variation in human gene expression to infer gene networks, which we constructed using correlations in expression levels of more than 8.5 million gene pairs in immortalized B cells from three independent samples. The resulting networks allowed us to identify biological processes and gene functions. Among the biological pathways, we found processes such as translation and glycolysis that co-occur in the same subnetworks. We predicted the functions of poorly characterized genes, including CHCHD2 and TMEM111, and provided experimental evidence that TMEM111 is part of the endoplasmic reticulum-associated secretory pathway. We also found that IFIH1, a susceptibility gene of type 1 diabetes, interacts with YES1, which plays a role in glucose transport. Furthermore, genes that predispose to the same diseases are clustered nonrandomly in the coexpression network, suggesting that networks can provide candidate genes that influence disease susceptibility. Therefore, our analysis of gene coexpression networks offers information on the role of human genes in normal and disease processes. PMID:19797678

  14. Predicting response to primary chemotherapy: gene expression profiling of paraffin-embedded core biopsy tissue.

    PubMed

    Mina, Lida; Soule, Sharon E; Badve, Sunil; Baehner, Fredrick L; Baker, Joffre; Cronin, Maureen; Watson, Drew; Liu, Mei-Lan; Sledge, George W; Shak, Steve; Miller, Kathy D

    2007-06-01

    Primary chemotherapy provides an ideal opportunity to correlate gene expression with response to treatment. We used paraffin-embedded core biopsies from a completed phase II trial to identify genes that correlate with response to primary chemotherapy. Patients with newly diagnosed stage II or III breast cancer were treated with sequential doxorubicin 75 mg/M2 q2 wks x 3 and docetaxel 40 mg/M2 weekly x 6; treatment order was randomly assigned. Pretreatment core biopsy samples were interrogated for genes that might correlate with pathologic complete response (pCR). In addition to the individual genes, the correlation of the Oncotype DX Recurrence Score with pCR was examined. Of 70 patients enrolled in the parent trial, core biopsies samples with sufficient RNA for gene analyses were available from 45 patients; 9 (20%) had inflammatory breast cancer (IBC). Six (14%) patients achieved a pCR. Twenty-two of the 274 candidate genes assessed correlated with pCR (p < 0.05). Genes correlating with pCR could be grouped into three large clusters: angiogenesis-related genes, proliferation related genes, and invasion-related genes. Expression of estrogen receptor (ER)-related genes and Recurrence Score did not correlate with pCR. In an exploratory analysis we compared gene expression in IBC to non-inflammatory breast cancer; twenty-four (9%) of the genes were differentially expressed (p < 0.05), 5 were upregulated and 19 were downregulated in IBC. Gene expression analysis on core biopsy samples is feasible and identifies candidate genes that correlate with pCR to primary chemotherapy. Gene expression in IBC differs significantly from noninflammatory breast cancer.

  15. Structure versus time in the evolutionary diversification of avian carotenoid metabolic networks.

    PubMed

    Morrison, Erin S; Badyaev, Alexander V

    2018-05-01

    Historical associations of genes and proteins are thought to delineate pathways available to subsequent evolution; however, the effects of past functional involvements on contemporary evolution are rarely quantified. Here, we examined the extent to which the structure of a carotenoid enzymatic network persists in avian evolution. Specifically, we tested whether the evolution of carotenoid networks was most concordant with phylogenetically structured expansion from core reactions of common ancestors or with subsampling of biochemical pathway modules from an ancestral network. We compared structural and historical associations in 467 carotenoid networks of extant and ancestral species and uncovered the overwhelming effect of pre-existing metabolic network structure on carotenoid diversification over the last 50 million years of avian evolution. Over evolutionary time, birds repeatedly subsampled and recombined conserved biochemical modules, which likely maintained the overall structure of the carotenoid metabolic network during avian evolution. These findings explain the recurrent convergence of evolutionary distant species in carotenoid metabolism and weak phylogenetic signal in avian carotenoid evolution. Remarkable retention of an ancient metabolic structure throughout extensive and prolonged ecological diversification in avian carotenoid metabolism illustrates a fundamental requirement of organismal evolution - historical continuity of a deterministic network that links past and present functional associations of its components. © 2018 European Society For Evolutionary Biology. Journal of Evolutionary Biology © 2018 European Society For Evolutionary Biology.

  16. Multi-scale structure and topological anomaly detection via a new network statistic: The onion decomposition.

    PubMed

    Hébert-Dufresne, Laurent; Grochow, Joshua A; Allard, Antoine

    2016-08-18

    We introduce a network statistic that measures structural properties at the micro-, meso-, and macroscopic scales, while still being easy to compute and interpretable at a glance. Our statistic, the onion spectrum, is based on the onion decomposition, which refines the k-core decomposition, a standard network fingerprinting method. The onion spectrum is exactly as easy to compute as the k-cores: It is based on the stages at which each vertex gets removed from a graph in the standard algorithm for computing the k-cores. Yet, the onion spectrum reveals much more information about a network, and at multiple scales; for example, it can be used to quantify node heterogeneity, degree correlations, centrality, and tree- or lattice-likeness. Furthermore, unlike the k-core decomposition, the combined degree-onion spectrum immediately gives a clear local picture of the network around each node which allows the detection of interesting subgraphs whose topological structure differs from the global network organization. This local description can also be leveraged to easily generate samples from the ensemble of networks with a given joint degree-onion distribution. We demonstrate the utility of the onion spectrum for understanding both static and dynamic properties on several standard graph models and on many real-world networks.

  17. Combining Phylogenetic and Syntenic Analyses for Understanding the Evolution of TCP ECE Genes in Eudicots

    PubMed Central

    Citerne, Hélène L.; Le Guilloux, Martine; Sannier, Julie; Nadot, Sophie; Damerval, Catherine

    2013-01-01

    TCP ECE genes encode transcription factors which have received much attention for their repeated recruitment in the control of floral symmetry in core eudicots, and more recently in monocots. Major duplications of TCP ECE genes have been described in core eudicots, but the evolutionary history of this gene family is unknown in basal eudicots. Reconstructing the phylogeny of ECE genes in basal eudicots will help set a framework for understanding the functional evolution of these genes. TCP ECE genes were sequenced in all major lineages of basal eudicots and Gunnera which belongs to the sister clade to all other core eudicots. We show that in these lineages they have a complex evolutionary history with repeated duplications. We estimate the timing of the two major duplications already identified in the core eudicots within a timeframe before the divergence of Gunnera and after the divergence of Proteales. We also use a synteny-based approach to examine the extent to which the expansion of TCP ECE genes in diverse eudicot lineages may be due to genome-wide duplications. The three major core-eudicot specific clades share a number of collinear genes, and their common evolutionary history may have originated at the γ event. Genomic comparisons in Arabidopsis thaliana and Solanum lycopersicum highlight their separate polyploid origin, with syntenic fragments with and without TCP ECE genes showing differential gene loss and genomic rearrangements. Comparison between recently available genomes from two basal eudicots Aquilegia coerulea and Nelumbo nucifera suggests that the two TCP ECE paralogs in these species are also derived from large-scale duplications. TCP ECE loci from basal eudicots share many features with the three main core eudicot loci, and allow us to infer the makeup of the ancestral eudicot locus. PMID:24019982

  18. Phylogenetic analysis of the core histone doublet and DNA topo II genes of Marseilleviridae: evidence of proto-eukaryotic provenance.

    PubMed

    Erives, Albert J

    2017-11-28

    While the genomes of eukaryotes and Archaea both encode the histone-fold domain, only eukaryotes encode the core histone paralogs H2A, H2B, H3, and H4. With DNA, these core histones assemble into the nucleosomal octamer underlying eukaryotic chromatin. Importantly, core histones for H2A and H3 are maintained as neofunctionalized paralogs adapted for general bulk chromatin (canonical H2 and H3) or specialized chromatin (H2A.Z enriched at gene promoters and cenH3s enriched at centromeres). In this context, the identification of core histone-like "doublets" in the cytoplasmic replication factories of the Marseilleviridae (MV) is a novel finding with possible relevance to understanding the origin of eukaryotic chromatin. Here, we analyze and compare the core histone doublet genes from all known MV genomes as well as other MV genes relevant to the origin of the eukaryotic replisome. Using different phylogenetic approaches, we show that MV histone domains encode obligate H2B-H2A and H4-H3 dimers of possible proto-eukaryotic origin. MV core histone moieties form sister clades to each of the four eukaryotic clades of canonical and variant core histones. This suggests that MV core histone moieties diverged prior to eukaryotic neofunctionalizations associated with paired linear chromosomes and variant histone octamer assembly. We also show that MV genomes encode a proto-eukaryotic DNA topoisomerase II enzyme that forms a sister clade to eukaryotes. This is a relevant finding given that DNA topo II influences histone deposition and chromatin compaction and is the second most abundant nuclear protein after histones. The combined domain architecture and phylogenomic analyses presented here suggest that a primitive origin for MV histone genes is a more parsimonious explanation than horizontal gene transfers + gene fusions + sufficient divergence to eliminate relatedness to eukaryotic neofunctionalizations within the H2A and H3 clades without loss of relatedness to each of the four core histone clades. We thus suggest MV histone doublet genes and their DNA topo II gene possibly were acquired from an organism with a chromatinized replisome that diverged prior to the origin of eukaryotic core histone variants for H2/H2A.Z and H3/cenH3. These results also imply that core histones were utilized ancestrally in viral DNA compaction and/or protection from host endonucleases.

  19. Topology association analysis in weighted protein interaction network for gene prioritization

    NASA Astrophysics Data System (ADS)

    Wu, Shunyao; Shao, Fengjing; Zhang, Qi; Ji, Jun; Xu, Shaojie; Sun, Rencheng; Sun, Gengxin; Du, Xiangjun; Sui, Yi

    2016-11-01

    Although lots of algorithms for disease gene prediction have been proposed, the weights of edges are rarely taken into account. In this paper, the strengths of topology associations between disease and essential genes are analyzed in weighted protein interaction network. Empirical analysis demonstrates that compared to other genes, disease genes are weakly connected with essential genes in protein interaction network. Based on this finding, a novel global distance measurement for gene prioritization with weighted protein interaction network is proposed in this paper. Positive and negative flow is allocated to disease and essential genes, respectively. Additionally network propagation model is extended for weighted network. Experimental results on 110 diseases verify the effectiveness and potential of the proposed measurement. Moreover, weak links play more important role than strong links for gene prioritization, which is meaningful to deeply understand protein interaction network.

  20. New Scheduling Algorithms for Agile All-Photonic Networks

    NASA Astrophysics Data System (ADS)

    Mehri, Mohammad Saleh; Ghaffarpour Rahbar, Akbar

    2017-12-01

    An optical overlaid star network is a class of agile all-photonic networks that consists of one or more core node(s) at the center of the star network and a number of edge nodes around the core node. In this architecture, a core node may use a scheduling algorithm for transmission of traffic through the network. A core node is responsible for scheduling optical packets that arrive from edge nodes and switching them toward their destinations. Nowadays, most edge nodes use virtual output queue (VOQ) architecture for buffering client packets to achieve high throughput. This paper presents two efficient scheduling algorithms called discretionary iterative matching (DIM) and adaptive DIM. These schedulers find maximum matching in a small number of iterations and provide high throughput and incur low delay. The number of arbiters in these schedulers and the number of messages exchanged between inputs and outputs of a core node are reduced. We show that DIM and adaptive DIM can provide better performance in comparison with iterative round-robin matching with SLIP (iSLIP). SLIP means the act of sliding for a short distance to select one of the requested connections based on the scheduling algorithm.

  1. PhenomeExpress: a refined network analysis of expression datasets by inclusion of known disease phenotypes.

    PubMed

    Soul, Jamie; Hardingham, Timothy E; Boot-Handford, Raymond P; Schwartz, Jean-Marc

    2015-01-29

    We describe a new method, PhenomeExpress, for the analysis of transcriptomic datasets to identify pathogenic disease mechanisms. Our analysis method includes input from both protein-protein interaction and phenotype similarity networks. This introduces valuable information from disease relevant phenotypes, which aids the identification of sub-networks that are significantly enriched in differentially expressed genes and are related to the disease relevant phenotypes. This contrasts with many active sub-network detection methods, which rely solely on protein-protein interaction networks derived from compounded data of many unrelated biological conditions and which are therefore not specific to the context of the experiment. PhenomeExpress thus exploits readily available animal model and human disease phenotype information. It combines this prior evidence of disease phenotypes with the experimentally derived disease data sets to provide a more targeted analysis. Two case studies, in subchondral bone in osteoarthritis and in Pax5 in acute lymphoblastic leukaemia, demonstrate that PhenomeExpress identifies core disease pathways in both mouse and human disease expression datasets derived from different technologies. We also validate the approach by comparison to state-of-the-art active sub-network detection methods, which reveals how it may enhance the detection of molecular phenotypes and provide a more detailed context to those previously identified as possible candidates.

  2. "Us and them": a social network analysis of physicians' professional networks and their attitudes towards EBM.

    PubMed

    Mascia, Daniele; Cicchetti, Americo; Damiani, Gianfranco

    2013-10-22

    Extant research suggests that there is a strong social component to Evidence-Based Medicine (EBM) adoption since professional networks amongst physicians are strongly associated with their attitudes towards EBM. Despite this evidence, it is still unknown whether individual attitudes to use scientific evidence in clinical decision-making influence the position that physicians hold in their professional network. This paper explores how physicians' attitudes towards EBM is related to the network position they occupy within healthcare organizations. Data pertain to a sample of Italian physicians, whose professional network relationships, demographics and work-profile characteristics were collected. A social network analysis was performed to capture the structural importance of physicians in the collaboration network by the means of a core-periphery analysis and the computation of network centrality indicators. Then, regression analysis was used to test the association between the network position of individual clinicians and their attitudes towards EBM. Findings documented that the overall network structure is made up of a dense cohesive core of physicians and of less connected clinicians who occupy the periphery. A negative association between the physicians' attitudes towards EBM and the coreness they exhibited in the professional network was also found. Network centrality indicators confirmed these results documenting a negative association between physicians' propensity to use EBM and their structural importance in the professional network. Attitudes that physicians show towards EBM are related to the part (core or periphery) of the professional networks to which they belong as well as to their structural importance. By identifying virtuous attitudes and behaviors of professionals within their organizations, policymakers and executives may avoid marginalization and stimulate integration and continuity of care, both within and across the boundaries of healthcare providers.

  3. Large-Scale Gene Relocations following an Ancient Genome Triplication Associated with the Diversification of Core Eudicots.

    PubMed

    Wang, Yupeng; Ficklin, Stephen P; Wang, Xiyin; Feltus, F Alex; Paterson, Andrew H

    2016-01-01

    Different modes of gene duplication including whole-genome duplication (WGD), and tandem, proximal and dispersed duplications are widespread in angiosperm genomes. Small-scale, stochastic gene relocations and transposed gene duplications are widely accepted to be the primary mechanisms for the creation of dispersed duplicates. However, here we show that most surviving ancient dispersed duplicates in core eudicots originated from large-scale gene relocations within a narrow window of time following a genome triplication (γ) event that occurred in the stem lineage of core eudicots. We name these surviving ancient dispersed duplicates as relocated γ duplicates. In Arabidopsis thaliana, relocated γ, WGD and single-gene duplicates have distinct features with regard to gene functions, essentiality, and protein interactions. Relative to γ duplicates, relocated γ duplicates have higher non-synonymous substitution rates, but comparable levels of expression and regulation divergence. Thus, relocated γ duplicates should be distinguished from WGD and single-gene duplicates for evolutionary investigations. Our results suggest large-scale gene relocations following the γ event were associated with the diversification of core eudicots.

  4. Large-Scale Gene Relocations following an Ancient Genome Triplication Associated with the Diversification of Core Eudicots

    PubMed Central

    Wang, Yupeng; Ficklin, Stephen P.; Wang, Xiyin; Feltus, F. Alex; Paterson, Andrew H.

    2016-01-01

    Different modes of gene duplication including whole-genome duplication (WGD), and tandem, proximal and dispersed duplications are widespread in angiosperm genomes. Small-scale, stochastic gene relocations and transposed gene duplications are widely accepted to be the primary mechanisms for the creation of dispersed duplicates. However, here we show that most surviving ancient dispersed duplicates in core eudicots originated from large-scale gene relocations within a narrow window of time following a genome triplication (γ) event that occurred in the stem lineage of core eudicots. We name these surviving ancient dispersed duplicates as relocated γ duplicates. In Arabidopsis thaliana, relocated γ, WGD and single-gene duplicates have distinct features with regard to gene functions, essentiality, and protein interactions. Relative to γ duplicates, relocated γ duplicates have higher non-synonymous substitution rates, but comparable levels of expression and regulation divergence. Thus, relocated γ duplicates should be distinguished from WGD and single-gene duplicates for evolutionary investigations. Our results suggest large-scale gene relocations following the γ event were associated with the diversification of core eudicots. PMID:27195960

  5. Uncovering disease mechanisms through network biology in the era of Next Generation Sequencing

    NASA Astrophysics Data System (ADS)

    Piñero, Janet; Berenstein, Ariel; Gonzalez-Perez, Abel; Chernomoretz, Ariel; Furlong, Laura I.

    2016-04-01

    Characterizing the behavior of disease genes in the context of biological networks has the potential to shed light on disease mechanisms, and to reveal both new candidate disease genes and therapeutic targets. Previous studies addressing the network properties of disease genes have produced contradictory results. Here we have explored the causes of these discrepancies and assessed the relationship between the network roles of disease genes and their tolerance to deleterious germline variants in human populations leveraging on: the abundance of interactome resources, a comprehensive catalog of disease genes and exome variation data. We found that the most salient network features of disease genes are driven by cancer genes and that genes related to different types of diseases play network roles whose centrality is inversely correlated to their tolerance to likely deleterious germline mutations. This proved to be a multiscale signature, including global, mesoscopic and local network centrality features. Cancer driver genes, the most sensitive to deleterious variants, occupy the most central positions, followed by dominant disease genes and then by recessive disease genes, which are tolerant to variants and isolated within their network modules.

  6. Uncovering disease mechanisms through network biology in the era of Next Generation Sequencing

    PubMed Central

    Piñero, Janet; Berenstein, Ariel; Gonzalez-Perez, Abel; Chernomoretz, Ariel; Furlong, Laura I.

    2016-01-01

    Characterizing the behavior of disease genes in the context of biological networks has the potential to shed light on disease mechanisms, and to reveal both new candidate disease genes and therapeutic targets. Previous studies addressing the network properties of disease genes have produced contradictory results. Here we have explored the causes of these discrepancies and assessed the relationship between the network roles of disease genes and their tolerance to deleterious germline variants in human populations leveraging on: the abundance of interactome resources, a comprehensive catalog of disease genes and exome variation data. We found that the most salient network features of disease genes are driven by cancer genes and that genes related to different types of diseases play network roles whose centrality is inversely correlated to their tolerance to likely deleterious germline mutations. This proved to be a multiscale signature, including global, mesoscopic and local network centrality features. Cancer driver genes, the most sensitive to deleterious variants, occupy the most central positions, followed by dominant disease genes and then by recessive disease genes, which are tolerant to variants and isolated within their network modules. PMID:27080396

  7. Identifying influential spreaders in complex networks based on kshell hybrid method

    NASA Astrophysics Data System (ADS)

    Namtirtha, Amrita; Dutta, Animesh; Dutta, Biswanath

    2018-06-01

    Influential spreaders are the key players in maximizing or controlling the spreading in a complex network. Identifying the influential spreaders using kshell decomposition method has become very popular in the recent time. In the literature, the core nodes i.e. with the largest kshell index of a network are considered as the most influential spreaders. We have studied the kshell method and spreading dynamics of nodes using Susceptible-Infected-Recovered (SIR) epidemic model to understand the behavior of influential spreaders in terms of its topological location in the network. From the study, we have found that every node in the core area is not the most influential spreader. Even a strategically placed lower shell node can also be a most influential spreader. Moreover, the core area can also be situated at the periphery of the network. The existing indexing methods are only designed to identify the most influential spreaders from core nodes and not from lower shells. In this work, we propose a kshell hybrid method to identify highly influential spreaders not only from the core but also from lower shells. The proposed method comprises the parameters such as kshell power, node's degree, contact distance, and many levels of neighbors' influence potential. The proposed method is evaluated using nine real world network datasets. In terms of the spreading dynamics, the experimental results show the superiority of the proposed method over the other existing indexing methods such as the kshell method, the neighborhood coreness centrality, the mixed degree decomposition, etc. Furthermore, the proposed method can also be applied to large-scale networks by considering the three levels of neighbors' influence potential.

  8. Fundamental structures of dynamic social networks.

    PubMed

    Sekara, Vedran; Stopczynski, Arkadiusz; Lehmann, Sune

    2016-09-06

    Social systems are in a constant state of flux, with dynamics spanning from minute-by-minute changes to patterns present on the timescale of years. Accurate models of social dynamics are important for understanding the spreading of influence or diseases, formation of friendships, and the productivity of teams. Although there has been much progress on understanding complex networks over the past decade, little is known about the regularities governing the microdynamics of social networks. Here, we explore the dynamic social network of a densely-connected population of ∼1,000 individuals and their interactions in the network of real-world person-to-person proximity measured via Bluetooth, as well as their telecommunication networks, online social media contacts, geolocation, and demographic data. These high-resolution data allow us to observe social groups directly, rendering community detection unnecessary. Starting from 5-min time slices, we uncover dynamic social structures expressed on multiple timescales. On the hourly timescale, we find that gatherings are fluid, with members coming and going, but organized via a stable core of individuals. Each core represents a social context. Cores exhibit a pattern of recurring meetings across weeks and months, each with varying degrees of regularity. Taken together, these findings provide a powerful simplification of the social network, where cores represent fundamental structures expressed with strong temporal and spatial regularity. Using this framework, we explore the complex interplay between social and geospatial behavior, documenting how the formation of cores is preceded by coordination behavior in the communication networks and demonstrating that social behavior can be predicted with high precision.

  9. Fundamental structures of dynamic social networks

    PubMed Central

    Sekara, Vedran; Stopczynski, Arkadiusz; Lehmann, Sune

    2016-01-01

    Social systems are in a constant state of flux, with dynamics spanning from minute-by-minute changes to patterns present on the timescale of years. Accurate models of social dynamics are important for understanding the spreading of influence or diseases, formation of friendships, and the productivity of teams. Although there has been much progress on understanding complex networks over the past decade, little is known about the regularities governing the microdynamics of social networks. Here, we explore the dynamic social network of a densely-connected population of ∼1,000 individuals and their interactions in the network of real-world person-to-person proximity measured via Bluetooth, as well as their telecommunication networks, online social media contacts, geolocation, and demographic data. These high-resolution data allow us to observe social groups directly, rendering community detection unnecessary. Starting from 5-min time slices, we uncover dynamic social structures expressed on multiple timescales. On the hourly timescale, we find that gatherings are fluid, with members coming and going, but organized via a stable core of individuals. Each core represents a social context. Cores exhibit a pattern of recurring meetings across weeks and months, each with varying degrees of regularity. Taken together, these findings provide a powerful simplification of the social network, where cores represent fundamental structures expressed with strong temporal and spatial regularity. Using this framework, we explore the complex interplay between social and geospatial behavior, documenting how the formation of cores is preceded by coordination behavior in the communication networks and demonstrating that social behavior can be predicted with high precision. PMID:27555584

  10. Trainable Gene Regulation Networks with Applications to Drosophila Pattern Formation

    NASA Technical Reports Server (NTRS)

    Mjolsness, Eric

    2000-01-01

    This chapter will very briefly introduce and review some computational experiments in using trainable gene regulation network models to simulate and understand selected episodes in the development of the fruit fly, Drosophila melanogaster. For details the reader is referred to the papers introduced below. It will then introduce a new gene regulation network model which can describe promoter-level substructure in gene regulation. As described in chapter 2, gene regulation may be thought of as a combination of cis-acting regulation by the extended promoter of a gene (including all regulatory sequences) by way of the transcription complex, and of trans-acting regulation by the transcription factor products of other genes. If we simplify the cis-action by using a phenomenological model which can be tuned to data, such as a unit or other small portion of an artificial neural network, then the full transacting interaction between multiple genes during development can be modelled as a larger network which can again be tuned or trained to data. The larger network will in general need to have recurrent (feedback) connections since at least some real gene regulation networks do. This is the basic modeling approach taken, which describes how a set of recurrent neural networks can be used as a modeling language for multiple developmental processes including gene regulation within a single cell, cell-cell communication, and cell division. Such network models have been called "gene circuits", "gene regulation networks", or "genetic regulatory networks", sometimes without distinguishing the models from the actual modeled systems.

  11. Networked Governance in Three Policy Areas with Implications for the Common Core State Standards Initiative

    ERIC Educational Resources Information Center

    Manna, Paul

    2010-01-01

    Policy makers and researchers now recognize that designing effective institutions to govern policy networks is a major challenge of the 21st Century. Presently, the Common Core State Standards Initiative resembles an emerging network of organizations united around the goal of developing clear and challenging academic expectations for students in…

  12. Robust Learning of High-dimensional Biological Networks with Bayesian Networks

    NASA Astrophysics Data System (ADS)

    Nägele, Andreas; Dejori, Mathäus; Stetter, Martin

    Structure learning of Bayesian networks applied to gene expression data has become a potentially useful method to estimate interactions between genes. However, the NP-hardness of Bayesian network structure learning renders the reconstruction of the full genetic network with thousands of genes unfeasible. Consequently, the maximal network size is usually restricted dramatically to a small set of genes (corresponding with variables in the Bayesian network). Although this feature reduction step makes structure learning computationally tractable, on the downside, the learned structure might be adversely affected due to the introduction of missing genes. Additionally, gene expression data are usually very sparse with respect to the number of samples, i.e., the number of genes is much greater than the number of different observations. Given these problems, learning robust network features from microarray data is a challenging task. This chapter presents several approaches tackling the robustness issue in order to obtain a more reliable estimation of learned network features.

  13. Mining disease genes using integrated protein-protein interaction and gene-gene co-regulation information.

    PubMed

    Li, Jin; Wang, Limei; Guo, Maozu; Zhang, Ruijie; Dai, Qiguo; Liu, Xiaoyan; Wang, Chunyu; Teng, Zhixia; Xuan, Ping; Zhang, Mingming

    2015-01-01

    In humans, despite the rapid increase in disease-associated gene discovery, a large proportion of disease-associated genes are still unknown. Many network-based approaches have been used to prioritize disease genes. Many networks, such as the protein-protein interaction (PPI), KEGG, and gene co-expression networks, have been used. Expression quantitative trait loci (eQTLs) have been successfully applied for the determination of genes associated with several diseases. In this study, we constructed an eQTL-based gene-gene co-regulation network (GGCRN) and used it to mine for disease genes. We adopted the random walk with restart (RWR) algorithm to mine for genes associated with Alzheimer disease. Compared to the Human Protein Reference Database (HPRD) PPI network alone, the integrated HPRD PPI and GGCRN networks provided faster convergence and revealed new disease-related genes. Therefore, using the RWR algorithm for integrated PPI and GGCRN is an effective method for disease-associated gene mining.

  14. A hybrid network-based method for the detection of disease-related genes

    NASA Astrophysics Data System (ADS)

    Cui, Ying; Cai, Meng; Dai, Yang; Stanley, H. Eugene

    2018-02-01

    Detecting disease-related genes is crucial in disease diagnosis and drug design. The accepted view is that neighbors of a disease-causing gene in a molecular network tend to cause the same or similar diseases, and network-based methods have been recently developed to identify novel hereditary disease-genes in available biomedical networks. Despite the steady increase in the discovery of disease-associated genes, there is still a large fraction of disease genes that remains under the tip of the iceberg. In this paper we exploit the topological properties of the protein-protein interaction (PPI) network to detect disease-related genes. We compute, analyze, and compare the topological properties of disease genes with non-disease genes in PPI networks. We also design an improved random forest classifier based on these network topological features, and a cross-validation test confirms that our method performs better than previous similar studies.

  15. Identification of GRB2 and GAB1 Coexpression as an Unfavorable Prognostic Factor for Hepatocellular Carcinoma by a Combination of Expression Profile and Network Analysis

    PubMed Central

    Yang, Mei; Wang, Danhua; Yu, Lingxiang; Guo, Chaonan; Guo, Xiaodong; Lin, Na

    2013-01-01

    Aim To screen novel markers for hepatocellular carcinoma (HCC) by a combination of expression profile, interaction network analysis and clinical validation. Methods HCC significant molecules which are differentially expressed or had genetic variations in HCC tissues were obtained from five existing HCC related databases (OncoDB.HCC, HCC.net, dbHCCvar, EHCO and Liverome). Then, the protein-protein interaction (PPI) network of these molecules was constructed. Three topological features of the network ('Degree', 'Betweenness', and 'Closeness') and the k-core algorithm were used to screen candidate HCC markers which play crucial roles in tumorigenesis of HCC. Furthermore, the clinical significance of two candidate HCC markers growth factor receptor-bound 2 (GRB2) and GRB2-associated-binding protein 1 (GAB1) was validated. Results In total, 6179 HCC significant genes and 977 HCC significant proteins were collected from existing HCC related databases. After network analysis, 331 candidate HCC markers were identified. Especially, GAB1 has the highest k-coreness suggesting its central localization in HCC related network, and the interaction between GRB2 and GAB1 has the largest edge-betweenness implying it may be biologically important to the function of HCC related network. As the results of clinical validation, the expression levels of both GRB2 and GAB1 proteins were significantly higher in HCC tissues than those in their adjacent nonneoplastic tissues. More importantly, the combined GRB2 and GAB1 protein expression was significantly associated with aggressive tumor progression and poor prognosis in patients with HCC. Conclusion This study provided an integrative analysis by combining expression profile and interaction network analysis to identify a list of biologically significant HCC related markers and pathways. Further experimental validation indicated that the aberrant expression of GRB2 and GAB1 proteins may be strongly related to tumor progression and prognosis in patients with HCC. The overexpression of GRB2 in combination with upregulation of GAB1 may be an unfavorable prognostic factor for HCC. PMID:24391994

  16. Gene network biological validity based on gene-gene interaction relevance.

    PubMed

    Gómez-Vela, Francisco; Díaz-Díaz, Norberto

    2014-01-01

    In recent years, gene networks have become one of the most useful tools for modeling biological processes. Many inference gene network algorithms have been developed as techniques for extracting knowledge from gene expression data. Ensuring the reliability of the inferred gene relationships is a crucial task in any study in order to prove that the algorithms used are precise. Usually, this validation process can be carried out using prior biological knowledge. The metabolic pathways stored in KEGG are one of the most widely used knowledgeable sources for analyzing relationships between genes. This paper introduces a new methodology, GeneNetVal, to assess the biological validity of gene networks based on the relevance of the gene-gene interactions stored in KEGG metabolic pathways. Hence, a complete KEGG pathway conversion into a gene association network and a new matching distance based on gene-gene interaction relevance are proposed. The performance of GeneNetVal was established with three different experiments. Firstly, our proposal is tested in a comparative ROC analysis. Secondly, a randomness study is presented to show the behavior of GeneNetVal when the noise is increased in the input network. Finally, the ability of GeneNetVal to detect biological functionality of the network is shown.

  17. Optimization of neural network architecture using genetic programming improves detection and modeling of gene-gene interactions in studies of human diseases

    PubMed Central

    Ritchie, Marylyn D; White, Bill C; Parker, Joel S; Hahn, Lance W; Moore, Jason H

    2003-01-01

    Background Appropriate definition of neural network architecture prior to data analysis is crucial for successful data mining. This can be challenging when the underlying model of the data is unknown. The goal of this study was to determine whether optimizing neural network architecture using genetic programming as a machine learning strategy would improve the ability of neural networks to model and detect nonlinear interactions among genes in studies of common human diseases. Results Using simulated data, we show that a genetic programming optimized neural network approach is able to model gene-gene interactions as well as a traditional back propagation neural network. Furthermore, the genetic programming optimized neural network is better than the traditional back propagation neural network approach in terms of predictive ability and power to detect gene-gene interactions when non-functional polymorphisms are present. Conclusion This study suggests that a machine learning strategy for optimizing neural network architecture may be preferable to traditional trial-and-error approaches for the identification and characterization of gene-gene interactions in common, complex human diseases. PMID:12846935

  18. Deep Evolutionary Comparison of Gene Expression Identifies Parallel Recruitment of Trans-Factors in Two Independent Origins of C4 Photosynthesis

    PubMed Central

    Kümpers, Britta M. C.; Smith-Unna, Richard D.; Hibberd, Julian M.

    2014-01-01

    With at least 60 independent origins spanning monocotyledons and dicotyledons, the C4 photosynthetic pathway represents one of the most remarkable examples of convergent evolution. The recurrent evolution of this highly complex trait involving alterations to leaf anatomy, cell biology and biochemistry allows an increase in productivity by ∼50% in tropical and subtropical areas. The extent to which separate lineages of C4 plants use the same genetic networks to maintain C4 photosynthesis is unknown. We developed a new informatics framework to enable deep evolutionary comparison of gene expression in species lacking reference genomes. We exploited this to compare gene expression in species representing two independent C4 lineages (Cleome gynandra and Zea mays) whose last common ancestor diverged ∼140 million years ago. We define a cohort of 3,335 genes that represent conserved components of leaf and photosynthetic development in these species. Furthermore, we show that genes encoding proteins of the C4 cycle are recruited into networks defined by photosynthesis-related genes. Despite the wide evolutionary separation and independent origins of the C4 phenotype, we report that these species use homologous transcription factors to both induce C4 photosynthesis and to maintain the cell specific gene expression required for the pathway to operate. We define a core molecular signature associated with leaf and photosynthetic maturation that is likely shared by angiosperm species derived from the last common ancestor of the monocotyledons and dicotyledons. We show that deep evolutionary comparisons of gene expression can reveal novel insight into the molecular convergence of highly complex phenotypes and that parallel evolution of trans-factors underpins the repeated appearance of C4 photosynthesis. Thus, exploitation of extant natural variation associated with complex traits can be used to identify regulators. Moreover, the transcription factors that are shared by independent C4 lineages are key targets for engineering the C4 pathway into C3 crops such as rice. PMID:24901697

  19. Study of hepatitis B virus gene mutations with enzymatic colorimetry-based DNA microarray.

    PubMed

    Mao, Hailei; Wang, Huimin; Zhang, Donglei; Mao, Hongju; Zhao, Jianlong; Shi, Jian; Cui, Zhichu

    2006-01-01

    To establish a modified microarray method for detecting HBV gene mutations in the clinic. Site-specific oligonucleotide probes were immobilized to microarray slides and hybridized to biotin-labeled HBV gene fragments amplified from two-step PCR. Hybridized targets were transferred to nitrocellulose membranes, followed by intensity measurement using BCIP/NBT colorimetry. HBV genes from 99 Hepatitis B patients and 40 healthy blood donors were analyzed. Mutation frequencies of HBV pre-core/core and basic core promoter (BCP) regions were found to be significantly higher in the patient group (42%, 40% versus 2.5%, 5%, P < 0.01). Compared with a traditional fluorescence method, the colorimetry method exhibited the same level of sensitivity and reproducibility. An enzymatic colorimetry-based DNA microarray assay was successfully established to monitor HBV mutations. Pre-core/core and BCP mutations of HBV genes could be major causes of HBV infection in HBeAg-negative patients and could also be relevant to chronicity and aggravation of hepatitis B.

  20. Topology of genetic associations between regional gray matter volume and intellectual ability: Evidence for a high capacity network.

    PubMed

    Bohlken, Marc M; Brouwer, Rachel M; Mandl, René C W; Hedman, Anna M; van den Heuvel, Martijn P; van Haren, Neeltje E M; Kahn, René S; Hulshoff Pol, Hilleke E

    2016-01-01

    Intelligence is associated with a network of distributed gray matter areas including the frontal and parietal higher association cortices and primary processing areas of the temporal and occipital lobes. Efficient information transfer between gray matter regions implicated in intelligence is thought to be critical for this trait to emerge. Genetic factors implicated in intelligence and gray matter may promote a high capacity for information transfer. Whether these genetic factors act globally or on local gray matter areas separately is not known. Brain maps of phenotypic and genetic associations between gray matter volume and intelligence were made using structural equation modeling of 3T MRI T1-weighted scans acquired in 167 adult twins of the newly acquired U-TWIN cohort. Subsequently, structural connectivity analyses (DTI) were performed to test the hypothesis that gray matter regions associated with intellectual ability form a densely connected core. Gray matter regions associated with intellectual ability were situated in the right prefrontal, bilateral temporal, bilateral parietal, right occipital and subcortical regions. Regions implicated in intelligence had high structural connectivity density compared to 10,000 reference networks (p=0.031). The genetic association with intelligence was for 39% explained by a genetic source unique to these regions (independent of total brain volume), this source specifically implicated the right supramarginal gyrus. Using a twin design, we show that intelligence is genetically represented in a spatially distributed and densely connected network of gray matter regions providing a high capacity infrastructure. Although genes for intelligence have overlap with those for total brain volume, we present evidence that there are genes for intelligence that act specifically on the subset of brain areas that form an efficient brain network. Copyright © 2015 Elsevier Inc. All rights reserved.

  1. Systematic Evaluation of Molecular Networks for Discovery of Disease Genes. | Office of Cancer Genomics

    Cancer.gov

    Gene networks are rapidly growing in size and number, raising the question of which networks are most appropriate for particular applications. Here, we evaluate 21 human genome-wide interaction networks for their ability to recover 446 disease gene sets identified through literature curation, gene expression profiling, or genome-wide association studies. While all networks have some ability to recover disease genes, we observe a wide range of performance with STRING, ConsensusPathDB, and GIANT networks having the best performance overall.

  2. SiBIC: a web server for generating gene set networks based on biclusters obtained by maximal frequent itemset mining.

    PubMed

    Takahashi, Kei-ichiro; Takigawa, Ichigaku; Mamitsuka, Hiroshi

    2013-01-01

    Detecting biclusters from expression data is useful, since biclusters are coexpressed genes under only part of all given experimental conditions. We present a software called SiBIC, which from a given expression dataset, first exhaustively enumerates biclusters, which are then merged into rather independent biclusters, which finally are used to generate gene set networks, in which a gene set assigned to one node has coexpressed genes. We evaluated each step of this procedure: 1) significance of the generated biclusters biologically and statistically, 2) biological quality of merged biclusters, and 3) biological significance of gene set networks. We emphasize that gene set networks, in which nodes are not genes but gene sets, can be more compact than usual gene networks, meaning that gene set networks are more comprehensible. SiBIC is available at http://utrecht.kuicr.kyoto-u.ac.jp:8080/miami/faces/index.jsp.

  3. Diurnal Transcriptome and Gene Network Represented through Sparse Modeling in Brachypodium distachyon.

    PubMed

    Koda, Satoru; Onda, Yoshihiko; Matsui, Hidetoshi; Takahagi, Kotaro; Yamaguchi-Uehara, Yukiko; Shimizu, Minami; Inoue, Komaki; Yoshida, Takuhiro; Sakurai, Tetsuya; Honda, Hiroshi; Eguchi, Shinto; Nishii, Ryuei; Mochida, Keiichi

    2017-01-01

    We report the comprehensive identification of periodic genes and their network inference, based on a gene co-expression analysis and an Auto-Regressive eXogenous (ARX) model with a group smoothly clipped absolute deviation (SCAD) method using a time-series transcriptome dataset in a model grass, Brachypodium distachyon . To reveal the diurnal changes in the transcriptome in B. distachyon , we performed RNA-seq analysis of its leaves sampled through a diurnal cycle of over 48 h at 4 h intervals using three biological replications, and identified 3,621 periodic genes through our wavelet analysis. The expression data are feasible to infer network sparsity based on ARX models. We found that genes involved in biological processes such as transcriptional regulation, protein degradation, and post-transcriptional modification and photosynthesis are significantly enriched in the periodic genes, suggesting that these processes might be regulated by circadian rhythm in B. distachyon . On the basis of the time-series expression patterns of the periodic genes, we constructed a chronological gene co-expression network and identified putative transcription factors encoding genes that might be involved in the time-specific regulatory transcriptional network. Moreover, we inferred a transcriptional network composed of the periodic genes in B. distachyon , aiming to identify genes associated with other genes through variable selection by grouping time points for each gene. Based on the ARX model with the group SCAD regularization using our time-series expression datasets of the periodic genes, we constructed gene networks and found that the networks represent typical scale-free structure. Our findings demonstrate that the diurnal changes in the transcriptome in B. distachyon leaves have a sparse network structure, demonstrating the spatiotemporal gene regulatory network over the cyclic phase transitions in B. distachyon diurnal growth.

  4. Identification of a core-periphery structure among participants of a business climate survey. An investigation based on the ZEW survey data

    NASA Astrophysics Data System (ADS)

    Stolzenburg, U.; Lux, T.

    2011-12-01

    Processes of social opinion formation might be dominated by a set of closely connected agents who constitute the cohesive `core' of a network and have a higher influence on the overall outcome of the process than those agents in the more sparsely connected `periphery'. Here we explore whether such a perspective could shed light on the dynamics of a well known economic sentiment index. To this end, we hypothesize that the respondents of the survey under investigation form a core-periphery network, and we identify those agents that define the core (in a discrete setting) or the proximity of each agent to the core (in a continuous setting). As it turns out, there is significant correlation between the so identified cores of different survey questions. Both the discrete and the continuous cores allow an almost perfect replication of the original series with a reduced data set of core members or weighted entries according to core proximity. Using a monthly time series on industrial production in Germany, we also compared experts' predictions with the real economic development. The core members identified in the discrete setting showed significantly better prediction capabilities than those agents assigned to the periphery of the network.

  5. Clock Genes and Altered Sleep–Wake Rhythms: Their Role in the Development of Psychiatric Disorders

    PubMed Central

    Charrier, Annaëlle; Olliac, Bertrand; Roubertoux, Pierre; Tordjman, Sylvie

    2017-01-01

    In mammals, the circadian clocks network (central and peripheral oscillators) controls circadian rhythms and orchestrates the expression of a range of downstream genes, allowing the organism to anticipate and adapt to environmental changes. Beyond their role in circadian rhythms, several studies have highlighted that circadian clock genes may have a more widespread physiological effect on cognition, mood, and reward-related behaviors. Furthermore, single nucleotide polymorphisms in core circadian clock genes have been associated with psychiatric disorders (such as autism spectrum disorder, schizophrenia, anxiety disorders, major depressive disorder, bipolar disorder, and attention deficit hyperactivity disorder). However, the underlying mechanisms of these associations remain to be ascertained and the cause–effect relationships are not clearly established. The objective of this article is to clarify the role of clock genes and altered sleep–wake rhythms in the development of psychiatric disorders (sleep problems are often observed at early onset of psychiatric disorders). First, the molecular mechanisms of circadian rhythms are described. Then, the relationships between disrupted circadian rhythms, including sleep–wake rhythms, and psychiatric disorders are discussed. Further research may open interesting perspectives with promising avenues for early detection and therapeutic intervention in psychiatric disorders. PMID:28468274

  6. Clock Genes and Altered Sleep-Wake Rhythms: Their Role in the Development of Psychiatric Disorders.

    PubMed

    Charrier, Annaëlle; Olliac, Bertrand; Roubertoux, Pierre; Tordjman, Sylvie

    2017-04-29

    In mammals, the circadian clocks network (central and peripheral oscillators) controls circadian rhythms and orchestrates the expression of a range of downstream genes, allowing the organism to anticipate and adapt to environmental changes. Beyond their role in circadian rhythms, several studies have highlighted that circadian clock genes may have a more widespread physiological effect on cognition, mood, and reward-related behaviors. Furthermore, single nucleotide polymorphisms in core circadian clock genes have been associated with psychiatric disorders (such as autism spectrum disorder, schizophrenia, anxiety disorders, major depressive disorder, bipolar disorder, and attention deficit hyperactivity disorder). However, the underlying mechanisms of these associations remain to be ascertained and the cause-effect relationships are not clearly established. The objective of this article is to clarify the role of clock genes and altered sleep-wake rhythms in the development of psychiatric disorders (sleep problems are often observed at early onset of psychiatric disorders). First, the molecular mechanisms of circadian rhythms are described. Then, the relationships between disrupted circadian rhythms, including sleep-wake rhythms, and psychiatric disorders are discussed. Further research may open interesting perspectives with promising avenues for early detection and therapeutic intervention in psychiatric disorders.

  7. Understanding the implementation of evidence-based care: a structural network approach.

    PubMed

    Parchman, Michael L; Scoglio, Caterina M; Schumm, Phillip

    2011-02-24

    Recent study of complex networks has yielded many new insights into phenomenon such as social networks, the internet, and sexually transmitted infections. The purpose of this analysis is to examine the properties of a network created by the 'co-care' of patients within one region of the Veterans Health Affairs. Data were obtained for all outpatient visits from 1 October 2006 to 30 September 2008 within one large Veterans Integrated Service Network. Types of physician within each clinic were nodes connected by shared patients, with a weighted link representing the number of shared patients between each connected pair. Network metrics calculated included edge weights, node degree, node strength, node coreness, and node betweenness. Log-log plots were used to examine the distribution of these metrics. Sizes of k-core networks were also computed under multiple conditions of node removal. There were 4,310,465 encounters by 266,710 shared patients between 722 provider types (nodes) across 41 stations or clinics resulting in 34,390 edges. The number of other nodes to which primary care provider nodes have a connection (172.7) is 42% greater than that of general surgeons and two and one-half times as high as cardiology. The log-log plot of the edge weight distribution appears to be linear in nature, revealing a 'scale-free' characteristic of the network, while the distributions of node degree and node strength are less so. The analysis of the k-core network sizes under increasing removal of primary care nodes shows that about 10 most connected primary care nodes play a critical role in keeping the k-core networks connected, because their removal disintegrates the highest k-core network. Delivery of healthcare in a large healthcare system such as that of the US Department of Veterans Affairs (VA) can be represented as a complex network. This network consists of highly connected provider nodes that serve as 'hubs' within the network, and demonstrates some 'scale-free' properties. By using currently available tools to explore its topology, we can explore how the underlying connectivity of such a system affects the behavior of providers, and perhaps leverage that understanding to improve quality and outcomes of care.

  8. Degrees of separation as a statistical tool for evaluating candidate genes.

    PubMed

    Nelson, Ronald M; Pettersson, Mats E

    2014-12-01

    Selection of candidate genes is an important step in the exploration of complex genetic architecture. The number of gene networks available is increasing and these can provide information to help with candidate gene selection. It is currently common to use the degree of connectedness in gene networks as validation in Genome Wide Association (GWA) and Quantitative Trait Locus (QTL) mapping studies. However, it can cause misleading results if not validated properly. Here we present a method and tool for validating the gene pairs from GWA studies given the context of the network they co-occur in. It ensures that proposed interactions and gene associations are not statistical artefacts inherent to the specific gene network architecture. The CandidateBacon package provides an easy and efficient method to calculate the average degree of separation (DoS) between pairs of genes to currently available gene networks. We show how these empirical estimates of average connectedness are used to validate candidate gene pairs. Validation of interacting genes by comparing their connectedness with the average connectedness in the gene network will provide support for said interactions by utilising the growing amount of gene network information available. Copyright © 2014 Elsevier Ltd. All rights reserved.

  9. The Matchmaker Exchange: a platform for rare disease gene discovery.

    PubMed

    Philippakis, Anthony A; Azzariti, Danielle R; Beltran, Sergi; Brookes, Anthony J; Brownstein, Catherine A; Brudno, Michael; Brunner, Han G; Buske, Orion J; Carey, Knox; Doll, Cassie; Dumitriu, Sergiu; Dyke, Stephanie O M; den Dunnen, Johan T; Firth, Helen V; Gibbs, Richard A; Girdea, Marta; Gonzalez, Michael; Haendel, Melissa A; Hamosh, Ada; Holm, Ingrid A; Huang, Lijia; Hurles, Matthew E; Hutton, Ben; Krier, Joel B; Misyura, Andriy; Mungall, Christopher J; Paschall, Justin; Paten, Benedict; Robinson, Peter N; Schiettecatte, François; Sobreira, Nara L; Swaminathan, Ganesh J; Taschner, Peter E; Terry, Sharon F; Washington, Nicole L; Züchner, Stephan; Boycott, Kym M; Rehm, Heidi L

    2015-10-01

    There are few better examples of the need for data sharing than in the rare disease community, where patients, physicians, and researchers must search for "the needle in a haystack" to uncover rare, novel causes of disease within the genome. Impeding the pace of discovery has been the existence of many small siloed datasets within individual research or clinical laboratory databases and/or disease-specific organizations, hoping for serendipitous occasions when two distant investigators happen to learn they have a rare phenotype in common and can "match" these cases to build evidence for causality. However, serendipity has never proven to be a reliable or scalable approach in science. As such, the Matchmaker Exchange (MME) was launched to provide a robust and systematic approach to rare disease gene discovery through the creation of a federated network connecting databases of genotypes and rare phenotypes using a common application programming interface (API). The core building blocks of the MME have been defined and assembled. Three MME services have now been connected through the API and are available for community use. Additional databases that support internal matching are anticipated to join the MME network as it continues to grow. © 2015 WILEY PERIODICALS, INC.

  10. The hnRNP-Htt axis regulates necrotic cell death induced by transcriptional repression through impaired RNA splicing.

    PubMed

    Mao, Y; Tamura, T; Yuki, Y; Abe, D; Tamada, Y; Imoto, S; Tanaka, H; Homma, H; Tagawa, K; Miyano, S; Okazawa, H

    2016-04-28

    In this study, we identify signaling network of necrotic cell death induced by transcriptional repression (TRIAD) by α-amanitin (AMA), the selective RNA polymerase II inhibitor, as a model of neurodegenerative cell death. We performed genetic screen of a knockdown (KD) fly library by measuring the ratio of transformation from pupa to larva (PL ratio) under TRIAD, and selected the cell death-promoting genes. Systems biology analysis of the positive genes mapped on protein-protein interaction databases predicted the signaling network of TRIAD and the core pathway including heterogeneous nuclear ribonucleoproteins (hnRNPs) and huntingtin (Htt). RNA sequencing revealed that AMA impaired transcription and RNA splicing of Htt, which is known as an endoplasmic reticulum (ER)-stabilizing molecule. The impairment in RNA splicing and PL ratio was rescued by overexpresion of hnRNP that had been also affected by transcriptional repression. Fly genetics with suppressor or expresser of Htt and hnRNP worsened or ameliorated the decreased PL ratio by AMA, respectively. Collectively, these results suggested involvement of RNA splicing and a regulatory role of the hnRNP-Htt axis in the process of the transcriptional repression-induced necrosis.

  11. The Matchmaker Exchange: A Platform for Rare Disease Gene Discovery

    PubMed Central

    Philippakis, Anthony A.; Azzariti, Danielle R.; Beltran, Sergi; Brookes, Anthony J.; Brownstein, Catherine A.; Brudno, Michael; Brunner, Han G.; Buske, Orion J.; Carey, Knox; Doll, Cassie; Dumitriu, Sergiu; Dyke, Stephanie O.M.; den Dunnen, Johan T.; Firth, Helen V.; Gibbs, Richard A.; Girdea, Marta; Gonzalez, Michael; Haendel, Melissa A.; Hamosh, Ada; Holm, Ingrid A.; Huang, Lijia; Hurles, Matthew E.; Hutton, Ben; Krier, Joel B.; Misyura, Andriy; Mungall, Christopher J.; Paschall, Justin; Paten, Benedict; Robinson, Peter N.; Schiettecatte, François; Sobreira, Nara L.; Swaminathan, Ganesh J.; Taschner, Peter E.; Terry, Sharon F.; Washington, Nicole L.; Züchner, Stephan; Boycott, Kym M.; Rehm, Heidi L.

    2015-01-01

    There are few better examples of the need for data sharing than in the rare disease community, where patients, physicians, and researchers must search for “the needle in a haystack” to uncover rare, novel causes of disease within the genome. Impeding the pace of discovery has been the existence of many small siloed datasets within individual research or clinical laboratory databases and/or disease-specific organizations, hoping for serendipitous occasions when two distant investigators happen to learn they have a rare phenotype in common and can “match” these cases to build evidence for causality. However, serendipity has never proven to be a reliable or scalable approach in science. As such, the Matchmaker Exchange (MME) was launched to provide a robust and systematic approach to rare disease gene discovery through the creation of a federated network connecting databases of genotypes and rare phenotypes using a common application programming interface (API). The core building blocks of the MME have been defined and assembled. Three MME services have now been connected through the API and are available for community use. Additional databases that support internal matching are anticipated to join the MME network as it continues to grow. PMID:26295439

  12. The Matchmaker Exchange: A Platform for Rare Disease Gene Discovery

    DOE PAGES

    Philippakis, Anthony A.; Azzariti, Danielle R.; Beltran, Sergi; ...

    2015-09-17

    There are few better examples of the need for data sharing than in the rare disease community, where patients, physicians, and researchers must search for "the needle in a haystack" to uncover rare, novel causes of disease within the genome. Impeding the pace of discovery has been the existence of many small siloed datasets within individual research or clinical laboratory databases and/or disease-specific organizations, hoping for serendipitous occasions when two distant investigators happen to learn they have a rare phenotype in common and can "match" these cases to build evidence for causality. However, serendipity has never proven to be amore » reliable or scalable approach in science. As such, the Matchmaker Exchange (MME) was launched to provide a robust and systematic approach to rare disease gene discovery through the creation of a federated network connecting databases of genotypes and rare phenotypes using a common application programming interface (API). The core building blocks of the MME have been defined and assembled. In conclusion, three MME services have now been connected through the API and are available for community use. Additional databases that support internal matching are anticipated to join the MME network as it continues to grow.« less

  13. The Matchmaker Exchange: A Platform for Rare Disease Gene Discovery

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Philippakis, Anthony A.; Azzariti, Danielle R.; Beltran, Sergi

    There are few better examples of the need for data sharing than in the rare disease community, where patients, physicians, and researchers must search for "the needle in a haystack" to uncover rare, novel causes of disease within the genome. Impeding the pace of discovery has been the existence of many small siloed datasets within individual research or clinical laboratory databases and/or disease-specific organizations, hoping for serendipitous occasions when two distant investigators happen to learn they have a rare phenotype in common and can "match" these cases to build evidence for causality. However, serendipity has never proven to be amore » reliable or scalable approach in science. As such, the Matchmaker Exchange (MME) was launched to provide a robust and systematic approach to rare disease gene discovery through the creation of a federated network connecting databases of genotypes and rare phenotypes using a common application programming interface (API). The core building blocks of the MME have been defined and assembled. In conclusion, three MME services have now been connected through the API and are available for community use. Additional databases that support internal matching are anticipated to join the MME network as it continues to grow.« less

  14. Endophenotype Network Models: Common Core of Complex Diseases

    PubMed Central

    Ghiassian, Susan Dina; Menche, Jörg; Chasman, Daniel I.; Giulianini, Franco; Wang, Ruisheng; Ricchiuto, Piero; Aikawa, Masanori; Iwata, Hiroshi; Müller, Christian; Zeller, Tania; Sharma, Amitabh; Wild, Philipp; Lackner, Karl; Singh, Sasha; Ridker, Paul M.; Blankenberg, Stefan; Barabási, Albert-László; Loscalzo, Joseph

    2016-01-01

    Historically, human diseases have been differentiated and categorized based on the organ system in which they primarily manifest. Recently, an alternative view is emerging that emphasizes that different diseases often have common underlying mechanisms and shared intermediate pathophenotypes, or endo(pheno)types. Within this framework, a specific disease’s expression is a consequence of the interplay between the relevant endophenotypes and their local, organ-based environment. Important examples of such endophenotypes are inflammation, fibrosis, and thrombosis and their essential roles in many developing diseases. In this study, we construct endophenotype network models and explore their relation to different diseases in general and to cardiovascular diseases in particular. We identify the local neighborhoods (module) within the interconnected map of molecular components, i.e., the subnetworks of the human interactome that represent the inflammasome, thrombosome, and fibrosome. We find that these neighborhoods are highly overlapping and significantly enriched with disease-associated genes. In particular they are also enriched with differentially expressed genes linked to cardiovascular disease (risk). Finally, using proteomic data, we explore how macrophage activation contributes to our understanding of inflammatory processes and responses. The results of our analysis show that inflammatory responses initiate from within the cross-talk of the three identified endophenotypic modules. PMID:27278246

  15. Endophenotype Network Models: Common Core of Complex Diseases

    NASA Astrophysics Data System (ADS)

    Ghiassian, Susan Dina; Menche, Jörg; Chasman, Daniel I.; Giulianini, Franco; Wang, Ruisheng; Ricchiuto, Piero; Aikawa, Masanori; Iwata, Hiroshi; Müller, Christian; Zeller, Tania; Sharma, Amitabh; Wild, Philipp; Lackner, Karl; Singh, Sasha; Ridker, Paul M.; Blankenberg, Stefan; Barabási, Albert-László; Loscalzo, Joseph

    2016-06-01

    Historically, human diseases have been differentiated and categorized based on the organ system in which they primarily manifest. Recently, an alternative view is emerging that emphasizes that different diseases often have common underlying mechanisms and shared intermediate pathophenotypes, or endo(pheno)types. Within this framework, a specific disease’s expression is a consequence of the interplay between the relevant endophenotypes and their local, organ-based environment. Important examples of such endophenotypes are inflammation, fibrosis, and thrombosis and their essential roles in many developing diseases. In this study, we construct endophenotype network models and explore their relation to different diseases in general and to cardiovascular diseases in particular. We identify the local neighborhoods (module) within the interconnected map of molecular components, i.e., the subnetworks of the human interactome that represent the inflammasome, thrombosome, and fibrosome. We find that these neighborhoods are highly overlapping and significantly enriched with disease-associated genes. In particular they are also enriched with differentially expressed genes linked to cardiovascular disease (risk). Finally, using proteomic data, we explore how macrophage activation contributes to our understanding of inflammatory processes and responses. The results of our analysis show that inflammatory responses initiate from within the cross-talk of the three identified endophenotypic modules.

  16. A network pharmacology approach to determine active ingredients and rationality of herb combinations of Modified-Simiaowan for treatment of gout.

    PubMed

    Zhao, Fangli; Guochun, Li; Yang, Yanhua; Shi, Le; Xu, Li; Yin, Lian

    2015-06-20

    Modified Simiaowan (MSW) is a traditional Chinese medicine (TCM) formula and is widely used as a clinically medication formula for its efficiency in treating gouty diseases.To predict the active ingredients in MSW and uncover the rationality of herb combinations of MSW. Three drug-target networks including the "candidate ingredient-target network" (cI-cT) that links the candidate ingredients and targets, the "core ingredient-target-pathway network" connecting core potential ingredients and targets through related pathways, and the "rationality of herb combinations of MSW network", which was derived from the cI-cT network, were developed to dissect the active ingredients in MSW and relationship between ingredients in herb combinations and their targets for gouty diseases. On the other hand, herbal ingredients comparisons were also conducted based on six physicochemical properties to investigate whether the herbs in MSW are similar in chemicals. Moreover, HUVEC viability and expression levels of ICAM-1 induced by monosodium urate (MSU) crystals were assessed to determine the activities of potential ingredients in MSW. Predicted by the core ingredient-target-pathway network, we collected 30 core ingredients in MSW and 25 inflammatory cytokines and uric acid synthetase or transporters, which are effective for gouty treatment through some related pathways. Experimental results also confirmed that those core ingredients could significantly increase HUVEC viability and attenuate the expression of ICAM-1, which supported the effectiveness of MSW in treating gouty diseases. Moreover, heat-clearing and dampness-eliminating herbs in MSW have similar physicochemical properties, which stimulate all the inflammatory and uric acid-lowing targets respectively, while the core drug and basic prescription in MSW stimulate the major and almost all the core targets, respectively. Our work successfully predicts the active ingredients in MSW and explains the cooperation between these ingredients and corresponding targets through related pathways for gouty diseases, and provides basis for an alternative approach to investigate the rationality of herb combinations of MSW on the network pharmacology level, which might be beneficial to drug development and applications. Copyright © 2015. Published by Elsevier Ireland Ltd.

  17. Incorporating networks in a probabilistic graphical model to find drivers for complex human diseases.

    PubMed

    Mezlini, Aziz M; Goldenberg, Anna

    2017-10-01

    Discovering genetic mechanisms driving complex diseases is a hard problem. Existing methods often lack power to identify the set of responsible genes. Protein-protein interaction networks have been shown to boost power when detecting gene-disease associations. We introduce a Bayesian framework, Conflux, to find disease associated genes from exome sequencing data using networks as a prior. There are two main advantages to using networks within a probabilistic graphical model. First, networks are noisy and incomplete, a substantial impediment to gene discovery. Incorporating networks into the structure of a probabilistic models for gene inference has less impact on the solution than relying on the noisy network structure directly. Second, using a Bayesian framework we can keep track of the uncertainty of each gene being associated with the phenotype rather than returning a fixed list of genes. We first show that using networks clearly improves gene detection compared to individual gene testing. We then show consistently improved performance of Conflux compared to the state-of-the-art diffusion network-based method Hotnet2 and a variety of other network and variant aggregation methods, using randomly generated and literature-reported gene sets. We test Hotnet2 and Conflux on several network configurations to reveal biases and patterns of false positives and false negatives in each case. Our experiments show that our novel Bayesian framework Conflux incorporates many of the advantages of the current state-of-the-art methods, while offering more flexibility and improved power in many gene-disease association scenarios.

  18. Discovering disease-associated genes in weighted protein-protein interaction networks

    NASA Astrophysics Data System (ADS)

    Cui, Ying; Cai, Meng; Stanley, H. Eugene

    2018-04-01

    Although there have been many network-based attempts to discover disease-associated genes, most of them have not taken edge weight - which quantifies their relative strength - into consideration. We use connection weights in a protein-protein interaction (PPI) network to locate disease-related genes. We analyze the topological properties of both weighted and unweighted PPI networks and design an improved random forest classifier to distinguish disease genes from non-disease genes. We use a cross-validation test to confirm that weighted networks are better able to discover disease-associated genes than unweighted networks, which indicates that including link weight in the analysis of network properties provides a better model of complex genotype-phenotype associations.

  19. The transfer and transformation of collective network information in gene-matched networks.

    PubMed

    Kitsukawa, Takashi; Yagi, Takeshi

    2015-10-09

    Networks, such as the human society network, social and professional networks, and biological system networks, contain vast amounts of information. Information signals in networks are distributed over nodes and transmitted through intricately wired links, making the transfer and transformation of such information difficult to follow. Here we introduce a novel method for describing network information and its transfer using a model network, the Gene-matched network (GMN), in which nodes (neurons) possess attributes (genes). In the GMN, nodes are connected according to their expression of common genes. Because neurons have multiple genes, the GMN is cluster-rich. We show that, in the GMN, information transfer and transformation were controlled systematically, according to the activity level of the network. Furthermore, information transfer and transformation could be traced numerically with a vector using genes expressed in the activated neurons, the active-gene array, which was used to assess the relative activity among overlapping neuronal groups. Interestingly, this coding style closely resembles the cell-assembly neural coding theory. The method introduced here could be applied to many real-world networks, since many systems, including human society and various biological systems, can be represented as a network of this type.

  20. Characterizing gene sets using discriminative random walks with restart on heterogeneous biological networks.

    PubMed

    Blatti, Charles; Sinha, Saurabh

    2016-07-15

    Analysis of co-expressed gene sets typically involves testing for enrichment of different annotations or 'properties' such as biological processes, pathways, transcription factor binding sites, etc., one property at a time. This common approach ignores any known relationships among the properties or the genes themselves. It is believed that known biological relationships among genes and their many properties may be exploited to more accurately reveal commonalities of a gene set. Previous work has sought to achieve this by building biological networks that combine multiple types of gene-gene or gene-property relationships, and performing network analysis to identify other genes and properties most relevant to a given gene set. Most existing network-based approaches for recognizing genes or annotations relevant to a given gene set collapse information about different properties to simplify (homogenize) the networks. We present a network-based method for ranking genes or properties related to a given gene set. Such related genes or properties are identified from among the nodes of a large, heterogeneous network of biological information. Our method involves a random walk with restarts, performed on an initial network with multiple node and edge types that preserve more of the original, specific property information than current methods that operate on homogeneous networks. In this first stage of our algorithm, we find the properties that are the most relevant to the given gene set and extract a subnetwork of the original network, comprising only these relevant properties. We then re-rank genes by their similarity to the given gene set, based on a second random walk with restarts, performed on the above subnetwork. We demonstrate the effectiveness of this algorithm for ranking genes related to Drosophila embryonic development and aggressive responses in the brains of social animals. DRaWR was implemented as an R package available at veda.cs.illinois.edu/DRaWR. blatti@illinois.edu Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press.

  1. Shell-corona microgels from double interpenetrating networks.

    PubMed

    Rudyak, Vladimir Yu; Gavrilov, Alexey A; Kozhunova, Elena Yu; Chertovich, Alexander V

    2018-04-18

    Polymer microgels with a dense outer shell offer outstanding features as universal carriers for different guest molecules. In this paper, microgels formed by an interpenetrating network comprised of collapsed and swollen subnetworks are investigated using dissipative particle dynamics (DPD) computer simulations, and it is found that such systems can form classical core-corona structures, shell-corona structures, and core-shell-corona structures, depending on the subchain length and molecular mass of the system. The core-corona structures consisting of a dense core and soft corona are formed at small microgel sizes when the subnetworks are able to effectively separate in space. The most interesting shell-corona structures consist of a soft cavity in a dense shell surrounded with a loose corona, and are found at intermediate gel sizes; the area of their existence depends on the subchain length and the corresponding mesh size. At larger molecular masses the collapsing network forms additional cores inside the soft cavity, leading to the core-shell-corona structure.

  2. Clinical features and ryanodine receptor type 1 gene mutation analysis in a Chinese family with central core disease.

    PubMed

    Chang, Xingzhi; Jin, Yiwen; Zhao, Haijuan; Huang, Qionghui; Wang, Jingmin; Yuan, Yun; Han, Ying; Qin, Jiong

    2013-03-01

    Central core disease is a rare inherited neuromuscular disorder caused by mutations in ryanodine receptor type 1 gene. The clinical phenotype of the disease is highly variable. We report a Chinese pedigree with central core disease confirmed by the gene sequencing. All 3 patients in the family presented with mild proximal limb weakness. The serum level of creatine kinase was normal, and electromyography suggested myogenic changes. The histologic analysis of muscle biopsy showed identical central core lesions in almost all of the muscle fibers in the index case. Exon 90-106 in the C-terminal domain of the ryanodine receptor type 1 gene was amplified using polymerase chain reaction. One heterozygous missense mutation G14678A (Arg4893Gln) in exon 102 was identified in all 3 patients. This is the first report of a familial case of central core disease confirmed by molecular study in mainland China.

  3. The Orphan Disease Networks

    PubMed Central

    Zhang, Minlu; Zhu, Cheng; Jacomy, Alexis; Lu, Long J.; Jegga, Anil G.

    2011-01-01

    The low prevalence rate of orphan diseases (OD) requires special combined efforts to improve diagnosis, prevention, and discovery of novel therapeutic strategies. To identify and investigate relationships based on shared genes or shared functional features, we have conducted a bioinformatic-based global analysis of all orphan diseases with known disease-causing mutant genes. Starting with a bipartite network of known OD and OD-causing mutant genes and using the human protein interactome, we first construct and topologically analyze three networks: the orphan disease network, the orphan disease-causing mutant gene network, and the orphan disease-causing mutant gene interactome. Our results demonstrate that in contrast to the common disease-causing mutant genes that are predominantly nonessential, a majority of orphan disease-causing mutant genes are essential. In confirmation of this finding, we found that OD-causing mutant genes are topologically important in the protein interactome and are ubiquitously expressed. Additionally, functional enrichment analysis of those genes in which mutations cause ODs shows that a majority result in premature death or are lethal in the orthologous mouse gene knockout models. To address the limitations of traditional gene-based disease networks, we also construct and analyze OD networks on the basis of shared enriched features (biological processes, cellular components, pathways, phenotypes, and literature citations). Analyzing these functionally-linked OD networks, we identified several additional OD-OD relations that are both phenotypically similar and phenotypically diverse. Surprisingly, we observed that the wiring of the gene-based and other feature-based OD networks are largely different; this suggests that the relationship between ODs cannot be fully captured by the gene-based network alone. PMID:21664998

  4. Reconstructing directed gene regulatory network by only gene expression data.

    PubMed

    Zhang, Lu; Feng, Xi Kang; Ng, Yen Kaow; Li, Shuai Cheng

    2016-08-18

    Accurately identifying gene regulatory network is an important task in understanding in vivo biological activities. The inference of such networks is often accomplished through the use of gene expression data. Many methods have been developed to evaluate gene expression dependencies between transcription factor and its target genes, and some methods also eliminate transitive interactions. The regulatory (or edge) direction is undetermined if the target gene is also a transcription factor. Some methods predict the regulatory directions in the gene regulatory networks by locating the eQTL single nucleotide polymorphism, or by observing the gene expression changes when knocking out/down the candidate transcript factors; regrettably, these additional data are usually unavailable, especially for the samples deriving from human tissues. In this study, we propose the Context Based Dependency Network (CBDN), a method that is able to infer gene regulatory networks with the regulatory directions from gene expression data only. To determine the regulatory direction, CBDN computes the influence of source to target by evaluating the magnitude changes of expression dependencies between the target gene and the others with conditioning on the source gene. CBDN extends the data processing inequality by involving the dependency direction to distinguish between direct and transitive relationship between genes. We also define two types of important regulators which can influence a majority of the genes in the network directly or indirectly. CBDN can detect both of these two types of important regulators by averaging the influence functions of candidate regulator to the other genes. In our experiments with simulated and real data, even with the regulatory direction taken into account, CBDN outperforms the state-of-the-art approaches for inferring gene regulatory network. CBDN identifies the important regulators in the predicted network: 1. TYROBP influences a batch of genes that are related to Alzheimer's disease; 2. ZNF329 and RB1 significantly regulate those 'mesenchymal' gene expression signature genes for brain tumors. By merely leveraging gene expression data, CBDN can efficiently infer the existence of gene-gene interactions as well as their regulatory directions. The constructed networks are helpful in the identification of important regulators for complex diseases.

  5. The Streptococcus pyogenes serotype M49 Nra-Ralp3 transcriptional regulatory network and its control of virulence factor expression from the novel eno ralp3 epf sagA pathogenicity region.

    PubMed

    Kreikemeyer, Bernd; Nakata, Masanobu; Köller, Thomas; Hildisch, Hendrikje; Kourakos, Vassilios; Standar, Kerstin; Kawabata, Shigetada; Glocker, Michael O; Podbielski, Andreas

    2007-12-01

    Many Streptococcus pyogenes (group A streptococcus [GAS]) virulence factor- and transcriptional regulator-encoding genes cluster together in discrete genomic regions. Nra is a central regulator of the FCT region. Previous studies exclusively described Nra as a transcriptional repressor of adhesin and toxin genes. Here transcriptome and proteome analysis of a serotype M49 GAS strain and an isogenic Nra mutant of this strain revealed the complete Nra regulon profile. Nra is active in all growth phases tested, with the largest regulon in the transition phase. Almost exclusively, virulence factor-encoding genes are repressed by Nra; these genes include the GAS pilus operon, the capsule synthesis operon, the cytolysin-mediated translocation system genes, all Mga region core virulence genes, and genes encoding other regulators, like the Ihk/Irr system, Rgg, and two additional RofA-like protein family regulators. Surprisingly, our experiments revealed that Nra additionally acts as a positive regulator, mostly for genes encoding proteins and enzymes with metabolic functions. Epidemiological investigations revealed strong genetic linkage of one particular Nra-repressed regulator, Ralp3 (SPy0735), with a gene encoding Epf (extracellular protein factor from Streptococcus suis). In a serotype-specific fashion, this ralp3 epf gene block is integrated, most likely via transposition, into the eno sagA virulence gene block, which is present in all GAS serotypes. In GAS serotypes M1, M4, M12, M28, and M49 this novel discrete genetic region is therefore designated the eno ralp3 epf sagA (ERES) pathogenicity region. Functional experiments showed that Epf is a novel GAS plasminogen-binding protein and revealed that Ralp3 activity counteracts Nra and MsmR regulatory activity. In addition to the Mga and FCT regions, the ERES region is the third discrete chromosomal pathogenicity region. All of these regions are transcriptionally linked, adding another level of complexity to the known GAS growth phase-dependent regulatory network.

  6. The Streptococcus pyogenes Serotype M49 Nra-Ralp3 Transcriptional Regulatory Network and Its Control of Virulence Factor Expression from the Novel eno ralp3 epf sagA Pathogenicity Region▿ †

    PubMed Central

    Kreikemeyer, Bernd; Nakata, Masanobu; Köller, Thomas; Hildisch, Hendrikje; Kourakos, Vassilios; Standar, Kerstin; Kawabata, Shigetada; Glocker, Michael O.; Podbielski, Andreas

    2007-01-01

    Many Streptococcus pyogenes (group A streptococcus [GAS]) virulence factor- and transcriptional regulator-encoding genes cluster together in discrete genomic regions. Nra is a central regulator of the FCT region. Previous studies exclusively described Nra as a transcriptional repressor of adhesin and toxin genes. Here transcriptome and proteome analysis of a serotype M49 GAS strain and an isogenic Nra mutant of this strain revealed the complete Nra regulon profile. Nra is active in all growth phases tested, with the largest regulon in the transition phase. Almost exclusively, virulence factor-encoding genes are repressed by Nra; these genes include the GAS pilus operon, the capsule synthesis operon, the cytolysin-mediated translocation system genes, all Mga region core virulence genes, and genes encoding other regulators, like the Ihk/Irr system, Rgg, and two additional RofA-like protein family regulators. Surprisingly, our experiments revealed that Nra additionally acts as a positive regulator, mostly for genes encoding proteins and enzymes with metabolic functions. Epidemiological investigations revealed strong genetic linkage of one particular Nra-repressed regulator, Ralp3 (SPy0735), with a gene encoding Epf (extracellular protein factor from Streptococcus suis). In a serotype-specific fashion, this ralp3 epf gene block is integrated, most likely via transposition, into the eno sagA virulence gene block, which is present in all GAS serotypes. In GAS serotypes M1, M4, M12, M28, and M49 this novel discrete genetic region is therefore designated the eno ralp3 epf sagA (ERES) pathogenicity region. Functional experiments showed that Epf is a novel GAS plasminogen-binding protein and revealed that Ralp3 activity counteracts Nra and MsmR regulatory activity. In addition to the Mga and FCT regions, the ERES region is the third discrete chromosomal pathogenicity region. All of these regions are transcriptionally linked, adding another level of complexity to the known GAS growth phase-dependent regulatory network. PMID:17893125

  7. Meta-analysis of gene expression patterns in animal models of prenatal alcohol exposure suggests role for protein synthesis inhibition and chromatin remodeling

    PubMed Central

    Rogic, Sanja; Wong, Albertina; Pavlidis, Paul

    2017-01-01

    Background Prenatal alcohol exposure (PAE) can result in an array of morphological, behavioural and neurobiological deficits that can range in their severity. Despite extensive research in the field and a significant progress made, especially in understanding the range of possible malformations and neurobehavioral abnormalities, the molecular mechanisms of alcohol responses in development are still not well understood. There have been multiple transcriptomic studies looking at the changes in gene expression after PAE in animal models, however there is a limited apparent consensus among the reported findings. In an effort to address this issue, we performed a comprehensive re-analysis and meta-analysis of all suitable, publically available expression data sets. Methods We assembled ten microarray data sets of gene expression after PAE in mouse and rat models consisting of samples from a total of 63 ethanol-exposed and 80 control animals. We re-analyzed each data set for differential expression and then used the results to perform meta-analyses considering all data sets together or grouping them by time or duration of exposure (pre- and post-natal, acute and chronic, respectively). We performed network and Gene Ontology enrichment analysis to further characterize the identified signatures. Results For each sub-analysis we identified signatures of differential expressed genes that show support from multiple studies. Overall, the changes in gene expression were more extensive after acute ethanol treatment during prenatal development than in other models. Considering the analysis of all the data together, we identified a robust core signature of 104 genes down-regulated after PAE, with no up-regulated genes. Functional analysis reveals over-representation of genes involved in protein synthesis, mRNA splicing and chromatin organization. Conclusions Our meta-analysis shows that existing studies, despite superficial dissimilarity in findings, share features that allow us to identify a common core signature set of transcriptome changes in PAE. This is an important step to identifying the biological processes that underlie the etiology of FASD. PMID:26996386

  8. Reconstructing genome-wide regulatory network of E. coli using transcriptome data and predicted transcription factor activities

    PubMed Central

    2011-01-01

    Background Gene regulatory networks play essential roles in living organisms to control growth, keep internal metabolism running and respond to external environmental changes. Understanding the connections and the activity levels of regulators is important for the research of gene regulatory networks. While relevance score based algorithms that reconstruct gene regulatory networks from transcriptome data can infer genome-wide gene regulatory networks, they are unfortunately prone to false positive results. Transcription factor activities (TFAs) quantitatively reflect the ability of the transcription factor to regulate target genes. However, classic relevance score based gene regulatory network reconstruction algorithms use models do not include the TFA layer, thus missing a key regulatory element. Results This work integrates TFA prediction algorithms with relevance score based network reconstruction algorithms to reconstruct gene regulatory networks with improved accuracy over classic relevance score based algorithms. This method is called Gene expression and Transcription factor activity based Relevance Network (GTRNetwork). Different combinations of TFA prediction algorithms and relevance score functions have been applied to find the most efficient combination. When the integrated GTRNetwork method was applied to E. coli data, the reconstructed genome-wide gene regulatory network predicted 381 new regulatory links. This reconstructed gene regulatory network including the predicted new regulatory links show promising biological significances. Many of the new links are verified by known TF binding site information, and many other links can be verified from the literature and databases such as EcoCyc. The reconstructed gene regulatory network is applied to a recent transcriptome analysis of E. coli during isobutanol stress. In addition to the 16 significantly changed TFAs detected in the original paper, another 7 significantly changed TFAs have been detected by using our reconstructed network. Conclusions The GTRNetwork algorithm introduces the hidden layer TFA into classic relevance score-based gene regulatory network reconstruction processes. Integrating the TFA biological information with regulatory network reconstruction algorithms significantly improves both detection of new links and reduces that rate of false positives. The application of GTRNetwork on E. coli gene transcriptome data gives a set of potential regulatory links with promising biological significance for isobutanol stress and other conditions. PMID:21668997

  9. Orthoscape: a cytoscape application for grouping and visualization KEGG based gene networks by taxonomy and homology principles.

    PubMed

    Mustafin, Zakhar Sergeevich; Lashin, Sergey Alexandrovich; Matushkin, Yury Georgievich; Gunbin, Konstantin Vladimirovich; Afonnikov, Dmitry Arkadievich

    2017-01-27

    There are many available software tools for visualization and analysis of biological networks. Among them, Cytoscape ( http://cytoscape.org/ ) is one of the most comprehensive packages, with many plugins and applications which extends its functionality by providing analysis of protein-protein interaction, gene regulatory and gene co-expression networks, metabolic, signaling, neural as well as ecological-type networks including food webs, communities networks etc. Nevertheless, only three plugins tagged 'network evolution' found in Cytoscape official app store and in literature. We have developed a new Cytoscape 3.0 application Orthoscape aimed to facilitate evolutionary analysis of gene networks and visualize the results. Orthoscape aids in analysis of evolutionary information available for gene sets and networks by highlighting: (1) the orthology relationships between genes; (2) the evolutionary origin of gene network components; (3) the evolutionary pressure mode (diversifying or stabilizing, negative or positive selection) of orthologous groups in general and/or branch-oriented mode. The distinctive feature of Orthoscape is the ability to control all data analysis steps via user-friendly interface. Orthoscape allows its users to analyze gene networks or separated gene sets in the context of evolution. At each step of data analysis, Orthoscape also provides for convenient visualization and data manipulation.

  10. Network planning study of the metro-optical-network-oriented 3G application

    NASA Astrophysics Data System (ADS)

    Gong, Qian; Xu, Rong; Lin, Jin Tong

    2005-02-01

    To compare with the 2G mobile communication, 3G technologies can supply the perfect service scope and performance. 3G is the trend of the mobile communication. So now to build the transmission network, it is needed to consider how the transmission network to support the 3G applications. For the 3G network architecture, it include the 2 part: Utran access network and core network. So the metro optical network should consider how to build the network to adapt the 3G applications. Include the metro core and access layer. In the metro core, we should consider the network should evolved towards the Mesh architecture with ASON function to realize the fast protection and restoration, quick end-to-end service provision, and high capacity cross-connect matrix etc. In the access layer, the network should have the ability to access the 3G services such as ATM interface with IMA function. In addition, the traffic grooming should be provided to improve the bandwidth utility. In this paper, first we present the MCC network situation, the network planning model will be introduced. Then we present the topology architecture, node capacity and traffic forecast. At last, based on our analysis, we will give a total solution to MCC to build their metro optical network toward to the mesh network with the consideration of 3G services.

  11. Using protein-protein interactions for refining gene networks estimated from microarray data by Bayesian networks.

    PubMed

    Nariai, N; Kim, S; Imoto, S; Miyano, S

    2004-01-01

    We propose a statistical method to estimate gene networks from DNA microarray data and protein-protein interactions. Because physical interactions between proteins or multiprotein complexes are likely to regulate biological processes, using only mRNA expression data is not sufficient for estimating a gene network accurately. Our method adds knowledge about protein-protein interactions to the estimation method of gene networks under a Bayesian statistical framework. In the estimated gene network, a protein complex is modeled as a virtual node based on principal component analysis. We show the effectiveness of the proposed method through the analysis of Saccharomyces cerevisiae cell cycle data. The proposed method improves the accuracy of the estimated gene networks, and successfully identifies some biological facts.

  12. Early neuroimaging markers of FOXP2 intragenic deletion.

    PubMed

    Liégeois, Frédérique J; Hildebrand, Michael S; Bonthrone, Alexandra; Turner, Samantha J; Scheffer, Ingrid E; Bahlo, Melanie; Connelly, Alan; Morgan, Angela T

    2016-10-13

    FOXP2 is the major gene associated with severe, persistent, developmental speech and language disorders. While studies in the original family in which a FOXP2 mutation was found showed volume reduction and reduced activation in core language and speech networks, there have been no imaging studies of different FOXP2 mutations. We conducted a multimodal MRI study in an eight-year-old boy (A-II) with a de novo FOXP2 intragenic deletion. A-II showed marked bilateral volume reductions in the hippocampus, thalamus, globus pallidus, and caudate nucleus compared with 26 control males (effect sizes from -1 to -3). He showed no detectable functional MRI activity when repeating nonsense words. The hippocampus is implicated for the first time in FOXP2 diseases. We conclude that FOXP2 anomaly is either directly or indirectly associated with atypical development of widespread subcortical networks early in life.

  13. Genes under weaker stabilizing selection increase network evolvability and rapid regulatory adaptation to an environmental shift.

    PubMed

    Laarits, T; Bordalo, P; Lemos, B

    2016-08-01

    Regulatory networks play a central role in the modulation of gene expression, the control of cellular differentiation, and the emergence of complex phenotypes. Regulatory networks could constrain or facilitate evolutionary adaptation in gene expression levels. Here, we model the adaptation of regulatory networks and gene expression levels to a shift in the environment that alters the optimal expression level of a single gene. Our analyses show signatures of natural selection on regulatory networks that both constrain and facilitate rapid evolution of gene expression level towards new optima. The analyses are interpreted from the standpoint of neutral expectations and illustrate the challenge to making inferences about network adaptation. Furthermore, we examine the consequence of variable stabilizing selection across genes on the strength and direction of interactions in regulatory networks and in their subsequent adaptation. We observe that directional selection on a highly constrained gene previously under strong stabilizing selection was more efficient when the gene was embedded within a network of partners under relaxed stabilizing selection pressure. The observation leads to the expectation that evolutionarily resilient regulatory networks will contain optimal ratios of genes whose expression is under weak and strong stabilizing selection. Altogether, our results suggest that the variable strengths of stabilizing selection across genes within regulatory networks might itself contribute to the long-term adaptation of complex phenotypes. © 2016 European Society For Evolutionary Biology. Journal of Evolutionary Biology © 2016 European Society For Evolutionary Biology.

  14. Integration of a splicing regulatory network within the meiotic gene expression program of Saccharomyces cerevisiae

    PubMed Central

    Munding, Elizabeth M.; Igel, A. Haller; Shiue, Lily; Dorighi, Kristel M.; Treviño, Lisa R.; Ares, Manuel

    2010-01-01

    Splicing regulatory networks are essential components of eukaryotic gene expression programs, yet little is known about how they are integrated with transcriptional regulatory networks into coherent gene expression programs. Here we define the MER1 splicing regulatory network and examine its role in the gene expression program during meiosis in budding yeast. Mer1p splicing factor promotes splicing of just four pre-mRNAs. All four Mer1p-responsive genes also require Nam8p for splicing activation by Mer1p; however, other genes require Nam8p but not Mer1p, exposing an overlapping meiotic splicing network controlled by Nam8p. MER1 mRNA and three of the four Mer1p substrate pre-mRNAs are induced by the transcriptional regulator Ume6p. This unusual arrangement delays expression of Mer1p-responsive genes relative to other genes under Ume6p control. Products of Mer1p-responsive genes are required for initiating and completing recombination and for activation of Ndt80p, the activator of the transcriptional network required for subsequent steps in the program. Thus, the MER1 splicing regulatory network mediates the dependent relationship between the UME6 and NDT80 transcriptional regulatory networks in the meiotic gene expression program. This study reveals how splicing regulatory networks can be interlaced with transcriptional regulatory networks in eukaryotic gene expression programs. PMID:21123654

  15. Construct and Compare Gene Coexpression Networks with DAPfinder and DAPview.

    PubMed

    Skinner, Jeff; Kotliarov, Yuri; Varma, Sudhir; Mine, Karina L; Yambartsev, Anatoly; Simon, Richard; Huyen, Yentram; Morgun, Andrey

    2011-07-14

    DAPfinder and DAPview are novel BRB-ArrayTools plug-ins to construct gene coexpression networks and identify significant differences in pairwise gene-gene coexpression between two phenotypes. Each significant difference in gene-gene association represents a Differentially Associated Pair (DAP). Our tools include several choices of filtering methods, gene-gene association metrics, statistical testing methods and multiple comparison adjustments. Network results are easily displayed in Cytoscape. Analyses of glioma experiments and microarray simulations demonstrate the utility of these tools. DAPfinder is a new friendly-user tool for reconstruction and comparison of biological networks.

  16. Genome-wide survey of the seagrass Zostera muelleri suggests modification of the ethylene signalling network.

    PubMed

    Golicz, Agnieszka A; Schliep, Martin; Lee, Huey Tyng; Larkum, Anthony W D; Dolferus, Rudy; Batley, Jacqueline; Chan, Chon-Kit Kenneth; Sablok, Gaurav; Ralph, Peter J; Edwards, David

    2015-03-01

    Seagrasses are flowering plants which grow fully submerged in the marine environment. They have evolved a range of adaptations to environmental challenges including light attenuation through water, the physical stress of wave action and tidal currents, high concentrations of salt, oxygen deficiency in marine sediment, and water-borne pollination. Although, seagrasses are a key stone species of the costal ecosystems, many questions regarding seagrass biology and evolution remain unanswered. Genome sequence data for the widespread Australian seagrass species Zostera muelleri were generated and the unassembled data were compared with the annotated genes of five sequenced plant species (Arabidopsis thaliana, Oryza sativa, Phoenix dactylifera, Musa acuminata, and Spirodela polyrhiza). Genes which are conserved between Z. muelleri and the five plant species were identified, together with genes that have been lost in Z. muelleri. The effect of gene loss on biological processes was assessed on the gene ontology classification level. Gene loss in Z. muelleri appears to influence some core biological processes such as ethylene biosynthesis. This study provides a foundation for further studies of seagrass evolution as well as the hormonal regulation of plant growth and development. © The Author 2015. Published by Oxford University Press on behalf of the Society for Experimental Biology.

  17. A long-time limit for world subway networks.

    PubMed

    Roth, Camille; Kang, Soong Moon; Batty, Michael; Barthelemy, Marc

    2012-10-07

    We study the temporal evolution of the structure of the world's largest subway networks in an exploratory manner. We show that, remarkably, all these networks converge to a shape that shares similar generic features despite their geographical and economic differences. This limiting shape is made of a core with branches radiating from it. For most of these networks, the average degree of a node (station) within the core has a value of order 2.5 and the proportion of k = 2 nodes in the core is larger than 60 per cent. The number of branches scales roughly as the square root of the number of stations, the current proportion of branches represents about half of the total number of stations, and the average diameter of branches is about twice the average radial extension of the core. Spatial measures such as the number of stations at a given distance to the barycentre display a first regime which grows as r(2) followed by another regime with different exponents, and eventually saturates. These results--difficult to interpret in the framework of fractal geometry--confirm and yield a natural explanation in the geometric picture of this core and their branches: the first regime corresponds to a uniform core, while the second regime is controlled by the interstation spacing on branches. The apparent convergence towards a unique network shape in the temporal limit suggests the existence of dominant, universal mechanisms governing the evolution of these structures.

  18. A long-time limit for world subway networks

    PubMed Central

    Roth, Camille; Kang, Soong Moon; Batty, Michael; Barthelemy, Marc

    2012-01-01

    We study the temporal evolution of the structure of the world's largest subway networks in an exploratory manner. We show that, remarkably, all these networks converge to a shape that shares similar generic features despite their geographical and economic differences. This limiting shape is made of a core with branches radiating from it. For most of these networks, the average degree of a node (station) within the core has a value of order 2.5 and the proportion of k = 2 nodes in the core is larger than 60 per cent. The number of branches scales roughly as the square root of the number of stations, the current proportion of branches represents about half of the total number of stations, and the average diameter of branches is about twice the average radial extension of the core. Spatial measures such as the number of stations at a given distance to the barycentre display a first regime which grows as r2 followed by another regime with different exponents, and eventually saturates. These results—difficult to interpret in the framework of fractal geometry—confirm and yield a natural explanation in the geometric picture of this core and their branches: the first regime corresponds to a uniform core, while the second regime is controlled by the interstation spacing on branches. The apparent convergence towards a unique network shape in the temporal limit suggests the existence of dominant, universal mechanisms governing the evolution of these structures. PMID:22593096

  19. On the Interplay between the Evolvability and Network Robustness in an Evolutionary Biological Network: A Systems Biology Approach

    PubMed Central

    Chen, Bor-Sen; Lin, Ying-Po

    2011-01-01

    In the evolutionary process, the random transmission and mutation of genes provide biological diversities for natural selection. In order to preserve functional phenotypes between generations, gene networks need to evolve robustly under the influence of random perturbations. Therefore, the robustness of the phenotype, in the evolutionary process, exerts a selection force on gene networks to keep network functions. However, gene networks need to adjust, by variations in genetic content, to generate phenotypes for new challenges in the network’s evolution, ie, the evolvability. Hence, there should be some interplay between the evolvability and network robustness in evolutionary gene networks. In this study, the interplay between the evolvability and network robustness of a gene network and a biochemical network is discussed from a nonlinear stochastic system point of view. It was found that if the genetic robustness plus environmental robustness is less than the network robustness, the phenotype of the biological network is robust in evolution. The tradeoff between the genetic robustness and environmental robustness in evolution is discussed from the stochastic stability robustness and sensitivity of the nonlinear stochastic biological network, which may be relevant to the statistical tradeoff between bias and variance, the so-called bias/variance dilemma. Further, the tradeoff could be considered as an antagonistic pleiotropic action of a gene network and discussed from the systems biology perspective. PMID:22084563

  20. Identification of potential crucial genes and construction of microRNA-mRNA negative regulatory networks in osteosarcoma.

    PubMed

    Pan, Yue; Lu, Lingyun; Chen, Junquan; Zhong, Yong; Dai, Zhehao

    2018-01-01

    This study aimed to identify potential crucial genes and construction of microRNA-mRNA negative regulatory networks in osteosarcoma by comprehensive bioinformatics analysis. Data of gene expression profiles (GSE28424) and miRNA expression profiles (GSE28423) were downloaded from GEO database. The differentially expressed genes (DEGs) and miRNAs (DEMIs) were obtained by R Bioconductor packages. Functional and enrichment analyses of selected genes were performed using DAVID database. Protein-protein interaction (PPI) network was constructed by STRING and visualized in Cytoscape. The relationships among the DEGs and module in PPI network were analyzed by plug-in NetworkAnalyzer and MCODE seperately. Through the TargetScan and comparing target genes with DEGs, the miRNA-mRNA regulation network was established. Totally 346 DEGs and 90 DEMIs were found to be differentially expressed. These DEGs were enriched in biological processes and KEGG pathway of inflammatory immune response. 25 genes in the PPI network were selected as hub genes. Top 10 hub genes were TYROBP, HLA-DRA, VWF, PPBP, SERPING1, HLA-DPA1, SERPINA1, KIF20A, FERMT3, HLA-E. PPI network of DEGs followed a pattern of power law network and met the characteristics of small-world network. MCODE analysis identified 4 clusters and the most significant cluster consisted of 11 nodes and 55 edges. SEPP1, CKS2, TCAP, BPI were identified as the seed genes in their own clusters, respectively. The miRNA-mRNA regulation network which was composed of 89 pairs was established. MiR-210 had the highest connectivity with 12 target genes. Among the predicted target of MiR-96, HLA-DPA1 and TYROBP were the hub genes. Our study indicated possible differentially expressed genes and miRNA, and microRNA-mRNA negative regulatory networks in osteosarcoma by bioinformatics analysis, which may provide novel insights for unraveling pathogenesis of osteosarcoma.

  1. Evolutionary re-wiring of p63 and the epigenomic regulatory landscape in keratinocytes and its potential implications on species-specific gene expression and phenotypes

    PubMed Central

    Sethi, Isha; Gluck, Christian; Zhou, Huiqing

    2017-01-01

    Abstract Although epidermal keratinocyte development and differentiation proceeds in similar fashion between humans and mice, evolutionary pressures have also wrought significant species-specific physiological differences. These differences between species could arise in part, by the rewiring of regulatory network due to changes in the global targets of lineage-specific transcriptional master regulators such as p63. Here we have performed a systematic and comparative analysis of the p63 target gene network within the integrated framework of the transcriptomic and epigenomic landscape of mouse and human keratinocytes. We determined that there exists a core set of ∼1600 genomic regions distributed among enhancers and super-enhancers, which are conserved and occupied by p63 in keratinocytes from both species. Notably, these DNA segments are typified by consensus p63 binding motifs under purifying selection and are associated with genes involved in key keratinocyte and skin-centric biological processes. However, the majority of the p63-bound mouse target regions consist of either murine-specific DNA elements that are not alignable to the human genome or exhibit no p63 binding in the orthologous syntenic regions, typifying an occupancy lost subset. Our results suggest that these evolutionarily divergent regions have undergone significant turnover of p63 binding sites and are associated with an underlying inactive and inaccessible chromatin state, indicative of their selective functional activity in the transcriptional regulatory network in mouse but not human. Furthermore, we demonstrate that this selective targeting of genes by p63 correlates with subtle, but measurable transcriptional differences in mouse and human keratinocytes that converges on major metabolic processes, which often exhibit species-specific trends. Collectively our study offers possible molecular explanation for the observable phenotypic differences between the mouse and human skin and broadly informs on the prevailing principles that govern the tug-of-war between evolutionary forces of rigidity and plasticity over transcriptional regulatory programs. PMID:28505376

  2. Robustness in Regulatory Interaction Networks. A Generic Approach with Applications at Different Levels: Physiologic, Metabolic and Genetic

    PubMed Central

    Demongeot, Jacques; Ben Amor, Hedi; Elena, Adrien; Gillois, Pierre; Noual, Mathilde; Sené, Sylvain

    2009-01-01

    Regulatory interaction networks are often studied on their dynamical side (existence of attractors, study of their stability). We focus here also on their robustness, that is their ability to offer the same spatiotemporal patterns and to resist to external perturbations such as losses of nodes or edges in the networks interactions architecture, changes in their environmental boundary conditions as well as changes in the update schedule (or updating mode) of the states of their elements (e.g., if these elements are genes, their synchronous coexpression mode versus their sequential expression). We define the generic notions of boundary, core, and critical vertex or edge of the underlying interaction graph of the regulatory network, whose disappearance causes dramatic changes in the number and nature of attractors (e.g., passage from a bistable behaviour to a unique periodic regime) or in the range of their basins of stability. The dynamic transition of states will be presented in the framework of threshold Boolean automata rules. A panorama of applications at different levels will be given: brain and plant morphogenesis, bulbar cardio-respiratory regulation, glycolytic/oxidative metabolic coupling, and eventually cell cycle and feather morphogenesis genetic control. PMID:20057955

  3. MYC interaction with the tumor suppressive SWI/SNF complex member INI1 regulates transcription and cellular transformation

    PubMed Central

    Stojanova, Angelina; Tu, William B.; Ponzielli, Romina; Kotlyar, Max; Chan, Pak-Kei; Boutros, Paul C.; Khosravi, Fereshteh; Jurisica, Igor; Raught, Brian; Penn, Linda Z.

    2016-01-01

    ABSTRACT MYC is a key driver of cellular transformation and is deregulated in most human cancers. Studies of MYC and its interactors have provided mechanistic insight into its role as a regulator of gene transcription. MYC has been previously linked to chromatin regulation through its interaction with INI1 (SMARCB1/hSNF5/BAF47), a core member of the SWI/SNF chromatin remodeling complex. INI1 is a potent tumor suppressor that is inactivated in several types of cancers, most prominently as the hallmark alteration in pediatric malignant rhabdoid tumors. However, the molecular and functional interaction of MYC and INI1 remains unclear. Here, we characterize the MYC-INI1 interaction in mammalian cells, mapping their minimal binding domains to functionally significant regions of MYC (leucine zipper) and INI1 (repeat motifs), and demonstrating that the interaction does not interfere with MYC-MAX interaction. Protein-protein interaction network analysis expands the MYC-INI1 interaction to the SWI/SNF complex and a larger network of chromatin regulatory complexes. Genome-wide analysis reveals that the DNA-binding regions and target genes of INI1 significantly overlap with those of MYC. In an INI1-deficient rhabdoid tumor system, we observe that with re-expression of INI1, MYC and INI1 bind to common target genes and have opposing effects on gene expression. Functionally, INI1 re-expression suppresses cell proliferation and MYC-potentiated transformation. Our findings thus establish the antagonistic roles of the INI1 and MYC transcriptional regulators in mediating cellular and oncogenic functions. PMID:27267444

  4. Meta-analysis of heat- and chemically upregulated chaperone genes in plant and human cells

    PubMed Central

    Finka, Andrija; Mattoo, Rayees U. H.

    2010-01-01

    Molecular chaperones are central to cellular protein homeostasis. In mammals, protein misfolding diseases and aging cause inflammation and progressive tissue loss, in correlation with the accumulation of toxic protein aggregates and the defective expression of chaperone genes. Bacteria and non-diseased, non-aged eukaryotic cells effectively respond to heat shock by inducing the accumulation of heat-shock proteins (HSPs), many of which molecular chaperones involved in protein homeostasis, in reducing stress damages and promoting cellular recovery and thermotolerance. We performed a meta-analysis of published microarray data and compared expression profiles of HSP genes from mammalian and plant cells in response to heat or isothermal treatments with drugs. The differences and overlaps between HSP and chaperone genes were analyzed, and expression patterns were clustered and organized in a network. HSPs and chaperones only partly overlapped. Heat-shock induced a subset of chaperones primarily targeted to the cytoplasm and organelles but not to the endoplasmic reticulum, which organized into a network with a central core of Hsp90s, Hsp70s, and sHSPs. Heat was best mimicked by isothermal treatments with Hsp90 inhibitors, whereas less toxic drugs, some of which non-steroidal anti-inflammatory drugs, weakly expressed different subsets of Hsp chaperones. This type of analysis may uncover new HSP-inducing drugs to improve protein homeostasis in misfolding and aging diseases. Electronic supplementary material The online version of this article (doi:10.1007/s12192-010-0216-8) contains supplementary material, which is available to authorized users. PMID:20694844

  5. Sex networking of young men who have sex with men in densely connected saunas in Hong Kong.

    PubMed

    Poon, Chin Man; Lee, Shui Shan

    2013-12-01

    Some men who have sex with men (MSM) meet and have sex with male partners at gay saunas, the connections between which are little explored for designing HIV prevention measures. This study aims to describe the network configuration of gay saunas and explore its relationship with risk behavior of MSM in the respective sauna communities, in the city of Hong Kong. Using venue-based sampling, 205 MSM were recruited in 8 saunas in July 2011 for a cross-sectional anonymous questionnaire survey. A network of saunas was constructed based on the proportion of clients shared between them. Core saunas with higher intensity of linkages were delineated from core-periphery analysis. Men who have sex with men in core saunas were compared with those in peripheral ones in terms of their demographics and risk behavioral profiles. Eight core saunas were differentiated from a highly connected sauna network, consisting of 13 saunas with a diameter of 2. Men who have sex with men visiting core saunas were more likely to be younger and users of the Internet for sex networking (odds ratio, 5.43; 95% confidence interval, 1.84-16.01). On average, they visited 1.7 saunas and had 2.6 sauna partners over a 1-month period, which were both significantly higher than those for MSM in peripheral saunas. However, there was no association between having unprotected anal sex and visiting core saunas. Sauna affiliation patterns were age dependent and geographically related. Saunas were not homogeneously connected with each other. Prioritization may be considered so that public health interventions can be targeted at saunas in denser networks. An assortative mixing in age among MSM in sauna community informs planning for client-specific venue-based prevention programs.

  6. An extensive analysis of disease-gene associations using network integration and fast kernel-based gene prioritization methods.

    PubMed

    Valentini, Giorgio; Paccanaro, Alberto; Caniza, Horacio; Romero, Alfonso E; Re, Matteo

    2014-06-01

    In the context of "network medicine", gene prioritization methods represent one of the main tools to discover candidate disease genes by exploiting the large amount of data covering different types of functional relationships between genes. Several works proposed to integrate multiple sources of data to improve disease gene prioritization, but to our knowledge no systematic studies focused on the quantitative evaluation of the impact of network integration on gene prioritization. In this paper, we aim at providing an extensive analysis of gene-disease associations not limited to genetic disorders, and a systematic comparison of different network integration methods for gene prioritization. We collected nine different functional networks representing different functional relationships between genes, and we combined them through both unweighted and weighted network integration methods. We then prioritized genes with respect to each of the considered 708 medical subject headings (MeSH) diseases by applying classical guilt-by-association, random walk and random walk with restart algorithms, and the recently proposed kernelized score functions. The results obtained with classical random walk algorithms and the best single network achieved an average area under the curve (AUC) across the 708 MeSH diseases of about 0.82, while kernelized score functions and network integration boosted the average AUC to about 0.89. Weighted integration, by exploiting the different "informativeness" embedded in different functional networks, outperforms unweighted integration at 0.01 significance level, according to the Wilcoxon signed rank sum test. For each MeSH disease we provide the top-ranked unannotated candidate genes, available for further bio-medical investigation. Network integration is necessary to boost the performances of gene prioritization methods. Moreover the methods based on kernelized score functions can further enhance disease gene ranking results, by adopting both local and global learning strategies, able to exploit the overall topology of the network. Copyright © 2014 The Authors. Published by Elsevier B.V. All rights reserved.

  7. The Variable Regions of Lactobacillus rhamnosus Genomes Reveal the Dynamic Evolution of Metabolic and Host-Adaptation Repertoires

    PubMed Central

    Ceapa, Corina; Davids, Mark; Ritari, Jarmo; Lambert, Jolanda; Wels, Michiel; Douillard, François P.; Smokvina, Tamara; de Vos, Willem M.; Knol, Jan; Kleerebezem, Michiel

    2016-01-01

    Lactobacillus rhamnosus is a diverse Gram-positive species with strains isolated from different ecological niches. Here, we report the genome sequence analysis of 40 diverse strains of L. rhamnosus and their genomic comparison, with a focus on the variable genome. Genomic comparison of 40 L. rhamnosus strains discriminated the conserved genes (core genome) and regions of plasticity involving frequent rearrangements and horizontal transfer (variome). The L. rhamnosus core genome encompasses 2,164 genes, out of 4,711 genes in total (the pan-genome). The accessory genome is dominated by genes encoding carbohydrate transport and metabolism, extracellular polysaccharides (EPS) biosynthesis, bacteriocin production, pili production, the cas system, and the associated clustered regularly interspaced short palindromic repeat (CRISPR) loci, and more than 100 transporter functions and mobile genetic elements like phages, plasmid genes, and transposons. A clade distribution based on amino acid differences between core (shared) proteins matched with the clade distribution obtained from the presence–absence of variable genes. The phylogenetic and variome tree overlap indicated that frequent events of gene acquisition and loss dominated the evolutionary segregation of the strains within this species, which is paralleled by evolutionary diversification of core gene functions. The CRISPR-Cas system could have contributed to this evolutionary segregation. Lactobacillus rhamnosus strains contain the genetic and metabolic machinery with strain-specific gene functions required to adapt to a large range of environments. A remarkable congruency of the evolutionary relatedness of the strains’ core and variome functions, possibly favoring interspecies genetic exchanges, underlines the importance of gene-acquisition and loss within the L. rhamnosus strain diversification. PMID:27358423

  8. Comparison of neuronal spike exchange methods on a Blue Gene/P supercomputer.

    PubMed

    Hines, Michael; Kumar, Sameer; Schürmann, Felix

    2011-01-01

    For neural network simulations on parallel machines, interprocessor spike communication can be a significant portion of the total simulation time. The performance of several spike exchange methods using a Blue Gene/P (BG/P) supercomputer has been tested with 8-128 K cores using randomly connected networks of up to 32 M cells with 1 k connections per cell and 4 M cells with 10 k connections per cell, i.e., on the order of 4·10(10) connections (K is 1024, M is 1024(2), and k is 1000). The spike exchange methods used are the standard Message Passing Interface (MPI) collective, MPI_Allgather, and several variants of the non-blocking Multisend method either implemented via non-blocking MPI_Isend, or exploiting the possibility of very low overhead direct memory access (DMA) communication available on the BG/P. In all cases, the worst performing method was that using MPI_Isend due to the high overhead of initiating a spike communication. The two best performing methods-the persistent Multisend method using the Record-Replay feature of the Deep Computing Messaging Framework DCMF_Multicast; and a two-phase multisend in which a DCMF_Multicast is used to first send to a subset of phase one destination cores, which then pass it on to their subset of phase two destination cores-had similar performance with very low overhead for the initiation of spike communication. Departure from ideal scaling for the Multisend methods is almost completely due to load imbalance caused by the large variation in number of cells that fire on each processor in the interval between synchronization. Spike exchange time itself is negligible since transmission overlaps with computation and is handled by a DMA controller. We conclude that ideal performance scaling will be ultimately limited by imbalance between incoming processor spikes between synchronization intervals. Thus, counterintuitively, maximization of load balance requires that the distribution of cells on processors should not reflect neural net architecture but be randomly distributed so that sets of cells which are burst firing together should be on different processors with their targets on as large a set of processors as possible.

  9. Shared protection based virtual network mapping in space division multiplexing optical networks

    NASA Astrophysics Data System (ADS)

    Zhang, Huibin; Wang, Wei; Zhao, Yongli; Zhang, Jie

    2018-05-01

    Space Division Multiplexing (SDM) has been introduced to improve the capacity of optical networks. In SDM optical networks, there are multiple cores/modes in each fiber link, and spectrum resources are multiplexed in both frequency and core/modes dimensions. Enabled by network virtualization technology, one SDM optical network substrate can be shared by several virtual networks operators. Similar with point-to-point connection services, virtual networks (VN) also need certain survivability to guard against network failures. Based on customers' heterogeneous requirements on the survivability of their virtual networks, this paper studies the shared protection based VN mapping problem and proposes a Minimum Free Frequency Slots (MFFS) mapping algorithm to improve spectrum efficiency. Simulation results show that the proposed algorithm can optimize SDM optical networks significantly in terms of blocking probability and spectrum utilization.

  10. Enhancing gene regulatory network inference through data integration with markov random fields

    DOE PAGES

    Banf, Michael; Rhee, Seung Y.

    2017-02-01

    Here, a gene regulatory network links transcription factors to their target genes and represents a map of transcriptional regulation. Much progress has been made in deciphering gene regulatory networks computationally. However, gene regulatory network inference for most eukaryotic organisms remain challenging. To improve the accuracy of gene regulatory network inference and facilitate candidate selection for experimentation, we developed an algorithm called GRACE (Gene Regulatory network inference ACcuracy Enhancement). GRACE exploits biological a priori and heterogeneous data integration to generate high- confidence network predictions for eukaryotic organisms using Markov Random Fields in a semi-supervised fashion. GRACE uses a novel optimization schememore » to integrate regulatory evidence and biological relevance. It is particularly suited for model learning with sparse regulatory gold standard data. We show GRACE’s potential to produce high confidence regulatory networks compared to state of the art approaches using Drosophila melanogaster and Arabidopsis thaliana data. In an A. thaliana developmental gene regulatory network, GRACE recovers cell cycle related regulatory mechanisms and further hypothesizes several novel regulatory links, including a putative control mechanism of vascular structure formation due to modifications in cell proliferation.« less

  11. Enhancing gene regulatory network inference through data integration with markov random fields

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Banf, Michael; Rhee, Seung Y.

    Here, a gene regulatory network links transcription factors to their target genes and represents a map of transcriptional regulation. Much progress has been made in deciphering gene regulatory networks computationally. However, gene regulatory network inference for most eukaryotic organisms remain challenging. To improve the accuracy of gene regulatory network inference and facilitate candidate selection for experimentation, we developed an algorithm called GRACE (Gene Regulatory network inference ACcuracy Enhancement). GRACE exploits biological a priori and heterogeneous data integration to generate high- confidence network predictions for eukaryotic organisms using Markov Random Fields in a semi-supervised fashion. GRACE uses a novel optimization schememore » to integrate regulatory evidence and biological relevance. It is particularly suited for model learning with sparse regulatory gold standard data. We show GRACE’s potential to produce high confidence regulatory networks compared to state of the art approaches using Drosophila melanogaster and Arabidopsis thaliana data. In an A. thaliana developmental gene regulatory network, GRACE recovers cell cycle related regulatory mechanisms and further hypothesizes several novel regulatory links, including a putative control mechanism of vascular structure formation due to modifications in cell proliferation.« less

  12. What a difference an X or Y makes: sex chromosomes, gene dose, and epigenetics in sexual differentiation

    PubMed Central

    Arnold, Arthur P.; Chen, Xuqi; Itoh, Yuichiro

    2014-01-01

    Summary A modern general theory of sex determination and sexual differentiation identifies the factors that cause sexual bias in gene networks, leading to sex differences in physiology and disease. The primary sex-biasing factors are those encoded on the sex chromosomes that are inherently different in the male and female zygote. These factors, and downstream factors such as gonadal hormones, act directly on tissues to produce sex differences, and to antagonize each other to reduce sex differences. Recent study of mouse models such as the Four Core Genotypes has begun to distinguish between direct effects of sex chromosome complement (XX vs. XY) and hormonal effects. Several lines of evidence implicate epigenetic processes in the control of sex differences, although a great deal of more information is needed about sex differences in the epigenome. PMID:23027446

  13. Stationary and structural control in gene regulatory networks: basic concepts

    NASA Astrophysics Data System (ADS)

    Dougherty, Edward R.; Pal, Ranadip; Qian, Xiaoning; Bittner, Michael L.; Datta, Aniruddha

    2010-01-01

    A major reason for constructing gene regulatory networks is to use them as models for determining therapeutic intervention strategies by deriving ways of altering their long-run dynamics in such a way as to reduce the likelihood of entering undesirable states. In general, two paradigms have been taken for gene network intervention: (1) stationary external control is based on optimally altering the status of a control gene (or genes) over time to drive network dynamics; and (2) structural intervention involves an optimal one-time change of the network structure (wiring) to beneficially alter the long-run behaviour of the network. These intervention approaches have mainly been developed within the context of the probabilistic Boolean network model for gene regulation. This article reviews both types of intervention and applies them to reducing the metastatic competence of cells via intervention in a melanoma-related network.

  14. Differentially Coexpressed Disease Gene Identification Based on Gene Coexpression Network.

    PubMed

    Jiang, Xue; Zhang, Han; Quan, Xiongwen

    2016-01-01

    Screening disease-related genes by analyzing gene expression data has become a popular theme. Traditional disease-related gene selection methods always focus on identifying differentially expressed gene between case samples and a control group. These traditional methods may not fully consider the changes of interactions between genes at different cell states and the dynamic processes of gene expression levels during the disease progression. However, in order to understand the mechanism of disease, it is important to explore the dynamic changes of interactions between genes in biological networks at different cell states. In this study, we designed a novel framework to identify disease-related genes and developed a differentially coexpressed disease-related gene identification method based on gene coexpression network (DCGN) to screen differentially coexpressed genes. We firstly constructed phase-specific gene coexpression network using time-series gene expression data and defined the conception of differential coexpression of genes in coexpression network. Then, we designed two metrics to measure the value of gene differential coexpression according to the change of local topological structures between different phase-specific networks. Finally, we conducted meta-analysis of gene differential coexpression based on the rank-product method. Experimental results demonstrated the feasibility and effectiveness of DCGN and the superior performance of DCGN over other popular disease-related gene selection methods through real-world gene expression data sets.

  15. Network diffusion-based analysis of high-throughput data for the detection of differentially enriched modules

    PubMed Central

    Bersanelli, Matteo; Mosca, Ettore; Remondini, Daniel; Castellani, Gastone; Milanesi, Luciano

    2016-01-01

    A relation exists between network proximity of molecular entities in interaction networks, functional similarity and association with diseases. The identification of network regions associated with biological functions and pathologies is a major goal in systems biology. We describe a network diffusion-based pipeline for the interpretation of different types of omics in the context of molecular interaction networks. We introduce the network smoothing index, a network-based quantity that allows to jointly quantify the amount of omics information in genes and in their network neighbourhood, using network diffusion to define network proximity. The approach is applicable to both descriptive and inferential statistics calculated on omics data. We also show that network resampling, applied to gene lists ranked by quantities derived from the network smoothing index, indicates the presence of significantly connected genes. As a proof of principle, we identified gene modules enriched in somatic mutations and transcriptional variations observed in samples of prostate adenocarcinoma (PRAD). In line with the local hypothesis, network smoothing index and network resampling underlined the existence of a connected component of genes harbouring molecular alterations in PRAD. PMID:27731320

  16. HCV core protein induces hepatic lipid accumulation by activating SREBP1 and PPAR{gamma}

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Kim, Kook Hwan; Hong, Sung Pyo; Kim, KyeongJin

    2007-04-20

    Hepatic steatosis is a common feature in patients with chronic hepatitis C virus (HCV) infection. HCV core protein plays an important role in the development of hepatic steatosis in HCV infection. Because SREBP1 (sterol regulatory element binding protein 1) and PPAR{gamma} (peroxisome proliferators-activated receptor {gamma}) are involved in the regulation of lipid metabolism of hepatocyte, we sought to determine whether HCV core protein may impair the expression and activity of SREBP1 and PPAR{gamma}. In this study, it was demonstrated that HCV core protein increases the gene expression of SREBP1 not only in Chang liver, Huh7, and HepG2 cells transiently transfectedmore » with HCV core protein expression plasmid, but also in Chang liver-core stable cells. Furthermore, HCV core protein enhanced the transcriptional activity of SREBP1. In addition, HCV core protein elevated PPAR{gamma} transcriptional activity. However, HCV core protein had no effect on PPAR{gamma} gene expression. Finally, we showed that HCV core protein stimulates the genes expression of lipogenic enzyme and fatty acid uptake associated protein. Therefore, our finding provides a new insight into the mechanism of hepatic steatosis by HCV infection.« less

  17. Laplacian normalization and random walk on heterogeneous networks for disease-gene prioritization.

    PubMed

    Zhao, Zhi-Qin; Han, Guo-Sheng; Yu, Zu-Guo; Li, Jinyan

    2015-08-01

    Random walk on heterogeneous networks is a recently emerging approach to effective disease gene prioritization. Laplacian normalization is a technique capable of normalizing the weight of edges in a network. We use this technique to normalize the gene matrix and the phenotype matrix before the construction of the heterogeneous network, and also use this idea to define the transition matrices of the heterogeneous network. Our method has remarkably better performance than the existing methods for recovering known gene-phenotype relationships. The Shannon information entropy of the distribution of the transition probabilities in our networks is found to be smaller than the networks constructed by the existing methods, implying that a higher number of top-ranked genes can be verified as disease genes. In fact, the most probable gene-phenotype relationships ranked within top 3 or top 5 in our gene lists can be confirmed by the OMIM database for many cases. Our algorithms have shown remarkably superior performance over the state-of-the-art algorithms for recovering gene-phenotype relationships. All Matlab codes can be available upon email request. Copyright © 2015 Elsevier Ltd. All rights reserved.

  18. Integration of multi-omics data for integrative gene regulatory network inference.

    PubMed

    Zarayeneh, Neda; Ko, Euiseong; Oh, Jung Hun; Suh, Sang; Liu, Chunyu; Gao, Jean; Kim, Donghyun; Kang, Mingon

    2017-01-01

    Gene regulatory networks provide comprehensive insights and indepth understanding of complex biological processes. The molecular interactions of gene regulatory networks are inferred from a single type of genomic data, e.g., gene expression data in most research. However, gene expression is a product of sequential interactions of multiple biological processes, such as DNA sequence variations, copy number variations, histone modifications, transcription factors, and DNA methylations. The recent rapid advances of high-throughput omics technologies enable one to measure multiple types of omics data, called 'multi-omics data', that represent the various biological processes. In this paper, we propose an Integrative Gene Regulatory Network inference method (iGRN) that incorporates multi-omics data and their interactions in gene regulatory networks. In addition to gene expressions, copy number variations and DNA methylations were considered for multi-omics data in this paper. The intensive experiments were carried out with simulation data, where iGRN's capability that infers the integrative gene regulatory network is assessed. Through the experiments, iGRN shows its better performance on model representation and interpretation than other integrative methods in gene regulatory network inference. iGRN was also applied to a human brain dataset of psychiatric disorders, and the biological network of psychiatric disorders was analysed.

  19. Integration of multi-omics data for integrative gene regulatory network inference

    PubMed Central

    Zarayeneh, Neda; Ko, Euiseong; Oh, Jung Hun; Suh, Sang; Liu, Chunyu; Gao, Jean; Kim, Donghyun

    2017-01-01

    Gene regulatory networks provide comprehensive insights and indepth understanding of complex biological processes. The molecular interactions of gene regulatory networks are inferred from a single type of genomic data, e.g., gene expression data in most research. However, gene expression is a product of sequential interactions of multiple biological processes, such as DNA sequence variations, copy number variations, histone modifications, transcription factors, and DNA methylations. The recent rapid advances of high-throughput omics technologies enable one to measure multiple types of omics data, called ‘multi-omics data’, that represent the various biological processes. In this paper, we propose an Integrative Gene Regulatory Network inference method (iGRN) that incorporates multi-omics data and their interactions in gene regulatory networks. In addition to gene expressions, copy number variations and DNA methylations were considered for multi-omics data in this paper. The intensive experiments were carried out with simulation data, where iGRN’s capability that infers the integrative gene regulatory network is assessed. Through the experiments, iGRN shows its better performance on model representation and interpretation than other integrative methods in gene regulatory network inference. iGRN was also applied to a human brain dataset of psychiatric disorders, and the biological network of psychiatric disorders was analysed. PMID:29354189

  20. Disease gene prioritization by integrating tissue-specific molecular networks using a robust multi-network model.

    PubMed

    Ni, Jingchao; Koyuturk, Mehmet; Tong, Hanghang; Haines, Jonathan; Xu, Rong; Zhang, Xiang

    2016-11-10

    Accurately prioritizing candidate disease genes is an important and challenging problem. Various network-based methods have been developed to predict potential disease genes by utilizing the disease similarity network and molecular networks such as protein interaction or gene co-expression networks. Although successful, a common limitation of the existing methods is that they assume all diseases share the same molecular network and a single generic molecular network is used to predict candidate genes for all diseases. However, different diseases tend to manifest in different tissues, and the molecular networks in different tissues are usually different. An ideal method should be able to incorporate tissue-specific molecular networks for different diseases. In this paper, we develop a robust and flexible method to integrate tissue-specific molecular networks for disease gene prioritization. Our method allows each disease to have its own tissue-specific network(s). We formulate the problem of candidate gene prioritization as an optimization problem based on network propagation. When there are multiple tissue-specific networks available for a disease, our method can automatically infer the relative importance of each tissue-specific network. Thus it is robust to the noisy and incomplete network data. To solve the optimization problem, we develop fast algorithms which have linear time complexities in the number of nodes in the molecular networks. We also provide rigorous theoretical foundations for our algorithms in terms of their optimality and convergence properties. Extensive experimental results show that our method can significantly improve the accuracy of candidate gene prioritization compared with the state-of-the-art methods. In our experiments, we compare our methods with 7 popular network-based disease gene prioritization algorithms on diseases from Online Mendelian Inheritance in Man (OMIM) database. The experimental results demonstrate that our methods recover true associations more accurately than other methods in terms of AUC values, and the performance differences are significant (with paired t-test p-values less than 0.05). This validates the importance to integrate tissue-specific molecular networks for studying disease gene prioritization and show the superiority of our network models and ranking algorithms toward this purpose. The source code and datasets are available at http://nijingchao.github.io/CRstar/ .

  1. Intrafocal heterogeneity of ERG protein expression and gene fusion pattern in prostate cancer.

    PubMed

    Suh, Ja Hee; Park, Jeong Hwan; Lee, Cheol; Moon, Kyung Chul

    2017-10-01

    Prostate cancer is considered to be highly heterogeneous, with various morphologic features and biologic behaviors. The TMPRSS2-ERG gene fusion is the most frequently observed genetic aberration in prostate cancer. The aim of this study was to elucidate the intrafocal heterogeneity of ERG gene fusion status. ERG immunohistochemistry (IHC) was performed in samples from 168 prostate cancer patients who had undergone radical prostatectomy, and 40 cases showing ERG-positive IHC staining were selected for tissue microarray (TMA) construction. Two to six representative cores were selected from each tumor focus. In the cases with heterogeneous ERG IHC staining intensity, the areas showing different intensities were separately selected. Using the TMA blocks, IHC and fluorescence in situ hybridization (FISH) were conducted to evaluate the heterogeneity of ERG protein expression and ERG fusion gene patterns, respectively, in a single tumor focus. Heterogeneity of ERG IHC staining was defined as the simultaneous presence of negative and positive cores in the same tumor focus. Heterogeneity of ERG FISH was defined by the presence of cores with positive and negative FISH signals or cores with break-apart and interstitial deletion FISH signals in the same tumor focus. A total of 202 TMA cores were isolated from 40 ERG-positive cases. Of the 202 total cores, 19 were negative for ERG IHC staining, and 46 showed 1+, 52 showed 2+, and 85 showed 3+ ERG staining intensity. Eleven cores were negative for ERG FISH signal, 119 cores showed ERG break-apart FISH signals, and the remaining 72 cores revealed interstitial deletion. Intrafocal heterogeneity of ERG IHC staining was found in 20% (8/40) of cases, and intrafocal heterogeneity of ERG gene fusion pattern was found in 32.5% (13/40) of cases. In summary, this study showed significantly frequent intrafocal heterogeneity of ERG protein expression, gene fusion status and fusion pattern. This heterogeneity can be caused by the development of subclones during cancer progression or the intermingling of different tumors. © 2017 Wiley Periodicals, Inc.

  2. Analysis of the SOS response of Vibrio and other bacteria with multiple chromosomes.

    PubMed

    Sanchez-Alberola, Neus; Campoy, Susana; Barbé, Jordi; Erill, Ivan

    2012-02-03

    The SOS response is a well-known regulatory network present in most bacteria and aimed at addressing DNA damage. It has also been linked extensively to stress-induced mutagenesis, virulence and the emergence and dissemination of antibiotic resistance determinants. Recently, the SOS response has been shown to regulate the activity of integrases in the chromosomal superintegrons of the Vibrionaceae, which encompasses a wide range of pathogenic species harboring multiple chromosomes. Here we combine in silico and in vitro techniques to perform a comparative genomics analysis of the SOS regulon in the Vibrionaceae, and we extend the methodology to map this transcriptional network in other bacterial species harboring multiple chromosomes. Our analysis provides the first comprehensive description of the SOS response in a family (Vibrionaceae) that includes major human pathogens. It also identifies several previously unreported members of the SOS transcriptional network, including two proteins of unknown function. The analysis of the SOS response in other bacterial species with multiple chromosomes uncovers additional regulon members and reveals that there is a conserved core of SOS genes, and that specialized additions to this basic network take place in different phylogenetic groups. Our results also indicate that across all groups the main elements of the SOS response are always found in the large chromosome, whereas specialized additions are found in the smaller chromosomes and plasmids. Our findings confirm that the SOS response of the Vibrionaceae is strongly linked with pathogenicity and dissemination of antibiotic resistance, and suggest that the characterization of the newly identified members of this regulon could provide key insights into the pathogenesis of Vibrio. The persistent location of key SOS genes in the large chromosome across several bacterial groups confirms that the SOS response plays an essential role in these organisms and sheds light into the mechanisms of evolution of global transcriptional networks involved in adaptability and rapid response to environmental changes, suggesting that small chromosomes may act as evolutionary test beds for the rewiring of transcriptional networks.

  3. Super-channel oriented routing, spectrum and core assignment under crosstalk limit in spatial division multiplexing elastic optical networks

    NASA Astrophysics Data System (ADS)

    Zhao, Yongli; Zhu, Ye; Wang, Chunhui; Yu, Xiaosong; Liu, Chuan; Liu, Binglin; Zhang, Jie

    2017-07-01

    With the capacity increasing in optical networks enabled by spatial division multiplexing (SDM) technology, spatial division multiplexing elastic optical networks (SDM-EONs) attract much attention from both academic and industry. Super-channel is an important type of service provisioning in SDM-EONs. This paper focuses on the issue of super-channel construction in SDM-EONs. Mixed super-channel oriented routing, spectrum and core assignment (MS-RSCA) algorithm is proposed in SDM-EONs considering inter-core crosstalk. Simulation results show that MS-RSCA can improve spectrum resource utilization and reduce blocking probability significantly compared with the baseline RSCA algorithms.

  4. Inference of cancer-specific gene regulatory networks using soft computing rules.

    PubMed

    Wang, Xiaosheng; Gotoh, Osamu

    2010-03-24

    Perturbations of gene regulatory networks are essentially responsible for oncogenesis. Therefore, inferring the gene regulatory networks is a key step to overcoming cancer. In this work, we propose a method for inferring directed gene regulatory networks based on soft computing rules, which can identify important cause-effect regulatory relations of gene expression. First, we identify important genes associated with a specific cancer (colon cancer) using a supervised learning approach. Next, we reconstruct the gene regulatory networks by inferring the regulatory relations among the identified genes, and their regulated relations by other genes within the genome. We obtain two meaningful findings. One is that upregulated genes are regulated by more genes than downregulated ones, while downregulated genes regulate more genes than upregulated ones. The other one is that tumor suppressors suppress tumor activators and activate other tumor suppressors strongly, while tumor activators activate other tumor activators and suppress tumor suppressors weakly, indicating the robustness of biological systems. These findings provide valuable insights into the pathogenesis of cancer.

  5. Structural stability of interaction networks against negative external fields

    NASA Astrophysics Data System (ADS)

    Yoon, S.; Goltsev, A. V.; Mendes, J. F. F.

    2018-04-01

    We explore structural stability of weighted and unweighted networks of positively interacting agents against a negative external field. We study how the agents support the activity of each other to confront the negative field, which suppresses the activity of agents and can lead to collapse of the whole network. The competition between the interactions and the field shape the structure of stable states of the system. In unweighted networks (uniform interactions) the stable states have the structure of k -cores of the interaction network. The interplay between the topology and the distribution of weights (heterogeneous interactions) impacts strongly the structural stability against a negative field, especially in the case of fat-tailed distributions of weights. We show that apart from critical slowing down there is also a critical change in the system structure that precedes the network collapse. The change can serve as an early warning of the critical transition. To characterize changes of network structure we develop a method based on statistical analysis of the k -core organization and so-called "corona" clusters belonging to the k -cores.

  6. Anatomy of an online misinformation network.

    PubMed

    Shao, Chengcheng; Hui, Pik-Mai; Wang, Lei; Jiang, Xinwen; Flammini, Alessandro; Menczer, Filippo; Ciampaglia, Giovanni Luca

    2018-01-01

    Massive amounts of fake news and conspiratorial content have spread over social media before and after the 2016 US Presidential Elections despite intense fact-checking efforts. How do the spread of misinformation and fact-checking compete? What are the structural and dynamic characteristics of the core of the misinformation diffusion network, and who are its main purveyors? How to reduce the overall amount of misinformation? To explore these questions we built Hoaxy, an open platform that enables large-scale, systematic studies of how misinformation and fact-checking spread and compete on Twitter. Hoaxy captures public tweets that include links to articles from low-credibility and fact-checking sources. We perform k-core decomposition on a diffusion network obtained from two million retweets produced by several hundred thousand accounts over the six months before the election. As we move from the periphery to the core of the network, fact-checking nearly disappears, while social bots proliferate. The number of users in the main core reaches equilibrium around the time of the election, with limited churn and increasingly dense connections. We conclude by quantifying how effectively the network can be disrupted by penalizing the most central nodes. These findings provide a first look at the anatomy of a massive online misinformation diffusion network.

  7. Anatomy of an online misinformation network

    PubMed Central

    Wang, Lei; Jiang, Xinwen; Flammini, Alessandro; Ciampaglia, Giovanni Luca

    2018-01-01

    Massive amounts of fake news and conspiratorial content have spread over social media before and after the 2016 US Presidential Elections despite intense fact-checking efforts. How do the spread of misinformation and fact-checking compete? What are the structural and dynamic characteristics of the core of the misinformation diffusion network, and who are its main purveyors? How to reduce the overall amount of misinformation? To explore these questions we built Hoaxy, an open platform that enables large-scale, systematic studies of how misinformation and fact-checking spread and compete on Twitter. Hoaxy captures public tweets that include links to articles from low-credibility and fact-checking sources. We perform k-core decomposition on a diffusion network obtained from two million retweets produced by several hundred thousand accounts over the six months before the election. As we move from the periphery to the core of the network, fact-checking nearly disappears, while social bots proliferate. The number of users in the main core reaches equilibrium around the time of the election, with limited churn and increasingly dense connections. We conclude by quantifying how effectively the network can be disrupted by penalizing the most central nodes. These findings provide a first look at the anatomy of a massive online misinformation diffusion network. PMID:29702657

  8. Co-expression network analysis of duplicate genes in maize (Zea mays L.) reveals no subgenome bias.

    PubMed

    Li, Lin; Briskine, Roman; Schaefer, Robert; Schnable, Patrick S; Myers, Chad L; Flagel, Lex E; Springer, Nathan M; Muehlbauer, Gary J

    2016-11-04

    Gene duplication is prevalent in many species and can result in coding and regulatory divergence. Gene duplications can be classified as whole genome duplication (WGD), tandem and inserted (non-syntenic). In maize, WGD resulted in the subgenomes maize1 and maize2, of which maize1 is considered the dominant subgenome. However, the landscape of co-expression network divergence of duplicate genes in maize is still largely uncharacterized. To address the consequence of gene duplication on co-expression network divergence, we developed a gene co-expression network from RNA-seq data derived from 64 different tissues/stages of the maize reference inbred-B73. WGD, tandem and inserted gene duplications exhibited distinct regulatory divergence. Inserted duplicate genes were more likely to be singletons in the co-expression networks, while WGD duplicate genes were likely to be co-expressed with other genes. Tandem duplicate genes were enriched in the co-expression pattern where co-expressed genes were nearly identical for the duplicates in the network. Older gene duplications exhibit more extensive co-expression variation than younger duplications. Overall, non-syntenic genes primarily from inserted duplications show more co-expression divergence. Also, such enlarged co-expression divergence is significantly related to duplication age. Moreover, subgenome dominance was not observed in the co-expression networks - maize1 and maize2 exhibit similar levels of intra subgenome correlations. Intriguingly, the level of inter subgenome co-expression was similar to the level of intra subgenome correlations, and genes from specific subgenomes were not likely to be the enriched in co-expression network modules and the hub genes were not predominantly from any specific subgenomes in maize. Our work provides a comprehensive analysis of maize co-expression network divergence for three different types of gene duplications and identifies potential relationships between duplication types, duplication ages and co-expression consequences.

  9. Global Mapping of the Yeast Genetic Interaction Network

    NASA Astrophysics Data System (ADS)

    Tong, Amy Hin Yan; Lesage, Guillaume; Bader, Gary D.; Ding, Huiming; Xu, Hong; Xin, Xiaofeng; Young, James; Berriz, Gabriel F.; Brost, Renee L.; Chang, Michael; Chen, YiQun; Cheng, Xin; Chua, Gordon; Friesen, Helena; Goldberg, Debra S.; Haynes, Jennifer; Humphries, Christine; He, Grace; Hussein, Shamiza; Ke, Lizhu; Krogan, Nevan; Li, Zhijian; Levinson, Joshua N.; Lu, Hong; Ménard, Patrice; Munyana, Christella; Parsons, Ainslie B.; Ryan, Owen; Tonikian, Raffi; Roberts, Tania; Sdicu, Anne-Marie; Shapiro, Jesse; Sheikh, Bilal; Suter, Bernhard; Wong, Sharyl L.; Zhang, Lan V.; Zhu, Hongwei; Burd, Christopher G.; Munro, Sean; Sander, Chris; Rine, Jasper; Greenblatt, Jack; Peter, Matthias; Bretscher, Anthony; Bell, Graham; Roth, Frederick P.; Brown, Grant W.; Andrews, Brenda; Bussey, Howard; Boone, Charles

    2004-02-01

    A genetic interaction network containing ~1000 genes and ~4000 interactions was mapped by crossing mutations in 132 different query genes into a set of ~4700 viable gene yeast deletion mutants and scoring the double mutant progeny for fitness defects. Network connectivity was predictive of function because interactions often occurred among functionally related genes, and similar patterns of interactions tended to identify components of the same pathway. The genetic network exhibited dense local neighborhoods; therefore, the position of a gene on a partially mapped network is predictive of other genetic interactions. Because digenic interactions are common in yeast, similar networks may underlie the complex genetics associated with inherited phenotypes in other organisms.

  10. Gene network-based analysis identifies two potential subtypes of small intestinal neuroendocrine tumors.

    PubMed

    Kidd, Mark; Modlin, Irvin M; Drozdov, Ignat

    2014-07-15

    Tumor transcriptomes contain information of critical value to understanding the different capacities of a cell at both a physiological and pathological level. In terms of clinical relevance, they provide information regarding the cellular "toolbox" e.g., pathways associated with malignancy and metastasis or drug dependency. Exploration of this resource can therefore be leveraged as a translational tool to better manage and assess neoplastic behavior. The availability of public genome-wide expression datasets, provide an opportunity to reassess neuroendocrine tumors at a more fundamental level. We hypothesized that stringent analysis of expression profiles as well as regulatory networks of the neoplastic cell would provide novel information that facilitates further delineation of the genomic basis of small intestinal neuroendocrine tumors. We re-analyzed two publically available small intestinal tumor transcriptomes using stringent quality control parameters and network-based approaches and validated expression of core secretory regulatory elements e.g., CPE, PCSK1, secretogranins, including genes involved in depolarization e.g., SCN3A, as well as transcription factors associated with neurodevelopment (NKX2-2, NeuroD1, INSM1) and glucose homeostasis (APLP1). The candidate metastasis-associated transcription factor, ST18, was highly expressed (>14-fold, p < 0.004). Genes previously associated with neoplasia, CEBPA and SDHD, were decreased in expression (-1.5 - -2, p < 0.02). Genomic interrogation indicated that intestinal tumors may consist of two different subtypes, serotonin-producing neoplasms and serotonin/substance P/tachykinin lesions. QPCR validation in an independent dataset (n = 13 neuroendocrine tumors), confirmed up-regulated expression of 87% of genes (13/15). An integrated cellular transcriptomic analysis of small intestinal neuroendocrine tumors identified that they are regulated at a developmental level, have key activation of hypoxic pathways (a known regulator of malignant stem cell phenotypes) as well as activation of genes involved in apoptosis and proliferation. Further refinement of these analyses by RNAseq studies of large-scale databases will enable definition of individual master regulators and facilitate the development of novel tissue and blood-based tools to better understand diagnose and treat tumors.

  11. Transcriptome profiling analysis reveals biomarkers in colon cancer samples of various differentiation

    PubMed Central

    Yu, Tonghu; Zhang, Huaping; Qi, Hong

    2018-01-01

    The aim of the present study was to investigate more colon cancer-related genes in different stages. Gene expression profile E-GEOD-62932 was extracted for differentially expressed gene (DEG) screening. Series test of cluster analysis was used to obtain significant trending models. Based on the Gene Ontology and Kyoto Encyclopedia of Genes and Genomes databases, functional and pathway enrichment analysis were processed and a pathway relation network was constructed. Gene co-expression network and gene signal network were constructed for common DEGs. The DEGs with the same trend were clustered and in total, 16 clusters with statistical significance were obtained. The screened DEGs were enriched into small molecule metabolic process and metabolic pathways. The pathway relation network was constructed with 57 nodes. A total of 328 common DEGs were obtained. Gene signal network was constructed with 71 nodes. Gene co-expression network was constructed with 161 nodes and 211 edges. ABCD3, CPT2, AGL and JAM2 are potential biomarkers for the diagnosis of colon cancer. PMID:29928385

  12. Gene networks and the evolution of plant morphology.

    PubMed

    Das Gupta, Mainak; Tsiantis, Miltos

    2018-06-06

    Elaboration of morphology depends on the precise orchestration of gene expression by key regulatory genes. The hierarchy and relationship among the participating genes is commonly known as gene regulatory network (GRN). Therefore, the evolution of morphology ultimately occurs by the rewiring of gene network structures or by the co-option of gene networks to novel domains. The availability of high-resolution expression data combined with powerful statistical tools have opened up new avenues to formulate and test hypotheses on how diverse gene networks influence trait development and diversity. Here we summarize recent studies based on both big-data and genetics approaches to understand the evolution of plant form and physiology. We also discuss recent genome-wide investigations on how studying open-chromatin regions may help study the evolution of gene expression patterns. Copyright © 2018. Published by Elsevier Ltd.

  13. Robust gene network analysis reveals alteration of the STAT5a network as a hallmark of prostate cancer.

    PubMed

    Reddy, Anupama; Huang, C Chris; Liu, Huiqing; Delisi, Charles; Nevalainen, Marja T; Szalma, Sandor; Bhanot, Gyan

    2010-01-01

    We develop a general method to identify gene networks from pair-wise correlations between genes in a microarray data set and apply it to a public prostate cancer gene expression data from 69 primary prostate tumors. We define the degree of a node as the number of genes significantly associated with the node and identify hub genes as those with the highest degree. The correlation network was pruned using transcription factor binding information in VisANT (http://visant.bu.edu/) as a biological filter. The reliability of hub genes was determined using a strict permutation test. Separate networks for normal prostate samples, and prostate cancer samples from African Americans (AA) and European Americans (EA) were generated and compared. We found that the same hubs control disease progression in AA and EA networks. Combining AA and EA samples, we generated networks for low low (<7) and high (≥7) Gleason grade tumors. A comparison of their major hubs with those of the network for normal samples identified two types of changes associated with disease: (i) Some hub genes increased their degree in the tumor network compared to their degree in the normal network, suggesting that these genes are associated with gain of regulatory control in cancer (e.g. possible turning on of oncogenes). (ii) Some hubs reduced their degree in the tumor network compared to their degree in the normal network, suggesting that these genes are associated with loss of regulatory control in cancer (e.g. possible loss of tumor suppressor genes). A striking result was that for both AA and EA tumor samples, STAT5a, CEBPB and EGR1 are major hubs that gain neighbors compared to the normal prostate network. Conversely, HIF-lα is a major hub that loses connections in the prostate cancer network compared to the normal prostate network. We also find that the degree of these hubs changes progressively from normal to low grade to high grade disease, suggesting that these hubs are master regulators of prostate cancer and marks disease progression. STAT5a was identified as a central hub, with ~120 neighbors in the prostate cancer network and only 81 neighbors in the normal prostate network. Of the 120 neighbors of STAT5a, 57 are known cancer related genes, known to be involved in functional pathways associated with tumorigenesis. Our method is general and can easily be extended to identify and study networks associated with any two phenotypes.

  14. MINER: exploratory analysis of gene interaction networks by machine learning from expression data.

    PubMed

    Kadupitige, Sidath Randeni; Leung, Kin Chun; Sellmeier, Julia; Sivieng, Jane; Catchpoole, Daniel R; Bain, Michael E; Gaëta, Bruno A

    2009-12-03

    The reconstruction of gene regulatory networks from high-throughput "omics" data has become a major goal in the modelling of living systems. Numerous approaches have been proposed, most of which attempt only "one-shot" reconstruction of the whole network with no intervention from the user, or offer only simple correlation analysis to infer gene dependencies. We have developed MINER (Microarray Interactive Network Exploration and Representation), an application that combines multivariate non-linear tree learning of individual gene regulatory dependencies, visualisation of these dependencies as both trees and networks, and representation of known biological relationships based on common Gene Ontology annotations. MINER allows biologists to explore the dependencies influencing the expression of individual genes in a gene expression data set in the form of decision, model or regression trees, using their domain knowledge to guide the exploration and formulate hypotheses. Multiple trees can then be summarised in the form of a gene network diagram. MINER is being adopted by several of our collaborators and has already led to the discovery of a new significant regulatory relationship with subsequent experimental validation. Unlike most gene regulatory network inference methods, MINER allows the user to start from genes of interest and build the network gene-by-gene, incorporating domain expertise in the process. This approach has been used successfully with RNA microarray data but is applicable to other quantitative data produced by high-throughput technologies such as proteomics and "next generation" DNA sequencing.

  15. Analysis of the core genome and pangenome of Pseudomonas putida.

    PubMed

    Udaondo, Zulema; Molina, Lázaro; Segura, Ana; Duque, Estrella; Ramos, Juan L

    2016-10-01

    Pseudomonas putida are strict aerobes that proliferate in a range of temperate niches and are of interest for environmental applications due to their capacity to degrade pollutants and ability to promote plant growth. Furthermore solvent-tolerant strains are useful for biosynthesis of added-value chemicals. We present a comprehensive comparative analysis of nine strains and the first characterization of the Pseudomonas putida pangenome. The core genome of P. putida comprises approximately 3386 genes. The most abundant genes within the core genome are those that encode nutrient transporters. Other conserved genes include those for central carbon metabolism through the Entner-Doudoroff pathway, the pentose phosphate cycle, arginine and proline metabolism, and pathways for degradation of aromatic chemicals. Genes that encode transporters, enzymes and regulators for amino acid metabolism (synthesis and degradation) are all part of the core genome, as well as various electron transporters, which enable aerobic metabolism under different oxygen regimes. Within the core genome are 30 genes for flagella biosynthesis and 12 key genes for biofilm formation. Pseudomonas putida strains share 85% of the coding regions with Pseudomonas aeruginosa; however, in P. putida, virulence factors such as exotoxins and type III secretion systems are absent. © 2015 Society for Applied Microbiology and John Wiley & Sons Ltd.

  16. Comparative analysis of protein interactome networks prioritizes candidate genes with cancer signatures.

    PubMed

    Li, Yongsheng; Sahni, Nidhi; Yi, Song

    2016-11-29

    Comprehensive understanding of human cancer mechanisms requires the identification of a thorough list of cancer-associated genes, which could serve as biomarkers for diagnoses and therapies in various types of cancer. Although substantial progress has been made in functional studies to uncover genes involved in cancer, these efforts are often time-consuming and costly. Therefore, it remains challenging to comprehensively identify cancer candidate genes. Network-based methods have accelerated this process through the analysis of complex molecular interactions in the cell. However, the extent to which various interactome networks can contribute to prediction of candidate genes responsible for cancer is still enigmatic. In this study, we evaluated different human protein-protein interactome networks and compared their application to cancer gene prioritization. Our results indicate that network analyses can increase the power to identify novel cancer genes. In particular, such predictive power can be enhanced with the use of unbiased systematic protein interaction maps for cancer gene prioritization. Functional analysis reveals that the top ranked genes from network predictions co-occur often with cancer-related terms in literature, and further, these candidate genes are indeed frequently mutated across cancers. Finally, our study suggests that integrating interactome networks with other omics datasets could provide novel insights into cancer-associated genes and underlying molecular mechanisms.

  17. Analysis of the SWI/SNF chromatin-remodeling complex during early heart development and BAF250a repression cardiac gene transcription during P19 cell differentiation

    PubMed Central

    Singh, Ajeet Pratap; Archer, Trevor K.

    2014-01-01

    The regulatory networks of differentiation programs and the molecular mechanisms of lineage-specific gene regulation in mammalian embryos remain only partially defined. We document differential expression and temporal switching of BRG1-associated factor (BAF) subunits, core pluripotency factors and cardiac-specific genes during post-implantation development and subsequent early organogenesis. Using affinity purification of BRG1 ATPase coupled to mass spectrometry, we characterized the cardiac-enriched remodeling complexes present in E8.5 mouse embryos. The relative abundance and combinatorial assembly of the BAF subunits provides functional specificity to Switch/Sucrose NonFermentable (SWI/SNF) complexes resulting in a unique gene expression profile in the developing heart. Remarkably, the specific depletion of the BAF250a subunit demonstrated differential effects on cardiac-specific gene expression and resulted in arrhythmic contracting cardiomyocytes in vitro. Indeed, the BAF250a physically interacts and functionally cooperates with Nucleosome Remodeling and Histone Deacetylase (NURD) complex subunits to repressively regulate chromatin structure of the cardiac genes by switching open and poised chromatin marks associated with active and repressed gene expression. Finally, BAF250a expression modulates BRG1 occupancy at the loci of cardiac genes regulatory regions in P19 cell differentiation. These findings reveal specialized and novel cardiac-enriched SWI/SNF chromatin-remodeling complexes, which are required for heart formation and critical for cardiac gene expression regulation at the early stages of heart development. PMID:24335282

  18. DiffSLC: A graph centrality method to detect essential proteins of a protein-protein interaction network.

    PubMed

    Mistry, Divya; Wise, Roger P; Dickerson, Julie A

    2017-01-01

    Identification of central genes and proteins in biomolecular networks provides credible candidates for pathway analysis, functional analysis, and essentiality prediction. The DiffSLC centrality measure predicts central and essential genes and proteins using a protein-protein interaction network. Network centrality measures prioritize nodes and edges based on their importance to the network topology. These measures helped identify critical genes and proteins in biomolecular networks. The proposed centrality measure, DiffSLC, combines the number of interactions of a protein and the gene coexpression values of genes from which those proteins were translated, as a weighting factor to bias the identification of essential proteins in a protein interaction network. Potentially essential proteins with low node degree are promoted through eigenvector centrality. Thus, the gene coexpression values are used in conjunction with the eigenvector of the network's adjacency matrix and edge clustering coefficient to improve essentiality prediction. The outcome of this prediction is shown using three variations: (1) inclusion or exclusion of gene co-expression data, (2) impact of different coexpression measures, and (3) impact of different gene expression data sets. For a total of seven networks, DiffSLC is compared to other centrality measures using Saccharomyces cerevisiae protein interaction networks and gene expression data. Comparisons are also performed for the top ranked proteins against the known essential genes from the Saccharomyces Gene Deletion Project, which show that DiffSLC detects more essential proteins and has a higher area under the ROC curve than other compared methods. This makes DiffSLC a stronger alternative to other centrality methods for detecting essential genes using a protein-protein interaction network that obeys centrality-lethality principle. DiffSLC is implemented using the igraph package in R, and networkx package in Python. The python package can be obtained from git.io/diffslcpy. The R implementation and code to reproduce the analysis is available via git.io/diffslc.

  19. Machine Learning–Based Differential Network Analysis: A Study of Stress-Responsive Transcriptomes in Arabidopsis[W

    PubMed Central

    Ma, Chuang; Xin, Mingming; Feldmann, Kenneth A.; Wang, Xiangfeng

    2014-01-01

    Machine learning (ML) is an intelligent data mining technique that builds a prediction model based on the learning of prior knowledge to recognize patterns in large-scale data sets. We present an ML-based methodology for transcriptome analysis via comparison of gene coexpression networks, implemented as an R package called machine learning–based differential network analysis (mlDNA) and apply this method to reanalyze a set of abiotic stress expression data in Arabidopsis thaliana. The mlDNA first used a ML-based filtering process to remove nonexpressed, constitutively expressed, or non-stress-responsive “noninformative” genes prior to network construction, through learning the patterns of 32 expression characteristics of known stress-related genes. The retained “informative” genes were subsequently analyzed by ML-based network comparison to predict candidate stress-related genes showing expression and network differences between control and stress networks, based on 33 network topological characteristics. Comparative evaluation of the network-centric and gene-centric analytic methods showed that mlDNA substantially outperformed traditional statistical testing–based differential expression analysis at identifying stress-related genes, with markedly improved prediction accuracy. To experimentally validate the mlDNA predictions, we selected 89 candidates out of the 1784 predicted salt stress–related genes with available SALK T-DNA mutagenesis lines for phenotypic screening and identified two previously unreported genes, mutants of which showed salt-sensitive phenotypes. PMID:24520154

  20. Regulatory network involving miRNAs and genes in serous ovarian carcinoma

    PubMed Central

    Zhao, Haiyan; Xu, Hao; Xue, Luchen

    2017-01-01

    Serous ovarian carcinoma (SOC) is one of the most life-threatening types of gynecological malignancy, but the pathogenesis of SOC remains unknown. Previous studies have indicated that differentially expressed genes and microRNAs (miRNAs) serve important functions in SOC. However, genes and miRNAs are identified in a disperse form, and limited information is known about the regulatory association between miRNAs and genes in SOC. In the present study, three regulatory networks were hierarchically constructed, including a differentially-expressed network, a related network and a global network to reveal associations between each factor. In each network, there were three types of factors, which were genes, miRNAs and transcription factors that interact with each other. Focus was placed on the differentially-expressed network, in which all genes and miRNAs were differentially expressed and therefore may have affected the development of SOC. Following the comparison and analysis between the three networks, a number of signaling pathways which demonstrated differentially expressed elements were highlighted. Subsequently, the upstream and downstream elements of differentially expressed miRNAs and genes were listed, and a number of key elements (differentially expressed miRNAs, genes and TFs predicted using the P-match method) were analyzed. The differentially expressed network partially illuminated the pathogenesis of SOC. It was hypothesized that if there was no differential expression of miRNAs and genes, SOC may be prevented and treatment may be identified. The present study provided a theoretical foundation for gene therapy for SOC. PMID:29113276

  1. A human functional protein interaction network and its application to cancer data analysis

    PubMed Central

    2010-01-01

    Background One challenge facing biologists is to tease out useful information from massive data sets for further analysis. A pathway-based analysis may shed light by projecting candidate genes onto protein functional relationship networks. We are building such a pathway-based analysis system. Results We have constructed a protein functional interaction network by extending curated pathways with non-curated sources of information, including protein-protein interactions, gene coexpression, protein domain interaction, Gene Ontology (GO) annotations and text-mined protein interactions, which cover close to 50% of the human proteome. By applying this network to two glioblastoma multiforme (GBM) data sets and projecting cancer candidate genes onto the network, we found that the majority of GBM candidate genes form a cluster and are closer than expected by chance, and the majority of GBM samples have sequence-altered genes in two network modules, one mainly comprising genes whose products are localized in the cytoplasm and plasma membrane, and another comprising gene products in the nucleus. Both modules are highly enriched in known oncogenes, tumor suppressors and genes involved in signal transduction. Similar network patterns were also found in breast, colorectal and pancreatic cancers. Conclusions We have built a highly reliable functional interaction network upon expert-curated pathways and applied this network to the analysis of two genome-wide GBM and several other cancer data sets. The network patterns revealed from our results suggest common mechanisms in the cancer biology. Our system should provide a foundation for a network or pathway-based analysis platform for cancer and other diseases. PMID:20482850

  2. A Risk Stratification Model for Lung Cancer Based on Gene Coexpression Network and Deep Learning

    PubMed Central

    2018-01-01

    Risk stratification model for lung cancer with gene expression profile is of great interest. Instead of previous models based on individual prognostic genes, we aimed to develop a novel system-level risk stratification model for lung adenocarcinoma based on gene coexpression network. Using multiple microarray, gene coexpression network analysis was performed to identify survival-related networks. A deep learning based risk stratification model was constructed with representative genes of these networks. The model was validated in two test sets. Survival analysis was performed using the output of the model to evaluate whether it could predict patients' survival independent of clinicopathological variables. Five networks were significantly associated with patients' survival. Considering prognostic significance and representativeness, genes of the two survival-related networks were selected for input of the model. The output of the model was significantly associated with patients' survival in two test sets and training set (p < 0.00001, p < 0.0001 and p = 0.02 for training and test sets 1 and 2, resp.). In multivariate analyses, the model was associated with patients' prognosis independent of other clinicopathological features. Our study presents a new perspective on incorporating gene coexpression networks into the gene expression signature and clinical application of deep learning in genomic data science for prognosis prediction. PMID:29581968

  3. Identification of Human Disease Genes from Interactome Network Using Graphlet Interaction

    PubMed Central

    Yang, Lun; Wei, Dong-Qing; Qi, Ying-Xin; Jiang, Zong-Lai

    2014-01-01

    Identifying genes related to human diseases, such as cancer and cardiovascular disease, etc., is an important task in biomedical research because of its applications in disease diagnosis and treatment. Interactome networks, especially protein-protein interaction networks, had been used to disease genes identification based on the hypothesis that strong candidate genes tend to closely relate to each other in some kinds of measure on the network. We proposed a new measure to analyze the relationship between network nodes which was called graphlet interaction. The graphlet interaction contained 28 different isomers. The results showed that the numbers of the graphlet interaction isomers between disease genes in interactome networks were significantly larger than random picked genes, while graphlet signatures were not. Then, we designed a new type of score, based on the network properties, to identify disease genes using graphlet interaction. The genes with higher scores were more likely to be disease genes, and all candidate genes were ranked according to their scores. Then the approach was evaluated by leave-one-out cross-validation. The precision of the current approach achieved 90% at about 10% recall, which was apparently higher than the previous three predominant algorithms, random walk, Endeavour and neighborhood based method. Finally, the approach was applied to predict new disease genes related to 4 common diseases, most of which were identified by other independent experimental researches. In conclusion, we demonstrate that the graphlet interaction is an effective tool to analyze the network properties of disease genes, and the scores calculated by graphlet interaction is more precise in identifying disease genes. PMID:24465923

  4. Genes2Networks: connecting lists of gene symbols using mammalian protein interactions databases.

    PubMed

    Berger, Seth I; Posner, Jeremy M; Ma'ayan, Avi

    2007-10-04

    In recent years, mammalian protein-protein interaction network databases have been developed. The interactions in these databases are either extracted manually from low-throughput experimental biomedical research literature, extracted automatically from literature using techniques such as natural language processing (NLP), generated experimentally using high-throughput methods such as yeast-2-hybrid screens, or interactions are predicted using an assortment of computational approaches. Genes or proteins identified as significantly changing in proteomic experiments, or identified as susceptibility disease genes in genomic studies, can be placed in the context of protein interaction networks in order to assign these genes and proteins to pathways and protein complexes. Genes2Networks is a software system that integrates the content of ten mammalian interaction network datasets. Filtering techniques to prune low-confidence interactions were implemented. Genes2Networks is delivered as a web-based service using AJAX. The system can be used to extract relevant subnetworks created from "seed" lists of human Entrez gene symbols. The output includes a dynamic linkable three color web-based network map, with a statistical analysis report that identifies significant intermediate nodes used to connect the seed list. Genes2Networks is powerful web-based software that can help experimental biologists to interpret lists of genes and proteins such as those commonly produced through genomic and proteomic experiments, as well as lists of genes and proteins associated with disease processes. This system can be used to find relationships between genes and proteins from seed lists, and predict additional genes or proteins that may play key roles in common pathways or protein complexes.

  5. A statistical method for measuring activation of gene regulatory networks.

    PubMed

    Esteves, Gustavo H; Reis, Luiz F L

    2018-06-13

    Gene expression data analysis is of great importance for modern molecular biology, given our ability to measure the expression profiles of thousands of genes and enabling studies rooted in systems biology. In this work, we propose a simple statistical model for the activation measuring of gene regulatory networks, instead of the traditional gene co-expression networks. We present the mathematical construction of a statistical procedure for testing hypothesis regarding gene regulatory network activation. The real probability distribution for the test statistic is evaluated by a permutation based study. To illustrate the functionality of the proposed methodology, we also present a simple example based on a small hypothetical network and the activation measuring of two KEGG networks, both based on gene expression data collected from gastric and esophageal samples. The two KEGG networks were also analyzed for a public database, available through NCBI-GEO, presented as Supplementary Material. This method was implemented in an R package that is available at the BioConductor project website under the name maigesPack.

  6. A Consensus Network of Gene Regulatory Factors in the Human Frontal Lobe

    PubMed Central

    Berto, Stefano; Perdomo-Sabogal, Alvaro; Gerighausen, Daniel; Qin, Jing; Nowick, Katja

    2016-01-01

    Cognitive abilities, such as memory, learning, language, problem solving, and planning, involve the frontal lobe and other brain areas. Not much is known yet about the molecular basis of cognitive abilities, but it seems clear that cognitive abilities are determined by the interplay of many genes. One approach for analyzing the genetic networks involved in cognitive functions is to study the coexpression networks of genes with known importance for proper cognitive functions, such as genes that have been associated with cognitive disorders like intellectual disability (ID) or autism spectrum disorders (ASD). Because many of these genes are gene regulatory factors (GRFs) we aimed to provide insights into the gene regulatory networks active in the human frontal lobe. Using genome wide human frontal lobe expression data from 10 independent data sets, we first derived 10 individual coexpression networks for all GRFs including their potential target genes. We observed a high level of variability among these 10 independently derived networks, pointing out that relying on results from a single study can only provide limited biological insights. To instead focus on the most confident information from these 10 networks we developed a method for integrating such independently derived networks into a consensus network. This consensus network revealed robust GRF interactions that are conserved across the frontal lobes of different healthy human individuals. Within this network, we detected a strong central module that is enriched for 166 GRFs known to be involved in brain development and/or cognitive disorders. Interestingly, several hubs of the consensus network encode for GRFs that have not yet been associated with brain functions. Their central role in the network suggests them as excellent new candidates for playing an essential role in the regulatory network of the human frontal lobe, which should be investigated in future studies. PMID:27014338

  7. Characterization of new mutants in the early part of the yeast secretory pathway isolated by a (/sup 3/H)mannose suicide selection

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Newman, A.P.; Ferro-Novick, S.

    We have adapted a (/sup 3/H)mannose suicide selection to identify mutations in additional genes which function in the early part of the yeast secretory pathway. Thus far this protocol has led to the identification of two new genes which are implicated in this process, as well as additional alleles of previously identified genes. The new mutants, bet1 and bet2, are temperature sensitive for growth and protein transport. Thin section analysis has revealed the accumulation of a network of endoplasmic reticulum (ER) at the restrictive temperature (37/sup 0/C). Precursors of exported proteins that accumulate in the cell at 37/sup 0/C aremore » terminally core glycosylated. These observations suggest that the transport of precursors is blocked subsequent to translocation into the ER but before entry into the Golgi apparatus. The bet1 and bet2 mutants define two new complementation groups which have the same properties as previously identified ER-accumulating mutants. This and previous findings suggest that protein exit from the ER and entry into the Golgi apparatus is a complex process requiring at least 11 genes.« less

  8. Comparative transcriptome analysis reveals insights into the streamlined genomes of haplosclerid demosponges

    NASA Astrophysics Data System (ADS)

    Guzman, Christine; Conaco, Cecilia

    2016-01-01

    Sponges (Porifera) are one of the most ancestral metazoan groups. They are characterized by a simple body plan lacking the true tissues and organ systems found in other animals. Members of this phylum display a remarkable diversity of form and function and yet little is known about the composition and complexity of their genomes. In this study, we sequenced the transcriptomes of two marine haplosclerid sponges belonging to Demospongiae, the largest and most diverse class within phylum Porifera, and compared their gene content with members of other sponge classes. We recovered 44,693 and 50,067 transcripts expressed in adult tissues of Haliclona amboinensis and Haliclona tubifera, respectively. These transcripts translate into 20,280 peptides in H. amboinensis and 18,000 peptides in H. tubifera. Genes associated with important signaling and metabolic pathways, regulatory networks, as well as genes that may be important in the organismal stress response, were identified in the transcriptomes. Futhermore, lineage-specific innovations were identified that may be correlated with observed sponge characters and ecological adaptations. The core gene complement expressed within the tissues of adult haplosclerid demosponges may represent a streamlined and flexible genetic toolkit that underlies the ecological success and resilience of sponges to environmental stress.

  9. Core histone genes of Giardia intestinalis: genomic organization, promoter structure, and expression

    PubMed Central

    Yee, Janet; Tang, Anita; Lau, Wei-Ling; Ritter, Heather; Delport, Dewald; Page, Melissa; Adam, Rodney D; Müller, Miklós; Wu, Gang

    2007-01-01

    Background Giardia intestinalis is a protist found in freshwaters worldwide, and is the most common cause of parasitic diarrhea in humans. The phylogenetic position of this parasite is still much debated. Histones are small, highly conserved proteins that associate tightly with DNA to form chromatin within the nucleus. There are two classes of core histone genes in higher eukaryotes: DNA replication-independent histones and DNA replication-dependent ones. Results We identified two copies each of the core histone H2a, H2b and H3 genes, and three copies of the H4 gene, at separate locations on chromosomes 3, 4 and 5 within the genome of Giardia intestinalis, but no gene encoding a H1 linker histone could be recognized. The copies of each gene share extensive DNA sequence identities throughout their coding and 5' noncoding regions, which suggests these copies have arisen from relatively recent gene duplications or gene conversions. The transcription start sites are at triplet A sequences 1–27 nucleotides upstream of the translation start codon for each gene. We determined that a 50 bp region upstream from the start of the histone H4 coding region is the minimal promoter, and a highly conserved 15 bp sequence called the histone motif (him) is essential for its activity. The Giardia core histone genes are constitutively expressed at approximately equivalent levels and their mRNAs are polyadenylated. Competition gel-shift experiments suggest that a factor within the protein complex that binds him may also be a part of the protein complexes that bind other promoter elements described previously in Giardia. Conclusion In contrast to other eukaryotes, the Giardia genome has only a single class of core histone genes that encode replication-independent histones. Our inability to locate a gene encoding the linker histone H1 leads us to speculate that the H1 protein may not be required for the compaction of Giardia's small and gene-rich genome. PMID:17425802

  10. F-MAP: A Bayesian approach to infer the gene regulatory network using external hints

    PubMed Central

    Shahdoust, Maryam; Mahjub, Hossein; Sadeghi, Mehdi

    2017-01-01

    The Common topological features of related species gene regulatory networks suggest reconstruction of the network of one species by using the further information from gene expressions profile of related species. We present an algorithm to reconstruct the gene regulatory network named; F-MAP, which applies the knowledge about gene interactions from related species. Our algorithm sets a Bayesian framework to estimate the precision matrix of one species microarray gene expressions dataset to infer the Gaussian Graphical model of the network. The conjugate Wishart prior is used and the information from related species is applied to estimate the hyperparameters of the prior distribution by using the factor analysis. Applying the proposed algorithm on six related species of drosophila shows that the precision of reconstructed networks is improved considerably compared to the precision of networks constructed by other Bayesian approaches. PMID:28938012

  11. Episodic specificity induction impacts activity in a core brain network during construction of imagined future experiences

    PubMed Central

    Madore, Kevin P.; Szpunar, Karl K.; Addis, Donna Rose; Schacter, Daniel L.

    2016-01-01

    Recent behavioral work suggests that an episodic specificity induction—brief training in recollecting the details of a past experience—enhances performance on subsequent tasks that rely on episodic retrieval, including imagining future experiences, solving open-ended problems, and thinking creatively. Despite these far-reaching behavioral effects, nothing is known about the neural processes impacted by an episodic specificity induction. Related neuroimaging work has linked episodic retrieval with a core network of brain regions that supports imagining future experiences. We tested the hypothesis that key structures in this network are influenced by the specificity induction. Participants received the specificity induction or one of two control inductions and then generated future events and semantic object comparisons during fMRI scanning. After receiving the specificity induction compared with the control, participants exhibited significantly more activity in several core network regions during the construction of imagined events over object comparisons, including the left anterior hippocampus, right inferior parietal lobule, right posterior cingulate cortex, and right ventral precuneus. Induction-related differences in the episodic detail of imagined events significantly modulated induction-related differences in the construction of imagined events in the left anterior hippocampus and right inferior parietal lobule. Resting-state functional connectivity analyses with hippocampal and inferior parietal lobule seed regions and the rest of the brain also revealed significantly stronger core network coupling following the specificity induction compared with the control. These findings provide evidence that an episodic specificity induction selectively targets episodic processes that are commonly linked to key core network regions, including the hippocampus. PMID:27601666

  12. HGPEC: a Cytoscape app for prediction of novel disease-gene and disease-disease associations and evidence collection based on a random walk on heterogeneous network.

    PubMed

    Le, Duc-Hau; Pham, Van-Huy

    2017-06-15

    Finding gene-disease and disease-disease associations play important roles in the biomedical area and many prioritization methods have been proposed for this goal. Among them, approaches based on a heterogeneous network of genes and diseases are considered state-of-the-art ones, which achieve high prediction performance and can be used for diseases with/without known molecular basis. Here, we developed a Cytoscape app, namely HGPEC, based on a random walk with restart algorithm on a heterogeneous network of genes and diseases. This app can prioritize candidate genes and diseases by employing a heterogeneous network consisting of a network of genes/proteins and a phenotypic disease similarity network. Based on the rankings, novel disease-gene and disease-disease associations can be identified. These associations can be supported with network- and rank-based visualization as well as evidences and annotations from biomedical data. A case study on prediction of novel breast cancer-associated genes and diseases shows the abilities of HGPEC. In addition, we showed prominence in the performance of HGPEC compared to other tools for prioritization of candidate disease genes. Taken together, our app is expected to effectively predict novel disease-gene and disease-disease associations and support network- and rank-based visualization as well as biomedical evidences for such the associations.

  13. Molecular epidemiology of Epizootic haematopoietic necrosis virus (EHNV).

    PubMed

    Hick, Paul M; Subramaniam, Kuttichantran; Thompson, Patrick M; Waltzek, Thomas B; Becker, Joy A; Whittington, Richard J

    2017-11-01

    Low genetic diversity of Epizootic haematopoietic necrosis virus (EHNV) was determined for the complete genome of 16 isolates spanning the natural range of hosts, geography and time since the first outbreaks of disease. Genomes ranged from 125,591-127,487 nucleotides with 97.47% pairwise identity and 106-109 genes. All isolates shared 101 core genes with 121 potential genes predicted within the pan-genome of this collection. There was high conservation within 90,181 nucleotides of the core genes with isolates separated by average genetic distance of 3.43 × 10 -4 substitutions per site. Evolutionary analysis of the core genome strongly supported historical epidemiological evidence of iatrogenic spread of EHNV to naïve hosts and establishment of endemic status in discrete ecological niches. There was no evidence of structural genome reorganization, however, the complement of non-core genes and variation in repeat elements enabled fine scale molecular epidemiological investigation of this unpredictable pathogen of fish. Copyright © 2017 Elsevier Inc. All rights reserved.

  14. Machine Learning-Assisted Network Inference Approach to Identify a New Class of Genes that Coordinate the Functionality of Cancer Networks.

    PubMed

    Ghanat Bari, Mehrab; Ung, Choong Yong; Zhang, Cheng; Zhu, Shizhen; Li, Hu

    2017-08-01

    Emerging evidence indicates the existence of a new class of cancer genes that act as "signal linkers" coordinating oncogenic signals between mutated and differentially expressed genes. While frequently mutated oncogenes and differentially expressed genes, which we term Class I cancer genes, are readily detected by most analytical tools, the new class of cancer-related genes, i.e., Class II, escape detection because they are neither mutated nor differentially expressed. Given this hypothesis, we developed a Machine Learning-Assisted Network Inference (MALANI) algorithm, which assesses all genes regardless of expression or mutational status in the context of cancer etiology. We used 8807 expression arrays, corresponding to 9 cancer types, to build more than 2 × 10 8 Support Vector Machine (SVM) models for reconstructing a cancer network. We found that ~3% of ~19,000 not differentially expressed genes are Class II cancer gene candidates. Some Class II genes that we found, such as SLC19A1 and ATAD3B, have been recently reported to associate with cancer outcomes. To our knowledge, this is the first study that utilizes both machine learning and network biology approaches to uncover Class II cancer genes in coordinating functionality in cancer networks and will illuminate our understanding of how genes are modulated in a tissue-specific network contribute to tumorigenesis and therapy development.

  15. Leveraging multiple gene networks to prioritize GWAS candidate genes via network representation learning.

    PubMed

    Wu, Mengmeng; Zeng, Wanwen; Liu, Wenqiang; Lv, Hairong; Chen, Ting; Jiang, Rui

    2018-06-03

    Genome-wide association studies (GWAS) have successfully discovered a number of disease-associated genetic variants in the past decade, providing an unprecedented opportunity for deciphering genetic basis of human inherited diseases. However, it is still a challenging task to extract biological knowledge from the GWAS data, due to such issues as missing heritability and weak interpretability. Indeed, the fact that the majority of discovered loci fall into noncoding regions without clear links to genes has been preventing the characterization of their functions and appealing for a sophisticated approach to bridge genetic and genomic studies. Towards this problem, network-based prioritization of candidate genes, which performs integrated analysis of gene networks with GWAS data, has emerged as a promising direction and attracted much attention. However, most existing methods overlook the sparse and noisy properties of gene networks and thus may lead to suboptimal performance. Motivated by this understanding, we proposed a novel method called REGENT for integrating multiple gene networks with GWAS data to prioritize candidate genes for complex diseases. We leveraged a technique called the network representation learning to embed a gene network into a compact and robust feature space, and then designed a hierarchical statistical model to integrate features of multiple gene networks with GWAS data for the effective inference of genes associated with a disease of interest. We applied our method to six complex diseases and demonstrated the superior performance of REGENT over existing approaches in recovering known disease-associated genes. We further conducted a pathway analysis and showed that the ability of REGENT to discover disease-associated pathways. We expect to see applications of our method to a broad spectrum of diseases for post-GWAS analysis. REGENT is freely available at https://github.com/wmmthu/REGENT. Copyright © 2018 Elsevier Inc. All rights reserved.

  16. Spatial-spectral flexible optical networking: enabling switching solutions for a simplified and efficient SDM network platform

    NASA Astrophysics Data System (ADS)

    Tomkos, I.; Zakynthinos, P.; Klonidis, D.; Marom, D.; Sygletos, S.; Ellis, A.; Salvadori, E.; Siracusa, D.; Angelou, M.; Papastergiou, G.; Psaila, N.; Ferran, J. F.; Ben-Ezra, S.; Jimenez, F.; Fernández-Palacios, J. P.

    2013-12-01

    The traffic carried by core optical networks grows at a steady but remarkable pace of 30-40% year-over-year. Optical transmissions and networking advancements continue to satisfy the traffic requirements by delivering the content over the network infrastructure in a cost and energy efficient manner. Such core optical networks serve the information traffic demands in a dynamic way, in response to requirements for shifting of traffics demands, both temporally (day/night) and spatially (business district/residential). However as we are approaching fundamental spectral efficiency limits of singlemode fibers, the scientific community is pursuing recently the development of an innovative, all-optical network architecture introducing the spatial degree of freedom when designing/operating future transport networks. Spacedivision- multiplexing through the use of bundled single mode fibers, and/or multi-core fibers and/or few-mode fibers can offer up to 100-fold capacity increase in future optical networks. The EU INSPACE project is working on the development of a complete spatial-spectral flexible optical networking solution, offering the network ultra-high capacity, flexibility and energy efficiency required to meet the challenges of delivering exponentially growing traffic demands in the internet over the next twenty years. In this paper we will present the motivation and main research activities of the INSPACE consortium towards the realization of the overall project solution.

  17. Integration of biological networks and gene expression data using Cytoscape

    PubMed Central

    Cline, Melissa S; Smoot, Michael; Cerami, Ethan; Kuchinsky, Allan; Landys, Nerius; Workman, Chris; Christmas, Rowan; Avila-Campilo, Iliana; Creech, Michael; Gross, Benjamin; Hanspers, Kristina; Isserlin, Ruth; Kelley, Ryan; Killcoyne, Sarah; Lotia, Samad; Maere, Steven; Morris, John; Ono, Keiichiro; Pavlovic, Vuk; Pico, Alexander R; Vailaya, Aditya; Wang, Peng-Liang; Adler, Annette; Conklin, Bruce R; Hood, Leroy; Kuiper, Martin; Sander, Chris; Schmulevich, Ilya; Schwikowski, Benno; Warner, Guy J; Ideker, Trey; Bader, Gary D

    2013-01-01

    Cytoscape is a free software package for visualizing, modeling and analyzing molecular and genetic interaction networks. This protocol explains how to use Cytoscape to analyze the results of mRNA expression profiling, and other functional genomics and proteomics experiments, in the context of an interaction network obtained for genes of interest. Five major steps are described: (i) obtaining a gene or protein network, (ii) displaying the network using layout algorithms, (iii) integrating with gene expression and other functional attributes, (iv) identifying putative complexes and functional modules and (v) identifying enriched Gene Ontology annotations in the network. These steps provide a broad sample of the types of analyses performed by Cytoscape. PMID:17947979

  18. Convergent evolution of gene networks by single-gene duplications in higher eukaryotes.

    PubMed

    Amoutzias, Gregory D; Robertson, David L; Oliver, Stephen G; Bornberg-Bauer, Erich

    2004-03-01

    By combining phylogenetic, proteomic and structural information, we have elucidated the evolutionary driving forces for the gene-regulatory interaction networks of basic helix-loop-helix transcription factors. We infer that recurrent events of single-gene duplication and domain rearrangement repeatedly gave rise to distinct networks with almost identical hub-based topologies, and multiple activators and repressors. We thus provide the first empirical evidence for scale-free protein networks emerging through single-gene duplications, the dominant importance of molecular modularity in the bottom-up construction of complex biological entities, and the convergent evolution of networks.

  19. Enhancing biological relevance of a weighted gene co-expression network for functional module identification.

    PubMed

    Prom-On, Santitham; Chanthaphan, Atthawut; Chan, Jonathan Hoyin; Meechai, Asawin

    2011-02-01

    Relationships among gene expression levels may be associated with the mechanisms of the disease. While identifying a direct association such as a difference in expression levels between case and control groups links genes to disease mechanisms, uncovering an indirect association in the form of a network structure may help reveal the underlying functional module associated with the disease under scrutiny. This paper presents a method to improve the biological relevance in functional module identification from the gene expression microarray data by enhancing the structure of a weighted gene co-expression network using minimum spanning tree. The enhanced network, which is called a backbone network, contains only the essential structural information to represent the gene co-expression network. The entire backbone network is decoupled into a number of coherent sub-networks, and then the functional modules are reconstructed from these sub-networks to ensure minimum redundancy. The method was tested with a simulated gene expression dataset and case-control expression datasets of autism spectrum disorder and colorectal cancer studies. The results indicate that the proposed method can accurately identify clusters in the simulated dataset, and the functional modules of the backbone network are more biologically relevant than those obtained from the original approach.

  20. Network-Based Integration of GWAS and Gene Expression Identifies a HOX-Centric Network Associated with Serous Ovarian Cancer Risk.

    PubMed

    Kar, Siddhartha P; Tyrer, Jonathan P; Li, Qiyuan; Lawrenson, Kate; Aben, Katja K H; Anton-Culver, Hoda; Antonenkova, Natalia; Chenevix-Trench, Georgia; Baker, Helen; Bandera, Elisa V; Bean, Yukie T; Beckmann, Matthias W; Berchuck, Andrew; Bisogna, Maria; Bjørge, Line; Bogdanova, Natalia; Brinton, Louise; Brooks-Wilson, Angela; Butzow, Ralf; Campbell, Ian; Carty, Karen; Chang-Claude, Jenny; Chen, Yian Ann; Chen, Zhihua; Cook, Linda S; Cramer, Daniel; Cunningham, Julie M; Cybulski, Cezary; Dansonka-Mieszkowska, Agnieszka; Dennis, Joe; Dicks, Ed; Doherty, Jennifer A; Dörk, Thilo; du Bois, Andreas; Dürst, Matthias; Eccles, Diana; Easton, Douglas F; Edwards, Robert P; Ekici, Arif B; Fasching, Peter A; Fridley, Brooke L; Gao, Yu-Tang; Gentry-Maharaj, Aleksandra; Giles, Graham G; Glasspool, Rosalind; Goode, Ellen L; Goodman, Marc T; Grownwald, Jacek; Harrington, Patricia; Harter, Philipp; Hein, Alexander; Heitz, Florian; Hildebrandt, Michelle A T; Hillemanns, Peter; Hogdall, Estrid; Hogdall, Claus K; Hosono, Satoyo; Iversen, Edwin S; Jakubowska, Anna; Paul, James; Jensen, Allan; Ji, Bu-Tian; Karlan, Beth Y; Kjaer, Susanne K; Kelemen, Linda E; Kellar, Melissa; Kelley, Joseph; Kiemeney, Lambertus A; Krakstad, Camilla; Kupryjanczyk, Jolanta; Lambrechts, Diether; Lambrechts, Sandrina; Le, Nhu D; Lee, Alice W; Lele, Shashi; Leminen, Arto; Lester, Jenny; Levine, Douglas A; Liang, Dong; Lissowska, Jolanta; Lu, Karen; Lubinski, Jan; Lundvall, Lene; Massuger, Leon; Matsuo, Keitaro; McGuire, Valerie; McLaughlin, John R; McNeish, Iain A; Menon, Usha; Modugno, Francesmary; Moysich, Kirsten B; Narod, Steven A; Nedergaard, Lotte; Ness, Roberta B; Nevanlinna, Heli; Odunsi, Kunle; Olson, Sara H; Orlow, Irene; Orsulic, Sandra; Weber, Rachel Palmieri; Pearce, Celeste Leigh; Pejovic, Tanja; Pelttari, Liisa M; Permuth-Wey, Jennifer; Phelan, Catherine M; Pike, Malcolm C; Poole, Elizabeth M; Ramus, Susan J; Risch, Harvey A; Rosen, Barry; Rossing, Mary Anne; Rothstein, Joseph H; Rudolph, Anja; Runnebaum, Ingo B; Rzepecka, Iwona K; Salvesen, Helga B; Schildkraut, Joellen M; Schwaab, Ira; Shu, Xiao-Ou; Shvetsov, Yurii B; Siddiqui, Nadeem; Sieh, Weiva; Song, Honglin; Southey, Melissa C; Sucheston-Campbell, Lara E; Tangen, Ingvild L; Teo, Soo-Hwang; Terry, Kathryn L; Thompson, Pamela J; Timorek, Agnieszka; Tsai, Ya-Yu; Tworoger, Shelley S; van Altena, Anne M; Van Nieuwenhuysen, Els; Vergote, Ignace; Vierkant, Robert A; Wang-Gohrke, Shan; Walsh, Christine; Wentzensen, Nicolas; Whittemore, Alice S; Wicklund, Kristine G; Wilkens, Lynne R; Woo, Yin-Ling; Wu, Xifeng; Wu, Anna; Yang, Hannah; Zheng, Wei; Ziogas, Argyrios; Sellers, Thomas A; Monteiro, Alvaro N A; Freedman, Matthew L; Gayther, Simon A; Pharoah, Paul D P

    2015-10-01

    Genome-wide association studies (GWAS) have so far reported 12 loci associated with serous epithelial ovarian cancer (EOC) risk. We hypothesized that some of these loci function through nearby transcription factor (TF) genes and that putative target genes of these TFs as identified by coexpression may also be enriched for additional EOC risk associations. We selected TF genes within 1 Mb of the top signal at the 12 genome-wide significant risk loci. Mutual information, a form of correlation, was used to build networks of genes strongly coexpressed with each selected TF gene in the unified microarray dataset of 489 serous EOC tumors from The Cancer Genome Atlas. Genes represented in this dataset were subsequently ranked using a gene-level test based on results for germline SNPs from a serous EOC GWAS meta-analysis (2,196 cases/4,396 controls). Gene set enrichment analysis identified six networks centered on TF genes (HOXB2, HOXB5, HOXB6, HOXB7 at 17q21.32 and HOXD1, HOXD3 at 2q31) that were significantly enriched for genes from the risk-associated end of the ranked list (P < 0.05 and FDR < 0.05). These results were replicated (P < 0.05) using an independent association study (7,035 cases/21,693 controls). Genes underlying enrichment in the six networks were pooled into a combined network. We identified a HOX-centric network associated with serous EOC risk containing several genes with known or emerging roles in serous EOC development. Network analysis integrating large, context-specific datasets has the potential to offer mechanistic insights into cancer susceptibility and prioritize genes for experimental characterization. ©2015 American Association for Cancer Research.

  1. Network-based integration of GWAS and gene expression identifies a HOX-centric network associated with serous ovarian cancer risk

    PubMed Central

    Kar, Siddhartha P.; Tyrer, Jonathan P.; Li, Qiyuan; Lawrenson, Kate; Aben, Katja K.H.; Anton-Culver, Hoda; Antonenkova, Natalia; Chenevix-Trench, Georgia; Baker, Helen; Bandera, Elisa V.; Bean, Yukie T.; Beckmann, Matthias W.; Berchuck, Andrew; Bisogna, Maria; Bjørge, Line; Bogdanova, Natalia; Brinton, Louise; Brooks-Wilson, Angela; Butzow, Ralf; Campbell, Ian; Carty, Karen; Chang-Claude, Jenny; Chen, Yian Ann; Chen, Zhihua; Cook, Linda S.; Cramer, Daniel; Cunningham, Julie M.; Cybulski, Cezary; Dansonka-Mieszkowska, Agnieszka; Dennis, Joe; Dicks, Ed; Doherty, Jennifer A.; Dörk, Thilo; du Bois, Andreas; Dürst, Matthias; Eccles, Diana; Easton, Douglas F.; Edwards, Robert P.; Ekici, Arif B.; Fasching, Peter A.; Fridley, Brooke L.; Gao, Yu-Tang; Gentry-Maharaj, Aleksandra; Giles, Graham G.; Glasspool, Rosalind; Goode, Ellen L.; Goodman, Marc T.; Grownwald, Jacek; Harrington, Patricia; Harter, Philipp; Hein, Alexander; Heitz, Florian; Hildebrandt, Michelle A.T.; Hillemanns, Peter; Hogdall, Estrid; Hogdall, Claus K.; Hosono, Satoyo; Iversen, Edwin S.; Jakubowska, Anna; Paul, James; Jensen, Allan; Ji, Bu-Tian; Karlan, Beth Y; Kjaer, Susanne K.; Kelemen, Linda E.; Kellar, Melissa; Kelley, Joseph; Kiemeney, Lambertus A.; Krakstad, Camilla; Kupryjanczyk, Jolanta; Lambrechts, Diether; Lambrechts, Sandrina; Le, Nhu D.; Lee, Alice W.; Lele, Shashi; Leminen, Arto; Lester, Jenny; Levine, Douglas A.; Liang, Dong; Lissowska, Jolanta; Lu, Karen; Lubinski, Jan; Lundvall, Lene; Massuger, Leon; Matsuo, Keitaro; McGuire, Valerie; McLaughlin, John R.; McNeish, Iain A.; Menon, Usha; Modugno, Francesmary; Moysich, Kirsten B.; Narod, Steven A.; Nedergaard, Lotte; Ness, Roberta B.; Nevanlinna, Heli; Odunsi, Kunle; Olson, Sara H.; Orlow, Irene; Orsulic, Sandra; Weber, Rachel Palmieri; Pearce, Celeste Leigh; Pejovic, Tanja; Pelttari, Liisa M.; Permuth-Wey, Jennifer; Phelan, Catherine M.; Pike, Malcolm C.; Poole, Elizabeth M.; Ramus, Susan J.; Risch, Harvey A.; Rosen, Barry; Rossing, Mary Anne; Rothstein, Joseph H.; Rudolph, Anja; Runnebaum, Ingo B.; Rzepecka, Iwona K.; Salvesen, Helga B.; Schildkraut, Joellen M.; Schwaab, Ira; Shu, Xiao-Ou; Shvetsov, Yurii B; Siddiqui, Nadeem; Sieh, Weiva; Song, Honglin; Southey, Melissa C.; Sucheston-Campbell, Lara E.; Tangen, Ingvild L.; Teo, Soo-Hwang; Terry, Kathryn L.; Thompson, Pamela J; Timorek, Agnieszka; Tsai, Ya-Yu; Tworoger, Shelley S.; van Altena, Anne M.; Van Nieuwenhuysen, Els; Vergote, Ignace; Vierkant, Robert A.; Wang-Gohrke, Shan; Walsh, Christine; Wentzensen, Nicolas; Whittemore, Alice S.; Wicklund, Kristine G.; Wilkens, Lynne R.; Woo, Yin-Ling; Wu, Xifeng; Wu, Anna; Yang, Hannah; Zheng, Wei; Ziogas, Argyrios; Sellers, Thomas A.; Monteiro, Alvaro N. A.; Freedman, Matthew L.; Gayther, Simon A.; Pharoah, Paul D. P.

    2015-01-01

    Background Genome-wide association studies (GWAS) have so far reported 12 loci associated with serous epithelial ovarian cancer (EOC) risk. We hypothesized that some of these loci function through nearby transcription factor (TF) genes and that putative target genes of these TFs as identified by co-expression may also be enriched for additional EOC risk associations. Methods We selected TF genes within 1 Mb of the top signal at the 12 genome-wide significant risk loci. Mutual information, a form of correlation, was used to build networks of genes strongly co-expressed with each selected TF gene in the unified microarray data set of 489 serous EOC tumors from The Cancer Genome Atlas. Genes represented in this data set were subsequently ranked using a gene-level test based on results for germline SNPs from a serous EOC GWAS meta-analysis (2,196 cases/4,396 controls). Results Gene set enrichment analysis identified six networks centered on TF genes (HOXB2, HOXB5, HOXB6, HOXB7 at 17q21.32 and HOXD1, HOXD3 at 2q31) that were significantly enriched for genes from the risk-associated end of the ranked list (P<0.05 and FDR<0.05). These results were replicated (P<0.05) using an independent association study (7,035 cases/21,693 controls). Genes underlying enrichment in the six networks were pooled into a combined network. Conclusion We identified a HOX-centric network associated with serous EOC risk containing several genes with known or emerging roles in serous EOC development. Impact Network analysis integrating large, context-specific data sets has the potential to offer mechanistic insights into cancer susceptibility and prioritize genes for experimental characterization. PMID:26209509

  2. Flexible, silver nanowire network nickel hydroxide core-shell electrodes for supercapacitors

    NASA Astrophysics Data System (ADS)

    Yuksel, Recep; Coskun, Sahin; Kalay, Yunus Eren; Unalan, Husnu Emrah

    2016-10-01

    We present a novel one-dimensional coaxial architecture composed of silver nanowire (Ag NW) network core and nickel hydroxide (Ni(OH)2) shell for the realization of coaxial nanocomposite electrode materials for supercapacitors. Ag NWs are formed conductive networks via spray coating onto polyethylene terephthalate (PET) substrates and Ni(OH)2 is gradually electrodeposited onto the Ag NW network to fabricate core-shell electrodes for supercapacitors. Synergy of highly conductive Ag NWs and high capacitive Ni(OH)2 facilitate ion and electron transport, enhance electrochemical properties and result in a specific capacitance of 1165.2 F g-1 at a current density of 3 A g-1. After 3000 cycles, fabricated nanocomposite electrodes show 93% capacity retention. The rational design explored in this study points out the potential of nanowire based coaxial energy storage devices.

  3. Rapid startup of thermophilic anaerobic digester to remove tetracycline and sulfonamides resistance genes from sewage sludge.

    PubMed

    Xu, Rui; Yang, Zhao-Hui; Wang, Qing-Peng; Bai, Yang; Liu, Jian-Bo; Zheng, Yue; Zhang, Yan-Ru; Xiong, Wei-Ping; Ahmad, Kito; Fan, Chang-Zheng

    2018-01-15

    Spread of antibiotic resistance genes (ARGs) originating from sewage sludge is highlighted as an eminent health threat. This study established a thermophilic anaerobic digester using one-step startup strategy to quickly remove tetracycline and sulfonamides resistance genes from sewage sludge. At least 20days were saved in the startup period from mesophilic to thermophilic condition. Based on the results of 16S rDNA amplicons sequencing and predicted metagenomic method, the successful startup largely relied on the fast colonization of core thermophilic microbial population (e.g. Firmicutes, Proteobacteria, Actinobacteria). Microbial metabolic gene pathways for substrate degradation and methane production was also increased by one-step mode. In addition, real-time quantitative PCR approach revealed that most targeted tetracycline and sulfonamides resistance genes ARGs (sulI, tetA, tetO, tetX) were substantially removed during thermophilic digestion (removal efficiency>80%). Network analysis showed that the elimination of ARGs was attributed to the decline of their horizontal (intI1 item) and vertical (potential hosts) transfer-related elements under high-temperature. This research demonstrated that rapid startup thermophilic anaerobic digestion of wastewater solids would be a suitable technology for reducing quantities of various ARGs. Copyright © 2017 Elsevier B.V. All rights reserved.

  4. Consensus between Pipelines in Structural Brain Networks

    PubMed Central

    Parker, Christopher S.; Deligianni, Fani; Cardoso, M. Jorge; Daga, Pankaj; Modat, Marc; Dayan, Michael; Clark, Chris A.

    2014-01-01

    Structural brain networks may be reconstructed from diffusion MRI tractography data and have great potential to further our understanding of the topological organisation of brain structure in health and disease. Network reconstruction is complex and involves a series of processesing methods including anatomical parcellation, registration, fiber orientation estimation and whole-brain fiber tractography. Methodological choices at each stage can affect the anatomical accuracy and graph theoretical properties of the reconstructed networks, meaning applying different combinations in a network reconstruction pipeline may produce substantially different networks. Furthermore, the choice of which connections are considered important is unclear. In this study, we assessed the similarity between structural networks obtained using two independent state-of-the-art reconstruction pipelines. We aimed to quantify network similarity and identify the core connections emerging most robustly in both pipelines. Similarity of network connections was compared between pipelines employing different atlases by merging parcels to a common and equivalent node scale. We found a high agreement between the networks across a range of fiber density thresholds. In addition, we identified a robust core of highly connected regions coinciding with a peak in similarity across network density thresholds, and replicated these results with atlases at different node scales. The binary network properties of these core connections were similar between pipelines but showed some differences in atlases across node scales. This study demonstrates the utility of applying multiple structural network reconstrution pipelines to diffusion data in order to identify the most important connections for further study. PMID:25356977

  5. Systematic Evaluation of Molecular Networks for Discovery of Disease Genes.

    PubMed

    Huang, Justin K; Carlin, Daniel E; Yu, Michael Ku; Zhang, Wei; Kreisberg, Jason F; Tamayo, Pablo; Ideker, Trey

    2018-04-25

    Gene networks are rapidly growing in size and number, raising the question of which networks are most appropriate for particular applications. Here, we evaluate 21 human genome-wide interaction networks for their ability to recover 446 disease gene sets identified through literature curation, gene expression profiling, or genome-wide association studies. While all networks have some ability to recover disease genes, we observe a wide range of performance with STRING, ConsensusPathDB, and GIANT networks having the best performance overall. A general tendency is that performance scales with network size, suggesting that new interaction discovery currently outweighs the detrimental effects of false positives. Correcting for size, we find that the DIP network provides the highest efficiency (value per interaction). Based on these results, we create a parsimonious composite network with both high efficiency and performance. This work provides a benchmark for selection of molecular networks in human disease research. Copyright © 2018 Elsevier Inc. All rights reserved.

  6. cGRNB: a web server for building combinatorial gene regulatory networks through integrated engineering of seed-matching sequence information and gene expression datasets.

    PubMed

    Xu, Huayong; Yu, Hui; Tu, Kang; Shi, Qianqian; Wei, Chaochun; Li, Yuan-Yuan; Li, Yi-Xue

    2013-01-01

    We are witnessing rapid progress in the development of methodologies for building the combinatorial gene regulatory networks involving both TFs (Transcription Factors) and miRNAs (microRNAs). There are a few tools available to do these jobs but most of them are not easy to use and not accessible online. A web server is especially needed in order to allow users to upload experimental expression datasets and build combinatorial regulatory networks corresponding to their particular contexts. In this work, we compiled putative TF-gene, miRNA-gene and TF-miRNA regulatory relationships from forward-engineering pipelines and curated them as built-in data libraries. We streamlined the R codes of our two separate forward-and-reverse engineering algorithms for combinatorial gene regulatory network construction and formalized them as two major functional modules. As a result, we released the cGRNB (combinatorial Gene Regulatory Networks Builder): a web server for constructing combinatorial gene regulatory networks through integrated engineering of seed-matching sequence information and gene expression datasets. The cGRNB enables two major network-building modules, one for MPGE (miRNA-perturbed gene expression) datasets and the other for parallel miRNA/mRNA expression datasets. A miRNA-centered two-layer combinatorial regulatory cascade is the output of the first module and a comprehensive genome-wide network involving all three types of combinatorial regulations (TF-gene, TF-miRNA, and miRNA-gene) are the output of the second module. In this article we propose cGRNB, a web server for building combinatorial gene regulatory networks through integrated engineering of seed-matching sequence information and gene expression datasets. Since parallel miRNA/mRNA expression datasets are rapidly accumulated by the advance of next-generation sequencing techniques, cGRNB will be very useful tool for researchers to build combinatorial gene regulatory networks based on expression datasets. The cGRNB web-server is free and available online at http://www.scbit.org/cgrnb.

  7. Reveal genes functionally associated with ACADS by a network study.

    PubMed

    Chen, Yulong; Su, Zhiguang

    2015-09-15

    Establishing a systematic network is aimed at finding essential human gene-gene/gene-disease pathway by means of network inter-connecting patterns and functional annotation analysis. In the present study, we have analyzed functional gene interactions of short-chain acyl-coenzyme A dehydrogenase gene (ACADS). ACADS plays a vital role in free fatty acid β-oxidation and regulates energy homeostasis. Modules of highly inter-connected genes in disease-specific ACADS network are derived by integrating gene function and protein interaction data. Among the 8 genes in ACADS web retrieved from both STRING and GeneMANIA, ACADS is effectively conjoined with 4 genes including HAHDA, HADHB, ECHS1 and ACAT1. The functional analysis is done via ontological briefing and candidate disease identification. We observed that the highly efficient-interlinked genes connected with ACADS are HAHDA, HADHB, ECHS1 and ACAT1. Interestingly, the ontological aspect of genes in the ACADS network reveals that ACADS, HAHDA and HADHB play equally vital roles in fatty acid metabolism. The gene ACAT1 together with ACADS indulges in ketone metabolism. Our computational gene web analysis also predicts potential candidate disease recognition, thus indicating the involvement of ACADS, HAHDA, HADHB, ECHS1 and ACAT1 not only with lipid metabolism but also with infant death syndrome, skeletal myopathy, acute hepatic encephalopathy, Reye-like syndrome, episodic ketosis, and metabolic acidosis. The current study presents a comprehensible layout of ACADS network, its functional strategies and candidate disease approach associated with ACADS network. Copyright © 2015 Elsevier B.V. All rights reserved.

  8. Seven-core multicore fiber transmissions for passive optical network.

    PubMed

    Zhu, B; Taunay, T F; Yan, M F; Fini, J M; Fishteyn, M; Monberg, E M; Dimarcello, F V

    2010-05-24

    We design and fabricate a novel multicore fiber (MCF), with seven cores arranged in a hexagonal array. The fiber properties of MCF including low crosstalk, attenuation and splice loss are described. A new tapered MCF connector (TMC), showing ultra-low crosstalk and losses, is also designed and fabricated for coupling the individual signals in-and-out of the MCF. We further propose a novel network configuration using parallel transmissions with the MCF and TMC for passive optical network (PON). To the best of our knowledge, we demonstrate the first bi-directional parallel transmissions of 1310 nm and 1490 nm signals over 11.3-km of seven-core MCF with 64-way splitter for PON.

  9. Quantum key distribution in multicore fibre for secure radio access networks

    NASA Astrophysics Data System (ADS)

    Llorente, Roberto; Provot, Antoine; Morant, Maria

    2018-01-01

    Broadband access in optical domain usually focuses in providing a pervasive cost-effective high bitrate communication in a given area. Nowadays, it is of utmost interest also to be able to provide a secure communication to the costumers in the area. Wireless access networks rely on optical domain for both fronthaul and backhaul of the radio access network (C-RAN). Multicore fiber (MCF) has been proposed as a promising candidate for the optical media of choice in nextgeneration wireless. The capacity demand of next-generation 5G networks makes interesting the use of high-capacity optical solutions as space-division multiplexing of different signals over MCF media. This work addresses secure MCF communication supporting C-RAN architectures. The paper proposes the use of one core in the MCF to transport securely an optical quantum key encoding altogether with end-to-end wireless signal transmitted in the remaining cores in radio-over-fiber (RoF). The RoF wireless signals are suitable for radio access fronthaul and backhaul. The theoretical principle and simulation analysis of quantum key distribution (QKD) are presented in this paper. The potential impact of optical RoF transmission crosstalk impairments is assessed experimentally considering different cellular signals on the remaining optical cores in the MCF. The experimental results report fronthaul performance over a four-core optical fiber with RoF transmission of full-standard CDMA signals providing 3.5G services in one core, HSPA+ signals providing 3.9G services in the second core and 3GPP LTEAdvanced signals providing 4G services in the third core, considering that the QKD signal is allocated in the fourth core.

  10. WGCNA: an R package for weighted correlation network analysis.

    PubMed

    Langfelder, Peter; Horvath, Steve

    2008-12-29

    Correlation networks are increasingly being used in bioinformatics applications. For example, weighted gene co-expression network analysis is a systems biology method for describing the correlation patterns among genes across microarray samples. Weighted correlation network analysis (WGCNA) can be used for finding clusters (modules) of highly correlated genes, for summarizing such clusters using the module eigengene or an intramodular hub gene, for relating modules to one another and to external sample traits (using eigengene network methodology), and for calculating module membership measures. Correlation networks facilitate network based gene screening methods that can be used to identify candidate biomarkers or therapeutic targets. These methods have been successfully applied in various biological contexts, e.g. cancer, mouse genetics, yeast genetics, and analysis of brain imaging data. While parts of the correlation network methodology have been described in separate publications, there is a need to provide a user-friendly, comprehensive, and consistent software implementation and an accompanying tutorial. The WGCNA R software package is a comprehensive collection of R functions for performing various aspects of weighted correlation network analysis. The package includes functions for network construction, module detection, gene selection, calculations of topological properties, data simulation, visualization, and interfacing with external software. Along with the R package we also present R software tutorials. While the methods development was motivated by gene expression data, the underlying data mining approach can be applied to a variety of different settings. The WGCNA package provides R functions for weighted correlation network analysis, e.g. co-expression network analysis of gene expression data. The R package along with its source code and additional material are freely available at http://www.genetics.ucla.edu/labs/horvath/CoexpressionNetwork/Rpackages/WGCNA.

  11. Identification and characterization of the autophagy-related genes Atg12 and Atg5 in hydra.

    PubMed

    Dixit, Nishikant S; Shravage, Bhupendra V; Ghaskadbi, Surendra

    2017-01-01

    Autophagy is an evolutionarily conserved process in eukaryotic cells that is involved in the degradation of cytoplasmic contents including organelles via the lysosome. Hydra is an early metazoan which exhibits simple tissue grade organization, a primitive nervous system, and is one of the classical non-bilaterian models extensively used in evo-devo research. Here, we describe the characterization of two core autophagy genes, Atg12 and Atg5, from hydra. In silico analyses including sequence similarity, domain analysis, and phylogenetic analysis demonstrate the conservation of these genes across eukaryotes. The predicted 3D structure of hydra Atg12 showed very little variance when compared to human Atg12 and yeast Atg12, whereas the hydra Atg5 predicted 3D structure was found to be variable, when compared with its human and yeast homologs. Strikingly, whole mount in situ hybridization showed high expression of Atg12 transcripts specifically in nematoblasts, whereas Atg5 transcripts were found to be expressed strongly in budding region and growing buds. This study may provide a framework to understand the evolution of autophagy networks in higher eukaryotes.

  12. An intersection network based on combining SNP co-association and RNA co-expression networks for feed utilization traits in Japanese Black cattle.

    PubMed

    Okada, D; Endo, S; Matsuda, H; Ogawa, S; Taniguchi, Y; Katsuta, T; Watanabe, T; Iwaisaki, H

    2018-05-12

    Genome-wide association studies (GWAS) of quantitative traits have detected numerous genetic associations, but they encounter difficulties in pinpointing prominent candidate genes and inferring gene networks. The present study used a systems genetics approach integrating GWAS results with external RNA-expression data to detect candidate gene networks in feed utilization and growth traits of Japanese Black cattle, which are matters of concern. A SNP co-association network was derived from significant correlations between SNPs with effects estimated by GWAS across seven phenotypic traits. The resulting network genes contained significant numbers of annotations related to the traits. Using bovine transcriptome data from a public database, an RNA co-expression network was inferred based on the similarity of expression patterns across different tissues. An intersection network was then generated by superimposing the SNP and RNA networks and extracting shared interactions. This intersection network contained four tissue-specific modules: nervous system, reproductive system, muscular system, and glands. To characterize the structure (topographical properties) of the three networks, their scale-free properties were evaluated, which revealed that the intersection network was the most scale-free. In the sub-network containing the most connected transcription factors (URI1, ROCK2 and ETV6), most genes were widely expressed across tissues, and genes previously shown to be involved in the traits were found. Results indicated that the current approach might be used to construct a gene network that better reflects biological information, providing encouragement for the genetic dissection of economically important quantitative traits.

  13. NASA's Next Generation Space Geodesy Network

    NASA Technical Reports Server (NTRS)

    Desai, S. D.; Gross, R. S.; Hilliard, L.; Lemoine, F. G.; Long, J. L.; Ma, C.; McGarry, J. F.; Merkowitz, S. M.; Murphy, D.; Noll, C. E.; hide

    2012-01-01

    NASA's Space Geodesy Project (SGP) is developing a prototype core site for a next generation Space Geodetic Network (SGN). Each of the sites in this planned network co-locate current state-of-the-art stations from all four space geodetic observing systems, GNSS, SLR, VLBI, and DORIS, with the goal of achieving modern requirements for the International Terrestrial Reference Frame (ITRF). In particular, the driving ITRF requirements for this network are 1.0 mm in accuracy and 0.1 mm/yr in stability, a factor of 10-20 beyond current capabilities. Development of the prototype core site, located at NASA's Geophysical and Astronomical Observatory at the Goddard Space Flight Center, started in 2011 and will be completed by the end of 2013. In January 2012, two operational GNSS stations, GODS and GOON, were established at the prototype site within 100 m of each other. Both stations are being proposed for inclusion into the IGS network. In addition, work is underway for the inclusion of next generation SLR and VLBI stations along with a modern DORIS station. An automated survey system is being developed to measure inter-technique vectorties, and network design studies are being performed to define the appropriate number and distribution of these next generation space geodetic core sites that are required to achieve the driving ITRF requirements. We present the status of this prototype next generation space geodetic core site, results from the analysis of data from the established geodetic stations, and results from the ongoing network design studies.

  14. Methods of information geometry in computational system biology (consistency between chemical and biological evolution).

    PubMed

    Astakhov, Vadim

    2009-01-01

    Interest in simulation of large-scale metabolic networks, species development, and genesis of various diseases requires new simulation techniques to accommodate the high complexity of realistic biological networks. Information geometry and topological formalisms are proposed to analyze information processes. We analyze the complexity of large-scale biological networks as well as transition of the system functionality due to modification in the system architecture, system environment, and system components. The dynamic core model is developed. The term dynamic core is used to define a set of causally related network functions. Delocalization of dynamic core model provides a mathematical formalism to analyze migration of specific functions in biosystems which undergo structure transition induced by the environment. The term delocalization is used to describe these processes of migration. We constructed a holographic model with self-poetic dynamic cores which preserves functional properties under those transitions. Topological constraints such as Ricci flow and Pfaff dimension were found for statistical manifolds which represent biological networks. These constraints can provide insight on processes of degeneration and recovery which take place in large-scale networks. We would like to suggest that therapies which are able to effectively implement estimated constraints, will successfully adjust biological systems and recover altered functionality. Also, we mathematically formulate the hypothesis that there is a direct consistency between biological and chemical evolution. Any set of causal relations within a biological network has its dual reimplementation in the chemistry of the system environment.

  15. Combinatorial optimization games

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Deng, X.; Ibaraki, Toshihide; Nagamochi, Hiroshi

    1997-06-01

    We introduce a general integer programming formulation for a class of combinatorial optimization games, which immediately allows us to improve the algorithmic result for finding amputations in the core (an important solution concept in cooperative game theory) of the network flow game on simple networks by Kalai and Zemel. An interesting result is a general theorem that the core for this class of games is nonempty if and only if a related linear program has an integer optimal solution. We study the properties for this mathematical condition to hold for several interesting problems, and apply them to resolve algorithmic andmore » complexity issues for their cores along the line as put forward in: decide whether the core is empty; if the core is empty, find an imputation in the core; given an imputation x, test whether x is in the core. We also explore the properties of totally balanced games in this succinct formulation of cooperative games.« less

  16. Gene transcripts selectively down-regulated in the shell of the nucleus accumbens long after heroin self-administration are up-regulated in the core independent of response contingency.

    PubMed

    Jacobs, Edwin H; de Vries, Taco J; Smit, August B; Schoffelmeer, Anton N M

    2004-01-01

    Long-term drug-induced alterations in neurotransmission within the nucleus accumbens (NAc) shell and core may underlie relapse to drug-seeking behavior and drug-taking upon re-exposure to drugs and drug-associated stimuli (cues) during abstinence. Using an open screening strategy, we recently identified 25 gene transcripts, encoding for proteins involved in neuronal functioning and structure that are down-regulated in rat NAc shell after contingent (active), but not after non-contingent (passive), heroin administration. Studying the expression of the same transcripts in the NAc core by means of quantitative PCR, we now demonstrate that most of these transcripts are up-regulated in that NAc subregion long (3 weeks) after heroin self-administration in rats. A similar up-regulation in gene expression was also apparent in the NAc core of animals with a history of non-contingent heroin administration (yoked controls). These data indicate that heroin self-administration differentially regulates genes in the NAc core as compared with the shell. Moreover, whereas cognitive processes involved in active drug self-administration (e.g., instrumental learning) seems to direct gene expression in the NAc shell, neuroplasticity in the NAc core may be due to the pharmacological effects of heroin (including Pavlovian conditioning), as expressed in rats upon contingent as well as non-contingent administration of heroin.

  17. Eukaryotic genomes may exhibit up to 10 generic classes of gene promoters.

    PubMed

    Gagniuc, Paul; Ionescu-Tirgoviste, Constantin

    2012-09-28

    The main function of gene promoters appears to be the integration of different gene products in their biological pathways in order to maintain homeostasis. Generally, promoters have been classified in two major classes, namely TATA and CpG. Nevertheless, many genes using the same combinatorial formation of transcription factors have different gene expression patterns. Accordingly, we tried to ask ourselves some fundamental questions: Why certain genes have an overall predisposition for higher gene expression levels than others? What causes such a predisposition? Is there a structural relationship of these sequences in different tissues? Is there a strong phylogenetic relationship between promoters of closely related species? In order to gain valuable insights into different promoter regions, we obtained a series of image-based patterns which allowed us to identify 10 generic classes of promoters. A comprehensive analysis was undertaken for promoter sequences from Arabidopsis thaliana, Drosophila melanogaster, Homo sapiens and Oryza sativa, and a more extensive analysis of tissue-specific promoters in humans. We observed a clear preference for these species to use certain classes of promoters for specific biological processes. Moreover, in humans, we found that different tissues use distinct classes of promoters, reflecting an emerging promoter network. Depending on the tissue type, comparisons made between these classes of promoters reveal a complementarity between their patterns whereas some other classes of promoters have been observed to occur in competition. Furthermore, we also noticed the existence of some transitional states between these classes of promoters that may explain certain evolutionary mechanisms, which suggest a possible predisposition for specific levels of gene expression and perhaps for a different number of factors responsible for triggering gene expression. Our conclusions are based on comprehensive data from three different databases and a new computer model whose core is using Kappa index of coincidence. To fully understand the connections between gene promoters and gene expression, we analyzed thousands of promoter sequences using our Kappa Index of Coincidence method and a specialized Optical Character Recognition (OCR) neural network. Under our criteria, 10 classes of promoters were detected. In addition, the existence of "transitional" promoters suggests that there is an evolutionary weighted continuum between classes, depending perhaps upon changes in their gene products.

  18. Gene Expression Network Reconstruction by Convex Feature Selection when Incorporating Genetic Perturbations

    PubMed Central

    Logsdon, Benjamin A.; Mezey, Jason

    2010-01-01

    Cellular gene expression measurements contain regulatory information that can be used to discover novel network relationships. Here, we present a new algorithm for network reconstruction powered by the adaptive lasso, a theoretically and empirically well-behaved method for selecting the regulatory features of a network. Any algorithms designed for network discovery that make use of directed probabilistic graphs require perturbations, produced by either experiments or naturally occurring genetic variation, to successfully infer unique regulatory relationships from gene expression data. Our approach makes use of appropriately selected cis-expression Quantitative Trait Loci (cis-eQTL), which provide a sufficient set of independent perturbations for maximum network resolution. We compare the performance of our network reconstruction algorithm to four other approaches: the PC-algorithm, QTLnet, the QDG algorithm, and the NEO algorithm, all of which have been used to reconstruct directed networks among phenotypes leveraging QTL. We show that the adaptive lasso can outperform these algorithms for networks of ten genes and ten cis-eQTL, and is competitive with the QDG algorithm for networks with thirty genes and thirty cis-eQTL, with rich topologies and hundreds of samples. Using this novel approach, we identify unique sets of directed relationships in Saccharomyces cerevisiae when analyzing genome-wide gene expression data for an intercross between a wild strain and a lab strain. We recover novel putative network relationships between a tyrosine biosynthesis gene (TYR1), and genes involved in endocytosis (RCY1), the spindle checkpoint (BUB2), sulfonate catabolism (JLP1), and cell-cell communication (PRM7). Our algorithm provides a synthesis of feature selection methods and graphical model theory that has the potential to reveal new directed regulatory relationships from the analysis of population level genetic and gene expression data. PMID:21152011

  19. GFD-Net: A novel semantic similarity methodology for the analysis of gene networks.

    PubMed

    Díaz-Montaña, Juan J; Díaz-Díaz, Norberto; Gómez-Vela, Francisco

    2017-04-01

    Since the popularization of biological network inference methods, it has become crucial to create methods to validate the resulting models. Here we present GFD-Net, the first methodology that applies the concept of semantic similarity to gene network analysis. GFD-Net combines the concept of semantic similarity with the use of gene network topology to analyze the functional dissimilarity of gene networks based on Gene Ontology (GO). The main innovation of GFD-Net lies in the way that semantic similarity is used to analyze gene networks taking into account the network topology. GFD-Net selects a functionality for each gene (specified by a GO term), weights each edge according to the dissimilarity between the nodes at its ends and calculates a quantitative measure of the network functional dissimilarity, i.e. a quantitative value of the degree of dissimilarity between the connected genes. The robustness of GFD-Net as a gene network validation tool was demonstrated by performing a ROC analysis on several network repositories. Furthermore, a well-known network was analyzed showing that GFD-Net can also be used to infer knowledge. The relevance of GFD-Net becomes more evident in Section "GFD-Net applied to the study of human diseases" where an example of how GFD-Net can be applied to the study of human diseases is presented. GFD-Net is available as an open-source Cytoscape app which offers a user-friendly interface to configure and execute the algorithm as well as the ability to visualize and interact with the results(http://apps.cytoscape.org/apps/gfdnet). Copyright © 2017 Elsevier Inc. All rights reserved.

  20. The Sheffield RNAi Screening Facility (SRSF): portfolio growth and technology development.

    PubMed

    Brown, Stephen

    2014-05-01

    The Sheffield RNAi Screening Facility (SRSF) (www.rnai.group.shef.ac.uk) was established in 2008 with Wellcome Trust and University of Sheffield funding, with the task to provide the first UK RNAi screening resource for academic groups interested in identifying genes required in a diverse range of biological processes using Drosophila cell culture. The SRSF has carried out a wide range of screens varying in sizes from bespoke small-scale libraries, targeting a few hundred genes, to high-throughput, genome-wide studies. The SRSF has grown and improved with a dedicated partnership of its academic customers based mainly in the UK. We are part of the UK Academics Functional Genomics Network, participating in organizing an annual meeting in London and are part of the University of Sheffield's D3N (www.d3n.org.uk), connecting academics, biotech and pharmaceutical companies with a multidisciplinary network in Drug Discovery and Development. Recently, the SRSF has been funded by the Yorkshire Cancer Research Fund to perform genome-wide RNAi screens using human cells as part of a core facility for regional Yorkshire Universities and screens are now underway. Overall the SRSF has carried out more than 40 screens from Drosophila and human cell culture experiments.

  1. A novel method to identify pathways associated with renal cell carcinoma based on a gene co-expression network

    PubMed Central

    RUAN, XIYUN; LI, HONGYUN; LIU, BO; CHEN, JIE; ZHANG, SHIBAO; SUN, ZEQIANG; LIU, SHUANGQING; SUN, FAHAI; LIU, QINGYONG

    2015-01-01

    The aim of the present study was to develop a novel method for identifying pathways associated with renal cell carcinoma (RCC) based on a gene co-expression network. A framework was established where a co-expression network was derived from the database as well as various co-expression approaches. First, the backbone of the network based on differentially expressed (DE) genes between RCC patients and normal controls was constructed by the Search Tool for the Retrieval of Interacting Genes/Proteins (STRING) database. The differentially co-expressed links were detected by Pearson’s correlation, the empirical Bayesian (EB) approach and Weighted Gene Co-expression Network Analysis (WGCNA). The co-expressed gene pairs were merged by a rank-based algorithm. We obtained 842; 371; 2,883 and 1,595 co-expressed gene pairs from the co-expression networks of the STRING database, Pearson’s correlation EB method and WGCNA, respectively. Two hundred and eighty-one differentially co-expressed (DC) gene pairs were obtained from the merged network using this novel method. Pathway enrichment analysis based on the Kyoto Encyclopedia of Genes and Genomes (KEGG) database and the network enrichment analysis (NEA) method were performed to verify feasibility of the merged method. Results of the KEGG and NEA pathway analyses showed that the network was associated with RCC. The suggested method was computationally efficient to identify pathways associated with RCC and has been identified as a useful complement to traditional co-expression analysis. PMID:26058425

  2. Predictive regulatory models in Drosophila melanogaster by integrative inference of transcriptional networks

    PubMed Central

    Marbach, Daniel; Roy, Sushmita; Ay, Ferhat; Meyer, Patrick E.; Candeias, Rogerio; Kahveci, Tamer; Bristow, Christopher A.; Kellis, Manolis

    2012-01-01

    Gaining insights on gene regulation from large-scale functional data sets is a grand challenge in systems biology. In this article, we develop and apply methods for transcriptional regulatory network inference from diverse functional genomics data sets and demonstrate their value for gene function and gene expression prediction. We formulate the network inference problem in a machine-learning framework and use both supervised and unsupervised methods to predict regulatory edges by integrating transcription factor (TF) binding, evolutionarily conserved sequence motifs, gene expression, and chromatin modification data sets as input features. Applying these methods to Drosophila melanogaster, we predict ∼300,000 regulatory edges in a network of ∼600 TFs and 12,000 target genes. We validate our predictions using known regulatory interactions, gene functional annotations, tissue-specific expression, protein–protein interactions, and three-dimensional maps of chromosome conformation. We use the inferred network to identify putative functions for hundreds of previously uncharacterized genes, including many in nervous system development, which are independently confirmed based on their tissue-specific expression patterns. Last, we use the regulatory network to predict target gene expression levels as a function of TF expression, and find significantly higher predictive power for integrative networks than for motif or ChIP-based networks. Our work reveals the complementarity between physical evidence of regulatory interactions (TF binding, motif conservation) and functional evidence (coordinated expression or chromatin patterns) and demonstrates the power of data integration for network inference and studies of gene regulation at the systems level. PMID:22456606

  3. Integrative Genomics Reveals Novel Molecular Pathways and Gene Networks for Coronary Artery Disease

    PubMed Central

    Mäkinen, Ville-Petteri; Civelek, Mete; Meng, Qingying; Zhang, Bin; Zhu, Jun; Levian, Candace; Huan, Tianxiao; Segrè, Ayellet V.; Ghosh, Sujoy; Vivar, Juan; Nikpay, Majid; Stewart, Alexandre F. R.; Nelson, Christopher P.; Willenborg, Christina; Erdmann, Jeanette; Blakenberg, Stefan; O'Donnell, Christopher J.; März, Winfried; Laaksonen, Reijo; Epstein, Stephen E.; Kathiresan, Sekar; Shah, Svati H.; Hazen, Stanley L.; Reilly, Muredach P.; Lusis, Aldons J.; Samani, Nilesh J.; Schunkert, Heribert; Quertermous, Thomas; McPherson, Ruth; Yang, Xia; Assimes, Themistocles L.

    2014-01-01

    The majority of the heritability of coronary artery disease (CAD) remains unexplained, despite recent successes of genome-wide association studies (GWAS) in identifying novel susceptibility loci. Integrating functional genomic data from a variety of sources with a large-scale meta-analysis of CAD GWAS may facilitate the identification of novel biological processes and genes involved in CAD, as well as clarify the causal relationships of established processes. Towards this end, we integrated 14 GWAS from the CARDIoGRAM Consortium and two additional GWAS from the Ottawa Heart Institute (25,491 cases and 66,819 controls) with 1) genetics of gene expression studies of CAD-relevant tissues in humans, 2) metabolic and signaling pathways from public databases, and 3) data-driven, tissue-specific gene networks from a multitude of human and mouse experiments. We not only detected CAD-associated gene networks of lipid metabolism, coagulation, immunity, and additional networks with no clear functional annotation, but also revealed key driver genes for each CAD network based on the topology of the gene regulatory networks. In particular, we found a gene network involved in antigen processing to be strongly associated with CAD. The key driver genes of this network included glyoxalase I (GLO1) and peptidylprolyl isomerase I (PPIL1), which we verified as regulatory by siRNA experiments in human aortic endothelial cells. Our results suggest genetic influences on a diverse set of both known and novel biological processes that contribute to CAD risk. The key driver genes for these networks highlight potential novel targets for further mechanistic studies and therapeutic interventions. PMID:25033284

  4. Differential Engagement of Brain Regions within a "Core" Network during Scene Construction

    ERIC Educational Resources Information Center

    Summerfield, Jennifer J.; Hassabis, Demis; Maguire, Eleanor A.

    2010-01-01

    Reliving past events and imagining potential future events engages a well-established "core" network of brain areas. How the brain constructs, or reconstructs, these experiences or scenes has been debated extensively in the literature, but remains poorly understood. Here we designed a novel task to investigate this (re)constructive process by…

  5. A palladium-doped ceria@carbon core-sheath nanowire network: a promising catalyst support for alcohol electrooxidation reactions.

    PubMed

    Tan, Qiang; Du, Chunyu; Sun, Yongrong; Du, Lei; Yin, Geping; Gao, Yunzhi

    2015-08-28

    A novel palladium-doped ceria and carbon core-sheath nanowire network (Pd-CeO2@C CSNWN) is synthesized by a template-free and surfactant-free solvothermal process, followed by high temperature carbonization. This hierarchical network serves as a new class of catalyst support to enhance the activity and durability of noble metal catalysts for alcohol oxidation reactions. Its supported Pd nanoparticles, Pd/(Pd-CeO2@C CSNWN), exhibit >9 fold increase in activity toward the ethanol oxidation over the state-of-the-art Pd/C catalyst, which is the highest among the reported Pd systems. Moreover, stability tests show a virtually unchanged activity after 1000 cycles. The high activity is mainly attributed to the superior oxygen-species releasing capability of Pd-doped CeO2 nanowires by accelerating the removal of the poisoning intermediate. The unique interconnected one-dimensional core-sheath structure is revealed to facilitate immobilization of the metal catalysts, leading to the improved durability. This core-sheath nanowire network opens up a new strategy for catalyst performance optimization for next-generation fuel cells.

  6. A family of selfish minicircular chromosomes with jumbled chloroplast gene fragments from a dinoflagellate.

    PubMed

    Zhang, Z; Cavalier-Smith, T; Green, B R

    2001-08-01

    Chloroplast genes of several dinoflagellate species are located on unigenic DNA minicircular chromosomes. We have now completely sequenced five aberrant minicircular chromosomes from the dinoflagellate Heterocapsa triquetra. These probably nonfunctional DNA circles lack complete genes, with each being composed of several short fragments of two or three different chloroplast genes and a common conserved region with a tripartite 9G-9A-9G core like the putative replicon origin of functional single-gene circular chloroplast chromosomes. Their sequences imply that all five circles evolved by differential deletions and duplications from common ancestral circles bearing fragments of four genes: psbA, psbC, 16S rRNA, and 23S rRNA. It appears that recombination between separate unigenic chromosomes initially gave intermediate heterodimers, which were subsequently stabilized by deletions that included part or all of one putative replicon origin. We suggest that homologous recombination at the 9G-9A-9G core regions produced a psbA/psbC heterodimer which generated two distinct chimeric circles by differential deletions and duplications. A 23S/16S rRNA heterodimer more likely formed by illegitimate recombination between 16S and 23S rRNA genes. Homologous recombination between the 9G-9A-9G core regions of both heterodimers and additional differential deletions and duplications could then have yielded the other three circles. Near identity of the gene fragments and 9G-9A-9G cores, despite diverging adjacent regions, may be maintained by gene conversion. The conserved organization of the 9G-9A-9G cores alone favors the idea that they are replicon origins and suggests that they may enable the aberrant minicircles to parasitize the chloroplast's replication machinery as selfish circles.

  7. Anti-Sigma Factors in E. coli: Common Regulatory Mechanisms Controlling Sigma Factors Availability

    PubMed Central

    Treviño-Quintanilla, Luis Gerardo; Freyre-González, Julio Augusto; Martínez-Flores, Irma

    2013-01-01

    In bacteria, transcriptional regulation is a key step in cellular gene expression. All bacteria contain a core RNA polymerase that is catalytically competent but requires an additional σ factor for specific promoter recognition and correct transcriptional initiation. The RNAP core is not able to selectively bind to a given σ factor. In contrast, different σ factors have different affinities for the RNAP core. As a consequence, the concentration of alternate σ factors requires strict regulation in order to properly control the delicate interplay among them, which favors the competence for the RNAP core. This control is archived by different σ/anti-σ controlling mechanisms that shape complex regulatory networks and cascades, and enable the response to sudden environmental cues, whose global understanding is a current challenge for systems biology. Although there have been a number of excellent studies on each of these σ/anti-σ post-transcriptional regulatory systems, no comprehensive comparison of these mechanisms in a single model organism has been conducted. Here, we survey all these systems in E. coli dissecting and analyzing their inner workings and highlightin their differences. Then, following an integral approach, we identify their commonalities and outline some of the principles exploited by the cell to effectively and globally reprogram the transcriptional machinery. These principles provide guidelines for developing biological synthetic circuits enabling an efficient and robust response to sudden stimuli. PMID:24396271

  8. Maximizing capture of gene co-expression relationships through pre-clustering of input expression samples: an Arabidopsis case study.

    PubMed

    Feltus, F Alex; Ficklin, Stephen P; Gibson, Scott M; Smith, Melissa C

    2013-06-05

    In genomics, highly relevant gene interaction (co-expression) networks have been constructed by finding significant pair-wise correlations between genes in expression datasets. These networks are then mined to elucidate biological function at the polygenic level. In some cases networks may be constructed from input samples that measure gene expression under a variety of different conditions, such as for different genotypes, environments, disease states and tissues. When large sets of samples are obtained from public repositories it is often unmanageable to associate samples into condition-specific groups, and combining samples from various conditions has a negative effect on network size. A fixed significance threshold is often applied also limiting the size of the final network. Therefore, we propose pre-clustering of input expression samples to approximate condition-specific grouping of samples and individual network construction of each group as a means for dynamic significance thresholding. The net effect is increase sensitivity thus maximizing the total co-expression relationships in the final co-expression network compendium. A total of 86 Arabidopsis thaliana co-expression networks were constructed after k-means partitioning of 7,105 publicly available ATH1 Affymetrix microarray samples. We term each pre-sorted network a Gene Interaction Layer (GIL). Random Matrix Theory (RMT), an un-supervised thresholding method, was used to threshold each of the 86 networks independently, effectively providing a dynamic (non-global) threshold for the network. The overall gene count across all GILs reached 19,588 genes (94.7% measured gene coverage) and 558,022 unique co-expression relationships. In comparison, network construction without pre-sorting of input samples yielded only 3,297 genes (15.9%) and 129,134 relationships. in the global network. Here we show that pre-clustering of microarray samples helps approximate condition-specific networks and allows for dynamic thresholding using un-supervised methods. Because RMT ensures only highly significant interactions are kept, the GIL compendium consists of 558,022 unique high quality A. thaliana co-expression relationships across almost all of the measurable genes on the ATH1 array. For A. thaliana, these networks represent the largest compendium to date of significant gene co-expression relationships, and are a means to explore complex pathway, polygenic, and pleiotropic relationships for this focal model plant. The networks can be explored at sysbio.genome.clemson.edu. Finally, this method is applicable to any large expression profile collection for any organism and is best suited where a knowledge-independent network construction method is desired.

  9. Maximizing capture of gene co-expression relationships through pre-clustering of input expression samples: an Arabidopsis case study

    PubMed Central

    2013-01-01

    Background In genomics, highly relevant gene interaction (co-expression) networks have been constructed by finding significant pair-wise correlations between genes in expression datasets. These networks are then mined to elucidate biological function at the polygenic level. In some cases networks may be constructed from input samples that measure gene expression under a variety of different conditions, such as for different genotypes, environments, disease states and tissues. When large sets of samples are obtained from public repositories it is often unmanageable to associate samples into condition-specific groups, and combining samples from various conditions has a negative effect on network size. A fixed significance threshold is often applied also limiting the size of the final network. Therefore, we propose pre-clustering of input expression samples to approximate condition-specific grouping of samples and individual network construction of each group as a means for dynamic significance thresholding. The net effect is increase sensitivity thus maximizing the total co-expression relationships in the final co-expression network compendium. Results A total of 86 Arabidopsis thaliana co-expression networks were constructed after k-means partitioning of 7,105 publicly available ATH1 Affymetrix microarray samples. We term each pre-sorted network a Gene Interaction Layer (GIL). Random Matrix Theory (RMT), an un-supervised thresholding method, was used to threshold each of the 86 networks independently, effectively providing a dynamic (non-global) threshold for the network. The overall gene count across all GILs reached 19,588 genes (94.7% measured gene coverage) and 558,022 unique co-expression relationships. In comparison, network construction without pre-sorting of input samples yielded only 3,297 genes (15.9%) and 129,134 relationships. in the global network. Conclusions Here we show that pre-clustering of microarray samples helps approximate condition-specific networks and allows for dynamic thresholding using un-supervised methods. Because RMT ensures only highly significant interactions are kept, the GIL compendium consists of 558,022 unique high quality A. thaliana co-expression relationships across almost all of the measurable genes on the ATH1 array. For A. thaliana, these networks represent the largest compendium to date of significant gene co-expression relationships, and are a means to explore complex pathway, polygenic, and pleiotropic relationships for this focal model plant. The networks can be explored at sysbio.genome.clemson.edu. Finally, this method is applicable to any large expression profile collection for any organism and is best suited where a knowledge-independent network construction method is desired. PMID:23738693

  10. CMIP: a software package capable of reconstructing genome-wide regulatory networks using gene expression data.

    PubMed

    Zheng, Guangyong; Xu, Yaochen; Zhang, Xiujun; Liu, Zhi-Ping; Wang, Zhuo; Chen, Luonan; Zhu, Xin-Guang

    2016-12-23

    A gene regulatory network (GRN) represents interactions of genes inside a cell or tissue, in which vertexes and edges stand for genes and their regulatory interactions respectively. Reconstruction of gene regulatory networks, in particular, genome-scale networks, is essential for comparative exploration of different species and mechanistic investigation of biological processes. Currently, most of network inference methods are computationally intensive, which are usually effective for small-scale tasks (e.g., networks with a few hundred genes), but are difficult to construct GRNs at genome-scale. Here, we present a software package for gene regulatory network reconstruction at a genomic level, in which gene interaction is measured by the conditional mutual information measurement using a parallel computing framework (so the package is named CMIP). The package is a greatly improved implementation of our previous PCA-CMI algorithm. In CMIP, we provide not only an automatic threshold determination method but also an effective parallel computing framework for network inference. Performance tests on benchmark datasets show that the accuracy of CMIP is comparable to most current network inference methods. Moreover, running tests on synthetic datasets demonstrate that CMIP can handle large datasets especially genome-wide datasets within an acceptable time period. In addition, successful application on a real genomic dataset confirms its practical applicability of the package. This new software package provides a powerful tool for genomic network reconstruction to biological community. The software can be accessed at http://www.picb.ac.cn/CMIP/ .

  11. Efficient Reverse-Engineering of a Developmental Gene Regulatory Network

    PubMed Central

    Cicin-Sain, Damjan; Ashyraliyev, Maksat; Jaeger, Johannes

    2012-01-01

    Understanding the complex regulatory networks underlying development and evolution of multi-cellular organisms is a major problem in biology. Computational models can be used as tools to extract the regulatory structure and dynamics of such networks from gene expression data. This approach is called reverse engineering. It has been successfully applied to many gene networks in various biological systems. However, to reconstitute the structure and non-linear dynamics of a developmental gene network in its spatial context remains a considerable challenge. Here, we address this challenge using a case study: the gap gene network involved in segment determination during early development of Drosophila melanogaster. A major problem for reverse-engineering pattern-forming networks is the significant amount of time and effort required to acquire and quantify spatial gene expression data. We have developed a simplified data processing pipeline that considerably increases the throughput of the method, but results in data of reduced accuracy compared to those previously used for gap gene network inference. We demonstrate that we can infer the correct network structure using our reduced data set, and investigate minimal data requirements for successful reverse engineering. Our results show that timing and position of expression domain boundaries are the crucial features for determining regulatory network structure from data, while it is less important to precisely measure expression levels. Based on this, we define minimal data requirements for gap gene network inference. Our results demonstrate the feasibility of reverse-engineering with much reduced experimental effort. This enables more widespread use of the method in different developmental contexts and organisms. Such systematic application of data-driven models to real-world networks has enormous potential. Only the quantitative investigation of a large number of developmental gene regulatory networks will allow us to discover whether there are rules or regularities governing development and evolution of complex multi-cellular organisms. PMID:22807664

  12. Analysis of gene network robustness based on saturated fixed point attractors

    PubMed Central

    2014-01-01

    The analysis of gene network robustness to noise and mutation is important for fundamental and practical reasons. Robustness refers to the stability of the equilibrium expression state of a gene network to variations of the initial expression state and network topology. Numerical simulation of these variations is commonly used for the assessment of robustness. Since there exists a great number of possible gene network topologies and initial states, even millions of simulations may be still too small to give reliable results. When the initial and equilibrium expression states are restricted to being saturated (i.e., their elements can only take values 1 or −1 corresponding to maximum activation and maximum repression of genes), an analytical gene network robustness assessment is possible. We present this analytical treatment based on determination of the saturated fixed point attractors for sigmoidal function models. The analysis can determine (a) for a given network, which and how many saturated equilibrium states exist and which and how many saturated initial states converge to each of these saturated equilibrium states and (b) for a given saturated equilibrium state or a given pair of saturated equilibrium and initial states, which and how many gene networks, referred to as viable, share this saturated equilibrium state or the pair of saturated equilibrium and initial states. We also show that the viable networks sharing a given saturated equilibrium state must follow certain patterns. These capabilities of the analytical treatment make it possible to properly define and accurately determine robustness to noise and mutation for gene networks. Previous network research conclusions drawn from performing millions of simulations follow directly from the results of our analytical treatment. Furthermore, the analytical results provide criteria for the identification of model validity and suggest modified models of gene network dynamics. The yeast cell-cycle network is used as an illustration of the practical application of this analytical treatment. PMID:24650364

  13. Integrated Module and Gene-Specific Regulatory Inference Implicates Upstream Signaling Networks

    PubMed Central

    Roy, Sushmita; Lagree, Stephen; Hou, Zhonggang; Thomson, James A.; Stewart, Ron; Gasch, Audrey P.

    2013-01-01

    Regulatory networks that control gene expression are important in diverse biological contexts including stress response and development. Each gene's regulatory program is determined by module-level regulation (e.g. co-regulation via the same signaling system), as well as gene-specific determinants that can fine-tune expression. We present a novel approach, Modular regulatory network learning with per gene information (MERLIN), that infers regulatory programs for individual genes while probabilistically constraining these programs to reveal module-level organization of regulatory networks. Using edge-, regulator- and module-based comparisons of simulated networks of known ground truth, we find MERLIN reconstructs regulatory programs of individual genes as well or better than existing approaches of network reconstruction, while additionally identifying modular organization of the regulatory networks. We use MERLIN to dissect global transcriptional behavior in two biological contexts: yeast stress response and human embryonic stem cell differentiation. Regulatory modules inferred by MERLIN capture co-regulatory relationships between signaling proteins and downstream transcription factors thereby revealing the upstream signaling systems controlling transcriptional responses. The inferred networks are enriched for regulators with genetic or physical interactions, supporting the inference, and identify modules of functionally related genes bound by the same transcriptional regulators. Our method combines the strengths of per-gene and per-module methods to reveal new insights into transcriptional regulation in stress and development. PMID:24146602

  14. Transcriptional network inference from functional similarity and expression data: a global supervised approach.

    PubMed

    Ambroise, Jérôme; Robert, Annie; Macq, Benoit; Gala, Jean-Luc

    2012-01-06

    An important challenge in system biology is the inference of biological networks from postgenomic data. Among these biological networks, a gene transcriptional regulatory network focuses on interactions existing between transcription factors (TFs) and and their corresponding target genes. A large number of reverse engineering algorithms were proposed to infer such networks from gene expression profiles, but most current methods have relatively low predictive performances. In this paper, we introduce the novel TNIFSED method (Transcriptional Network Inference from Functional Similarity and Expression Data), that infers a transcriptional network from the integration of correlations and partial correlations of gene expression profiles and gene functional similarities through a supervised classifier. In the current work, TNIFSED was applied to predict the transcriptional network in Escherichia coli and in Saccharomyces cerevisiae, using datasets of 445 and 170 affymetrix arrays, respectively. Using the area under the curve of the receiver operating characteristics and the F-measure as indicators, we showed the predictive performance of TNIFSED to be better than unsupervised state-of-the-art methods. TNIFSED performed slightly worse than the supervised SIRENE algorithm for the target genes identification of the TF having a wide range of yet identified target genes but better for TF having only few identified target genes. Our results indicate that TNIFSED is complementary to the SIRENE algorithm, and particularly suitable to discover target genes of "orphan" TFs.

  15. EgoNet: identification of human disease ego-network modules

    PubMed Central

    2014-01-01

    Background Mining novel biomarkers from gene expression profiles for accurate disease classification is challenging due to small sample size and high noise in gene expression measurements. Several studies have proposed integrated analyses of microarray data and protein-protein interaction (PPI) networks to find diagnostic subnetwork markers. However, the neighborhood relationship among network member genes has not been fully considered by those methods, leaving many potential gene markers unidentified. The main idea of this study is to take full advantage of the biological observation that genes associated with the same or similar diseases commonly reside in the same neighborhood of molecular networks. Results We present EgoNet, a novel method based on egocentric network-analysis techniques, to exhaustively search and prioritize disease subnetworks and gene markers from a large-scale biological network. When applied to a triple-negative breast cancer (TNBC) microarray dataset, the top selected modules contain both known gene markers in TNBC and novel candidates, such as RAD51 and DOK1, which play a central role in their respective ego-networks by connecting many differentially expressed genes. Conclusions Our results suggest that EgoNet, which is based on the ego network concept, allows the identification of novel biomarkers and provides a deeper understanding of their roles in complex diseases. PMID:24773628

  16. Gene Regulatory Network Inferences Using a Maximum-Relevance and Maximum-Significance Strategy

    PubMed Central

    Liu, Wei; Zhu, Wen; Liao, Bo; Chen, Xiangtao

    2016-01-01

    Recovering gene regulatory networks from expression data is a challenging problem in systems biology that provides valuable information on the regulatory mechanisms of cells. A number of algorithms based on computational models are currently used to recover network topology. However, most of these algorithms have limitations. For example, many models tend to be complicated because of the “large p, small n” problem. In this paper, we propose a novel regulatory network inference method called the maximum-relevance and maximum-significance network (MRMSn) method, which converts the problem of recovering networks into a problem of how to select the regulator genes for each gene. To solve the latter problem, we present an algorithm that is based on information theory and selects the regulator genes for a specific gene by maximizing the relevance and significance. A first-order incremental search algorithm is used to search for regulator genes. Eventually, a strict constraint is adopted to adjust all of the regulatory relationships according to the obtained regulator genes and thus obtain the complete network structure. We performed our method on five different datasets and compared our method to five state-of-the-art methods for network inference based on information theory. The results confirm the effectiveness of our method. PMID:27829000

  17. Statistical indicators of collective behavior and functional clusters in gene networks of yeast

    NASA Astrophysics Data System (ADS)

    Živković, J.; Tadić, B.; Wick, N.; Thurner, S.

    2006-03-01

    We analyze gene expression time-series data of yeast (S. cerevisiae) measured along two full cell-cycles. We quantify these data by using q-exponentials, gene expression ranking and a temporal mean-variance analysis. We construct gene interaction networks based on correlation coefficients and study the formation of the corresponding giant components and minimum spanning trees. By coloring genes according to their cell function we find functional clusters in the correlation networks and functional branches in the associated trees. Our results suggest that a percolation point of functional clusters can be identified on these gene expression correlation networks.

  18. Position-Specific HIV Risk in a Large Network of Homeless Youths

    PubMed Central

    Barman-Adhikari, Anamika; Milburn, Norweeta G.; Monro, William

    2012-01-01

    Objectives. We examined interconnections among runaway and homeless youths (RHYs) and how aggregated network structure position was associated with HIV risk in this population. Methods. We collected individual and social network data from 136 RHYs. On the basis of these data, we generated a sociomatrix, accomplished network visualization with a “spring embedder,” and examined k-cores. We used multivariate logistic regression models to assess associations between peripheral and nonperipheral network position and recent unprotected sexual intercourse. Results. Small numbers of nominations at the individual level aggregated into a large social network with a visible core, periphery, and small clusters. Female youths were more likely to be in the core, as were youths who had been homeless for 2 years or more. Youths at the periphery were less likely to report unprotected intercourse and had been homeless for a shorter duration. Conclusions. HIV risk was a function of risk-taking youths' connections with one another and was associated with position in the overall network structure. Social network–based prevention programs, young women's housing and health programs, and housing-first programs for peripheral youths could be effective strategies for preventing HIV among this population. PMID:22095350

  19. Highly stable multi-wall carbon nanotubes@poly(3,4-ethylenedioxythiophene)/poly(styrene sulfonate) core-shell composites with three-dimensional porous nano-network for electrochemical capacitors

    NASA Astrophysics Data System (ADS)

    Zhou, Haihan; Han, Gaoyi; Chang, Yunzhen; Fu, Dongying; Xiao, Yaoming

    2015-01-01

    A facile and feasible electrochemical polymerization method has been used to construct the multi-wall carbon nanotubes@poly(3,4-ethylenedioxythiophene)/poly(styrene sulfonate) (MWCNTs@PEDOT/PSS) core-shell composites with three-dimensional (3D) porous nano-network microstructure. The composites are characterized with Fourier transform infrared spectroscopy, scanning electron microscope, and transmission electron microscopy. This special core-shell nanostructure can significantly reduce the ions diffusion distance and the 3D porous nano-network microstructure effectively enlarges the electrode/electrolyte interface. The electrochemical tests including cyclic voltammetry, galvanostatic charge/discharge measurements, and electrochemical impedance spectroscopy tests are performed, the results manifest the MWCNTs@PEDOT/PSS core-shell composites have superior capacitive behaviors and excellent cyclic stability, and a high areal capacitance of 98.1 mF cm-2 is achieved at 5 mV s-1 cyclic voltammetry scan. Furthermore, the MWCNTs@PEDOT/PSS composites exhibit obviously superior capacitive performance than that of PEDOT/PSS and PEDOT/Cl electrodes, indicating the effective composite of MWCNTs and PEDOT noticeably boosts the capacitive performance of PEDOT-based electrodes for electrochemical energy storage. Such a highly stable core-shell 3D network structural composite is very promising to be used as electrode materials for the high-performance electrochemical capacitors.

  20. Detecting recurrent gene mutation in interaction network context using multi-scale graph diffusion.

    PubMed

    Babaei, Sepideh; Hulsman, Marc; Reinders, Marcel; de Ridder, Jeroen

    2013-01-23

    Delineating the molecular drivers of cancer, i.e. determining cancer genes and the pathways which they deregulate, is an important challenge in cancer research. In this study, we aim to identify pathways of frequently mutated genes by exploiting their network neighborhood encoded in the protein-protein interaction network. To this end, we introduce a multi-scale diffusion kernel and apply it to a large collection of murine retroviral insertional mutagenesis data. The diffusion strength plays the role of scale parameter, determining the size of the network neighborhood that is taken into account. As a result, in addition to detecting genes with frequent mutations in their genomic vicinity, we find genes that harbor frequent mutations in their interaction network context. We identify densely connected components of known and putatively novel cancer genes and demonstrate that they are strongly enriched for cancer related pathways across the diffusion scales. Moreover, the mutations in the clusters exhibit a significant pattern of mutual exclusion, supporting the conjecture that such genes are functionally linked. Using multi-scale diffusion kernel, various infrequently mutated genes are found to harbor significant numbers of mutations in their interaction network neighborhood. Many of them are well-known cancer genes. The results demonstrate the importance of defining recurrent mutations while taking into account the interaction network context. Importantly, the putative cancer genes and networks detected in this study are found to be significant at different diffusion scales, confirming the necessity of a multi-scale analysis.

  1. Selection Shapes Transcriptional Logic and Regulatory Specialization in Genetic Networks.

    PubMed

    Fogelmark, Karl; Peterson, Carsten; Troein, Carl

    2016-01-01

    Living organisms need to regulate their gene expression in response to environmental signals and internal cues. This is a computational task where genes act as logic gates that connect to form transcriptional networks, which are shaped at all scales by evolution. Large-scale mutations such as gene duplications and deletions add and remove network components, whereas smaller mutations alter the connections between them. Selection determines what mutations are accepted, but its importance for shaping the resulting networks has been debated. To investigate the effects of selection in the shaping of transcriptional networks, we derive transcriptional logic from a combinatorially powerful yet tractable model of the binding between DNA and transcription factors. By evolving the resulting networks based on their ability to function as either a simple decision system or a circadian clock, we obtain information on the regulation and logic rules encoded in functional transcriptional networks. Comparisons are made between networks evolved for different functions, as well as with structurally equivalent but non-functional (neutrally evolved) networks, and predictions are validated against the transcriptional network of E. coli. We find that the logic rules governing gene expression depend on the function performed by the network. Unlike the decision systems, the circadian clocks show strong cooperative binding and negative regulation, which achieves tight temporal control of gene expression. Furthermore, we find that transcription factors act preferentially as either activators or repressors, both when binding multiple sites for a single target gene and globally in the transcriptional networks. This separation into positive and negative regulators requires gene duplications, which highlights the interplay between mutation and selection in shaping the transcriptional networks.

  2. PLGA/polymeric liposome for targeted drug and gene co-delivery.

    PubMed

    Wang, Hanjie; Zhao, Peiqi; Su, Wenya; Wang, Sheng; Liao, Zhenyu; Niu, Ruifang; Chang, Jin

    2010-11-01

    Chemotherapy is one of the most effective approaches to treat cancers in the clinic, but the problems, such as multidrug resistance (MDR), low bioavailability and toxicity, severely constrain the further application of chemotherapy. Our group recently reported that cationic PLGA/folate coated PEGlated polymeric liposome core-shell nanoparticles (PLGA/FPL NPs). It was self-assembled from a hydrophobic PLGA core and a hydrophilic folate coated PEGlated lipid shell for targeting co-delivery of drug and gene. Hydrophobic drugs can be incorporated into the core and the cationic shell of the drug-loaded nanoparticles can be used to bind DNA. The drug-loaded PLGA/FPL NPs/DNA complexes offer advantages to overcome these problems mentioned above, such as co-delivery of drugs and DNA to improving the chemosensitivity of cancer cells at a gene level, and targeting delivery of drug to the cancer tissue that enhance the bioavailability and reduce the toxicity. The experiment showed that nanoparticles have core-shell structure with nanosize, sustained drug release profile and good DNA-binding ability. Importantly, the core-shell nanoparticles achieve the possibility of co-delivering drugs and genes to the same cells with high gene transfection and drug delivery efficiency. Our data suggest that the PLGA/FPL NPs may be a useful drug and gene co-delivery system. Copyright © 2010 Elsevier Ltd. All rights reserved.

  3. [Fanconi anemia: genes and function(s) revisited].

    PubMed

    Papadopoulo, Dora; Moustacchi, Ethel

    2005-01-01

    Fanconi anemia (FA), a rare inherited disorder, exhibits a complex phenotype including progressive bone marrow failure, congenital malformations and increased risk of cancers, mainly acute myeloid leukaemia. At the cellular level, FA is characterized by hypersensitivity to DNA cross-linking agents and by high frequencies of induced chromosomal aberrations, a property used for diagnosis. FA results from mutations in one of the eleven FANC (FANCA to FANCJ) genes. Nine of them have been identified. In addition, FANCD1 gene has been shown to be identical to BRCA2, one of the two breast cancer susceptibility genes. Seven of the FANC proteins form a complex, which exists in four different forms depending of its subcellular localisation. Four FANC proteins (D1(BRCA2), D2, I and J) are not associated to the complex. The presence of the nuclear form of the FA core complex is necessary for the mono-ubiquitinylation of FANCD2 protein, a modification required for its re-localization to nuclear foci, likely to be sites of DNA repair. A clue towards understanding the molecular function of the FANC genes comes from the recently identified connection of FANC to the BRCA1, ATM, NBS1 and ATR genes. Two of the FANC proteins (A and D2) directly interact with BRCA1, which in turn interacts with the MRE11/RAD50/NBS1 complex, which is one of the key components in the mechanisms involved in the cellular response to DNA double strand breaks (DSB). Moreover, ATM, a protein kinase that plays a central role in the network of DSB signalling, phosphorylates in vitro and in vivo FANCD2 in response to ionising radiations. Moreover, the NBS1 protein and the monoubiquitinated form of FANCD2 seem to act together in response to DNA crosslinking agents. Taken together with the previously reported impaired DSB and DNA interstrand crosslinks repair in FA cells, the connection of FANC genes to the ATM, ATR, NBS1 and BRCA1 links the FANC genes function to the finely orchestrated network involved in the sensing, signalling and repair of DNA replication-blocking lesions.

  4. Gene Coexpression Network Alignment and Conservation of Gene Modules between Two Grass Species: Maize and Rice[C][W][OA

    PubMed Central

    Ficklin, Stephen P.; Feltus, F. Alex

    2011-01-01

    One major objective for plant biology is the discovery of molecular subsystems underlying complex traits. The use of genetic and genomic resources combined in a systems genetics approach offers a means for approaching this goal. This study describes a maize (Zea mays) gene coexpression network built from publicly available expression arrays. The maize network consisted of 2,071 loci that were divided into 34 distinct modules that contained 1,928 enriched functional annotation terms and 35 cofunctional gene clusters. Of note, 391 maize genes of unknown function were found to be coexpressed within modules along with genes of known function. A global network alignment was made between this maize network and a previously described rice (Oryza sativa) coexpression network. The IsoRankN tool was used, which incorporates both gene homology and network topology for the alignment. A total of 1,173 aligned loci were detected between the two grass networks, which condensed into 154 conserved subgraphs that preserved 4,758 coexpression edges in rice and 6,105 coexpression edges in maize. This study provides an early view into maize coexpression space and provides an initial network-based framework for the translation of functional genomic and genetic information between these two vital agricultural species. PMID:21606319

  5. Gene coexpression network alignment and conservation of gene modules between two grass species: maize and rice.

    PubMed

    Ficklin, Stephen P; Feltus, F Alex

    2011-07-01

    One major objective for plant biology is the discovery of molecular subsystems underlying complex traits. The use of genetic and genomic resources combined in a systems genetics approach offers a means for approaching this goal. This study describes a maize (Zea mays) gene coexpression network built from publicly available expression arrays. The maize network consisted of 2,071 loci that were divided into 34 distinct modules that contained 1,928 enriched functional annotation terms and 35 cofunctional gene clusters. Of note, 391 maize genes of unknown function were found to be coexpressed within modules along with genes of known function. A global network alignment was made between this maize network and a previously described rice (Oryza sativa) coexpression network. The IsoRankN tool was used, which incorporates both gene homology and network topology for the alignment. A total of 1,173 aligned loci were detected between the two grass networks, which condensed into 154 conserved subgraphs that preserved 4,758 coexpression edges in rice and 6,105 coexpression edges in maize. This study provides an early view into maize coexpression space and provides an initial network-based framework for the translation of functional genomic and genetic information between these two vital agricultural species.

  6. Gene Network Construction from Microarray Data Identifies a Key Network Module and Several Candidate Hub Genes in Age-Associated Spatial Learning Impairment

    PubMed Central

    Uddin, Raihan; Singh, Shiva M.

    2017-01-01

    As humans age many suffer from a decrease in normal brain functions including spatial learning impairments. This study aimed to better understand the molecular mechanisms in age-associated spatial learning impairment (ASLI). We used a mathematical modeling approach implemented in Weighted Gene Co-expression Network Analysis (WGCNA) to create and compare gene network models of young (learning unimpaired) and aged (predominantly learning impaired) brains from a set of exploratory datasets in rats in the context of ASLI. The major goal was to overcome some of the limitations previously observed in the traditional meta- and pathway analysis using these data, and identify novel ASLI related genes and their networks based on co-expression relationship of genes. This analysis identified a set of network modules in the young, each of which is highly enriched with genes functioning in broad but distinct GO functional categories or biological pathways. Interestingly, the analysis pointed to a single module that was highly enriched with genes functioning in “learning and memory” related functions and pathways. Subsequent differential network analysis of this “learning and memory” module in the aged (predominantly learning impaired) rats compared to the young learning unimpaired rats allowed us to identify a set of novel ASLI candidate hub genes. Some of these genes show significant repeatability in networks generated from independent young and aged validation datasets. These hub genes are highly co-expressed with other genes in the network, which not only show differential expression but also differential co-expression and differential connectivity across age and learning impairment. The known function of these hub genes indicate that they play key roles in critical pathways, including kinase and phosphatase signaling, in functions related to various ion channels, and in maintaining neuronal integrity relating to synaptic plasticity and memory formation. Taken together, they provide a new insight and generate new hypotheses into the molecular mechanisms responsible for age associated learning impairment, including spatial learning. PMID:29066959

  7. Gene Network Construction from Microarray Data Identifies a Key Network Module and Several Candidate Hub Genes in Age-Associated Spatial Learning Impairment.

    PubMed

    Uddin, Raihan; Singh, Shiva M

    2017-01-01

    As humans age many suffer from a decrease in normal brain functions including spatial learning impairments. This study aimed to better understand the molecular mechanisms in age-associated spatial learning impairment (ASLI). We used a mathematical modeling approach implemented in Weighted Gene Co-expression Network Analysis (WGCNA) to create and compare gene network models of young (learning unimpaired) and aged (predominantly learning impaired) brains from a set of exploratory datasets in rats in the context of ASLI. The major goal was to overcome some of the limitations previously observed in the traditional meta- and pathway analysis using these data, and identify novel ASLI related genes and their networks based on co-expression relationship of genes. This analysis identified a set of network modules in the young, each of which is highly enriched with genes functioning in broad but distinct GO functional categories or biological pathways. Interestingly, the analysis pointed to a single module that was highly enriched with genes functioning in "learning and memory" related functions and pathways. Subsequent differential network analysis of this "learning and memory" module in the aged (predominantly learning impaired) rats compared to the young learning unimpaired rats allowed us to identify a set of novel ASLI candidate hub genes. Some of these genes show significant repeatability in networks generated from independent young and aged validation datasets. These hub genes are highly co-expressed with other genes in the network, which not only show differential expression but also differential co-expression and differential connectivity across age and learning impairment. The known function of these hub genes indicate that they play key roles in critical pathways, including kinase and phosphatase signaling, in functions related to various ion channels, and in maintaining neuronal integrity relating to synaptic plasticity and memory formation. Taken together, they provide a new insight and generate new hypotheses into the molecular mechanisms responsible for age associated learning impairment, including spatial learning.

  8. WGCNA: an R package for weighted correlation network analysis

    PubMed Central

    Langfelder, Peter; Horvath, Steve

    2008-01-01

    Background Correlation networks are increasingly being used in bioinformatics applications. For example, weighted gene co-expression network analysis is a systems biology method for describing the correlation patterns among genes across microarray samples. Weighted correlation network analysis (WGCNA) can be used for finding clusters (modules) of highly correlated genes, for summarizing such clusters using the module eigengene or an intramodular hub gene, for relating modules to one another and to external sample traits (using eigengene network methodology), and for calculating module membership measures. Correlation networks facilitate network based gene screening methods that can be used to identify candidate biomarkers or therapeutic targets. These methods have been successfully applied in various biological contexts, e.g. cancer, mouse genetics, yeast genetics, and analysis of brain imaging data. While parts of the correlation network methodology have been described in separate publications, there is a need to provide a user-friendly, comprehensive, and consistent software implementation and an accompanying tutorial. Results The WGCNA R software package is a comprehensive collection of R functions for performing various aspects of weighted correlation network analysis. The package includes functions for network construction, module detection, gene selection, calculations of topological properties, data simulation, visualization, and interfacing with external software. Along with the R package we also present R software tutorials. While the methods development was motivated by gene expression data, the underlying data mining approach can be applied to a variety of different settings. Conclusion The WGCNA package provides R functions for weighted correlation network analysis, e.g. co-expression network analysis of gene expression data. The R package along with its source code and additional material are freely available at . PMID:19114008

  9. Network topologies and convergent aetiologies arising from deletions and duplications observed in individuals with autism.

    PubMed

    Noh, Hyun Ji; Ponting, Chris P; Boulding, Hannah C; Meader, Stephen; Betancur, Catalina; Buxbaum, Joseph D; Pinto, Dalila; Marshall, Christian R; Lionel, Anath C; Scherer, Stephen W; Webber, Caleb

    2013-06-01

    Autism Spectrum Disorders (ASD) are highly heritable and characterised by impairments in social interaction and communication, and restricted and repetitive behaviours. Considering four sets of de novo copy number variants (CNVs) identified in 181 individuals with autism and exploiting mouse functional genomics and known protein-protein interactions, we identified a large and significantly interconnected interaction network. This network contains 187 genes affected by CNVs drawn from 45% of the patients we considered and 22 genes previously implicated in ASD, of which 192 form a single interconnected cluster. On average, those patients with copy number changed genes from this network possess changes in 3 network genes, suggesting that epistasis mediated through the network is extensive. Correspondingly, genes that are highly connected within the network, and thus whose copy number change is predicted by the network to be more phenotypically consequential, are significantly enriched among patients that possess only a single ASD-associated network copy number changed gene (p = 0.002). Strikingly, deleted or disrupted genes from the network are significantly enriched in GO-annotated positive regulators (2.3-fold enrichment, corrected p = 2×10(-5)), whereas duplicated genes are significantly enriched in GO-annotated negative regulators (2.2-fold enrichment, corrected p = 0.005). The direction of copy change is highly informative in the context of the network, providing the means through which perturbations arising from distinct deletions or duplications can yield a common outcome. These findings reveal an extensive ASD-associated molecular network, whose topology indicates ASD-relevant mutational deleteriousness and that mechanistically details how convergent aetiologies can result extensively from CNVs affecting pathways causally implicated in ASD.

  10. Poly-dimensional network comparative analysis reveals the pure pharmacological mechanism of baicalin in the targeted network of mouse cerebral ischemia.

    PubMed

    Liu, Qiong; Liu, Jun; Wang, Pengqian; Zhang, Yingying; Li, Bing; Yu, Yanan; Dang, Haixia; Li, Haixia; Zhang, Xiaoxu; Wang, Zhong

    2017-07-01

    This study aimed to investigate the pure pharmacological mechanisms of baicalin/baicalein (BA) in the targeted network of mouse cerebral ischemia using a poly-dimensional network comparative analysis. Eighty mice with induced focal cerebral ischemia were randomly divided into four groups: BA, Concha Margaritifera (CM), vehicle and sham group. A poly-dimensional comparative analysis of the expression levels of 374 stroke-related genes in each of the four groups was performed using MetaCore. BA significantly reduced the ischemic infarct volume (P<0.05), whereas CM was ineffective. Two processes and 10 network nodes were shared between "BA vs CM" and vehicle, but there were no overlapping pathways. Two pathways, three processes and 12 network nodes overlapped in "BA vs CM" and BA. The pure pharmacological mechanism of BA resulted in targeting of pathways related to development, G-protein signaling, apoptosis, signal transduction and immunity. The biological processes affected by BA were primarily found to correlate with apoptotic, anti-apoptotic and neurophysiological processes. Three network nodes changed from up-regulation to down-regulation, while mitogen-activated protein kinase kinase 6 (MAP2K6, also known as MEK6) changed from down-regulation to up-regulation in "BA vs CM" and vehicle. The changed nodes were all related to cell death and development. The pure pharmacological mechanism of BA is related to immunity, apoptosis, development, cytoskeletal remodeling, transduction and neurophysiology, as ascertained using a poly-dimensional network comparative analysis. Copyright © 2017. Published by Elsevier B.V.

  11. ProphTools: general prioritization tools for heterogeneous biological networks.

    PubMed

    Navarro, Carmen; Martínez, Victor; Blanco, Armando; Cano, Carlos

    2017-12-01

    Networks have been proven effective representations for the analysis of biological data. As such, there exist multiple methods to extract knowledge from biological networks. However, these approaches usually limit their scope to a single biological entity type of interest or they lack the flexibility to analyze user-defined data. We developed ProphTools, a flexible open-source command-line tool that performs prioritization on a heterogeneous network. ProphTools prioritization combines a Flow Propagation algorithm similar to a Random Walk with Restarts and a weighted propagation method. A flexible model for the representation of a heterogeneous network allows the user to define a prioritization problem involving an arbitrary number of entity types and their interconnections. Furthermore, ProphTools provides functionality to perform cross-validation tests, allowing users to select the best network configuration for a given problem. ProphTools core prioritization methodology has already been proven effective in gene-disease prioritization and drug repositioning. Here we make ProphTools available to the scientific community as flexible, open-source software and perform a new proof-of-concept case study on long noncoding RNAs (lncRNAs) to disease prioritization. ProphTools is robust prioritization software that provides the flexibility not present in other state-of-the-art network analysis approaches, enabling researchers to perform prioritization tasks on any user-defined heterogeneous network. Furthermore, the application to lncRNA-disease prioritization shows that ProphTools can reach the performance levels of ad hoc prioritization tools without losing its generality. © The Authors 2017. Published by Oxford University Press.

  12. A palladium-doped ceria@carbon core-sheath nanowire network: a promising catalyst support for alcohol electrooxidation reactions

    NASA Astrophysics Data System (ADS)

    Tan, Qiang; Du, Chunyu; Sun, Yongrong; Du, Lei; Yin, Geping; Gao, Yunzhi

    2015-08-01

    A novel palladium-doped ceria and carbon core-sheath nanowire network (Pd-CeO2@C CSNWN) is synthesized by a template-free and surfactant-free solvothermal process, followed by high temperature carbonization. This hierarchical network serves as a new class of catalyst support to enhance the activity and durability of noble metal catalysts for alcohol oxidation reactions. Its supported Pd nanoparticles, Pd/(Pd-CeO2@C CSNWN), exhibit >9 fold increase in activity toward the ethanol oxidation over the state-of-the-art Pd/C catalyst, which is the highest among the reported Pd systems. Moreover, stability tests show a virtually unchanged activity after 1000 cycles. The high activity is mainly attributed to the superior oxygen-species releasing capability of Pd-doped CeO2 nanowires by accelerating the removal of the poisoning intermediate. The unique interconnected one-dimensional core-sheath structure is revealed to facilitate immobilization of the metal catalysts, leading to the improved durability. This core-sheath nanowire network opens up a new strategy for catalyst performance optimization for next-generation fuel cells.A novel palladium-doped ceria and carbon core-sheath nanowire network (Pd-CeO2@C CSNWN) is synthesized by a template-free and surfactant-free solvothermal process, followed by high temperature carbonization. This hierarchical network serves as a new class of catalyst support to enhance the activity and durability of noble metal catalysts for alcohol oxidation reactions. Its supported Pd nanoparticles, Pd/(Pd-CeO2@C CSNWN), exhibit >9 fold increase in activity toward the ethanol oxidation over the state-of-the-art Pd/C catalyst, which is the highest among the reported Pd systems. Moreover, stability tests show a virtually unchanged activity after 1000 cycles. The high activity is mainly attributed to the superior oxygen-species releasing capability of Pd-doped CeO2 nanowires by accelerating the removal of the poisoning intermediate. The unique interconnected one-dimensional core-sheath structure is revealed to facilitate immobilization of the metal catalysts, leading to the improved durability. This core-sheath nanowire network opens up a new strategy for catalyst performance optimization for next-generation fuel cells. Electronic supplementary information (ESI) available. See DOI: 10.1039/c5nr03023d

  13. Construction of regulatory networks using expression time-series data of a genotyped population.

    PubMed

    Yeung, Ka Yee; Dombek, Kenneth M; Lo, Kenneth; Mittler, John E; Zhu, Jun; Schadt, Eric E; Bumgarner, Roger E; Raftery, Adrian E

    2011-11-29

    The inference of regulatory and biochemical networks from large-scale genomics data is a basic problem in molecular biology. The goal is to generate testable hypotheses of gene-to-gene influences and subsequently to design bench experiments to confirm these network predictions. Coexpression of genes in large-scale gene-expression data implies coregulation and potential gene-gene interactions, but provide little information about the direction of influences. Here, we use both time-series data and genetics data to infer directionality of edges in regulatory networks: time-series data contain information about the chronological order of regulatory events and genetics data allow us to map DNA variations to variations at the RNA level. We generate microarray data measuring time-dependent gene-expression levels in 95 genotyped yeast segregants subjected to a drug perturbation. We develop a Bayesian model averaging regression algorithm that incorporates external information from diverse data types to infer regulatory networks from the time-series and genetics data. Our algorithm is capable of generating feedback loops. We show that our inferred network recovers existing and novel regulatory relationships. Following network construction, we generate independent microarray data on selected deletion mutants to prospectively test network predictions. We demonstrate the potential of our network to discover de novo transcription-factor binding sites. Applying our construction method to previously published data demonstrates that our method is competitive with leading network construction algorithms in the literature.

  14. Transcriptional dynamics of a conserved gene expression network associated with craniofacial divergence in Arctic charr.

    PubMed

    Ahi, Ehsan Pashay; Kapralova, Kalina Hristova; Pálsson, Arnar; Maier, Valerie Helene; Gudbrandsson, Jóhannes; Snorrason, Sigurdur S; Jónsson, Zophonías O; Franzdóttir, Sigrídur Rut

    2014-01-01

    Understanding the molecular basis of craniofacial variation can provide insights into key developmental mechanisms of adaptive changes and their role in trophic divergence and speciation. Arctic charr (Salvelinus alpinus) is a polymorphic fish species, and, in Lake Thingvallavatn in Iceland, four sympatric morphs have evolved distinct craniofacial structures. We conducted a gene expression study on candidates from a conserved gene coexpression network, focusing on the development of craniofacial elements in embryos of two contrasting Arctic charr morphotypes (benthic and limnetic). Four Arctic charr morphs were studied: one limnetic and two benthic morphs from Lake Thingvallavatn and a limnetic reference aquaculture morph. The presence of morphological differences at developmental stages before the onset of feeding was verified by morphometric analysis. Following up on our previous findings that Mmp2 and Sparc were differentially expressed between morphotypes, we identified a network of genes with conserved coexpression across diverse vertebrate species. A comparative expression study of candidates from this network in developing heads of the four Arctic charr morphs verified the coexpression relationship of these genes and revealed distinct transcriptional dynamics strongly correlated with contrasting craniofacial morphologies (benthic versus limnetic). A literature review and Gene Ontology analysis indicated that a significant proportion of the network genes play a role in extracellular matrix organization and skeletogenesis, and motif enrichment analysis of conserved noncoding regions of network candidates predicted a handful of transcription factors, including Ap1 and Ets2, as potential regulators of the gene network. The expression of Ets2 itself was also found to associate with network gene expression. Genes linked to glucocorticoid signalling were also studied, as both Mmp2 and Sparc are responsive to this pathway. Among those, several transcriptional targets and upstream regulators showed differential expression between the contrasting morphotypes. Interestingly, although selected network genes showed overlapping expression patterns in situ and no morph differences, Timp2 expression patterns differed between morphs. Our comparative study of transcriptional dynamics in divergent craniofacial morphologies of Arctic charr revealed a conserved network of coexpressed genes sharing functional roles in structural morphogenesis. We also implicate transcriptional regulators of the network as targets for future functional studies.

  15. PanCoreGen - Profiling, detecting, annotating protein-coding genes in microbial genomes.

    PubMed

    Paul, Sandip; Bhardwaj, Archana; Bag, Sumit K; Sokurenko, Evgeni V; Chattopadhyay, Sujay

    2015-12-01

    A large amount of genomic data, especially from multiple isolates of a single species, has opened new vistas for microbial genomics analysis. Analyzing the pan-genome (i.e. the sum of genetic repertoire) of microbial species is crucial in understanding the dynamics of molecular evolution, where virulence evolution is of major interest. Here we present PanCoreGen - a standalone application for pan- and core-genomic profiling of microbial protein-coding genes. PanCoreGen overcomes key limitations of the existing pan-genomic analysis tools, and develops an integrated annotation-structure for a species-specific pan-genomic profile. It provides important new features for annotating draft genomes/contigs and detecting unidentified genes in annotated genomes. It also generates user-defined group-specific datasets within the pan-genome. Interestingly, analyzing an example-set of Salmonella genomes, we detect potential footprints of adaptive convergence of horizontally transferred genes in two human-restricted pathogenic serovars - Typhi and Paratyphi A. Overall, PanCoreGen represents a state-of-the-art tool for microbial phylogenomics and pathogenomics study. Copyright © 2015 Elsevier Inc. All rights reserved.

  16. Construction of local gene network for revealing different liver function of rats fed deep-fried oil with or without resistant starch.

    PubMed

    Wang, Zhiwei; Liao, Tianqi; Zhou, Zhongkai; Wang, Yuyang; Diao, Yongjia; Strappe, Padraig; Prenzler, Paul; Ayton, Jamie; Blanchard, Chris

    2016-09-06

    To study the mechanism underlying the liver damage induced by deep-fried oil (DO) consumption and the beneficial effects from resistant starch (RS) supplement, differential gene expression and pathway network were analyzed based on RNA sequencing data from rats. The up/down regulated genes and corresponding signaling pathways were used to construct a novel local gene network (LGN). The topology of the network showed characteristics of small-world network, with some pathways demonstrating a high degree. Some changes in genes led to a larger probability occurrence of disease or infection with DO intake. More importantly, the main pathways were found to be almost the same between the two LGNs (30 pathways overlapped in total 48) with gene expression profile. This finding may indicate that RS supplement in DO-containing diet may mainly regulate the genes that related to DO damage, and RS in the diet may provide direct signals to the liver cells and modulate its effect through a network involving complex gene regulatory events. It is the first attempt to reveal the mechanism of the attenuation of liver dysfunction from RS supplement in the DO-containing diet using differential gene expression and pathway network. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.

  17. Evidence that the Dictyostelium STAT protein Dd-STATa plays a role in the differentiation of inner basal disc cells and identification of a promoter element essential for expression in these cells.

    PubMed

    Shimada, Nao; Maruo, Toshinari; Maeda, Mineko; Urushihara, Hideko; Kawata, Takefumi

    2005-02-01

    Dd-STATa, a Dictyostelium homolog of the metazoan STAT (signal transducers and activators of transcription) proteins, is necessary in the slug for correct entry into culmination. Dd-STATa-null mutant fails to culminate and its phenotype correlates with the loss of a funnel-shaped core region, the pstAB core region, which expresses both the ecmA and ecmB genes. To understand how the differentiation of pstAB core cells is regulated, we identified an EST that is expressed in the core cells of normal slugs but down-regulated in the Dd-STATa-null mutant. This EST, SSK348, encodes a close homolog of the Dictyostelium acetyl-CoA synthetase (ACS). A promoter fragment of the cognate gene, aslA (acetyl-CoA synthetase-like A), was fused to a lacZ reporter and the expression pattern determined. As expected from the behavior of the endogenous aslA gene, the aslA::lacZ fusion gene is not expressed in Dd-STATa-null slugs. In parental cells, the aslA promoter is first activated in the funnel-shaped core cells located at the slug anterior, the "pstAB core." During culmination, the pstAB core cells move down, through the prespore cells, to form the inner part of the basal disc. As the spore mass climbs the stalk, the aslA gene comes to be expressed in cells of the upper and lower cups, structures that cradle the spore head. Deletion and point mutation analyses of the promoter identified an AT-rich sequence that is necessary for expression in the pstAB core. This acts in combination with repressor regions that prevent ectopic aslA expression in the pre-stalk regions of slugs and the stalks of culminants. Thus, this study confirms that Dd-STATa is necessary for the differentiation of pstAB core cells, by showing that it is needed for the activation of the aslA gene. It also identifies aslA promoter elements that are likely to be regulated, directly or indirectly, by Dd-STATa.

  18. No3CoGP: non-conserved and conserved coexpressed gene pairs.

    PubMed

    Mal, Chittabrata; Aftabuddin, Md; Kundu, Sudip

    2014-12-08

    Analyzing the microarray data of different conditions, one can identify the conserved and condition-specific genes and gene modules, and thus can infer the underlying cellular activities. All the available tools based on Bioconductor and R packages differ in how they extract differential coexpression and at what level they study. There is a need for a user-friendly, flexible tool which can start analysis using raw or preprocessed microarray data and can report different levels of useful information. We present a GUI software, No3CoGP: Non-Conserved and Conserved Coexpressed Gene Pairs which takes Affymetrix microarray data (.CEL files or log2 normalized.txt files) along with annotation file (.csv file), Chip Definition File (CDF file) and probe file as inputs, utilizes the concept of network density cut-off and Fisher's z-test to extract biologically relevant information. It can identify four possible types of gene pairs based on their coexpression relationships. These are (i) gene pair showing coexpression in one condition but not in the other, (ii) gene pair which is positively coexpressed in one condition but negatively coexpressed in the other condition, (iii) positively and (iv) negatively coexpressed in both the conditions. Further, it can generate modules of coexpressed genes. Easy-to-use GUI interface enables researchers without knowledge in R language to use No3CoGP. Utilization of one or more CPU cores, depending on the availability, speeds up the program. The output files stored in the respective directories under the user-defined project offer the researchers to unravel condition-specific functionalities of gene, gene sets or modules.

  19. Dynamic network reconstruction from gene expression data applied to immune response during bacterial infection.

    PubMed

    Guthke, Reinhard; Möller, Ulrich; Hoffmann, Martin; Thies, Frank; Töpfer, Susanne

    2005-04-15

    The immune response to bacterial infection represents a complex network of dynamic gene and protein interactions. We present an optimized reverse engineering strategy aimed at a reconstruction of this kind of interaction networks. The proposed approach is based on both microarray data and available biological knowledge. The main kinetics of the immune response were identified by fuzzy clustering of gene expression profiles (time series). The number of clusters was optimized using various evaluation criteria. For each cluster a representative gene with a high fuzzy-membership was chosen in accordance with available physiological knowledge. Then hypothetical network structures were identified by seeking systems of ordinary differential equations, whose simulated kinetics could fit the gene expression profiles of the cluster-representative genes. For the construction of hypothetical network structures singular value decomposition (SVD) based methods and a newly introduced heuristic Network Generation Method here were compared. It turned out that the proposed novel method could find sparser networks and gave better fits to the experimental data. Reinhard.Guthke@hki-jena.de.

  20. Core network infrastructure supporting the VLT at ESO Paranal in Chile

    NASA Astrophysics Data System (ADS)

    Reay, Harold

    2000-06-01

    In October 1997 a number of projects were started at ESO's Paranal Observatory at Cerro Paranal in Chile to upgrade the communications infrastructure in place at the time. The planned upgrades were to internal systems such as computer data networks and telephone installations and also data links connecting Paranal to other ESO sites. This paper details the installation work carried out on the Paranal Core Network (PCN) during the period of October 1997 to December 1999. These installations were to provide both short term solutions to the requirement for reliable high bandwidth network connectivity between Paranal and ESO HQ in Garching, Germany in time for UTI (Antu) first light and perhaps more importantly, to provide the core systems necessary for a site moving towards operational status. This paper explains the reasons for using particular cable types, network topology, and fiber backbone design and implementation. We explain why it was decided to install the PCN in two distinct stages and how equipment used in temporary installations was re-used in the Very Large Telescope networks. Finally we describe the tools used to monitor network and satellite link performance and will discuss whether network backbone bandwidth meets the expected utilization and how this bandwidth can easily be increased in the future should there be a requirement.

  1. Reverse-engineering of gene networks for regulating early blood development from single-cell measurements.

    PubMed

    Wei, Jiangyong; Hu, Xiaohua; Zou, Xiufen; Tian, Tianhai

    2017-12-28

    Recent advances in omics technologies have raised great opportunities to study large-scale regulatory networks inside the cell. In addition, single-cell experiments have measured the gene and protein activities in a large number of cells under the same experimental conditions. However, a significant challenge in computational biology and bioinformatics is how to derive quantitative information from the single-cell observations and how to develop sophisticated mathematical models to describe the dynamic properties of regulatory networks using the derived quantitative information. This work designs an integrated approach to reverse-engineer gene networks for regulating early blood development based on singel-cell experimental observations. The wanderlust algorithm is initially used to develop the pseudo-trajectory for the activities of a number of genes. Since the gene expression data in the developed pseudo-trajectory show large fluctuations, we then use Gaussian process regression methods to smooth the gene express data in order to obtain pseudo-trajectories with much less fluctuations. The proposed integrated framework consists of both bioinformatics algorithms to reconstruct the regulatory network and mathematical models using differential equations to describe the dynamics of gene expression. The developed approach is applied to study the network regulating early blood cell development. A graphic model is constructed for a regulatory network with forty genes and a dynamic model using differential equations is developed for a network of nine genes. Numerical results suggests that the proposed model is able to match experimental data very well. We also examine the networks with more regulatory relations and numerical results show that more regulations may exist. We test the possibility of auto-regulation but numerical simulations do not support the positive auto-regulation. In addition, robustness is used as an importantly additional criterion to select candidate networks. The research results in this work shows that the developed approach is an efficient and effective method to reverse-engineer gene networks using single-cell experimental observations.

  2. The HOX genes are expressed, in vivo, in human tooth germs: in vitro cAMP exposure of dental pulp cells results in parallel HOX network activation and neuronal differentiation.

    PubMed

    D'Antò, Vincenzo; Cantile, Monica; D'Armiento, Maria; Schiavo, Giulia; Spagnuolo, Gianrico; Terracciano, Luigi; Vecchione, Raffaela; Cillo, Clemente

    2006-03-01

    Homeobox-containing genes play a crucial role in odontogenesis. After the detection of Dlx and Msx genes in overlapping domains along maxillary and mandibular processes, a homeobox odontogenic code has been proposed to explain the interaction between different homeobox genes during dental lamina patterning. No role has so far been assigned to the Hox gene network in the homeobox odontogenic code due to studies on specific Hox genes and evolutionary considerations. Despite its involvement in early patterning during embryonal development, the HOX gene network, the most repeat-poor regions of the human genome, controls the phenotype identity of adult eukaryotic cells. Here, according to our results, the HOX gene network appears to be active in human tooth germs between 18 and 24 weeks of development. The immunohistochemical localization of specific HOX proteins mostly concerns the epithelial tooth germ compartment. Furthermore, only a few genes of the network are active in embryonal retromolar tissues, as well as in ectomesenchymal dental pulp cells (DPC) grown in vitro from adult human molar. Exposure of DPCs to cAMP induces the expression of from three to nine total HOX genes of the network in parallel with phenotype modifications with traits of neuronal differentiation. Our observations suggest that: (i) by combining its component genes, the HOX gene network determines the phenotype identity of epithelial and ectomesenchymal cells interacting in the generation of human tooth germ; (ii) cAMP treatment activates the HOX network and induces, in parallel, a neuronal-like phenotype in human primary ectomesenchymal dental pulp cells. 2005 Wiley-Liss, Inc.

  3. Identification and Analyses of AUX-IAA target genes controlling multiple pathways in developing fiber cells of Gossypium hirsutum L

    PubMed Central

    Nigam, Deepti; Sawant, Samir V

    2013-01-01

    Technological development led to an increased interest in systems biological approaches in plants to characterize developmental mechanism and candidate genes relevant to specific tissue or cell morphology. AUX-IAA proteins are important plant-specific putative transcription factors. There are several reports on physiological response of this family in Arabidopsis but in cotton fiber the transcriptional network through which AUX-IAA regulated its target genes is still unknown. in-silico modelling of cotton fiber development specific gene expression data (108 microarrays and 22,737 genes) using Algorithm for the Reconstruction of Accurate Cellular Networks (ARACNe) reveals 3690 putative AUX-IAA target genes of which 139 genes were known to be AUX-IAA co-regulated within Arabidopsis. Further AUX-IAA targeted gene regulatory network (GRN) had substantial impact on the transcriptional dynamics of cotton fiber, as showed by, altered TF networks, and Gene Ontology (GO) biological processes and metabolic pathway associated with its target genes. Analysis of the AUX-IAA-correlated gene network reveals multiple functions for AUX-IAA target genes such as unidimensional cell growth, cellular nitrogen compound metabolic process, nucleosome organization, DNA-protein complex and process related to cell wall. These candidate networks/pathways have a variety of profound impacts on such cellular functions as stress response, cell proliferation, and cell differentiation. While these functions are fairly broad, their underlying TF networks may provide a global view of AUX-IAA regulated gene expression and a GRN that guides future studies in understanding role of AUX-IAA box protein and its targets regulating fiber development. PMID:24497725

  4. Mapping edge-based traffic measurements onto the internal links in MPLS network

    NASA Astrophysics Data System (ADS)

    Zhao, Guofeng; Tang, Hong; Zhang, Yi

    2004-09-01

    Applying multi-protocol label switching techniques to IP-based backbone for traffic engineering goals has shown advantageous. Obtaining a volume of load on each internal link of the network is crucial for traffic engineering applying. Though collecting can be available for each link, such as applying traditional SNMP scheme, the approach may cause heavy processing load and sharply degrade the throughput of the core routers. Then monitoring merely at the edge of the network and mapping the measurements onto the core provides a good alternative way. In this paper, we explore a scheme for traffic mapping with edge-based measurements in MPLS network. It is supposed that the volume of traffic on each internal link over the domain would be mapped onto by measurements available only at ingress nodes. We apply path-based measurements at ingress nodes without enabling measurements in the core of the network. We propose a method that can infer a path from the ingress to the egress node using label distribution protocol without collecting routing data from core routers. Based on flow theory and queuing theory, we prove that our approach is effective and present the algorithm for traffic mapping. We also show performance simulation results that indicate potential of our approach.

  5. Text mining and network analysis to find functional associations of genes in high altitude diseases.

    PubMed

    Bhasuran, Balu; Subramanian, Devika; Natarajan, Jeyakumar

    2018-05-02

    Travel to elevations above 2500 m is associated with the risk of developing one or more forms of acute altitude illness such as acute mountain sickness (AMS), high altitude cerebral edema (HACE) or high altitude pulmonary edema (HAPE). Our work aims to identify the functional association of genes involved in high altitude diseases. In this work we identified the gene networks responsible for high altitude diseases by using the principle of gene co-occurrence statistics from literature and network analysis. First, we mined the literature data from PubMed on high-altitude diseases, and extracted the co-occurring gene pairs. Next, based on their co-occurrence frequency, gene pairs were ranked. Finally, a gene association network was created using statistical measures to explore potential relationships. Network analysis results revealed that EPO, ACE, IL6 and TNF are the top five genes that were found to co-occur with 20 or more genes, while the association between EPAS1 and EGLN1 genes is strongly substantiated. The network constructed from this study proposes a large number of genes that work in-toto in high altitude conditions. Overall, the result provides a good reference for further study of the genetic relationships in high altitude diseases. Copyright © 2018 Elsevier Ltd. All rights reserved.

  6. Enriching regulatory networks by bootstrap learning using optimised GO-based gene similarity and gene links mined from PubMed abstracts

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Taylor, Ronald C.; Sanfilippo, Antonio P.; McDermott, Jason E.

    2011-02-18

    Transcriptional regulatory networks are being determined using “reverse engineering” methods that infer connections based on correlations in gene state. Corroboration of such networks through independent means such as evidence from the biomedical literature is desirable. Here, we explore a novel approach, a bootstrapping version of our previous Cross-Ontological Analytic method (XOA) that can be used for semi-automated annotation and verification of inferred regulatory connections, as well as for discovery of additional functional relationships between the genes. First, we use our annotation and network expansion method on a biological network learned entirely from the literature. We show how new relevant linksmore » between genes can be iteratively derived using a gene similarity measure based on the Gene Ontology that is optimized on the input network at each iteration. Second, we apply our method to annotation, verification, and expansion of a set of regulatory connections found by the Context Likelihood of Relatedness algorithm.« less

  7. Interfacing cellular networks of S. cerevisiae and E. coli: Connecting dynamic and genetic information

    PubMed Central

    2013-01-01

    Background In recent years, various types of cellular networks have penetrated biology and are nowadays used omnipresently for studying eukaryote and prokaryote organisms. Still, the relation and the biological overlap among phenomenological and inferential gene networks, e.g., between the protein interaction network and the gene regulatory network inferred from large-scale transcriptomic data, is largely unexplored. Results We provide in this study an in-depth analysis of the structural, functional and chromosomal relationship between a protein-protein network, a transcriptional regulatory network and an inferred gene regulatory network, for S. cerevisiae and E. coli. Further, we study global and local aspects of these networks and their biological information overlap by comparing, e.g., the functional co-occurrence of Gene Ontology terms by exploiting the available interaction structure among the genes. Conclusions Although the individual networks represent different levels of cellular interactions with global structural and functional dissimilarities, we observe crucial functions of their network interfaces for the assembly of protein complexes, proteolysis, transcription, translation, metabolic and regulatory interactions. Overall, our results shed light on the integrability of these networks and their interfacing biological processes. PMID:23663484

  8. Identification and validation of differentially expressed transcripts by RNA-sequencing of formalin-fixed, paraffin-embedded (FFPE) lung tissue from patients with Idiopathic Pulmonary Fibrosis.

    PubMed

    Vukmirovic, Milica; Herazo-Maya, Jose D; Blackmon, John; Skodric-Trifunovic, Vesna; Jovanovic, Dragana; Pavlovic, Sonja; Stojsic, Jelena; Zeljkovic, Vesna; Yan, Xiting; Homer, Robert; Stefanovic, Branko; Kaminski, Naftali

    2017-01-12

    Idiopathic Pulmonary Fibrosis (IPF) is a lethal lung disease of unknown etiology. A major limitation in transcriptomic profiling of lung tissue in IPF has been a dependence on snap-frozen fresh tissues (FF). In this project we sought to determine whether genome scale transcript profiling using RNA Sequencing (RNA-Seq) could be applied to archived Formalin-Fixed Paraffin-Embedded (FFPE) IPF tissues. We isolated total RNA from 7 IPF and 5 control FFPE lung tissues and performed 50 base pair paired-end sequencing on Illumina 2000 HiSeq. TopHat2 was used to map sequencing reads to the human genome. On average ~62 million reads (53.4% of ~116 million reads) were mapped per sample. 4,131 genes were differentially expressed between IPF and controls (1,920 increased and 2,211 decreased (FDR < 0.05). We compared our results to differentially expressed genes calculated from a previously published dataset generated from FF tissues analyzed on Agilent microarrays (GSE47460). The overlap of differentially expressed genes was very high (760 increased and 1,413 decreased, FDR < 0.05). Only 92 differentially expressed genes changed in opposite directions. Pathway enrichment analysis performed using MetaCore confirmed numerous IPF relevant genes and pathways including extracellular remodeling, TGF-beta, and WNT. Gene network analysis of MMP7, a highly differentially expressed gene in both datasets, revealed the same canonical pathways and gene network candidates in RNA-Seq and microarray data. For validation by NanoString nCounter® we selected 35 genes that had a fold change of 2 in at least one dataset (10 discordant, 10 significantly differentially expressed in one dataset only and 15 concordant genes). High concordance of fold change and FDR was observed for each type of the samples (FF vs FFPE) with both microarrays (r = 0.92) and RNA-Seq (r = 0.90) and the number of discordant genes was reduced to four. Our results demonstrate that RNA sequencing of RNA obtained from archived FFPE lung tissues is feasible. The results obtained from FFPE tissue are highly comparable to FF tissues. The ability to perform RNA-Seq on archived FFPE IPF tissues should greatly enhance the availability of tissue biopsies for research in IPF.

  9. Thermodynamic Constraints Improve Metabolic Networks.

    PubMed

    Krumholz, Elias W; Libourel, Igor G L

    2017-08-08

    In pursuit of establishing a realistic metabolic phenotypic space, the reversibility of reactions is thermodynamically constrained in modern metabolic networks. The reversibility constraints follow from heuristic thermodynamic poise approximations that take anticipated cellular metabolite concentration ranges into account. Because constraints reduce the feasible space, draft metabolic network reconstructions may need more extensive reconciliation, and a larger number of genes may become essential. Notwithstanding ubiquitous application, the effect of reversibility constraints on the predictive capabilities of metabolic networks has not been investigated in detail. Instead, work has focused on the implementation and validation of the thermodynamic poise calculation itself. With the advance of fast linear programming-based network reconciliation, the effects of reversibility constraints on network reconciliation and gene essentiality predictions have become feasible and are the subject of this study. Networks with thermodynamically informed reversibility constraints outperformed gene essentiality predictions compared to networks that were constrained with randomly shuffled constraints. Unconstrained networks predicted gene essentiality as accurately as thermodynamically constrained networks, but predicted substantially fewer essential genes. Networks that were reconciled with sequence similarity data and strongly enforced reversibility constraints outperformed all other networks. We conclude that metabolic network analysis confirmed the validity of the thermodynamic constraints, and that thermodynamic poise information is actionable during network reconciliation. Copyright © 2017 Biophysical Society. Published by Elsevier Inc. All rights reserved.

  10. STARNET 2: a web-based tool for accelerating discovery of gene regulatory networks using microarray co-expression data

    PubMed Central

    Jupiter, Daniel; Chen, Hailin; VanBuren, Vincent

    2009-01-01

    Background Although expression microarrays have become a standard tool used by biologists, analysis of data produced by microarray experiments may still present challenges. Comparison of data from different platforms, organisms, and labs may involve complicated data processing, and inferring relationships between genes remains difficult. Results STARNET 2 is a new web-based tool that allows post hoc visual analysis of correlations that are derived from expression microarray data. STARNET 2 facilitates user discovery of putative gene regulatory networks in a variety of species (human, rat, mouse, chicken, zebrafish, Drosophila, C. elegans, S. cerevisiae, Arabidopsis and rice) by graphing networks of genes that are closely co-expressed across a large heterogeneous set of preselected microarray experiments. For each of the represented organisms, raw microarray data were retrieved from NCBI's Gene Expression Omnibus for a selected Affymetrix platform. All pairwise Pearson correlation coefficients were computed for expression profiles measured on each platform, respectively. These precompiled results were stored in a MySQL database, and supplemented by additional data retrieved from NCBI. A web-based tool allows user-specified queries of the database, centered at a gene of interest. The result of a query includes graphs of correlation networks, graphs of known interactions involving genes and gene products that are present in the correlation networks, and initial statistical analyses. Two analyses may be performed in parallel to compare networks, which is facilitated by the new HEATSEEKER module. Conclusion STARNET 2 is a useful tool for developing new hypotheses about regulatory relationships between genes and gene products, and has coverage for 10 species. Interpretation of the correlation networks is supported with a database of previously documented interactions, a test for enrichment of Gene Ontology terms, and heat maps of correlation distances that may be used to compare two networks. The list of genes in a STARNET network may be useful in developing a list of candidate genes to use for the inference of causal networks. The tool is freely available at , and does not require user registration. PMID:19828039

  11. The rapidly emerging ESBL-producing Escherichia coli O25-ST131 clone carries LPS core synthesis genes of the K-12 type.

    PubMed

    Szijártó, Valéria; Pal, Tibor; Nagy, Gabor; Nagy, Eszter; Ghazawi, Akela; al-Haj, Mohammed; El Kurdi, Sylvia; Sonnevend, Agnes

    2012-07-01

    The clone Escherichia coli O25 ST131, typically producing extended-spectrum beta-lactamases (ESBLs), has spread globally and became the dominant type among extraintestinal isolates at many parts of the world. However, the reasons behind the emergence and success of this clone are only partially understood. We compared the core type genes by PCR of ESBL-producing and ESBL-nonproducing strains isolated from urinary tract infections in the United Arab Emirates and found a surprisingly high frequency of the K-12 core type (44.6%) among members of the former group, while in the latter one, it was as low (3.7%), as reported earlier. The high figure was almost entirely attributable to the presence of members of the clone O25 ST131 among ESBL producers. Strains from the same clone isolated in Europe also carried the K-12 core type genes. Sequencing the entire core operon of an O25 ST131 isolate revealed a high level of similarity to known K-12 core gene sequences and an almost complete identity with a recently sequenced non-O25 ST131 fecal isolate. The exact chemical structure and whether and how this unusual core type contributed to the sudden emergence of ST131 require further investigations. © 2012 Federation of European Microbiological Societies. Published by Blackwell Publishing Ltd. All rights reserved.

  12. Comparative Genomics of Bacteriophage of the Genus Seuratvirus

    PubMed Central

    Sazinas, Pavelas; Redgwell, Tamsin; Rihtman, Branko; Grigonyte, Aurelija; Michniewski, Slawomir; Scanlan, David J; Hobman, Jon

    2018-01-01

    Abstract Despite being more abundant and having smaller genomes than their bacterial host, relatively few bacteriophages have had their genomes sequenced. Here, we isolated 14 bacteriophages from cattle slurry and performed de novo genome sequencing, assembly, and annotation. The commonly used marker genes polB and terL showed these bacteriophages to be closely related to members of the genus Seuratvirus. We performed a core-gene analysis using the 14 new and four closely related genomes. A total of 58 core genes were identified, the majority of which has no known function. These genes were used to construct a core-gene phylogeny, the results of which confirmed the new isolates to be part of the genus Seuratvirus and expanded the number of species within this genus to four. All bacteriophages within the genus contained the genes queCDE encoding enzymes involved in queuosine biosynthesis. We suggest these genes are carried as a mechanism to modify DNA in order to protect these bacteriophages against host endonucleases. PMID:29272407

  13. Community core detection in transportation networks

    NASA Astrophysics Data System (ADS)

    De Leo, Vincenzo; Santoboni, Giovanni; Cerina, Federica; Mureddu, Mario; Secchi, Luca; Chessa, Alessandro

    2013-10-01

    This work analyzes methods for the identification and the stability under perturbation of a territorial community structure with specific reference to transportation networks. We considered networks of commuters for a city and an insular region. In both cases, we have studied the distribution of commuters’ trips (i.e., home-to-work trips and vice versa). The identification and stability of the communities’ cores are linked to the land-use distribution within the zone system, and therefore their proper definition may be useful to transport planners.

  14. On construction of stochastic genetic networks based on gene expression sequences.

    PubMed

    Ching, Wai-Ki; Ng, Michael M; Fung, Eric S; Akutsu, Tatsuya

    2005-08-01

    Reconstruction of genetic regulatory networks from time series data of gene expression patterns is an important research topic in bioinformatics. Probabilistic Boolean Networks (PBNs) have been proposed as an effective model for gene regulatory networks. PBNs are able to cope with uncertainty, corporate rule-based dependencies between genes and discover the sensitivity of genes in their interactions with other genes. However, PBNs are unlikely to use directly in practice because of huge amount of computational cost for obtaining predictors and their corresponding probabilities. In this paper, we propose a multivariate Markov model for approximating PBNs and describing the dynamics of a genetic network for gene expression sequences. The main contribution of the new model is to preserve the strength of PBNs and reduce the complexity of the networks. The number of parameters of our proposed model is O(n2) where n is the number of genes involved. We also develop efficient estimation methods for solving the model parameters. Numerical examples on synthetic data sets and practical yeast data sequences are given to demonstrate the effectiveness of the proposed model.

  15. Meeting the memory challenges of brain-scale network simulation.

    PubMed

    Kunkel, Susanne; Potjans, Tobias C; Eppler, Jochen M; Plesser, Hans Ekkehard; Morrison, Abigail; Diesmann, Markus

    2011-01-01

    The development of high-performance simulation software is crucial for studying the brain connectome. Using connectome data to generate neurocomputational models requires software capable of coping with models on a variety of scales: from the microscale, investigating plasticity, and dynamics of circuits in local networks, to the macroscale, investigating the interactions between distinct brain regions. Prior to any serious dynamical investigation, the first task of network simulations is to check the consistency of data integrated in the connectome and constrain ranges for yet unknown parameters. Thanks to distributed computing techniques, it is possible today to routinely simulate local cortical networks of around 10(5) neurons with up to 10(9) synapses on clusters and multi-processor shared-memory machines. However, brain-scale networks are orders of magnitude larger than such local networks, in terms of numbers of neurons and synapses as well as in terms of computational load. Such networks have been investigated in individual studies, but the underlying simulation technologies have neither been described in sufficient detail to be reproducible nor made publicly available. Here, we discover that as the network model sizes approach the regime of meso- and macroscale simulations, memory consumption on individual compute nodes becomes a critical bottleneck. This is especially relevant on modern supercomputers such as the Blue Gene/P architecture where the available working memory per CPU core is rather limited. We develop a simple linear model to analyze the memory consumption of the constituent components of neuronal simulators as a function of network size and the number of cores used. This approach has multiple benefits. The model enables identification of key contributing components to memory saturation and prediction of the effects of potential improvements to code before any implementation takes place. As a consequence, development cycles can be shorter and less expensive. Applying the model to our freely available Neural Simulation Tool (NEST), we identify the software components dominant at different scales, and develop general strategies for reducing the memory consumption, in particular by using data structures that exploit the sparseness of the local representation of the network. We show that these adaptations enable our simulation software to scale up to the order of 10,000 processors and beyond. As memory consumption issues are likely to be relevant for any software dealing with complex connectome data on such architectures, our approach and our findings should be useful for researchers developing novel neuroinformatics solutions to the challenges posed by the connectome project.

  16. Conserved Curvature of RNA Polymerase I Core Promoter Beyond rRNA Genes: The Case of the Tritryps

    PubMed Central

    Smircich, Pablo; Duhagon, María Ana; Garat, Beatriz

    2015-01-01

    In trypanosomatids, the RNA polymerase I (RNAPI)-dependent promoters controlling the ribosomal RNA (rRNA) genes have been well identified. Although the RNAPI transcription machinery recognizes the DNA conformation instead of the DNA sequence of promoters, no conformational study has been reported for these promoters. Here we present the in silico analysis of the intrinsic DNA curvature of the rRNA gene core promoters in Trypanosoma brucei, Trypanosoma cruzi, and Leishmania major. We found that, in spite of the absence of sequence conservation, these promoters hold conformational properties similar to other eukaryotic rRNA promoters. Our results also indicated that the intrinsic DNA curvature pattern is conserved within the Leishmania genus and also among strains of T. cruzi and T. brucei. Furthermore, we analyzed the impact of point mutations on the intrinsic curvature and their impact on the promoter activity. Furthermore, we found that the core promoters of protein-coding genes transcribed by RNAPI in T. brucei show the same conserved conformational characteristics. Overall, our results indicate that DNA intrinsic curvature of the rRNA gene core promoters is conserved in these ancient eukaryotes and such conserved curvature might be a requirement of RNAPI machinery for transcription of not only rRNA genes but also protein-coding genes. PMID:26718450

  17. Systematic identification of an integrative network module during senescence from time-series gene expression.

    PubMed

    Park, Chihyun; Yun, So Jeong; Ryu, Sung Jin; Lee, Soyoung; Lee, Young-Sam; Yoon, Youngmi; Park, Sang Chul

    2017-03-15

    Cellular senescence irreversibly arrests growth of human diploid cells. In addition, recent studies have indicated that senescence is a multi-step evolving process related to important complex biological processes. Most studies analyzed only the genes and their functions representing each senescence phase without considering gene-level interactions and continuously perturbed genes. It is necessary to reveal the genotypic mechanism inferred by affected genes and their interaction underlying the senescence process. We suggested a novel computational approach to identify an integrative network which profiles an underlying genotypic signature from time-series gene expression data. The relatively perturbed genes were selected for each time point based on the proposed scoring measure denominated as perturbation scores. Then, the selected genes were integrated with protein-protein interactions to construct time point specific network. From these constructed networks, the conserved edges across time point were extracted for the common network and statistical test was performed to demonstrate that the network could explain the phenotypic alteration. As a result, it was confirmed that the difference of average perturbation scores of common networks at both two time points could explain the phenotypic alteration. We also performed functional enrichment on the common network and identified high association with phenotypic alteration. Remarkably, we observed that the identified cell cycle specific common network played an important role in replicative senescence as a key regulator. Heretofore, the network analysis from time series gene expression data has been focused on what topological structure was changed over time point. Conversely, we focused on the conserved structure but its context was changed in course of time and showed it was available to explain the phenotypic changes. We expect that the proposed method will help to elucidate the biological mechanism unrevealed by the existing approaches.

  18. “Guilt by Association” Is the Exception Rather Than the Rule in Gene Networks

    PubMed Central

    Gillis, Jesse; Pavlidis, Paul

    2012-01-01

    Gene networks are commonly interpreted as encoding functional information in their connections. An extensively validated principle called guilt by association states that genes which are associated or interacting are more likely to share function. Guilt by association provides the central top-down principle for analyzing gene networks in functional terms or assessing their quality in encoding functional information. In this work, we show that functional information within gene networks is typically concentrated in only a very few interactions whose properties cannot be reliably related to the rest of the network. In effect, the apparent encoding of function within networks has been largely driven by outliers whose behaviour cannot even be generalized to individual genes, let alone to the network at large. While experimentalist-driven analysis of interactions may use prior expert knowledge to focus on the small fraction of critically important data, large-scale computational analyses have typically assumed that high-performance cross-validation in a network is due to a generalizable encoding of function. Because we find that gene function is not systemically encoded in networks, but dependent on specific and critical interactions, we conclude it is necessary to focus on the details of how networks encode function and what information computational analyses use to extract functional meaning. We explore a number of consequences of this and find that network structure itself provides clues as to which connections are critical and that systemic properties, such as scale-free-like behaviour, do not map onto the functional connectivity within networks. PMID:22479173

  19. Small Core, Big Network: A Comprehensive Approach to GIS Teaching Practice Based on Digital Three-Dimensional Campus Reconstruction

    ERIC Educational Resources Information Center

    Cheng, Liang; Zhang, Wen; Wang, Jiechen; Li, Manchun; Zhong, Lishan

    2014-01-01

    Geographic information science (GIS) features a wide range of disciplines and has broad applicability. Challenges associated with rapidly developing GIS technology and the currently limited teaching and practice materials hinder universities from cultivating highly skilled GIS graduates. Based on the idea of "small core, big network," a…

  20. GENE EXPRESSION NETWORKS

    EPA Science Inventory

    "Gene expression network" is the term used to describe the interplay, simple or complex, between two or more gene products in performing a specific cellular function. Although the delineation of such networks is complicated by the existence of multiple and subtle types of intera...

  1. Gene regulatory networks and the underlying biology of developmental toxicity

    EPA Science Inventory

    Embryonic cells are specified by large-scale networks of functionally linked regulatory genes. Knowledge of the relevant gene regulatory networks is essential for understanding phenotypic heterogeneity that emerges from disruption of molecular functions, cellular processes or sig...

  2. Finding novel relationships with integrated gene-gene association network analysis of Synechocystis sp. PCC 6803 using species-independent text-mining.

    PubMed

    Kreula, Sanna M; Kaewphan, Suwisa; Ginter, Filip; Jones, Patrik R

    2018-01-01

    The increasing move towards open access full-text scientific literature enhances our ability to utilize advanced text-mining methods to construct information-rich networks that no human will be able to grasp simply from 'reading the literature'. The utility of text-mining for well-studied species is obvious though the utility for less studied species, or those with no prior track-record at all, is not clear. Here we present a concept for how advanced text-mining can be used to create information-rich networks even for less well studied species and apply it to generate an open-access gene-gene association network resource for Synechocystis sp. PCC 6803, a representative model organism for cyanobacteria and first case-study for the methodology. By merging the text-mining network with networks generated from species-specific experimental data, network integration was used to enhance the accuracy of predicting novel interactions that are biologically relevant. A rule-based algorithm (filter) was constructed in order to automate the search for novel candidate genes with a high degree of likely association to known target genes by (1) ignoring established relationships from the existing literature, as they are already 'known', and (2) demanding multiple independent evidences for every novel and potentially relevant relationship. Using selected case studies, we demonstrate the utility of the network resource and filter to ( i ) discover novel candidate associations between different genes or proteins in the network, and ( ii ) rapidly evaluate the potential role of any one particular gene or protein. The full network is provided as an open-source resource.

  3. Gene expression complex networks: synthesis, identification, and analysis.

    PubMed

    Lopes, Fabrício M; Cesar, Roberto M; Costa, Luciano Da F

    2011-10-01

    Thanks to recent advances in molecular biology, allied to an ever increasing amount of experimental data, the functional state of thousands of genes can now be extracted simultaneously by using methods such as cDNA microarrays and RNA-Seq. Particularly important related investigations are the modeling and identification of gene regulatory networks from expression data sets. Such a knowledge is fundamental for many applications, such as disease treatment, therapeutic intervention strategies and drugs design, as well as for planning high-throughput new experiments. Methods have been developed for gene networks modeling and identification from expression profiles. However, an important open problem regards how to validate such approaches and its results. This work presents an objective approach for validation of gene network modeling and identification which comprises the following three main aspects: (1) Artificial Gene Networks (AGNs) model generation through theoretical models of complex networks, which is used to simulate temporal expression data; (2) a computational method for gene network identification from the simulated data, which is founded on a feature selection approach where a target gene is fixed and the expression profile is observed for all other genes in order to identify a relevant subset of predictors; and (3) validation of the identified AGN-based network through comparison with the original network. The proposed framework allows several types of AGNs to be generated and used in order to simulate temporal expression data. The results of the network identification method can then be compared to the original network in order to estimate its properties and accuracy. Some of the most important theoretical models of complex networks have been assessed: the uniformly-random Erdös-Rényi (ER), the small-world Watts-Strogatz (WS), the scale-free Barabási-Albert (BA), and geographical networks (GG). The experimental results indicate that the inference method was sensitive to average degree variation, decreasing its network recovery rate with the increase of . The signal size was important for the inference method to get better accuracy in the network identification rate, presenting very good results with small expression profiles. However, the adopted inference method was not sensible to recognize distinct structures of interaction among genes, presenting a similar behavior when applied to different network topologies. In summary, the proposed framework, though simple, was adequate for the validation of the inferred networks by identifying some properties of the evaluated method, which can be extended to other inference methods.

  4. Ontology-based literature mining of E. coli vaccine-associated gene interaction networks.

    PubMed

    Hur, Junguk; Özgür, Arzucan; He, Yongqun

    2017-03-14

    Pathogenic Escherichia coli infections cause various diseases in humans and many animal species. However, with extensive E. coli vaccine research, we are still unable to fully protect ourselves against E. coli infections. To more rational development of effective and safe E. coli vaccine, it is important to better understand E. coli vaccine-associated gene interaction networks. In this study, we first extended the Vaccine Ontology (VO) to semantically represent various E. coli vaccines and genes used in the vaccine development. We also normalized E. coli gene names compiled from the annotations of various E. coli strains using a pan-genome-based annotation strategy. The Interaction Network Ontology (INO) includes a hierarchy of various interaction-related keywords useful for literature mining. Using VO, INO, and normalized E. coli gene names, we applied an ontology-based SciMiner literature mining strategy to mine all PubMed abstracts and retrieve E. coli vaccine-associated E. coli gene interactions. Four centrality metrics (i.e., degree, eigenvector, closeness, and betweenness) were calculated for identifying highly ranked genes and interaction types. Using vaccine-related PubMed abstracts, our study identified 11,350 sentences that contain 88 unique INO interactions types and 1,781 unique E. coli genes. Each sentence contained at least one interaction type and two unique E. coli genes. An E. coli gene interaction network of genes and INO interaction types was created. From this big network, a sub-network consisting of 5 E. coli vaccine genes, including carA, carB, fimH, fepA, and vat, and 62 other E. coli genes, and 25 INO interaction types was identified. While many interaction types represent direct interactions between two indicated genes, our study has also shown that many of these retrieved interaction types are indirect in that the two genes participated in the specified interaction process in a required but indirect process. Our centrality analysis of these gene interaction networks identified top ranked E. coli genes and 6 INO interaction types (e.g., regulation and gene expression). Vaccine-related E. coli gene-gene interaction network was constructed using ontology-based literature mining strategy, which identified important E. coli vaccine genes and their interactions with other genes through specific interaction types.

  5. Evaluation of the streamflow-gaging network of Texas and a proposed core network

    USGS Publications Warehouse

    Slade, Raymond M.; Howard, Teresa; Anaya, Roberto

    2001-01-01

    The U.S. Geological Survey streamflowgaging network in Texas is operated as part of the National Streamgaging Program and is jointly funded by the Geological Survey and Federal, State, and local agencies. This report documents an evaluation of the existing (as of October 1, 1999) network with regard to four major objectives of streamflow data; and on the basis of that evaluation, proposes a core network of streamflowgaging stations that best meets those objectives. The objectives are (1) regionalization (estimate flows or flow characteristics at ungaged sites in 11 hydrologically similar regions), (2) major flow (obtain flow rates and volumes in large streams), (3) outflow from the State (account for streamflow leaving the State), and (4) streamflow conditions assessment (assess current conditions with regard to long-term data, and define temporal trends in flow). The network analysis resulted in a proposed core network of 263 stations. Of those 263 stations, 43 were discontinued as of October 1, 1999, and 15 were partial-record stations. Fifty-five of the proposed core-network stations meet two of the four major objectives, 16 stations meet three objectives, and 1 station meets all four. One-hundred eighty-five stations with a median record length of 33 years were selected to meet the regionalization objective. Ninety-two stations with a median record length of about 62 years were selected to meet the major-flow objective. Twenty-six stations with a median record length of 59 years were selected to meet the outflow from the State objective. Fifty stations with a median record length of 53 years were selected to meet the streamflow conditions assessment objective.

  6. The role of interpersonal communication in the process of knowledge mobilization within a community-based organization: a network analysis.

    PubMed

    Gainforth, Heather L; Latimer-Cheung, Amy E; Athanasopoulos, Peter; Moore, Spencer; Ginis, Kathleen A Martin

    2014-05-22

    Diffusion of innovations theory has been widely used to explain knowledge mobilization of research findings. This theory posits that individuals who are more interpersonally connected within an organization may be more likely to adopt an innovation (e.g., research evidence) than individuals who are less interconnected. Research examining this tenet of diffusion of innovations theory in the knowledge mobilization literature is limited. The purpose of the present study was to use network analysis to examine the role of interpersonal communication in the adoption and mobilization of the physical activity guidelines for people with spinal cord injury (SCI) among staff in a community-based organization (CBO). The study used a cross-sectional, whole-network design. In total, 56 staff completed the network survey. Adoption of the guidelines was assessed using Rogers' innovation-decision process and interpersonal communication was assessed using an online network instrument. The patterns of densities observed within the network were indicative of a core-periphery structure revealing that interpersonal communication was greater within the core than between the core and periphery and within the periphery. Membership in the core, as opposed to membership in the periphery, was associated with greater knowledge of the evidence-based physical activity resources available and engagement in physical activity promotion behaviours (ps < 0.05). Greater in-degree centrality was associated with adoption of evidence-based behaviours (p < 0.05). Findings suggest that interpersonal communication is associated with knowledge mobilization and highlight how the network structure could be improved for further dissemination efforts. diffusion of innovations; network analysis; community-based organization; knowledge mobilization; knowledge translation, interpersonal communication.

  7. Selection Shapes Transcriptional Logic and Regulatory Specialization in Genetic Networks

    PubMed Central

    Fogelmark, Karl; Peterson, Carsten; Troein, Carl

    2016-01-01

    Background Living organisms need to regulate their gene expression in response to environmental signals and internal cues. This is a computational task where genes act as logic gates that connect to form transcriptional networks, which are shaped at all scales by evolution. Large-scale mutations such as gene duplications and deletions add and remove network components, whereas smaller mutations alter the connections between them. Selection determines what mutations are accepted, but its importance for shaping the resulting networks has been debated. Methodology To investigate the effects of selection in the shaping of transcriptional networks, we derive transcriptional logic from a combinatorially powerful yet tractable model of the binding between DNA and transcription factors. By evolving the resulting networks based on their ability to function as either a simple decision system or a circadian clock, we obtain information on the regulation and logic rules encoded in functional transcriptional networks. Comparisons are made between networks evolved for different functions, as well as with structurally equivalent but non-functional (neutrally evolved) networks, and predictions are validated against the transcriptional network of E. coli. Principal Findings We find that the logic rules governing gene expression depend on the function performed by the network. Unlike the decision systems, the circadian clocks show strong cooperative binding and negative regulation, which achieves tight temporal control of gene expression. Furthermore, we find that transcription factors act preferentially as either activators or repressors, both when binding multiple sites for a single target gene and globally in the transcriptional networks. This separation into positive and negative regulators requires gene duplications, which highlights the interplay between mutation and selection in shaping the transcriptional networks. PMID:26927540

  8. Finding pathway-modulating genes from a novel Ontology Fingerprint-derived gene network.

    PubMed

    Qin, Tingting; Matmati, Nabil; Tsoi, Lam C; Mohanty, Bidyut K; Gao, Nan; Tang, Jijun; Lawson, Andrew B; Hannun, Yusuf A; Zheng, W Jim

    2014-10-01

    To enhance our knowledge regarding biological pathway regulation, we took an integrated approach, using the biomedical literature, ontologies, network analyses and experimental investigation to infer novel genes that could modulate biological pathways. We first constructed a novel gene network via a pairwise comparison of all yeast genes' Ontology Fingerprints--a set of Gene Ontology terms overrepresented in the PubMed abstracts linked to a gene along with those terms' corresponding enrichment P-values. The network was further refined using a Bayesian hierarchical model to identify novel genes that could potentially influence the pathway activities. We applied this method to the sphingolipid pathway in yeast and found that many top-ranked genes indeed displayed altered sphingolipid pathway functions, initially measured by their sensitivity to myriocin, an inhibitor of de novo sphingolipid biosynthesis. Further experiments confirmed the modulation of the sphingolipid pathway by one of these genes, PFA4, encoding a palmitoyl transferase. Comparative analysis showed that few of these novel genes could be discovered by other existing methods. Our novel gene network provides a unique and comprehensive resource to study pathway modulations and systems biology in general. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

  9. Discovering hidden relationships between renal diseases and regulated genes through 3D network visualizations

    PubMed Central

    2010-01-01

    Background In a recent study, two-dimensional (2D) network layouts were used to visualize and quantitatively analyze the relationship between chronic renal diseases and regulated genes. The results revealed complex relationships between disease type, gene specificity, and gene regulation type, which led to important insights about the underlying biological pathways. Here we describe an attempt to extend our understanding of these complex relationships by reanalyzing the data using three-dimensional (3D) network layouts, displayed through 2D and 3D viewing methods. Findings The 3D network layout (displayed through the 3D viewing method) revealed that genes implicated in many diseases (non-specific genes) tended to be predominantly down-regulated, whereas genes regulated in a few diseases (disease-specific genes) tended to be up-regulated. This new global relationship was quantitatively validated through comparison to 1000 random permutations of networks of the same size and distribution. Our new finding appeared to be the result of using specific features of the 3D viewing method to analyze the 3D renal network. Conclusions The global relationship between gene regulation and gene specificity is the first clue from human studies that there exist common mechanisms across several renal diseases, which suggest hypotheses for the underlying mechanisms. Furthermore, the study suggests hypotheses for why the 3D visualization helped to make salient a new regularity that was difficult to detect in 2D. Future research that tests these hypotheses should enable a more systematic understanding of when and how to use 3D network visualizations to reveal complex regularities in biological networks. PMID:21070623

  10. The effects of graded levels of calorie restriction: VII. Topological rearrangement of hypothalamic aging networks.

    PubMed

    Derous, Davina; Mitchell, Sharon E; Green, Cara L; Wang, Yingchun; Han, Jing Dong J; Chen, Luonan; Promislow, Daniel E L; Lusseau, David; Speakman, John R; Douglas, Alex

    2016-05-01

    Connectivity in a gene-gene network declines with age, typically within gene clusters. We explored the effect of short-term (3 months) graded calorie restriction (CR) (up to 40 %) on network structure of aging-associated genes in the murine hypothalamus by using conditional mutual information. The networks showed a topological rearrangement when exposed to graded CR with a higher relative within cluster connectivity at 40CR. We observed changes in gene centrality concordant with changes in CR level, with Ppargc1a, and Ppt1 having increased centrality and Etfdh, Traf3 and Abcc1 decreased centrality as CR increased. This change in gene centrality in a graded manner with CR, occurred in the absence of parallel changes in gene expression levels. This study emphasizes the importance of augmenting traditional differential gene expression analyses to better understand structural changes in the transcriptome. Overall our results suggested that CR induced changes in centrality of biological relevant genes that play an important role in preventing the age-associated loss of network integrity irrespective of their gene expression levels.

  11. The effects of graded levels of calorie restriction: VII. Topological rearrangement of hypothalamic aging networks

    PubMed Central

    Derous, Davina; Mitchell, Sharon E.; Green, Cara L.; Wang, Yingchun; Han, Jing Dong J.; Chen, Luonan; Promislow, Daniel E.L.; Lusseau, David; Speakman, John R.; Douglas, Alex

    2016-01-01

    Connectivity in a gene-gene network declines with age, typically within gene clusters. We explored the effect of short-term (3 months) graded calorie restriction (CR) (up to 40 %) on network structure of aging-associated genes in the murine hypothalamus by using conditional mutual information. The networks showed a topological rearrangement when exposed to graded CR with a higher relative within cluster connectivity at 40CR. We observed changes in gene centrality concordant with changes in CR level, with Ppargc1a, and Ppt1 having increased centrality and Etfdh, Traf3 and Abcc1 decreased centrality as CR increased. This change in gene centrality in a graded manner with CR, occurred in the absence of parallel changes in gene expression levels. This study emphasizes the importance of augmenting traditional differential gene expression analyses to better understand structural changes in the transcriptome. Overall our results suggested that CR induced changes in centrality of biological relevant genes that play an important role in preventing the age-associated loss of network integrity irrespective of their gene expression levels. PMID:27115072

  12. Gene Discovery of Characteristic Metabolic Pathways in the Tea Plant (Camellia sinensis) Using ‘Omics’-Based Network Approaches: A Future Perspective

    PubMed Central

    Zhang, Shihua; Zhang, Liang; Tai, Yuling; Wang, Xuewen; Ho, Chi-Tang; Wan, Xiaochun

    2018-01-01

    Characteristic secondary metabolites, including flavonoids, theanine and caffeine, in the tea plant (Camellia sinensis) are the primary sources of the rich flavors, fresh taste, and health benefits of tea. The decoding of genes involved in these characteristic components is still significantly lagging, which lays an obstacle for applied genetic improvement and metabolic engineering. With the popularity of high-throughout transcriptomics and metabolomics, ‘omics’-based network approaches, such as gene co-expression network and gene-to-metabolite network, have emerged as powerful tools for gene discovery of plant-specialized (secondary) metabolism. Thus, it is pivotal to summarize and introduce such system-based strategies in facilitating gene identification of characteristic metabolic pathways in the tea plant (or other plants). In this review, we describe recent advances in transcriptomics and metabolomics for transcript and metabolite profiling, and highlight ‘omics’-based network strategies using successful examples in model and non-model plants. Further, we summarize recent progress in ‘omics’ analysis for gene identification of characteristic metabolites in the tea plant. Limitations of the current strategies are discussed by comparison with ‘omics’-based network approaches. Finally, we demonstrate the potential of introducing such network strategies in the tea plant, with a prospects ending for a promising network discovery of characteristic metabolite genes in the tea plant. PMID:29915604

  13. An additional k-means clustering step improves the biological features of WGCNA gene co-expression networks.

    PubMed

    Botía, Juan A; Vandrovcova, Jana; Forabosco, Paola; Guelfi, Sebastian; D'Sa, Karishma; Hardy, John; Lewis, Cathryn M; Ryten, Mina; Weale, Michael E

    2017-04-12

    Weighted Gene Co-expression Network Analysis (WGCNA) is a widely used R software package for the generation of gene co-expression networks (GCN). WGCNA generates both a GCN and a derived partitioning of clusters of genes (modules). We propose k-means clustering as an additional processing step to conventional WGCNA, which we have implemented in the R package km2gcn (k-means to gene co-expression network, https://github.com/juanbot/km2gcn ). We assessed our method on networks created from UKBEC data (10 different human brain tissues), on networks created from GTEx data (42 human tissues, including 13 brain tissues), and on simulated networks derived from GTEx data. We observed substantially improved module properties, including: (1) few or zero misplaced genes; (2) increased counts of replicable clusters in alternate tissues (x3.1 on average); (3) improved enrichment of Gene Ontology terms (seen in 48/52 GCNs) (4) improved cell type enrichment signals (seen in 21/23 brain GCNs); and (5) more accurate partitions in simulated data according to a range of similarity indices. The results obtained from our investigations indicate that our k-means method, applied as an adjunct to standard WGCNA, results in better network partitions. These improved partitions enable more fruitful downstream analyses, as gene modules are more biologically meaningful.

  14. Investigating the Effects of Imputation Methods for Modelling Gene Networks Using a Dynamic Bayesian Network from Gene Expression Data

    PubMed Central

    CHAI, Lian En; LAW, Chow Kuan; MOHAMAD, Mohd Saberi; CHONG, Chuii Khim; CHOON, Yee Wen; DERIS, Safaai; ILLIAS, Rosli Md

    2014-01-01

    Background: Gene expression data often contain missing expression values. Therefore, several imputation methods have been applied to solve the missing values, which include k-nearest neighbour (kNN), local least squares (LLS), and Bayesian principal component analysis (BPCA). However, the effects of these imputation methods on the modelling of gene regulatory networks from gene expression data have rarely been investigated and analysed using a dynamic Bayesian network (DBN). Methods: In the present study, we separately imputed datasets of the Escherichia coli S.O.S. DNA repair pathway and the Saccharomyces cerevisiae cell cycle pathway with kNN, LLS, and BPCA, and subsequently used these to generate gene regulatory networks (GRNs) using a discrete DBN. We made comparisons on the basis of previous studies in order to select the gene network with the least error. Results: We found that BPCA and LLS performed better on larger networks (based on the S. cerevisiae dataset), whereas kNN performed better on smaller networks (based on the E. coli dataset). Conclusion: The results suggest that the performance of each imputation method is dependent on the size of the dataset, and this subsequently affects the modelling of the resultant GRNs using a DBN. In addition, on the basis of these results, a DBN has the capacity to discover potential edges, as well as display interactions, between genes. PMID:24876803

  15. Rhythmogenic neuronal networks, emergent leaders, and k-cores.

    PubMed

    Schwab, David J; Bruinsma, Robijn F; Feldman, Jack L; Levine, Alex J

    2010-11-01

    Neuronal network behavior results from a combination of the dynamics of individual neurons and the connectivity of the network that links them together. We study a simplified model, based on the proposal of Feldman and Del Negro (FDN) [Nat. Rev. Neurosci. 7, 232 (2006)], of the preBötzinger Complex, a small neuronal network that participates in the control of the mammalian breathing rhythm through periodic firing bursts. The dynamics of this randomly connected network of identical excitatory neurons differ from those of a uniformly connected one. Specifically, network connectivity determines the identity of emergent leader neurons that trigger the firing bursts. When neuronal desensitization is controlled by the number of input signals to the neurons (as proposed by FDN), the network's collective desensitization--required for successful burst termination--is mediated by k-core clusters of neurons.

  16. Cationic niosomes an effective gene carrier composed of novel spermine-derivative cationic lipids: effect of central core structures.

    PubMed

    Opanasopit, Praneet; Leksantikul, Lalita; Niyomtham, Nattisa; Rojanarata, Theerasak; Ngawhirunpat, Tanasait; Yingyongnarongkul, Boon-Ek

    2017-05-01

    Cationic niosomes formulated from Span 20, cholesterol (Chol) and novel spermine-based cationic lipids of multiple central core structures (di(oxyethyl)amino, di(oxyethyl)amino carboxy, 3-amino-1,2-dioxypropyl and 2-amino-1,3-dioxypropyl) were successfully prepared for improving transfection efficiency in vitro. The niosomes composed of spermine cationic lipid with central core structure of di(oxyethyl)amino revealed the highest gene transfection efficiency. To investigate the factors affecting gene transfection and cell viability including differences in the central core structures of cationic lipids, the composition of vesicles, molar ratio of cationic lipids in formulations and the weight ratio of niosomes to DNA. Cationic niosomes composed of nonionic surfactants (Span20), cholesterol and spermine-based cationic lipids of multiple central core structures were formulated. Gene transfection and cell viability were evaluated on a human cervical carcinoma cell line (HeLa cells) using pDNA encoding green fluorescent protein (pEGFP-C2). The morphology, size and charge were also characterized. High transfection efficiency was obtained from cationic niosomes composed of Span20:Chol:cationic lipid at the molar ratio of 2.5:2.5:0.5 mM. Cationic lipids with di(oxyethyl)amino as a central core structure exhibited highest transfection efficiency. In addition, there was also no serum effect on transfection efficiency. These novel cationic niosomes may constitute a good alternative carrier for gene transfection.

  17. Massive-scale gene co-expression network construction and robustness testing using random matrix theory.

    PubMed

    Gibson, Scott M; Ficklin, Stephen P; Isaacson, Sven; Luo, Feng; Feltus, Frank A; Smith, Melissa C

    2013-01-01

    The study of gene relationships and their effect on biological function and phenotype is a focal point in systems biology. Gene co-expression networks built using microarray expression profiles are one technique for discovering and interpreting gene relationships. A knowledge-independent thresholding technique, such as Random Matrix Theory (RMT), is useful for identifying meaningful relationships. Highly connected genes in the thresholded network are then grouped into modules that provide insight into their collective functionality. While it has been shown that co-expression networks are biologically relevant, it has not been determined to what extent any given network is functionally robust given perturbations in the input sample set. For such a test, hundreds of networks are needed and hence a tool to rapidly construct these networks. To examine functional robustness of networks with varying input, we enhanced an existing RMT implementation for improved scalability and tested functional robustness of human (Homo sapiens), rice (Oryza sativa) and budding yeast (Saccharomyces cerevisiae). We demonstrate dramatic decrease in network construction time and computational requirements and show that despite some variation in global properties between networks, functional similarity remains high. Moreover, the biological function captured by co-expression networks thresholded by RMT is highly robust.

  18. Weighted gene co‑expression network analysis in identification of key genes and networks for ischemic‑reperfusion remodeling myocardium.

    PubMed

    Guo, Nan; Zhang, Nan; Yan, Liqiu; Lian, Zheng; Wang, Jiawang; Lv, Fengfeng; Wang, Yunfei; Cao, Xufen

    2018-06-14

    Acute myocardial infarction induces ventricular remodeling, which is implicated in dilated heart and heart failure. The pathogenical mechanism of myocardium remodeling remains to be elucidated. The aim of the present study was to identify key genes and networks for myocardium remodeling following ischemia‑reperfusion (IR). First, the mRNA expression data from the National Center for Biotechnology Information database were downloaded to identify differences in mRNA expression of the IR heart at days 2 and 7. Then, weighted gene co‑expression network analysis, hierarchical clustering, protein‑protein interaction (PPI) network, Gene Ontology (GO), Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway were used to identify key genes and networks for the heart remodeling process following IR. A total of 3,321 differentially expressed genes were identified during the heart remodeling process. A total of 6 modules were identified through gene co‑expression network analysis. GO and KEGG analysis results suggested that each module represented a different biological function and was associated with different pathways. Finally, hub genes of each module were identified by PPI network construction. The present study revealed that heart remodeling following IR is a complicated process, involving extracellular matrix organization, neural development, apoptosis and energy metabolism. The dysregulated genes, including SRC proto‑oncogene, non‑receptor tyrosine kinase, discs large MAGUK scaffold protein 1, ATP citrate lyase, RAN, member RAS oncogene family, tumor protein p53, and polo like kinase 2, may be essential for heart remodeling following IR and may be used as potential targets for the inhibition of heart remodeling following acute myocardial infarction.

  19. Arabidopsis CML38, a Calcium Sensor That Localizes to Ribonucleoprotein Complexes under Hypoxia Stress1[OPEN

    PubMed Central

    McClintock, Carlee; Li, Tian

    2016-01-01

    During waterlogging and the associated oxygen deprivation stress, plants respond by the induction of adaptive programs, including the redirected expression of gene networks toward the synthesis of core hypoxia-response proteins. Among these core response proteins in Arabidopsis (Arabidopsis thaliana) is the calcium sensor CML38, a protein related to regulator of gene silencing calmodulin-like proteins (rgsCaMs). CML38 transcripts are up-regulated more than 300-fold in roots within 6 h of hypoxia treatment. Transfer DNA insertional mutants of CML38 show an enhanced sensitivity to hypoxia stress, with lowered survival and more severe inhibition of root and shoot growth. By using yellow fluorescent protein (YFP) translational fusions, CML38 protein was found to be localized to cytosolic granule structures similar in morphology to hypoxia-induced stress granules. Immunoprecipitation of CML38 from the roots of hypoxia-challenged transgenic plants harboring CML38pro::CML38:YFP followed by liquid chromatography-tandem mass spectrometry analysis revealed the presence of protein targets associated with messenger RNA ribonucleoprotein (mRNP) complexes including stress granules, which are known to accumulate as messenger RNA storage and triage centers during hypoxia. This finding is further supported by the colocalization of CML38 with the mRNP stress granule marker RNA Binding Protein 47 (RBP47) upon cotransfection of Nicotiana benthamiana leaves. Ruthenium Red treatment results in the loss of CML38 signal in cytosolic granules, suggesting that calcium is necessary for stress granule association. These results confirm that CML38 is a core hypoxia response calcium sensor protein and suggest that it serves as a potential calcium signaling target within stress granules and other mRNPs that accumulate during flooding stress responses. PMID:26634999

  20. Structural Insights into the Free-Standing Condensation Enzyme SgcC5 Catalyzing Ester-Bond Formation in the Biosynthesis of the Enediyne Antitumor Antibiotic C-1027.

    PubMed

    Chang, Chin-Yuan; Lohman, Jeremy R; Huang, Tingting; Michalska, Karolina; Bigelow, Lance; Rudolf, Jeffrey D; Jedrzejczak, Robert; Yan, Xiaohui; Ma, Ming; Babnigg, Gyorgy; Joachimiak, Andrzej; Phillips, George N; Shen, Ben

    2018-03-21

    C-1027 is a chromoprotein enediyne antitumor antibiotic, consisting of the CagA apoprotein and the C-1027 chromophore. The C-1027 chromophore features a nine-membered enediyne core appended with three peripheral moieties, including an ( S)-3-chloro-5-hydroxy-β-tyrosine. In a convergent biosynthesis of the C-1027 chromophore, the ( S)-3-chloro-5-hydroxy-β-tyrosine moiety is appended to the enediyne core by the free-standing condensation enzyme SgcC5. Unlike canonical condensation domains from the modular nonribosomal peptide synthetases that catalyze amide-bond formation, SgcC5 catalyzes ester-bond formation, as demonstrated in vitro, between SgcC2-tethered ( S)-3-chloro-5-hydroxy-β-tyrosine and ( R)-1-phenyl-1,2-ethanediol, a mimic of the enediyne core as an acceptor substrate. Here, we report that (i) genes encoding SgcC5 homologues are widespread among both experimentally confirmed and bioinformatically predicted enediyne biosynthetic gene clusters, forming a new clade of condensation enzymes, (ii) SgcC5 shares a similar overall structure with the canonical condensation domains but forms a homodimer in solution, the active site of which is located in a cavity rather than a tunnel typically seen in condensation domains, and (iii) the catalytic histidine of SgcC5 activates the 2-hydroxyl group, while a hydrogen-bond network in SgcC5 prefers the R-enantiomer of the acceptor substrate, accounting for the regio- and stereospecific ester-bond formation between SgcC2-tethered ( S)-3-chloro-5-hydroxy-β-tyrosine and ( R)-1-phenyl-1,2-ethanediol upon acid-base catalysis. These findings expand the catalytic repertoire and reveal new insights into the structure and mechanism of condensation enzymes.

Top