Sample records for identifying gene interaction

  1. A Penalized Robust Method for Identifying Gene-Environment Interactions

    PubMed Central

    Shi, Xingjie; Liu, Jin; Huang, Jian; Zhou, Yong; Xie, Yang; Ma, Shuangge

    2015-01-01

    In high-throughput studies, an important objective is to identify gene-environment interactions associated with disease outcomes and phenotypes. Many commonly adopted methods assume specific parametric or semiparametric models, which may be subject to model mis-specification. In addition, they usually use significance level as the criterion for selecting important interactions. In this study, we adopt the rank-based estimation, which is much less sensitive to model specification than some of the existing methods and includes several commonly encountered data and models as special cases. Penalization is adopted for the identification of gene-environment interactions. It achieves simultaneous estimation and identification and does not rely on significance level. For computation feasibility, a smoothed rank estimation is further proposed. Simulation shows that under certain scenarios, for example with contaminated or heavy-tailed data, the proposed method can significantly outperform the existing alternatives with more accurate identification. We analyze a lung cancer prognosis study with gene expression measurements under the AFT (accelerated failure time) model. The proposed method identifies interactions different from those using the alternatives. Some of the identified genes have important implications. PMID:24616063

  2. LNDriver: identifying driver genes by integrating mutation and expression data based on gene-gene interaction network.

    PubMed

    Wei, Pi-Jing; Zhang, Di; Xia, Junfeng; Zheng, Chun-Hou

    2016-12-23

    Cancer is a complex disease which is characterized by the accumulation of genetic alterations during the patient's lifetime. With the development of the next-generation sequencing technology, multiple omics data, such as cancer genomic, epigenomic and transcriptomic data etc., can be measured from each individual. Correspondingly, one of the key challenges is to pinpoint functional driver mutations or pathways, which contributes to tumorigenesis, from millions of functional neutral passenger mutations. In this paper, in order to identify driver genes effectively, we applied a generalized additive model to mutation profiles to filter genes with long length and constructed a new gene-gene interaction network. Then we integrated the mutation data and expression data into the gene-gene interaction network. Lastly, greedy algorithm was used to prioritize candidate driver genes from the integrated data. We named the proposed method Length-Net-Driver (LNDriver). Experiments on three TCGA datasets, i.e., head and neck squamous cell carcinoma, kidney renal clear cell carcinoma and thyroid carcinoma, demonstrated that the proposed method was effective. Also, it can identify not only frequently mutated drivers, but also rare candidate driver genes.

  3. TGMI: an efficient algorithm for identifying pathway regulators through evaluation of triple-gene mutual interaction

    PubMed Central

    Gunasekara, Chathura; Zhang, Kui; Deng, Wenping; Brown, Laura

    2018-01-01

    Abstract Despite their important roles, the regulators for most metabolic pathways and biological processes remain elusive. Presently, the methods for identifying metabolic pathway and biological process regulators are intensively sought after. We developed a novel algorithm called triple-gene mutual interaction (TGMI) for identifying these regulators using high-throughput gene expression data. It first calculated the regulatory interactions among triple gene blocks (two pathway genes and one transcription factor (TF)), using conditional mutual information, and then identifies significantly interacted triple genes using a newly identified novel mutual interaction measure (MIM), which was substantiated to reflect strengths of regulatory interactions within each triple gene block. The TGMI calculated the MIM for each triple gene block and then examined its statistical significance using bootstrap. Finally, the frequencies of all TFs present in all significantly interacted triple gene blocks were calculated and ranked. We showed that the TFs with higher frequencies were usually genuine pathway regulators upon evaluating multiple pathways in plants, animals and yeast. Comparison of TGMI with several other algorithms demonstrated its higher accuracy. Therefore, TGMI will be a valuable tool that can help biologists to identify regulators of metabolic pathways and biological processes from the exploded high-throughput gene expression data in public repositories. PMID:29579312

  4. Gene-environment interaction involving recently identified colorectal cancer susceptibility loci

    PubMed Central

    Kantor, Elizabeth D.; Hutter, Carolyn M.; Minnier, Jessica; Berndt, Sonja I.; Brenner, Hermann; Caan, Bette J.; Campbell, Peter T.; Carlson, Christopher S.; Casey, Graham; Chan, Andrew T.; Chang-Claude, Jenny; Chanock, Stephen J.; Cotterchio, Michelle; Du, Mengmeng; Duggan, David; Fuchs, Charles S.; Giovannucci, Edward L.; Gong, Jian; Harrison, Tabitha A.; Hayes, Richard B.; Henderson, Brian E.; Hoffmeister, Michael; Hopper, John L.; Jenkins, Mark A.; Jiao, Shuo; Kolonel, Laurence N.; Le Marchand, Loic; Lemire, Mathieu; Ma, Jing; Newcomb, Polly A.; Ochs-Balcom, Heather M.; Pflugeisen, Bethann M.; Potter, John D.; Rudolph, Anja; Schoen, Robert E.; Seminara, Daniela; Slattery, Martha L.; Stelling, Deanna L.; Thomas, Fridtjof; Thornquist, Mark; Ulrich, Cornelia M.; Warnick, Greg S.; Zanke, Brent W.; Peters, Ulrike; Hsu, Li; White, Emily

    2014-01-01

    BACKGROUND Genome-wide association studies have identified several single nucleotide polymorphisms (SNPs) that are associated with risk of colorectal cancer (CRC). Prior research has evaluated the presence of gene-environment interaction involving the first 10 identified susceptibility loci, but little work has been conducted on interaction involving SNPs at recently identified susceptibility loci, including: rs10911251, rs6691170, rs6687758, rs11903757, rs10936599, rs647161, rs1321311, rs719725, rs1665650, rs3824999, rs7136702, rs11169552, rs59336, rs3217810, rs4925386, and rs2423279. METHODS Data on 9160 cases and 9280 controls from the Genetics and Epidemiology of Colorectal Cancer Consortium (GECCO) and Colon Cancer Family Registry (CCFR) were used to evaluate the presence of interaction involving the above-listed SNPs and sex, body mass index (BMI), alcohol consumption, smoking, aspirin use, post-menopausal hormone (PMH) use, as well as intake of dietary calcium, dietary fiber, dietary folate, red meat, processed meat, fruit, and vegetables. Interaction was evaluated using a fixed-effects meta-analysis of an efficient Empirical Bayes estimator, and permutation was used to account for multiple comparisons. RESULTS None of the permutation-adjusted p-values reached statistical significance. CONCLUSIONS The associations between recently identified genetic susceptibility loci and CRC are not strongly modified by sex, BMI, alcohol, smoking, aspirin, PMH use, and various dietary factors. IMPACT Results suggest no evidence of strong gene-environment interactions involving the recently identified 16 susceptibility loci for CRC taken one at a time. PMID:24994789

  5. GESearch: An Interactive GUI Tool for Identifying Gene Expression Signature.

    PubMed

    Ye, Ning; Yin, Hengfu; Liu, Jingjing; Dai, Xiaogang; Yin, Tongming

    2015-01-01

    The huge amount of gene expression data generated by microarray and next-generation sequencing technologies present challenges to exploit their biological meanings. When searching for the coexpression genes, the data mining process is largely affected by selection of algorithms. Thus, it is highly desirable to provide multiple options of algorithms in the user-friendly analytical toolkit to explore the gene expression signatures. For this purpose, we developed GESearch, an interactive graphical user interface (GUI) toolkit, which is written in MATLAB and supports a variety of gene expression data files. This analytical toolkit provides four models, including the mean, the regression, the delegate, and the ensemble models, to identify the coexpression genes, and enables the users to filter data and to select gene expression patterns by browsing the display window or by importing knowledge-based genes. Subsequently, the utility of this analytical toolkit is demonstrated by analyzing two sets of real-life microarray datasets from cell-cycle experiments. Overall, we have developed an interactive GUI toolkit that allows for choosing multiple algorithms for analyzing the gene expression signatures.

  6. Genome-wide gene by lead exposure interaction analysis identifies UNC5D as a candidate gene for neurodevelopment.

    PubMed

    Wang, Zhaoxi; Claus Henn, Birgit; Wang, Chaolong; Wei, Yongyue; Su, Li; Sun, Ryan; Chen, Han; Wagner, Peter J; Lu, Quan; Lin, Xihong; Wright, Robert; Bellinger, David; Kile, Molly; Mazumdar, Maitreyi; Tellez-Rojo, Martha Maria; Schnaas, Lourdes; Christiani, David C

    2017-07-28

    Neurodevelopment is a complex process involving both genetic and environmental factors. Prenatal exposure to lead (Pb) has been associated with lower performance on neurodevelopmental tests. Adverse neurodevelopmental outcomes are more frequent and/or more severe when toxic exposures interact with genetic susceptibility. To explore possible loci associated with increased susceptibility to prenatal Pb exposure, we performed a genome-wide gene-environment interaction study (GWIS) in young children from Mexico (n = 390) and Bangladesh (n = 497). Prenatal Pb exposure was estimated by cord blood Pb concentration. Neurodevelopment was assessed using the Bayley Scales of Infant Development. We identified a locus on chromosome 8, containing UNC5D, and demonstrated evidence of its genome-wide significance with mental composite scores (rs9642758, p meta  = 4.35 × 10 -6 ). Within this locus, the joint effects of two independent single nucleotide polymorphisms (SNPs, rs9642758 and rs10503970) had a p-value of 4.38 × 10 -9 for mental composite scores. Correlating GWIS results with in vitro transcriptomic profiles identified one common gene, SLC1A5, which is involved in synaptic function, neuronal development, and excitotoxicity. Further analysis revealed interconnected interactions that formed a large network of 52 genes enriched with oxidative stress genes and neurodevelopmental genes. Our findings suggest that certain genetic polymorphisms within/near genes relevant to neurodevelopment might modify the toxic effects of Pb exposure via oxidative stress.

  7. Meta-analysis identifies gene-by-environment interactions as demonstrated in a study of 4,965 mice.

    PubMed

    Kang, Eun Yong; Han, Buhm; Furlotte, Nicholas; Joo, Jong Wha J; Shih, Diana; Davis, Richard C; Lusis, Aldons J; Eskin, Eleazar

    2014-01-01

    Identifying environmentally-specific genetic effects is a key challenge in understanding the structure of complex traits. Model organisms play a crucial role in the identification of such gene-by-environment interactions, as a result of the unique ability to observe genetically similar individuals across multiple distinct environments. Many model organism studies examine the same traits but under varying environmental conditions. For example, knock-out or diet-controlled studies are often used to examine cholesterol in mice. These studies, when examined in aggregate, provide an opportunity to identify genomic loci exhibiting environmentally-dependent effects. However, the straightforward application of traditional methodologies to aggregate separate studies suffers from several problems. First, environmental conditions are often variable and do not fit the standard univariate model for interactions. Additionally, applying a multivariate model results in increased degrees of freedom and low statistical power. In this paper, we jointly analyze multiple studies with varying environmental conditions using a meta-analytic approach based on a random effects model to identify loci involved in gene-by-environment interactions. Our approach is motivated by the observation that methods for discovering gene-by-environment interactions are closely related to random effects models for meta-analysis. We show that interactions can be interpreted as heterogeneity and can be detected without utilizing the traditional uni- or multi-variate approaches for discovery of gene-by-environment interactions. We apply our new method to combine 17 mouse studies containing in aggregate 4,965 distinct animals. We identify 26 significant loci involved in High-density lipoprotein (HDL) cholesterol, many of which are consistent with previous findings. Several of these loci show significant evidence of involvement in gene-by-environment interactions. An additional advantage of our meta

  8. Meta-Analysis Identifies Gene-by-Environment Interactions as Demonstrated in a Study of 4,965 Mice

    PubMed Central

    Joo, Jong Wha J.; Shih, Diana; Davis, Richard C.; Lusis, Aldons J.; Eskin, Eleazar

    2014-01-01

    Identifying environmentally-specific genetic effects is a key challenge in understanding the structure of complex traits. Model organisms play a crucial role in the identification of such gene-by-environment interactions, as a result of the unique ability to observe genetically similar individuals across multiple distinct environments. Many model organism studies examine the same traits but under varying environmental conditions. For example, knock-out or diet-controlled studies are often used to examine cholesterol in mice. These studies, when examined in aggregate, provide an opportunity to identify genomic loci exhibiting environmentally-dependent effects. However, the straightforward application of traditional methodologies to aggregate separate studies suffers from several problems. First, environmental conditions are often variable and do not fit the standard univariate model for interactions. Additionally, applying a multivariate model results in increased degrees of freedom and low statistical power. In this paper, we jointly analyze multiple studies with varying environmental conditions using a meta-analytic approach based on a random effects model to identify loci involved in gene-by-environment interactions. Our approach is motivated by the observation that methods for discovering gene-by-environment interactions are closely related to random effects models for meta-analysis. We show that interactions can be interpreted as heterogeneity and can be detected without utilizing the traditional uni- or multi-variate approaches for discovery of gene-by-environment interactions. We apply our new method to combine 17 mouse studies containing in aggregate 4,965 distinct animals. We identify 26 significant loci involved in High-density lipoprotein (HDL) cholesterol, many of which are consistent with previous findings. Several of these loci show significant evidence of involvement in gene-by-environment interactions. An additional advantage of our meta

  9. Integrative Analysis of GWASs, Human Protein Interaction, and Gene Expression Identified Gene Modules Associated With BMDs

    PubMed Central

    He, Hao; Zhang, Lei; Li, Jian; Wang, Yu-Ping; Zhang, Ji-Gang; Shen, Jie; Guo, Yan-Fang

    2014-01-01

    Context: To date, few systems genetics studies in the bone field have been performed. We designed our study from a systems-level perspective by integrating genome-wide association studies (GWASs), human protein-protein interaction (PPI) network, and gene expression to identify gene modules contributing to osteoporosis risk. Methods: First we searched for modules significantly enriched with bone mineral density (BMD)-associated genes in human PPI network by using 2 large meta-analysis GWAS datasets through a dense module search algorithm. One included 7 individual GWAS samples (Meta7). The other was from the Genetic Factors for Osteoporosis Consortium (GEFOS2). One was assigned as a discovery dataset and the other as an evaluation dataset, and vice versa. Results: In total, 42 modules and 129 modules were identified significantly in both Meta7 and GEFOS2 datasets for femoral neck and spine BMD, respectively. There were 3340 modules identified for hip BMD only in Meta7. As candidate modules, they were assessed for the biological relevance to BMD by gene set enrichment analysis in 2 expression profiles generated from circulating monocytes in subjects with low versus high BMD values. Interestingly, there were 2 modules significantly enriched in monocytes from the low BMD group in both gene expression datasets (nominal P value <.05). Two modules had 16 nonredundant genes. Functional enrichment analysis revealed that both modules were enriched for genes involved in Wnt receptor signaling and osteoblast differentiation. Conclusion: We highlighted 2 modules and novel genes playing important roles in the regulation of bone mass, providing important clues for therapeutic approaches for osteoporosis. PMID:25119315

  10. Gene interactions in the DNA damage-response pathway identified by genome-wide RNA-interference analysis of synthetic lethality

    PubMed Central

    van Haaften, Gijs; Vastenhouw, Nadine L.; Nollen, Ellen A. A.; Plasterk, Ronald H. A.; Tijsterman, Marcel

    2004-01-01

    Here, we describe a systematic search for synthetic gene interactions in a multicellular organism, the nematode Caenorhabditis elegans. We established a high-throughput method to determine synthetic gene interactions by genome-wide RNA interference and identified genes that are required to protect the germ line against DNA double-strand breaks. Besides known DNA-repair proteins such as the C. elegans orthologs of TopBP1, RPA2, and RAD51, eight genes previously unassociated with a double-strand-break response were identified. Knockdown of these genes increased sensitivity to ionizing radiation and camptothecin and resulted in increased chromosomal nondisjunction. All genes have human orthologs that may play a role in human carcinogenesis. PMID:15326288

  11. Identifying candidate genes affecting developmental time in Drosophila melanogaster: pervasive pleiotropy and gene-by-environment interaction

    PubMed Central

    Mensch, Julián; Lavagnino, Nicolás; Carreira, Valeria Paula; Massaldi, Ana; Hasson, Esteban; Fanara, Juan José

    2008-01-01

    Background Understanding the genetic architecture of ecologically relevant adaptive traits requires the contribution of developmental and evolutionary biology. The time to reach the age of reproduction is a complex life history trait commonly known as developmental time. In particular, in holometabolous insects that occupy ephemeral habitats, like fruit flies, the impact of developmental time on fitness is further exaggerated. The present work is one of the first systematic studies of the genetic basis of developmental time, in which we also evaluate the impact of environmental variation on the expression of the trait. Results We analyzed 179 co-isogenic single P[GT1]-element insertion lines of Drosophila melanogaster to identify novel genes affecting developmental time in flies reared at 25°C. Sixty percent of the lines showed a heterochronic phenotype, suggesting that a large number of genes affect this trait. Mutant lines for the genes Merlin and Karl showed the most extreme phenotypes exhibiting a developmental time reduction and increase, respectively, of over 2 days and 4 days relative to the control (a co-isogenic P-element insertion free line). In addition, a subset of 42 lines selected at random from the initial set of 179 lines was screened at 17°C. Interestingly, the gene-by-environment interaction accounted for 52% of total phenotypic variance. Plastic reaction norms were found for a large number of developmental time candidate genes. Conclusion We identified components of several integrated time-dependent pathways affecting egg-to-adult developmental time in Drosophila. At the same time, we also show that many heterochronic phenotypes may arise from changes in genes involved in several developmental mechanisms that do not explicitly control the timing of specific events. We also demonstrate that many developmental time genes have pleiotropic effects on several adult traits and that the action of most of them is sensitive to temperature during

  12. Gene-Gene and Gene-Environment Interactions in Ulcerative Colitis

    PubMed Central

    Wang, Ming-Hsi; Fiocchi, Claudio; Zhu, Xiaofeng; Ripke, Stephan; Kamboh, M. Ilyas; Rebert, Nancy; Duerr, Richard H.; Achkar, Jean-Paul

    2014-01-01

    Genome-wide association studies (GWAS) have identified at least 133 ulcerative colitis (UC) associated loci. The role of genetic factors in clinical practice is not clearly defined. The relevance of genetic variants to disease pathogenesis is still uncertain because of not characterized gene-gene and gene-environment interactions. We examined the predictive value of combining the 133 UC risk loci with genetic interactions in an ongoing inflammatory bowel disease (IBD) GWAS. The Wellcome Trust Case-Control Consortium (WTCCC) IBD GWAS was used as a replication cohort. We applied logic regression (LR), a novel adaptive regression methodology, to search for high order interactions. Exploratory genotype correlations with UC sub-phenotypes (extent of disease, need of surgery, age of onset, extra-intestinal manifestations and primary sclerosing cholangitis (PSC)) were conducted. The combination of 133 UC loci yielded good UC risk predictability (area under the curve [AUC] of 0.86). A higher cumulative allele score predicted higher UC risk. Through LR, several lines of evidence for genetic interactions were identified and successfully replicated in the WTCCC cohort. The genetic interactions combined with the gene-smoking interaction significantly improved predictability in the model (AUC, from 0.86 to 0.89, P=3.26E-05). Explained UC variance increased from 37% to 42% after adding the interaction terms. A within case analysis found suggested genetic association with PSC. Our study demonstrates that the LR methodology allows the identification and replication of high order genetic interactions in UC GWAS datasets. UC risk can be predicted by a 133 loci and improved by adding gene-gene and gene-environment interactions. PMID:24241240

  13. Computational modeling identifies key gene regulatory interactions underlying phenobarbital-mediated tumor promotion

    PubMed Central

    Luisier, Raphaëlle; Unterberger, Elif B.; Goodman, Jay I.; Schwarz, Michael; Moggs, Jonathan; Terranova, Rémi; van Nimwegen, Erik

    2014-01-01

    Gene regulatory interactions underlying the early stages of non-genotoxic carcinogenesis are poorly understood. Here, we have identified key candidate regulators of phenobarbital (PB)-mediated mouse liver tumorigenesis, a well-characterized model of non-genotoxic carcinogenesis, by applying a new computational modeling approach to a comprehensive collection of in vivo gene expression studies. We have combined our previously developed motif activity response analysis (MARA), which models gene expression patterns in terms of computationally predicted transcription factor binding sites with singular value decomposition (SVD) of the inferred motif activities, to disentangle the roles that different transcriptional regulators play in specific biological pathways of tumor promotion. Furthermore, transgenic mouse models enabled us to identify which of these regulatory activities was downstream of constitutive androstane receptor and β-catenin signaling, both crucial components of PB-mediated liver tumorigenesis. We propose novel roles for E2F and ZFP161 in PB-mediated hepatocyte proliferation and suggest that PB-mediated suppression of ESR1 activity contributes to the development of a tumor-prone environment. Our study shows that combining MARA with SVD allows for automated identification of independent transcription regulatory programs within a complex in vivo tissue environment and provides novel mechanistic insights into PB-mediated hepatocarcinogenesis. PMID:24464994

  14. Filtering Gene Ontology semantic similarity for identifying protein complexes in large protein interaction networks.

    PubMed

    Wang, Jian; Xie, Dong; Lin, Hongfei; Yang, Zhihao; Zhang, Yijia

    2012-06-21

    Many biological processes recognize in particular the importance of protein complexes, and various computational approaches have been developed to identify complexes from protein-protein interaction (PPI) networks. However, high false-positive rate of PPIs leads to challenging identification. A protein semantic similarity measure is proposed in this study, based on the ontology structure of Gene Ontology (GO) terms and GO annotations to estimate the reliability of interactions in PPI networks. Interaction pairs with low GO semantic similarity are removed from the network as unreliable interactions. Then, a cluster-expanding algorithm is used to detect complexes with core-attachment structure on filtered network. Our method is applied to three different yeast PPI networks. The effectiveness of our method is examined on two benchmark complex datasets. Experimental results show that our method performed better than other state-of-the-art approaches in most evaluation metrics. The method detects protein complexes from large scale PPI networks by filtering GO semantic similarity. Removing interactions with low GO similarity significantly improves the performance of complex identification. The expanding strategy is also effective to identify attachment proteins of complexes.

  15. Machine Learning for Detecting Gene-Gene Interactions

    PubMed Central

    McKinney, Brett A.; Reif, David M.; Ritchie, Marylyn D.; Moore, Jason H.

    2011-01-01

    Complex interactions among genes and environmental factors are known to play a role in common human disease aetiology. There is a growing body of evidence to suggest that complex interactions are ‘the norm’ and, rather than amounting to a small perturbation to classical Mendelian genetics, interactions may be the predominant effect. Traditional statistical methods are not well suited for detecting such interactions, especially when the data are high dimensional (many attributes or independent variables) or when interactions occur between more than two polymorphisms. In this review, we discuss machine-learning models and algorithms for identifying and characterising susceptibility genes in common, complex, multifactorial human diseases. We focus on the following machine-learning methods that have been used to detect gene-gene interactions: neural networks, cellular automata, random forests, and multifactor dimensionality reduction. We conclude with some ideas about how these methods and others can be integrated into a comprehensive and flexible framework for data mining and knowledge discovery in human genetics. PMID:16722772

  16. Knowledge-Driven Analysis Identifies a Gene–Gene Interaction Affecting High-Density Lipoprotein Cholesterol Levels in Multi-Ethnic Populations

    PubMed Central

    Ma, Li; Brautbar, Ariel; Boerwinkle, Eric; Sing, Charles F.

    2012-01-01

    Total cholesterol, low-density lipoprotein cholesterol, triglyceride, and high-density lipoprotein cholesterol (HDL-C) levels are among the most important risk factors for coronary artery disease. We tested for gene–gene interactions affecting the level of these four lipids based on prior knowledge of established genome-wide association study (GWAS) hits, protein–protein interactions, and pathway information. Using genotype data from 9,713 European Americans from the Atherosclerosis Risk in Communities (ARIC) study, we identified an interaction between HMGCR and a locus near LIPC in their effect on HDL-C levels (Bonferroni corrected P c = 0.002). Using an adaptive locus-based validation procedure, we successfully validated this gene–gene interaction in the European American cohorts from the Framingham Heart Study (P c = 0.002) and the Multi-Ethnic Study of Atherosclerosis (MESA; P c = 0.006). The interaction between these two loci is also significant in the African American sample from ARIC (P c = 0.004) and in the Hispanic American sample from MESA (P c = 0.04). Both HMGCR and LIPC are involved in the metabolism of lipids, and genome-wide association studies have previously identified LIPC as associated with levels of HDL-C. However, the effect on HDL-C of the novel gene–gene interaction reported here is twice as pronounced as that predicted by the sum of the marginal effects of the two loci. In conclusion, based on a knowledge-driven analysis of epistasis, together with a new locus-based validation method, we successfully identified and validated an interaction affecting a complex trait in multi-ethnic populations. PMID:22654671

  17. Identifying cooperative transcriptional regulations using protein–protein interactions

    PubMed Central

    Nagamine, Nobuyoshi; Kawada, Yuji; Sakakibara, Yasubumi

    2005-01-01

    Cooperative transcriptional activations among multiple transcription factors (TFs) are important to understand the mechanisms of complex transcriptional regulations in eukaryotes. Previous studies have attempted to find cooperative TFs based on gene expression data with gene expression profiles as a measure of similarity of gene regulations. In this paper, we use protein–protein interaction data to infer synergistic binding of cooperative TFs. Our fundamental idea is based on the assumption that genes contributing to a similar biological process are regulated under the same control mechanism. First, the protein–protein interaction networks are used to calculate the similarity of biological processes among genes. Second, we integrate this similarity and the chromatin immuno-precipitation data to identify cooperative TFs. Our computational experiments in yeast show that predictions made by our method have successfully identified eight pairs of cooperative TFs that have literature evidences but could not be identified by the previous method. Further, 12 new possible pairs have been inferred and we have examined the biological relevances for them. However, since a typical problem using protein–protein interaction data is that many false-positive data are contained, we propose a method combining various biological data to increase the prediction accuracy. PMID:16126847

  18. Identifying genetic loci affecting antidepressant drug response in depression using drug–gene interaction models

    PubMed Central

    Noordam, Raymond; Avery, Christy L; Visser, Loes E; Stricker, Bruno H

    2016-01-01

    Antidepressants are often only moderately successful in decreasing the severity of depressive symptoms. In part, antidepressant treatment response in patients with depression is genetically determined. However, although a large number of studies have been conducted aiming to identify genetic variants associated with antidepressant drug response in depression, only a few variants have been repeatedly identified. Within the present review, we will discuss the methodological challenges and limitations of the studies that have been conducted on this topic to date (e.g., ‘treated-only design’, statistical power) and we will discuss how specifically drug–gene interaction models can be used to be better able to identify genetic variants associated with antidepressant drug response in depression. PMID:27248517

  19. Bayesian Variable Selection for Hierarchical Gene-Environment and Gene-Gene Interactions

    PubMed Central

    Liu, Changlu; Ma, Jianzhong; Amos, Christopher I.

    2014-01-01

    We propose a Bayesian hierarchical mixture model framework that allows us to investigate the genetic and environmental effects, gene by gene interactions and gene by environment interactions in the same model. Our approach incorporates the natural hierarchical structure between the main effects and interaction effects into a mixture model, such that our methods tend to remove the irrelevant interaction effects more effectively, resulting in more robust and parsimonious models. We consider both strong and weak hierarchical models. For a strong hierarchical model, both of the main effects between interacting factors must be present for the interactions to be considered in the model development, while for a weak hierarchical model, only one of the two main effects is required to be present for the interaction to be evaluated. Our simulation results show that the proposed strong and weak hierarchical mixture models work well in controlling false positive rates and provide a powerful approach for identifying the predisposing effects and interactions in gene-environment interaction studies, in comparison with the naive model that does not impose this hierarchical constraint in most of the scenarios simulated. We illustrated our approach using data for lung cancer and cutaneous melanoma. PMID:25154630

  20. Genes2Networks: connecting lists of gene symbols using mammalian protein interactions databases.

    PubMed

    Berger, Seth I; Posner, Jeremy M; Ma'ayan, Avi

    2007-10-04

    In recent years, mammalian protein-protein interaction network databases have been developed. The interactions in these databases are either extracted manually from low-throughput experimental biomedical research literature, extracted automatically from literature using techniques such as natural language processing (NLP), generated experimentally using high-throughput methods such as yeast-2-hybrid screens, or interactions are predicted using an assortment of computational approaches. Genes or proteins identified as significantly changing in proteomic experiments, or identified as susceptibility disease genes in genomic studies, can be placed in the context of protein interaction networks in order to assign these genes and proteins to pathways and protein complexes. Genes2Networks is a software system that integrates the content of ten mammalian interaction network datasets. Filtering techniques to prune low-confidence interactions were implemented. Genes2Networks is delivered as a web-based service using AJAX. The system can be used to extract relevant subnetworks created from "seed" lists of human Entrez gene symbols. The output includes a dynamic linkable three color web-based network map, with a statistical analysis report that identifies significant intermediate nodes used to connect the seed list. Genes2Networks is powerful web-based software that can help experimental biologists to interpret lists of genes and proteins such as those commonly produced through genomic and proteomic experiments, as well as lists of genes and proteins associated with disease processes. This system can be used to find relationships between genes and proteins from seed lists, and predict additional genes or proteins that may play key roles in common pathways or protein complexes.

  1. Gene-gene and gene-environment interactions: new insights into the prevention, detection and management of coronary artery disease.

    PubMed

    Lanktree, Matthew B; Hegele, Robert A

    2009-02-26

    Despite the recent success of genome-wide association studies (GWASs) in identifying loci consistently associated with coronary artery disease (CAD), a large proportion of the genetic components of CAD and its metabolic risk factors, including plasma lipids, type 2 diabetes and body mass index, remain unattributed. Gene-gene and gene-environment interactions might produce a meaningful improvement in quantification of the genetic determinants of CAD. Testing for gene-gene and gene-environment interactions is thus a new frontier for large-scale GWASs of CAD. There are several anecdotal examples of monogenic susceptibility to CAD in which the phenotype was worsened by an adverse environment. In addition, small-scale candidate gene association studies with functional hypotheses have identified gene-environment interactions. For future evaluation of gene-gene and gene-environment interactions to achieve the same success as the single gene associations reported in recent GWASs, it will be important to pre-specify agreed standards of study design and statistical power, environmental exposure measurement, phenomic characterization and analytical strategies. Here we discuss these issues, particularly in relation to the investigation and potential clinical utility of gene-gene and gene-environment interactions in CAD.

  2. Simple F Test Reveals Gene-Gene Interactions in Case-Control Studies

    PubMed Central

    Chen, Guanjie; Yuan, Ao; Zhou, Jie; Bentley, Amy R.; Adeyemo, Adebowale; Rotimi, Charles N.

    2012-01-01

    Missing heritability is still a challenge for Genome Wide Association Studies (GWAS). Gene-gene interactions may partially explain this residual genetic influence and contribute broadly to complex disease. To analyze the gene-gene interactions in case-control studies of complex disease, we propose a simple, non-parametric method that utilizes the F-statistic. This approach consists of three steps. First, we examine the joint distribution of a pair of SNPs in cases and controls separately. Second, an F-test is used to evaluate the ratio of dependence in cases to that of controls. Finally, results are adjusted for multiple tests. This method was used to evaluate gene-gene interactions that are associated with risk of Type 2 Diabetes among African Americans in the Howard University Family Study. We identified 18 gene-gene interactions (P < 0.0001). Compared with the commonly-used logistical regression method, we demonstrate that the F-ratio test is an efficient approach to measuring gene-gene interactions, especially for studies with limited sample size. PMID:22837643

  3. Ontology-based literature mining of E. coli vaccine-associated gene interaction networks.

    PubMed

    Hur, Junguk; Özgür, Arzucan; He, Yongqun

    2017-03-14

    Pathogenic Escherichia coli infections cause various diseases in humans and many animal species. However, with extensive E. coli vaccine research, we are still unable to fully protect ourselves against E. coli infections. To more rational development of effective and safe E. coli vaccine, it is important to better understand E. coli vaccine-associated gene interaction networks. In this study, we first extended the Vaccine Ontology (VO) to semantically represent various E. coli vaccines and genes used in the vaccine development. We also normalized E. coli gene names compiled from the annotations of various E. coli strains using a pan-genome-based annotation strategy. The Interaction Network Ontology (INO) includes a hierarchy of various interaction-related keywords useful for literature mining. Using VO, INO, and normalized E. coli gene names, we applied an ontology-based SciMiner literature mining strategy to mine all PubMed abstracts and retrieve E. coli vaccine-associated E. coli gene interactions. Four centrality metrics (i.e., degree, eigenvector, closeness, and betweenness) were calculated for identifying highly ranked genes and interaction types. Using vaccine-related PubMed abstracts, our study identified 11,350 sentences that contain 88 unique INO interactions types and 1,781 unique E. coli genes. Each sentence contained at least one interaction type and two unique E. coli genes. An E. coli gene interaction network of genes and INO interaction types was created. From this big network, a sub-network consisting of 5 E. coli vaccine genes, including carA, carB, fimH, fepA, and vat, and 62 other E. coli genes, and 25 INO interaction types was identified. While many interaction types represent direct interactions between two indicated genes, our study has also shown that many of these retrieved interaction types are indirect in that the two genes participated in the specified interaction process in a required but indirect process. Our centrality analysis of

  4. Controllability analysis of the directed human protein interaction network identifies disease genes and drug targets

    PubMed Central

    Vinayagam, Arunachalam; Gibson, Travis E.; Lee, Ho-Joon; Yilmazel, Bahar; Roesel, Charles; Hu, Yanhui; Kwon, Young; Sharma, Amitabh; Liu, Yang-Yu; Perrimon, Norbert; Barabási, Albert-László

    2016-01-01

    The protein–protein interaction (PPI) network is crucial for cellular information processing and decision-making. With suitable inputs, PPI networks drive the cells to diverse functional outcomes such as cell proliferation or cell death. Here, we characterize the structural controllability of a large directed human PPI network comprising 6,339 proteins and 34,813 interactions. This network allows us to classify proteins as “indispensable,” “neutral,” or “dispensable,” which correlates to increasing, no effect, or decreasing the number of driver nodes in the network upon removal of that protein. We find that 21% of the proteins in the PPI network are indispensable. Interestingly, these indispensable proteins are the primary targets of disease-causing mutations, human viruses, and drugs, suggesting that altering a network’s control property is critical for the transition between healthy and disease states. Furthermore, analyzing copy number alterations data from 1,547 cancer patients reveals that 56 genes that are frequently amplified or deleted in nine different cancers are indispensable. Among the 56 genes, 46 of them have not been previously associated with cancer. This suggests that controllability analysis is very useful in identifying novel disease genes and potential drug targets. PMID:27091990

  5. Development and application of an interaction network ontology for literature mining of vaccine-associated gene-gene interactions.

    PubMed

    Hur, Junguk; Özgür, Arzucan; Xiang, Zuoshuang; He, Yongqun

    2015-01-01

    Literature mining of gene-gene interactions has been enhanced by ontology-based name classifications. However, in biomedical literature mining, interaction keywords have not been carefully studied and used beyond a collection of keywords. In this study, we report the development of a new Interaction Network Ontology (INO) that classifies >800 interaction keywords and incorporates interaction terms from the PSI Molecular Interactions (PSI-MI) and Gene Ontology (GO). Using INO-based literature mining results, a modified Fisher's exact test was established to analyze significantly over- and under-represented enriched gene-gene interaction types within a specific area. Such a strategy was applied to study the vaccine-mediated gene-gene interactions using all PubMed abstracts. The Vaccine Ontology (VO) and INO were used to support the retrieval of vaccine terms and interaction keywords from the literature. INO is aligned with the Basic Formal Ontology (BFO) and imports terms from 10 other existing ontologies. Current INO includes 540 terms. In terms of interaction-related terms, INO imports and aligns PSI-MI and GO interaction terms and includes over 100 newly generated ontology terms with 'INO_' prefix. A new annotation property, 'has literature mining keywords', was generated to allow the listing of different keywords mapping to the interaction types in INO. Using all PubMed documents published as of 12/31/2013, approximately 266,000 vaccine-associated documents were identified, and a total of 6,116 gene-pairs were associated with at least one INO term. Out of 78 INO interaction terms associated with at least five gene-pairs of the vaccine-associated sub-network, 14 terms were significantly over-represented (i.e., more frequently used) and 17 under-represented based on our modified Fisher's exact test. These over-represented and under-represented terms share some common top-level terms but are distinct at the bottom levels of the INO hierarchy. The analysis of these

  6. Gene-based interaction analysis shows GABAergic genes interacting with parenting in adolescent depressive symptoms.

    PubMed

    Van Assche, Evelien; Moons, Tim; Cinar, Ozan; Viechtbauer, Wolfgang; Oldehinkel, Albertine J; Van Leeuwen, Karla; Verschueren, Karine; Colpin, Hilde; Lambrechts, Diether; Van den Noortgate, Wim; Goossens, Luc; Claes, Stephan; van Winkel, Ruud

    2017-12-01

    Most gene-environment interaction studies (G × E) have focused on single candidate genes. This approach is criticized for its expectations of large effect sizes and occurrence of spurious results. We describe an approach that accounts for the polygenic nature of most psychiatric phenotypes and reduces the risk of false-positive findings. We apply this method focusing on the role of perceived parental support, psychological control, and harsh punishment in depressive symptoms in adolescence. Analyses were conducted on 982 adolescents of Caucasian origin (M age (SD) = 13.78 (.94) years) genotyped for 4,947 SNPs in 263 genes, selected based on a literature survey. The Leuven Adolescent Perceived Parenting Scale (LAPPS) and the Parental Behavior Scale (PBS) were used to assess perceived parental psychological control, harsh punishment, and support. The Center for Epidemiologic Studies Depression Scale (CES-D) was the outcome. We used gene-based testing taking into account linkage disequilibrium to identify genes containing SNPs exhibiting an interaction with environmental factors yielding a p-value per single gene. Significant results at the corrected p-value of p < 1.90 × 10 -4 were examined in an independent replication sample of Dutch adolescents (N = 1354). Two genes showed evidence for interaction with perceived support: GABRR1 (p = 4.62 × 10 -5 ) and GABRR2 (p = 9.05 × 10 -6 ). No genes interacted significantly with psychological control or harsh punishment. Gene-based analysis was unable to confirm the interaction of GABRR1 or GABRR2 with support in the replication sample. However, for GABRR2, but not GABRR1, the correlation of the estimates between the two datasets was significant (r (46) = .32; p = .027) and a gene-based analysis of the combined datasets supported GABRR2 × support interaction (p = 1.63 × 10 -4 ). We present a gene-based method for gene-environment interactions in a polygenic context and show that genes

  7. Identification of Human Disease Genes from Interactome Network Using Graphlet Interaction

    PubMed Central

    Yang, Lun; Wei, Dong-Qing; Qi, Ying-Xin; Jiang, Zong-Lai

    2014-01-01

    Identifying genes related to human diseases, such as cancer and cardiovascular disease, etc., is an important task in biomedical research because of its applications in disease diagnosis and treatment. Interactome networks, especially protein-protein interaction networks, had been used to disease genes identification based on the hypothesis that strong candidate genes tend to closely relate to each other in some kinds of measure on the network. We proposed a new measure to analyze the relationship between network nodes which was called graphlet interaction. The graphlet interaction contained 28 different isomers. The results showed that the numbers of the graphlet interaction isomers between disease genes in interactome networks were significantly larger than random picked genes, while graphlet signatures were not. Then, we designed a new type of score, based on the network properties, to identify disease genes using graphlet interaction. The genes with higher scores were more likely to be disease genes, and all candidate genes were ranked according to their scores. Then the approach was evaluated by leave-one-out cross-validation. The precision of the current approach achieved 90% at about 10% recall, which was apparently higher than the previous three predominant algorithms, random walk, Endeavour and neighborhood based method. Finally, the approach was applied to predict new disease genes related to 4 common diseases, most of which were identified by other independent experimental researches. In conclusion, we demonstrate that the graphlet interaction is an effective tool to analyze the network properties of disease genes, and the scores calculated by graphlet interaction is more precise in identifying disease genes. PMID:24465923

  8. Novel genetic associations for blood pressure identified via gene-alcohol interaction in up to 570K individuals across multiple ancestries

    PubMed Central

    Guo, Xiuqing; Franceschini, Nora; Cheng, Ching-Yu; Sim, Xueling; Vojinovic, Dina; Marten, Jonathan; Musani, Solomon K.; Li, Changwei; Schwander, Karen; Richard, Melissa A.; Noordam, Raymond; Aschard, Hugues; Bartz, Traci M.; Bielak, Lawrence F.; Dorajoo, Rajkumar; Fisher, Virginia; Hartwig, Fernando P.; Horimoto, Andrea R. V. R.; Lohman, Kurt K.; Manning, Alisa K.; Rankinen, Tuomo; Smith, Albert V.; Wojczynski, Mary K.; Alver, Maris; Boissel, Mathilde; Cai, Qiuyin; Divers, Jasmin; Gao, Chuan; Goel, Anuj; Harris, Sarah E.; He, Meian; Hsu, Fang-Chi; Jackson, Anne U.; Kähönen, Mika; Kasturiratne, Anuradhani; Komulainen, Pirjo; Kühnel, Brigitte; Laguzzi, Federica; Luan, Jian'an; Nolte, Ilja M.; Padmanabhan, Sandosh; Robino, Antonietta; Scott, Robert A.; Sofer, Tamar; Stančáková, Alena; Takeuchi, Fumihiko; Tayo, Bamidele O.; Varga, Tibor V.; Vitart, Veronique; Wang, Yajuan; Warren, Helen R.; Wen, Wanqing; Yanek, Lisa R.; Zhang, Weihua; Zhao, Jing Hua; Afaq, Saima; Amin, Najaf; Arking, Dan E.; Aung, Tin; Boerwinkle, Eric; Borecki, Ingrid; Broeckel, Ulrich; Brown, Morris; Brumat, Marco; Burke, Gregory L.; Chakravarti, Aravinda; Charumathi, Sabanayagam; Ida Chen, Yii-Der; Connell, John M.; Correa, Adolfo; de las Fuentes, Lisa; de Mutsert, Renée; de Silva, H. Janaka; Deng, Xuan; Ding, Jingzhong; Duan, Qing; Eaton, Charles B.; Ehret, Georg; Eppinga, Ruben N.; Faul, Jessica D.; Felix, Stephan B.; Forouhi, Nita G.; Forrester, Terrence; Franco, Oscar H.; Friedlander, Yechiel; Gandin, Ilaria; Gao, He; Ghanbari, Mohsen; Gigante, Bruna; Gu, C. Charles; Gu, Dongfeng; Hagenaars, Saskia P.; Hallmans, Göran; Harris, Tamara B.; He, Jiang; Heng, Chew-Kiat; Hirata, Makoto; Howard, Barbara V.; Ikram, M. Arfan; John, Ulrich; Katsuya, Tomohiro; Khor, Chiea Chuen; Kilpeläinen, Tuomas O.; Koh, Woon-Puay; Krieger, José E.; Kritchevsky, Stephen B.; Kubo, Michiaki; Kuusisto, Johanna; Lakka, Timo A.; Langefeld, Carl D.; Langenberg, Claudia; Launer, Lenore J.; Lehne, Benjamin; Lewis, Cora E.; Li, Yize; Lin, Shiow; Liu, Jianjun; Liu, Jingmin; Loh, Marie; Louie, Tin; Mägi, Reedik; McKenzie, Colin A.; Meitinger, Thomas; Milaneschi, Yuri; Milani, Lili; Mohlke, Karen L.; Momozawa, Yukihide; Nalls, Mike A.; Nelson, Christopher P.; Sotoodehnia, Nona; Norris, Jill M.; O'Connell, Jeff R.; Palmer, Nicholette D.; Perls, Thomas; Pedersen, Nancy L.; Peters, Annette; Peyser, Patricia A.; Poulter, Neil; Raffel, Leslie J.; Raitakari, Olli T.; Roll, Kathryn; Rose, Lynda M.; Rosendaal, Frits R.; Rotter, Jerome I.; Schmidt, Carsten O.; Schreiner, Pamela J.; Schupf, Nicole; Scott, William R.; Shi, Yuan; Sidney, Stephen; Sims, Mario; Sitlani, Colleen M.; Smith, Jennifer A.; Snieder, Harold; Starr, John M.; Strauch, Konstantin; Stringham, Heather M.; Tan, Nicholas Y. Q.; Tang, Hua; Taylor, Kent D.; Teo, Yik Ying; Tham, Yih Chung; Turner, Stephen T.; Uitterlinden, André G.; Vollenweider, Peter; Waldenberger, Melanie; Wang, Lihua; Wang, Ya Xing; Wei, Wen Bin; Williams, Christine; Yao, Jie; Yu, Caizheng; Yuan, Jian-Min; Zhao, Wei; Zonderman, Alan B.; Becker, Diane M.; Boehnke, Michael; Bowden, Donald W.; Chambers, John C.; Deary, Ian J.; Esko, Tõnu; Farrall, Martin; Franks, Paul W.; Freedman, Barry I.; Froguel, Philippe; Gasparini, Paolo; Gieger, Christian; Kamatani, Yoichiro; Kato, Norihiro; Kooner, Jaspal S.; Kutalik, Zoltán; Laakso, Markku; Laurie, Cathy C.; Leander, Karin; Lehtimäki, Terho; Study, Lifelines Cohort; Magnusson, Patrik K. E.; Oldehinkel, Albertine J.; Penninx, Brenda W. J. H.; Polasek, Ozren; Porteous, David J.; Rauramaa, Rainer; Samani, Nilesh J.; Scott, James; Shu, Xiao-Ou; van der Harst, Pim; Wagenknecht, Lynne E.; Watkins, Hugh; Weir, David R.; Wickremasinghe, Ananda R.; Wu, Tangchun; Zheng, Wei; Bouchard, Claude; Christensen, Kaare; Evans, Michele K.; Gudnason, Vilmundur; Horta, Bernardo L.; Kardia, Sharon L. R.; Liu, Yongmei; Pereira, Alexandre C.; Psaty, Bruce M.; Ridker, Paul M.; van Dam, Rob M.; Gauderman, W. James; Zhu, Xiaofeng; Mook-Kanamori, Dennis O.; Fornage, Myriam; Rotimi, Charles N.; Cupples, L. Adrienne; Kelly, Tanika N.; Fox, Ervin R.; Hayward, Caroline; van Duijn, Cornelia M.; Tai, E Shyong; Wong, Tien Yin; Kooperberg, Charles; Palmas, Walter; Morrison, Alanna C.; Caulfield, Mark J.; Munroe, Patricia B.; Rao, Dabeeru C.; Province, Michael A.; Levy, Daniel

    2018-01-01

    Heavy alcohol consumption is an established risk factor for hypertension; the mechanism by which alcohol consumption impact blood pressure (BP) regulation remains unknown. We hypothesized that a genome-wide association study accounting for gene-alcohol consumption interaction for BP might identify additional BP loci and contribute to the understanding of alcohol-related BP regulation. We conducted a large two-stage investigation incorporating joint testing of main genetic effects and single nucleotide variant (SNV)-alcohol consumption interactions. In Stage 1, genome-wide discovery meta-analyses in ≈131K individuals across several ancestry groups yielded 3,514 SNVs (245 loci) with suggestive evidence of association (P < 1.0 x 10−5). In Stage 2, these SNVs were tested for independent external replication in ≈440K individuals across multiple ancestries. We identified and replicated (at Bonferroni correction threshold) five novel BP loci (380 SNVs in 21 genes) and 49 previously reported BP loci (2,159 SNVs in 109 genes) in European ancestry, and in multi-ancestry meta-analyses (P < 5.0 x 10−8). For African ancestry samples, we detected 18 potentially novel BP loci (P < 5.0 x 10−8) in Stage 1 that warrant further replication. Additionally, correlated meta-analysis identified eight novel BP loci (11 genes). Several genes in these loci (e.g., PINX1, GATA4, BLK, FTO and GABBR2) have been previously reported to be associated with alcohol consumption. These findings provide insights into the role of alcohol consumption in the genetic architecture of hypertension. PMID:29912962

  9. Novel genetic associations for blood pressure identified via gene-alcohol interaction in up to 570K individuals across multiple ancestries.

    PubMed

    Feitosa, Mary F; Kraja, Aldi T; Chasman, Daniel I; Sung, Yun J; Winkler, Thomas W; Ntalla, Ioanna; Guo, Xiuqing; Franceschini, Nora; Cheng, Ching-Yu; Sim, Xueling; Vojinovic, Dina; Marten, Jonathan; Musani, Solomon K; Li, Changwei; Bentley, Amy R; Brown, Michael R; Schwander, Karen; Richard, Melissa A; Noordam, Raymond; Aschard, Hugues; Bartz, Traci M; Bielak, Lawrence F; Dorajoo, Rajkumar; Fisher, Virginia; Hartwig, Fernando P; Horimoto, Andrea R V R; Lohman, Kurt K; Manning, Alisa K; Rankinen, Tuomo; Smith, Albert V; Tajuddin, Salman M; Wojczynski, Mary K; Alver, Maris; Boissel, Mathilde; Cai, Qiuyin; Campbell, Archie; Chai, Jin Fang; Chen, Xu; Divers, Jasmin; Gao, Chuan; Goel, Anuj; Hagemeijer, Yanick; Harris, Sarah E; He, Meian; Hsu, Fang-Chi; Jackson, Anne U; Kähönen, Mika; Kasturiratne, Anuradhani; Komulainen, Pirjo; Kühnel, Brigitte; Laguzzi, Federica; Luan, Jian'an; Matoba, Nana; Nolte, Ilja M; Padmanabhan, Sandosh; Riaz, Muhammad; Rueedi, Rico; Robino, Antonietta; Said, M Abdullah; Scott, Robert A; Sofer, Tamar; Stančáková, Alena; Takeuchi, Fumihiko; Tayo, Bamidele O; van der Most, Peter J; Varga, Tibor V; Vitart, Veronique; Wang, Yajuan; Ware, Erin B; Warren, Helen R; Weiss, Stefan; Wen, Wanqing; Yanek, Lisa R; Zhang, Weihua; Zhao, Jing Hua; Afaq, Saima; Amin, Najaf; Amini, Marzyeh; Arking, Dan E; Aung, Tin; Boerwinkle, Eric; Borecki, Ingrid; Broeckel, Ulrich; Brown, Morris; Brumat, Marco; Burke, Gregory L; Canouil, Mickaël; Chakravarti, Aravinda; Charumathi, Sabanayagam; Ida Chen, Yii-Der; Connell, John M; Correa, Adolfo; de Las Fuentes, Lisa; de Mutsert, Renée; de Silva, H Janaka; Deng, Xuan; Ding, Jingzhong; Duan, Qing; Eaton, Charles B; Ehret, Georg; Eppinga, Ruben N; Evangelou, Evangelos; Faul, Jessica D; Felix, Stephan B; Forouhi, Nita G; Forrester, Terrence; Franco, Oscar H; Friedlander, Yechiel; Gandin, Ilaria; Gao, He; Ghanbari, Mohsen; Gigante, Bruna; Gu, C Charles; Gu, Dongfeng; Hagenaars, Saskia P; Hallmans, Göran; Harris, Tamara B; He, Jiang; Heikkinen, Sami; Heng, Chew-Kiat; Hirata, Makoto; Howard, Barbara V; Ikram, M Arfan; John, Ulrich; Katsuya, Tomohiro; Khor, Chiea Chuen; Kilpeläinen, Tuomas O; Koh, Woon-Puay; Krieger, José E; Kritchevsky, Stephen B; Kubo, Michiaki; Kuusisto, Johanna; Lakka, Timo A; Langefeld, Carl D; Langenberg, Claudia; Launer, Lenore J; Lehne, Benjamin; Lewis, Cora E; Li, Yize; Lin, Shiow; Liu, Jianjun; Liu, Jingmin; Loh, Marie; Louie, Tin; Mägi, Reedik; McKenzie, Colin A; Meitinger, Thomas; Metspalu, Andres; Milaneschi, Yuri; Milani, Lili; Mohlke, Karen L; Momozawa, Yukihide; Nalls, Mike A; Nelson, Christopher P; Sotoodehnia, Nona; Norris, Jill M; O'Connell, Jeff R; Palmer, Nicholette D; Perls, Thomas; Pedersen, Nancy L; Peters, Annette; Peyser, Patricia A; Poulter, Neil; Raffel, Leslie J; Raitakari, Olli T; Roll, Kathryn; Rose, Lynda M; Rosendaal, Frits R; Rotter, Jerome I; Schmidt, Carsten O; Schreiner, Pamela J; Schupf, Nicole; Scott, William R; Sever, Peter S; Shi, Yuan; Sidney, Stephen; Sims, Mario; Sitlani, Colleen M; Smith, Jennifer A; Snieder, Harold; Starr, John M; Strauch, Konstantin; Stringham, Heather M; Tan, Nicholas Y Q; Tang, Hua; Taylor, Kent D; Teo, Yik Ying; Tham, Yih Chung; Turner, Stephen T; Uitterlinden, André G; Vollenweider, Peter; Waldenberger, Melanie; Wang, Lihua; Wang, Ya Xing; Wei, Wen Bin; Williams, Christine; Yao, Jie; Yu, Caizheng; Yuan, Jian-Min; Zhao, Wei; Zonderman, Alan B; Becker, Diane M; Boehnke, Michael; Bowden, Donald W; Chambers, John C; Deary, Ian J; Esko, Tõnu; Farrall, Martin; Franks, Paul W; Freedman, Barry I; Froguel, Philippe; Gasparini, Paolo; Gieger, Christian; Jonas, Jost Bruno; Kamatani, Yoichiro; Kato, Norihiro; Kooner, Jaspal S; Kutalik, Zoltán; Laakso, Markku; Laurie, Cathy C; Leander, Karin; Lehtimäki, Terho; Study, Lifelines Cohort; Magnusson, Patrik K E; Oldehinkel, Albertine J; Penninx, Brenda W J H; Polasek, Ozren; Porteous, David J; Rauramaa, Rainer; Samani, Nilesh J; Scott, James; Shu, Xiao-Ou; van der Harst, Pim; Wagenknecht, Lynne E; Wareham, Nicholas J; Watkins, Hugh; Weir, David R; Wickremasinghe, Ananda R; Wu, Tangchun; Zheng, Wei; Bouchard, Claude; Christensen, Kaare; Evans, Michele K; Gudnason, Vilmundur; Horta, Bernardo L; Kardia, Sharon L R; Liu, Yongmei; Pereira, Alexandre C; Psaty, Bruce M; Ridker, Paul M; van Dam, Rob M; Gauderman, W James; Zhu, Xiaofeng; Mook-Kanamori, Dennis O; Fornage, Myriam; Rotimi, Charles N; Cupples, L Adrienne; Kelly, Tanika N; Fox, Ervin R; Hayward, Caroline; van Duijn, Cornelia M; Tai, E Shyong; Wong, Tien Yin; Kooperberg, Charles; Palmas, Walter; Rice, Kenneth; Morrison, Alanna C; Elliott, Paul; Caulfield, Mark J; Munroe, Patricia B; Rao, Dabeeru C; Province, Michael A; Levy, Daniel

    2018-01-01

    Heavy alcohol consumption is an established risk factor for hypertension; the mechanism by which alcohol consumption impact blood pressure (BP) regulation remains unknown. We hypothesized that a genome-wide association study accounting for gene-alcohol consumption interaction for BP might identify additional BP loci and contribute to the understanding of alcohol-related BP regulation. We conducted a large two-stage investigation incorporating joint testing of main genetic effects and single nucleotide variant (SNV)-alcohol consumption interactions. In Stage 1, genome-wide discovery meta-analyses in ≈131K individuals across several ancestry groups yielded 3,514 SNVs (245 loci) with suggestive evidence of association (P < 1.0 x 10-5). In Stage 2, these SNVs were tested for independent external replication in ≈440K individuals across multiple ancestries. We identified and replicated (at Bonferroni correction threshold) five novel BP loci (380 SNVs in 21 genes) and 49 previously reported BP loci (2,159 SNVs in 109 genes) in European ancestry, and in multi-ancestry meta-analyses (P < 5.0 x 10-8). For African ancestry samples, we detected 18 potentially novel BP loci (P < 5.0 x 10-8) in Stage 1 that warrant further replication. Additionally, correlated meta-analysis identified eight novel BP loci (11 genes). Several genes in these loci (e.g., PINX1, GATA4, BLK, FTO and GABBR2) have been previously reported to be associated with alcohol consumption. These findings provide insights into the role of alcohol consumption in the genetic architecture of hypertension.

  10. Gene expression patterns combined with network analysis identify hub genes associated with bladder cancer.

    PubMed

    Bi, Dongbin; Ning, Hao; Liu, Shuai; Que, Xinxiang; Ding, Kejia

    2015-06-01

    To explore molecular mechanisms of bladder cancer (BC), network strategy was used to find biomarkers for early detection and diagnosis. The differentially expressed genes (DEGs) between bladder carcinoma patients and normal subjects were screened using empirical Bayes method of the linear models for microarray data package. Co-expression networks were constructed by differentially co-expressed genes and links. Regulatory impact factors (RIF) metric was used to identify critical transcription factors (TFs). The protein-protein interaction (PPI) networks were constructed by the Search Tool for the Retrieval of Interacting Genes/Proteins (STRING) and clusters were obtained through molecular complex detection (MCODE) algorithm. Centralities analyses for complex networks were performed based on degree, stress and betweenness. Enrichment analyses were performed based on Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) databases. Co-expression networks and TFs (based on expression data of global DEGs and DEGs in different stages and grades) were identified. Hub genes of complex networks, such as UBE2C, ACTA2, FABP4, CKS2, FN1 and TOP2A, were also obtained according to analysis of degree. In gene enrichment analyses of global DEGs, cell adhesion, proteinaceous extracellular matrix and extracellular matrix structural constituent were top three GO terms. ECM-receptor interaction, focal adhesion, and cell cycle were significant pathways. Our results provide some potential underlying biomarkers of BC. However, further validation is required and deep studies are needed to elucidate the pathogenesis of BC. Copyright © 2015 Elsevier Ltd. All rights reserved.

  11. Gene-Based Testing of Interactions in Association Studies of Quantitative Traits

    PubMed Central

    Ma, Li; Clark, Andrew G.; Keinan, Alon

    2013-01-01

    Various methods have been developed for identifying gene–gene interactions in genome-wide association studies (GWAS). However, most methods focus on individual markers as the testing unit, and the large number of such tests drastically erodes statistical power. In this study, we propose novel interaction tests of quantitative traits that are gene-based and that confer advantage in both statistical power and biological interpretation. The framework of gene-based gene–gene interaction (GGG) tests combine marker-based interaction tests between all pairs of markers in two genes to produce a gene-level test for interaction between the two. The tests are based on an analytical formula we derive for the correlation between marker-based interaction tests due to linkage disequilibrium. We propose four GGG tests that extend the following P value combining methods: minimum P value, extended Simes procedure, truncated tail strength, and truncated P value product. Extensive simulations point to correct type I error rates of all tests and show that the two truncated tests are more powerful than the other tests in cases of markers involved in the underlying interaction not being directly genotyped and in cases of multiple underlying interactions. We applied our tests to pairs of genes that exhibit a protein–protein interaction to test for gene-level interactions underlying lipid levels using genotype data from the Atherosclerosis Risk in Communities study. We identified five novel interactions that are not evident from marker-based interaction testing and successfully replicated one of these interactions, between SMAD3 and NEDD9, in an independent sample from the Multi-Ethnic Study of Atherosclerosis. We conclude that our GGG tests show improved power to identify gene-level interactions in existing, as well as emerging, association studies. PMID:23468652

  12. A genomic approach to identify hybrid incompatibility genes.

    PubMed

    Cooper, Jacob C; Phadnis, Nitin

    2016-07-02

    Uncovering the genetic and molecular basis of barriers to gene flow between populations is key to understanding how new species are born. Intrinsic postzygotic reproductive barriers such as hybrid sterility and hybrid inviability are caused by deleterious genetic interactions known as hybrid incompatibilities. The difficulty in identifying these hybrid incompatibility genes remains a rate-limiting step in our understanding of the molecular basis of speciation. We recently described how whole genome sequencing can be applied to identify hybrid incompatibility genes, even from genetically terminal hybrids. Using this approach, we discovered a new hybrid incompatibility gene, gfzf, between Drosophila melanogaster and Drosophila simulans, and found that it plays an essential role in cell cycle regulation. Here, we discuss the history of the hunt for incompatibility genes between these species, discuss the molecular roles of gfzf in cell cycle regulation, and explore how intragenomic conflict drives the evolution of fundamental cellular mechanisms that lead to the developmental arrest of hybrids.

  13. A genomic approach to identify hybrid incompatibility genes

    PubMed Central

    Cooper, Jacob C.; Phadnis, Nitin

    2016-01-01

    ABSTRACT Uncovering the genetic and molecular basis of barriers to gene flow between populations is key to understanding how new species are born. Intrinsic postzygotic reproductive barriers such as hybrid sterility and hybrid inviability are caused by deleterious genetic interactions known as hybrid incompatibilities. The difficulty in identifying these hybrid incompatibility genes remains a rate-limiting step in our understanding of the molecular basis of speciation. We recently described how whole genome sequencing can be applied to identify hybrid incompatibility genes, even from genetically terminal hybrids. Using this approach, we discovered a new hybrid incompatibility gene, gfzf, between Drosophila melanogaster and Drosophila simulans, and found that it plays an essential role in cell cycle regulation. Here, we discuss the history of the hunt for incompatibility genes between these species, discuss the molecular roles of gfzf in cell cycle regulation, and explore how intragenomic conflict drives the evolution of fundamental cellular mechanisms that lead to the developmental arrest of hybrids. PMID:27230814

  14. Chemical-Gene Interactions from ToxCast Bioactivity Data ...

    EPA Pesticide Factsheets

    Characterizing the effects of chemicals in biological systems is often summarized by chemical-gene interactions, which have sparse coverage in the literature. The ToxCast chemical screening program has produced bioactivity data for nearly 2000 chemicals and over 450 gene targets. To evaluate the information gained from the ToxCast project, a ToxCast bioactivity network was created comprising ToxCast chemical-gene interactions based on assay data and compared to a chemical-gene association network from literature. The literature network was compiled from PubMed articles, excluding ToxCast publications, mapped to genes and chemicals. Genes were identified by curated associations available from NCBI while chemicals were identified by PubChem submissions. The frequencies of chemical-gene associations from the literature network were log-scaled and then compared to the ToxCast bioactivity network. In total, 140 times more chemical-gene associations were present in the ToxCast network in comparison to the literature-derived network highlighting the vast increase in chemical-gene interactions putatively elucidated by the ToxCast research program. There were 165 associations found in the literature network that were reproduced by ToxCast bioactivity data, and 336 associations in the literature network were not reproduced by the ToxCast bioactivity network. The literature network relies on the assumption that chemical-gene associations represent a true chemical-gene inte

  15. Next-generation analysis of cataracts: determining knowledge driven gene-gene interactions using Biofilter, and gene-environment interactions using the PhenX Toolkit.

    PubMed

    Pendergrass, Sarah A; Verma, Shefali S; Holzinger, Emily R; Moore, Carrie B; Wallace, John; Dudek, Scott M; Huggins, Wayne; Kitchner, Terrie; Waudby, Carol; Berg, Richard; McCarty, Catherine A; Ritchie, Marylyn D

    2013-01-01

    Investigating the association between biobank derived genomic data and the information of linked electronic health records (EHRs) is an emerging area of research for dissecting the architecture of complex human traits, where cases and controls for study are defined through the use of electronic phenotyping algorithms deployed in large EHR systems. For our study, 2580 cataract cases and 1367 controls were identified within the Marshfield Personalized Medicine Research Project (PMRP) Biobank and linked EHR, which is a member of the NHGRI-funded electronic Medical Records and Genomics (eMERGE) Network. Our goal was to explore potential gene-gene and gene-environment interactions within these data for 529,431 single nucleotide polymorphisms (SNPs) with minor allele frequency > 1%, in order to explore higher level associations with cataract risk beyond investigations of single SNP-phenotype associations. To build our SNP-SNP interaction models we utilized a prior-knowledge driven filtering method called Biofilter to minimize the multiple testing burden of exploring the vast array of interaction models possible from our extensive number of SNPs. Using the Biofilter, we developed 57,376 prior-knowledge directed SNP-SNP models to test for association with cataract status. We selected models that required 6 sources of external domain knowledge. We identified 5 statistically significant models with an interaction term with p-value < 0.05, as well as an overall model with p-value < 0.05 associated with cataract status. We also conducted gene-environment interaction analyses for all GWAS SNPs and a set of environmental factors from the PhenX Toolkit: smoking, UV exposure, and alcohol use; these environmental factors have been previously associated with the formation of cataracts. We found a total of 288 models that exhibit an interaction term with a p-value ≤ 1×10(-4) associated with cataract status. Our results show these approaches enable advanced searches for epistasis

  16. Systematic Search for Gene-Gene Interaction Effect on Prostate Cancer Risk

    DTIC Science & Technology

    2011-07-01

    PLEASE DO NOT RETURN YOUR FORM TO THE ABOVE ADDRESS. 1. REPORT DATE (DD-MM-YYYY) 2 . REPORT TYPE 3. DATES COVERED (From - To) 4. TITLE AND SUBTITLE 5a...identify SNPs in the genome that interact to have stronger effects on PCa risk in the CGEMS GWAS data, 2 ) confirm the gene-gene interaction effect on PCa...for pairs of SNPs implicated in Aim 2 among the remaining 1,893 cases and 781 controls in CAPS, and 4) fine map the genomic regions where SNPs have

  17. Learning contextual gene set interaction networks of cancer with condition specificity

    PubMed Central

    2013-01-01

    Background Identifying similarities and differences in the molecular constitutions of various types of cancer is one of the key challenges in cancer research. The appearances of a cancer depend on complex molecular interactions, including gene regulatory networks and gene-environment interactions. This complexity makes it challenging to decipher the molecular origin of the cancer. In recent years, many studies reported methods to uncover heterogeneous depictions of complex cancers, which are often categorized into different subtypes. The challenge is to identify diverse molecular contexts within a cancer, to relate them to different subtypes, and to learn underlying molecular interactions specific to molecular contexts so that we can recommend context-specific treatment to patients. Results In this study, we describe a novel method to discern molecular interactions specific to certain molecular contexts. Unlike conventional approaches to build modular networks of individual genes, our focus is to identify cancer-generic and subtype-specific interactions between contextual gene sets, of which each gene set share coherent transcriptional patterns across a subset of samples, termed contextual gene set. We then apply a novel formulation for quantitating the effect of the samples from each subtype on the calculated strength of interactions observed. Two cancer data sets were analyzed to support the validity of condition-specificity of identified interactions. When compared to an existing approach, the proposed method was much more sensitive in identifying condition-specific interactions even in heterogeneous data set. The results also revealed that network components specific to different types of cancer are related to different biological functions than cancer-generic network components. We found not only the results that are consistent with previous studies, but also new hypotheses on the biological mechanisms specific to certain cancer types that warrant further

  18. Genome-wide transcriptome study in wheat identified candidate genes related to processing quality, majority of them showing interaction (quality x development) and having temporal and spatial distributions.

    PubMed

    Singh, Anuradha; Mantri, Shrikant; Sharma, Monica; Chaudhury, Ashok; Tuli, Rakesh; Roy, Joy

    2014-01-16

    The cultivated bread wheat (Triticum aestivum L.) possesses unique flour quality, which can be processed into many end-use food products such as bread, pasta, chapatti (unleavened flat bread), biscuit, etc. The present wheat varieties require improvement in processing quality to meet the increasing demand of better quality food products. However, processing quality is very complex and controlled by many genes, which have not been completely explored. To identify the candidate genes whose expressions changed due to variation in processing quality and interaction (quality x development), genome-wide transcriptome studies were performed in two sets of diverse Indian wheat varieties differing for chapatti quality. It is also important to understand the temporal and spatial distributions of their expressions for designing tissue and growth specific functional genomics experiments. Gene-specific two-way ANOVA analysis of expression of about 55 K transcripts in two diverse sets of Indian wheat varieties for chapatti quality at three seed developmental stages identified 236 differentially expressed probe sets (10-fold). Out of 236, 110 probe sets were identified for chapatti quality. Many processing quality related key genes such as glutenin and gliadins, puroindolines, grain softness protein, alpha and beta amylases, proteases, were identified, and many other candidate genes related to cellular and molecular functions were also identified. The ANOVA analysis revealed that the expression of 56 of 110 probe sets was involved in interaction (quality x development). Majority of the probe sets showed differential expression at early stage of seed development i.e. temporal expression. Meta-analysis revealed that the majority of the genes expressed in one or a few growth stages indicating spatial distribution of their expressions. The differential expressions of a few candidate genes such as pre-alpha/beta-gliadin and gamma gliadin were validated by RT-PCR. Therefore, this study

  19. Genome-wide transcriptome study in wheat identified candidate genes related to processing quality, majority of them showing interaction (quality x development) and having temporal and spatial distributions

    PubMed Central

    2014-01-01

    Background The cultivated bread wheat (Triticum aestivum L.) possesses unique flour quality, which can be processed into many end-use food products such as bread, pasta, chapatti (unleavened flat bread), biscuit, etc. The present wheat varieties require improvement in processing quality to meet the increasing demand of better quality food products. However, processing quality is very complex and controlled by many genes, which have not been completely explored. To identify the candidate genes whose expressions changed due to variation in processing quality and interaction (quality x development), genome-wide transcriptome studies were performed in two sets of diverse Indian wheat varieties differing for chapatti quality. It is also important to understand the temporal and spatial distributions of their expressions for designing tissue and growth specific functional genomics experiments. Results Gene-specific two-way ANOVA analysis of expression of about 55 K transcripts in two diverse sets of Indian wheat varieties for chapatti quality at three seed developmental stages identified 236 differentially expressed probe sets (10-fold). Out of 236, 110 probe sets were identified for chapatti quality. Many processing quality related key genes such as glutenin and gliadins, puroindolines, grain softness protein, alpha and beta amylases, proteases, were identified, and many other candidate genes related to cellular and molecular functions were also identified. The ANOVA analysis revealed that the expression of 56 of 110 probe sets was involved in interaction (quality x development). Majority of the probe sets showed differential expression at early stage of seed development i.e. temporal expression. Meta-analysis revealed that the majority of the genes expressed in one or a few growth stages indicating spatial distribution of their expressions. The differential expressions of a few candidate genes such as pre-alpha/beta-gliadin and gamma gliadin were validated by RT

  20. Genome-wide approach identifies a novel gene-maternal pre-pregnancy BMI interaction on preterm birth

    PubMed Central

    Hong, Xiumei; Hao, Ke; Ji, Hongkai; Peng, Shouneng; Sherwood, Ben; Di Narzo, Antonio; Tsai, Hui-Ju; Liu, Xin; Burd, Irina; Wang, Guoying; Ji, Yuelong; Caruso, Deanna; Mao, Guangyun; Bartell, Tami R.; Zhang, Zhongyang; Pearson, Colleen; Heffner, Linda; Cerda, Sandra; Beaty, Terri H.; Fallin, M. Daniele; Lee-Parritz, Aviva; Zuckerman, Barry; Weeks, Daniel E.; Wang, Xiaobin

    2017-01-01

    Preterm birth (PTB) contributes significantly to infant mortality and morbidity with lifelong impact. Few robust genetic factors of PTB have been identified. Such ‘missing heritability' may be partly due to gene × environment interactions (G × E), which is largely unexplored. Here we conduct genome-wide G × E analyses of PTB in 1,733 African-American women (698 mothers of PTB; 1,035 of term birth) from the Boston Birth Cohort. We show that maternal COL24A1 variants have a significant genome-wide interaction with maternal pre-pregnancy overweight/obesity on PTB risk, with rs11161721 (PG × E=1.8 × 10−8; empirical PG × E=1.2 × 10−8) as the top hit. This interaction is replicated in African-American mothers (PG × E=0.01) from an independent cohort and in meta-analysis (PG × E=3.6 × 10−9), but is not replicated in Caucasians. In adipose tissue, rs11161721 is significantly associated with altered COL24A1 expression. Our findings may provide new insight into the aetiology of PTB and improve our ability to predict and prevent PTB. PMID:28598419

  1. Computational methods for identifying miRNA sponge interactions.

    PubMed

    Le, Thuc Duy; Zhang, Junpeng; Liu, Lin; Li, Jiuyong

    2017-07-01

    Recent findings show that coding genes are not the only targets that miRNAs interact with. In fact, there is a pool of different RNAs competing with each other to attract miRNAs for interactions, thus acting as competing endogenous RNAs (ceRNAs). The ceRNAs indirectly regulate each other via the titration mechanism, i.e. the increasing concentration of a ceRNA will decrease the number of miRNAs that are available for interacting with other targets. The cross-talks between ceRNAs, i.e. their interactions mediated by miRNAs, have been identified as the drivers in many disease conditions, including cancers. In recent years, some computational methods have emerged for identifying ceRNA-ceRNA interactions. However, there remain great challenges and opportunities for developing computational methods to provide new insights into ceRNA regulatory mechanisms.In this paper, we review the publically available databases of ceRNA-ceRNA interactions and the computational methods for identifying ceRNA-ceRNA interactions (also known as miRNA sponge interactions). We also conduct a comparison study of the methods with a breast cancer dataset. Our aim is to provide a current snapshot of the advances of the computational methods in identifying miRNA sponge interactions and to discuss the remaining challenges. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

  2. Robust and Comprehensive Analysis of 20 Osteoporosis Candidate Genes by Very High-Density Single-Nucleotide Polymorphism Screen Among 405 White Nuclear Families Identified Significant Association and Gene–Gene Interaction

    PubMed Central

    Xiong, Dong-Hai; Shen, Hui; Zhao, Lan-Juan; Xiao, Peng; Yang, Tie-Lin; Guo, Yan; Wang, Wei; Guo, Yan-Fang; Liu, Yong-Jun; Recker, Robert R; Deng, Hong-Wen

    2007-01-01

    Many “novel” osteoporosis candidate genes have been proposed in recent years. To advance our knowledge of their roles in osteoporosis, we screened 20 such genes using a set of high-density SNPs in a large family-based study. Our efforts led to the prioritization of those osteoporosis genes and the detection of gene–gene interactions. Introduction We performed large-scale family-based association analyses of 20 novel osteoporosis candidate genes using 277 single nucleotide polymorphisms (SNPs) for the quantitative trait BMD variation and the qualitative trait osteoporosis (OP) at three clinically important skeletal sites: spine, hip, and ultradistal radius (UD). Materials and Methods One thousand eight hundred seventy-three subjects from 405 white nuclear families were genotyped and analyzed with an average density of one SNP per 4 kb across the 20 genes. We conducted association analyses by SNP- and haplotype-based family-based association test (FBAT) and performed gene–gene interaction analyses using multianalytic approaches such as multifactor-dimensionality reduction (MDR) and conditional logistic regression. Results and Conclusions We detected four genes (DBP, LRP5, CYP17, and RANK) that showed highly suggestive associations (10,000-permutation derived empirical global p ≤ 0.01) with spine BMD/OP; four genes (CYP19, RANK, RANKL, and CYP17) highly suggestive for hip BMD/OP; and four genes (CYP19, BMP2, RANK, and TNFR2) highly suggestive for UD BMD/OP. The associations between BMP2 with UD BMD and those between RANK with OP at the spine, hip, and UD also met the experiment-wide stringent criterion (empirical global p ≤ 0.0007). Sex-stratified analyses further showed that some of the significant associations in the total sample were driven by either male or female subjects. In addition, we identified and validated a two-locus gene–gene interaction model involving GCR and ESR2, for which prior biological evidence exists. Our results suggested the

  3. A Functional Genomics Approach to Identify Novel Breast Cancer Gene Targets in Yeast

    DTIC Science & Technology

    2004-05-01

    AD Award Number: DAMD17-03-1-0232 TITLE: A Functional Genomics Approach to Identify Novel Breast Cancer Gene Targets in Yeast PRINCIPAL INVESTIGATOR...Approach to Identify Novel Breast DAMD17-03-1-0232 Cancer Gene Targets in Yeast 6. A UTHOR(S) Craig Bennett, Ph.D. 7. PERFORMING ORGANIZA TION NAME(S...Unlimited 13. ABSTRACT (Maximum 200 Words) We are using the yeast Saccharomyces cerevisiae to identify new cancer gene targets that interact with the

  4. Discovery of gene-gene interactions across multiple independent data sets of late onset Alzheimer disease from the Alzheimer Disease Genetics Consortium.

    PubMed

    Hohman, Timothy J; Bush, William S; Jiang, Lan; Brown-Gentry, Kristin D; Torstenson, Eric S; Dudek, Scott M; Mukherjee, Shubhabrata; Naj, Adam; Kunkle, Brian W; Ritchie, Marylyn D; Martin, Eden R; Schellenberg, Gerard D; Mayeux, Richard; Farrer, Lindsay A; Pericak-Vance, Margaret A; Haines, Jonathan L; Thornton-Wells, Tricia A

    2016-02-01

    Late-onset Alzheimer disease (AD) has a complex genetic etiology, involving locus heterogeneity, polygenic inheritance, and gene-gene interactions; however, the investigation of interactions in recent genome-wide association studies has been limited. We used a biological knowledge-driven approach to evaluate gene-gene interactions for consistency across 13 data sets from the Alzheimer Disease Genetics Consortium. Fifteen single nucleotide polymorphism (SNP)-SNP pairs within 3 gene-gene combinations were identified: SIRT1 × ABCB1, PSAP × PEBP4, and GRIN2B × ADRA1A. In addition, we extend a previously identified interaction from an endophenotype analysis between RYR3 × CACNA1C. Finally, post hoc gene expression analyses of the implicated SNPs further implicate SIRT1 and ABCB1, and implicate CDH23 which was most recently identified as an AD risk locus in an epigenetic analysis of AD. The observed interactions in this article highlight ways in which genotypic variation related to disease may depend on the genetic context in which it occurs. Further, our results highlight the utility of evaluating genetic interactions to explain additional variance in AD risk and identify novel molecular mechanisms of AD pathogenesis. Copyright © 2016 Elsevier Inc. All rights reserved.

  5. Common Marker Genes Identified from Various Sample Types for Systemic Lupus Erythematosus.

    PubMed

    Bing, Peng-Fei; Xia, Wei; Wang, Lan; Zhang, Yong-Hong; Lei, Shu-Feng; Deng, Fei-Yan

    2016-01-01

    Systemic lupus erythematosus (SLE) is a complex auto-immune disease. Gene expression studies have been conducted to identify SLE-related genes in various types of samples. It is unknown whether there are common marker genes significant for SLE but independent of sample types, which may have potentials for follow-up translational research. The aim of this study is to identify common marker genes across various sample types for SLE. Based on four public microarray gene expression datasets for SLE covering three representative types of blood-born samples (monocyte; peripheral blood mononuclear cell, PBMC; whole blood), we utilized three statistics (fold-change, FC; t-test p value; false discovery rate adjusted p value) to scrutinize genes simultaneously regulated with SLE across various sample types. For common marker genes, we conducted the Gene Ontology enrichment analysis and Protein-Protein Interaction analysis to gain insights into their functions. We identified 10 common marker genes associated with SLE (IFI6, IFI27, IFI44L, OAS1, OAS2, EIF2AK2, PLSCR1, STAT1, RNASE2, and GSTO1). Significant up-regulation of IFI6, IFI27, and IFI44L with SLE was observed in all the studied sample types, though the FC was most striking in monocyte, compared with PBMC and whole blood (8.82-251.66 vs. 3.73-74.05 vs. 1.19-1.87). Eight of the above 10 genes, except RNASE2 and GSTO1, interact with each other and with known SLE susceptibility genes, participate in immune response, RNA and protein catabolism, and cell death. Our data suggest that there exist common marker genes across various sample types for SLE. The 10 common marker genes, identified herein, deserve follow-up studies to dissert their potentials as diagnostic or therapeutic markers to predict SLE or treatment response.

  6. A kernel regression approach to gene-gene interaction detection for case-control studies.

    PubMed

    Larson, Nicholas B; Schaid, Daniel J

    2013-11-01

    Gene-gene interactions are increasingly being addressed as a potentially important contributor to the variability of complex traits. Consequently, attentions have moved beyond single locus analysis of association to more complex genetic models. Although several single-marker approaches toward interaction analysis have been developed, such methods suffer from very high testing dimensionality and do not take advantage of existing information, notably the definition of genes as functional units. Here, we propose a comprehensive family of gene-level score tests for identifying genetic elements of disease risk, in particular pairwise gene-gene interactions. Using kernel machine methods, we devise score-based variance component tests under a generalized linear mixed model framework. We conducted simulations based upon coalescent genetic models to evaluate the performance of our approach under a variety of disease models. These simulations indicate that our methods are generally higher powered than alternative gene-level approaches and at worst competitive with exhaustive SNP-level (where SNP is single-nucleotide polymorphism) analyses. Furthermore, we observe that simulated epistatic effects resulted in significant marginal testing results for the involved genes regardless of whether or not true main effects were present. We detail the benefits of our methods and discuss potential genome-wide analysis strategies for gene-gene interaction analysis in a case-control study design. © 2013 WILEY PERIODICALS, INC.

  7. Predicting hepatocellular carcinoma through cross-talk genes identified by risk pathways

    PubMed Central

    Shao, Zhuo; Huo, Diwei; Zhang, Denan; Xie, Hongbo; Yang, Jingbo; Liu, Qiuqi; Chen, Xiujie

    2018-01-01

    Hepatocellular carcinoma (HCC) is the most frequent type of liver cancer with poor survival rate and high mortality. Despite efforts on the mechanism of HCC, new molecular markers are needed for exact diagnosis, evaluation and treatment. Here, we combined transcriptome of HCC with networks and pathways to identify reliable molecular markers. Through integrating 249 differentially expressed genes with syncretic protein interaction networks, we constructed a HCC-specific network, from which we further extracted 480 pivotal genes. Based on the cross-talk between the enriched pathways of the pivotal genes, we finally identified a HCC signature of 45 genes, which could accurately distinguish HCC patients with normal individuals and reveal the prognosis of HCC patients. Among these 45 genes, 15 showed dysregulated expression patterns and a part have been reported to be associated with HCC and/or other cancers. These findings suggested that our identified 45 gene signature could be potential and valuable molecular markers for diagnosis and evaluation of HCC. PMID:29765536

  8. Genome-Wide Analysis of Gene-Gene and Gene-Environment Interactions Using Closed-Form Wald Tests.

    PubMed

    Yu, Zhaoxia; Demetriou, Michael; Gillen, Daniel L

    2015-09-01

    Despite the successful discovery of hundreds of variants for complex human traits using genome-wide association studies, the degree to which genes and environmental risk factors jointly affect disease risk is largely unknown. One obstacle toward this goal is that the computational effort required for testing gene-gene and gene-environment interactions is enormous. As a result, numerous computationally efficient tests were recently proposed. However, the validity of these methods often relies on unrealistic assumptions such as additive main effects, main effects at only one variable, no linkage disequilibrium between the two single-nucleotide polymorphisms (SNPs) in a pair or gene-environment independence. Here, we derive closed-form and consistent estimates for interaction parameters and propose to use Wald tests for testing interactions. The Wald tests are asymptotically equivalent to the likelihood ratio tests (LRTs), largely considered to be the gold standard tests but generally too computationally demanding for genome-wide interaction analysis. Simulation studies show that the proposed Wald tests have very similar performances with the LRTs but are much more computationally efficient. Applying the proposed tests to a genome-wide study of multiple sclerosis, we identify interactions within the major histocompatibility complex region. In this application, we find that (1) focusing on pairs where both SNPs are marginally significant leads to more significant interactions when compared to focusing on pairs where at least one SNP is marginally significant; and (2) parsimonious parameterization of interaction effects might decrease, rather than increase, statistical power. © 2015 WILEY PERIODICALS, INC.

  9. A Combinatorial Approach to Detecting Gene-Gene and Gene-Environment Interactions in Family Studies

    PubMed Central

    Lou, Xiang-Yang; Chen, Guo-Bo; Yan, Lei; Ma, Jennie Z.; Mangold, Jamie E.; Zhu, Jun; Elston, Robert C.; Li, Ming D.

    2008-01-01

    Widespread multifactor interactions present a significant challenge in determining risk factors of complex diseases. Several combinatorial approaches, such as the multifactor dimensionality reduction (MDR) method, have emerged as a promising tool for better detecting gene-gene (G × G) and gene-environment (G × E) interactions. We recently developed a general combinatorial approach, namely the generalized multifactor dimensionality reduction (GMDR) method, which can entertain both qualitative and quantitative phenotypes and allows for both discrete and continuous covariates to detect G × G and G × E interactions in a sample of unrelated individuals. In this article, we report the development of an algorithm that can be used to study G × G and G × E interactions for family-based designs, called pedigree-based GMDR (PGMDR). Compared to the available method, our proposed method has several major improvements, including allowing for covariate adjustments and being applicable to arbitrary phenotypes, arbitrary pedigree structures, and arbitrary patterns of missing marker genotypes. Our Monte Carlo simulations provide evidence that the PGMDR method is superior in performance to identify epistatic loci compared to the MDR-pedigree disequilibrium test (PDT). Finally, we applied our proposed approach to a genetic data set on tobacco dependence and found a significant interaction between two taste receptor genes (i.e., TAS2R16 and TAS2R38) in affecting nicotine dependence. PMID:18834969

  10. A gene-trap strategy identifies quiescence-induced genes in synchronized myoblasts.

    PubMed

    Sambasivan, Ramkumar; Pavlath, Grace K; Dhawan, Jyotsna

    2008-03-01

    Cellular quiescence is characterized not only by reduced mitotic and metabolic activity but also by altered gene expression. Growing evidence suggests that quiescence is not merely a basal state but is regulated by active mechanisms. To understand the molecular programme that governs reversible cell cycle exit, we focused on quiescence-related gene expression in a culture model of myogenic cell arrest and activation. Here we report the identification of quiescence-induced genes using a gene-trap strategy. Using a retroviral vector, we generated a library of gene traps in C2C12 myoblasts that were screened for arrest-induced insertions by live cell sorting (FACS-gal). Several independent gene- trap lines revealed arrest-dependent induction of betagal activity, confirming the efficacy of the FACS screen. The locus of integration was identified in 15 lines. In three lines,insertion occurred in genes previously implicated in the control of quiescence, i.e. EMSY - a BRCA2--interacting protein, p8/com1 - a p300HAT -- binding protein and MLL5 - a SET domain protein. Our results demonstrate that expression of chromatin modulatory genes is induced in G0, providing support to the notion that this reversibly arrested state is actively regulated.

  11. Identifying key genes associated with acute myocardial infarction.

    PubMed

    Cheng, Ming; An, Shoukuan; Li, Junquan

    2017-10-01

    This study aimed to identify key genes associated with acute myocardial infarction (AMI) by reanalyzing microarray data. Three gene expression profile datasets GSE66360, GSE34198, and GSE48060 were downloaded from GEO database. After data preprocessing, genes without heterogeneity across different platforms were subjected to differential expression analysis between the AMI group and the control group using metaDE package. P < .05 was used as the cutoff for a differentially expressed gene (DEG). The expression data matrices of DEGs were imported in ReactomeFIViz to construct a gene functional interaction (FI) network. Then, DEGs in each module were subjected to pathway enrichment analysis using DAVID. MiRNAs and transcription factors predicted to regulate target DEGs were identified. Quantitative real-time polymerase chain reaction (RT-PCR) was applied to verify the expression of genes. A total of 913 upregulated genes and 1060 downregulated genes were identified in the AMI group. A FI network consists of 21 modules and DEGs in 12 modules were significantly enriched in pathways. The transcription factor-miRNA-gene network contains 2 transcription factors FOXO3 and MYBL2, and 2 miRNAs hsa-miR-21-5p and hsa-miR-30c-5p. RT-PCR validations showed that expression levels of FOXO3 and MYBL2 were significantly increased in AMI, and expression levels of hsa-miR-21-5p and hsa-miR-30c-5p were obviously decreased in AMI. A total of 41 DEGs, such as SOCS3, VAPA, and COL5A2, are speculated to have roles in the pathogenesis of AMI; 2 transcription factors FOXO3 and MYBL2, and 2 miRNAs hsa-miR-21-5p and hsa-miR-30c-5p may be involved in the regulation of the expression of these DEGs.

  12. Genome-Wide Gene-Environment Study Identifies Glutamate Receptor Gene GRIN2A as a Parkinson's Disease Modifier Gene via Interaction with Coffee

    PubMed Central

    Hamza, Taye H.; Chen, Honglei; Hill-Burns, Erin M.; Rhodes, Shannon L.; Montimurro, Jennifer; Kay, Denise M.; Tenesa, Albert; Kusel, Victoria I.; Sheehan, Patricia; Eaaswarkhanth, Muthukrishnan; Yearout, Dora; Samii, Ali; Roberts, John W.; Agarwal, Pinky; Bordelon, Yvette; Park, Yikyung; Wang, Liyong; Gao, Jianjun; Vance, Jeffery M.; Kendler, Kenneth S.; Bacanu, Silviu-Alin; Scott, William K.; Ritz, Beate; Nutt, John; Factor, Stewart A.; Zabetian, Cyrus P.; Payami, Haydeh

    2011-01-01

    Our aim was to identify genes that influence the inverse association of coffee with the risk of developing Parkinson's disease (PD). We used genome-wide genotype data and lifetime caffeinated-coffee-consumption data on 1,458 persons with PD and 931 without PD from the NeuroGenetics Research Consortium (NGRC), and we performed a genome-wide association and interaction study (GWAIS), testing each SNP's main-effect plus its interaction with coffee, adjusting for sex, age, and two principal components. We then stratified subjects as heavy or light coffee-drinkers and performed genome-wide association study (GWAS) in each group. We replicated the most significant SNP. Finally, we imputed the NGRC dataset, increasing genomic coverage to examine the region of interest in detail. The primary analyses (GWAIS, GWAS, Replication) were performed using genotyped data. In GWAIS, the most significant signal came from rs4998386 and the neighboring SNPs in GRIN2A. GRIN2A encodes an NMDA-glutamate-receptor subunit and regulates excitatory neurotransmission in the brain. Achieving P2df = 10−6, GRIN2A surpassed all known PD susceptibility genes in significance in the GWAIS. In stratified GWAS, the GRIN2A signal was present in heavy coffee-drinkers (OR = 0.43; P = 6×10−7) but not in light coffee-drinkers. The a priori Replication hypothesis that “Among heavy coffee-drinkers, rs4998386_T carriers have lower PD risk than rs4998386_CC carriers” was confirmed: ORReplication = 0.59, PReplication = 10−3; ORPooled = 0.51, PPooled = 7×10−8. Compared to light coffee-drinkers with rs4998386_CC genotype, heavy coffee-drinkers with rs4998386_CC genotype had 18% lower risk (P = 3×10−3), whereas heavy coffee-drinkers with rs4998386_TC genotype had 59% lower risk (P = 6×10−13). Imputation revealed a block of SNPs that achieved P2df<5×10−8 in GWAIS, and OR = 0.41, P = 3×10−8 in heavy coffee-drinkers. This study is proof of concept

  13. Genotype-based association models of complex diseases to detect gene-gene and gene-environment interactions.

    PubMed

    Lobach, Iryna; Fan, Ruzong; Manga, Prashiela

    A central problem in genetic epidemiology is to identify and rank genetic markers involved in a disease. Complex diseases, such as cancer, hypertension, diabetes, are thought to be caused by an interaction of a panel of genetic factors, that can be identified by markers, which modulate environmental factors. Moreover, the effect of each genetic marker may be small. Hence, the association signal may be missed unless a large sample is considered, or a priori biomedical data are used. Recent advances generated a vast variety of a priori information, including linkage maps and information about gene regulatory dependence assembled into curated pathway databases. We propose a genotype-based approach that takes into account linkage disequilibrium (LD) information between genetic markers that are in moderate LD while modeling gene-gene and gene-environment interactions. A major advantage of our method is that the observed genetic information enters a model directly thus eliminating the need to estimate haplotype-phase. Our approach results in an algorithm that is inexpensive computationally and does not suffer from bias induced by haplotype-phase ambiguity. We investigated our model in a series of simulation experiments and demonstrated that the proposed approach results in estimates that are nearly unbiased and have small variability. We applied our method to the analysis of data from a melanoma case-control study and investigated interaction between a set of pigmentation genes and environmental factors defined by age and gender. Furthermore, an application of our method is demonstrated using a study of Alcohol Dependence.

  14. Gene-Gene-Environment Interactions of Serotonin Transporter, Monoamine Oxidase A and Childhood Maltreatment Predict Aggressive Behavior in Chinese Adolescents

    PubMed Central

    Zhang, Yun; Ming, Qing-sen; Yi, Jin-yao; Wang, Xiang; Chai, Qiao-lian; Yao, Shu-qiao

    2017-01-01

    Gene-environment interactions that moderate aggressive behavior have been identified independently in the serotonin transporter (5-HTT) gene and monoamine oxidase A gene (MAOA). The aim of the present study was to investigate epistasis interactions between MAOA-variable number tandem repeat (VNTR), 5-HTTlinked polymorphism (LPR) and child abuse and the effects of these on aggressive tendencies in a group of otherwise healthy adolescents. A group of 546 Chinese male adolescents completed the Child Trauma Questionnaire and Youth self-report of the Child Behavior Checklist. Buccal cells were collected for DNA analysis. The effects of childhood abuse, MAOA-VNTR, 5-HTTLPR genotypes and their interactive gene-gene-environmental effects on aggressive behavior were analyzed using a linear regression model. The effect of child maltreatment was significant, and a three-way interaction among MAOA-VNTR, 5-HTTLPR and sexual abuse (SA) relating to aggressive behaviors was identified. Chinese male adolescents with high expression of the MAOA-VNTR allele and 5-HTTLPR “SS” genotype exhibited the highest aggression tendencies with an increase in SA during childhood. The findings reported support aggression being a complex behavior involving the synergistic effects of gene-gene-environment interactions. PMID:28203149

  15. m6A-Driver: Identifying Context-Specific mRNA m6A Methylation-Driven Gene Interaction Networks

    PubMed Central

    Zhang, Song-Yao; Zhang, Shao-Wu; Liu, Lian; Huang, Yufei

    2016-01-01

    As the most prevalent mammalian mRNA epigenetic modification, N6-methyladenosine (m6A) has been shown to possess important post-transcriptional regulatory functions. However, the regulatory mechanisms and functional circuits of m6A are still largely elusive. To help unveil the regulatory circuitry mediated by mRNA m6A methylation, we develop here m6A-Driver, an algorithm for predicting m6A-driven genes and associated networks, whose functional interactions are likely to be actively modulated by m6A methylation under a specific condition. Specifically, m6A-Driver integrates the PPI network and the predicted differential m6A methylation sites from methylated RNA immunoprecipitation sequencing (MeRIP-Seq) data using a Random Walk with Restart (RWR) algorithm and then builds a consensus m6A-driven network of m6A-driven genes. To evaluate the performance, we applied m6A-Driver to build the context-specific m6A-driven networks for 4 known m6A (de)methylases, i.e., FTO, METTL3, METTL14 and WTAP. Our results suggest that m6A-Driver can robustly and efficiently identify m6A-driven genes that are functionally more enriched and associated with higher degree of differential expression than differential m6A methylated genes. Pathway analysis of the constructed context-specific m6A-driven gene networks further revealed the regulatory circuitry underlying the dynamic interplays between the methyltransferases and demethylase at the epitranscriptomic layer of gene regulation. PMID:28027310

  16. Interaction of Proteins Identified in Human Thyroid Cells

    PubMed Central

    Pietsch, Jessica; Riwaldt, Stefan; Bauer, Johann; Sickmann, Albert; Weber, Gerhard; Grosse, Jirka; Infanger, Manfred; Eilles, Christoph; Grimm, Daniela

    2013-01-01

    Influence of gravity forces on the regulation of protein expression by healthy and malignant thyroid cells was studied with the aim to identify protein interactions. Western blot analyses of a limited number of proteins suggested a time-dependent regulation of protein expression by simulated microgravity. After applying free flow isoelectric focusing and mass spectrometry to search for differently expressed proteins by thyroid cells exposed to simulated microgravity for three days, a considerable number of candidates for gravi-sensitive proteins were detected. In order to show how proteins sensitive to microgravity could directly influence other proteins, we investigated all polypeptide chains identified with Mascot scores above 100, looking for groups of interacting proteins. Hence, UniProtKB entry numbers of all detected proteins were entered into the Search Tool for the Retrieval of Interacting Genes/Proteins (STRING) and processed. The program indicated that we had detected various groups of interacting proteins in each of the three cell lines studied. The major groups of interacting proteins play a role in pathways of carbohydrate and protein metabolism, regulation of cell growth and cell membrane structuring. Analyzing these groups, networks of interaction could be established which show how a punctual influence of simulated microgravity may propagate via various members of interaction chains. PMID:23303277

  17. Coexpression network based on natural variation in human gene expression reveals gene interactions and functions

    PubMed Central

    Nayak, Renuka R.; Kearns, Michael; Spielman, Richard S.; Cheung, Vivian G.

    2009-01-01

    Genes interact in networks to orchestrate cellular processes. Analysis of these networks provides insights into gene interactions and functions. Here, we took advantage of normal variation in human gene expression to infer gene networks, which we constructed using correlations in expression levels of more than 8.5 million gene pairs in immortalized B cells from three independent samples. The resulting networks allowed us to identify biological processes and gene functions. Among the biological pathways, we found processes such as translation and glycolysis that co-occur in the same subnetworks. We predicted the functions of poorly characterized genes, including CHCHD2 and TMEM111, and provided experimental evidence that TMEM111 is part of the endoplasmic reticulum-associated secretory pathway. We also found that IFIH1, a susceptibility gene of type 1 diabetes, interacts with YES1, which plays a role in glucose transport. Furthermore, genes that predispose to the same diseases are clustered nonrandomly in the coexpression network, suggesting that networks can provide candidate genes that influence disease susceptibility. Therefore, our analysis of gene coexpression networks offers information on the role of human genes in normal and disease processes. PMID:19797678

  18. Identifying differentially expressed genes in cancer patients using a non-parameter Ising model.

    PubMed

    Li, Xumeng; Feltus, Frank A; Sun, Xiaoqian; Wang, James Z; Luo, Feng

    2011-10-01

    Identification of genes and pathways involved in diseases and physiological conditions is a major task in systems biology. In this study, we developed a novel non-parameter Ising model to integrate protein-protein interaction network and microarray data for identifying differentially expressed (DE) genes. We also proposed a simulated annealing algorithm to find the optimal configuration of the Ising model. The Ising model was applied to two breast cancer microarray data sets. The results showed that more cancer-related DE sub-networks and genes were identified by the Ising model than those by the Markov random field model. Furthermore, cross-validation experiments showed that DE genes identified by Ising model can improve classification performance compared with DE genes identified by Markov random field model. Copyright © 2011 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  19. Comparative genomics of free-living Gammaproteobacteria: pathogenesis-related genes or interaction-related genes?

    PubMed

    Vázquez-Rosas-Landa, Mirna; Ponce-Soto, Gabriel Yaxal; Eguiarte, Luis E; Souza, V

    2017-07-31

    Bacteria have numerous strategies to interact with themselves and with their environment, but genes associated with these interactions are usually cataloged as pathogenic. To understand the role that these genes have not only in pathogenesis but also in bacterial interactions, we compared the genomes of eight bacteria from human-impacted environments with those of free-living bacteria from the Cuatro Ciénegas Basin (CCB), a relatively pristine oligotrophic site. Fifty-one genomes from CCB bacteria, including Pseudomonas, Vibrio, Photobacterium and Aeromonas, were analyzed. We found that the CCB strains had several virulence-related genes, 15 of which were common to all strains and were related to flagella and chemotaxis. We also identified the presence of Type III and VI secretion systems, which leads us to propose that these systems play an important role in interactions among bacterial communities beyond pathogenesis. None of the CCB strains had pathogenicity islands, despite having genes associated with antibiotics. Integrons were rare, while CRISPR elements were common. The idea that pathogenicity-related genes in many cases form part of a wider strategy used by bacteria to interact with other organisms could help us to understand the role of pathogenicity-related elements in an ecological and evolutionary framework leading toward a more inclusive One Health concept. © FEMS 2017. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  20. Identifying key genes associated with acute myocardial infarction

    PubMed Central

    Cheng, Ming; An, Shoukuan; Li, Junquan

    2017-01-01

    Abstract Background: This study aimed to identify key genes associated with acute myocardial infarction (AMI) by reanalyzing microarray data. Methods: Three gene expression profile datasets GSE66360, GSE34198, and GSE48060 were downloaded from GEO database. After data preprocessing, genes without heterogeneity across different platforms were subjected to differential expression analysis between the AMI group and the control group using metaDE package. P < .05 was used as the cutoff for a differentially expressed gene (DEG). The expression data matrices of DEGs were imported in ReactomeFIViz to construct a gene functional interaction (FI) network. Then, DEGs in each module were subjected to pathway enrichment analysis using DAVID. MiRNAs and transcription factors predicted to regulate target DEGs were identified. Quantitative real-time polymerase chain reaction (RT-PCR) was applied to verify the expression of genes. Result: A total of 913 upregulated genes and 1060 downregulated genes were identified in the AMI group. A FI network consists of 21 modules and DEGs in 12 modules were significantly enriched in pathways. The transcription factor-miRNA-gene network contains 2 transcription factors FOXO3 and MYBL2, and 2 miRNAs hsa-miR-21-5p and hsa-miR-30c-5p. RT-PCR validations showed that expression levels of FOXO3 and MYBL2 were significantly increased in AMI, and expression levels of hsa-miR-21–5p and hsa-miR-30c-5p were obviously decreased in AMI. Conclusion: A total of 41 DEGs, such as SOCS3, VAPA, and COL5A2, are speculated to have roles in the pathogenesis of AMI; 2 transcription factors FOXO3 and MYBL2, and 2 miRNAs hsa-miR-21-5p and hsa-miR-30c-5p may be involved in the regulation of the expression of these DEGs. PMID:29049183

  1. Direct protein interaction underlies gene-for-gene specificity and coevolution of the flax resistance genes and flax rust avirulence genes

    PubMed Central

    Dodds, Peter N.; Lawrence, Gregory J.; Catanzariti, Ann-Maree; Teh, Trazel; Wang, Ching-I. A.; Ayliffe, Michael A.; Kobe, Bostjan; Ellis, Jeffrey G.

    2006-01-01

    Plant resistance proteins (R proteins) recognize corresponding pathogen avirulence (Avr) proteins either indirectly through detection of changes in their host protein targets or through direct R–Avr protein interaction. Although indirect recognition imposes selection against Avr effector function, pathogen effector molecules recognized through direct interaction may overcome resistance through sequence diversification rather than loss of function. Here we show that the flax rust fungus AvrL567 genes, whose products are recognized by the L5, L6, and L7 R proteins of flax, are highly diverse, with 12 sequence variants identified from six rust strains. Seven AvrL567 variants derived from Avr alleles induce necrotic responses when expressed in flax plants containing corresponding resistance genes (R genes), whereas five variants from avr alleles do not. Differences in recognition specificity between AvrL567 variants and evidence for diversifying selection acting on these genes suggest they have been involved in a gene-specific arms race with the corresponding flax R genes. Yeast two-hybrid assays indicate that recognition is based on direct R–Avr protein interaction and recapitulate the interaction specificity observed in planta. Biochemical analysis of Escherichia coli-produced AvrL567 proteins shows that variants that escape recognition nevertheless maintain a conserved structure and stability, suggesting that the amino acid sequence differences directly affect the R–Avr protein interaction. We suggest that direct recognition associated with high genetic diversity at corresponding R and Avr gene loci represents an alternative outcome of plant–pathogen coevolution to indirect recognition associated with simple balanced polymorphisms for functional and nonfunctional R and Avr genes. PMID:16731621

  2. Influence of SNPs in nutrient-sensitive candidate genes and gene-diet interactions on blood lipids: the DiOGenes study.

    PubMed

    Brahe, Lena K; Ängquist, Lars; Larsen, Lesli H; Vimaleswaran, Karani S; Hager, Jörg; Viguerie, Nathalie; Loos, Ruth J F; Handjieva-Darlenska, Teodora; Jebb, Susan A; Hlavaty, Petr; Larsen, Thomas M; Martinez, J Alfredo; Papadaki, Angeliki; Pfeiffer, Andreas F H; van Baak, Marleen A; Sørensen, Thorkild I A; Holst, Claus; Langin, Dominique; Astrup, Arne; Saris, Wim H M

    2013-09-14

    Blood lipid response to a given dietary intervention could be determined by the effect of diet, gene variants or gene-diet interactions. The objective of the present study was to investigate whether variants in presumed nutrient-sensitive genes involved in lipid metabolism modified lipid profile after weight loss and in response to a given diet, among overweight European adults participating in the Diet Obesity and Genes study. By multiple linear regressions, 240 SNPs in twenty-four candidate genes were investigated for SNP main and SNP-diet interaction effects on total cholesterol, LDL-cholesterol, HDL-cholesterol and TAG after an 8-week low-energy diet (only main effect) ,and a 6-month ad libitum weight maintenance diet, with different contents of dietary protein or glycaemic index. After adjusting for multiple testing, a SNP-dietary protein interaction effect on TAG was identified for lipin 1 (LPIN1) rs4315495, with a decrease in TAG of 20.26 mmol/l per A-allele/protein unit (95% CI 20.38, 20.14, P=0.000043). In conclusion, we investigated SNP-diet interactions for blood lipid profiles for 240 SNPs in twenty-four candidate genes, selected for their involvement in lipid metabolism pathways, and identified one significant interaction between LPIN1 rs4315495 and dietary protein for TAG concentration.

  3. [Gene-gene interaction on central obesity in school-aged children in China].

    PubMed

    Fu, L W; Zhang, M X; Wu, L J; Gao, L W; Mi, J

    2017-07-10

    Objective: To investigate possible effect of 6 obesity-associated SNPs in contribution to central obesity and examine whether there is an interaction in the 6 SNPs in the cause of central obesity in school-aged children in China. Methods: A total of 3 502 school-aged children who were included in Beijing Child and Adolescent Metabolic Syndrome (BCAMS) Study were selected, and based on the age and sex specific waist circumference (WC) standards in the BCAMS study, 1 196 central obese cases and 2 306 controls were identified. Genomic DNA was extracted from peripheral blood white cells using the salt fractionation method. A total of 6 single nucleotide polymorphisms ( FTO rs9939609, MC4R rs17782313, BDNF rs6265, PCSK1 rs6235, SH2B1 rs4788102, and CSK rs1378942) were genotyped by TaqMan allelic discrimination assays with the GeneAmp 7900 sequence detection system (Applied Biosystems, Foster City, CA, USA). Logistic regression model was used to investigate the association between 6 SNPs and central obesity. Gene-gene interactions among 6 polymorphic loci were analyzed by using the Generalized Multifactor Dimensionality Reduction (GMDR) method, and then logistic regression model was constructed to confirm the best combination of loci identified in the GMDR. Results: After adjusting gender, age, Tanner stage, physical activity and family history of obesity, the FTO rs9939609-A, MC4R rs17782313-C and BDNF rs6265-G alleles were associated with central obesity under additive genetic model ( OR =1.24, 95 %CI : 1.06-1.45, P =0.008; OR =1.26, 95 %CI : 1.11-1.43, P =2.98×10(-4); OR =1.18, 95 % CI : 1.06-1.32, P =0.003). GMDR analysis showed a significant gene-gene interaction between MC4R rs17782313 and BDNF rs6265 ( P =0.001). The best two-locus combination showed the cross-validation consistency of 10/10 and testing accuracy of 0.539. This interaction showed the maximum consistency and minimum prediction error among all gene-gene interaction models evaluated. Moreover, the

  4. Genome-wide assessment of gene-by-smoking interactions in COPD.

    PubMed

    Park, Boram; Koo, So-My; An, Jaehoon; Lee, MoonGyu; Kang, Hae Yeon; Qiao, Dandi; Cho, Michael H; Sung, Joohon; Silverman, Edwin K; Yang, Hyeon-Jong; Won, Sungho

    2018-06-18

    Cigarette smoke exposure is a major risk factor in chronic obstructive pulmonary disease (COPD) and its interactions with genetic variants could affect lung function. However, few gene-smoking interactions have been reported. In this report, we evaluated the effects of gene-smoking interactions on lung function using Korea Associated Resource (KARE) data with the spirometric variables-forced expiratory volume in 1 s (FEV 1 ). We found that variations in FEV 1 were different among smoking status. Thus, we considered a linear mixed model for association analysis under heteroscedasticity according to smoking status. We found a previously identified locus near SOX9 on chromosome 17 to be the most significant based on a joint test of the main and interaction effects of smoking. Smoking interactions were replicated with Gene-Environment of Interaction and phenotype (GENIE), Multi-Ethnic Study of Atherosclerosis-Lung (MESA-Lung), and COPDGene studies. We found that individuals with minor alleles, rs17765644, rs17178251, rs11870732, and rs4793541, tended to have lower FEV 1 values, and lung function decreased much faster with age for smokers. There have been very few reports to replicate a common variant gene-smoking interaction, and our results revealed that statistical models for gene-smoking interaction analyses should be carefully selected.

  5. Three Approaches to Modeling Gene-Environment Interactions in Longitudinal Family Data: Gene-Smoking Interactions in Blood Pressure.

    PubMed

    Basson, Jacob; Sung, Yun Ju; de Las Fuentes, Lisa; Schwander, Karen L; Vazquez, Ana; Rao, Dabeeru C

    2016-01-01

    Blood pressure (BP) has been shown to be substantially heritable, yet identified genetic variants explain only a small fraction of the heritability. Gene-smoking interactions have detected novel BP loci in cross-sectional family data. Longitudinal family data are available and have additional promise to identify BP loci. However, this type of data presents unique analysis challenges. Although several methods for analyzing longitudinal family data are available, which method is the most appropriate and under what conditions has not been fully studied. Using data from three clinic visits from the Framingham Heart Study, we performed association analysis accounting for gene-smoking interactions in BP at 31,203 markers on chromosome 22. We evaluated three different modeling frameworks: generalized estimating equations (GEE), hierarchical linear modeling, and pedigree-based mixed modeling. The three models performed somewhat comparably, with multiple overlaps in the most strongly associated loci from each model. Loci with the greatest significance were more strongly supported in the longitudinal analyses than in any of the component single-visit analyses. The pedigree-based mixed model was more conservative, with less inflation in the variant main effect and greater deflation in the gene-smoking interactions. The GEE, but not the other two models, resulted in substantial inflation in the tail of the distribution when variants with minor allele frequency <1% were included in the analysis. The choice of analysis method should depend on the model and the structure and complexity of the familial and longitudinal data. © 2015 WILEY PERIODICALS, INC.

  6. Behavioral science and the study of gene-nutrition and gene-physical activity interactions in obesity research.

    PubMed

    Faith, Myles S

    2008-12-01

    This report summarizes emerging opportunities for behavioral science to help advance the field of gene-environment and gene-behavior interactions, based on presentations at The National Cancer Institute (NCI) Workshop, "Gene-Nutrition and Gene-Physical Activity Interactions in the Etiology of Obesity." Three opportunities are highlighted: (i) designing potent behavioral "challenges" in experiments, (ii) determining viable behavioral phenotypes for genetics studies, and (iii) identifying specific measures of the environment or environmental exposures. Additional points are underscored, including the need to incorporate novel findings from neuroimaging studies regarding motivation and drive for eating and physical activity. Advances in behavioral science theory and methods can play an important role in advancing understanding of gene-brain-behavior relationships in obesity onset.

  7. Promoter-enhancer interactions identified from Hi-C data using probabilistic models and hierarchical topological domains.

    PubMed

    Ron, Gil; Globerson, Yuval; Moran, Dror; Kaplan, Tommy

    2017-12-21

    Proximity-ligation methods such as Hi-C allow us to map physical DNA-DNA interactions along the genome, and reveal its organization into topologically associating domains (TADs). As the Hi-C data accumulate, computational methods were developed for identifying domain borders in multiple cell types and organisms. Here, we present PSYCHIC, a computational approach for analyzing Hi-C data and identifying promoter-enhancer interactions. We use a unified probabilistic model to segment the genome into domains, which we then merge hierarchically and fit using a local background model, allowing us to identify over-represented DNA-DNA interactions across the genome. By analyzing the published Hi-C data sets in human and mouse, we identify hundreds of thousands of putative enhancers and their target genes, and compile an extensive genome-wide catalog of gene regulation in human and mouse. As we show, our predictions are highly enriched for ChIP-seq and DNA accessibility data, evolutionary conservation, eQTLs and other DNA-DNA interaction data.

  8. A vitamin D pathway gene-gene interaction affects low-density lipoprotein cholesterol levels.

    PubMed

    Grave, Nathália; Tovo-Rodrigues, Luciana; da Silveira, Janaína; Rovaris, Diego Luiz; Dal Bosco, Simone Morelo; Contini, Verônica; Genro, Júlia Pasqualini

    2016-12-01

    Much evidence suggests an association between vitamin D deficiency and chronic diseases such as obesity and dyslipidemia. Although genetic factors play an important role in the etiology of these diseases, only a few studies have investigated the relationship between vitamin D-related genes and anthropometric and lipid profiles. The aim of this study was to investigate the association of three vitamin D-related genes with anthropometric and lipid parameters in 542 adult individuals. We analyzed the rs2228570 polymorphism in the vitamin D receptor gene (VDR), rs2134095 in the retinoid X receptor gamma gene (RXRG) and rs7041 in the vitamin D-binding protein gene (GC). Polymorphisms were genotyped by TaqMan allelic discrimination. Gene-gene interactions were evaluated by the general linear model. The functionality of the polymorphisms was investigated using the following predictors and databases: SIFT (Sorting Intolerant from Tolerant), PolyPhen-2 (Polymorphism Phenotyping v2) and Human Splicing Finder 3. We identified a significant effect of the interaction between RXRG (rs2134095) and GC (rs7041) on low-density lipoprotein cholesterol (LDL-c) levels (P=.005). Furthermore, our in silico analysis suggested a functional role for both variants in the regulation of the gene products. Our results suggest that the vitamin D-related genes RXRG and GC affect LDL-c levels. These findings are in agreement with other studies that consistently associate vitamin D and lipid profile. Together, our results corroborate the idea that analyzing gene-gene interaction would be helpful to clarify the genetic component of lipid profile. Copyright © 2016 Elsevier Inc. All rights reserved.

  9. Incorporating gene-environment interaction in testing for association with rare genetic variants.

    PubMed

    Chen, Han; Meigs, James B; Dupuis, Josée

    2014-01-01

    The incorporation of gene-environment interactions could improve the ability to detect genetic associations with complex traits. For common genetic variants, single-marker interaction tests and joint tests of genetic main effects and gene-environment interaction have been well-established and used to identify novel association loci for complex diseases and continuous traits. For rare genetic variants, however, single-marker tests are severely underpowered due to the low minor allele frequency, and only a few gene-environment interaction tests have been developed. We aimed at developing powerful and computationally efficient tests for gene-environment interaction with rare variants. In this paper, we propose interaction and joint tests for testing gene-environment interaction of rare genetic variants. Our approach is a generalization of existing gene-environment interaction tests for multiple genetic variants under certain conditions. We show in our simulation studies that our interaction and joint tests have correct type I errors, and that the joint test is a powerful approach for testing genetic association, allowing for gene-environment interaction. We also illustrate our approach in a real data example from the Framingham Heart Study. Our approach can be applied to both binary and continuous traits, it is powerful and computationally efficient.

  10. Allelic-based gene-gene interaction associated with quantitative traits.

    PubMed

    Jung, Jeesun; Sun, Bin; Kwon, Deukwoo; Koller, Daniel L; Foroud, Tatiana M

    2009-05-01

    Recent studies have shown that quantitative phenotypes may be influenced not only by multiple single nucleotide polymorphisms (SNPs) within a gene but also by the interaction between SNPs at unlinked genes. We propose a new statistical approach that can detect gene-gene interactions at the allelic level which contribute to the phenotypic variation in a quantitative trait. By testing for the association of allelic combinations at multiple unlinked loci with a quantitative trait, we can detect the SNP allelic interaction whether or not it can be detected as a main effect. Our proposed method assigns a score to unrelated subjects according to their allelic combination inferred from observed genotypes at two or more unlinked SNPs, and then tests for the association of the allelic score with a quantitative trait. To investigate the statistical properties of the proposed method, we performed a simulation study to estimate type I error rates and power and demonstrated that this allelic approach achieves greater power than the more commonly used genotypic approach to test for gene-gene interaction. As an example, the proposed method was applied to data obtained as part of a candidate gene study of sodium retention by the kidney. We found that this method detects an interaction between the calcium-sensing receptor gene (CaSR), the chloride channel gene (CLCNKB) and the Na, K, 2Cl cotransporter gene (CLC12A1) that contributes to variation in diastolic blood pressure.

  11. A Nonlinear Model for Gene-Based Gene-Environment Interaction.

    PubMed

    Sa, Jian; Liu, Xu; He, Tao; Liu, Guifen; Cui, Yuehua

    2016-06-04

    A vast amount of literature has confirmed the role of gene-environment (G×E) interaction in the etiology of complex human diseases. Traditional methods are predominantly focused on the analysis of interaction between a single nucleotide polymorphism (SNP) and an environmental variable. Given that genes are the functional units, it is crucial to understand how gene effects (rather than single SNP effects) are influenced by an environmental variable to affect disease risk. Motivated by the increasing awareness of the power of gene-based association analysis over single variant based approach, in this work, we proposed a sparse principle component regression (sPCR) model to understand the gene-based G×E interaction effect on complex disease. We first extracted the sparse principal components for SNPs in a gene, then the effect of each principal component was modeled by a varying-coefficient (VC) model. The model can jointly model variants in a gene in which their effects are nonlinearly influenced by an environmental variable. In addition, the varying-coefficient sPCR (VC-sPCR) model has nice interpretation property since the sparsity on the principal component loadings can tell the relative importance of the corresponding SNPs in each component. We applied our method to a human birth weight dataset in Thai population. We analyzed 12,005 genes across 22 chromosomes and found one significant interaction effect using the Bonferroni correction method and one suggestive interaction. The model performance was further evaluated through simulation studies. Our model provides a system approach to evaluate gene-based G×E interaction.

  12. A Critical Look at Entropy-Based Gene-Gene Interaction Measures.

    PubMed

    Lee, Woojoo; Sjölander, Arvid; Pawitan, Yudi

    2016-07-01

    Several entropy-based measures for detecting gene-gene interaction have been proposed recently. It has been argued that the entropy-based measures are preferred because entropy can better capture the nonlinear relationships between genotypes and traits, so they can be useful to detect gene-gene interactions for complex diseases. These suggested measures look reasonable at intuitive level, but so far there has been no detailed characterization of the interactions captured by them. Here we study analytically the properties of some entropy-based measures for detecting gene-gene interactions in detail. The relationship between interactions captured by the entropy-based measures and those of logistic regression models is clarified. In general we find that the entropy-based measures can suffer from a lack of specificity in terms of target parameters, i.e., they can detect uninteresting signals as interactions. Numerical studies are carried out to confirm theoretical findings. © 2016 WILEY PERIODICALS, INC.

  13. Identification of fever and vaccine-associated gene interaction networks using ontology-based literature mining

    PubMed Central

    2012-01-01

    Background Fever is one of the most common adverse events of vaccines. The detailed mechanisms of fever and vaccine-associated gene interaction networks are not fully understood. In the present study, we employed a genome-wide, Centrality and Ontology-based Network Discovery using Literature data (CONDL) approach to analyse the genes and gene interaction networks associated with fever or vaccine-related fever responses. Results Over 170,000 fever-related articles from PubMed abstracts and titles were retrieved and analysed at the sentence level using natural language processing techniques to identify genes and vaccines (including 186 Vaccine Ontology terms) as well as their interactions. This resulted in a generic fever network consisting of 403 genes and 577 gene interactions. A vaccine-specific fever sub-network consisting of 29 genes and 28 gene interactions was extracted from articles that are related to both fever and vaccines. In addition, gene-vaccine interactions were identified. Vaccines (including 4 specific vaccine names) were found to directly interact with 26 genes. Gene set enrichment analysis was performed using the genes in the generated interaction networks. Moreover, the genes in these networks were prioritized using network centrality metrics. Making scientific discoveries and generating new hypotheses were possible by using network centrality and gene set enrichment analyses. For example, our study found that the genes in the generic fever network were more enriched in cell death and responses to wounding, and the vaccine sub-network had more gene enrichment in leukocyte activation and phosphorylation regulation. The most central genes in the vaccine-specific fever network are predicted to be highly relevant to vaccine-induced fever, whereas genes that are central only in the generic fever network are likely to be highly relevant to generic fever responses. Interestingly, no Toll-like receptors (TLRs) were found in the gene-vaccine interaction

  14. NIH Researchers Identify OCD Risk Gene

    MedlinePlus

    ... News From NIH NIH Researchers Identify OCD Risk Gene Past Issues / Summer 2006 Table of Contents For ... and Alcoholism (NIAAA) have identified a previously unknown gene variant that doubles an individual's risk for obsessive- ...

  15. Systems approach identifies an organic nitrogen-responsive gene network that is regulated by the master clock control gene CCA1.

    PubMed

    Gutiérrez, Rodrigo A; Stokes, Trevor L; Thum, Karen; Xu, Xiaodong; Obertello, Mariana; Katari, Manpreet S; Tanurdzic, Milos; Dean, Alexis; Nero, Damion C; McClung, C Robertson; Coruzzi, Gloria M

    2008-03-25

    Understanding how nutrients affect gene expression will help us to understand the mechanisms controlling plant growth and development as a function of nutrient availability. Nitrate has been shown to serve as a signal for the control of gene expression in Arabidopsis. There is also evidence, on a gene-by-gene basis, that downstream products of nitrogen (N) assimilation such as glutamate (Glu) or glutamine (Gln) might serve as signals of organic N status that in turn regulate gene expression. To identify genome-wide responses to such organic N signals, Arabidopsis seedlings were transiently treated with ammonium nitrate in the presence or absence of MSX, an inhibitor of glutamine synthetase, resulting in a block of Glu/Gln synthesis. Genes that responded to organic N were identified as those whose response to ammonium nitrate treatment was blocked in the presence of MSX. We showed that some genes previously identified to be regulated by nitrate are under the control of an organic N-metabolite. Using an integrated network model of molecular interactions, we uncovered a subnetwork regulated by organic N that included CCA1 and target genes involved in N-assimilation. We validated some of the predicted interactions and showed that regulation of the master clock control gene CCA1 by Glu or a Glu-derived metabolite in turn regulates the expression of key N-assimilatory genes. Phase response curve analysis shows that distinct N-metabolites can advance or delay the CCA1 phase. Regulation of CCA1 by organic N signals may represent a novel input mechanism for N-nutrients to affect plant circadian clock function.

  16. Learning Petri net models of non-linear gene interactions.

    PubMed

    Mayo, Michael

    2005-10-01

    Understanding how an individual's genetic make-up influences their risk of disease is a problem of paramount importance. Although machine-learning techniques are able to uncover the relationships between genotype and disease, the problem of automatically building the best biochemical model or "explanation" of the relationship has received less attention. In this paper, I describe a method based on random hill climbing that automatically builds Petri net models of non-linear (or multi-factorial) disease-causing gene-gene interactions. Petri nets are a suitable formalism for this problem, because they are used to model concurrent, dynamic processes analogous to biochemical reaction networks. I show that this method is routinely able to identify perfect Petri net models for three disease-causing gene-gene interactions recently reported in the literature.

  17. Gene-environment interactions in atherosclerosis.

    PubMed

    Hegele, R A

    1991-06-01

    It is becoming clear that genetic and environmental factors can interact to varying degrees in a given individual. In some cases, genetically determined resistance to CAD (eg, genetic hyperalpha- or hypobetalipoproteinemia), or genetically determined susceptibility to CAD (eg, high Lp[a] levels) may not be significantly modulated by a prudent lifestyle. Estimates of the prevalence in the general population of these genetic extremes average around 5% (4). In the remaining 95% of cases, nature and nurture interact. For example, a genetic flaw that is usually expressed phenotypically as premature death due to CAD (eg, some cases of FH) can be ameliorated by a prudent diet. There is little doubt that an individual's responsiveness to environmental factors can be determined by many different genes. The exact candidate genes and the nature of most of the genetic changes affecting response to diet still need to be determined. Once identified, they may one day form the basis for early diagnosis of metabolic problems and individually tailored diet and drug treatment programs.

  18. Semantic integration to identify overlapping functional modules in protein interaction networks

    PubMed Central

    Cho, Young-Rae; Hwang, Woochang; Ramanathan, Murali; Zhang, Aidong

    2007-01-01

    Background The systematic analysis of protein-protein interactions can enable a better understanding of cellular organization, processes and functions. Functional modules can be identified from the protein interaction networks derived from experimental data sets. However, these analyses are challenging because of the presence of unreliable interactions and the complex connectivity of the network. The integration of protein-protein interactions with the data from other sources can be leveraged for improving the effectiveness of functional module detection algorithms. Results We have developed novel metrics, called semantic similarity and semantic interactivity, which use Gene Ontology (GO) annotations to measure the reliability of protein-protein interactions. The protein interaction networks can be converted into a weighted graph representation by assigning the reliability values to each interaction as a weight. We presented a flow-based modularization algorithm to efficiently identify overlapping modules in the weighted interaction networks. The experimental results show that the semantic similarity and semantic interactivity of interacting pairs were positively correlated with functional co-occurrence. The effectiveness of the algorithm for identifying modules was evaluated using functional categories from the MIPS database. We demonstrated that our algorithm had higher accuracy compared to other competing approaches. Conclusion The integration of protein interaction networks with GO annotation data and the capability of detecting overlapping modules substantially improve the accuracy of module identification. PMID:17650343

  19. Identifying the genes of unconventional high temperature superconductors.

    PubMed

    Hu, Jiangping

    We elucidate a recently emergent framework in unifying the two families of high temperature (high [Formula: see text]) superconductors, cuprates and iron-based superconductors. The unification suggests that the latter is simply the counterpart of the former to realize robust extended s-wave pairing symmetries in a square lattice. The unification identifies that the key ingredients (gene) of high [Formula: see text] superconductors is a quasi two dimensional electronic environment in which the d -orbitals of cations that participate in strong in-plane couplings to the p -orbitals of anions are isolated near Fermi energy. With this gene, the superexchange magnetic interactions mediated by anions could maximize their contributions to superconductivity. Creating the gene requires special arrangements between local electronic structures and crystal lattice structures. The speciality explains why high [Formula: see text] superconductors are so rare. An explicit prediction is made to realize high [Formula: see text] superconductivity in Co/Ni-based materials with a quasi two dimensional hexagonal lattice structure formed by trigonal bipyramidal complexes.

  20. Coalitional game theory as a promising approach to identify candidate autism genes.

    PubMed

    Gupta, Anika; Sun, Min Woo; Paskov, Kelley Marie; Stockham, Nate Tyler; Jung, Jae-Yoon; Wall, Dennis Paul

    2018-01-01

    Despite mounting evidence for the strong role of genetics in the phenotypic manifestation of Autism Spectrum Disorder (ASD), the specific genes responsible for the variable forms of ASD remain undefined. ASD may be best explained by a combinatorial genetic model with varying epistatic interactions across many small effect mutations. Coalitional or cooperative game theory is a technique that studies the combined effects of groups of players, known as coalitions, seeking to identify players who tend to improve the performance--the relationship to a specific disease phenotype--of any coalition they join. This method has been previously shown to boost biologically informative signal in gene expression data but to-date has not been applied to the search for cooperative mutations among putative ASD genes. We describe our approach to highlight genes relevant to ASD using coalitional game theory on alteration data of 1,965 fully sequenced genomes from 756 multiplex families. Alterations were encoded into binary matrices for ASD (case) and unaffected (control) samples, indicating likely gene-disrupting, inherited mutations in altered genes. To determine individual gene contributions given an ASD phenotype, a "player" metric, referred to as the Shapley value, was calculated for each gene in the case and control cohorts. Sixty seven genes were found to have significantly elevated player scores and likely represent significant contributors to the genetic coordination underlying ASD. Using network and cross-study analysis, we found that these genes are involved in biological pathways known to be affected in the autism cases and that a subset directly interact with several genes known to have strong associations to autism. These findings suggest that coalitional game theory can be applied to large-scale genomic data to identify hidden yet influential players in complex polygenic disorders such as autism.

  1. Gene network biological validity based on gene-gene interaction relevance.

    PubMed

    Gómez-Vela, Francisco; Díaz-Díaz, Norberto

    2014-01-01

    In recent years, gene networks have become one of the most useful tools for modeling biological processes. Many inference gene network algorithms have been developed as techniques for extracting knowledge from gene expression data. Ensuring the reliability of the inferred gene relationships is a crucial task in any study in order to prove that the algorithms used are precise. Usually, this validation process can be carried out using prior biological knowledge. The metabolic pathways stored in KEGG are one of the most widely used knowledgeable sources for analyzing relationships between genes. This paper introduces a new methodology, GeneNetVal, to assess the biological validity of gene networks based on the relevance of the gene-gene interactions stored in KEGG metabolic pathways. Hence, a complete KEGG pathway conversion into a gene association network and a new matching distance based on gene-gene interaction relevance are proposed. The performance of GeneNetVal was established with three different experiments. Firstly, our proposal is tested in a comparative ROC analysis. Secondly, a randomness study is presented to show the behavior of GeneNetVal when the noise is increased in the input network. Finally, the ability of GeneNetVal to detect biological functionality of the network is shown.

  2. Transposon tagging of genes for cell-cell interactions in Myxococcus xanthus.

    PubMed Central

    Kalos, M; Zissler, J

    1990-01-01

    The prokaryote Myxococcus xanthus is a model for cell interactions important in multicellular behavior. We used the transposon TnphoA to specifically identify genes for cell-surface factors involved in cell interactions. From a library of 10,700 insertions of TnphoA, we isolated 36 that produced alkaline phosphatase activity. Three TnphoA insertions tagged cell motility genes, called cgl, which control the adventurous movement of cells. The products of the tagged cgl genes could function in trans upon other cells and were localized primarily in the cell envelope and extracellular space, consistent with TnphoA tagging genes for extracellular factors controlling motility. Images PMID:2172982

  3. Antioxidant Defense Enzyme Genes and Asthma Susceptibility: Gender-Specific Effects and Heterogeneity in Gene-Gene Interactions between Pathogenetic Variants of the Disease

    PubMed Central

    Polonikov, Alexey V.; Ivanov, Vladimir P.; Bogomazov, Alexey D.; Freidin, Maxim B.; Illig, Thomas; Solodilova, Maria A.

    2014-01-01

    Oxidative stress resulting from an increased amount of reactive oxygen species and an imbalance between oxidants and antioxidants plays an important role in the pathogenesis of asthma. The present study tested the hypothesis that genetic susceptibility to allergic and nonallergic variants of asthma is determined by complex interactions between genes encoding antioxidant defense enzymes (ADE). We carried out a comprehensive analysis of the associations between adult asthma and 46 single nucleotide polymorphisms of 34 ADE genes and 12 other candidate genes of asthma in Russian population using set association analysis and multifactor dimensionality reduction approaches. We found for the first time epistatic interactions between ADE genes underlying asthma susceptibility and the genetic heterogeneity between allergic and nonallergic variants of the disease. We identified GSR (glutathione reductase) and PON2 (paraoxonase 2) as novel candidate genes for asthma susceptibility. We observed gender-specific effects of ADE genes on the risk of asthma. The results of the study demonstrate complexity and diversity of interactions between genes involved in oxidative stress underlying susceptibility to allergic and nonallergic asthma. PMID:24895604

  4. Using the principle of entropy maximization to infer genetic interaction networks from gene expression patterns

    PubMed Central

    Lezon, Timothy R.; Banavar, Jayanth R.; Cieplak, Marek; Maritan, Amos; Fedoroff, Nina V.

    2006-01-01

    We describe a method based on the principle of entropy maximization to identify the gene interaction network with the highest probability of giving rise to experimentally observed transcript profiles. In its simplest form, the method yields the pairwise gene interaction network, but it can also be extended to deduce higher-order interactions. Analysis of microarray data from genes in Saccharomyces cerevisiae chemostat cultures exhibiting energy metabolic oscillations identifies a gene interaction network that reflects the intracellular communication pathways that adjust cellular metabolic activity and cell division to the limiting nutrient conditions that trigger metabolic oscillations. The success of the present approach in extracting meaningful genetic connections suggests that the maximum entropy principle is a useful concept for understanding living systems, as it is for other complex, nonequilibrium systems. PMID:17138668

  5. Using the principle of entropy maximization to infer genetic interaction networks from gene expression patterns.

    PubMed

    Lezon, Timothy R; Banavar, Jayanth R; Cieplak, Marek; Maritan, Amos; Fedoroff, Nina V

    2006-12-12

    We describe a method based on the principle of entropy maximization to identify the gene interaction network with the highest probability of giving rise to experimentally observed transcript profiles. In its simplest form, the method yields the pairwise gene interaction network, but it can also be extended to deduce higher-order interactions. Analysis of microarray data from genes in Saccharomyces cerevisiae chemostat cultures exhibiting energy metabolic oscillations identifies a gene interaction network that reflects the intracellular communication pathways that adjust cellular metabolic activity and cell division to the limiting nutrient conditions that trigger metabolic oscillations. The success of the present approach in extracting meaningful genetic connections suggests that the maximum entropy principle is a useful concept for understanding living systems, as it is for other complex, nonequilibrium systems.

  6. Why study gene-environment interactions?

    USDA-ARS?s Scientific Manuscript database

    PURPOSE OF REVIEW: We examine the reasons for investigating gene-environment interactions and address recent reports evaluating interactions between genes and environmental modulators in relation to cardiovascular disease and its common risk factors. RECENT FINDINGS: Studies focusing on smoking, phy...

  7. Detecting recurrent gene mutation in interaction network context using multi-scale graph diffusion.

    PubMed

    Babaei, Sepideh; Hulsman, Marc; Reinders, Marcel; de Ridder, Jeroen

    2013-01-23

    Delineating the molecular drivers of cancer, i.e. determining cancer genes and the pathways which they deregulate, is an important challenge in cancer research. In this study, we aim to identify pathways of frequently mutated genes by exploiting their network neighborhood encoded in the protein-protein interaction network. To this end, we introduce a multi-scale diffusion kernel and apply it to a large collection of murine retroviral insertional mutagenesis data. The diffusion strength plays the role of scale parameter, determining the size of the network neighborhood that is taken into account. As a result, in addition to detecting genes with frequent mutations in their genomic vicinity, we find genes that harbor frequent mutations in their interaction network context. We identify densely connected components of known and putatively novel cancer genes and demonstrate that they are strongly enriched for cancer related pathways across the diffusion scales. Moreover, the mutations in the clusters exhibit a significant pattern of mutual exclusion, supporting the conjecture that such genes are functionally linked. Using multi-scale diffusion kernel, various infrequently mutated genes are found to harbor significant numbers of mutations in their interaction network neighborhood. Many of them are well-known cancer genes. The results demonstrate the importance of defining recurrent mutations while taking into account the interaction network context. Importantly, the putative cancer genes and networks detected in this study are found to be significant at different diffusion scales, confirming the necessity of a multi-scale analysis.

  8. ReliefSeq: A Gene-Wise Adaptive-K Nearest-Neighbor Feature Selection Tool for Finding Gene-Gene Interactions and Main Effects in mRNA-Seq Gene Expression Data

    PubMed Central

    McKinney, Brett A.; White, Bill C.; Grill, Diane E.; Li, Peter W.; Kennedy, Richard B.; Poland, Gregory A.; Oberg, Ann L.

    2013-01-01

    Relief-F is a nonparametric, nearest-neighbor machine learning method that has been successfully used to identify relevant variables that may interact in complex multivariate models to explain phenotypic variation. While several tools have been developed for assessing differential expression in sequence-based transcriptomics, the detection of statistical interactions between transcripts has received less attention in the area of RNA-seq analysis. We describe a new extension and assessment of Relief-F for feature selection in RNA-seq data. The ReliefSeq implementation adapts the number of nearest neighbors (k) for each gene to optimize the Relief-F test statistics (importance scores) for finding both main effects and interactions. We compare this gene-wise adaptive-k (gwak) Relief-F method with standard RNA-seq feature selection tools, such as DESeq and edgeR, and with the popular machine learning method Random Forests. We demonstrate performance on a panel of simulated data that have a range of distributional properties reflected in real mRNA-seq data including multiple transcripts with varying sizes of main effects and interaction effects. For simulated main effects, gwak-Relief-F feature selection performs comparably to standard tools DESeq and edgeR for ranking relevant transcripts. For gene-gene interactions, gwak-Relief-F outperforms all comparison methods at ranking relevant genes in all but the highest fold change/highest signal situations where it performs similarly. The gwak-Relief-F algorithm outperforms Random Forests for detecting relevant genes in all simulation experiments. In addition, Relief-F is comparable to the other methods based on computational time. We also apply ReliefSeq to an RNA-Seq study of smallpox vaccine to identify gene expression changes between vaccinia virus-stimulated and unstimulated samples. ReliefSeq is an attractive tool for inclusion in the suite of tools used for analysis of mRNA-Seq data; it has power to detect both main

  9. Gene function prediction with gene interaction networks: a context graph kernel approach.

    PubMed

    Li, Xin; Chen, Hsinchun; Li, Jiexun; Zhang, Zhu

    2010-01-01

    Predicting gene functions is a challenge for biologists in the postgenomic era. Interactions among genes and their products compose networks that can be used to infer gene functions. Most previous studies adopt a linkage assumption, i.e., they assume that gene interactions indicate functional similarities between connected genes. In this study, we propose to use a gene's context graph, i.e., the gene interaction network associated with the focal gene, to infer its functions. In a kernel-based machine-learning framework, we design a context graph kernel to capture the information in context graphs. Our experimental study on a testbed of p53-related genes demonstrates the advantage of using indirect gene interactions and shows the empirical superiority of the proposed approach over linkage-assumption-based methods, such as the algorithm to minimize inconsistent connected genes and diffusion kernels.

  10. Identifying genes involved in the interaction of Aggregatibacter actinomycetemcomitans with Maillard reaction products (MRP)

    NASA Astrophysics Data System (ADS)

    Jaha, Raniah Abdulmohsen

    Aggregatibacter (Actinobacillus) actinomycelemcomitcrns is a gram-negative bacterium that is a facultative anaerobe which can grow in either aerobic or anaerobic conditions. The bacteria cause localized aggressive periodontitis that can result in the loss of teeth and endocarditis, which is an infection of the heart valves. A rich medium is an essential requirement for its growth. There arc some difficulties associated with growing the bacteria as they easily switch from the rough to smooth phenotype under no specific conditions. The bacteria start to lose viability after about 19 hours of growth in broth or about three days on plates. Colonies in the dense part of the streak on plates die earlier. It was shown that acid secreted by the colonies is responsible for the loss of viability as the bacteria are extremely sensitive to low pH. Autoclaving the growth medium for A. actinomycetemcomitans causes the bacteria to grow slowly because of the formation of Maillard reaction products (MRPs). A method has been developed to make the A. actinomycetemcomitans growth medium using the microwave instead of the autoclave. This method produces much less of the inhibitory product since the heating time is only six minutes, compared to more than an hour when using the autoclave. Two approaches were sought in this research. The first approach was the identification of genes responsible for the interaction between the MRP and A. actinomycetemcomitans. The gene responsible for this interaction was found to be a Lys M protein which is found in many genes responsible for the cell wall integrity. The second approach was to develop a new drug made of glucose and lysine with a minimum inhibitory concentration as 75mM.

  11. Leveraging Gene-Environment Interactions and Endotypes for Asthma Gene Discovery

    PubMed Central

    Bønnelykke, Klaus; Ober, Carole

    2016-01-01

    Asthma is a heterogeneous clinical syndrome that includes subtypes of disease with different underlying causes and disease mechanisms. Asthma is caused by a complex interaction between genes and environmental exposures; early-life exposures in particular play an important role. Asthma is also heritable, and a number of susceptibility variants have been discovered in genome-wide association studies, although the known risk alleles explain only a small proportion of the heritability. In this review, we present evidence supporting the hypothesis that focusing on more specific asthma phenotypes, such as childhood asthma with severe exacerbations, and on relevant exposures that are involved in gene-environment interactions (GEIs), such as rhinovirus infections, will improve detection of asthma genes and our understanding of the underlying mechanisms. We will discuss the challenges of considering GEIs and the advantages of studying responses to asthma-associated exposures in clinical birth cohorts, as well as in cell models of GEIs, to dissect the context-specific nature of genotypic risks, to prioritize variants in genome-wide association studies, and to identify pathways involved in pathogenesis in subgroups of patients. We propose that such approaches, in spite of their many challenges, present great opportunities for better understanding of asthma pathogenesis and heterogeneity and, ultimately, for improving prevention and treatment of disease. PMID:26947980

  12. Gene-environment interaction and suicidal behavior.

    PubMed

    Roy, Alec; Sarchiopone, Marco; Carli, Vladimir

    2009-07-01

    Studies have increasingly shown that gene-environment interactions are important in psychiatry. Suicidal behavior is a major public health problem. Suicide is generally considered to be a multi-determined act involving various areas of proximal and distal risk. Genetic risk factors are estimated to account for approximately 30% to 40% of the variance in suicidal behavior. In this article, the authors review relevant studies concerning the interaction between the serotonin transporter gene and environmental variables as a model of gene-environment interactions that may have an impact on suicidal behavior. The findings reviewed here suggest that there may be meaningful interactions between distal and proximal suicide risk factors that may amplify the risk of suicidal behavior. Future studies of suicidal behavior should examine both genetic and environmental variables and examine for gene-environment interactions.

  13. Boosting for detection of gene-environment interactions.

    PubMed

    Pashova, H; LeBlanc, M; Kooperberg, C

    2013-01-30

    In genetic association studies, it is typically thought that genetic variants and environmental variables jointly will explain more of the inheritance of a phenotype than either of these two components separately. Traditional methods to identify gene-environment interactions typically consider only one measured environmental variable at a time. However, in practice, multiple environmental factors may each be imprecise surrogates for the underlying physiological process that actually interacts with the genetic factors. In this paper, we develop a variant of L(2) boosting that is specifically designed to identify combinations of environmental variables that jointly modify the effect of a gene on a phenotype. Because the effect modifiers might have a small signal compared with the main effects, working in a space that is orthogonal to the main predictors allows us to focus on the interaction space. In a simulation study that investigates some plausible underlying model assumptions, our method outperforms the least absolute shrinkage and selection and Akaike Information Criterion and Bayesian Information Criterion model selection procedures as having the lowest test error. In an example for the Women's Health Initiative-Population Architecture using Genomics and Epidemiology study, the dedicated boosting method was able to pick out two single-nucleotide polymorphisms for which effect modification appears present. The performance was evaluated on an independent test set, and the results are promising. Copyright © 2012 John Wiley & Sons, Ltd.

  14. Gene × Environment Interactions in Schizophrenia: Evidence from Genetic Mouse Models

    PubMed Central

    Marr, Julia; Bock, Gavin; Desbonnet, Lieve; Waddington, John

    2016-01-01

    The study of gene × environment, as well as epistatic interactions in schizophrenia, has provided important insight into the complex etiopathologic basis of schizophrenia. It has also increased our understanding of the role of susceptibility genes in the disorder and is an important consideration as we seek to translate genetic advances into novel antipsychotic treatment targets. This review summarises data arising from research involving the modelling of gene × environment interactions in schizophrenia using preclinical genetic models. Evidence for synergistic effects on the expression of schizophrenia-relevant endophenotypes will be discussed. It is proposed that valid and multifactorial preclinical models are important tools for identifying critical areas, as well as underlying mechanisms, of convergence of genetic and environmental risk factors, and their interaction in schizophrenia. PMID:27725886

  15. Assessment of Gene-by-Sex Interaction Effect on Bone Mineral Density

    PubMed Central

    Liu, Ching-Ti; Estrada, Karol; Yerges-Armstrong, Laura M.; Amin, Najaf; Evangelou, Evangelos; Li, Guo; Minster, Ryan L.; Carless, Melanie A.; Kammerer, Candace M.; Oei, Ling; Zhou, Yanhua; Alonso, Nerea; Dailiana, Zoe; Eriksson, Joel; García-Giralt, Natalia; Giroux, Sylvie; Husted, Lise Bjerre; Khusainova, Rita I.; Koromila, Theodora; Kung, Annie WaiChee; Lewis, Joshua R.; Masi, Laura; Mencej-Bedrac, Simona; Nogues, Xavier; Patel, Millan S.; Prezelj, Janez; Richards, J Brent; Sham, Pak Chung; Spector, Timothy; Vandenput, Liesbeth; Xiao, Su-Mei; Zheng, Hou-Feng; Zhu, Kun; Balcells, Susana; Brandi, Maria Luisa; Frost, Morten; Goltzman, David; González-Macías, Jesús; Karlsson, Magnus; Khusnutdinova, Elza K.; Kollia, Panagoula; Langdahl, Bente Lomholt; Ljunggren, Östen; Lorentzon, Mattias; Marc, Janja; Mellström, Dan; Ohlsson, Claes; Olmos, José M.; Ralston, Stuart H.; Riancho, José A.; Rousseau, François; Urreizti, Roser; Van Hul, Wim; Zarrabeitia, María T.; Castano-Betancourt, Martha; Demissie, Serkalem; Grundberg, Elin; Herrera, Lizbeth; Kwan, Tony; Medina-Gómez, Carolina; Pastinen, Tomi; Sigurdsson, Gunnar; Thorleifsson, Gudmar; vanMeurs, Joyce B.J.; Blangero, John; Hofman, Albert; Liu, Yongmei; Mitchell, Braxton D.; O’Connell, Jeffrey R.; Oostra, Ben A.; Rotter, Jerome I; Stefansson, Kari; Streeten, Elizabeth A.; Styrkarsdottir, Unnur; Thorsteinsdottir, Unnur; Tylavsky, Frances A.; Uitterlinden, Andre; Cauley, Jane A.; Harris, Tamara B.; Ioannidis, John P.A.; Psaty, Bruce M.; Robbins, John A; Zillikens, M. Carola; vanDuijn, Cornelia M.; Prince, Richard L.; Karasik, David; Rivadeneira, Fernando; Kiel, Douglas P.; Cupples, L. Adrienne; Hsu, Yi-Hsiang

    2012-01-01

    Background Sexual dimorphism in various bone phenotypes, including bone mineral density (BMD), is widely observed; however the extent to which genes explain these sex differences is unclear. To identify variants with different effects by sex, we examined gene-by-sex autosomal interactions genome-wide, and performed eQTL analysis and bioinformatics network analysis. Methods We conducted an autosomal genome-wide meta-analysis of gene-by-sex interaction on lumbar spine (LS-) and femoral neck (FN-) BMD, in 25,353 individuals from eight cohorts. In a second stage, we followed up the 12 top SNPs (P<1×10−5) in an additional set of 24,763 individuals. Gene-by-sex interaction and sex-specific effects were examined in these 12 SNPs. Results We detected one novel genome-wide significant interaction associated with LS-BMD at the Chr3p26.1-p25.1 locus, near the GRM7 gene (male effect = 0.02 & p-value = 3.0×10−5; female effect = −0.007 & p-value=3.3×10−2) and eleven suggestive loci associated with either FN- or LS-BMD in discovery cohorts. However, there was no evidence for genome-wide significant (P<5×10−8) gene-by-sex interaction in the joint analysis of discovery and replication cohorts. Conclusion Despite the large collaborative effort, no genome-wide significant evidence for gene-by-sex interaction was found influencing BMD variation in this screen of autosomal markers. If they exist, gene-by-sex interactions for BMD probably have weak effects, accounting for less than 0.08% of the variation in these traits per implicated SNP. PMID:22692763

  16. Gene-environment interactions in mental disorders

    PubMed Central

    Tsuang, Ming T; Bar, Jessica L; Stone, William S; Faraone, Stephen V

    2004-01-01

    Research clearly shows that both nature and nurture play important roles in the genesis of psychopathology. In this paper, we focus on 'gene-environment interaction' in mental disorders, using genetic control of sensitivity to the environment as our definition of that term. We begin with an examination of methodological issues involving gene-environment interactions, with examples concerning psychiatric and neurological conditions. Then we review the interactions in psychiatric disorders using twin, adoption and association designs. Finally, we consider gene-environment interactions in selected neurodevelopmental disorders (autism and schizophrenia). PMID:16633461

  17. Identifying candidate driver genes by integrative ovarian cancer genomics data

    NASA Astrophysics Data System (ADS)

    Lu, Xinguo; Lu, Jibo

    2017-08-01

    Integrative analysis of molecular mechanics underlying cancer can distinguish interactions that cannot be revealed based on one kind of data for the appropriate diagnosis and treatment of cancer patients. Tumor samples exhibit heterogeneity in omics data, such as somatic mutations, Copy Number Variations CNVs), gene expression profiles and so on. In this paper we combined gene co-expression modules and mutation modulators separately in tumor patients to obtain the candidate driver genes for resistant and sensitive tumor from the heterogeneous data. The final list of modulators identified are well known in biological processes associated with ovarian cancer, such as CCL17, CACTIN, CCL16, CCL22, APOB, KDF1, CCL11, HNF1B, LRG1, MED1 and so on, which can help to facilitate the discovery of biomarkers, molecular diagnostics, and drug discovery.

  18. Genes involved in host-parasite interactions can be revealed by their correlated expression.

    PubMed

    Reid, Adam James; Berriman, Matthew

    2013-02-01

    Molecular interactions between a parasite and its host are key to the ability of the parasite to enter the host and persist. Our understanding of the genes and proteins involved in these interactions is limited. To better understand these processes it would be advantageous to have a range of methods to predict pairs of genes involved in such interactions. Correlated gene expression profiles can be used to identify molecular interactions within a species. Here we have extended the concept to different species, showing that genes with correlated expression are more likely to encode proteins, which directly or indirectly participate in host-parasite interaction. We go on to examine our predictions of molecular interactions between the malaria parasite and both its mammalian host and insect vector. Our approach could be applied to study any interaction between species, for example, between a host and its parasites or pathogens, but also symbiotic and commensal pairings.

  19. Environmental confounding in gene-environment interaction studies.

    PubMed

    Vanderweele, Tyler J; Ko, Yi-An; Mukherjee, Bhramar

    2013-07-01

    We show that, in the presence of uncontrolled environmental confounding, joint tests for the presence of a main genetic effect and gene-environment interaction will be biased if the genetic and environmental factors are correlated, even if there is no effect of either the genetic factor or the environmental factor on the disease. When environmental confounding is ignored, such tests will in fact reject the joint null of no genetic effect with a probability that tends to 1 as the sample size increases. This problem with the joint test vanishes under gene-environment independence, but it still persists if estimating the gene-environment interaction parameter itself is of interest. Uncontrolled environmental confounding will bias estimates of gene-environment interaction parameters even under gene-environment independence, but it will not do so if the unmeasured confounding variable itself does not interact with the genetic factor. Under gene-environment independence, if the interaction parameter without controlling for the environmental confounder is nonzero, then there is gene-environment interaction either between the genetic factor and the environmental factor of interest or between the genetic factor and the unmeasured environmental confounder. We evaluate several recently proposed joint tests in a simulation study and discuss the implications of these results for the conduct of gene-environment interaction studies.

  20. Testing Gene-Gene Interactions in the Case-Parents Design

    PubMed Central

    Yu, Zhaoxia

    2011-01-01

    The case-parents design has been widely used to detect genetic associations as it can prevent spurious association that could occur in population-based designs. When examining the effect of an individual genetic locus on a disease, logistic regressions developed by conditioning on parental genotypes provide complete protection from spurious association caused by population stratification. However, when testing gene-gene interactions, it is unknown whether conditional logistic regressions are still robust. Here we evaluate the robustness and efficiency of several gene-gene interaction tests that are derived from conditional logistic regressions. We found that in the presence of SNP genotype correlation due to population stratification or linkage disequilibrium, tests with incorrectly specified main-genetic-effect models can lead to inflated type I error rates. We also found that a test with fully flexible main genetic effects always maintains correct test size and its robustness can be achieved with negligible sacrifice of its power. When testing gene-gene interactions is the focus, the test allowing fully flexible main effects is recommended to be used. PMID:21778736

  1. A combination test for detection of gene-environment interaction in cohort studies.

    PubMed

    Coombes, Brandon; Basu, Saonli; McGue, Matt

    2017-07-01

    Identifying gene-environment (G-E) interactions can contribute to a better understanding of disease etiology, which may help researchers develop disease prevention strategies and interventions. One big criticism of studying G-E interaction is the lack of power due to sample size. Studies often restrict the interaction search to the top few hundred hits from a genome-wide association study or focus on potential candidate genes. In this paper, we test interactions between a candidate gene and an environmental factor to improve power by analyzing multiple variants within a gene. We extend recently developed score statistic based genetic association testing approaches to the G-E interaction testing problem. We also propose tests for interaction using gene-based summary measures that pool variants together. Although it has recently been shown that these summary measures can be biased and may lead to inflated type I error, we show that under several realistic scenarios, we can still provide valid tests of interaction. These tests use significantly less degrees of freedom and thus can have much higher power to detect interaction. Additionally, we demonstrate that the iSeq-aSum-min test, which combines a gene-based summary measure test, iSeq-aSum-G, and an interaction-based summary measure test, iSeq-aSum-I, provides a powerful alternative to test G-E interaction. We demonstrate the performance of these approaches using simulation studies and illustrate their performance to study interaction between the SNPs in several candidate genes and family climate environment on alcohol consumption using the Minnesota Center for Twin and Family Research dataset. © 2017 WILEY PERIODICALS, INC.

  2. Gene-gene interactions and gene polymorphisms of VEGFA and EG-VEGF gene systems in recurrent pregnancy loss.

    PubMed

    Su, Mei-Tsz; Lin, Sheng-Hsiang; Chen, Yi-Chi; Kuo, Pao-Lin

    2014-06-01

    Both vascular endothelial growth factor A (VEGFA) and endocrine gland-derived vascular endothelial growth factor (EG-VEGF) systems play major roles in angiogenesis. A body of evidence suggests VEGFs regulate critical processes during pregnancy and have been associated with recurrent pregnancy loss (RPL). However, little information is available regarding the interaction of these two major major angiogenesis-related systems in early human pregnancy. This study was conducted to investigate the association of gene polymorphisms and gene-gene interaction among genes in VEGFA and EG-VEGF systems and idiopathic RPL. A total of 98 women with history of idiopathic RPL and 142 controls were included, and 5 functional SNPs selected from VEGFA, KDR, EG-VEGF (PROK1), PROKR1 and PROKR2 were genotyped. We used multifactor dimensionality reduction (MDR) analysis to choose a best model and evaluate gene-gene interactions. Ingenuity pathways analysis (IPA) was introduced to explore possible complex interactions. Two receptor gene polymorphisms [KDR (Q472H) and PROKR2 (V331M)] were significantly associated with idiopathic RPL (P<0.01). The MDR test revealed that the KDR (Q472H) polymorphism was the best loci to be associated with RPL (P=0.02). IPA revealed EG-VEGF and VEGFA systems shared several canonical signaling pathways that may contribute to gene-gene interactions, including the Akt, IL-8, EGFR, MAPK, SRC, VHL, HIF-1A and STAT3 signaling pathways. Two receptor gene polymorphisms [KDR (Q472H) and PROKR2 (V331M)] were significantly associated with idiopathic RPL. EG-VEGF and VEGFA systems shared several canonical signaling pathways that may contribute to gene-gene interactions, including the Akt, IL-8, EGFR, MAPK, SRC, VHL, HIF-1A and STAT3.

  3. Analysis of Multiple Association Studies Provides Evidence of an Expression QTL Hub in Gene-Gene Interaction Network Affecting HDL Cholesterol Levels

    PubMed Central

    Ma, Li; Ballantyne, Christie; Brautbar, Ariel; Keinan, Alon

    2014-01-01

    Epistasis has been suggested to underlie part of the missing heritability in genome-wide association studies. In this study, we first report an analysis of gene-gene interactions affecting HDL cholesterol (HDL-C) levels in a candidate gene study of 2,091 individuals with mixed dyslipidemia from a clinical trial. Two additional studies, the Atherosclerosis Risk in Communities study (ARIC; n = 9,713) and the Multi-Ethnic Study of Atherosclerosis (MESA; n = 2,685), were considered for replication. We identified a gene-gene interaction between rs1532085 and rs12980554 (P = 7.1×10−7) in their effect on HDL-C levels, which is significant after Bonferroni correction (P c = 0.017) for the number of SNP pairs tested. The interaction successfully replicated in the ARIC study (P = 7.0×10−4; P c = 0.02). Rs1532085, an expression QTL (eQTL) of LIPC, is one of the two SNPs involved in another, well-replicated gene-gene interaction underlying HDL-C levels. To further investigate the role of this eQTL SNP in gene-gene interactions affecting HDL-C, we tested in the ARIC study for interaction between this SNP and any other SNP genome-wide. We found the eQTL to be involved in a few suggestive interactions, one of which significantly replicated in MESA. Importantly, these gene-gene interactions, involving only rs1532085, explain an additional 1.4% variation of HDL-C, on top of the 0.65% explained by rs1532085 alone. LIPC plays a key role in the lipid metabolism pathway and it, and rs1532085 in particular, has been associated with HDL-C and other lipid levels. Collectively, we discovered several novel gene-gene interactions, all involving an eQTL of LIPC, thus suggesting a hub role of LIPC in the gene-gene interaction network that regulates HDL-C levels, which in turn raises the hypothesis that LIPC's contribution is largely via interactions with other lipid metabolism related genes. PMID:24651390

  4. Utilizing Gene Tree Variation to Identify Candidate Effector Genes in Zymoseptoria tritici

    PubMed Central

    McDonald, Megan C.; McGinness, Lachlan; Hane, James K.; Williams, Angela H.; Milgate, Andrew; Solomon, Peter S.

    2016-01-01

    Zymoseptoria tritici is a host-specific, necrotrophic pathogen of wheat. Infection by Z. tritici is characterized by its extended latent period, which typically lasts 2 wks, and is followed by extensive host cell death, and rapid proliferation of fungal biomass. This work characterizes the level of genomic variation in 13 isolates, for which we have measured virulence on 11 wheat cultivars with differential resistance genes. Between the reference isolate, IPO323, and the 13 Australian isolates we identified over 800,000 single nucleotide polymorphisms, of which ∼10% had an effect on the coding regions of the genome. Furthermore, we identified over 1700 probable presence/absence polymorphisms in genes across the Australian isolates using de novo assembly. Finally, we developed a gene tree sorting method that quickly identifies groups of isolates within a single gene alignment whose sequence haplotypes correspond with virulence scores on a single wheat cultivar. Using this method, we have identified < 100 candidate effector genes whose gene sequence correlates with virulence toward a wheat cultivar carrying a major resistance gene. PMID:26837952

  5. Feature genes in metastatic breast cancer identified by MetaDE and SVM classifier methods.

    PubMed

    Tuo, Youlin; An, Ning; Zhang, Ming

    2018-03-01

    The aim of the present study was to investigate the feature genes in metastatic breast cancer samples. A total of 5 expression profiles of metastatic breast cancer samples were downloaded from the Gene Expression Omnibus database, which were then analyzed using the MetaQC and MetaDE packages in R language. The feature genes between metastasis and non‑metastasis samples were screened under the threshold of P<0.05. Based on the protein‑protein interactions (PPIs) in the Biological General Repository for Interaction Datasets, Human Protein Reference Database and Biomolecular Interaction Network Database, the PPI network of the feature genes was constructed. The feature genes identified by topological characteristics were then used for support vector machine (SVM) classifier training and verification. The accuracy of the SVM classifier was then evaluated using another independent dataset from The Cancer Genome Atlas database. Finally, function and pathway enrichment analyses for genes in the SVM classifier were performed. A total of 541 feature genes were identified between metastatic and non‑metastatic samples. The top 10 genes with the highest betweenness centrality values in the PPI network of feature genes were Nuclear RNA Export Factor 1, cyclin‑dependent kinase 2 (CDK2), myelocytomatosis proto‑oncogene protein (MYC), Cullin 5, SHC Adaptor Protein 1, Clathrin heavy chain, Nucleolin, WD repeat domain 1, proteasome 26S subunit non‑ATPase 2 and telomeric repeat binding factor 2. The cyclin‑dependent kinase inhibitor 1A (CDKN1A), E2F transcription factor 1 (E2F1), and MYC interacted with CDK2. The SVM classifier constructed by the top 30 feature genes was able to distinguish metastatic samples from non‑metastatic samples [correct rate, specificity, positive predictive value and negative predictive value >0.89; sensitivity >0.84; area under the receiver operating characteristic curve (AUROC) >0.96]. The verification of the SVM classifier in an

  6. Assessment of gene-by-sex interaction effect on bone mineral density.

    PubMed

    Liu, Ching-Ti; Estrada, Karol; Yerges-Armstrong, Laura M; Amin, Najaf; Evangelou, Evangelos; Li, Guo; Minster, Ryan L; Carless, Melanie A; Kammerer, Candace M; Oei, Ling; Zhou, Yanhua; Alonso, Nerea; Dailiana, Zoe; Eriksson, Joel; García-Giralt, Natalia; Giroux, Sylvie; Husted, Lise Bjerre; Khusainova, Rita I; Koromila, Theodora; Kung, Annie Waichee; Lewis, Joshua R; Masi, Laura; Mencej-Bedrac, Simona; Nogues, Xavier; Patel, Millan S; Prezelj, Janez; Richards, J Brent; Sham, Pak Chung; Spector, Timothy; Vandenput, Liesbeth; Xiao, Su-Mei; Zheng, Hou-Feng; Zhu, Kun; Balcells, Susana; Brandi, Maria Luisa; Frost, Morten; Goltzman, David; González-Macías, Jesús; Karlsson, Magnus; Khusnutdinova, Elza K; Kollia, Panagoula; Langdahl, Bente Lomholt; Ljunggren, Osten; Lorentzon, Mattias; Marc, Janja; Mellström, Dan; Ohlsson, Claes; Olmos, José M; Ralston, Stuart H; Riancho, José A; Rousseau, François; Urreizti, Roser; Van Hul, Wim; Zarrabeitia, María T; Castano-Betancourt, Martha; Demissie, Serkalem; Grundberg, Elin; Herrera, Lizbeth; Kwan, Tony; Medina-Gómez, Carolina; Pastinen, Tomi; Sigurdsson, Gunnar; Thorleifsson, Gudmar; Vanmeurs, Joyce Bj; Blangero, John; Hofman, Albert; Liu, Yongmei; Mitchell, Braxton D; O'Connell, Jeffrey R; Oostra, Ben A; Rotter, Jerome I; Stefansson, Kari; Streeten, Elizabeth A; Styrkarsdottir, Unnur; Thorsteinsdottir, Unnur; Tylavsky, Frances A; Uitterlinden, Andre; Cauley, Jane A; Harris, Tamara B; Ioannidis, John Pa; Psaty, Bruce M; Robbins, John A; Zillikens, M Carola; Vanduijn, Cornelia M; Prince, Richard L; Karasik, David; Rivadeneira, Fernando; Kiel, Douglas P; Cupples, L Adrienne; Hsu, Yi-Hsiang

    2012-10-01

    Sexual dimorphism in various bone phenotypes, including bone mineral density (BMD), is widely observed; however, the extent to which genes explain these sex differences is unclear. To identify variants with different effects by sex, we examined gene-by-sex autosomal interactions genome-wide, and performed expression quantitative trait loci (eQTL) analysis and bioinformatics network analysis. We conducted an autosomal genome-wide meta-analysis of gene-by-sex interaction on lumbar spine (LS) and femoral neck (FN) BMD in 25,353 individuals from 8 cohorts. In a second stage, we followed up the 12 top single-nucleotide polymorphisms (SNPs; p < 1 × 10(-5) ) in an additional set of 24,763 individuals. Gene-by-sex interaction and sex-specific effects were examined in these 12 SNPs. We detected one novel genome-wide significant interaction associated with LS-BMD at the Chr3p26.1-p25.1 locus, near the GRM7 gene (male effect = 0.02 and p = 3.0 × 10(-5) ; female effect = -0.007 and p = 3.3 × 10(-2) ), and 11 suggestive loci associated with either FN- or LS-BMD in discovery cohorts. However, there was no evidence for genome-wide significant (p < 5 × 10(-8) ) gene-by-sex interaction in the joint analysis of discovery and replication cohorts. Despite the large collaborative effort, no genome-wide significant evidence for gene-by-sex interaction was found to influence BMD variation in this screen of autosomal markers. If they exist, gene-by-sex interactions for BMD probably have weak effects, accounting for less than 0.08% of the variation in these traits per implicated SNP. © 2012 American Society for Bone and Mineral Research. Copyright © 2012 American Society for Bone and Mineral Research.

  7. Co-expression network analysis identified six hub genes in association with metastasis risk and prognosis in hepatocellular carcinoma

    PubMed Central

    Feng, Juerong; Zhou, Rui; Chang, Ying; Liu, Jing; Zhao, Qiu

    2017-01-01

    Hepatocellular carcinoma (HCC) has a high incidence and mortality worldwide, and its carcinogenesis and progression are influenced by a complex network of gene interactions. A weighted gene co-expression network was constructed to identify gene modules associated with the clinical traits in HCC (n = 214). Among the 13 modules, high correlation was only found between the red module and metastasis risk (classified by the HCC metastasis gene signature) (R2 = −0.74). Moreover, in the red module, 34 network hub genes for metastasis risk were identified, six of which (ABAT, AGXT, ALDH6A1, CYP4A11, DAO and EHHADH) were also hub nodes in the protein-protein interaction network of the module genes. Thus, a total of six hub genes were identified. In validation, all hub genes showed a negative correlation with the four-stage HCC progression (P for trend < 0.05) in the test set. Furthermore, in the training set, HCC samples with any hub gene lowly expressed demonstrated a higher recurrence rate and poorer survival rate (hazard ratios with 95% confidence intervals > 1). RNA-sequencing data of 142 HCC samples showed consistent results in the prognosis. Gene set enrichment analysis (GSEA) demonstrated that in the samples with any hub gene highly expressed, a total of 24 functional gene sets were enriched, most of which focused on amino acid metabolism and oxidation. In conclusion, co-expression network analysis identified six hub genes in association with HCC metastasis risk and prognosis, which might improve the prognosis by influencing amino acid metabolism and oxidation. PMID:28430663

  8. From Genes to Networks: Characterizing Gene-Regulatory Interactions in Plants.

    PubMed

    Kaufmann, Kerstin; Chen, Dijun

    2017-01-01

    Plants, like other eukaryotes, have evolved complex mechanisms to coordinate gene expression during development, environmental response, and cellular homeostasis. Transcription factors (TFs), accompanied by basic cofactors and posttranscriptional regulators, are key players in gene-regulatory networks (GRNs). The coordinated control of gene activity is achieved by the interplay of these factors and by physical interactions between TFs and DNA. Here, we will briefly outline recent technological progress made to elucidate GRNs in plants. We will focus on techniques that allow us to characterize physical interactions in GRNs in plants and to analyze their regulatory consequences. Targeted manipulation allows us to test the relevance of specific gene-regulatory interactions. The combination of genome-wide experimental approaches with mathematical modeling allows us to get deeper insights into key-regulatory interactions and combinatorial control of important processes in plants.

  9. Genotype by watering regime interaction in cultivated tomato: lessons from linkage mapping and gene expression.

    PubMed

    Albert, Elise; Gricourt, Justine; Bertin, Nadia; Bonnefoi, Julien; Pateyron, Stéphanie; Tamby, Jean-Philippe; Bitton, Frédérique; Causse, Mathilde

    2016-02-01

    In tomato, genotype by watering interaction resulted from genotype re-ranking more than scale changes. Interactive QTLs according to watering regime were detected. Differentially expressed genes were identified in some intervals. As a result of climate change, drought will increasingly limit crop production in the future. Studying genotype by watering regime interactions is necessary to improve plant adaptation to low water availability. In cultivated tomato (Solanum lycopersicum L.), extensively grown in dry areas, well-mastered water deficits can stimulate metabolite production, increasing plant defenses and concentration of compounds involved in fruit quality, at the same time. However, few tomato Quantitative Trait Loci (QTLs) and genes involved in response to drought are identified or only in wild species. In this study, we phenotyped a population of 119 recombinant inbred lines derived from a cross between a cherry tomato and a large fruit tomato, grown in greenhouse under two watering regimes, in two locations. A large genetic variability was measured for 19 plant and fruit traits, under the two watering treatments. Highly significant genotype by watering regime interactions were detected and resulted from re-ranking more than scale changes. The population was genotyped for 679 SNP markers to develop a genetic map. In total, 56 QTLs were identified among which 11 were interactive between watering regimes. These later mainly exhibited antagonist effects according to watering treatment. Variation in gene expression in leaves of parental accessions revealed 2259 differentially expressed genes, among which candidate genes presenting sequence polymorphisms were identified under two main interactive QTLs. Our results provide knowledge about the genetic control of genotype by watering regime interactions in cultivated tomato and the possible use of deficit irrigation to improve tomato quality.

  10. Choline Metabolites: Gene by Diet Interactions

    PubMed Central

    Smallwood, Tangi; Allayee, Hooman; Bennett, Brian J.

    2015-01-01

    Purpose of review This review highlights recent advances in our understanding of the interactions between genetic polymorphisms in genes that metabolize choline and the dietary requirements of choline and how these interactions relate to human health and disease. Recent findings The importance of choline as an essential nutrient has been well established but our appreciation of the interaction between our underlying genetic architecture and dietary choline requirements is only beginning. It has been shown in both human and animal studies that choline deficiencies contribute to diseases such as non-alcoholic fatty liver disease and various neurodegenerative diseases. An adequate supply of dietary choline is important for optimum development, highlighted by the increased maternal requirements during fetal development and in breast-fed infants. We discuss recent studies investigating variants in PEMT and MTHFR1 that are associated with a variety of birth defects. In addition to genetic interactions, we discuss several recent studies that uncover changes in fetal global methylation patterns in response to maternal dietary choline intake that result in changes in gene expression in the offspring. In contrast to the developmental role of adequate choline, there is now an appreciation of the role choline has in cardiovascular disease through the gut microbiota-mediated metabolite trimethylamine N-oxide. This pathway highlights some of our understanding of how the microbiome affects nutrient processing and bioavailability. Finally, in order to better characterize the genetic architecture regulating choline requirements, we discuss recent results focused on identifying polymorphisms that regulate choline and its derivative products. Summary Here we discuss recent studies that have advanced our understanding of how specific alleles in key choline metabolism genes are related to dietary choline requirements and human disease. PMID:26655287

  11. GSNO Reductase and β2 Adrenergic Receptor Gene-gene Interaction: Bronchodilator Responsiveness to Albuterol

    PubMed Central

    Choudhry, Shweta; Que, Loretta G.; Yang, Zhonghui; Liu, Limin; Eng, Celeste; Kim, Sung O.; Kumar, Gunjan; Thyne, Shannon; Chapela, Rocio; Rodriguez-Santana, Jose R.; Rodriguez-Cintron, William; Avila, Pedro C.; Stamler, Jonathan S.; Burchard, Esteban G.

    2010-01-01

    Background Short-acting inhaled β2-agonists such as albuterol are used for bronchodilation and are the mainstay of asthma treatment worldwide. There is significant variation in bronchodilator responsiveness to albuterol not only between individuals but also across racial/ethnic groups. The β2-adrenergic receptor (β2AR) is the target for β2-agonist drugs. The enzyme S-nitrosoglutathione reductase (GSNOR), which regulates levels of the endogenous bronchodilator S-nitrosoglutathione, has been shown to modulate the response to β2-agonists. Objective We hypothesized that there are pharmacogenetic interactions between GSNOR and β2AR gene variants which are associated with variable response to albuterol. Methods We performed family-based analyses to test for association between GSNOR gene variants and asthma and related phenotypes in 609 Puerto Rican and Mexican families with asthma. In addition, we tested these subjects for pharmacogenetic interaction between GSNOR and β2AR gene variants and responsiveness to albuterol using linear regression. Cell transfection experiments were performed to test the potential effect of the GSNOR gene variants. Results Among Puerto Ricans, several GSNOR SNPs and a haplotype in the 3′UTR were significantly associated with increased risk for asthma and lower bronchodilator responsiveness (p = 0.04 to 0.007). The GSNOR risk haplotype affects expression of GSNOR mRNA and protein, suggesting a gain of function. Furthermore, gene-gene interaction analysis provided evidence of pharmacogenetic interaction between GSNOR and β2AR gene variants and the response to albuterol in Puerto Rican (p = 0.03), Mexican (p = 0.15) and combined Puerto Rican and Mexican asthmatics (p = 0.003). Specifically, GSNOR+17059*β2AR+46 genotype combinations (TG+GG*AG and TG+GG*GG) were associated with lower bronchodilator response. Conclusion Genotyping of GSNOR and β2AR genes may be a useful in identifying Latino subjects, who might benefit from adjuvant

  12. DNMT1-interacting RNAs block gene specific DNA methylation

    PubMed Central

    Di Ruscio, Annalisa; Ebralidze, Alexander K.; Benoukraf, Touati; Amabile, Giovanni; Goff, Loyal A.; Terragni, Joylon; Figueroa, Maria Eugenia; De Figureido Pontes, Lorena Lobo; Alberich-Jorda, Meritxell; Zhang, Pu; Wu, Mengchu; D’Alò, Francesco; Melnick, Ari; Leone, Giuseppe; Ebralidze, Konstantin K.; Pradhan, Sriharsa; Rinn, John L.; Tenen, Daniel G.

    2013-01-01

    Summary DNA methylation was described almost a century ago. However, the rules governing its establishment and maintenance remain elusive. Here, we present data demonstrating that active transcription regulates levels of genomic methylation. We identified a novel RNA arising from the CEBPA gene locus critical in regulating the local DNA methylation profile. This RNA binds to DNMT1 and prevents CEBPA gene locus methylation. Deep sequencing of transcripts associated with DNMT1 combined with genome-scale methylation and expression profiling extended the generality of this finding to numerous gene loci. Collectively, these results delineate the nature of DNMT1-RNA interactions and suggest strategies for gene selective demethylation of therapeutic targets in disease. PMID:24107992

  13. The Impact of Gene-Environment Dependence and Misclassification in Genetic Association Studies Incorporating Gene-Environment Interactions

    PubMed Central

    Lindström, Sara; Yen, Yu-Chun; Spiegelman, Donna; Kraft, Peter

    2009-01-01

    The possibility of gene-environment interaction can be exploited to identify genetic variants associated with disease using a joint test of genetic main effect and gene-environment interaction. We consider how exposure misclassification and dependence between the true exposure E and the tested genetic variant G affect this joint test in absolute terms and relative to three other tests: the marginal test (G), the standard test for multiplicative gene-environment interaction (GE), and the case-only test for interaction (GE-CO). All tests can have inflated Type I error rate when E and G are correlated in the underlying population. For the GE and G-GE tests this inflation is only noticeable when the gene-environment dependence is unusually strong; the inflation can be large for the GE-CO test even for modest correlation. The joint G-GE test has greater power than the GE test generally, and greater power than the G test when there is no genetic main effect and the measurement error is small to moderate. The joint G-GE test is an attractive test for assessing genetic association when there is limited knowledge about casual mechanisms a priori, even in the presence of misclassification in environmental exposure measurement and correlation between exposure and genetic variants. PMID:19521099

  14. A Kinome RNAi Screen in Drosophila Identifies Novel Genes Interacting with Lgl, aPKC, and Crb Cell Polarity Genes in Epithelial Tissues.

    PubMed

    Parsons, Linda M; Grzeschik, Nicola A; Amaratunga, Kasun; Burke, Peter; Quinn, Leonie M; Richardson, Helena E

    2017-08-07

    In both Drosophila melanogaster and mammalian systems, epithelial structure and underlying cell polarity are essential for proper tissue morphogenesis and organ growth. Cell polarity interfaces with multiple cellular processes that are regulated by the phosphorylation status of large protein networks. To gain insight into the molecular mechanisms that coordinate cell polarity with tissue growth, we screened a boutique collection of RNAi stocks targeting the kinome for their capacity to modify Drosophila "cell polarity" eye and wing phenotypes. Initially, we identified kinase or phosphatase genes whose depletion modified adult eye phenotypes associated with the manipulation of cell polarity complexes (via overexpression of Crb or aPKC). We next conducted a secondary screen to test whether these cell polarity modifiers altered tissue overgrowth associated with depletion of Lgl in the wing. These screens identified Hippo, Jun kinase (JNK), and Notch signaling pathways, previously linked to cell polarity regulation of tissue growth. Furthermore, novel pathways not previously connected to cell polarity regulation of tissue growth were identified, including Wingless (Wg/Wnt), Ras, and lipid/Phospho-inositol-3-kinase (PI3K) signaling pathways. Additionally, we demonstrated that the "nutrient sensing" kinases Salt Inducible Kinase 2 and 3 ( SIK2 and 3 ) are potent modifiers of cell polarity phenotypes and regulators of tissue growth. Overall, our screen has revealed novel cell polarity-interacting kinases and phosphatases that affect tissue growth, providing a platform for investigating molecular mechanisms coordinating cell polarity and tissue growth during development. Copyright © 2017 Parsons et al.

  15. Benchmarking pathway interaction network for colorectal cancer to identify dysregulated pathways.

    PubMed

    Wang, Q; Shi, C-J; Lv, S-H

    2017-03-30

    Different pathways act synergistically to participate in many biological processes. Thus, the purpose of our study was to extract dysregulated pathways to investigate the pathogenesis of colorectal cancer (CRC) based on the functional dependency among pathways. Protein-protein interaction (PPI) information and pathway data were retrieved from STRING and Reactome databases, respectively. After genes were aligned to the pathways, each pathway activity was calculated using the principal component analysis (PCA) method, and the seed pathway was discovered. Subsequently, we constructed the pathway interaction network (PIN), where each node represented a biological pathway based on gene expression profile, PPI data, as well as pathways. Dysregulated pathways were then selected from the PIN according to classification performance and seed pathway. A PIN including 11,960 interactions was constructed to identify dysregulated pathways. Interestingly, the interaction of mRNA splicing and mRNA splicing-major pathway had the highest score of 719.8167. Maximum change of the activity score between CRC and normal samples appeared in the pathway of DNA replication, which was selected as the seed pathway. Starting with this seed pathway, a pathway set containing 30 dysregulated pathways was obtained with an area under the curve score of 0.8598. The pathway of mRNA splicing, mRNA splicing-major pathway, and RNA polymerase I had the maximum genes of 107. Moreover, we found that these 30 pathways had crosstalks with each other. The results suggest that these dysregulated pathways might be used as biomarkers to diagnose CRC.

  16. Gene-Based Genome-Wide Association Analysis in European and Asian Populations Identified Novel Genes for Rheumatoid Arthritis.

    PubMed

    Zhu, Hong; Xia, Wei; Mo, Xing-Bo; Lin, Xiang; Qiu, Ying-Hua; Yi, Neng-Jun; Zhang, Yong-Hong; Deng, Fei-Yan; Lei, Shu-Feng

    2016-01-01

    Rheumatoid arthritis (RA) is a complex autoimmune disease. Using a gene-based association research strategy, the present study aims to detect unknown susceptibility to RA and to address the ethnic differences in genetic susceptibility to RA between European and Asian populations. Gene-based association analyses were performed with KGG 2.5 by using publicly available large RA datasets (14,361 RA cases and 43,923 controls of European subjects, 4,873 RA cases and 17,642 controls of Asian Subjects). For the newly identified RA-associated genes, gene set enrichment analyses and protein-protein interactions analyses were carried out with DAVID and STRING version 10.0, respectively. Differential expression verification was conducted using 4 GEO datasets. The expression levels of three selected 'highly verified' genes were measured by ELISA among our in-house RA cases and controls. A total of 221 RA-associated genes were newly identified by gene-based association study, including 71'overlapped', 76 'European-specific' and 74 'Asian-specific' genes. Among them, 105 genes had significant differential expressions between RA patients and health controls at least in one dataset, especially for 20 genes including 11 'overlapped' (ABCF1, FLOT1, HLA-F, IER3, TUBB, ZKSCAN4, BTN3A3, HSP90AB1, CUTA, BRD2, HLA-DMA), 5 'European-specific' (PHTF1, RPS18, BAK1, TNFRSF14, SUOX) and 4 'Asian-specific' (RNASET2, HFE, BTN2A2, MAPK13) genes whose differential expressions were significant at least in three datasets. The protein expressions of two selected genes FLOT1 (P value = 1.70E-02) and HLA-DMA (P value = 4.70E-02) in plasma were significantly different in our in-house samples. Our study identified 221 novel RA-associated genes and especially highlighted the importance of 20 candidate genes on RA. The results addressed ethnic genetic background differences for RA susceptibility between European and Asian populations and detected a long list of overlapped or ethnic specific RA genes. The

  17. Single gene and gene interaction effects on fertilization and embryonic survival rates in cattle.

    PubMed

    Khatib, H; Huang, W; Wang, X; Tran, A H; Bindrim, A B; Schutzkus, V; Monson, R L; Yandell, B S

    2009-05-01

    Decrease in fertility and conception rates is a major cause of economic loss and cow culling in dairy herds. Conception rate is the product of fertilization rate and embryonic survival rate. Identification of genetic factors that cause the death of embryos is the first step in eliminating this problem from the population and thereby increasing reproductive efficiency. A candidate pathway approach was used to identify candidate genes affecting fertilization and embryo survival rates using an in vitro fertilization experimental system. A total of 7,413 in vitro fertilizations were performed using oocytes from 504 ovaries and semen samples from 10 different bulls. Fertilization rate was calculated as the number of cleaved embryos 48 h postfertilization out of the total number of oocytes exposed to sperm. Survival rate of embryos was calculated as the number of blastocysts on d 7 of development out of the number of total embryos cultured. All ovaries were genotyped for 8 genes in the POU1F1 signaling pathway. Single-gene analysis revealed significant associations of GHR, PRLR, STAT5A, and UTMP with survival rate and of POU1F1, GHR, STAT5A, and OPN with fertilization rate. To further characterize the contribution of the entire integrated POU1F1 pathway to fertilization and early embryonic survival, a model selection procedure was applied. Comparisons among the different models showed that interactions between adjacent genes in the pathway revealed a significant contribution to the variation in fertility traits compared with other models that analyzed only bull information or only genes without interactions. Moreover, some genes that were not significant in the single-gene analysis showed significant effects in the interaction analysis. Thus, we propose that single genes as well as an entire pathway can be used in selection programs to improve reproduction performance in dairy cattle.

  18. Identifying Cancer Driver Genes Using Replication-Incompetent Retroviral Vectors

    PubMed Central

    Bii, Victor M.; Trobridge, Grant D.

    2016-01-01

    Identifying novel genes that drive tumor metastasis and drug resistance has significant potential to improve patient outcomes. High-throughput sequencing approaches have identified cancer genes, but distinguishing driver genes from passengers remains challenging. Insertional mutagenesis screens using replication-incompetent retroviral vectors have emerged as a powerful tool to identify cancer genes. Unlike replicating retroviruses and transposons, replication-incompetent retroviral vectors lack additional mutagenesis events that can complicate the identification of driver mutations from passenger mutations. They can also be used for almost any human cancer due to the broad tropism of the vectors. Replication-incompetent retroviral vectors have the ability to dysregulate nearby cancer genes via several mechanisms including enhancer-mediated activation of gene promoters. The integrated provirus acts as a unique molecular tag for nearby candidate driver genes which can be rapidly identified using well established methods that utilize next generation sequencing and bioinformatics programs. Recently, retroviral vector screens have been used to efficiently identify candidate driver genes in prostate, breast, liver and pancreatic cancers. Validated driver genes can be potential therapeutic targets and biomarkers. In this review, we describe the emergence of retroviral insertional mutagenesis screens using replication-incompetent retroviral vectors as a novel tool to identify cancer driver genes in different cancer types. PMID:27792127

  19. Gene-Diet Interactions in Childhood Obesity

    PubMed Central

    Garver, William S

    2011-01-01

    Childhood overweight and obesity have reached epidemic proportions worldwide, and the increase in weight-associated co-morbidities including premature type 2 diabetes mellitus (T2DM) and atherosclerotic cardiovascular disease will soon become major healthcare and economic problems. A number of studies now indicate that the childhood obesity epidemic which has emerged during the past 30 years is a complex multi-factorial disease resulting from interaction of susceptibility genes with an obesogenic environment. This review will focus on gene-diet interactions suspected of having a prominent role in promoting childhood obesity. In particular, the specific genes that will be presented (FTO, MC4R, and NPC1) have recently been associated with childhood obesity through a genome-wide association study (GWAS) and were shown to interact with nutritional components to increase weight gain. Although a fourth gene (APOA2) has not yet been associated with childhood obesity, this review will also present information on what now represents the best characterized gene-diet interaction in promoting weight gain. PMID:22043166

  20. Dissecting the Gene Network of Dietary Restriction to Identify Evolutionarily Conserved Pathways and New Functional Genes

    PubMed Central

    Wuttke, Daniel; Connor, Richard; Vora, Chintan; Craig, Thomas; Li, Yang; Wood, Shona; Vasieva, Olga; Shmookler Reis, Robert; Tang, Fusheng; de Magalhães, João Pedro

    2012-01-01

    Dietary restriction (DR), limiting nutrient intake from diet without causing malnutrition, delays the aging process and extends lifespan in multiple organisms. The conserved life-extending effect of DR suggests the involvement of fundamental mechanisms, although these remain a subject of debate. To help decipher the life-extending mechanisms of DR, we first compiled a list of genes that if genetically altered disrupt or prevent the life-extending effects of DR. We called these DR–essential genes and identified more than 100 in model organisms such as yeast, worms, flies, and mice. In order for other researchers to benefit from this first curated list of genes essential for DR, we established an online database called GenDR (http://genomics.senescence.info/diet/). To dissect the interactions of DR–essential genes and discover the underlying lifespan-extending mechanisms, we then used a variety of network and systems biology approaches to analyze the gene network of DR. We show that DR–essential genes are more conserved at the molecular level and have more molecular interactions than expected by chance. Furthermore, we employed a guilt-by-association method to predict novel DR–essential genes. In budding yeast, we predicted nine genes related to vacuolar functions; we show experimentally that mutations deleting eight of those genes prevent the life-extending effects of DR. Three of these mutants (OPT2, FRE6, and RCR2) had extended lifespan under ad libitum, indicating that the lack of further longevity under DR is not caused by a general compromise of fitness. These results demonstrate how network analyses of DR using GenDR can be used to make phenotypically relevant predictions. Moreover, gene-regulatory circuits reveal that the DR–induced transcriptional signature in yeast involves nutrient-sensing, stress responses and meiotic transcription factors. Finally, comparing the influence of gene expression changes during DR on the interactomes of multiple

  1. Using protein-protein interactions for refining gene networks estimated from microarray data by Bayesian networks.

    PubMed

    Nariai, N; Kim, S; Imoto, S; Miyano, S

    2004-01-01

    We propose a statistical method to estimate gene networks from DNA microarray data and protein-protein interactions. Because physical interactions between proteins or multiprotein complexes are likely to regulate biological processes, using only mRNA expression data is not sufficient for estimating a gene network accurately. Our method adds knowledge about protein-protein interactions to the estimation method of gene networks under a Bayesian statistical framework. In the estimated gene network, a protein complex is modeled as a virtual node based on principal component analysis. We show the effectiveness of the proposed method through the analysis of Saccharomyces cerevisiae cell cycle data. The proposed method improves the accuracy of the estimated gene networks, and successfully identifies some biological facts.

  2. The Rice Tungro Bacilliform Virus Gene II Product Interacts with the Coat Protein Domain of the Viral Gene III Polyprotein

    PubMed Central

    Herzog, Etienne; Guerra-Peraza, Orlene; Hohn, Thomas

    2000-01-01

    Rice tungro bacilliform virus (RTBV) is a plant pararetrovirus whose DNA genome contains four genes encoding three proteins and a large polyprotein. The function of most of the viral proteins is still unknown. To investigate the role of the gene II product (P2), we searched for interactions between this protein and other RTBV proteins. P2 was shown to interact with the coat protein (CP) domain of the viral gene III polyprotein (P3) both in the yeast two-hybrid system and in vitro. Domains involved in the P2-CP association have been identified and mapped on both proteins. To determine the importance of this interaction for viral multiplication, the infectivity of RTBV gene II mutants was investigated by agroinoculation of rice plants. The results showed that virus viability correlates with the ability of P2 to interact with the CP domain of P3. This study suggests that P2 could participate in RTBV capsid assembly. PMID:10666237

  3. The role of gene-gene interaction in the prediction of criminal behavior.

    PubMed

    Boutwell, Brian B; Menard, Scott; Barnes, J C; Beaver, Kevin M; Armstrong, Todd A; Boisvert, Danielle

    2014-04-01

    A host of research has examined the possibility that environmental risk factors might condition the influence of genes on various outcomes. Less research, however, has been aimed at exploring the possibility that genetic factors might interact to impact the emergence of human traits. Even fewer studies exist examining the interaction of genes in the prediction of behavioral outcomes. The current study expands this body of research by testing the interaction between genes involved in neural transmission. Our findings suggest that certain dopamine genes interact to increase the odds of criminogenic outcomes in a national sample of Americans. Copyright © 2014 Elsevier Inc. All rights reserved.

  4. Gene-nutrient interaction markedly influences yeast chronological lifespan.

    PubMed

    Smith, Daniel L; Maharrey, Crystal H; Carey, Christopher R; White, Richard A; Hartman, John L

    2016-12-15

    Research into the genetic mechanisms of aging has expanded rapidly over the past two decades. This has in part been the result of the use of model organisms (particularly yeast, worms and flies) and high-throughput technologies, combined with a growing interest in aging research. Despite this progress, widespread consensus regarding the pathways that are fundamental to the modulation of cellular aging and lifespan for all organisms has been limited due to discrepancies between different studies. We have compared results from published genome-wide, chronological lifespan (CLS) screens of individual gene deletion strains in Saccharomyces cerevisiae in order to identify gene deletion strains with consistent influences on longevity as possible indicators of fundamental aging processes from this single-celled, eukaryotic model organism. Three previous reports have described genetic modifiers of chronological aging in the budding yeast (S. cerevisiae) using the yeast gene deletion strain collection. We performed a comparison among the data sets using correlation and decile distribution analysis to describe concordance between screens and identify strains that consistently increased or decreased CLS. We used gene enrichment analysis in an effort to understand the biology underlying genes identified in multiple studies. We attempted to replicate the different experimental conditions employed by the screens to identify potential sources of variability in CLS worth further investigating. Among 3209 strains present in all three screens, nine deletions strains were in common in the longest-lived decile (2.80%) and thirteen were in common in the shortest-lived decile (4.05%) of all three screens. Similarly, pairwise overlap between screens was low. When the same comparison was extended to three deciles to include more mutants studied in common between the three screens, enrichment of cellular processes based on gene ontology analysis in the long-lived strains remained very

  5. Gene-Nutrient Interaction Markedly Influences Yeast Chronological Lifespan

    PubMed Central

    Smith, Daniel L.; Maharrey, Crystal H.; Carey, Christopher R.; White, Richard A.; Hartman, John L.

    2016-01-01

    Purpose Research into the genetic mechanisms of aging has expanded rapidly over the past two decades. This has in part been the result of the use of model organisms (particularly yeast, worms and flies) and high-throughput technologies, combined with a growing interest in aging research. Despite this progress, widespread consensus regarding the pathways that are fundamental to the modulation of cellular aging and lifespan for all organisms has been limited due to discrepancies between different studies. We have compared results from published genome-wide, chronological lifespan (CLS) screens of individual gene deletion strains in S. cerevisiae in order to identify gene deletion strains with consistent influences on longevity as possible indicators of fundamental aging processes from this single-celled, eukaryotic model organism. Methods Three previous reports have described genetic modifiers of chronological aging in the budding yeast (S. cerevisiae) using the yeast gene deletion strain collection. We performed a comparison among the data sets using correlation and decile distribution analysis to describe concordance between screens and identify strains that consistently increased or decreased CLS. We used gene enrichment analysis in an effort to understand the biology underlying genes identified in multiple studies. We attempted to replicate the different experimental conditions employed by the screens to identify potential sources of variability in CLS worth further investigating. Results Among 3209 strains present in all three screens, nine (2.80%) deletions strains were in common in the longest-lived decile and thirteen (4.05%) were in common in the shortest-lived decile for all three screens. Similarly, pairwise overlap between screens was low. When the same comparison was extended to three deciles to include more mutants studied in common between the three screens, enrichment of cellular processes based on gene ontology analysis in the long-lived strains

  6. Genetic Association and Gene-Gene Interaction Analyses in African American Dialysis Patients With Nondiabetic Nephropathy

    PubMed Central

    Bostrom, Meredith A.; Kao, W.H. Linda; Li, Man; Abboud, Hanna E.; Adler, Sharon G.; Iyengar, Sudha K.; Kimmel, Paul L.; Hanson, Robert L.; Nicholas, Susanne B.; Rasooly, Rebekah S.; Sedor, John R.; Coresh, Josef; Kohn, Orly F.; Leehey, David J.; Thornley-Brown, Denyse; Bottinger, Erwin P.; Lipkowitz, Michael S.; Meoni, Lucy A.; Klag, Michael J.; Lu, Lingyi; Hicks, Pamela J.; Langefeld, Carl D.; Parekh, Rulan S.; Bowden, Donald W.; Freedman, Barry I.

    2011-01-01

    Background African Americans (AAs) have increased susceptibility to non-diabetic nephropathy relative to European Americans. Study Design Follow-up of a pooled genome-wide association study (GWAS) in AA dialysis patients with nondiabetic nephropathy; novel gene-gene interaction analyses. Setting & Participants Wake Forest sample: 962 AA nondiabetic nephropathy cases; 931 non-nephropathy controls. Replication sample: 668 Family Investigation of Nephropathy and Diabetes (FIND) AA nondiabetic nephropathy cases; 804 non-nephropathy controls. Predictors Individual genotyping of top 1420 pooled GWAS-associated single nucleotide polymorphisms (SNPs) and 54 SNPs in six nephropathy susceptibility genes. Outcomes APOL1 genetic association and additional candidate susceptibility loci interacting with, or independently from, APOL1. Results The strongest GWAS associations included two non-coding APOL1 SNPs, rs2239785 (odds ratio [OR], 0.33; dominant; p = 5.9 × 10−24) and rs136148 (OR, 0.54; additive; p = 1.1 × 10−7) with replication in FIND (p = 5.0 × 10−21 and 1.9 × 10−05, respectively). Rs2239785 remained significantly associated after controlling for the APOL1 G1 and G2 coding variants. Additional top hits included a CFH SNP(OR from meta-analysis in above 3367 AA cases and controls, 0.81; additive; p = 6.8 × 10−4). The 1420 SNPs were tested for interaction with APOL1 G1 and G2 variants. Several interactive SNPs were detected, the most significant was rs16854341 in the podocin gene (NPHS2) (p = 0.0001). Limitations Non-pooled GWAS have not been performed in AA nondiabetic nephropathy. Conclusions This follow-up of a pooled GWAS provides additional and independent evidence that APOL1 variants contribute to nondiabetic nephropathy in AAs and identified additional associated and interactive non-diabetic nephropathy susceptibility genes. PMID:22119407

  7. Identifying Novel Transcriptional and Epigenetic Features of Nuclear Lamina-associated Genes.

    PubMed

    Wu, Feinan; Yao, Jie

    2017-03-07

    Because a large portion of the mammalian genome is associated with the nuclear lamina (NL), it is interesting to study how native genes resided there are transcribed and regulated. In this study, we report unique transcriptional and epigenetic features of nearly 3,500 NL-associated genes (NL genes). Promoter regions of active NL genes are often excluded from NL-association, suggesting that NL-promoter interactions may repress transcription. Active NL genes with higher RNA polymerase II (Pol II) recruitment levels tend to display Pol II promoter-proximal pausing, while Pol II recruitment and Pol II pausing are not correlated among non-NL genes. At the genome-wide scale, NL-association and H3K27me3 distinguishes two large gene classes with low transcriptional activities. Notably, NL-association is anti-correlated with both transcription and active histone mark levels among genes not significantly enriched with H3K9me3 or H3K27me3, suggesting that NL-association may represent a novel gene repression pathway. Interestingly, an NL gene subgroup is not significantly enriched with H3K9me3 or H3K27me3 and is transcribed at higher levels than the rest of NL genes. Furthermore, we identified distal enhancers associated with active NL genes and reported their epigenetic features.

  8. The case-only test for gene-environment interaction is not uniformly powerful: an empirical example

    PubMed Central

    Wu, Chen; Chang, Jiang; Ma, Baoshan; Miao, Xiaoping; Zhou, Yifeng; Liu, Yu; Li, Yun; Wu, Tangchun; Hu, Zhibin; Shen, Hongbing; Jia, Weihua; Zeng, Yixin; Lin, Dongxin; Kraft, Peter

    2016-01-01

    The case-only test has been proposed as a more powerful approach to detect gene-environment (G×E) interactions. This approach assumes that the genetic and environmental factors are independent. While it is well known that Type I error rate will increase if this assumption is violated, it is less widely appreciated that gene-environment correlation can also lead to power loss. We illustrate this phenomenon by comparing the performance of the case-only test to other approaches to detect G×E interactions in a genome-wide association study of esophageal squamous carcinoma (ESCC) in Chinese populations. Some of these approaches do not use information on the correlation between exposure and genotype (standard logistic regression), while others seek to use this information in a robust fashion to boost power without increasing Type I error (two-step, empirical Bayes and cocktail methods). G×E interactions were identified involving drinking status and two regions containing genes in the alcohol metabolism pathway, 4q23 and 12q24. Although the case-only test yielded the most significant tests of G×E interaction in the 4q23 region, the case-only test failed to identify significant interactions in the 12q24 region which were readily identified using other approaches. The low power of the case-only test in the 12q24 region is likely due to the strong inverse association between the SNPs in this region and drinking status. This example underscores the need to consider multiple approaches to detect gene-environment interactions, as different tests are more or less sensitive to different alternative hypotheses and violations of the gene-environment independence assumption. PMID:23595356

  9. Candidate genes for panhypopituitarism identified by gene expression profiling

    PubMed Central

    Mortensen, Amanda H.; MacDonald, James W.; Ghosh, Debashis

    2011-01-01

    Mutations in the transcription factors PROP1 and PIT1 (POU1F1) lead to pituitary hormone deficiency and hypopituitarism in mice and humans. The dysmorphology of developing Prop1 mutant pituitaries readily distinguishes them from those of Pit1 mutants and normal mice. This and other features suggest that Prop1 controls the expression of genes besides Pit1 that are important for pituitary cell migration, survival, and differentiation. To identify genes involved in these processes we used microarray analysis of gene expression to compare pituitary RNA from newborn Prop1 and Pit1 mutants and wild-type littermates. Significant differences in gene expression were noted between each mutant and their normal littermates, as well as between Prop1 and Pit1 mutants. Otx2, a gene critical for normal eye and pituitary development in humans and mice, exhibited elevated expression specifically in Prop1 mutant pituitaries. We report the spatial and temporal regulation of Otx2 in normal mice and Prop1 mutants, and the results suggest Otx2 could influence pituitary development by affecting signaling from the ventral diencephalon and regulation of gene expression in Rathke's pouch. The discovery that Otx2 expression is affected by Prop1 deficiency provides support for our hypothesis that identifying molecular differences in mutants will contribute to understanding the molecular mechanisms that control pituitary organogenesis and lead to human pituitary disease. PMID:21828248

  10. A deeper look at two concepts of measuring gene-gene interactions: logistic regression and interaction information revisited.

    PubMed

    Mielniczuk, Jan; Teisseyre, Paweł

    2018-03-01

    Detection of gene-gene interactions is one of the most important challenges in genome-wide case-control studies. Besides traditional logistic regression analysis, recently the entropy-based methods attracted a significant attention. Among entropy-based methods, interaction information is one of the most promising measures having many desirable properties. Although both logistic regression and interaction information have been used in several genome-wide association studies, the relationship between them has not been thoroughly investigated theoretically. The present paper attempts to fill this gap. We show that although certain connections between the two methods exist, in general they refer two different concepts of dependence and looking for interactions in those two senses leads to different approaches to interaction detection. We introduce ordering between interaction measures and specify conditions for independent and dependent genes under which interaction information is more discriminative measure than logistic regression. Moreover, we show that for so-called perfect distributions those measures are equivalent. The numerical experiments illustrate the theoretical findings indicating that interaction information and its modified version are more universal tools for detecting various types of interaction than logistic regression and linkage disequilibrium measures. © 2017 WILEY PERIODICALS, INC.

  11. Identifying novel genes and chemicals related to nasopharyngeal cancer in a heterogeneous network.

    PubMed

    Li, Zhandong; An, Lifeng; Li, Hao; Wang, ShaoPeng; Zhou, You; Yuan, Fei; Li, Lin

    2016-05-05

    Nasopharyngeal cancer or nasopharyngeal carcinoma (NPC) is the most common cancer originating in the nasopharynx. The factors that induce nasopharyngeal cancer are still not clear. Additional information about the chemicals or genes related to nasopharyngeal cancer will promote a better understanding of the pathogenesis of this cancer and the factors that induce it. Thus, a computational method NPC-RGCP was proposed in this study to identify the possible relevant chemicals and genes based on the presently known chemicals and genes related to nasopharyngeal cancer. To extensively utilize the functional associations between proteins and chemicals, a heterogeneous network was constructed based on interactions of proteins and chemicals. The NPC-RGCP included two stages: the searching stage and the screening stage. The former stage is for finding new possible genes and chemicals in the heterogeneous network, while the latter stage is for screening and removing false discoveries and selecting the core genes and chemicals. As a result, five putative genes, CXCR3, IRF1, CDK1, GSTP1, and CDH2, and seven putative chemicals, iron, propionic acid, dimethyl sulfoxide, isopropanol, erythrose 4-phosphate, β-D-Fructose 6-phosphate, and flavin adenine dinucleotide, were identified by NPC-RGCP. Extensive analyses provided confirmation that the putative genes and chemicals have significant associations with nasopharyngeal cancer.

  12. Identifying novel genes and chemicals related to nasopharyngeal cancer in a heterogeneous network

    PubMed Central

    Li, Zhandong; An, Lifeng; Li, Hao; Wang, ShaoPeng; Zhou, You; Yuan, Fei; Li, Lin

    2016-01-01

    Nasopharyngeal cancer or nasopharyngeal carcinoma (NPC) is the most common cancer originating in the nasopharynx. The factors that induce nasopharyngeal cancer are still not clear. Additional information about the chemicals or genes related to nasopharyngeal cancer will promote a better understanding of the pathogenesis of this cancer and the factors that induce it. Thus, a computational method NPC-RGCP was proposed in this study to identify the possible relevant chemicals and genes based on the presently known chemicals and genes related to nasopharyngeal cancer. To extensively utilize the functional associations between proteins and chemicals, a heterogeneous network was constructed based on interactions of proteins and chemicals. The NPC-RGCP included two stages: the searching stage and the screening stage. The former stage is for finding new possible genes and chemicals in the heterogeneous network, while the latter stage is for screening and removing false discoveries and selecting the core genes and chemicals. As a result, five putative genes, CXCR3, IRF1, CDK1, GSTP1, and CDH2, and seven putative chemicals, iron, propionic acid, dimethyl sulfoxide, isopropanol, erythrose 4-phosphate, β-D-Fructose 6-phosphate, and flavin adenine dinucleotide, were identified by NPC-RGCP. Extensive analyses provided confirmation that the putative genes and chemicals have significant associations with nasopharyngeal cancer. PMID:27149165

  13. Patterns of HIV-1 Protein Interaction Identify Perturbed Host-Cellular Subsystems

    PubMed Central

    MacPherson, Jamie I.; Dickerson, Jonathan E.; Pinney, John W.; Robertson, David L.

    2010-01-01

    Human immunodeficiency virus type 1 (HIV-1) exploits a diverse array of host cell functions in order to replicate. This is mediated through a network of virus-host interactions. A variety of recent studies have catalogued this information. In particular the HIV-1, Human Protein Interaction Database (HHPID) has provided a unique depth of protein interaction detail. However, as a map of HIV-1 infection, the HHPID is problematic, as it contains curation error and redundancy; in addition, it is based on a heterogeneous set of experimental methods. Based on identifying shared patterns of HIV-host interaction, we have developed a novel methodology to delimit the core set of host-cellular functions and their associated perturbation from the HHPID. Initially, using biclustering, we identify 279 significant sets of host proteins that undergo the same types of interaction. The functional cohesiveness of these protein sets was validated using a human protein-protein interaction network, gene ontology annotation and sequence similarity. Next, using a distance measure, we group host protein sets and identify 37 distinct higher-level subsystems. We further demonstrate the biological significance of these subsystems by cross-referencing with global siRNA screens that have been used to detect host factors necessary for HIV-1 replication, and investigate the seemingly small intersect between these data sets. Our results highlight significant host-cell subsystems that are perturbed during the course of HIV-1 infection. Moreover, we characterise the patterns of interaction that contribute to these perturbations. Thus, our work disentangles the complex set of HIV-1-host protein interactions in the HHPID, reconciles these with siRNA screens and provides an accessible and interpretable map of infection. PMID:20686668

  14. Overexpression screens identify conserved dosage chromosome instability genes in yeast and human cancer

    PubMed Central

    Duffy, Supipi; Fam, Hok Khim; Wang, Yi Kan; Styles, Erin B.; Kim, Jung-Hyun; Ang, J. Sidney; Singh, Tejomayee; Larionov, Vladimir; Shah, Sohrab P.; Andrews, Brenda; Boerkoel, Cornelius F.; Hieter, Philip

    2016-01-01

    Somatic copy number amplification and gene overexpression are common features of many cancers. To determine the role of gene overexpression on chromosome instability (CIN), we performed genome-wide screens in the budding yeast for yeast genes that cause CIN when overexpressed, a phenotype we refer to as dosage CIN (dCIN), and identified 245 dCIN genes. This catalog of genes reveals human orthologs known to be recurrently overexpressed and/or amplified in tumors. We show that two genes, TDP1, a tyrosyl-DNA-phosphdiesterase, and TAF12, an RNA polymerase II TATA-box binding factor, cause CIN when overexpressed in human cells. Rhabdomyosarcoma lines with elevated human Tdp1 levels also exhibit CIN that can be partially rescued by siRNA-mediated knockdown of TDP1. Overexpression of dCIN genes represents a genetic vulnerability that could be leveraged for selective killing of cancer cells through targeting of an unlinked synthetic dosage lethal (SDL) partner. Using SDL screens in yeast, we identified a set of genes that when deleted specifically kill cells with high levels of Tdp1. One gene was the histone deacetylase RPD3, for which there are known inhibitors. Both HT1080 cells overexpressing hTDP1 and rhabdomyosarcoma cells with elevated levels of hTdp1 were more sensitive to histone deacetylase inhibitors valproic acid (VPA) and trichostatin A (TSA), recapitulating the SDL interaction in human cells and suggesting VPA and TSA as potential therapeutic agents for tumors with elevated levels of hTdp1. The catalog of dCIN genes presented here provides a candidate list to identify genes that cause CIN when overexpressed in cancer, which can then be leveraged through SDL to selectively target tumors. PMID:27551064

  15. An analysis of the gene interaction networks identifying the role of PARP1 in metastasis of non-small cell lung cancer.

    PubMed

    Chen, Kai; Li, Yajie; Xu, Hui; Zhang, Chunfeng; Li, Zhiqiang; Wang, Wei; Wang, Baofeng

    2017-10-20

    Though there were many researches about the effects of cancer cells on non-small cell lung cancer (NSCLC) currently, it has been rarely reported completed oncogene and its mechanism in tumors by far. Here, we used biological methods with known oncogene of NSCLC to find new oncogene and explore its functionary mechanism in NSCLC. The study firstly built NSCLC genetic interaction network based on bioinformatics methods and then combined shortest path algorithm with significance test to confirmed core genes that were closely involved with given genes; real-time qPCR was conducted to detect expression levels between patients with NSCLC and normal people; additionally, detection of PARP1's role in migration and invasion was performed by trans-well assays and wound-healing. Through gene interaction network, it was found that, core genes like PARP1, EGFR and ALK had a direct interaction. TCGA database showed that PARP1 presented strong expression in NSCLC and the expression level of metastatic NSCLC was significantly higher than that of non-metastatic NSCLC. Cell migration of NSCLC in accordance to the scratch test was suppressed by PARP1 silence but stimulated noticeably by PARP1 overexpression. According to Kaplan-meier survival curve, the higher PARP1 expression, the poorer patient survival rate and prognosis. Thus, PARP1 expression had a negative correction with patient survival rate and prognosis. New oncogene PARP1 was found from known NSCLC oncogene in terms of gene interaction network, demonstrating PARP1's impact on NSCLC cell migration.

  16. A Comparative Study on Multifactor Dimensionality Reduction Methods for Detecting Gene-Gene Interactions with the Survival Phenotype

    PubMed Central

    Lee, Seungyeoun; Kim, Yongkang; Kwon, Min-Seok; Park, Taesung

    2015-01-01

    Genome-wide association studies (GWAS) have extensively analyzed single SNP effects on a wide variety of common and complex diseases and found many genetic variants associated with diseases. However, there is still a large portion of the genetic variants left unexplained. This missing heritability problem might be due to the analytical strategy that limits analyses to only single SNPs. One of possible approaches to the missing heritability problem is to consider identifying multi-SNP effects or gene-gene interactions. The multifactor dimensionality reduction method has been widely used to detect gene-gene interactions based on the constructive induction by classifying high-dimensional genotype combinations into one-dimensional variable with two attributes of high risk and low risk for the case-control study. Many modifications of MDR have been proposed and also extended to the survival phenotype. In this study, we propose several extensions of MDR for the survival phenotype and compare the proposed extensions with earlier MDR through comprehensive simulation studies. PMID:26339630

  17. Computational analysis of gene-gene interactions using multifactor dimensionality reduction.

    PubMed

    Moore, Jason H

    2004-11-01

    Understanding the relationship between DNA sequence variations and biologic traits is expected to improve the diagnosis, prevention and treatment of common human diseases. Success in characterizing genetic architecture will depend on our ability to address nonlinearities in the genotype-to-phenotype mapping relationship as a result of gene-gene interactions, or epistasis. This review addresses the challenges associated with the detection and characterization of epistasis. A novel strategy known as multifactor dimensionality reduction that was specifically designed for the identification of multilocus genetic effects is presented. Several case studies that demonstrate the detection of gene-gene interactions in common diseases such as atrial fibrillation, Type II diabetes and essential hypertension are also discussed.

  18. Identifying potential maternal genes of Bombyx mori using digital gene expression profiling

    PubMed Central

    Xu, Pingzhen

    2018-01-01

    Maternal genes present in mature oocytes play a crucial role in the early development of silkworm. Although maternal genes have been widely studied in many other species, there has been limited research in Bombyx mori. High-throughput next generation sequencing provides a practical method for gene discovery on a genome-wide level. Herein, a transcriptome study was used to identify maternal-related genes from silkworm eggs. Unfertilized eggs from five different stages of early development were used to detect the changing situation of gene expression. The expressed genes showed different patterns over time. Seventy-six maternal genes were annotated according to homology analysis with Drosophila melanogaster. More than half of the differentially expressed maternal genes fell into four expression patterns, while the expression patterns showed a downward trend over time. The functional annotation of these material genes was mainly related to transcription factor activity, growth factor activity, nucleic acid binding, RNA binding, ATP binding, and ion binding. Additionally, twenty-two gene clusters including maternal genes were identified from 18 scaffolds. Altogether, we plotted a profile for the maternal genes of Bombyx mori using a digital gene expression profiling method. This will provide the basis for maternal-specific signature research and improve the understanding of the early development of silkworm. PMID:29462160

  19. Gene-gene and gene-environment interactions defining lipid-related traits.

    PubMed

    Ordovás, José M; Robertson, Ruairi; Cléirigh, Ellen Ní

    2011-04-01

    Steps towards reducing chronic disease progression are continuously being taken through the form of genomic research. Studies over the last year have highlighted more and more polymorphisms, pathways and interactions responsible for metabolic disorders such as cardiovascular disease, obesity and dyslipidemia. Many of these chronic illnesses can be partially blamed by altered lipid metabolism, combined with individual genetic components. Critical evaluation and comparison of these recent studies is essential in order to comprehend the results, conclusions and future prospects in the field of genomics as a whole. Recent literature elucidates significant gene--diet and gene--environment interactions resulting in altered lipid metabolism, inflammation and other metabolic imbalances leading to cardiovascular disease and obesity. Epigenetic and epistatic interactions are now becoming more significantly associated with such disorders, as genomic research digs deeper into the complex nature of genetic individuality and heritability. The vast array of data collected from genome-wide association studies must now be empowered and explored through more complex interaction studies, using standardized methods and larger sample sizes. In doing so the etiology of chronic disease progression will be further understood.

  20. LGscore: A method to identify disease-related genes using biological literature and Google data.

    PubMed

    Kim, Jeongwoo; Kim, Hyunjin; Yoon, Youngmi; Park, Sanghyun

    2015-04-01

    Since the genome project in 1990s, a number of studies associated with genes have been conducted and researchers have confirmed that genes are involved in disease. For this reason, the identification of the relationships between diseases and genes is important in biology. We propose a method called LGscore, which identifies disease-related genes using Google data and literature data. To implement this method, first, we construct a disease-related gene network using text-mining results. We then extract gene-gene interactions based on co-occurrences in abstract data obtained from PubMed, and calculate the weights of edges in the gene network by means of Z-scoring. The weights contain two values: the frequency and the Google search results. The frequency value is extracted from literature data, and the Google search result is obtained using Google. We assign a score to each gene through a network analysis. We assume that genes with a large number of links and numerous Google search results and frequency values are more likely to be involved in disease. For validation, we investigated the top 20 inferred genes for five different diseases using answer sets. The answer sets comprised six databases that contain information on disease-gene relationships. We identified a significant number of disease-related genes as well as candidate genes for Alzheimer's disease, diabetes, colon cancer, lung cancer, and prostate cancer. Our method was up to 40% more accurate than existing methods. Copyright © 2015 Elsevier Inc. All rights reserved.

  1. Systematic analysis of mutation distribution in three dimensional protein structures identifies cancer driver genes.

    PubMed

    Fujimoto, Akihiro; Okada, Yukinori; Boroevich, Keith A; Tsunoda, Tatsuhiko; Taniguchi, Hiroaki; Nakagawa, Hidewaki

    2016-05-26

    Protein tertiary structure determines molecular function, interaction, and stability of the protein, therefore distribution of mutation in the tertiary structure can facilitate the identification of new driver genes in cancer. To analyze mutation distribution in protein tertiary structures, we applied a novel three dimensional permutation test to the mutation positions. We analyzed somatic mutation datasets of 21 types of cancers obtained from exome sequencing conducted by the TCGA project. Of the 3,622 genes that had ≥3 mutations in the regions with tertiary structure data, 106 genes showed significant skew in mutation distribution. Known tumor suppressors and oncogenes were significantly enriched in these identified cancer gene sets. Physical distances between mutations in known oncogenes were significantly smaller than those of tumor suppressors. Twenty-three genes were detected in multiple cancers. Candidate genes with significant skew of the 3D mutation distribution included kinases (MAPK1, EPHA5, ERBB3, and ERBB4), an apoptosis related gene (APP), an RNA splicing factor (SF1), a miRNA processing factor (DICER1), an E3 ubiquitin ligase (CUL1) and transcription factors (KLF5 and EEF1B2). Our study suggests that systematic analysis of mutation distribution in the tertiary protein structure can help identify cancer driver genes.

  2. Systematic analysis of mutation distribution in three dimensional protein structures identifies cancer driver genes

    PubMed Central

    Fujimoto, Akihiro; Okada, Yukinori; Boroevich, Keith A.; Tsunoda, Tatsuhiko; Taniguchi, Hiroaki; Nakagawa, Hidewaki

    2016-01-01

    Protein tertiary structure determines molecular function, interaction, and stability of the protein, therefore distribution of mutation in the tertiary structure can facilitate the identification of new driver genes in cancer. To analyze mutation distribution in protein tertiary structures, we applied a novel three dimensional permutation test to the mutation positions. We analyzed somatic mutation datasets of 21 types of cancers obtained from exome sequencing conducted by the TCGA project. Of the 3,622 genes that had ≥3 mutations in the regions with tertiary structure data, 106 genes showed significant skew in mutation distribution. Known tumor suppressors and oncogenes were significantly enriched in these identified cancer gene sets. Physical distances between mutations in known oncogenes were significantly smaller than those of tumor suppressors. Twenty-three genes were detected in multiple cancers. Candidate genes with significant skew of the 3D mutation distribution included kinases (MAPK1, EPHA5, ERBB3, and ERBB4), an apoptosis related gene (APP), an RNA splicing factor (SF1), a miRNA processing factor (DICER1), an E3 ubiquitin ligase (CUL1) and transcription factors (KLF5 and EEF1B2). Our study suggests that systematic analysis of mutation distribution in the tertiary protein structure can help identify cancer driver genes. PMID:27225414

  3. Identification of genes related to proliferative diabetic retinopathy through RWR algorithm based on protein-protein interaction network.

    PubMed

    Zhang, Jian; Suo, Yan; Liu, Min; Xu, Xun

    2018-06-01

    Proliferative diabetic retinopathy (PDR) is one of the most common complications of diabetes and can lead to blindness. Proteomic studies have provided insight into the pathogenesis of PDR and a series of PDR-related genes has been identified but are far from fully characterized because the experimental methods are expensive and time consuming. In our previous study, we successfully identified 35 candidate PDR-related genes through the shortest-path algorithm. In the current study, we developed a computational method using the random walk with restart (RWR) algorithm and the protein-protein interaction (PPI) network to identify potential PDR-related genes. After some possible genes were obtained by the RWR algorithm, a three-stage filtration strategy, which includes the permutation test, interaction test and enrichment test, was applied to exclude potential false positives caused by the structure of PPI network, the poor interaction strength, and the limited similarity on gene ontology (GO) terms and biological pathways. As a result, 36 candidate genes were discovered by the method which was different from the 35 genes reported in our previous study. A literature review showed that 21 of these 36 genes are supported by previous experiments. These findings suggest the robustness and complementary effects of both our efforts using different computational methods, thus providing an alternative method to study PDR pathogenesis. Copyright © 2017 Elsevier B.V. All rights reserved.

  4. Identifying New Candidate Genes and Chemicals Related to Prostate Cancer Using a Hybrid Network and Shortest Path Approach

    PubMed Central

    Wang, Meng; Wu, Kai; Lu, Changhong; Kong, Xiangyin

    2015-01-01

    Prostate cancer is a type of cancer that occurs in the male prostate, a gland in the male reproductive system. Because prostate cancer cells may spread to other parts of the body and can influence human reproduction, understanding the mechanisms underlying this disease is critical for designing effective treatments. The identification of as many genes and chemicals related to prostate cancer as possible will enhance our understanding of this disease. In this study, we proposed a computational method to identify new candidate genes and chemicals based on currently known genes and chemicals related to prostate cancer by applying a shortest path approach in a hybrid network. The hybrid network was constructed according to information concerning chemical-chemical interactions, chemical-protein interactions, and protein-protein interactions. Many of the obtained genes and chemicals are associated with prostate cancer. PMID:26504486

  5. A Partial Least Square Approach for Modeling Gene-gene and Gene-environment Interactions When Multiple Markers Are Genotyped

    PubMed Central

    Wang, Tao; Ho, Gloria; Ye, Kenny; Strickler, Howard; Elston, Robert C.

    2008-01-01

    Genetic association studies achieve an unprecedented level of resolution in mapping disease genes by genotyping dense SNPs in a gene region. Meanwhile, these studies require new powerful statistical tools that can optimally handle a large amount of information provided by genotype data. A question that arises is how to model interactions between two genes. Simply modeling all possible interactions between the SNPs in two gene regions is not desirable because a greatly increased number of degrees of freedom can be involved in the test statistic. We introduce an approach to reduce the genotype dimension in modeling interactions. The genotype compression of this approach is built upon the information on both the trait and the cross-locus gametic disequilibrium between SNPs in two interacting genes, in such a way as to parsimoniously model the interactions without loss of useful information in the process of dimension reduction. As a result, it improves power to detect association in the presence of gene-gene interactions. This approach can be similarly applied for modeling gene-environment interactions. We compare this method with other approaches: the corresponding test without modeling any interaction, that based on a saturated interaction model, that based on principal component analysis, and that based on Tukey’s 1-df model. Our simulations suggest that this new approach has superior power to that of the other methods. In an application to endometrial cancer case-control data from the Women’s Health Initiative (WHI), this approach detected AKT1 and AKT2 as being significantly associated with endometrial cancer susceptibility by taking into account their interactions with BMI. PMID:18615621

  6. A partial least-square approach for modeling gene-gene and gene-environment interactions when multiple markers are genotyped.

    PubMed

    Wang, Tao; Ho, Gloria; Ye, Kenny; Strickler, Howard; Elston, Robert C

    2009-01-01

    Genetic association studies achieve an unprecedented level of resolution in mapping disease genes by genotyping dense single nucleotype polymorphisms (SNPs) in a gene region. Meanwhile, these studies require new powerful statistical tools that can optimally handle a large amount of information provided by genotype data. A question that arises is how to model interactions between two genes. Simply modeling all possible interactions between the SNPs in two gene regions is not desirable because a greatly increased number of degrees of freedom can be involved in the test statistic. We introduce an approach to reduce the genotype dimension in modeling interactions. The genotype compression of this approach is built upon the information on both the trait and the cross-locus gametic disequilibrium between SNPs in two interacting genes, in such a way as to parsimoniously model the interactions without loss of useful information in the process of dimension reduction. As a result, it improves power to detect association in the presence of gene-gene interactions. This approach can be similarly applied for modeling gene-environment interactions. We compare this method with other approaches, the corresponding test without modeling any interaction, that based on a saturated interaction model, that based on principal component analysis, and that based on Tukey's one-degree-of-freedom model. Our simulations suggest that this new approach has superior power to that of the other methods. In an application to endometrial cancer case-control data from the Women's Health Initiative, this approach detected AKT1 and AKT2 as being significantly associated with endometrial cancer susceptibility by taking into account their interactions with body mass index.

  7. Powerful multilocus tests of genetic association in the presence of gene-gene and gene-environment interactions.

    PubMed

    Chatterjee, Nilanjan; Kalaylioglu, Zeynep; Moslehi, Roxana; Peters, Ulrike; Wacholder, Sholom

    2006-12-01

    In modern genetic epidemiology studies, the association between the disease and a genomic region, such as a candidate gene, is often investigated using multiple SNPs. We propose a multilocus test of genetic association that can account for genetic effects that might be modified by variants in other genes or by environmental factors. We consider use of the venerable and parsimonious Tukey's 1-degree-of-freedom model of interaction, which is natural when individual SNPs within a gene are associated with disease through a common biological mechanism; in contrast, many standard regression models are designed as if each SNP has unique functional significance. On the basis of Tukey's model, we propose a novel but computationally simple generalized test of association that can simultaneously capture both the main effects of the variants within a genomic region and their interactions with the variants in another region or with an environmental exposure. We compared performance of our method with that of two standard tests of association, one ignoring gene-gene/gene-environment interactions and the other based on a saturated model of interactions. We demonstrate major power advantages of our method both in analysis of data from a case-control study of the association between colorectal adenoma and DNA variants in the NAT2 genomic region, which are well known to be related to a common biological phenotype, and under different models of gene-gene interactions with use of simulated data.

  8. Key genes and pathways in measles and their interaction with environmental chemicals

    PubMed Central

    Zhang, Rongqiang; Jiang, Hualin; Li, Fengying; Su, Ning; Ding, Yi; Mao, Xiang; Ren, Dan; Wang, Jing

    2018-01-01

    The aim of the present study was to explore key genes that may have a role in the pathology of measles virus infection and to clarify the interaction networks between environmental factors and differentially expressed genes (DEGs). After screening the database of the Gene Expression Omnibus of the National Center for Biotechnology Information, the dataset GSE5808 was downloaded and analyzed. A global normalization method was performed to minimize data inconsistencies and heterogeneity. DEGs during different stages of measles virus infection were explored using R software (v3.4.0). Gene Ontology and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway analysis of the DEGs were performed using Cytoscape 3.4.0 software. A protein-protein interaction (PPI) network of the DEGs was obtained from the STRING database v9.05. A total of 43 DEGs were obtained from four analyzed sample groups, including 10 highly expressed genes and 33 genes with decreased expression. The most enriched pathways based on KEGG analysis were fatty acid elongation, cytokine-cytokine receptor interaction and RNA degradation. The genes mentioned in the PPI network were mainly associated with protein binding and chemokine activity. A total of 219 chemicals were identified that may, jointly or on their own, interact with the 6 DEGs between the control group and patients with measles (at hospital entry), including benzo(a)pyrene (BaP) and tetrachlorodibenzodioxin (TCDD). In conclusion, the present study revealed that chemokines and environmental chemicals, e.g. BaP and TCDD, may affect the development of measles. PMID:29805511

  9. Key genes and pathways in measles and their interaction with environmental chemicals.

    PubMed

    Zhang, Rongqiang; Jiang, Hualin; Li, Fengying; Su, Ning; Ding, Yi; Mao, Xiang; Ren, Dan; Wang, Jing

    2018-06-01

    The aim of the present study was to explore key genes that may have a role in the pathology of measles virus infection and to clarify the interaction networks between environmental factors and differentially expressed genes (DEGs). After screening the database of the Gene Expression Omnibus of the National Center for Biotechnology Information, the dataset GSE5808 was downloaded and analyzed. A global normalization method was performed to minimize data inconsistencies and heterogeneity. DEGs during different stages of measles virus infection were explored using R software (v3.4.0). Gene Ontology and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway analysis of the DEGs were performed using Cytoscape 3.4.0 software. A protein-protein interaction (PPI) network of the DEGs was obtained from the STRING database v9.05. A total of 43 DEGs were obtained from four analyzed sample groups, including 10 highly expressed genes and 33 genes with decreased expression. The most enriched pathways based on KEGG analysis were fatty acid elongation, cytokine-cytokine receptor interaction and RNA degradation. The genes mentioned in the PPI network were mainly associated with protein binding and chemokine activity. A total of 219 chemicals were identified that may, jointly or on their own, interact with the 6 DEGs between the control group and patients with measles (at hospital entry), including benzo(a)pyrene (BaP) and tetrachlorodibenzodioxin (TCDD). In conclusion, the present study revealed that chemokines and environmental chemicals, e.g. BaP and TCDD, may affect the development of measles.

  10. Differential reconstructed gene interaction networks for deriving toxicity threshold in chemical risk assessment.

    PubMed

    Yang, Yi; Maxwell, Andrew; Zhang, Xiaowei; Wang, Nan; Perkins, Edward J; Zhang, Chaoyang; Gong, Ping

    2013-01-01

    Pathway alterations reflected as changes in gene expression regulation and gene interaction can result from cellular exposure to toxicants. Such information is often used to elucidate toxicological modes of action. From a risk assessment perspective, alterations in biological pathways are a rich resource for setting toxicant thresholds, which may be more sensitive and mechanism-informed than traditional toxicity endpoints. Here we developed a novel differential networks (DNs) approach to connect pathway perturbation with toxicity threshold setting. Our DNs approach consists of 6 steps: time-series gene expression data collection, identification of altered genes, gene interaction network reconstruction, differential edge inference, mapping of genes with differential edges to pathways, and establishment of causal relationships between chemical concentration and perturbed pathways. A one-sample Gaussian process model and a linear regression model were used to identify genes that exhibited significant profile changes across an entire time course and between treatments, respectively. Interaction networks of differentially expressed (DE) genes were reconstructed for different treatments using a state space model and then compared to infer differential edges/interactions. DE genes possessing differential edges were mapped to biological pathways in databases such as KEGG pathways. Using the DNs approach, we analyzed a time-series Escherichia coli live cell gene expression dataset consisting of 4 treatments (control, 10, 100, 1000 mg/L naphthenic acids, NAs) and 18 time points. Through comparison of reconstructed networks and construction of differential networks, 80 genes were identified as DE genes with a significant number of differential edges, and 22 KEGG pathways were altered in a concentration-dependent manner. Some of these pathways were perturbed to a degree as high as 70% even at the lowest exposure concentration, implying a high sensitivity of our DNs approach

  11. Genome-nuclear lamina interactions and gene regulation.

    PubMed

    Kind, Jop; van Steensel, Bas

    2010-06-01

    The nuclear lamina, a filamentous protein network that coats the inner nuclear membrane, has long been thought to interact with specific genomic loci and regulate their expression. Molecular mapping studies have now identified large genomic domains that are in contact with the lamina. Genes in these domains are typically repressed, and artificial tethering experiments indicate that the lamina can actively contribute to this repression. Furthermore, the lamina indirectly controls gene expression in the nuclear interior by sequestration of certain transcription factors. A variety of DNA-binding and chromatin proteins may anchor specific loci to the lamina, while histone-modifying enzymes partly mediate the local repressive effect of the lamina. Experimental tools are now available to begin to unravel the underlying molecular mechanisms. Copyright 2010 Elsevier Ltd. All rights reserved.

  12. Gene-Environment Interactions in Schizophrenia: Review of Epidemiological Findings and Future Directions

    PubMed Central

    van Os, Jim; Rutten, Bart PF; Poulton, Richie

    2008-01-01

    Concern is building about high rates of schizophrenia in large cities, and among immigrants, cannabis users, and traumatized individuals, some of which likely reflects the causal influence of environmental exposures. This, in combination with very slow progress in the area of molecular genetics, has generated interest in more complicated models of schizophrenia etiology that explicitly posit gene-environment interactions (EU-GEI. European Network of Schizophrenia Networks for the Study of Gene Environment Interactions. Schizophrenia aetiology: do gene-environment interactions hold the key? [published online ahead of print April 25, 2008] Schizophr Res; S0920-9964(08) 00170–9). Although findings of epidemiological gene-environment interaction (G × E) studies are suggestive of widespread gene-environment interactions in the etiology of schizophrenia, numerous challenges remain. For example, attempts to identify gene-environment interactions cannot be equated with molecular genetic studies with a few putative environmental variables “thrown in”: G × E is a multidisciplinary exercise involving epidemiology, psychology, psychiatry, neuroscience, neuroimaging, pharmacology, biostatistics, and genetics. Epidemiological G × E studies using indirect measures of genetic risk in genetically sensitive designs have the advantage that they are able to model the net, albeit nonspecific, genetic load. In studies using direct molecular measures of genetic variation, a hypothesis-driven approach postulating synergistic effects between genes and environment impacting on a final common pathway, such as “sensitization” of mesolimbic dopamine neurotransmission, while simplistic, may provide initial focus and protection against the numerous false-positive and false-negative results that these investigations engender. Experimental ecogenetic approaches with randomized assignment may help to overcome some of the limitations of observational studies and allow for the additional

  13. Gene-gene, gene-environment, gene-nutrient interactions and single nucleotide polymorphisms of inflammatory cytokines.

    PubMed

    Nadeem, Amina; Mumtaz, Sadaf; Naveed, Abdul Khaliq; Aslam, Muhammad; Siddiqui, Arif; Lodhi, Ghulam Mustafa; Ahmad, Tausif

    2015-05-15

    Inflammation plays a significant role in the etiology of type 2 diabetes mellitus (T2DM). The rise in the pro-inflammatory cytokines is the essential step in glucotoxicity and lipotoxicity induced mitochondrial injury, oxidative stress and beta cell apoptosis in T2DM. Among the recognized markers are interleukin (IL)-6, IL-1, IL-10, IL-18, tissue necrosis factor-alpha (TNF-α), C-reactive protein, resistin, adiponectin, tissue plasminogen activator, fibrinogen and heptoglobins. Diabetes mellitus has firm genetic and very strong environmental influence; exhibiting a polygenic mode of inheritance. Many single nucleotide polymorphisms (SNPs) in various genes including those of pro and anti-inflammatory cytokines have been reported as a risk for T2DM. Not all the SNPs have been confirmed by unifying results in different studies and wide variations have been reported in various ethnic groups. The inter-ethnic variations can be explained by the fact that gene expression may be regulated by gene-gene, gene-environment and gene-nutrient interactions. This review highlights the impact of these interactions on determining the role of single nucleotide polymorphism of IL-6, TNF-α, resistin and adiponectin in pathogenesis of T2DM.

  14. Detecting regulatory gene-environment interactions with unmeasured environmental factors.

    PubMed

    Fusi, Nicoló; Lippert, Christoph; Borgwardt, Karsten; Lawrence, Neil D; Stegle, Oliver

    2013-06-01

    Genomic studies have revealed a substantial heritable component of the transcriptional state of the cell. To fully understand the genetic regulation of gene expression variability, it is important to study the effect of genotype in the context of external factors such as alternative environmental conditions. In model systems, explicit environmental perturbations have been considered for this purpose, allowing to directly test for environment-specific genetic effects. However, such experiments are limited to species that can be profiled in controlled environments, hampering their use in important systems such as human. Moreover, even in seemingly tightly regulated experimental conditions, subtle environmental perturbations cannot be ruled out, and hence unknown environmental influences are frequent. Here, we propose a model-based approach to simultaneously infer unmeasured environmental factors from gene expression profiles and use them in genetic analyses, identifying environment-specific associations between polymorphic loci and individual gene expression traits. In extensive simulation studies, we show that our method is able to accurately reconstruct environmental factors and their interactions with genotype in a variety of settings. We further illustrate the use of our model in a real-world dataset in which one environmental factor has been explicitly experimentally controlled. Our method is able to accurately reconstruct the true underlying environmental factor even if it is not given as an input, allowing to detect genuine genotype-environment interactions. In addition to the known environmental factor, we find unmeasured factors involved in novel genotype-environment interactions. Our results suggest that interactions with both known and unknown environmental factors significantly contribute to gene expression variability. and implementation: Software available at http://pmbio.github.io/envGPLVM/. Supplementary data are available at Bioinformatics online.

  15. Functional modules by relating protein interaction networks and gene expression.

    PubMed

    Tornow, Sabine; Mewes, H W

    2003-11-01

    Genes and proteins are organized on the basis of their particular mutual relations or according to their interactions in cellular and genetic networks. These include metabolic or signaling pathways and protein interaction, regulatory or co-expression networks. Integrating the information from the different types of networks may lead to the notion of a functional network and functional modules. To find these modules, we propose a new technique which is based on collective, multi-body correlations in a genetic network. We calculated the correlation strength of a group of genes (e.g. in the co-expression network) which were identified as members of a module in a different network (e.g. in the protein interaction network) and estimated the probability that this correlation strength was found by chance. Groups of genes with a significant correlation strength in different networks have a high probability that they perform the same function. Here, we propose evaluating the multi-body correlations by applying the superparamagnetic approach. We compare our method to the presently applied mean Pearson correlations and show that our method is more sensitive in revealing functional relationships.

  16. Functional modules by relating protein interaction networks and gene expression

    PubMed Central

    Tornow, Sabine; Mewes, H. W.

    2003-01-01

    Genes and proteins are organized on the basis of their particular mutual relations or according to their interactions in cellular and genetic networks. These include metabolic or signaling pathways and protein interaction, regulatory or co-expression networks. Integrating the information from the different types of networks may lead to the notion of a functional network and functional modules. To find these modules, we propose a new technique which is based on collective, multi-body correlations in a genetic network. We calculated the correlation strength of a group of genes (e.g. in the co-expression network) which were identified as members of a module in a different network (e.g. in the protein interaction network) and estimated the probability that this correlation strength was found by chance. Groups of genes with a significant correlation strength in different networks have a high probability that they perform the same function. Here, we propose evaluating the multi-body correlations by applying the superparamagnetic approach. We compare our method to the presently applied mean Pearson correlations and show that our method is more sensitive in revealing functional relationships. PMID:14576317

  17. Identifying key genes in rheumatoid arthritis by weighted gene co-expression network analysis.

    PubMed

    Ma, Chunhui; Lv, Qi; Teng, Songsong; Yu, Yinxian; Niu, Kerun; Yi, Chengqin

    2017-08-01

    This study aimed to identify rheumatoid arthritis (RA) related genes based on microarray data using the WGCNA (weighted gene co-expression network analysis) method. Two gene expression profile datasets GSE55235 (10 RA samples and 10 healthy controls) and GSE77298 (16 RA samples and seven healthy controls) were downloaded from Gene Expression Omnibus database. Characteristic genes were identified using metaDE package. WGCNA was used to find disease-related networks based on gene expression correlation coefficients, and module significance was defined as the average gene significance of all genes used to assess the correlation between the module and RA status. Genes in the disease-related gene co-expression network were subject to functional annotation and pathway enrichment analysis using Database for Annotation Visualization and Integrated Discovery. Characteristic genes were also mapped to the Connectivity Map to screen small molecules. A total of 599 characteristic genes were identified. For each dataset, characteristic genes in the green, red and turquoise modules were most closely associated with RA, with gene numbers of 54, 43 and 79, respectively. These genes were enriched in totally enriched in 17 Gene Ontology terms, mainly related to immune response (CD97, FYB, CXCL1, IKBKE, CCR1, etc.), inflammatory response (CD97, CXCL1, C3AR1, CCR1, LYZ, etc.) and homeostasis (C3AR1, CCR1, PLN, CCL19, PPT1, etc.). Two small-molecule drugs sanguinarine and papaverine were predicted to have a therapeutic effect against RA. Genes related to immune response, inflammatory response and homeostasis presumably have critical roles in RA pathogenesis. Sanguinarine and papaverine have a potential therapeutic effect against RA. © 2017 Asia Pacific League of Associations for Rheumatology and John Wiley & Sons Australia, Ltd.

  18. Identifying gene networks underlying the neurobiology of ethanol and alcoholism.

    PubMed

    Wolen, Aaron R; Miles, Michael F

    2012-01-01

    For complex disorders such as alcoholism, identifying the genes linked to these diseases and their specific roles is difficult. Traditional genetic approaches, such as genetic association studies (including genome-wide association studies) and analyses of quantitative trait loci (QTLs) in both humans and laboratory animals already have helped identify some candidate genes. However, because of technical obstacles, such as the small impact of any individual gene, these approaches only have limited effectiveness in identifying specific genes that contribute to complex diseases. The emerging field of systems biology, which allows for analyses of entire gene networks, may help researchers better elucidate the genetic basis of alcoholism, both in humans and in animal models. Such networks can be identified using approaches such as high-throughput molecular profiling (e.g., through microarray-based gene expression analyses) or strategies referred to as genetical genomics, such as the mapping of expression QTLs (eQTLs). Characterization of gene networks can shed light on the biological pathways underlying complex traits and provide the functional context for identifying those genes that contribute to disease development.

  19. DEIVA: a web application for interactive visual analysis of differential gene expression profiles.

    PubMed

    Harshbarger, Jayson; Kratz, Anton; Carninci, Piero

    2017-01-07

    Differential gene expression (DGE) analysis is a technique to identify statistically significant differences in RNA abundance for genes or arbitrary features between different biological states. The result of a DGE test is typically further analyzed using statistical software, spreadsheets or custom ad hoc algorithms. We identified a need for a web-based system to share DGE statistical test results, and locate and identify genes in DGE statistical test results with a very low barrier of entry. We have developed DEIVA, a free and open source, browser-based single page application (SPA) with a strong emphasis on being user friendly that enables locating and identifying single or multiple genes in an immediate, interactive, and intuitive manner. By design, DEIVA scales with very large numbers of users and datasets. Compared to existing software, DEIVA offers a unique combination of design decisions that enable inspection and analysis of DGE statistical test results with an emphasis on ease of use.

  20. Systematic Search for Gene-Gene Interaction Effect on Prostate Cancer Risk

    DTIC Science & Technology

    2013-07-01

    Systematic Search for Gene-Gene Interaction 5a. CONTRACT NUMBER Effect on Prostate Cancer Risk 5b. GRANT NUMBER W81XWH-09-1-0488 5c. PROGRAM...Supported by this grant ) 1. Tao S, Wang Z, Feng J, Hsu FC, Jin G, Kin ST, Zhang Z, Gronberg H, Zheng, SL, Isaacs WB, XU J, Sun J. A Genome-Wide Search for...order interactions among estrogen- metabolism genes in sporadic breast cancer. Am J Hum Genet, 69, 138-47. 48. Marchini, J., Donnelly, P. and Cardon

  1. Pathway Interaction Network Analysis Identifies Dysregulated Pathways in Human Monocytes Infected by Listeria monocytogenes.

    PubMed

    Fan, Wufeng; Zhou, Yuhan; Li, Hao

    2017-01-01

    In our study, we aimed to extract dysregulated pathways in human monocytes infected by Listeria monocytogenes (LM) based on pathway interaction network (PIN) which presented the functional dependency between pathways. After genes were aligned to the pathways, principal component analysis (PCA) was used to calculate the pathway activity for each pathway, followed by detecting seed pathway. A PIN was constructed based on gene expression profile, protein-protein interactions (PPIs), and cellular pathways. Identifying dysregulated pathways from the PIN was performed relying on seed pathway and classification accuracy. To evaluate whether the PIN method was feasible or not, we compared the introduced method with standard network centrality measures. The pathway of RNA polymerase II pretranscription events was selected as the seed pathway. Taking this seed pathway as start, one pathway set (9 dysregulated pathways) with AUC score of 1.00 was identified. Among the 5 hub pathways obtained using standard network centrality measures, 4 pathways were the common ones between the two methods. RNA polymerase II transcription and DNA replication owned a higher number of pathway genes and DEGs. These dysregulated pathways work together to influence the progression of LM infection, and they will be available as biomarkers to diagnose LM infection.

  2. Identification of T1D susceptibility genes within the MHC region by combining protein interaction networks and SNP genotyping data

    PubMed Central

    Brorsson, C.; Hansen, N. T.; Lage, K.; Bergholdt, R.; Brunak, S.; Pociot, F.

    2009-01-01

    Aim To develop novel methods for identifying new genes that contribute to the risk of developing type 1 diabetes within the Major Histocompatibility Complex (MHC) region on chromosome 6, independently of the known linkage disequilibrium (LD) between human leucocyte antigen (HLA)-DRB1, -DQA1, -DQB1 genes. Methods We have developed a novel method that combines single nucleotide polymorphism (SNP) genotyping data with protein–protein interaction (ppi) networks to identify disease-associated network modules enriched for proteins encoded from the MHC region. Approximately 2500 SNPs located in the 4 Mb MHC region were analysed in 1000 affected offspring trios generated by the Type 1 Diabetes Genetics Consortium (T1DGC). The most associated SNP in each gene was chosen and genes were mapped to ppi networks for identification of interaction partners. The association testing and resulting interacting protein modules were statistically evaluated using permutation. Results A total of 151 genes could be mapped to nodes within the protein interaction network and their interaction partners were identified. Five protein interaction modules reached statistical significance using this approach. The identified proteins are well known in the pathogenesis of T1D, but the modules also contain additional candidates that have been implicated in β-cell development and diabetic complications. Conclusions The extensive LD within the MHC region makes it important to develop new methods for analysing genotyping data for identification of additional risk genes for T1D. Combining genetic data with knowledge about functional pathways provides new insight into mechanisms underlying T1D. PMID:19143816

  3. Activation-dependent intrachromosomal interactions formed by the TNF gene promoter and two distal enhancers

    PubMed Central

    Tsytsykova, Alla V.; Rajsbaum, Ricardo; Falvo, James V.; Ligeiro, Filipa; Neely, Simon R.; Goldfeld, Anne E.

    2007-01-01

    Here we provide a mechanism for specific, efficient transcription of the TNF gene and, potentially, other genes residing within multigene loci. We identify and characterize highly conserved noncoding elements flanking the TNF gene, which undergo activation-dependent intrachromosomal interactions. These elements, hypersensitive site (HSS)−9 and HSS+3 (9 kb upstream and 3 kb downstream of the TNF gene, respectively), contain DNase I hypersensitive sites in naive, T helper 1, and T helper 2 primary T cells. Both HSS-9 and HSS+3 inducibly associate with acetylated histones, indicative of chromatin remodeling, bind the transcription factor nuclear factor of activated T cells (NFAT)p in vitro and in vivo, and function as enhancers of NFAT-dependent transactivation mediated by the TNF promoter. Using the chromosome conformation capture assay, we demonstrate that upon T cell activation intrachromosomal looping occurs in the TNF locus. HSS-9 and HSS+3 each associate with the TNF promoter and with each other, circularizing the TNF gene and bringing NFAT-containing nucleoprotein complexes into close proximity. TNF gene regulation thus reveals a mode of intrachromosomal interaction that combines a looped gene topology with interactions between enhancers and a gene promoter. PMID:17940009

  4. Gene-Environment Interactions in Cancer Epidemiology: A National Cancer Institute Think Tank Report

    PubMed Central

    Hutter, Carolyn M.; Mechanic, Leah E.; Chatterjee, Nilanjan; Kraft, Peter; Gillander, Elizabeth M.

    2014-01-01

    Cancer risk is determined by a complex interplay of genetic and environmental factors. Genome-wide association studies (GWAS) have identified hundreds of common (minor allele frequency [MAF]>0.05) and less common (0.01genes and environment, including gene-environment interactions, into epidemiologic studies of cancer. To help address these questions, and to better inform research priorities and allocation of resources, the National Cancer Institute sponsored a “Gene-Environment Think Tank” on January 10th–011th, 2012. The objective of the Think Tank was to facilitate discussions on: 1) the state of the science; 2) the goals of gene-environment interaction studies in cancer epidemiology; and 3) opportunities for developing novel study designs and analysis tools. This report summarizes the Think Tank discussion, with a focus on contemporary approaches to the analysis of gene-environment interactions. Selecting the appropriate methods requires first identifying the relevant scientific question and rationale, with an important distinction made between analyses aiming to characterize the joint effects of putative or established genetic and environmental factors and analyses aiming to discover novel risk factors or novel interaction effects. Other discussion items include measurement error, statistical power, significance and replication. Additional designs, exposure assessments, and analytical approaches need to be considered as we move from the current small number of success stories to a fuller understanding of the interplay of genetic and environmental factors. PMID:24123198

  5. Multifactor-Dimensionality Reduction Reveals High-Order Interactions among Estrogen-Metabolism Genes in Sporadic Breast Cancer

    PubMed Central

    Ritchie, Marylyn D.; Hahn, Lance W.; Roodi, Nady; Bailey, L. Renee; Dupont, William D.; Parl, Fritz F.; Moore, Jason H.

    2001-01-01

    One of the greatest challenges facing human geneticists is the identification and characterization of susceptibility genes for common complex multifactorial human diseases. This challenge is partly due to the limitations of parametric-statistical methods for detection of gene effects that are dependent solely or partially on interactions with other genes and with environmental exposures. We introduce multifactor-dimensionality reduction (MDR) as a method for reducing the dimensionality of multilocus information, to improve the identification of polymorphism combinations associated with disease risk. The MDR method is nonparametric (i.e., no hypothesis about the value of a statistical parameter is made), is model-free (i.e., it assumes no particular inheritance model), and is directly applicable to case-control and discordant-sib-pair studies. Using simulated case-control data, we demonstrate that MDR has reasonable power to identify interactions among two or more loci in relatively small samples. When it was applied to a sporadic breast cancer case-control data set, in the absence of any statistically significant independent main effects, MDR identified a statistically significant high-order interaction among four polymorphisms from three different estrogen-metabolism genes. To our knowledge, this is the first report of a four-locus interaction associated with a common complex multifactorial disease. PMID:11404819

  6. Identification of susceptible genes for complex chronic diseases based on disease risk functional SNPs and interaction networks.

    PubMed

    Li, Wan; Zhu, Lina; Huang, Hao; He, Yuehan; Lv, Junjie; Li, Weimin; Chen, Lina; He, Weiming

    2017-10-01

    Complex chronic diseases are caused by the effects of genetic and environmental factors. Single nucleotide polymorphisms (SNPs), one common type of genetic variations, played vital roles in diseases. We hypothesized that disease risk functional SNPs in coding regions and protein interaction network modules were more likely to contribute to the identification of disease susceptible genes for complex chronic diseases. This could help to further reveal the pathogenesis of complex chronic diseases. Disease risk SNPs were first recognized from public SNP data for coronary heart disease (CHD), hypertension (HT) and type 2 diabetes (T2D). SNPs in coding regions that were classified into nonsense and missense by integrating several SNP functional annotation databases were treated as functional SNPs. Then, regions significantly associated with each disease were screened using random permutations for disease risk functional SNPs. Corresponding to these regions, 155, 169 and 173 potential disease susceptible genes were identified for CHD, HT and T2D, respectively. A disease-related gene product interaction network in environmental context was constructed for interacting gene products of both disease genes and potential disease susceptible genes for these diseases. After functional enrichment analysis for disease associated modules, 5 CHD susceptible genes, 7 HT susceptible genes and 3 T2D susceptible genes were finally identified, some of which had pleiotropic effects. Most of these genes were verified to be related to these diseases in literature. This was similar for disease genes identified from another method proposed by Lee et al. from a different aspect. This research could provide novel perspectives for diagnosis and treatment of complex chronic diseases and susceptible genes identification for other diseases. Copyright © 2017 Elsevier Inc. All rights reserved.

  7. Identifying key genes in glaucoma based on a benchmarked dataset and the gene regulatory network.

    PubMed

    Chen, Xi; Wang, Qiao-Ling; Zhang, Meng-Hui

    2017-10-01

    The current study aimed to identify key genes in glaucoma based on a benchmarked dataset and gene regulatory network (GRN). Local and global noise was added to the gene expression dataset to produce a benchmarked dataset. Differentially-expressed genes (DEGs) between patients with glaucoma and normal controls were identified utilizing the Linear Models for Microarray Data (Limma) package based on benchmarked dataset. A total of 5 GRN inference methods, including Zscore, GeneNet, context likelihood of relatedness (CLR) algorithm, Partial Correlation coefficient with Information Theory (PCIT) and GEne Network Inference with Ensemble of Trees (Genie3) were evaluated using receiver operating characteristic (ROC) and precision and recall (PR) curves. The interference method with the best performance was selected to construct the GRN. Subsequently, topological centrality (degree, closeness and betweenness) was conducted to identify key genes in the GRN of glaucoma. Finally, the key genes were validated by performing reverse transcription-quantitative polymerase chain reaction (RT-qPCR). A total of 176 DEGs were detected from the benchmarked dataset. The ROC and PR curves of the 5 methods were analyzed and it was determined that Genie3 had a clear advantage over the other methods; thus, Genie3 was used to construct the GRN. Following topological centrality analysis, 14 key genes for glaucoma were identified, including IL6 , EPHA2 and GSTT1 and 5 of these 14 key genes were validated by RT-qPCR. Therefore, the current study identified 14 key genes in glaucoma, which may be potential biomarkers to use in the diagnosis of glaucoma and aid in identifying the molecular mechanism of this disease.

  8. Chemical-gene interaction networks and causal reasoning for ...

    EPA Pesticide Factsheets

    Evaluating the potential human health and ecological risks associated with exposures to complex chemical mixtures in the environment is one of the main challenges of chemical safety assessment and environmental protection. There is a need for approaches that can help to integrate chemical monitoring and biological effects data to evaluate risks associated with chemicals present in the environment. Here, we used prior knowledge about chemical-gene interactions to develop a knowledge assembly model for detected chemicals at five locations near the North Branch and Chisago wastewater treatment plants (WWTP) in the St. Croix River Basin, MN and WI. The assembly model was used to generate hypotheses about the biological impacts of the chemicals at each location. The hypotheses were tested using empirical hepatic gene expression data from fathead minnows exposed for 12 d at each location. Empirical gene expression data were also mapped to the assembly models to evaluate the likelihood of a chemical contributing to the observed biological responses using richness and concordance statistics. The prior knowledge approach was able predict the observed biological pathways impacted at one site but not the other. Atrazine was identified as a potential contributor to the observed gene expression responses at a location upstream of the North Branch WTTP. Four chemicals were identified as contributors to the observed biological responses at the effluent and downstream o

  9. Network-Based Method for Identifying Co-Regeneration Genes in Bone, Dentin, Nerve and Vessel Tissues

    PubMed Central

    Pan, Hongying; Zhang, Yu-Hang; Feng, Kaiyan; Kong, XiangYin; Cai, Yu-Dong

    2017-01-01

    Bone and dental diseases are serious public health problems. Most current clinical treatments for these diseases can produce side effects. Regeneration is a promising therapy for bone and dental diseases, yielding natural tissue recovery with few side effects. Because soft tissues inside the bone and dentin are densely populated with nerves and vessels, the study of bone and dentin regeneration should also consider the co-regeneration of nerves and vessels. In this study, a network-based method to identify co-regeneration genes for bone, dentin, nerve and vessel was constructed based on an extensive network of protein–protein interactions. Three procedures were applied in the network-based method. The first procedure, searching, sought the shortest paths connecting regeneration genes of one tissue type with regeneration genes of other tissues, thereby extracting possible co-regeneration genes. The second procedure, testing, employed a permutation test to evaluate whether possible genes were false discoveries; these genes were excluded by the testing procedure. The last procedure, screening, employed two rules, the betweenness ratio rule and interaction score rule, to select the most essential genes. A total of seventeen genes were inferred by the method, which were deemed to contribute to co-regeneration of at least two tissues. All these seventeen genes were extensively discussed to validate the utility of the method. PMID:28974058

  10. Network-Based Method for Identifying Co- Regeneration Genes in Bone, Dentin, Nerve and Vessel Tissues.

    PubMed

    Chen, Lei; Pan, Hongying; Zhang, Yu-Hang; Feng, Kaiyan; Kong, XiangYin; Huang, Tao; Cai, Yu-Dong

    2017-10-02

    Bone and dental diseases are serious public health problems. Most current clinical treatments for these diseases can produce side effects. Regeneration is a promising therapy for bone and dental diseases, yielding natural tissue recovery with few side effects. Because soft tissues inside the bone and dentin are densely populated with nerves and vessels, the study of bone and dentin regeneration should also consider the co-regeneration of nerves and vessels. In this study, a network-based method to identify co-regeneration genes for bone, dentin, nerve and vessel was constructed based on an extensive network of protein-protein interactions. Three procedures were applied in the network-based method. The first procedure, searching, sought the shortest paths connecting regeneration genes of one tissue type with regeneration genes of other tissues, thereby extracting possible co-regeneration genes. The second procedure, testing, employed a permutation test to evaluate whether possible genes were false discoveries; these genes were excluded by the testing procedure. The last procedure, screening, employed two rules, the betweenness ratio rule and interaction score rule, to select the most essential genes. A total of seventeen genes were inferred by the method, which were deemed to contribute to co-regeneration of at least two tissues. All these seventeen genes were extensively discussed to validate the utility of the method.

  11. A Sparse Reconstruction Approach for Identifying Gene Regulatory Networks Using Steady-State Experiment Data

    PubMed Central

    Zhang, Wanhong; Zhou, Tong

    2015-01-01

    Motivation Identifying gene regulatory networks (GRNs) which consist of a large number of interacting units has become a problem of paramount importance in systems biology. Situations exist extensively in which causal interacting relationships among these units are required to be reconstructed from measured expression data and other a priori information. Though numerous classical methods have been developed to unravel the interactions of GRNs, these methods either have higher computing complexities or have lower estimation accuracies. Note that great similarities exist between identification of genes that directly regulate a specific gene and a sparse vector reconstruction, which often relates to the determination of the number, location and magnitude of nonzero entries of an unknown vector by solving an underdetermined system of linear equations y = Φx. Based on these similarities, we propose a novel framework of sparse reconstruction to identify the structure of a GRN, so as to increase accuracy of causal regulation estimations, as well as to reduce their computational complexity. Results In this paper, a sparse reconstruction framework is proposed on basis of steady-state experiment data to identify GRN structure. Different from traditional methods, this approach is adopted which is well suitable for a large-scale underdetermined problem in inferring a sparse vector. We investigate how to combine the noisy steady-state experiment data and a sparse reconstruction algorithm to identify causal relationships. Efficiency of this method is tested by an artificial linear network, a mitogen-activated protein kinase (MAPK) pathway network and the in silico networks of the DREAM challenges. The performance of the suggested approach is compared with two state-of-the-art algorithms, the widely adopted total least-squares (TLS) method and those available results on the DREAM project. Actual results show that, with a lower computational cost, the proposed method can

  12. Gene-environment interactions in the aetiology of systemic lupus erythematosus.

    PubMed

    Jönsen, Andreas; Bengtsson, Anders A; Nived, Ola; Truedsson, Lennart; Sturfelt, Gunnar

    2007-12-01

    Systemic lupus erythematosus (SLE) is a disease that displays a multitude of symptoms and a vast array of autoantibodies. The disease course may vary substantially between patients. The current understanding of SLE aetiology includes environmental factors acting on a genetically prone individual during an undetermined time period resulting in autoimmunity and finally surpassing that individual's disease threshold. Genetic differences and environmental factors may interact specifically in the pathogenetic processes and may influence disease development and modify the disease course. Identification of these factors and their interactions in the pathogenesis of SLE is vital in understanding the disease and may contribute to identify new treatment targets and perhaps also aid in disease prevention. However, there are several problems that need to be overcome, such as the protracted time frame of environmental influence, time dependent epigenetic alterations and the possibility that different pathogenetic pathways may result in a similar disease phenotype. This is mirrored by the relatively few studies that suggest specific gene-environment interactions. These include an association between SLE diagnosis and glutation S-transferase gene variants combined with occupational sun exposure as well as variants of the N-acetyl transferase gene in combination with either aromatic amine exposure or hydralazine. With increased knowledge on SLE pathogenesis, the role of environmental factors and their genetic interactions may be further elucidated.

  13. Diametrical clustering for identifying anti-correlated gene clusters.

    PubMed

    Dhillon, Inderjit S; Marcotte, Edward M; Roshan, Usman

    2003-09-01

    Clustering genes based upon their expression patterns allows us to predict gene function. Most existing clustering algorithms cluster genes together when their expression patterns show high positive correlation. However, it has been observed that genes whose expression patterns are strongly anti-correlated can also be functionally similar. Biologically, this is not unintuitive-genes responding to the same stimuli, regardless of the nature of the response, are more likely to operate in the same pathways. We present a new diametrical clustering algorithm that explicitly identifies anti-correlated clusters of genes. Our algorithm proceeds by iteratively (i). re-partitioning the genes and (ii). computing the dominant singular vector of each gene cluster; each singular vector serving as the prototype of a 'diametric' cluster. We empirically show the effectiveness of the algorithm in identifying diametrical or anti-correlated clusters. Testing the algorithm on yeast cell cycle data, fibroblast gene expression data, and DNA microarray data from yeast mutants reveals that opposed cellular pathways can be discovered with this method. We present systems whose mRNA expression patterns, and likely their functions, oppose the yeast ribosome and proteosome, along with evidence for the inverse transcriptional regulation of a number of cellular systems.

  14. Association of SNPs in dopamine and serotonin pathway genes and their interacting genes with temperament traits in Charolais cows.

    PubMed

    Garza-Brenner, E; Sifuentes-Rincón, A M; Randel, R D; Paredes-Sánchez, F A; Parra-Bracamonte, G M; Arellano Vera, W; Rodríguez Almeida, F A; Segura Cabrera, A

    2017-08-01

    Cattle temperament is a complex trait, and molecular studies aimed at defining this trait are scarce. We used an interaction networks approach to identify new genes (interacting genes) and to estimate their effects and those of 19 dopamine- and serotonin-related genes on the temperament traits of Charolais cattle. The genes proopiomelanocortin (POMC), neuropeptide Y (NPY), solute carrier family 18, member 2 (SLC18A2) and FBJ murine osteosarcoma viral oncogene homologue (FOSFBJ) were identified as new candidates. Their potential to be associated with temperament was estimated according to their reported biological activities, which included interactions with neural activity, receptor function, targeting or synthesis of neurotransmitters and association with behaviour. Pen score (PS) and exit velocity (EV) measures were determined from 412 Charolais cows to calculate their temperament score (TS). Based on the TS, calm (n = 55; TS, 1.09 ± 0.33) and temperamental (n = 58; TS, 2.27 ± 0.639) cows were selected and genotyped using a 248 single-nucleotide variation (SNV) panel. Of the 248 variations in the panel, only 151 were confirmed to be polymorphic (single-nucleotide polymorphisms; SNPs) in the tested population. Single-marker association analyses between genotypes and temperament measures (EV, PS and/or TS) indicated significant associations of six SNPs from four candidate genes. The markers rs109576799 and rs43696138, located in the DRD3 and HTR2A genes, respectively, were significantly associated with both EV and TS traits. Four markers, rs110365063 and rs137756569 from the POMC gene and rs110365063 and rs135155082 located in SLC18A2 and DRD2, respectively, were associated with PS. The variant rs110365063 located in bovine SLC18A2 causes a change in the amino acid sequence from Ala to Thr. Further studies are needed to confirm the association of genetic profile with cattle temperament; however, our study represents important progress in

  15. Gene Network for Identifying the Entropy Changes of Different Modules in Pediatric Sepsis.

    PubMed

    Yang, Jing; Zhang, Pingli; Wang, Lumin

    2016-01-01

    Pediatric sepsis is a disease that threatens life of children. The incidence of pediatric sepsis is higher in developing countries due to various reasons, such as insufficient immunization and nutrition, water and air pollution, etc. Exploring the potential genes via different methods is of significance for the prevention and treatment of pediatric sepsis. This study aimed to identify potential genes associated with pediatric sepsis utilizing analysis of gene network and entropy. The mRNA expression in the blood samples collected from 20 septic children and 30 healthy controls was quantified by using Affymetrix HG-U133A microarray. Two condition-specific protein-protein interaction networks (PINs), one for the healthy control and the other one for the children with sepsis, were deduced by combining the fundamental human PINs with gene expression profiles in the two phenotypes. Subsequently, distinct modules from the two conditional networks were extracted by adopting a maximal clique-merging approach. Delta entropy (ΔS) was calculated between sepsis and control modules. Then, key genes displaying changes in gene composition were identified by matching the control and sepsis modules. Two objective modules were obtained, in which ribosomal protein RPL4 and RPL9 as well as TOP2A were probably considered as the key genes differentiating sepsis from healthy controls. According to previous reports and this work, TOP2A is the potential gene therapy target for pediatric sepsis. The relationship between pediatric sepsis and RPL4 and RPL9 needs further investigation. © 2016 The Author(s) Published by S. Karger AG, Basel.

  16. Gene by Environment Interaction and Resilience: Effects of Child Maltreatment and Serotonin, Corticotropin Releasing Hormone, Dopamine, and Oxytocin Genes

    PubMed Central

    Cicchetti, Dante; Rogosch, Fred A.

    2013-01-01

    In this investigation, gene-environment interaction effects in predicting resilience in adaptive functioning among maltreated and nonmaltreated low-income children (N = 595) were examined. A multi-component index of resilient functioning was derived and levels of resilient functioning were identified. Variants in four genes, 5-HTTLPR, CRHR1, DRD4 -521C/T, and OXTR, were investigated. In a series of ANCOVAs, child maltreatment demonstrated a strong negative main effect on children’s resilient functioning, whereas no main effects for any of the genotypes of the respective genes were found. However, gene-environment interactions involving genotypes of each of the respective genes and maltreatment status were obtained. For each respective gene, among children with a specific genotype, the relative advantage in resilient functioning of nonmaltreated compared to maltreated children was stronger than was the case for nonmaltreated and maltreated children with other genotypes of the respective gene. Across the four genes, a composite of the genotypes that more strongly differentiated resilient functioning between nonmaltreated and maltreated children provided further evidence of genetic variations influencing resilient functioning in nonmaltreated children, whereas genetic variation had a negligible effect on promoting resilience among maltreated children. Additional effects were observed for children based on the number of subtypes of maltreatment children experienced, as well as for abuse and neglect subgroups. Finally, maltreated and nonmaltreated children with high levels of resilience differed in their average number of differentiating genotypes. These results suggest that differential resilient outcomes are based on the interaction between genes and developmental experiences. PMID:22559122

  17. Interaction of two photoreceptors in the regulation of bacterial photosynthesis genes

    PubMed Central

    Metz, Sebastian; Haberzettl, Kerstin; Frühwirth, Sebastian; Teich, Kristin; Hasewinkel, Christian; Klug, Gabriele

    2012-01-01

    The expression of photosynthesis genes in the facultatively photosynthetic bacterium Rhodobacter sphaeroides is controlled by the oxygen tension and by light quantity. Two photoreceptor proteins, AppA and CryB, have been identified in the past, which are involved in this regulation. AppA senses light by its N-terminal BLUF domain, its C-terminal part binds heme and is redox-responsive. Through its interaction to the transcriptional repressor PpsR the AppA photoreceptor controls expression of photosynthesis genes. The cryptochrome-like protein CryB was shown to affect regulation of photosynthesis genes, but the underlying signal chain remained unknown. Here we show that CryB interacts with the C-terminal domain of AppA and modulates the binding of AppA to the transcriptional repressor PpsR in a light-dependent manner. Consequently, binding of the transcription factor PpsR to its DNA target is affected by CryB. In agreement with this, all genes of the PpsR regulon showed altered expression levels in a CryB deletion strain after blue-light illumination. These results elucidate for the first time how a bacterial cryptochrome affects gene expression. PMID:22434878

  18. Interaction of two photoreceptors in the regulation of bacterial photosynthesis genes.

    PubMed

    Metz, Sebastian; Haberzettl, Kerstin; Frühwirth, Sebastian; Teich, Kristin; Hasewinkel, Christian; Klug, Gabriele

    2012-07-01

    The expression of photosynthesis genes in the facultatively photosynthetic bacterium Rhodobacter sphaeroides is controlled by the oxygen tension and by light quantity. Two photoreceptor proteins, AppA and CryB, have been identified in the past, which are involved in this regulation. AppA senses light by its N-terminal BLUF domain, its C-terminal part binds heme and is redox-responsive. Through its interaction to the transcriptional repressor PpsR the AppA photoreceptor controls expression of photosynthesis genes. The cryptochrome-like protein CryB was shown to affect regulation of photosynthesis genes, but the underlying signal chain remained unknown. Here we show that CryB interacts with the C-terminal domain of AppA and modulates the binding of AppA to the transcriptional repressor PpsR in a light-dependent manner. Consequently, binding of the transcription factor PpsR to its DNA target is affected by CryB. In agreement with this, all genes of the PpsR regulon showed altered expression levels in a CryB deletion strain after blue-light illumination. These results elucidate for the first time how a bacterial cryptochrome affects gene expression.

  19. ENU Mutagenesis in Mice Identifies Candidate Genes For Hypogonadism

    PubMed Central

    Weiss, Jeffrey; Hurley, Lisa A.; Harris, Rebecca M.; Finlayson, Courtney; Tong, Minghan; Fisher, Lisa A.; Moran, Jennifer L.; Beier, David R.; Mason, Christopher; Jameson, J. Larry

    2012-01-01

    Genome-wide mutagenesis was performed in mice to identify candidate genes for male infertility, for which the predominant causes remain idiopathic. Mice were mutagenized using N-ethyl-N-nitrosourea (ENU), bred, and screened for phenotypes associated with the male urogenital system. Fifteen heritable lines were isolated and chromosomal loci were assigned using low density genome-wide SNP arrays. Ten of the fifteen lines were pursued further using higher resolution SNP analysis to narrow the candidate gene regions. Exon sequencing of candidate genes identified mutations in mice with cystic kidneys (Bicc1), cryptorchidism (Rxfp2), restricted germ cell deficiency (Plk4), and severe germ cell deficiency (Prdm9). In two other lines with severe hypogonadism candidate sequencing failed to identify mutations, suggesting defects in genes with previously undocumented roles in gonadal function. These genomic intervals were sequenced in their entirety and a candidate mutation was identified in SnrpE in one of the two lines. The line harboring the SnrpE variant retains substantial spermatogenesis despite small testis size, an unusual phenotype. In addition to the reproductive defects, heritable phenotypes were observed in mice with ataxia (Myo5a), tremors (Pmp22), growth retardation (unknown gene), and hydrocephalus (unknown gene). These results demonstrate that the ENU screen is an effective tool for identifying potential causes of male infertility. PMID:22258617

  20. Clustering gene expression data based on predicted differential effects of GV interaction.

    PubMed

    Pan, Hai-Yan; Zhu, Jun; Han, Dan-Fu

    2005-02-01

    Microarray has become a popular biotechnology in biological and medical research. However, systematic and stochastic variabilities in microarray data are expected and unavoidable, resulting in the problem that the raw measurements have inherent "noise" within microarray experiments. Currently, logarithmic ratios are usually analyzed by various clustering methods directly, which may introduce bias interpretation in identifying groups of genes or samples. In this paper, a statistical method based on mixed model approaches was proposed for microarray data cluster analysis. The underlying rationale of this method is to partition the observed total gene expression level into various variations caused by different factors using an ANOVA model, and to predict the differential effects of GV (gene by variety) interaction using the adjusted unbiased prediction (AUP) method. The predicted GV interaction effects can then be used as the inputs of cluster analysis. We illustrated the application of our method with a gene expression dataset and elucidated the utility of our approach using an external validation.

  1. Identification of Genes That Interact With Drosophila liquid facets

    PubMed Central

    Eun, Suk Ho; Lea, Kristi; Overstreet, Erin; Stevens, Samuel; Lee, Ji-Hoon; Fischer, Janice A.

    2007-01-01

    We have performed mutagenesis screens of the Drosophila X chromosome and the autosomes for dominant enhancers of the rough eye resulting from overexpression of liquid facets. The liquid facets gene encodes the homolog of vertebrate endocytic Epsin, an endocytic adapter protein. In Drosophila, Liquid facets is a core component of the Notch signaling pathway required in the signaling cells for ligand endocytosis and signaling. Why ligand internalization by the signaling cells is essential for signaling is a mystery. The requirement for Liquid facets is a hint at the answer, and the genes identified in this screen provide further clues. Mutant alleles of clathrin heavy chain, Rala, split ends, and auxilin were identified as enhancers. We describe the mutant alleles and mutant phenotypes of Rala and aux. We discuss the relevance of all of these genetic interactions to the function of Liquid facets in Notch signaling. PMID:17179082

  2. Rice-arsenate interactions in hydroponics: a three-gene model for tolerance.

    PubMed

    Norton, Gareth J; Nigar, Meher; Williams, Paul N; Dasgupta, Tapash; Meharg, Andrew A; Price, Adam H

    2008-01-01

    In this study, the genetic mapping of the tolerance of root growth to 13.3 muM arsenate [As(V)] using the BalaxAzucena population is improved, and candidate genes for further study are identified. A remarkable three-gene model of tolerance is advanced, which appears to involve epistatic interaction between three major genes, two on chromosome 6 and one on chromosome 10. Any combination of two of these genes inherited from the tolerant parent leads to the plant having tolerance. Lists of potential positional candidate genes are presented. These are then refined using whole genome transcriptomics data and bioinformatics. Physiological evidence is also provided that genes related to phosphate transport are unlikely to be behind the genetic loci conferring tolerance. These results offer testable hypotheses for genes related to As(V) tolerance that might offer strategies for mitigating arsenic (As) accumulation in consumed rice.

  3. Rice–arsenate interactions in hydroponics: a three-gene model for tolerance

    PubMed Central

    Norton, Gareth J.; Nigar, Meher; Dasgupta, Tapash; Meharg, Andrew A.; Price, Adam H.

    2008-01-01

    In this study, the genetic mapping of the tolerance of root growth to 13.3 μM arsenate [As(V)] using the Bala×Azucena population is improved, and candidate genes for further study are identified. A remarkable three-gene model of tolerance is advanced, which appears to involve epistatic interaction between three major genes, two on chromosome 6 and one on chromosome 10. Any combination of two of these genes inherited from the tolerant parent leads to the plant having tolerance. Lists of potential positional candidate genes are presented. These are then refined using whole genome transcriptomics data and bioinformatics. Physiological evidence is also provided that genes related to phosphate transport are unlikely to be behind the genetic loci conferring tolerance. These results offer testable hypotheses for genes related to As(V) tolerance that might offer strategies for mitigating arsenic (As) accumulation in consumed rice. PMID:18453529

  4. Genome-wide gene-asbestos exposure interaction association study identifies a common susceptibility variant on 22q13.31 associated with lung cancer risk

    PubMed Central

    Liu, Chen-yu; Stücker, Isabelle; Chen, Chu; Goodman, Gary; McHugh, Michelle K.; D’Amelio, Anthony M.; Etzel, Carol J.; Li, Su; Lin, Xihong; Christiani, David C.

    2015-01-01

    Background Occupational asbestos exposure has been found to increase lung cancer risk in epidemiological studies. Methods We conducted an asbestos exposure-gene interaction analyses among several Caucasian populations who were current or ex-smokers. The discovery phase included 833 Caucasian cases and 739 Caucasian controls, and used a genome-wide association study (GWAS) to identify single nucleotide polymorphisms (SNPs) with gene-asbestos interaction effects. The top ranked SNPs from the discovery phase were replicated within the International Lung and Cancer Consortium (ILCCO). First, in silico replication was conducted in those groups that had GWAS and asbestos exposure data, including 1,548 cases and 1,527 controls. This step was followed by de novo genotyping to replicate the results from the in silico replication, and included 1,539 cases and 1,761 controls. Multiple logistic regression was used to assess the SNP-asbestos exposure interaction effects on lung cancer risk. Results We observed significantly increased lung cancer risk among MIRLET7BHG (MIRLET7B host gene located at 22q13.31) polymorphisms rs13053856, rs11090910, rs11703832, and rs12170325 heterozygous and homozygous variant allele(s) carriers [p<5×10−7 by likelihood ratio test; df=1]. Among the heterozygous and homozygous variant allele(s) carriers of polymorphisms rs13053856, rs11090910, rs11703832, and rs12170325, each unit increase in the natural log-transformed asbestos exposure score was associated with age-, sex-, smoking status- and center-adjusted ORs of 1.34 (95%CI=1.18–1.51), 1.24 (95%CI=1.14–1.35), 1.28 (95%CI=1.17–1.40), and 1.26 (95%CI=1.15–1.38), respectively for lung cancer risk. Conclusion Our findings suggest that MIRLET7BHG polymorphisms may be important predictive markers for asbestos exposure-related lung cancer. Impact To our knowledge, our study is the first report using a systematic genome-wide analysis in combination with detailed asbestos exposure data and

  5. A Novel Test for Gene-Ancestry Interactions in Genome-Wide Association Data

    PubMed Central

    Dunlop, Malcolm G.; Houlston, Richard S.; Tomlinson, Ian P.; Holmes, Chris C.

    2012-01-01

    Genome-wide association study (GWAS) data on a disease are increasingly available from multiple related populations. In this scenario, meta-analyses can improve power to detect homogeneous genetic associations, but if there exist ancestry-specific effects, via interactions on genetic background or with a causal effect that co-varies with genetic background, then these will typically be obscured. To address this issue, we have developed a robust statistical method for detecting susceptibility gene-ancestry interactions in multi-cohort GWAS based on closely-related populations. We use the leading principal components of the empirical genotype matrix to cluster individuals into “ancestry groups” and then look for evidence of heterogeneous genetic associations with disease or other trait across these clusters. Robustness is improved when there are multiple cohorts, as the signal from true gene-ancestry interactions can then be distinguished from gene-collection artefacts by comparing the observed interaction effect sizes in collection groups relative to ancestry groups. When applied to colorectal cancer, we identified a missense polymorphism in iron-absorption gene CYBRD1 that associated with disease in individuals of English, but not Scottish, ancestry. The association replicated in two additional, independently-collected data sets. Our method can be used to detect associations between genetic variants and disease that have been obscured by population genetic heterogeneity. It can be readily extended to the identification of genetic interactions on other covariates such as measured environmental exposures. We envisage our methodology being of particular interest to researchers with existing GWAS data, as ancestry groups can be easily defined and thus tested for interactions. PMID:23236349

  6. Gene-diet interaction effects on BMI levels in the Singapore Chinese population.

    PubMed

    Chang, Xuling; Dorajoo, Rajkumar; Sun, Ye; Han, Yi; Wang, Ling; Khor, Chiea-Chuen; Sim, Xueling; Tai, E-Shyong; Liu, Jianjun; Yuan, Jian-Min; Koh, Woon-Puay; van Dam, Rob M; Friedlander, Yechiel; Heng, Chew-Kiat

    2018-02-24

    Recent genome-wide association studies (GWAS) have identified 97 body-mass index (BMI) associated loci. We aimed to evaluate if dietary intake modifies BMI associations at these loci in the Singapore Chinese population. We utilized GWAS information from six data subsets from two adult Chinese population (N = 7817). Seventy-eight genotyped or imputed index BMI single nucleotide polymorphisms (SNPs) that passed quality control procedures were available in all datasets. Alternative Healthy Eating Index (AHEI)-2010 score and ten nutrient variables were evaluated. Linear regression analyses between z score transformed BMI (Z-BMI) and dietary factors were performed. Interaction analyses were performed by introducing the interaction term (diet x SNP) in the same regression model. Analysis was carried out in each cohort individually and subsequently meta-analyzed using the inverse-variance weighted method. Analyses were also evaluated with a weighted gene-risk score (wGRS) contructed by BMI index SNPs from recent large-scale GWAS studies. Nominal associations between Z-BMI and AHEI-2010 and some dietary factors were identified (P = 0.047-0.010). The BMI wGRS was robustly associated with Z-BMI (P = 1.55 × 10 - 15 ) but not with any dietary variables. Dietary variables did not significantly interact with the wGRS to modify BMI associations. When interaction analyses were repeated using individual SNPs, a significant association between cholesterol intake and rs4740619 (CCDC171) was identified (β = 0.077, adjP interaction  = 0.043). The CCDC171 gene locus may interact with cholesterol intake to increase BMI in the Singaporean Chinese population, however most known obesity risk loci were not associated with dietary intake and did not interact with diet to modify BMI levels.

  7. GENOME-WIDE GENE-SODIUM INTERACTION ANALYSES ON BLOOD PRESSURE: THE GENSALT STUDY

    PubMed Central

    Li, Changwei; He, Jiang; Chen, Jing; Zhao, Jinying; Gu, Dongfeng; Hixson, James E.; Rao, Dabeeru C.; Jaquish, Cashell E.; Gu, Charles C.; Chen, Jichun; Huang, Jianfeng; Chen, Shufeng; Kelly, Tanika N.

    2016-01-01

    We performed genome-wide analyses to identify genomic loci that interact with sodium to influence blood pressure (BP) using single marker (one and two degree-of-freedom joint tests) and gene-based tests among 1,876 Chinese participants of the Genetic Epidemiology Network of Salt-Sensitivity (GenSalt) study. Among GenSalt participants, the average of three urine samples was used to estimate sodium excretion. Nine BP measurements were taken using a random-zero-sphygmomanometer. A total of 2.05 million SNPs were imputed using Affymetrix 6.0 genotype data and the Chinese Han of Beijing and Japanese of Tokyo HapMap reference panel. Promising findings (P <1.00×10−4) from GenSalt were evaluated for replication among 775 Chinese participants of the Multi-ethnic Study of Atherosclerosis (MESA). SNP and gene-based results were meta-analyzed across the GenSalt and MESA studies to determine genome-wide significance. The one degree-of-freedom tests identified interactions for UST rs13211840 on diastolic BP (P=3.13×10−9). The two degree-of-freedom tests additionally identified associations for CLGN rs2567241 (P=3.90×10−12) and LOC105369882 rs11104632 (P=4.51×10−8) with systolic BP. The CLGN variant rs2567241 was also associated with diastolic BP (P=3.11×10−22) and mean arterial pressure (P= 2.86×10−15). Genome-wide gene-based analysis identified MKNK1 (P=6.70×10−7), C2orf80 (P<1.00×10−12), EPHA6 (P=2.88×10−7), SCOC-AS1 (P=4.35×10−14), SCOC (P=6.46×10−11), CLGN (P=3.68×10−13), MGAT4D (P=4.73×10−11), ARHGAP42 (P=<1.00×10−12), CASP4 (P=1.31×10−8), and LINC01478 (P=6.75×10−10) that were associated with at least one BP phenotype. In summary, we identified 8 novel and 1 previously reported BP loci through the examination of SNP and gene-based interactions with sodium. PMID:27271309

  8. Gene essentiality and the topology of protein interaction networks

    PubMed Central

    Coulomb, Stéphane; Bauer, Michel; Bernard, Denis; Marsolier-Kergoat, Marie-Claude

    2005-01-01

    The mechanistic bases for gene essentiality and for cell mutational resistance have long been disputed. The recent availability of large protein interaction databases has fuelled the analysis of protein interaction networks and several authors have proposed that gene dispensability could be strongly related to some topological parameters of these networks. However, many results were based on protein interaction data whose biases were not taken into account. In this article, we show that the essentiality of a gene in yeast is poorly related to the number of interactants (or degree) of the corresponding protein and that the physiological consequences of gene deletions are unrelated to several other properties of proteins in the interaction networks, such as the average degrees of their nearest neighbours, their clustering coefficients or their relative distances. We also found that yeast protein interaction networks lack degree correlation, i.e. a propensity for their vertices to associate according to their degrees. Gene essentiality and more generally cell resistance against mutations thus seem largely unrelated to many parameters of protein network topology. PMID:16087428

  9. Comparative transcriptome analysis of stylar canal cells identifies novel candidate genes implicated in the self-incompatibility response of Citrus clementina

    PubMed Central

    2012-01-01

    Background Reproductive biology in citrus is still poorly understood. Although in recent years several efforts have been made to study pollen-pistil interaction and self-incompatibility, little information is available about the molecular mechanisms regulating these processes. Here we report the identification of candidate genes involved in pollen-pistil interaction and self-incompatibility in clementine (Citrus clementina Hort. ex Tan.). These genes have been identified comparing the transcriptomes of laser-microdissected stylar canal cells (SCC) isolated from two genotypes differing for self-incompatibility response ('Comune', a self-incompatible cultivar and 'Monreal', a self- compatible mutation of 'Comune'). Results The transcriptome profiling of SCC indicated that the differential regulation of few specific, mostly uncharacterized transcripts is associated with the breakdown of self-incompatibility in 'Monreal'. Among them, a novel F-box gene showed a drastic up-regulation both in laser microdissected stylar canal cells and in self-pollinated whole styles with stigmas of 'Comune' in concomitance with the arrest of pollen tube growth. Moreover, we identify a non-characterized gene family as closely associated to the self-incompatibility genetic program activated in 'Comune'. Three different aspartic-acid rich (Asp-rich) protein genes, located in tandem in the clementine genome, were over-represented in the transcriptome of 'Comune'. These genes are tightly linked to a DELLA gene, previously found to be up-regulated in the self-incompatible genotype during pollen-pistil interaction. Conclusion The highly specific transcriptome survey of the stylar canal cells identified novel genes which have not been previously associated with self-pollen rejection in citrus and in other plant species. Bioinformatic and transcriptional analyses suggested that the mutation leading to self-compatibility in 'Monreal' affected the expression of non-homologous genes located in a

  10. Multi-omics approach identifies molecular mechanisms of plant-fungus mycorrhizal interaction

    DOE PAGES

    Larsen, Peter E.; Sreedasyam, Avinash; Trivedi, Geetika; ...

    2016-01-19

    In mycorrhizal symbiosis, plant roots form close, mutually beneficial interactions with soil fungi. Before this mycorrhizal interaction can be established however, plant roots must be capable of detecting potential beneficial fungal partners and initiating the gene expression patterns necessary to begin symbiosis. To predict a plant root – mycorrhizal fungi sensor systems, we analyzed in vitro experiments of Populus tremuloides (aspen tree) and Laccaria bicolor (mycorrhizal fungi) interaction and leveraged over 200 previously published transcriptomic experimental data sets, 159 experimentally validated plant transcription factor binding motifs, and more than 120-thousand experimentally validated protein-protein interactions to generate models of pre-mycorrhizal sensormore » systems in aspen root. These sensor mechanisms link extracellular signaling molecules with gene regulation through a network comprised of membrane receptors, signal cascade proteins, transcription factors, and transcription factor biding DNA motifs. Modeling predicted four pre-mycorrhizal sensor complexes in aspen that interact with fifteen transcription factors to regulate the expression of 1184 genes in response to extracellular signals synthesized by Laccaria. Predicted extracellular signaling molecules include common signaling molecules such as phenylpropanoids, salicylate, and, jasmonic acid. Lastly, this multi-omic computational modeling approach for predicting the complex sensory networks yielded specific, testable biological hypotheses for mycorrhizal interaction signaling compounds, sensor complexes, and mechanisms of gene regulation.« less

  11. Multi-omics approach identifies molecular mechanisms of plant-fungus mycorrhizal interaction

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Larsen, Peter E.; Sreedasyam, Avinash; Trivedi, Geetika

    In mycorrhizal symbiosis, plant roots form close, mutually beneficial interactions with soil fungi. Before this mycorrhizal interaction can be established however, plant roots must be capable of detecting potential beneficial fungal partners and initiating the gene expression patterns necessary to begin symbiosis. To predict a plant root – mycorrhizal fungi sensor systems, we analyzed in vitro experiments of Populus tremuloides (aspen tree) and Laccaria bicolor (mycorrhizal fungi) interaction and leveraged over 200 previously published transcriptomic experimental data sets, 159 experimentally validated plant transcription factor binding motifs, and more than 120-thousand experimentally validated protein-protein interactions to generate models of pre-mycorrhizal sensormore » systems in aspen root. These sensor mechanisms link extracellular signaling molecules with gene regulation through a network comprised of membrane receptors, signal cascade proteins, transcription factors, and transcription factor biding DNA motifs. Modeling predicted four pre-mycorrhizal sensor complexes in aspen that interact with fifteen transcription factors to regulate the expression of 1184 genes in response to extracellular signals synthesized by Laccaria. Predicted extracellular signaling molecules include common signaling molecules such as phenylpropanoids, salicylate, and, jasmonic acid. Lastly, this multi-omic computational modeling approach for predicting the complex sensory networks yielded specific, testable biological hypotheses for mycorrhizal interaction signaling compounds, sensor complexes, and mechanisms of gene regulation.« less

  12. Enhancer connectome in primary human cells identifies target genes of disease-associated DNA elements

    PubMed Central

    Mumbach, Maxwell R; Satpathy, Ansuman T; Boyle, Evan A; Dai, Chao; Gowen, Benjamin G; Cho, Seung Woo; Nguyen, Michelle L; Rubin, Adam J; Granja, Jeffrey M; Kazane, Katelynn R; Wei, Yuning; Nguyen, Trieu; Greenside, Peyton G; Corces, M Ryan; Tycko, Josh; Simeonov, Dimitre R; Suliman, Nabeela; Li, Rui; Xu, Jin; Flynn, Ryan A; Kundaje, Anshul; Khavari, Paul A; Marson, Alexander; Corn, Jacob E; Quertermous, Thomas; Greenleaf, William J; Chang, Howard Y

    2018-01-01

    The challenge of linking intergenic mutations to target genes has limited molecular understanding of human diseases. Here we show that H3K27ac HiChIP generates high-resolution contact maps of active enhancers and target genes in rare primary human T cell subtypes and coronary artery smooth muscle cells. Differentiation of naive T cells into T helper 17 cells or regulatory T cells creates subtype-specific enhancer–promoter interactions, specifically at regions of shared DNA accessibility. These data provide a principled means of assigning molecular functions to autoimmune and cardiovascular disease risk variants, linking hundreds of noncoding variants to putative gene targets. Target genes identified with HiChIP are further supported by CRISPR interference and activation at linked enhancers, by the presence of expression quantitative trait loci, and by allele-specific enhancer loops in patient-derived primary cells. The majority of disease-associated enhancers contact genes beyond the nearest gene in the linear genome, leading to a fourfold increase in the number of potential target genes for autoimmune and cardiovascular diseases. PMID:28945252

  13. Physical Interactions and Expression Quantitative Traits Loci Identify Regulatory Connections for Obesity and Type 2 Diabetes Associated SNPs

    PubMed Central

    Fadason, Tayaza; Ekblad, Cameron; Ingram, John R.; Schierding, William S.; O'Sullivan, Justin M.

    2017-01-01

    The mechanisms that underlie the association between obesity and type 2 diabetes are not fully understood. Here, we investigated the role of the 3D genome organization in the pathogeneses of obesity and type-2 diabetes. We interpreted the combined and differential impacts of 196 diabetes and 390 obesity associated single nucleotide polymorphisms (SNPs) by integrating data on the genes with which they physically interact (as captured by Hi-C) and the functional [i.e., expression quantitative trait loci (eQTL)] outcomes associated with these interactions. We identified 861 spatially regulated genes (e.g., AP3S2, ELP5, SVIP, IRS1, FADS2, WFS1, RBM6, HORMAD1, PYROXD2), which are enriched in tissues (e.g., adipose, skeletal muscle, pancreas) and biological processes and canonical pathways (e.g., lipid metabolism, leptin, and glucose-insulin signaling pathways) that are important for the pathogenesis of type 2 diabetes and obesity. Our discovery-based approach also identifies enrichment for eQTL SNP-gene interactions in tissues that are not classically associated with diabetes or obesity. We propose that the combinatorial action of active obesity and diabetes spatial eQTL SNPs on their gene pairs within different tissues reduces the ability of these tissues to contribute to the maintenance of a healthy energy metabolism. PMID:29081791

  14. Gene-environment interaction study for BMI reveals interactions between genetic factors and physical activity, alcohol consumption and socioeconomic status

    PubMed Central

    Karlsson, Torgny; Ek, Weronica E.

    2017-01-01

    Previous genome-wide association studies (GWAS) have identified hundreds of genetic loci to be associated with body mass index (BMI) and risk of obesity. Genetic effects can differ between individuals depending on lifestyle or environmental factors due to gene-environment interactions. In this study, we examine gene-environment interactions in 362,496 unrelated participants with Caucasian ancestry from the UK Biobank resource. A total of 94 BMI-associated SNPs, selected from a previous GWAS on BMI, were used to construct weighted genetic scores for BMI (GSBMI). Linear regression modeling was used to estimate the effect of gene-environment interactions on BMI for 131 lifestyle factors related to: dietary habits, smoking and alcohol consumption, physical activity, socioeconomic status, mental health, sleeping patterns, as well as female-specific factors such as menopause and childbirth. In total, 15 lifestyle factors were observed to interact with GSBMI, of which alcohol intake frequency, usual walking pace, and Townsend deprivation index, a measure of socioeconomic status, were all highly significant (p = 1.45*10−29, p = 3.83*10−26, p = 4.66*10−11, respectively). Interestingly, the frequency of alcohol consumption, rather than the total weekly amount resulted in a significant interaction. The FTO locus was the strongest single locus interacting with any of the lifestyle factors. However, 13 significant interactions were also observed after omitting the FTO locus from the genetic score. Our analyses indicate that many lifestyle factors modify the genetic effects on BMI with some groups of individuals having more than double the effect of the genetic score. However, the underlying causal mechanisms of gene-environmental interactions are difficult to deduce from cross-sectional data alone and controlled experiments are required to fully characterise the causal factors. PMID:28873402

  15. Cry-Bt identifier: a biological database for PCR detection of Cry genes present in transgenic plants.

    PubMed

    Singh, Vinay Kumar; Ambwani, Sonu; Marla, Soma; Kumar, Anil

    2009-10-23

    We describe the development of a user friendly tool that would assist in the retrieval of information relating to Cry genes in transgenic crops. The tool also helps in detection of transformed Cry genes from Bacillus thuringiensis present in transgenic plants by providing suitable designed primers for PCR identification of these genes. The tool designed based on relational database model enables easy retrieval of information from the database with simple user queries. The tool also enables users to access related information about Cry genes present in various databases by interacting with different sources (nucleotide sequences, protein sequence, sequence comparison tools, published literature, conserved domains, evolutionary and structural data). http://insilicogenomics.in/Cry-btIdentifier/welcome.html.

  16. Discovery of new candidate genes related to brain development using protein interaction information.

    PubMed

    Chen, Lei; Chu, Chen; Kong, Xiangyin; Huang, Tao; Cai, Yu-Dong

    2015-01-01

    Human brain development is a dramatic process composed of a series of complex and fine-tuned spatiotemporal gene expressions. A good comprehension of this process can assist us in developing the potential of our brain. However, we have only limited knowledge about the genes and gene functions that are involved in this biological process. Therefore, a substantial demand remains to discover new brain development-related genes and identify their biological functions. In this study, we aimed to discover new brain-development related genes by building a computational method. We referred to a series of computational methods used to discover new disease-related genes and developed a similar method. In this method, the shortest path algorithm was executed on a weighted graph that was constructed using protein-protein interactions. New candidate genes fell on at least one of the shortest paths connecting two known genes that are related to brain development. A randomization test was then adopted to filter positive discoveries. Of the final identified genes, several have been reported to be associated with brain development, indicating the effectiveness of the method, whereas several of the others may have potential roles in brain development.

  17. Gene-Environment Interactions in Cardiovascular Disease

    PubMed Central

    Flowers, Elena; Froelicher, Erika Sivarajan; Aouizerat, Bradley E.

    2011-01-01

    Background Historically, models to describe disease were exclusively nature-based or nurture-based. Current theoretical models for complex conditions such as cardiovascular disease acknowledge the importance of both biologic and non-biologic contributors to disease. A critical feature is the occurrence of interactions between numerous risk factors for disease. The interaction between genetic (i.e. biologic, nature) and environmental (i.e. non-biologic, nurture) causes of disease is an important mechanism for understanding both the etiology and public health impact of cardiovascular disease. Objectives The purpose of this paper is to describe theoretical underpinnings of gene-environment interactions, models of interaction, methods for studying gene-environment interactions, and the related concept of interactions between epigenetic mechanisms and the environment. Discussion Advances in methods for measurement of genetic predictors of disease have enabled an increasingly comprehensive understanding of the causes of disease. In order to fully describe the effects of genetic predictors of disease, it is necessary to place genetic predictors within the context of known environmental risk factors. The additive or multiplicative effect of the interaction between genetic and environmental risk factors is often greater than the contribution of either risk factor alone. PMID:21684212

  18. Multiscale mutation clustering algorithm identifies pan-cancer mutational clusters associated with pathway-level changes in gene expression

    PubMed Central

    Poole, William; Leinonen, Kalle; Shmulevich, Ilya

    2017-01-01

    Cancer researchers have long recognized that somatic mutations are not uniformly distributed within genes. However, most approaches for identifying cancer mutations focus on either the entire-gene or single amino-acid level. We have bridged these two methodologies with a multiscale mutation clustering algorithm that identifies variable length mutation clusters in cancer genes. We ran our algorithm on 539 genes using the combined mutation data in 23 cancer types from The Cancer Genome Atlas (TCGA) and identified 1295 mutation clusters. The resulting mutation clusters cover a wide range of scales and often overlap with many kinds of protein features including structured domains, phosphorylation sites, and known single nucleotide variants. We statistically associated these multiscale clusters with gene expression and drug response data to illuminate the functional and clinical consequences of mutations in our clusters. Interestingly, we find multiple clusters within individual genes that have differential functional associations: these include PTEN, FUBP1, and CDH1. This methodology has potential implications in identifying protein regions for drug targets, understanding the biological underpinnings of cancer, and personalizing cancer treatments. Toward this end, we have made the mutation clusters and the clustering algorithm available to the public. Clusters and pathway associations can be interactively browsed at m2c.systemsbiology.net. The multiscale mutation clustering algorithm is available at https://github.com/IlyaLab/M2C. PMID:28170390

  19. Multiscale mutation clustering algorithm identifies pan-cancer mutational clusters associated with pathway-level changes in gene expression.

    PubMed

    Poole, William; Leinonen, Kalle; Shmulevich, Ilya; Knijnenburg, Theo A; Bernard, Brady

    2017-02-01

    Cancer researchers have long recognized that somatic mutations are not uniformly distributed within genes. However, most approaches for identifying cancer mutations focus on either the entire-gene or single amino-acid level. We have bridged these two methodologies with a multiscale mutation clustering algorithm that identifies variable length mutation clusters in cancer genes. We ran our algorithm on 539 genes using the combined mutation data in 23 cancer types from The Cancer Genome Atlas (TCGA) and identified 1295 mutation clusters. The resulting mutation clusters cover a wide range of scales and often overlap with many kinds of protein features including structured domains, phosphorylation sites, and known single nucleotide variants. We statistically associated these multiscale clusters with gene expression and drug response data to illuminate the functional and clinical consequences of mutations in our clusters. Interestingly, we find multiple clusters within individual genes that have differential functional associations: these include PTEN, FUBP1, and CDH1. This methodology has potential implications in identifying protein regions for drug targets, understanding the biological underpinnings of cancer, and personalizing cancer treatments. Toward this end, we have made the mutation clusters and the clustering algorithm available to the public. Clusters and pathway associations can be interactively browsed at m2c.systemsbiology.net. The multiscale mutation clustering algorithm is available at https://github.com/IlyaLab/M2C.

  20. A stratified transcriptomics analysis of polygenic fat and lean mouse adipose tissues identifies novel candidate obesity genes.

    PubMed

    Morton, Nicholas M; Nelson, Yvonne B; Michailidou, Zoi; Di Rollo, Emma M; Ramage, Lynne; Hadoke, Patrick W F; Seckl, Jonathan R; Bunger, Lutz; Horvat, Simon; Kenyon, Christopher J; Dunbar, Donald R

    2011-01-01

    Obesity and metabolic syndrome results from a complex interaction between genetic and environmental factors. In addition to brain-regulated processes, recent genome wide association studies have indicated that genes highly expressed in adipose tissue affect the distribution and function of fat and thus contribute to obesity. Using a stratified transcriptome gene enrichment approach we attempted to identify adipose tissue-specific obesity genes in the unique polygenic Fat (F) mouse strain generated by selective breeding over 60 generations for divergent adiposity from a comparator Lean (L) strain. To enrich for adipose tissue obesity genes a 'snap-shot' pooled-sample transcriptome comparison of key fat depots and non adipose tissues (muscle, liver, kidney) was performed. Known obesity quantitative trait loci (QTL) information for the model allowed us to further filter genes for increased likelihood of being causal or secondary for obesity. This successfully identified several genes previously linked to obesity (C1qr1, and Np3r) as positional QTL candidate genes elevated specifically in F line adipose tissue. A number of novel obesity candidate genes were also identified (Thbs1, Ppp1r3d, Tmepai, Trp53inp2, Ttc7b, Tuba1a, Fgf13, Fmr) that have inferred roles in fat cell function. Quantitative microarray analysis was then applied to the most phenotypically divergent adipose depot after exaggerating F and L strain differences with chronic high fat feeding which revealed a distinct gene expression profile of line, fat depot and diet-responsive inflammatory, angiogenic and metabolic pathways. Selected candidate genes Npr3 and Thbs1, as well as Gys2, a non-QTL gene that otherwise passed our enrichment criteria were characterised, revealing novel functional effects consistent with a contribution to obesity. A focussed candidate gene enrichment strategy in the unique F and L model has identified novel adipose tissue-enriched genes contributing to obesity.

  1. Review of the Gene-Environment Interaction Literature in Cancer: What Do We Know?

    PubMed

    Simonds, Naoko I; Ghazarian, Armen A; Pimentel, Camilla B; Schully, Sheri D; Ellison, Gary L; Gillanders, Elizabeth M; Mechanic, Leah E

    2016-07-01

    Risk of cancer is determined by a complex interplay of genetic and environmental factors. Although the study of gene-environment interactions (G×E) has been an active area of research, little is reported about the known findings in the literature. To examine the state of the science in G×E research in cancer, we performed a systematic review of published literature using gene-environment or pharmacogenomic flags from two curated databases of genetic association studies, the Human Genome Epidemiology (HuGE) literature finder and Cancer Genome-Wide Association and Meta Analyses Database (CancerGAMAdb), from January 1, 2001, to January 31, 2011. A supplemental search using HuGE was conducted for articles published from February 1, 2011, to April 11, 2013. A 25% sample of the supplemental publications was reviewed. A total of 3,019 articles were identified in the original search. From these articles, 243 articles were determined to be relevant based on inclusion criteria (more than 3,500 interactions). From the supplemental search (1,400 articles identified), 29 additional relevant articles (1,370 interactions) were included. The majority of publications in both searches examined G×E in colon, rectal, or colorectal; breast; or lung cancer. Specific interactions examined most frequently included environmental factors categorized as energy balance (e.g., body mass index, diet), exogenous (e.g., oral contraceptives) and endogenous hormones (e.g., menopausal status), chemical environment (e.g., grilled meats), and lifestyle (e.g., smoking, alcohol intake). In both searches, the majority of interactions examined were using loci from candidate genes studies and none of the studies were genome-wide interaction studies (GEWIS). The most commonly reported measure was the interaction P-value, of which a sizable number of P-values were considered statistically significant (i.e., <0.05). In addition, the magnitude of interactions reported was modest. Observations of published

  2. Review of the Gene-Environment Interaction Literature in Cancer: What do we know?

    PubMed Central

    Simonds, Naoko I.; Ghazarian, Armen A.; Pimentel, Camilla B.; Schully, Sheri D.; Ellison, Gary L.; Gillanders, Elizabeth M.; Mechanic, Leah E.

    2016-01-01

    Background Risk of cancer is determined by a complex interplay of genetic and environmental factors. Although the study of gene-environment (GxE) interactions has been an active area of research, little is reported about the known findings in the literature. Methods To examine the state of the science in GxE research in cancer, we performed a systematic review of published literature using gene-environment or pharmacogenomic flags from two curated databases of genetic association studies, the Human Genome Epidemiology (HuGE) literature finder and Cancer Genome-Wide Association and Meta Analyses Database (CancerGAMAdb), from January 1, 2001, to January 31, 2011. A supplemental search using HuGE was conducted for articles published February 1, 2011, to April 11, 2013. A 25% sample of the supplemental publications was reviewed. Results A total of 3,019 articles were identified in the original search. From these articles, 243 articles were determined to be relevant based on inclusion criteria (more than 3,500 interactions). From the supplemental search (1,400 articles identified), 29 additional relevant articles (1,370 interactions) were included. The majority of publications in both searches examined GxE in colon, rectal, or colorectal cancer types; breast; or lung cancer. Specific interactions examined most frequently included environmental factors categorized as energy balance (e.g., body mass index (BMI), diet), exogenous (e.g., oral contraceptives) and endogenous hormones (e.g., menopausal status), chemical environment (e.g., grilled meats), and lifestyle (e.g., smoking, alcohol intake). In both searches, the majority of interactions examined were using loci from candidate genes studies and none of the studies were genome-wide interaction studies (GEWIS). The most commonly reported measure was the interaction p-value, of which a sizable number of p-values were considered statistically significant (i.e., < 0.05). In addition, the magnitudes of interactions reported

  3. Inverse gene-for-gene interactions contribute additively to tan spot susceptibility in wheat.

    PubMed

    Liu, Zhaohui; Zurn, Jason D; Kariyawasam, Gayan; Faris, Justin D; Shi, Gongjun; Hansen, Jana; Rasmussen, Jack B; Acevedo, Maricelis

    2017-06-01

    Tan spot susceptibility is conferred by multiple interactions of necrotrophic effector and host sensitivity genes. Tan spot of wheat, caused by Pyrenophora tritici-repentis, is an important disease in almost all wheat-growing areas of the world. The disease system is known to involve at least three fungal-produced necrotrophic effectors (NEs) that interact with the corresponding host sensitivity (S) genes in an inverse gene-for-gene manner to induce disease. However, it is unknown if the effects of these NE-S gene interactions contribute additively to the development of tan spot. In this work, we conducted disease evaluations using different races and quantitative trait loci (QTL) analysis in a wheat recombinant inbred line (RIL) population derived from a cross between two susceptible genotypes, LMPG-6 and PI 626573. The two parental lines each harbored a single known NE sensitivity gene with LMPG-6 having the Ptr ToxC sensitivity gene Tsc1 and PI 626573 having the Ptr ToxA sensitivity gene Tsn1. Transgressive segregation was observed in the population for all races. QTL mapping revealed that both loci (Tsn1 and Tsc1) were significantly associated with susceptibility to race 1 isolates, which produce both Ptr ToxA and Ptr ToxC, and the two genes contributed additively to tan spot susceptibility. For isolates of races 2 and 3, which produce only Ptr ToxA and Ptr ToxC, only Tsn1 and Tsc1 were associated with tan spot susceptibility, respectively. This work clearly demonstrates that tan spot susceptibility in this population is due primarily to two NE-S interactions. Breeders should remove both sensitivity genes from wheat lines to obtain high levels of tan spot resistance.

  4. Identifying Dynamic Protein Complexes Based on Gene Expression Profiles and PPI Networks

    PubMed Central

    Li, Min; Chen, Weijie; Wang, Jianxin; Pan, Yi

    2014-01-01

    Identification of protein complexes from protein-protein interaction networks has become a key problem for understanding cellular life in postgenomic era. Many computational methods have been proposed for identifying protein complexes. Up to now, the existing computational methods are mostly applied on static PPI networks. However, proteins and their interactions are dynamic in reality. Identifying dynamic protein complexes is more meaningful and challenging. In this paper, a novel algorithm, named DPC, is proposed to identify dynamic protein complexes by integrating PPI data and gene expression profiles. According to Core-Attachment assumption, these proteins which are always active in the molecular cycle are regarded as core proteins. The protein-complex cores are identified from these always active proteins by detecting dense subgraphs. Final protein complexes are extended from the protein-complex cores by adding attachments based on a topological character of “closeness” and dynamic meaning. The protein complexes produced by our algorithm DPC contain two parts: static core expressed in all the molecular cycle and dynamic attachments short-lived. The proposed algorithm DPC was applied on the data of Saccharomyces cerevisiae and the experimental results show that DPC outperforms CMC, MCL, SPICi, HC-PIN, COACH, and Core-Attachment based on the validation of matching with known complexes and hF-measures. PMID:24963481

  5. Identifying a gene expression signature of cluster headache in blood

    PubMed Central

    Eising, Else; Pelzer, Nadine; Vijfhuizen, Lisanne S.; Vries, Boukje de; Ferrari, Michel D.; ‘t Hoen, Peter A. C.; Terwindt, Gisela M.; van den Maagdenberg, Arn M. J. M.

    2017-01-01

    Cluster headache is a relatively rare headache disorder, typically characterized by multiple daily, short-lasting attacks of excruciating, unilateral (peri-)orbital or temporal pain associated with autonomic symptoms and restlessness. To better understand the pathophysiology of cluster headache, we used RNA sequencing to identify differentially expressed genes and pathways in whole blood of patients with episodic (n = 19) or chronic (n = 20) cluster headache in comparison with headache-free controls (n = 20). Gene expression data were analysed by gene and by module of co-expressed genes with particular attention to previously implicated disease pathways including hypocretin dysregulation. Only moderate gene expression differences were identified and no associations were found with previously reported pathogenic mechanisms. At the level of functional gene sets, associations were observed for genes involved in several brain-related mechanisms such as GABA receptor function and voltage-gated channels. In addition, genes and modules of co-expressed genes showed a role for intracellular signalling cascades, mitochondria and inflammation. Although larger study samples may be required to identify the full range of involved pathways, these results indicate a role for mitochondria, intracellular signalling and inflammation in cluster headache. PMID:28074859

  6. Genome wide approaches to identify protein-DNA interactions.

    PubMed

    Ma, Tao; Ye, Zhenqing; Wang, Liguo

    2018-05-29

    Transcription factors are DNA-binding proteins that play key roles in many fundamental biological processes. Unraveling their interactions with DNA is essential to identify their target genes and understand the regulatory network. Genome-wide identification of their binding sites became feasible thanks to recent progress in experimental and computational approaches. ChIP-chip, ChIP-seq, and ChIP-exo are three widely used techniques to demarcate genome-wide transcription factor binding sites. This review aims to provide an overview of these three techniques including their experiment procedures, computational approaches, and popular analytic tools. ChIP-chip, ChIP-seq, and ChIP-exo have been the major techniques to study genome-wide in vivo protein-DNA interaction. Due to the rapid development of next-generation sequencing technology, array-based ChIP-chip is deprecated and ChIP-seq has become the most widely used technique to identify transcription factor binding sites in genome-wide. The newly developed ChIP-exo further improves the spatial resolution to single nucleotide. Numerous tools have been developed to analyze ChIP-chip, ChIP-seq and ChIP-exo data. However, different programs may employ different mechanisms or underlying algorithms thus each will inherently include its own set of statistical assumption and bias. So choosing the most appropriate analytic program for a given experiment needs careful considerations. Moreover, most programs only have command line interface so their installation and usage will require basic computation expertise in Unix/Linux. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.

  7. Gene-for-genes interactions between cotton R genes and Xanthomonas campestris pv. malvacearum avr genes.

    PubMed

    De Feyter, R; Yang, Y; Gabriel, D W

    1993-01-01

    Six plasmid-borne avirulence (avr) genes were previously cloned from strain XcmH of the cotton pathogen, Xanthomonas campestris pv. malvacearum. We have now localized all six avr genes on the cloned fragments by subcloning and Tn5-gusA insertional mutagenesis. None of these avr genes appeared to exhibit exclusively gene-for-gene patterns of interactions with cotton R genes, and avrB4 was demonstrated to confer avr gene-for-R genes (plural) avirulence to X. c. pv. malvacearum on congenic cotton lines carrying either of two different resistance loci, B1 or B4. Furthermore, the B1 locus appeared to confer R gene-for-avr genes resistance to cotton against isogenic X. c. pv. malvacearum strains carrying any one of three avr genes: avrB4, avrb6, or avrB102. Restriction enzyme, Southern blot hybridization, and DNA sequence analyses showed that the XcmH avr genes are all highly similar to each other, to avrBs3 and avrBsP from the pepper pathogen X. c. pv. vesicatoria, and to the host-specific virulence gene pthA from the citrus pathogen X. citri. The XcmH avr genes differed primarily in the multiplicity of a tandemly repeated 102-base pair motif within the central portions of the genes, repeated from 14 to 23 times in members of this gene family. The complete nucleotide sequence of avrb6 revealed that it is 97% identical in DNA sequence to avrB4, avrBs3, avrBsP, and pthA and that 62-bp inverted terminal repeats mark the boundaries of homology between avrb6 and all members of this Xanthomonas virulence/avirulence gene family sequenced to date. The terminal 38 bp of both inverted repeats are highly similar to the 38-bp consensus terminal sequence of the Tn3 family of transposons. Up to 11 members of the avr gene family appear to be present in North American strains of X. c. pv. malvacearum, including XcmH. The high level of homology observed among these avr genes and their presence in multiple copies may explain the gene-for-genes interactions and also the observed high

  8. Integrating genome-wide association study summaries and element-gene interaction datasets identified multiple associations between elements and complex diseases.

    PubMed

    He, Awen; Wang, Wenyu; Prakash, N Tejo; Tinkov, Alexey A; Skalny, Anatoly V; Wen, Yan; Hao, Jingcan; Guo, Xiong; Zhang, Feng

    2018-03-01

    Chemical elements are closely related to human health. Extensive genomic profile data of complex diseases offer us a good opportunity to systemically investigate the relationships between elements and complex diseases/traits. In this study, we applied gene set enrichment analysis (GSEA) approach to detect the associations between elements and complex diseases/traits though integrating element-gene interaction datasets and genome-wide association study (GWAS) data of complex diseases/traits. To illustrate the performance of GSEA, the element-gene interaction datasets of 24 elements were extracted from the comparative toxicogenomics database (CTD). GWAS summary datasets of 24 complex diseases or traits were downloaded from the dbGaP or GEFOS websites. We observed significant associations between 7 elements and 13 complex diseases or traits (all false discovery rate (FDR) < 0.05), including reported relationships such as aluminum vs. Alzheimer's disease (FDR = 0.042), calcium vs. bone mineral density (FDR = 0.031), magnesium vs. systemic lupus erythematosus (FDR = 0.012) as well as novel associations, such as nickel vs. hypertriglyceridemia (FDR = 0.002) and bipolar disorder (FDR = 0.027). Our study results are consistent with previous biological studies, supporting the good performance of GSEA. Our analyzing results based on GSEA framework provide novel clues for discovering causal relationships between elements and complex diseases. © 2017 WILEY PERIODICALS, INC.

  9. Identifying interactions between chemical entities in biomedical text.

    PubMed

    Lamurias, Andre; Ferreira, João D; Couto, Francisco M

    2014-10-23

    Interactions between chemical compounds described in biomedical text can be of great importance to drug discovery and design, as well as pharmacovigilance. We developed a novel system, \\"Identifying Interactions between Chemical Entities\\" (IICE), to identify chemical interactions described in text. Kernel-based Support Vector Machines first identify the interactions and then an ensemble classifier validates and classifies the type of each interaction. This relation extraction module was evaluated with the corpus released for the DDI Extraction task of SemEval 2013, obtaining results comparable to state-of-the-art methods for this type of task. We integrated this module with our chemical named entity recognition module and made the whole system available as a web tool at www.lasige.di.fc.ul.pt/webtools/iice.

  10. Identifying interactions between chemical entities in biomedical text.

    PubMed

    Lamurias, Andre; Ferreira, João D; Couto, Francisco M

    2014-12-01

    Interactions between chemical compounds described in biomedical text can be of great importance to drug discovery and design, as well as pharmacovigilance. We developed a novel system, "Identifying Interactions between Chemical Entities" (IICE), to identify chemical interactions described in text. Kernel-based Support Vector Machines first identify the interactions and then an ensemble classifier validates and classifies the type of each interaction. This relation extraction module was evaluated with the corpus released for the DDI Extraction task of SemEval 2013, obtaining results comparable to stateof- the-art methods for this type of task. We integrated this module with our chemical named entity recognition module and made the whole system available as a web tool at www.lasige.di.fc.ul.pt/webtools/iice.

  11. Gene-based rare allele analysis identified a risk gene of Alzheimer's disease.

    PubMed

    Kim, Jong Hun; Song, Pamela; Lim, Hyunsun; Lee, Jae-Hyung; Lee, Jun Hong; Park, Sun Ah

    2014-01-01

    Alzheimer's disease (AD) has a strong propensity to run in families. However, the known risk genes excluding APOE are not clinically useful. In various complex diseases, gene studies have targeted rare alleles for unsolved heritability. Our study aims to elucidate previously unknown risk genes for AD by targeting rare alleles. We used data from five publicly available genetic studies from the Alzheimer's Disease Neuroimaging Initiative (ADNI) and the database of Genotypes and Phenotypes (dbGaP). A total of 4,171 cases and 9,358 controls were included. The genotype information of rare alleles was imputed using 1,000 genomes. We performed gene-based analysis of rare alleles (minor allele frequency≤3%). The genome-wide significance level was defined as meta P<1.8×10(-6) (0.05/number of genes in human genome = 0.05/28,517). ZNF628, which is located at chromosome 19q13.42, showed a genome-wide significant association with AD. The association of ZNF628 with AD was not dependent on APOE ε4. APOE and TREM2 were also significantly associated with AD, although not at genome-wide significance levels. Other genes identified by targeting common alleles could not be replicated in our gene-based rare allele analysis. We identified that rare variants in ZNF628 are associated with AD. The protein encoded by ZNF628 is known as a transcription factor. Furthermore, the associations of APOE and TREM2 with AD were highly significant, even in gene-based rare allele analysis, which implies that further deep sequencing of these genes is required in AD heritability studies.

  12. Transcriptome profiling identified differentially expressed genes and pathways associated with tamoxifen resistance in human breast cancer

    PubMed Central

    Men, Xin; Ma, Jun; Wu, Tong; Pu, Junyi; Wen, Shaojia; Shen, Jianfeng; Wang, Xun; Wang, Yamin; Chen, Chao; Dai, Penggao

    2018-01-01

    Tamoxifen (TAM) resistance is an important clinical problem in the treatment of breast cancer. In order to identify the mechanism of TAM resistance for estrogen receptor (ER)-positive breast cancer, we screened the transcriptome using RNA-seq and compared the gene expression profiles between the MCF-7 mamma carcinoma cell line and the TAM-resistant cell line TAMR/MCF-7, 52 significant differential expression genes (DEGs) were identified including SLIT2, ROBO, LHX, KLF, VEGFC, BAMBI, LAMA1, FLT4, PNMT, DHRS2, MAOA and ALDH. The DEGs were annotated in the GO, COG and KEGG databases. Annotation of the function of the DEGs in the KEGG database revealed the top three pathways enriched with the most DEGs, including pathways in cancer, the PI3K-AKT pathway, and focal adhesion. Then we compared the gene expression profiles between the Clinical progressive disease (PD) and the complete response (CR) from the cancer genome altas (TCGA). 10 common DEGs were identified through combining the clinical and cellular analysis results. Protein-protein interaction network was applied to analyze the association of ER signal pathway with the 10 DEGs. 3 significant genes (GFRA3, NPY1R and PTPRN2) were closely related to ER related pathway. These significant DEGs regulated many biological activities such as cell proliferation and survival, motility and migration, and tumor cell invasion. The interactions between these DEGs and drug resistance phenomenon need to be further elucidated at a functional level in further studies. Based on our findings, we believed that these DEGs could be therapeutic targets, which can be explored to develop new treatment options. PMID:29423105

  13. Gene expression analysis uncovers novel Hedgehog interacting protein (HHIP) effects in human bronchial epithelial cells

    PubMed Central

    Zhou, Xiaobo; Qiu, Weiliang; Sathirapongsasuti, J. Fah.; Cho, Michael H.; Mancini, John D.; Lao, Taotao; Thibault, Derek M.; Litonjua, Gus; Bakke, Per S.; Gulsvik, Amund; Lomas, David A.; Beaty, Terri H.; Hersh, Craig P.; Anderson, Christopher; Geigenmuller, Ute; Raby, Benjamin A.; Rennard, Stephen I.; Perrella, Mark A.; Choi, Augustine M.K.; Quackenbush, John; Silverman, Edwin K.

    2013-01-01

    Hedgehog Interacting Protein (HHIP) was implicated in chronic obstructive pulmonary disease (COPD) by genome-wide association studies (GWAS). However, it remains unclear how HHIP contributes to COPD pathogenesis. To identify genes regulated by HHIP, we performed gene expression microarray analysis in a human bronchial epithelial cell line (Beas-2B) stably infected with HHIP shRNAs. HHIP silencing led to differential expression of 296 genes; enrichment for variants nominally associated with COPD was found. Eighteen of the differentially expressed genes were validated by real-time PCR in Beas-2B cells. Seven of 11 validated genes tested in human COPD and control lung tissues demonstrated significant gene expression differences. Functional annotation indicated enrichment for extracellular matrix and cell growth genes. Network modeling demonstrated that the extracellular matrix and cell proliferation genes influenced by HHIP tended to be interconnected. Thus, we identified potential HHIP targets in human bronchial epithelial cells that may contribute to COPD pathogenesis. PMID:23459001

  14. Gene-environment interaction and male reproductive function

    PubMed Central

    Axelsson, Jonatan; Bonde, Jens Peter; Giwercman, Yvonne L.; Rylander, Lars; Giwercman, Aleksander

    2010-01-01

    As genetic factors can hardly explain the changes taking place during short time spans, environmental and lifestyle-related factors have been suggested as the causes of time-related deterioration of male reproductive function. However, considering the strong heterogeneity of male fecundity between and within populations, genetic variants might be important determinants of the individual susceptibility to the adverse effects of environment or lifestyle. Although the possible mechanisms of such interplay in relation to the reproductive system are largely unknown, some recent studies have indicated that specific genotypes may confer a larger risk of male reproductive disorders following certain exposures. This paper presents a critical review of animal and human evidence on how genes may modify environmental effects on male reproductive function. Some examples have been found that support this mechanism, but the number of studies is still limited. This type of interaction studies may improve our understanding of normal physiology and help us to identify the risk factors to male reproductive malfunction. We also shortly discuss other aspects of gene-environment interaction specifically associated with the issue of reproduction, namely environmental and lifestyle factors as the cause of sperm DNA damage. It remains to be investigated to what extent such genetic changes, by natural conception or through the use of assisted reproductive techniques, are transmitted to the next generation, thereby causing increased morbidity in the offspring. PMID:20348940

  15. The Genome-Wide Interaction Network of Nutrient Stress Genes in Escherichia coli.

    PubMed

    Côté, Jean-Philippe; French, Shawn; Gehrke, Sebastian S; MacNair, Craig R; Mangat, Chand S; Bharat, Amrita; Brown, Eric D

    2016-11-22

    Conventional efforts to describe essential genes in bacteria have typically emphasized nutrient-rich growth conditions. Of note, however, are the set of genes that become essential when bacteria are grown under nutrient stress. For example, more than 100 genes become indispensable when the model bacterium Escherichia coli is grown on nutrient-limited media, and many of these nutrient stress genes have also been shown to be important for the growth of various bacterial pathogens in vivo To better understand the genetic network that underpins nutrient stress in E. coli, we performed a genome-scale cross of strains harboring deletions in some 82 nutrient stress genes with the entire E. coli gene deletion collection (Keio) to create 315,400 double deletion mutants. An analysis of the growth of the resulting strains on rich microbiological media revealed an average of 23 synthetic sick or lethal genetic interactions for each nutrient stress gene, suggesting that the network defining nutrient stress is surprisingly complex. A vast majority of these interactions involved genes of unknown function or genes of unrelated pathways. The most profound synthetic lethal interactions were between nutrient acquisition and biosynthesis. Further, the interaction map reveals remarkable metabolic robustness in E. coli through pathway redundancies. In all, the genetic interaction network provides a powerful tool to mine and identify missing links in nutrient synthesis and to further characterize genes of unknown function in E. coli Moreover, understanding of bacterial growth under nutrient stress could aid in the development of novel antibiotic discovery platforms. With the rise of antibiotic drug resistance, there is an urgent need for new antibacterial drugs. Here, we studied a group of genes that are essential for the growth of Escherichia coli under nutrient limitation, culture conditions that arguably better represent nutrient availability during an infection than rich

  16. Multivariate generalized multifactor dimensionality reduction to detect gene-gene interactions

    PubMed Central

    2013-01-01

    Background Recently, one of the greatest challenges in genome-wide association studies is to detect gene-gene and/or gene-environment interactions for common complex human diseases. Ritchie et al. (2001) proposed multifactor dimensionality reduction (MDR) method for interaction analysis. MDR is a combinatorial approach to reduce multi-locus genotypes into high-risk and low-risk groups. Although MDR has been widely used for case-control studies with binary phenotypes, several extensions have been proposed. One of these methods, a generalized MDR (GMDR) proposed by Lou et al. (2007), allows adjusting for covariates and applying to both dichotomous and continuous phenotypes. GMDR uses the residual score of a generalized linear model of phenotypes to assign either high-risk or low-risk group, while MDR uses the ratio of cases to controls. Methods In this study, we propose multivariate GMDR, an extension of GMDR for multivariate phenotypes. Jointly analysing correlated multivariate phenotypes may have more power to detect susceptible genes and gene-gene interactions. We construct generalized estimating equations (GEE) with multivariate phenotypes to extend generalized linear models. Using the score vectors from GEE we discriminate high-risk from low-risk groups. We applied the multivariate GMDR method to the blood pressure data of the 7,546 subjects from the Korean Association Resource study: systolic blood pressure (SBP) and diastolic blood pressure (DBP). We compare the results of multivariate GMDR for SBP and DBP to the results from separate univariate GMDR for SBP and DBP, respectively. We also applied the multivariate GMDR method to the repeatedly measured hypertension status from 5,466 subjects and compared its result with those of univariate GMDR at each time point. Results Results from the univariate GMDR and multivariate GMDR in two-locus model with both blood pressures and hypertension phenotypes indicate best combinations of SNPs whose interaction has

  17. Gene expression meta-analysis identifies chromosomal regions and candidate genes involved in breast cancer metastasis.

    PubMed

    Thomassen, Mads; Tan, Qihua; Kruse, Torben A

    2009-01-01

    Breast cancer cells exhibit complex karyotypic alterations causing deregulation of numerous genes. Some of these genes are probably causal for cancer formation and local growth whereas others are causal for the various steps of metastasis. In a fraction of tumors deregulation of the same genes might be caused by epigenetic modulations, point mutations or the influence of other genes. We have investigated the relation of gene expression and chromosomal position, using eight datasets including more than 1200 breast tumors, to identify chromosomal regions and candidate genes possibly causal for breast cancer metastasis. By use of "Gene Set Enrichment Analysis" we have ranked chromosomal regions according to their relation to metastasis. Overrepresentation analysis identified regions with increased expression for chromosome 1q41-42, 8q24, 12q14, 16q22, 16q24, 17q12-21.2, 17q21-23, 17q25, 20q11, and 20q13 among metastasizing tumors and reduced gene expression at 1p31-21, 8p22-21, and 14q24. By analysis of genes with extremely imbalanced expression in these regions we identified DIRAS3 at 1p31, PSD3, LPL, EPHX2 at 8p21-22, and FOS at 14q24 as candidate metastasis suppressor genes. Potential metastasis promoting genes includes RECQL4 at 8q24, PRMT7 at 16q22, GINS2 at 16q24, and AURKA at 20q13.

  18. Systemic virus-induced gene silencing allows functional characterization of maize genes during biotrophic interaction with Ustilago maydis.

    PubMed

    van der Linde, Karina; Kastner, Christine; Kumlehn, Jochen; Kahmann, Regine; Doehlemann, Gunther

    2011-01-01

    Infection of maize (Zea mays) plants with the corn smut fungus Ustilago maydis leads to the formation of large tumors on the stem, leaves and inflorescences. In this biotrophic interaction, plant defense responses are actively suppressed by the pathogen, and previous transcriptome analyses of infected maize plants showed massive and stage-specific changes in host gene expression during disease progression. To identify maize genes that are functionally involved in the interaction with U. maydis, we adapted a virus-induced gene silencing (VIGS) system based on the brome mosaic virus (BMV) for maize. Conditions were established that allowed successful U. maydis infection of BMV-preinfected maize plants. This set-up enabled quantification of VIGS and its impact on U. maydis infection using a quantitative real-time PCR (qRT-PCR)-based readout. In proof-of-principle experiments, an U. maydis-induced terpene synthase was shown to negatively regulate disease development while a protein involved in cell death inhibition was required for full virulence of U. maydis. The results suggest that this system is a versatile tool for the rapid identification of maize genes that determine compatibility with U. maydis. © (2010) Max Planck Society. Journal compilation © New Phytologist Trust (2010).

  19. Interaction between LRP5 and periostin gene polymorphisms on serum periostin levels and cortical bone microstructure.

    PubMed

    Pepe, J; Bonnet, N; Herrmann, F R; Biver, E; Rizzoli, R; Chevalley, T; Ferrari, S L

    2018-02-01

    We investigated the interaction between periostin SNPs and the SNPs of the genes assumed to modulate serum periostin levels and bone microstructure in a cohort of postmenopausal women. We identified an interaction between LRP5 SNP rs648438 and periostin SNP rs9547970 on serum periostin levels and on radial cortical porosity. The purpose of this study is to investigate the interaction between periostin gene polymorphisms (SNPs) and other genes potentially responsible for modulating serum periostin levels and bone microstructure in a cohort of postmenopausal women. In 648 postmenopausal women from the Geneva Retirees Cohort, we analyzed 6 periostin SNPs and another 149 SNPs in 14 genes, namely BMP2, CTNNB1, ESR1, ESR2, LRP5, LRP6, PTH, SPTBN1, SOST, TGFb1, TNFRSF11A, TNFSF11, TNFRSF11B and WNT16. Volumetric BMD and bone microstructure were measured by high-resolution peripheral quantitative computed tomography at the distal radius and tibia. Serum periostin levels were associated with radial cortical porosity, including after adjustment for age, BMI, and years since menopause (p = 0.036). Sixteen SNPs in the ESR1, LRP5, TNFRSF11A, SOST, SPTBN1, TNFRSF11B and TNFSF11 genes were associated with serum periostin levels (p range 0.03-0.001) whereas 26 SNPs in 9 genes were associated with cortical porosity at the radius and/or at the tibia. WNT 16 was the gene with the highest number of SNPs associated with both trabecular and cortical microstructure. The periostin SNP rs9547970 was also associated with cortical porosity (p = 0.04). In particular, SNPs in LRP5, ESR1 and near the TNFRSF11A gene were associated with both cortical porosity and serum periostin levels. Eventually, we identified an interaction between LRP5 SNP rs648438 and periostin SNP rs9547970 on serum periostin levels (interaction p = 0.01) and on radial cortical porosity (interaction p = 0.005). These results suggest that periostin expression is genetically modulated, particularly by polymorphisms

  20. Identifying Functional Mechanisms of Gene and Protein Regulatory Networks in Response to a Broader Range of Environmental Stresses

    PubMed Central

    Li, Cheng-Wei; Chen, Bor-Sen

    2010-01-01

    Cellular responses to sudden environmental stresses or physiological changes provide living organisms with the opportunity for final survival and further development. Therefore, it is an important topic to understand protective mechanisms against environmental stresses from the viewpoint of gene and protein networks. We propose two coupled nonlinear stochastic dynamic models to reconstruct stress-activated gene and protein regulatory networks via microarray data in response to environmental stresses. According to the reconstructed gene/protein networks, some possible mutual interactions, feedforward and feedback loops are found for accelerating response and filtering noises in these signaling pathways. A bow-tie core network is also identified to coordinate mutual interactions and feedforward loops, feedback inhibitions, feedback activations, and cross talks to cope efficiently with a broader range of environmental stresses with limited proteins and pathways. PMID:20454442

  1. Gene Polymorphism Association with Type 2 Diabetes and Related Gene-Gene and Gene-Environment Interactions in a Uyghur Population

    PubMed Central

    Xiao, Shan; Zeng, Xiaoyun; Fan, Yong; Su, Yinxia; Ma, Qi; Zhu, Jun; Yao, Hua

    2016-01-01

    Background We investigated the association between 8 single-nucleotide polymorphisms (SNPs) at 3 genetic loci (CDKAL1, CDKN2A/2B and FTO) with type 2 diabetes (T2D) in a Uyghur population. Material/Methods A case-control study of 879 Uyghur patients with T2D and 895 non-diabetic Uyghur controls was conducted at the Hospital of Xinjiang Medical University between 2010 and 2013. Eight SNPs in CDKAL1, CDKN2A/2B and FTO were analyzed using Sequenom MassARRAY®SNP genotyping. Factors associated with T2D were assessed by logistic regression analyses. Gene-gene and gene-environment interactions were analyzed by generalized multifactor dimensionality reduction. Results Genotype distributions of rs10811661 (CDKN2A/2B), rs7195539, rs8050136, and rs9939609 (FTO) and allele frequencies of rs8050136 and rs9939609 differed significantly between diabetes and control groups (all P<0.05). While rs10811661, rs8050136, and rs9939609 were eliminated after adjusting for covariates (P>0.05), rs7195539 distribution differed significantly in co-dominant and dominant models (P<0.05). In gene-gene interaction analysis, after adjusting for covariates the two-locus rs10811661-rs7195539 interaction model had a cross-validation consistency of 10/10 and the highest balanced accuracy of 0.5483 (P=0.014). In gene-environment interaction analysis, the 3-locus interaction model TG-HDL-family history of diabetes had a cross-validation consistency of 10/10 and the highest balanced accuracy of 0.7072 (P<0.001). The 4-locus interaction model, rs7195539-TG-HDL-family history of diabetes had a cross-validation consistency of 8/10 (P<0.001). Conclusions Polymorphisms in CDKN2A/2B and FTO, but not CDKAL1, may be associated with T2D, and alleles rs8050136 and rs9939609 are likely risk alleles for T2D in this population. There were potential interactions among CDKN2A/2B (rs10811661) – FTO (rs7195539) or FTO (rs7195539)-TG-HDL-family history of diabetes in the pathogenesis of T2D in a Uyghur population. PMID

  2. Evidence for gene-gene epistatic interactions among susceptibility loci for systemic lupus erythematosus.

    PubMed

    Hughes, Travis; Adler, Adam; Kelly, Jennifer A; Kaufman, Kenneth M; Williams, Adrienne H; Langefeld, Carl D; Brown, Elizabeth E; Alarcón, Graciela S; Kimberly, Robert P; Edberg, Jeffrey C; Ramsey-Goldman, Rosalind; Petri, Michelle; Boackle, Susan A; Stevens, Anne M; Reveille, John D; Sanchez, Elena; Martín, Javier; Niewold, Timothy B; Vilá, Luis M; Scofield, R Hal; Gilkeson, Gary S; Gaffney, Patrick M; Criswell, Lindsey A; Moser, Kathy L; Merrill, Joan T; Jacob, Chaim O; Tsao, Betty P; James, Judith A; Vyse, Timothy J; Alarcón-Riquelme, Marta E; Harley, John B; Richardson, Bruce C; Sawalha, Amr H

    2012-02-01

    Several confirmed genetic susceptibility loci for lupus have been described. To date, no clear evidence for genetic epistasis in lupus has been established. The aim of this study was to test for gene-gene interactions in a number of known lupus susceptibility loci. Eighteen single-nucleotide polymorphisms tagging independent and confirmed lupus susceptibility loci were genotyped in a set of 4,248 patients with lupus and 3,818 normal healthy control subjects of European descent. Epistasis was tested by a 2-step approach using both parametric and nonparametric methods. The false discovery rate (FDR) method was used to correct for multiple testing. We detected and confirmed gene-gene interactions between the HLA region and CTLA4, IRF5, and ITGAM and between PDCD1 and IL21 in patients with lupus. The most significant interaction detected by parametric analysis was between rs3131379 in the HLA region and rs231775 in CTLA4 (interaction odds ratio 1.19, Z = 3.95, P = 7.8 × 10(-5) [FDR ≤0.05], P for multifactor dimensionality reduction = 5.9 × 10(-45)). Importantly, our data suggest that in patients with lupus, the presence of the HLA lupus risk alleles in rs1270942 and rs3131379 increases the odds of also carrying the lupus risk allele in IRF5 (rs2070197) by 17% and 16%, respectively (P = 0.0028 and P = 0.0047, respectively). We provide evidence for gene-gene epistasis in systemic lupus erythematosus. These findings support a role for genetic interaction contributing to the complexity of lupus heritability. Copyright © 2012 by the American College of Rheumatology.

  3. Bayesian logistic regression in detection of gene-steroid interaction for cancer at PDLIM5 locus.

    PubMed

    Wang, Ke-Sheng; Owusu, Daniel; Pan, Yue; Xie, Changchun

    2016-06-01

    The PDZ and LIM domain 5 (PDLIM5) gene may play a role in cancer, bipolar disorder, major depression, alcohol dependence and schizophrenia; however, little is known about the interaction effect of steroid and PDLIM5 gene on cancer. This study examined 47 single-nucleotide polymorphisms (SNPs) within the PDLIM5 gene in the Marshfield sample with 716 cancer patients (any diagnosed cancer, excluding minor skin cancer) and 2848 noncancer controls. Multiple logistic regression model in PLINK software was used to examine the association of each SNP with cancer. Bayesian logistic regression in PROC GENMOD in SAS statistical software, ver. 9.4 was used to detect gene- steroid interactions influencing cancer. Single marker analysis using PLINK identified 12 SNPs associated with cancer (P< 0.05); especially, SNP rs6532496 revealed the strongest association with cancer (P = 6.84 × 10⁻³); while the next best signal was rs951613 (P = 7.46 × 10⁻³). Classic logistic regression in PROC GENMOD showed that both rs6532496 and rs951613 revealed strong gene-steroid interaction effects (OR=2.18, 95% CI=1.31-3.63 with P = 2.9 × 10⁻³ for rs6532496 and OR=2.07, 95% CI=1.24-3.45 with P = 5.43 × 10⁻³ for rs951613, respectively). Results from Bayesian logistic regression showed stronger interaction effects (OR=2.26, 95% CI=1.2-3.38 for rs6532496 and OR=2.14, 95% CI=1.14-3.2 for rs951613, respectively). All the 12 SNPs associated with cancer revealed significant gene-steroid interaction effects (P < 0.05); whereas 13 SNPs showed gene-steroid interaction effects without main effect on cancer. SNP rs4634230 revealed the strongest gene-steroid interaction effect (OR=2.49, 95% CI=1.5-4.13 with P = 4.0 × 10⁻⁴ based on the classic logistic regression and OR=2.59, 95% CI=1.4-3.97 from Bayesian logistic regression; respectively). This study provides evidence of common genetic variants within the PDLIM5 gene and interactions between PLDIM5 gene polymorphisms and steroid use

  4. Integrating Genetic and Gene Co-expression Analysis Identifies Gene Networks Involved in Alcohol and Stress Responses

    PubMed Central

    Luo, Jie; Xu, Pei; Cao, Peijian; Wan, Hongjian; Lv, Xiaonan; Xu, Shengchun; Wang, Gangjun; Cook, Melloni N.; Jones, Byron C.; Lu, Lu; Wang, Xusheng

    2018-01-01

    Although the link between stress and alcohol is well recognized, the underlying mechanisms of how they interplay at the molecular level remain unclear. The purpose of this study is to identify molecular networks underlying the effects of alcohol and stress responses, as well as their interaction on anxiety behaviors in the hippocampus of mice using a systems genetics approach. Here, we applied a gene co-expression network approach to transcriptomes of 41 BXD mouse strains under four conditions: stress, alcohol, stress-induced alcohol and control. The co-expression analysis identified 14 modules and characterized four expression patterns across the four conditions. The four expression patterns include up-regulation in no restraint stress and given an ethanol injection (NOE) but restoration in restraint stress followed by an ethanol injection (RSE; pattern 1), down-regulation in NOE but rescue in RSE (pattern 2), up-regulation in both restraint stress followed by a saline injection (RSS) and NOE, and further amplification in RSE (pattern 3), and up-regulation in RSS but reduction in both NOE and RSE (pattern 4). We further identified four functional subnetworks by superimposing protein-protein interactions (PPIs) to the 14 co-expression modules, including γ-aminobutyric acid receptor (GABA) signaling, glutamate signaling, neuropeptide signaling, cAMP-dependent signaling. We further performed module specificity analysis to identify modules that are specific to stress, alcohol, or stress-induced alcohol responses. Finally, we conducted causality analysis to link genetic variation to these identified modules, and anxiety behaviors after stress and alcohol treatments. This study underscores the importance of integrative analysis and offers new insights into the molecular networks underlying stress and alcohol responses. PMID:29674951

  5. A functional cancer genomics screen identifies a druggable synthetic lethal interaction between MSH3 and PRKDC.

    PubMed

    Dietlein, Felix; Thelen, Lisa; Jokic, Mladen; Jachimowicz, Ron D; Ivan, Laura; Knittel, Gero; Leeser, Uschi; van Oers, Johanna; Edelmann, Winfried; Heukamp, Lukas C; Reinhardt, H Christian

    2014-05-01

    Here, we use a large-scale cell line-based approach to identify cancer cell-specific mutations that are associated with DNA-dependent protein kinase catalytic subunit (DNA-PKcs) dependence. For this purpose, we profiled the mutational landscape across 1,319 cancer-associated genes of 67 distinct cell lines and identified numerous genes involved in homologous recombination-mediated DNA repair, including BRCA1, BRCA2, ATM, PAXIP, and RAD50, as being associated with non-oncogene addiction to DNA-PKcs. Mutations in the mismatch repair gene MSH3, which have been reported to occur recurrently in numerous human cancer entities, emerged as the most significant predictors of DNA-PKcs addiction. Concordantly, DNA-PKcs inhibition robustly induced apoptosis in MSH3-mutant cell lines in vitro and displayed remarkable single-agent efficacy against MSH3-mutant tumors in vivo. Thus, we here identify a therapeutically actionable synthetic lethal interaction between MSH3 and the non-homologous end joining kinase DNA-PKcs. Our observations recommend DNA-PKcs inhibition as a therapeutic concept for the treatment of human cancers displaying homologous recombination defects.

  6. Blood lead levels, iron metabolism gene polymorphisms and homocysteine: a gene-environment interaction study.

    PubMed

    Kim, Kyoung-Nam; Lee, Mee-Ri; Lim, Youn-Hee; Hong, Yun-Chul

    2017-12-01

    Homocysteine has been causally associated with various adverse health outcomes. Evidence supporting the relationship between lead and homocysteine levels has been accumulating, but most prior studies have not focused on the interaction with genetic polymorphisms. From a community-based prospective cohort, we analysed 386 participants (aged 41-71 years) with information regarding blood lead and plasma homocysteine levels. Blood lead levels were measured between 2001 and 2003, and plasma homocysteine levels were measured in 2007. Interactions of lead levels with 42 genotyped single-nucleotide polymorphisms (SNPs) in five genes ( TF , HFE , CBS , BHMT and MTR ) were assessed via a 2-degree of freedom (df) joint test and a 1-df interaction test. In secondary analyses using imputation, we further assessed 58 imputed SNPs in the TF and MTHFR genes. Blood lead concentrations were positively associated with plasma homocysteine levels (p=0.0276). Six SNPs in the TF and MTR genes were screened using the 2-df joint test, and among them, three SNPs in the TF gene showed interactions with lead with respect to homocysteine levels through the 1-df interaction test (p<0.0083). Seven SNPs in the MTHFR gene were associated with homocysteine levels at an α-level of 0.05, but the associations did not persist after Bonferroni correction. These SNPs did not show interactions with lead levels. Blood lead levels were positively associated with plasma homocysteine levels measured 4-6 years later, and three SNPs in the TF gene modified the association. © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2017. All rights reserved. No commercial use is permitted unless otherwise expressly granted.

  7. Evolution of Genes Involved in Gamete Interaction: Evidence for Positive Selection, Duplications and Losses in Vertebrates

    PubMed Central

    Callebaut, Isabelle; Laurin, Michel; Pascal, Géraldine; Poupon, Anne; Goudet, Ghylène; Monget, Philippe

    2012-01-01

    Genes encoding proteins involved in sperm-egg interaction and fertilization exhibit a particularly fast evolution and may participate in prezygotic species isolation [1], [2]. Some of them (ZP3, ADAM1, ADAM2, ACR and CD9) have individually been shown to evolve under positive selection [3], [4], suggesting a role of positive Darwinian selection on sperm-egg interaction. However, the genes involved in this biological function have not been systematically and exhaustively studied with an evolutionary perspective, in particular across vertebrates with internal and external fertilization. Here we show that 33 genes among the 69 that have been experimentally shown to be involved in fertilization in at least one taxon in vertebrates are under positive selection. Moreover, we identified 17 pseudogenes and 39 genes that have at least one duplicate in one species. For 15 genes, we found neither positive selection, nor gene copies or pseudogenes. Genes of teleosts, especially genes involved in sperm-oolemma fusion, appear to be more frequently under positive selection than genes of birds and eutherians. In contrast, pseudogenization, gene loss and gene gain are more frequent in eutherians. Thus, each of the 19 studied vertebrate species exhibits a unique signature characterized by gene gain and loss, as well as position of amino acids under positive selection. Reflecting these clade-specific signatures, teleosts and eutherian mammals are recovered as clades in a parsimony analysis. Interestingly the same analysis places Xenopus apart from teleosts, with which it shares the primitive external fertilization, and locates it along with amniotes (which share internal fertilization), suggesting that external or internal environmental conditions of germ cell interaction may not be the unique factors that drive the evolution of fertilization genes. Our work should improve our understanding of the fertilization process and on the establishment of reproductive barriers, for example by

  8. Genetics of Addiction: Future Focus on Gene × Environment Interaction?

    PubMed

    Vink, Jacqueline M

    2016-09-01

    The heritability of substance use is moderate to high. Successful efforts to find genetic variants associated with substance use (smoking, alcohol, cannabis) have been undertaken by large consortia. However, the proportion of phenotypic variance explained by the identified genetic variants is small. Interestingly, there is overlap between the genetic variants that influence different substances. Moreover, there are sets of "substance-specific" genes and sets of genes contributing to a "vulnerability for addictive behavior" in general. It is important to recognize that genes alone do not determine addiction phenotypes: Environmental factors such as parental monitoring, peer pressure, or socioeconomic status also play an important role. Despite a rich epidemiologic literature focused on the social determinants of substance use, few studies have examined the moderation of genetic influences like gene-environment (G × E) interactions. Understanding this balance may hold the key to understanding the individual differences in substance use, abuse, and addictive behavior. Recommendations for future research are described in this commentary and include increasing the power of G × E studies by using state-of-the-art methods such as polygenic risk scores instead of single genetic variants and taking genetic overlap between substances into account. Future genetic studies should also investigate environmental risk factors for addictive behavior more extensively to unravel the interaction between nature and nurture. Focusing on G × E interactions not only will give insight into the underlying biological mechanism but will also characterize subgroups (based on environmental factors) at high risk for addictive behaviors. With this information, we could bridge the gap between fundamental research and applications for society.

  9. Topology association analysis in weighted protein interaction network for gene prioritization

    NASA Astrophysics Data System (ADS)

    Wu, Shunyao; Shao, Fengjing; Zhang, Qi; Ji, Jun; Xu, Shaojie; Sun, Rencheng; Sun, Gengxin; Du, Xiangjun; Sui, Yi

    2016-11-01

    Although lots of algorithms for disease gene prediction have been proposed, the weights of edges are rarely taken into account. In this paper, the strengths of topology associations between disease and essential genes are analyzed in weighted protein interaction network. Empirical analysis demonstrates that compared to other genes, disease genes are weakly connected with essential genes in protein interaction network. Based on this finding, a novel global distance measurement for gene prioritization with weighted protein interaction network is proposed in this paper. Positive and negative flow is allocated to disease and essential genes, respectively. Additionally network propagation model is extended for weighted network. Experimental results on 110 diseases verify the effectiveness and potential of the proposed measurement. Moreover, weak links play more important role than strong links for gene prioritization, which is meaningful to deeply understand protein interaction network.

  10. Whole Wiskott‑Aldrich syndrome protein gene deletion identified by high throughput sequencing.

    PubMed

    He, Xiangling; Zou, Runying; Zhang, Bing; You, Yalan; Yang, Yang; Tian, Xin

    2017-11-01

    Wiskott‑Aldrich syndrome (WAS) is a rare X‑linked recessive immunodeficiency disorder, characterized by thrombocytopenia, small platelets, eczema and recurrent infections associated with increased risk of autoimmunity and malignancy disorders. Mutations in the WAS protein (WASP) gene are responsible for WAS. To date, WASP mutations, including missense/nonsense, splicing, small deletions, small insertions, gross deletions, and gross insertions have been identified in patients with WAS. In addition, WASP‑interacting proteins are suspected in patients with clinical features of WAS, in whom the WASP gene sequence and mRNA levels are normal. The present study aimed to investigate the application of next generation sequencing in definitive diagnosis and clinical therapy for WAS. A 5 month‑old child with WAS who displayed symptoms of thrombocytopenia was examined. Whole exome sequence analysis of genomic DNA showed that the coverage and depth of WASP were extremely low. Quantitative polymerase chain reaction indicated total WASP gene deletion in the proband. In conclusion, high throughput sequencing is useful for the verification of WAS on the genetic profile, and has implications for family planning guidance and establishment of clinical programs.

  11. Genetic and Physical Interaction of the B-Cell SLE-Associated Genes BANK1 and BLK

    PubMed Central

    Castillejo-López, Casimiro; Delgado-Vega, Angélica M.; Wojcik, Jerome; Kozyrev, Sergey V.; Thavathiru, Elangovan; Wu, Ying-Yu; Sánchez, Elena; Pöllmann, David; López-Egido, Juan R.; Fineschi, Serena; Domínguez, Nicolás; Lu, Rufei; James, Judith A.; Merrill, Joan T.; Kelly, Jennifer A.; Kaufman, Kenneth M.; Moser, Kathy; Gilkeson, Gary; Frostegård, Johan; Pons-Estel, Bernardo A.; D’Alfonso, Sandra; Witte, Torsten; Callejas, José Luis; Harley, John B.; Gaffney, Patrick; Martin, Javier; Guthridge, Joel M.; Alarcón-Riquelme, Marta E.

    2012-01-01

    Objectives Altered signaling in B-cells is a predominant feature of systemic lupus erythematosus (SLE). The genes BANK1 and BLK were recently described as associated with SLE. BANK1 codes for a B-cell-specific cytoplasmic protein involved in B-cell receptor signaling and BLK codes for an Src tyrosine kinase with important roles in B-cell development. To characterize the role of BANK1 and BLK in SLE, we performed a genetic interaction analysis hypothesizing that genetic interactions could reveal functional pathways relevant to disease pathogenesis. Methods We Used the method GPAT16 to analyze the gene-gene interactions of BANK1 and BLK. Confocal microscopy was used to investigate co-localization, and immunoprecipitation was used to verify the physical interaction of BANK1 and BLK. Results Epistatic interactions between BANK1 and BLK polymorphisms associated with SLE were observed in a discovery set of 279 patients and 515 controls from Northern Europe. A meta-analysis with 4399 European individuals confirmed the genetic interactions between BANK1 and BLK. As BANK1 was identified as a binding partner of the Src tyrosine kinase LYN, we tested the possibility that BANK1 and BLK could also show a protein-protein interaction. We demonstrated co-immunoprecipitation and co-localization of BLK and BANK1. In a Daudi cell line and primary naïve B-cells the endogenous binding was enhanced upon B-cell receptor stimulation using anti-IgM antibodies. Conclusions Here, we show a genetic interaction between BANK1 and BLK, and demonstrate that these molecules interact physically. Our results have important consequences for the understanding of SLE and other autoimmune diseases and identify a potential new signaling pathway. PMID:21978998

  12. Interactions in the microbiome: communities of organisms and communities of genes

    PubMed Central

    Boon, Eva; Meehan, Conor J; Whidden, Chris; Wong, Dennis H-J; Langille, Morgan GI; Beiko, Robert G

    2014-01-01

    A central challenge in microbial community ecology is the delineation of appropriate units of biodiversity, which can be taxonomic, phylogenetic, or functional in nature. The term ‘community’ is applied ambiguously; in some cases, the term refers simply to a set of observed entities, while in other cases, it requires that these entities interact with one another. Microorganisms can rapidly gain and lose genes, potentially decoupling community roles from taxonomic and phylogenetic groupings. Trait-based approaches offer a useful alternative, but many traits can be defined based on gene functions, metabolic modules, and genomic properties, and the optimal set of traits to choose is often not obvious. An analysis that considers taxon assignment and traits in concert may be ideal, with the strengths of each approach offsetting the weaknesses of the other. Individual genes also merit consideration as entities in an ecological analysis, with characteristics such as diversity, turnover, and interactions modeled using genes rather than organisms as entities. We identify some promising avenues of research that are likely to yield a deeper understanding of microbial communities that shift from observation-based questions of ‘Who is there?’ and ‘What are they doing?’ to the mechanistically driven question of ‘How will they respond?’ PMID:23909933

  13. Impact of Maspin Polymorphism rs2289520 G/C and Its Interaction with Gene to Gene, Alcohol Consumption Increase Susceptibility to Oral Cancer Occurrence.

    PubMed

    Yang, Po-Yu; Miao, Nae-Fang; Lin, Chiao-Wen; Chou, Ying-Erh; Yang, Shun-Fa; Huang, Hui-Chuan; Chang, Hsiu-Ju; Tsai, Hsiu-Ting

    2016-01-01

    The purpose of this study was to identify gene polymorphisms of mammary serine protease inhibitor (Maspin) specific to patients with oral cancer susceptibility and clinicopathological status. Three single-nucleotide polymorphisms (SNPs) of the Maspin gene from 741 patients with oral cancer and 601 non-cancer controls were analyzed by real-time PCR. The participants with G/G homozygotes or with G/C heterozygotes of Maspin rs2289520 polymorphism had a 2.07-fold (p = 0.01) and a 2.01-fold (p = 0.02) risk of developing oral cancer compared to those with C/C homozygotes. Moreover, gene-gene interaction increased the risk of oral cancer susceptibility among subjects expose to oral cancer related risk factors, including areca, alcohol, and tobacco consumption. G allele of Maspin rs2289520 polymorphism may be a factor that increases the susceptibility to oral cancer. The interactions of gene to oral cancer-related environmental risk factors have a synergetic effect that can further enhance oral cancer development.

  14. A P-Norm Robust Feature Extraction Method for Identifying Differentially Expressed Genes

    PubMed Central

    Liu, Jian; Liu, Jin-Xing; Gao, Ying-Lian; Kong, Xiang-Zhen; Wang, Xue-Song; Wang, Dong

    2015-01-01

    In current molecular biology, it becomes more and more important to identify differentially expressed genes closely correlated with a key biological process from gene expression data. In this paper, based on the Schatten p-norm and Lp-norm, a novel p-norm robust feature extraction method is proposed to identify the differentially expressed genes. In our method, the Schatten p-norm is used as the regularization function to obtain a low-rank matrix and the Lp-norm is taken as the error function to improve the robustness to outliers in the gene expression data. The results on simulation data show that our method can obtain higher identification accuracies than the competitive methods. Numerous experiments on real gene expression data sets demonstrate that our method can identify more differentially expressed genes than the others. Moreover, we confirmed that the identified genes are closely correlated with the corresponding gene expression data. PMID:26201006

  15. A P-Norm Robust Feature Extraction Method for Identifying Differentially Expressed Genes.

    PubMed

    Liu, Jian; Liu, Jin-Xing; Gao, Ying-Lian; Kong, Xiang-Zhen; Wang, Xue-Song; Wang, Dong

    2015-01-01

    In current molecular biology, it becomes more and more important to identify differentially expressed genes closely correlated with a key biological process from gene expression data. In this paper, based on the Schatten p-norm and Lp-norm, a novel p-norm robust feature extraction method is proposed to identify the differentially expressed genes. In our method, the Schatten p-norm is used as the regularization function to obtain a low-rank matrix and the Lp-norm is taken as the error function to improve the robustness to outliers in the gene expression data. The results on simulation data show that our method can obtain higher identification accuracies than the competitive methods. Numerous experiments on real gene expression data sets demonstrate that our method can identify more differentially expressed genes than the others. Moreover, we confirmed that the identified genes are closely correlated with the corresponding gene expression data.

  16. Gene-environment interaction between adiponectin gene polymorphisms and environmental factors on the risk of diabetic retinopathy.

    PubMed

    Li, Yuan; Wu, Qun Hong; Jiao, Ming Li; Fan, Xiao Hong; Hu, Quan; Hao, Yan Hua; Liu, Ruo Hong; Zhang, Wei; Cui, Yu; Han, Li Yuan

    2015-01-01

    To evaluate whether the adiponectin gene is associated with diabetic retinopathy (DR) risk and interaction with environmental factors modifies the DR risk, and to investigate the relationship between serum adiponectin levels and DR. Four adiponectin polymorphisms were evaluated in 372 DR cases and 145 controls. Differences in environmental factors between cases and controls were evaluated by unconditional logistic regression analysis. The model-free multifactor dimensionality reduction method and traditional multiple regression models were applied to explore interactions between the polymorphisms and environmental factors. Using the Bonferroni method, we found no significant associations between four adiponectin polymorphisms and DR susceptibility. Multivariate logistic regression found that physical activity played a protective role in the progress of DR, whereas family history of diabetes (odds ratio 1.75) and insulin therapy (odds ratio 1.78) were associated with an increased risk for DR. The interaction between the C-11377 G (rs266729) polymorphism and insulin therapy might be associated with DR risk. Family history of diabetes combined with insulin therapy also increased the risk of DR. No adiponectin gene polymorphisms influenced the serum adiponectin levels. Serum adiponectin levels did not differ between the DR group and non-DR group. No significant association was identified between four adiponectin polymorphisms and DR susceptibility after stringent Bonferroni correction. The interaction between C-11377G (rs266729) polymorphism and insulin therapy, as well as the interaction between family history of diabetes and insulin therapy, might be associated with DR susceptibility.

  17. A Stratified Transcriptomics Analysis of Polygenic Fat and Lean Mouse Adipose Tissues Identifies Novel Candidate Obesity Genes

    PubMed Central

    Morton, Nicholas M.; Nelson, Yvonne B.; Michailidou, Zoi; Di Rollo, Emma M.; Ramage, Lynne; Hadoke, Patrick W. F.; Seckl, Jonathan R.; Bunger, Lutz; Horvat, Simon; Kenyon, Christopher J.; Dunbar, Donald R.

    2011-01-01

    Background Obesity and metabolic syndrome results from a complex interaction between genetic and environmental factors. In addition to brain-regulated processes, recent genome wide association studies have indicated that genes highly expressed in adipose tissue affect the distribution and function of fat and thus contribute to obesity. Using a stratified transcriptome gene enrichment approach we attempted to identify adipose tissue-specific obesity genes in the unique polygenic Fat (F) mouse strain generated by selective breeding over 60 generations for divergent adiposity from a comparator Lean (L) strain. Results To enrich for adipose tissue obesity genes a ‘snap-shot’ pooled-sample transcriptome comparison of key fat depots and non adipose tissues (muscle, liver, kidney) was performed. Known obesity quantitative trait loci (QTL) information for the model allowed us to further filter genes for increased likelihood of being causal or secondary for obesity. This successfully identified several genes previously linked to obesity (C1qr1, and Np3r) as positional QTL candidate genes elevated specifically in F line adipose tissue. A number of novel obesity candidate genes were also identified (Thbs1, Ppp1r3d, Tmepai, Trp53inp2, Ttc7b, Tuba1a, Fgf13, Fmr) that have inferred roles in fat cell function. Quantitative microarray analysis was then applied to the most phenotypically divergent adipose depot after exaggerating F and L strain differences with chronic high fat feeding which revealed a distinct gene expression profile of line, fat depot and diet-responsive inflammatory, angiogenic and metabolic pathways. Selected candidate genes Npr3 and Thbs1, as well as Gys2, a non-QTL gene that otherwise passed our enrichment criteria were characterised, revealing novel functional effects consistent with a contribution to obesity. Conclusions A focussed candidate gene enrichment strategy in the unique F and L model has identified novel adipose tissue-enriched genes

  18. Interactions between Bmp-4 and Msx-1 act to restrict gene expression to odontogenic mesenchyme.

    PubMed

    Tucker, A S; Al Khamis, A; Sharpe, P T

    1998-08-01

    Tooth development is regulated by a reciprocal series of epithelial-mesenchymal interactions. Bmp4 has been identified as a candidate signalling molecule in these interactions, initially as an epithelial signal and then later at the bud stage as a mesenchymal signal (Vainio et al. [1993] Cell 75:45-58). A target gene for Bmp4 signalling is the homeobox gene Msx-1, identified by the ability of recombinant Bmp4 protein to induce expression in mesenchyme. There is, however, no evidence that Bmp4 is the endogenous inducer of Msx-1 expression. Msx-1 and Bmp-4 show dynamic, interactive patterns of expression in oral epithelium and ectomesenchyme during the early stages of tooth development. In this study, we compare the temporal and spatial expression of these two genes to determine whether the changing expression patterns of these genes are consistent with interactions between the two molecules. We show that changes in Bmp-4 expression precede changes in Msx-1 expression. At embryonic day (E)10.5-E11.0, expression patterns are consistent with BMP4 from the epithelium, inducing or maintaining Msx-1 in underlying mesenchyme. At E11.5, Bmp-4 expression shifts from epithelium to mesenchyme and is rapidly followed by localised up-regulation of Msx-1 expression at the sites of Bmp-4 expression. Using cultured explants of developing mandibles, we confirm that exogenous BMP4 is capable of replacing the endogenous source in epithelium and inducing Msx-1 gene expression in mesenchyme. By using noggin, a BMP inhibitor, we show that endogenous Msx-1 expression can be inhibited at E10.5 and E11.5, providing the first evidence that endogenous Bmp-4 from the epithelium is responsible for regulating the early spatial expression of Msx-1. We also show that the mesenchymal shift in Bmp-4 is responsible for up-regulating Msx-1 specifically at the sites of future tooth formation. Thus, we establish that a reciprocal series of interactions act to restrict expression of both genes to future

  19. Mining disease genes using integrated protein-protein interaction and gene-gene co-regulation information.

    PubMed

    Li, Jin; Wang, Limei; Guo, Maozu; Zhang, Ruijie; Dai, Qiguo; Liu, Xiaoyan; Wang, Chunyu; Teng, Zhixia; Xuan, Ping; Zhang, Mingming

    2015-01-01

    In humans, despite the rapid increase in disease-associated gene discovery, a large proportion of disease-associated genes are still unknown. Many network-based approaches have been used to prioritize disease genes. Many networks, such as the protein-protein interaction (PPI), KEGG, and gene co-expression networks, have been used. Expression quantitative trait loci (eQTLs) have been successfully applied for the determination of genes associated with several diseases. In this study, we constructed an eQTL-based gene-gene co-regulation network (GGCRN) and used it to mine for disease genes. We adopted the random walk with restart (RWR) algorithm to mine for genes associated with Alzheimer disease. Compared to the Human Protein Reference Database (HPRD) PPI network alone, the integrated HPRD PPI and GGCRN networks provided faster convergence and revealed new disease-related genes. Therefore, using the RWR algorithm for integrated PPI and GGCRN is an effective method for disease-associated gene mining.

  20. Gene-Diet Interactions in Type 2 Diabetes: The Chicken and Egg Debate

    PubMed Central

    Ortega, Ángeles; Berná, Genoveva; Rojas, Anabel; Martín, Franz; Soria, Bernat

    2017-01-01

    Consistent evidence from both experimental and human studies indicates that Type 2 diabetes mellitus (T2DM) is a complex disease resulting from the interaction of genetic, epigenetic, environmental, and lifestyle factors. Nutrients and dietary patterns are important environmental factors to consider in the prevention, development and treatment of this disease. Nutritional genomics focuses on the interaction between bioactive food components and the genome and includes studies of nutrigenetics, nutrigenomics and epigenetic modifications caused by nutrients. There is evidence supporting the existence of nutrient-gene and T2DM interactions coming from animal studies and family-based intervention studies. Moreover, many case-control, cohort, cross-sectional cohort studies and clinical trials have identified relationships between individual genetic load, diet and T2DM. Some of these studies were on a large scale. In addition, studies with animal models and human observational studies, in different countries over periods of time, support a causative relationship between adverse nutritional conditions during in utero development, persistent epigenetic changes and T2DM. This review provides comprehensive information on the current state of nutrient-gene interactions and their role in T2DM pathogenesis, the relationship between individual genetic load and diet, and the importance of epigenetic factors in influencing gene expression and defining the individual risk of T2DM. PMID:28574454

  1. A Strategy for Identifying Quantitative Trait Genes Using Gene Expression Analysis and Causal Analysis.

    PubMed

    Ishikawa, Akira

    2017-11-27

    Large numbers of quantitative trait loci (QTL) affecting complex diseases and other quantitative traits have been reported in humans and model animals. However, the genetic architecture of these traits remains elusive due to the difficulty in identifying causal quantitative trait genes (QTGs) for common QTL with relatively small phenotypic effects. A traditional strategy based on techniques such as positional cloning does not always enable identification of a single candidate gene for a QTL of interest because it is difficult to narrow down a target genomic interval of the QTL to a very small interval harboring only one gene. A combination of gene expression analysis and statistical causal analysis can greatly reduce the number of candidate genes. This integrated approach provides causal evidence that one of the candidate genes is a putative QTG for the QTL. Using this approach, I have recently succeeded in identifying a single putative QTG for resistance to obesity in mice. Here, I outline the integration approach and discuss its usefulness using my studies as an example.

  2. A robust multifactor dimensionality reduction method for detecting gene-gene interactions with application to the genetic analysis of bladder cancer susceptibility

    PubMed Central

    Gui, Jiang; Andrew, Angeline S.; Andrews, Peter; Nelson, Heather M.; Kelsey, Karl T.; Karagas, Margaret R.; Moore, Jason H.

    2010-01-01

    A central goal of human genetics is to identify and characterize susceptibility genes for common complex human diseases. An important challenge in this endeavor is the modeling of gene-gene interaction or epistasis that can result in non-additivity of genetic effects. The multifactor dimensionality reduction (MDR) method was developed as machine learning alternative to parametric logistic regression for detecting interactions in absence of significant marginal effects. The goal of MDR is to reduce the dimensionality inherent in modeling combinations of polymorphisms using a computational approach called constructive induction. Here, we propose a Robust Multifactor Dimensionality Reduction (RMDR) method that performs constructive induction using a Fisher’s Exact Test rather than a predetermined threshold. The advantage of this approach is that only those genotype combinations that are determined to be statistically significant are considered in the MDR analysis. We use two simulation studies to demonstrate that this approach will increase the success rate of MDR when there are only a few genotype combinations that are significantly associated with case-control status. We show that there is no loss of success rate when this is not the case. We then apply the RMDR method to the detection of gene-gene interactions in genotype data from a population-based study of bladder cancer in New Hampshire. PMID:21091664

  3. Identification of New Genetic Susceptibility Loci for Breast Cancer Through Consideration of Gene-Environment Interactions

    PubMed Central

    Schoeps, Anja; Rudolph, Anja; Seibold, Petra; Dunning, Alison M.; Milne, Roger L.; Bojesen, Stig E.; Swerdlow, Anthony; Andrulis, Irene; Brenner, Hermann; Behrens, Sabine; Orr, Nicholas; Jones, Michael; Ashworth, Alan; Li, Jingmei; Cramp, Helen; Connley, Dan; Czene, Kamila; Darabi, Hatef; Chanock, Stephen J.; Lissowska, Jolanta; Figueroa, Jonine D.; Knight, Julia; Glendon, Gord; Mulligan, Anna M.; Dumont, Martine; Severi, Gianluca; Baglietto, Laura; Olson, Janet; Vachon, Celine; Purrington, Kristen; Moisse, Matthieu; Neven, Patrick; Wildiers, Hans; Spurdle, Amanda; Kosma, Veli-Matti; Kataja, Vesa; Hartikainen, Jaana M.; Hamann, Ute; Ko, Yon-Dschun; Dieffenbach, Aida K.; Arndt, Volker; Stegmaier, Christa; Malats, Núria; Arias Perez, JoséI.; Benítez, Javier; Flyger, Henrik; Nordestgaard, Børge G.; Truong, Théresè; Cordina-Duverger, Emilie; Menegaux, Florence; Silva, Isabel dos Santos; Fletcher, Olivia; Johnson, Nichola; Häberle, Lothar; Beckmann, Matthias W.; Ekici, Arif B.; Braaf, Linde; Atsma, Femke; van den Broek, Alexandra J.; Makalic, Enes; Schmidt, Daniel F.; Southey, Melissa C.; Cox, Angela; Simard, Jacques; Giles, Graham G.; Lambrechts, Diether; Mannermaa, Arto; Brauch, Hiltrud; Guénel, Pascal; Peto, Julian; Fasching, Peter A.; Hopper, John; Flesch-Janys, Dieter; Couch, Fergus; Chenevix-Trench, Georgia; Pharoah, Paul D. P.; Garcia-Closas, Montserrat; Schmidt, Marjanka K.; Hall, Per; Easton, Douglas F.; Chang-Claude, Jenny

    2014-01-01

    Genes that alter disease risk only in combination with certain environmental exposures may not be detected in genetic association analysis. By using methods accounting for gene-environment (G × E) interaction, we aimed to identify novel genetic loci associated with breast cancer risk. Up to 34,475 cases and 34,786 controls of European ancestry from up to 23 studies in the Breast Cancer Association Consortium were included. Overall, 71,527 single nucleotide polymorphisms (SNPs), enriched for association with breast cancer, were tested for interaction with 10 environmental risk factors using three recently proposed hybrid methods and a joint test of association and interaction. Analyses were adjusted for age, study, population stratification, and confounding factors as applicable. Three SNPs in two independent loci showed statistically significant association: SNPs rs10483028 and rs2242714 in perfect linkage disequilibrium on chromosome 21 and rs12197388 in ARID1B on chromosome 6. While rs12197388 was identified using the joint test with parity and with age at menarche (P-values = 3 × 10−07), the variants on chromosome 21 q22.12, which showed interaction with adult body mass index (BMI) in 8,891 postmenopausal women, were identified by all methods applied. SNP rs10483028 was associated with breast cancer in women with a BMI below 25 kg/m2 (OR = 1.26, 95% CI 1.15–1.38) but not in women with a BMI of 30 kg/m2 or higher (OR = 0.89, 95% CI 0.72–1.11, P for interaction = 3.2 × 10−05). Our findings confirm comparable power of the recent methods for detecting G × E interaction and the utility of using G × E interaction analyses to identify new susceptibility loci. PMID:24248812

  4. Investigation of gene-environment interactions between 47 newly identified breast cancer susceptibility loci and environmental risk factors.

    PubMed

    Rudolph, Anja; Milne, Roger L; Truong, Thérèse; Knight, Julia A; Seibold, Petra; Flesch-Janys, Dieter; Behrens, Sabine; Eilber, Ursula; Bolla, Manjeet K; Wang, Qin; Dennis, Joe; Dunning, Alison M; Shah, Mitul; Munday, Hannah R; Darabi, Hatef; Eriksson, Mikael; Brand, Judith S; Olson, Janet; Vachon, Celine M; Hallberg, Emily; Castelao, J Esteban; Carracedo, Angel; Torres, Maria; Li, Jingmei; Humphreys, Keith; Cordina-Duverger, Emilie; Menegaux, Florence; Flyger, Henrik; Nordestgaard, Børge G; Nielsen, Sune F; Yesilyurt, Betul T; Floris, Giuseppe; Leunen, Karin; Engelhardt, Ellen G; Broeks, Annegien; Rutgers, Emiel J; Glendon, Gord; Mulligan, Anna Marie; Cross, Simon; Reed, Malcolm; Gonzalez-Neira, Anna; Arias Perez, José Ignacio; Provenzano, Elena; Apicella, Carmel; Southey, Melissa C; Spurdle, Amanda; Häberle, Lothar; Beckmann, Matthias W; Ekici, Arif B; Dieffenbach, Aida Karina; Arndt, Volker; Stegmaier, Christa; McLean, Catriona; Baglietto, Laura; Chanock, Stephen J; Lissowska, Jolanta; Sherman, Mark E; Brüning, Thomas; Hamann, Ute; Ko, Yon-Dschun; Orr, Nick; Schoemaker, Minouk; Ashworth, Alan; Kosma, Veli-Matti; Kataja, Vesa; Hartikainen, Jaana M; Mannermaa, Arto; Swerdlow, Anthony; Giles, Graham G; Brenner, Hermann; Fasching, Peter A; Chenevix-Trench, Georgia; Hopper, John; Benítez, Javier; Cox, Angela; Andrulis, Irene L; Lambrechts, Diether; Gago-Dominguez, Manuela; Couch, Fergus; Czene, Kamila; Bojesen, Stig E; Easton, Doug F; Schmidt, Marjanka K; Guénel, Pascal; Hall, Per; Pharoah, Paul D P; Garcia-Closas, Montserrat; Chang-Claude, Jenny

    2015-03-15

    A large genotyping project within the Breast Cancer Association Consortium (BCAC) recently identified 41 associations between single nucleotide polymorphisms (SNPs) and overall breast cancer (BC) risk. We investigated whether the effects of these 41 SNPs, as well as six SNPs associated with estrogen receptor (ER) negative BC risk are modified by 13 environmental risk factors for BC. Data from 22 studies participating in BCAC were pooled, comprising up to 26,633 cases and 30,119 controls. Interactions between SNPs and environmental factors were evaluated using an empirical Bayes-type shrinkage estimator. Six SNPs showed interactions with associated p-values (pint ) <1.1 × 10(-3) . None of the observed interactions was significant after accounting for multiple testing. The Bayesian False Discovery Probability was used to rank the findings, which indicated three interactions as being noteworthy at 1% prior probability of interaction. SNP rs6828523 was associated with increased ER-negative BC risk in women ≥170 cm (OR = 1.22, p = 0.017), but inversely associated with ER-negative BC risk in women <160 cm (OR = 0.83, p = 0.039, pint = 1.9 × 10(-4) ). The inverse association between rs4808801 and overall BC risk was stronger for women who had had four or more pregnancies (OR = 0.85, p = 2.0 × 10(-4) ), and absent in women who had had just one (OR = 0.96, p = 0.19, pint = 6.1 × 10(-4) ). SNP rs11242675 was inversely associated with overall BC risk in never/former smokers (OR = 0.93, p = 2.8 × 10(-5) ), but no association was observed in current smokers (OR = 1.07, p = 0.14, pint = 3.4 × 10(-4) ). In conclusion, recently identified BC susceptibility loci are not strongly modified by established risk factors and the observed potential interactions require confirmation in independent studies. © 2014 UICC.

  5. Gene-environment interactions in cancer epidemiology: a National Cancer Institute Think Tank report.

    PubMed

    Hutter, Carolyn M; Mechanic, Leah E; Chatterjee, Nilanjan; Kraft, Peter; Gillanders, Elizabeth M

    2013-11-01

    Cancer risk is determined by a complex interplay of genetic and environmental factors. Genome-wide association studies (GWAS) have identified hundreds of common (minor allele frequency [MAF] > 0.05) and less common (0.01 < MAF < 0.05) genetic variants associated with cancer. The marginal effects of most of these variants have been small (odds ratios: 1.1-1.4). There remain unanswered questions on how best to incorporate the joint effects of genes and environment, including gene-environment (G × E) interactions, into epidemiologic studies of cancer. To help address these questions, and to better inform research priorities and allocation of resources, the National Cancer Institute sponsored a "Gene-Environment Think Tank" on January 10-11, 2012. The objective of the Think Tank was to facilitate discussions on (1) the state of the science, (2) the goals of G × E interaction studies in cancer epidemiology, and (3) opportunities for developing novel study designs and analysis tools. This report summarizes the Think Tank discussion, with a focus on contemporary approaches to the analysis of G × E interactions. Selecting the appropriate methods requires first identifying the relevant scientific question and rationale, with an important distinction made between analyses aiming to characterize the joint effects of putative or established genetic and environmental factors and analyses aiming to discover novel risk factors or novel interaction effects. Other discussion items include measurement error, statistical power, significance, and replication. Additional designs, exposure assessments, and analytical approaches need to be considered as we move from the current small number of success stories to a fuller understanding of the interplay of genetic and environmental factors. © 2013 WILEY PERIODICALS, INC.

  6. Gene-Environment Interactions in Asthma: Genetic and Epigenetic Effects.

    PubMed

    Lee, Jong-Uk; Kim, Jeong Dong; Park, Choon-Sik

    2015-07-01

    Over the past three decades, a large number of genetic studies have been aimed at finding genetic variants associated with the risk of asthma, applying various genetic and genomic approaches including linkage analysis, candidate gene polymorphism studies, and genome-wide association studies (GWAS). However, contrary to general expectation, even single nucleotide polymorphisms (SNPs) discovered by GWAS failed to fully explain the heritability of asthma. Thus, application of rare allele polymorphisms in well defined phenotypes and clarification of environmental factors have been suggested to overcome the problem of 'missing' heritability. Such factors include allergens, cigarette smoke, air pollutants, and infectious agents during pre- and post-natal periods. The first and simplest interaction between a gene and the environment is a candidate interaction of both a well known gene and environmental factor in a direct physical or chemical interaction such as between CD14 and endotoxin or between HLA and allergens. Several GWAS have found environmental interactions with occupational asthma, aspirin exacerbated respiratory disease, tobacco smoke-related airway dysfunction, and farm-related atopic diseases. As one of the mechanisms behind gene-environment interaction is epigenetics, a few studies on DNA CpG methylation have been reported on subphenotypes of asthma, pitching the exciting idea that it may be possible to intervene at the junction between the genome and the environment. Epigenetic studies are starting to include data from clinical samples, which will make them another powerful tool for re-search on gene-environment interactions in asthma.

  7. The interaction of BDNF and NTRK2 gene increases the susceptibility of paranoid schizophrenia.

    PubMed

    Lin, Zheng; Su, Yousong; Zhang, Chengfang; Xing, Mengjuan; Ding, Wenhua; Liao, Liwei; Guan, Yangtai; Li, Zezhi; Cui, Donghong

    2013-01-01

    The association between BDNF gene functional Val66Met polymorphism rs6265 and the schizophrenia is far from being consistent. In addition to the heterogeneous in schizophrenia per se leading to the inconsistent results, the interaction among multi-genes is probably playing the main role in the pathogenesis of schizophrenia, but not a single gene. Neurotrophic tyrosine kinase receptor 2 (NTRK2) is the high-affinity receptor of BDNF, and was reported to be associated with mood disorders, though no literature reported the association with schizophrenia. Thus, in the present study, total 402 patients with paranoid schizophrenia (the most common subtype of schizophrenia) and matched 406 healthy controls were recruited to investigate the role of rs6265 in BDNF, three polymorphisms in NTRK2 gene (rs1387923, rs2769605 and rs1565445) and their interaction in the susceptibility to paranoid schizophrenia in a Chinese Han population. We did not observe significant differences in allele and genotype frequencies between patients and healthy controls for all four polymorphisms separately. The haplotype analysis also showed no association between haplotype of NTRK2 genes (rs1387923, rs2769605, and rs1565445) and paranoid schizophrenia. However, we found the association between the interaction of BDNF and NTRK2 with paranoid schizophrenia by using the MDR method followed by conventional statistical analysis. The best gene-gene interaction model was a three-locus model (BDNF rs6265, NTRK2 rs1387923 and NTRK2 rs2769605), in which one low-risk and three high-risk four-locus genotype combinations were identified. Our findings implied that single polymorphism of rs6265 rs1387923, rs2769605, and rs1565445 in BDNF and NTRK2 were not associated with the development of paranoid schizophrenia in a Han population, however, the interaction of BDNF and NTRK2 genes polymorphisms (BDNF-rs6265, NTRK2-rs1387923 and NTRK2-rs2769605) may be involved in the susceptibility to paranoid schizophrenia.

  8. The Interaction of BDNF and NTRK2 Gene Increases the Susceptibility of Paranoid Schizophrenia

    PubMed Central

    Zhang, Chengfang; Xing, Mengjuan; Ding, Wenhua; Liao, Liwei; Guan, Yangtai; Li, Zezhi; Cui, Donghong

    2013-01-01

    The association between BDNF gene functional Val66Met polymorphism rs6265 and the schizophrenia is far from being consistent. In addition to the heterogeneous in schizophrenia per se leading to the inconsistent results, the interaction among multi-genes is probably playing the main role in the pathogenesis of schizophrenia, but not a single gene. Neurotrophic tyrosine kinase receptor 2 (NTRK2) is the high-affinity receptor of BDNF, and was reported to be associated with mood disorders, though no literature reported the association with schizophrenia. Thus, in the present study, total 402 patients with paranoid schizophrenia (the most common subtype of schizophrenia) and matched 406 healthy controls were recruited to investigate the role of rs6265 in BDNF, three polymorphisms in NTRK2 gene (rs1387923, rs2769605 and rs1565445) and their interaction in the susceptibility to paranoid schizophrenia in a Chinese Han population. We did not observe significant differences in allele and genotype frequencies between patients and healthy controls for all four polymorphisms separately. The haplotype analysis also showed no association between haplotype of NTRK2 genes (rs1387923, rs2769605, and rs1565445) and paranoid schizophrenia. However, we found the association between the interaction of BDNF and NTRK2 with paranoid schizophrenia by using the MDR method followed by conventional statistical analysis. The best gene-gene interaction model was a three-locus model (BDNF rs6265, NTRK2 rs1387923 and NTRK2 rs2769605), in which one low-risk and three high-risk four-locus genotype combinations were identified. Our findings implied that single polymorphism of rs6265 rs1387923, rs2769605, and rs1565445 in BDNF and NTRK2 were not associated with the development of paranoid schizophrenia in a Han population, however, the interaction of BDNF and NTRK2 genes polymorphisms (BDNF-rs6265, NTRK2-rs1387923 and NTRK2-rs2769605) may be involved in the susceptibility to paranoid schizophrenia

  9. Specification, testing, and interpretation of gene-by-measured-environment interaction models in the presence of gene-environment correlation

    PubMed Central

    Rathouz, Paul J.; Van Hulle, Carol A.; Lee Rodgers, Joseph; Waldman, Irwin D.; Lahey, Benjamin B.

    2009-01-01

    Purcell (2002) proposed a bivariate biometric model for testing and quantifying the interaction between latent genetic influences and measured environments in the presence of gene-environment correlation. Purcell’s model extends the Cholesky model to include gene-environment interaction. We examine a number of closely-related alternative models that do not involve gene-environment interaction but which may fit the data as well Purcell’s model. Because failure to consider these alternatives could lead to spurious detection of gene-environment interaction, we propose alternative models for testing gene-environment interaction in the presence of gene-environment correlation, including one based on the correlated factors model. In addition, we note mathematical errors in the calculation of effect size via variance components in Purcell’s model. We propose a statistical method for deriving and interpreting variance decompositions that are true to the fitted model. PMID:18293078

  10. Microbe–microbe interactions trigger Mn(II)-oxidizing gene expression

    PubMed Central

    Liang, Jinsong; Bai, Yaohui; Men, Yujie; Qu, Jiuhui

    2017-01-01

    Manganese (Mn) is an important metal in geochemical cycles. Some microorganisms can oxidize Mn(II) to Mn oxides, which can, in turn, affect the global cycles of other elements by strong sorption and oxidation effects. Microbe–microbe interactions have important roles in a number of biological processes. However, how microbial interactions affect Mn(II) oxidation still remains unknown. Here, we investigated the interactions between two bacteria (Arthrobacter sp. and Sphingopyxis sp.) in a co-culture, which exhibited Mn(II)-oxidizing activity, although neither were able to oxidize Mn(II) in isolation. We demonstrated that the Mn(II)-oxidizing activity in co-culture was most likely induced via contact-dependent interactions. The expressed Mn(II)-oxidizing protein in the co-culture was purified and identified as a bilirubin oxidase belonging to strain Arthrobacter. Full sequencing of the bilirubin oxidase-encoding gene (boxA) was performed. The Mn(II)-oxidizing protein and the transcripts of boxA were detected in the co-culture, but not in either of the isolated cultures. This indicate that boxA was silent in Arthrobacter monoculture, and was activated in response to presence of Sphingopyxis in the co-culture. Further, transcriptomic analysis by RNA-Seq, extracellular superoxide detection and cell density quantification by flow cytometry indicate induction of boxA gene expression in Arthrobacter was co-incident with a stress response triggered by co-cultivation with Sphingopyxis. Our findings suggest the potential roles of microbial physiological responses to stress induced by other microbes in Mn(II) oxidation and extracellular superoxide production. PMID:27518809

  11. A novel method to identify pathways associated with renal cell carcinoma based on a gene co-expression network

    PubMed Central

    RUAN, XIYUN; LI, HONGYUN; LIU, BO; CHEN, JIE; ZHANG, SHIBAO; SUN, ZEQIANG; LIU, SHUANGQING; SUN, FAHAI; LIU, QINGYONG

    2015-01-01

    The aim of the present study was to develop a novel method for identifying pathways associated with renal cell carcinoma (RCC) based on a gene co-expression network. A framework was established where a co-expression network was derived from the database as well as various co-expression approaches. First, the backbone of the network based on differentially expressed (DE) genes between RCC patients and normal controls was constructed by the Search Tool for the Retrieval of Interacting Genes/Proteins (STRING) database. The differentially co-expressed links were detected by Pearson’s correlation, the empirical Bayesian (EB) approach and Weighted Gene Co-expression Network Analysis (WGCNA). The co-expressed gene pairs were merged by a rank-based algorithm. We obtained 842; 371; 2,883 and 1,595 co-expressed gene pairs from the co-expression networks of the STRING database, Pearson’s correlation EB method and WGCNA, respectively. Two hundred and eighty-one differentially co-expressed (DC) gene pairs were obtained from the merged network using this novel method. Pathway enrichment analysis based on the Kyoto Encyclopedia of Genes and Genomes (KEGG) database and the network enrichment analysis (NEA) method were performed to verify feasibility of the merged method. Results of the KEGG and NEA pathway analyses showed that the network was associated with RCC. The suggested method was computationally efficient to identify pathways associated with RCC and has been identified as a useful complement to traditional co-expression analysis. PMID:26058425

  12. Gene-Trap Mutagenesis Identifies Mammalian Genes Contributing to Intoxication by Clostridium perfringens ε-Toxin

    PubMed Central

    Ivie, Susan E.; Fennessey, Christine M.; Sheng, Jinsong; Rubin, Donald H.; McClain, Mark S.

    2011-01-01

    The Clostridium perfringens ε-toxin is an extremely potent toxin associated with lethal toxemias in domesticated ruminants and may be toxic to humans. Intoxication results in fluid accumulation in various tissues, most notably in the brain and kidneys. Previous studies suggest that the toxin is a pore-forming toxin, leading to dysregulated ion homeostasis and ultimately cell death. However, mammalian host factors that likely contribute to ε-toxin-induced cytotoxicity are poorly understood. A library of insertional mutant Madin Darby canine kidney (MDCK) cells, which are highly susceptible to the lethal affects of ε-toxin, was used to select clones of cells resistant to ε-toxin-induced cytotoxicity. The genes mutated in 9 surviving resistant cell clones were identified. We focused additional experiments on one of the identified genes as a means of validating the experimental approach. Gene expression microarray analysis revealed that one of the identified genes, hepatitis A virus cellular receptor 1 (HAVCR1, KIM-1, TIM1), is more abundantly expressed in human kidney cell lines than it is expressed in human cells known to be resistant to ε-toxin. One human kidney cell line, ACHN, was found to be sensitive to the toxin and expresses a larger isoform of the HAVCR1 protein than the HAVCR1 protein expressed by other, toxin-resistant human kidney cell lines. RNA interference studies in MDCK and in ACHN cells confirmed that HAVCR1 contributes to ε-toxin-induced cytotoxicity. Additionally, ε-toxin was shown to bind to HAVCR1 in vitro. The results of this study indicate that HAVCR1 and the other genes identified through the use of gene-trap mutagenesis and RNA interference strategies represent important targets for investigation of the process by which ε-toxin induces cell death and new targets for potential therapeutic intervention. PMID:21412435

  13. Gene-trap mutagenesis identifies mammalian genes contributing to intoxication by Clostridium perfringens ε-toxin.

    PubMed

    Ivie, Susan E; Fennessey, Christine M; Sheng, Jinsong; Rubin, Donald H; McClain, Mark S

    2011-03-11

    The Clostridium perfringens ε-toxin is an extremely potent toxin associated with lethal toxemias in domesticated ruminants and may be toxic to humans. Intoxication results in fluid accumulation in various tissues, most notably in the brain and kidneys. Previous studies suggest that the toxin is a pore-forming toxin, leading to dysregulated ion homeostasis and ultimately cell death. However, mammalian host factors that likely contribute to ε-toxin-induced cytotoxicity are poorly understood. A library of insertional mutant Madin Darby canine kidney (MDCK) cells, which are highly susceptible to the lethal affects of ε-toxin, was used to select clones of cells resistant to ε-toxin-induced cytotoxicity. The genes mutated in 9 surviving resistant cell clones were identified. We focused additional experiments on one of the identified genes as a means of validating the experimental approach. Gene expression microarray analysis revealed that one of the identified genes, hepatitis A virus cellular receptor 1 (HAVCR1, KIM-1, TIM1), is more abundantly expressed in human kidney cell lines than it is expressed in human cells known to be resistant to ε-toxin. One human kidney cell line, ACHN, was found to be sensitive to the toxin and expresses a larger isoform of the HAVCR1 protein than the HAVCR1 protein expressed by other, toxin-resistant human kidney cell lines. RNA interference studies in MDCK and in ACHN cells confirmed that HAVCR1 contributes to ε-toxin-induced cytotoxicity. Additionally, ε-toxin was shown to bind to HAVCR1 in vitro. The results of this study indicate that HAVCR1 and the other genes identified through the use of gene-trap mutagenesis and RNA interference strategies represent important targets for investigation of the process by which ε-toxin induces cell death and new targets for potential therapeutic intervention.

  14. Gene-gene interactions among genetic variants from obesity candidate genes for nonobese and obese populations in type 2 diabetes.

    PubMed

    Lin, Eugene; Pei, Dee; Huang, Yi-Jen; Hsieh, Chang-Hsun; Wu, Lawrence Shih-Hsin

    2009-08-01

    Recent studies indicate that obesity may play a key role in modulating genetic predispositions to type 2 diabetes (T2D). This study examines the main effects of both single-locus and multilocus interactions among genetic variants in Taiwanese obese and nonobese individuals to test the hypothesis that obesity-related genes may contribute to the etiology of T2D independently and/or through such complex interactions. We genotyped 11 single nucleotide polymorphisms for 10 obesity candidate genes including adrenergic beta-2-receptor surface, adrenergic beta-3-receptor surface, angiotensinogen, fat mass and obesity associated gene, guanine nucleotide binding protein beta polypeptide 3 (GNB3), interleukin 6 receptor, proprotein convertase subtilisin/kexin type 1 (PCSK1), uncoupling protein 1, uncoupling protein 2, and uncoupling protein 3. There were 389 patients diagnosed with T2D and 186 age- and sex-matched controls. Single-locus analyses showed significant main effects of the GNB3 and PCSK1 genes on the risk of T2D among the nonobese group (p = 0.002 and 0.047, respectively). Further, interactions involving GNB3 and PCSK1 were suggested among the nonobese population using the generalized multifactor dimensionality reduction method (p = 0.001). In addition, interactions among angiotensinogen, fat mass and obesity associated gene, GNB3, and uncoupling protein 3 genes were found in a significant four-locus generalized multifactor dimensionality reduction model among the obese population (p = 0.001). The results suggest that the single nucleotide polymorphisms from the obesity candidate genes may contribute to the risk of T2D independently and/or in an interactive manner according to the presence or absence of obesity.

  15. Interactions of HIPPI, a molecular partner of Huntingtin interacting protein HIP1, with the specific motif present at the putative promoter sequence of the caspase-1, caspase-8 and caspase-10 genes.

    PubMed

    Majumder, P; Choudhury, A; Banerjee, M; Lahiri, A; Bhattacharyya, N P

    2007-08-01

    To investigate the mechanism of increased expression of caspase-1 caused by exogenous Hippi, observed earlier in HeLa and Neuro2A cells, in this work we identified a specific motif AAAGACATG (- 101 to - 93) at the caspase-1 gene upstream sequence where HIPPI could bind. Various mutations in this specific sequence compromised the interaction, showing the specificity of the interactions. In the luciferase reporter assay, when the reporter gene was driven by caspase-1 gene upstream sequences (- 151 to - 92) with the mutation G to T at position - 98, luciferase activity was decreased significantly in green fluorescent protein-Hippi-expressing HeLa cells in comparison to that obtained with the wild-type caspase-1 gene 60 bp upstream sequence, indicating the biological significance of such binding. It was observed that the C-terminal 'pseudo' death effector domain of HIPPI interacted with the 60 bp (- 151 to - 92) upstream sequence of the caspase-1 gene containing the motif. We further observed that expression of caspase-8 and caspase-10 was increased in green fluorescent protein-Hippi-expressing HeLa cells. In addition, HIPPI interacted in vitro with putative promoter sequences of these genes, containing a similar motif. In summary, we identified a novel function of HIPPI; it binds to specific upstream sequences of the caspase-1, caspase-8 and caspase-10 genes and alters the expression of the genes. This result showed the motif-specific interaction of HIPPI with DNA, and indicates that it could act as transcription regulator.

  16. Cross-species microarray hybridization to identify developmentally regulated genes in the filamentous fungus Sordaria macrospora.

    PubMed

    Nowrousian, Minou; Ringelberg, Carol; Dunlap, Jay C; Loros, Jennifer J; Kück, Ulrich

    2005-04-01

    The filamentous fungus Sordaria macrospora forms complex three-dimensional fruiting bodies that protect the developing ascospores and ensure their proper discharge. Several regulatory genes essential for fruiting body development were previously isolated by complementation of the sterile mutants pro1, pro11 and pro22. To establish the genetic relationships between these genes and to identify downstream targets, we have conducted cross-species microarray hybridizations using cDNA arrays derived from the closely related fungus Neurospora crassa and RNA probes prepared from wild-type S. macrospora and the three developmental mutants. Of the 1,420 genes which gave a signal with the probes from all the strains used, 172 (12%) were regulated differently in at least one of the three mutants compared to the wild type, and 17 (1.2%) were regulated differently in all three mutant strains. Microarray data were verified by Northern analysis or quantitative real time PCR. Among the genes that are up- or down-regulated in the mutant strains are genes encoding the pheromone precursors, enzymes involved in melanin biosynthesis and a lectin-like protein. Analysis of gene expression in double mutants revealed a complex network of interaction between the pro gene products.

  17. The Interaction of TXNIP and AFq1 Genes Increases the Susceptibility of Schizophrenia.

    PubMed

    Su, Yousong; Ding, Wenhua; Xing, Mengjuan; Qi, Dake; Li, Zezhi; Cui, Donghong

    2017-08-01

    Although previous studies showed the reduced risk of cancer in patients with schizophrenia, whether patients with schizophrenia possess genetic factors that also contribute to tumor suppressor is still unknown. In the present study, based on our previous microarray data, we focused on the tumor suppressor genes TXNIP and AF1q, which differentially expressed in patients with schizophrenia. A total of 413 patients and 578 healthy controls were recruited. We found no significant differences in genotype, allele, or haplotype frequencies at the selected five single nucleotide polymorphisms (SNPs) (rs2236566 and rs7211 in TXNIP gene; rs10749659, rs2140709, and rs3738481 in AF1q gene) between patients with schizophrenia and controls. However, we found the association between the interaction of TXNIP and AF1q with schizophrenia by using the MDR method followed by traditional statistical analysis. The best gene-gene interaction model identified was a three-locus model TXNIP (rs2236566, rs7211)-AF1q (rs2140709). After traditional statistical analysis, we found the high-risk genotype combination was rs2236566 (GG)-rs7211(CC)-rs2140709(CC) (OR = 1.35 [1.03-1.76]). The low-risk genotype combination was rs2236566 (GT)-rs7211(CC)-rs2140709(CC) (OR = 0.67 [0.49-0.91]). Our finding suggested statistically significant role of interaction of TXNIP and AF1q polymorphisms (TXNIP-rs2236566, TXNIP-rs7211, and AF1q-rs2769605) in schizophrenia susceptibility.

  18. RNA-Seq Meta-analysis identifies genes in skeletal muscle associated with gain and intake across a multi-season study of crossbred beef steers.

    PubMed

    Keel, Brittney N; Zarek, Christina M; Keele, John W; Kuehn, Larry A; Snelling, Warren M; Oliver, William T; Freetly, Harvey C; Lindholm-Perry, Amanda K

    2018-06-04

    Feed intake and body weight gain are economically important inputs and outputs of beef production systems. The purpose of this study was to discover differentially expressed genes that will be robust for feed intake and gain across a large segment of the cattle industry. Transcriptomic studies often suffer from issues with reproducibility and cross-validation. One way to improve reproducibility is by integrating multiple datasets via meta-analysis. RNA sequencing (RNA-Seq) was performed on longissimus dorsi muscle from 80 steers (5 cohorts, each with 16 animals) selected from the outside fringe of a bivariate gain and feed intake distribution to understand the genes and pathways involved in feed efficiency. In each cohort, 16 steers were selected from one of four gain and feed intake phenotypes (n = 4 per phenotype) in a 2 × 2 factorial arrangement with gain and feed intake as main effect variables. Each cohort was analyzed as a single experiment using a generalized linear model and results from the 5 cohort analyses were combined in a meta-analysis to identify differentially expressed genes (DEG) across the cohorts. A total of 51 genes were differentially expressed for the main effect of gain, 109 genes for the intake main effect, and 11 genes for the gain x intake interaction (P corrected  < 0.05). A jackknife sensitivity analysis showed that, in general, the meta-analysis produced robust DEGs for the two main effects and their interaction. Pathways identified from over-represented genes included mitochondrial energy production and oxidative stress pathways for the main effect of gain due to DEG including GPD1, NDUFA6, UQCRQ, ACTC1, and MGST3. For intake, metabolic pathways including amino acid biosynthesis and degradation were identified, and for the interaction analysis the pathways identified included GADD45, pyridoxal 5'phosphate salvage, and caveolar mediated endocytosis signaling. Variation among DEG identified by cohort suggests that

  19. APOE Modulates the Correlation Between Triglycerides, Cholesterol, and CHD Through Pleiotropy, and Gene-by-Gene Interactions

    PubMed Central

    Maxwell, Taylor J.; Ballantyne, Christie M.; Cheverud, James M.; Guild, Cameron S.; Ndumele, Chiadi E.; Boerwinkle, Eric

    2013-01-01

    Relationship loci (rQTL) exist when the correlation between multiple traits varies by genotype. rQTL often occur due to gene-by-gene (G × G) or gene-by-environmental interactions, making them a powerful tool for detecting G × G. Here we present an empirical analysis of apolipoprotein E (APOE) with respect to lipid traits and incident CHD leading to the discovery of loci that interact with APOE to affect these traits. We found that the relationship between total cholesterol (TC) and triglycerides (ln TG) varies by APOE isoform genotype in African-American (AA) and European-American (EA) populations. The e2 allele is associated with strong correlation between ln TG and TC while the e4 allele leads to little or no correlation. This led to a priori hypotheses that APOE genotypes affect the relationship of TC and/or ln TG with incident CHD. We found that APOE*TC was significant (P = 0.016) for AA but not EA while APOE*ln TG was significant for EA (P = 0.027) but not AA. In both cases, e2e2 and e2e3 had strong relationships between TC and ln TG with CHD while e2e4 and e4e4 results in little or no relationship between TC and ln TG with CHD. Using ARIC GWAS data, scans for loci that significantly interact with APOE produced four loci for African Americans (one CHD, one TC, and two HDL). These interactions contribute to the rQTL pattern. rQTL are a powerful tool to identify loci that modify the relationship between risk factors and disease and substantially increase statistical power for detecting G × G. PMID:24097412

  20. Shame and Guilt-Proneness in Adolescents: Gene-Environment Interactions.

    PubMed

    Szentágotai-Tătar, Aurora; Chiș, Adina; Vulturar, Romana; Dobrean, Anca; Cândea, Diana Mirela; Miu, Andrei C

    2015-01-01

    Rooted in people's preoccupation with how they are perceived and evaluated, shame and guilt are self-conscious emotions that play adaptive roles in social behavior, but can also contribute to psychopathology when dysregulated. Shame and guilt-proneness develop during childhood and adolescence, and are influenced by genetic and environmental factors that are little known to date. This study investigated the effects of early traumatic events and functional polymorphisms in the brain-derived neurotrophic factor (BDNF) gene and the serotonin transporter gene promoter (5-HTTLPR) on shame and guilt in adolescents. A sample of N = 271 healthy adolescents between 14 and 17 years of age filled in measures of early traumatic events and proneness to shame and guilt, and were genotyped for the BDNF Val66Met and 5-HTTLPR polymorphisms. Results of moderator analyses indicated that trauma intensity was positively associated with guilt-proneness only in carriers of the low-expressing Met allele of BDNF Val66Met. This is the first study that identifies a gene-environment interaction that significantly contributes to guilt proneness in adolescents, with potential implications for developmental psychopathology.

  1. Use of Phage Display to Identify Novel Mineralocorticoid Receptor-Interacting Proteins

    PubMed Central

    Yang, Jun; Fuller, Peter J.; Morgan, James; Shibata, Hirotaka; McDonnell, Donald P.; Clyne, Colin D.

    2014-01-01

    The mineralocorticoid receptor (MR) plays a central role in salt and water homeostasis via the kidney; however, inappropriate activation of the MR in the heart can lead to heart failure. A selective MR modulator that antagonizes MR signaling in the heart but not the kidney would provide the cardiovascular protection of current MR antagonists but allow for normal electrolyte balance. The development of such a pharmaceutical requires an understanding of coregulators and their tissue-selective interactions with the MR, which is currently limited by the small repertoire of MR coregulators described in the literature. To identify potential novel MR coregulators, we used T7 phage display to screen tissue-selective cDNA libraries for MR-interacting proteins. Thirty MR binding peptides were identified, from which three were chosen for further characterization based on their nuclear localization and their interaction with other MR-interacting proteins or, in the case of x-ray repair cross-complementing protein 6, its known status as an androgen receptor coregulator. Eukaryotic elongation factor 1A1, structure-specific recognition protein 1, and x-ray repair cross-complementing protein 6 modulated MR-mediated transcription in a ligand-, cell- and/or promoter-specific manner and colocalized with the MR upon agonist treatment when imaged using immunofluorescence microscopy. These results highlight the utility of phage display for rapid and sensitive screening of MR binding proteins and suggest that eukaryotic elongation factor 1A1, structure-specific recognition protein 1, and x-ray repair cross-complementing protein 6 may be potential MR coactivators whose activity is dependent on the ligand, cellular context, and target gene promoter. PMID:25000480

  2. Inferring Gene Family Histories in Yeast Identifies Lineage Specific Expansions

    PubMed Central

    Ames, Ryan M.; Money, Daniel; Lovell, Simon C.

    2014-01-01

    The complement of genes found in the genome is a balance between gene gain and gene loss. Knowledge of the specific genes that are gained and lost over evolutionary time allows an understanding of the evolution of biological functions. Here we use new evolutionary models to infer gene family histories across complete yeast genomes; these models allow us to estimate the relative genome-wide rates of gene birth, death, innovation and extinction (loss of an entire family) for the first time. We show that the rates of gene family evolution vary both between gene families and between species. We are also able to identify those families that have experienced rapid lineage specific expansion/contraction and show that these families are enriched for specific functions. Moreover, we find that families with specific functions are repeatedly expanded in multiple species, suggesting the presence of common adaptations and that these family expansions/contractions are not random. Additionally, we identify potential specialisations, unique to specific species, in the functions of lineage specific expanded families. These results suggest that an important mechanism in the evolution of genome content is the presence of lineage-specific gene family changes. PMID:24921666

  3. Genexpi: a toolset for identifying regulons and validating gene regulatory networks using time-course expression data.

    PubMed

    Modrák, Martin; Vohradský, Jiří

    2018-04-13

    Identifying regulons of sigma factors is a vital subtask of gene network inference. Integrating multiple sources of data is essential for correct identification of regulons and complete gene regulatory networks. Time series of expression data measured with microarrays or RNA-seq combined with static binding experiments (e.g., ChIP-seq) or literature mining may be used for inference of sigma factor regulatory networks. We introduce Genexpi: a tool to identify sigma factors by combining candidates obtained from ChIP experiments or literature mining with time-course gene expression data. While Genexpi can be used to infer other types of regulatory interactions, it was designed and validated on real biological data from bacterial regulons. In this paper, we put primary focus on CyGenexpi: a plugin integrating Genexpi with the Cytoscape software for ease of use. As a part of this effort, a plugin for handling time series data in Cytoscape called CyDataseries has been developed and made available. Genexpi is also available as a standalone command line tool and an R package. Genexpi is a useful part of gene network inference toolbox. It provides meaningful information about the composition of regulons and delivers biologically interpretable results.

  4. You've gotta be lucky: Coverage and the elusive gene-gene interaction.

    PubMed

    Reimherr, Matthew; Nicolae, Dan L

    2011-01-01

    Genome-wide association studies (GWAS) have led to a large number of single-SNP association findings, but there has been, so far, no investigation resulting in the discovery of a replicable gene-gene interaction. In this paper, we examine some of the possible explanations for the lack of findings, and argue that coverage of causal variation not only has a large effect on the loss in power, but that the effect is larger than in the single-SNP analyses. We show that the product of linkage disequilibrium measures, r², between causal and tested SNPs offers a good approximation to the loss in efficiency as defined by the ratio of sample sizes that lead to similar power. We also demonstrate that, in addition to the huge search space, the loss in power due to coverage when using commercially available platforms makes the search for gene-gene interactions daunting. © 2010 The Authors Annals of Human Genetics © 2010 Blackwell Publishing Ltd/University College London.

  5. Virus-Plus-Susceptibility Gene Interaction Determines Crohn’s Disease Gene Atg16L1 Phenotypes in Intestine

    PubMed Central

    Cadwell, Ken; Patel, Khushbu K.; Maloney, Nicole S.; Liu, Ta-Chiang; Ng, Aylwin C.Y.; Storer, Chad E.; Head, Richard D.; Xavier, Ramnik; Stappenbeck, Thaddeus S.; Virgin, Herbert W.

    2010-01-01

    SUMMARY It is unclear why disease occurs in only a small proportion of persons carrying common risk alleles of disease susceptibility genes. Here we demonstrate that an interaction between a specific virus infection and a mutation in the Crohn’s disease susceptibility gene Atg16L1 induces intestinal pathologies in mice. This virus-plus-susceptibility gene interaction generated abnormalities in granule packaging and unique patterns of gene expression in Paneth cells. Further, the response to injury induced by the toxic substance dextran sodium sulfate was fundamentally altered to include pathologies resembling aspects of Crohn’s disease. These pathologies triggered by virus-plus-susceptibility gene interaction were dependent on TNFα and IFNγ and were prevented by treatment with broad spectrum antibiotics. Thus, we provide a specific example of how a virus-plus-susceptibility gene interaction can, in combination with additional environmental factors and commensal bacteria, determine the phenotype of hosts carrying common risk alleles for inflammatory disease. PMID:20602997

  6. Gene Environment Interactions and Predictors of Colorectal Cancer in Family-Based, Multi-Ethnic Groups.

    PubMed

    Shiao, S Pamela K; Grayson, James; Yu, Chong Ho; Wasek, Brandi; Bottiglieri, Teodoro

    2018-02-16

    For the personalization of polygenic/omics-based health care, the purpose of this study was to examine the gene-environment interactions and predictors of colorectal cancer (CRC) by including five key genes in the one-carbon metabolism pathways. In this proof-of-concept study, we included a total of 54 families and 108 participants, 54 CRC cases and 54 matched family friends representing four major racial ethnic groups in southern California (White, Asian, Hispanics, and Black). We used three phases of data analytics, including exploratory, family-based analyses adjusting for the dependence within the family for sharing genetic heritage, the ensemble method, and generalized regression models for predictive modeling with a machine learning validation procedure to validate the results for enhanced prediction and reproducibility. The results revealed that despite the family members sharing genetic heritage, the CRC group had greater combined gene polymorphism rates than the family controls ( p < 0.05), on MTHFR C677T , MTR A2756G , MTRR A66G, and DHFR 19 bp except MTHFR A1298C. Four racial groups presented different polymorphism rates for four genes (all p < 0.05) except MTHFR A1298C. Following the ensemble method, the most influential factors were identified, and the best predictive models were generated by using the generalized regression models, with Akaike's information criterion and leave-one-out cross validation methods. Body mass index (BMI) and gender were consistent predictors of CRC for both models when individual genes versus total polymorphism counts were used, and alcohol use was interactive with BMI status. Body mass index status was also interactive with both gender and MTHFR C677T gene polymorphism, and the exposure to environmental pollutants was an additional predictor. These results point to the important roles of environmental and modifiable factors in relation to gene-environment interactions in the prevention of CRC.

  7. Dissecting gene-environment interactions: A penalized robust approach accounting for hierarchical structures.

    PubMed

    Wu, Cen; Jiang, Yu; Ren, Jie; Cui, Yuehua; Ma, Shuangge

    2018-02-10

    Identification of gene-environment (G × E) interactions associated with disease phenotypes has posed a great challenge in high-throughput cancer studies. The existing marginal identification methods have suffered from not being able to accommodate the joint effects of a large number of genetic variants, while some of the joint-effect methods have been limited by failing to respect the "main effects, interactions" hierarchy, by ignoring data contamination, and by using inefficient selection techniques under complex structural sparsity. In this article, we develop an effective penalization approach to identify important G × E interactions and main effects, which can account for the hierarchical structures of the 2 types of effects. Possible data contamination is accommodated by adopting the least absolute deviation loss function. The advantage of the proposed approach over the alternatives is convincingly demonstrated in both simulation and a case study on lung cancer prognosis with gene expression measurements and clinical covariates under the accelerated failure time model. Copyright © 2017 John Wiley & Sons, Ltd.

  8. The genetics of alcoholism: identifying specific genes through family studies.

    PubMed

    Edenberg, Howard J; Foroud, Tatiana

    2006-09-01

    Alcoholism is a complex disorder with both genetic and environmental risk factors. Studies in humans have begun to elucidate the genetic underpinnings of the risk for alcoholism. Here we briefly review strategies for identifying individual genes in which variations affect the risk for alcoholism and related phenotypes, in the context of one large study that has successfully identified such genes. The Collaborative Study on the Genetics of Alcoholism (COGA) is a family-based study that has collected detailed phenotypic data on individuals in families with multiple alcoholic members. A genome-wide linkage approach led to the identification of chromosomal regions containing genes that influenced alcoholism risk and related phenotypes. Subsequently, single nucleotide polymorphisms (SNPs) were genotyped in positional candidate genes located within the linked chromosomal regions, and analyzed for association with these phenotypes. Using this sequential approach, COGA has detected association with GABRA2, CHRM2 and ADH4; these associations have all been replicated by other researchers. COGA has detected association to additional genes including GABRG3, TAS2R16, SNCA, OPRK1 and PDYN, results that are awaiting confirmation. These successes demonstrate that genes contributing to the risk for alcoholism can be reliably identified using human subjects.

  9. NRIP enhances HPV gene expression via interaction with either GR or E2

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Chang, Szu-Wei; Lu, Pei-Yu; Guo, Jih-Huong

    We previously identified a gene, nuclear receptor-interaction protein (NRIP), which functions as a transcription cofactor in glucocorticoid receptor (GR) and human papillomavirus E2 (HPV E2)-driven gene expression. Here, we comprehensively evaluated the role of NRIP in HPV-16 gene expression. NRIP acts as a transcription cofactor to enhance GR-regulated HPV-16 gene expression in the presence of hormone. NRIP also can form complex with E2 that caused NRIP-induced HPV gene expression via E2-binding sites in a hormone-independent manner. Furthermore, NRIP can associate with GR and E2 to form tri-protein complex to activate HPV gene expression via GRE, not the E2-binding site, inmore » a hormone-dependent manner. These results indicate that NRIP and GR are viral E2-binding proteins and that NRIP regulates HPV gene expression via GRE and/or E2 binding site in the HPV promoter in a hormone-dependent or independent manner, respectively.« less

  10. A Heterogeneous Network Based Method for Identifying GBM-Related Genes by Integrating Multi-Dimensional Data.

    PubMed

    Chen Peng; Ao Li

    2017-01-01

    The emergence of multi-dimensional data offers opportunities for more comprehensive analysis of the molecular characteristics of human diseases and therefore improving diagnosis, treatment, and prevention. In this study, we proposed a heterogeneous network based method by integrating multi-dimensional data (HNMD) to identify GBM-related genes. The novelty of the method lies in that the multi-dimensional data of GBM from TCGA dataset that provide comprehensive information of genes, are combined with protein-protein interactions to construct a weighted heterogeneous network, which reflects both the general and disease-specific relationships between genes. In addition, a propagation algorithm with resistance is introduced to precisely score and rank GBM-related genes. The results of comprehensive performance evaluation show that the proposed method significantly outperforms the network based methods with single-dimensional data and other existing approaches. Subsequent analysis of the top ranked genes suggests they may be functionally implicated in GBM, which further corroborates the superiority of the proposed method. The source code and the results of HNMD can be downloaded from the following URL: http://bioinformatics.ustc.edu.cn/hnmd/ .

  11. IRS-PCR-based genetic mapping of the huntingtin interacting protein gene (HIP1) on mouse chromosome 5.

    PubMed

    Himmelbauer, H; Wedemeyer, N; Haaf, T; Wanker, E E; Schalkwyk, L C; Lehrach, H

    1998-01-01

    Huntington's disease (HD) is a devastating central nervous system disorder. Even though the gene responsible has been positionally cloned recently, its etiology has remained largely unclear. To investigate potential disease mechanisms, we conducted a search for binding partners of the HD-protein huntingtin. With the yeast two-hybrid system, one such interacting factor, the huntingtin interacting protein-1 (HIP-1), was identified (Wanker et al. 1997; Kalchman et al. 1997) and the human gene mapped to 7q11.2. In this paper we demonstrate the localization of the HIP1 mouse homologue (Hip1) into a previously identified region of human-mouse synteny on distal mouse Chromosome (Chr) 5, both employing an IRS-PCR-based mapping strategy and traditional fluorescent in situ hybridization (FISH) mapping.

  12. Gene expression patterns combined with bioinformatics analysis identify genes associated with cholangiocarcinoma.

    PubMed

    Li, Chen; Shen, Weixing; Shen, Sheng; Ai, Zhilong

    2013-12-01

    To explore the molecular mechanisms of cholangiocarcinoma (CC), microarray technology was used to find biomarkers for early detection and diagnosis. The gene expression profiles from 6 patients with CC and 5 normal controls were downloaded from Gene Expression Omnibus and compared. As a result, 204 differentially co-expressed genes (DCGs) in CC patients compared to normal controls were identified using a computational bioinformatics analysis. These genes were mainly involved in coenzyme metabolic process, peptidase activity and oxidation reduction. A regulatory network was constructed by mapping the DCGs to known regulation data. Four transcription factors, FOXC1, ZIC2, NKX2-2 and GCGR, were hub nodes in the network. In conclusion, this study provides a set of targets useful for future investigations into molecular biomarker studies. Copyright © 2013 Elsevier Ltd. All rights reserved.

  13. Identifying functional cancer-specific miRNA-mRNA interactions in testicular germ cell tumor.

    PubMed

    Sedaghat, Nafiseh; Fathy, Mahmood; Modarressi, Mohammad Hossein; Shojaie, Ali

    2016-09-07

    Testicular cancer is the most common cancer in men aged between 15 and 35 and more than 90% of testicular neoplasms are originated at germ cells. Recent research has shown the impact of microRNAs (miRNAs) in different types of cancer, including testicular germ cell tumor (TGCT). MicroRNAs are small non-coding RNAs which affect the development and progression of cancer cells by binding to mRNAs and regulating their expressions. The identification of functional miRNA-mRNA interactions in cancers, i.e. those that alter the expression of genes in cancer cells, can help delineate post-regulatory mechanisms and may lead to new treatments to control the progression of cancer. A number of sequence-based methods have been developed to predict miRNA-mRNA interactions based on the complementarity of sequences. While necessary, sequence complementarity is, however, not sufficient for presence of functional interactions. Alternative methods have thus been developed to refine the sequence-based interactions using concurrent expression profiles of miRNAs and mRNAs. This study aims to find functional cancer-specific miRNA-mRNA interactions in TGCT. To this end, the sequence-based predicted interactions are first refined using an ensemble learning method, based on two well-known methods of learning miRNA-mRNA interactions, namely, TaLasso and GenMiR++. Additional functional analyses were then used to identify a subset of interactions to be most likely functional and specific to TGCT. The final list of 13 miRNA-mRNA interactions can be potential targets for identifying TGCT-specific interactions and future laboratory experiments to develop new therapies. Copyright © 2016 Elsevier Ltd. All rights reserved.

  14. Genetic and physical interaction of the B-cell systemic lupus erythematosus-associated genes BANK1 and BLK.

    PubMed

    Castillejo-López, Casimiro; Delgado-Vega, Angélica M; Wojcik, Jerome; Kozyrev, Sergey V; Thavathiru, Elangovan; Wu, Ying-Yu; Sánchez, Elena; Pöllmann, David; López-Egido, Juan R; Fineschi, Serena; Domínguez, Nicolás; Lu, Rufei; James, Judith A; Merrill, Joan T; Kelly, Jennifer A; Kaufman, Kenneth M; Moser, Kathy L; Gilkeson, Gary; Frostegård, Johan; Pons-Estel, Bernardo A; D'Alfonso, Sandra; Witte, Torsten; Callejas, José Luis; Harley, John B; Gaffney, Patrick M; Martin, Javier; Guthridge, Joel M; Alarcón-Riquelme, Marta E

    2012-01-01

    Altered signalling in B cells is a predominant feature of systemic lupus erythematosus (SLE). The genes BANK1 and BLK were recently described as associated with SLE. BANK1 codes for a B-cell-specific cytoplasmic protein involved in B-cell receptor signalling and BLK codes for an Src tyrosine kinase with important roles in B-cell development. To characterise the role of BANK1 and BLK in SLE, a genetic interaction analysis was performed hypothesising that genetic interactions could reveal functional pathways relevant to disease pathogenesis. The GPAT16 method was used to analyse the gene-gene interactions of BANK1 and BLK. Confocal microscopy was used to investigate co-localisation, and immunoprecipitation was used to verify the physical interaction of BANK1 and BLK. Epistatic interactions between BANK1 and BLK polymorphisms associated with SLE were observed in a discovery set of 279 patients and 515 controls from northern Europe. A meta-analysis with 4399 European individuals confirmed the genetic interactions between BANK1 and BLK. As BANK1 was identified as a binding partner of the Src tyrosine kinase LYN, the possibility that BANK1 and BLK could also show a protein-protein interaction was tested. The co-immunoprecipitation and co-localisation of BLK and BANK1 were demonstrated. In a Daudi cell line and primary naive B cells endogenous binding was enhanced upon B-cell receptor stimulation using anti-IgM antibodies. This study shows a genetic interaction between BANK1 and BLK, and demonstrates that these molecules interact physically. The results have important consequences for the understanding of SLE and other autoimmune diseases and identify a potential new signalling pathway.

  15. Transcriptomic meta-analysis identifies gene expression characteristics in various samples of HIV-infected patients with nonprogressive disease.

    PubMed

    Zhang, Le-Le; Zhang, Zi-Ning; Wu, Xian; Jiang, Yong-Jun; Fu, Ya-Jing; Shang, Hong

    2017-09-12

    A small proportion of HIV-infected patients remain clinically and/or immunologically stable for years, including elite controllers (ECs) who have undetectable viremia (<50 copies/ml) and long-term nonprogressors (LTNPs) who maintain normal CD4 + T cell counts for prolonged periods (>10 years). However, the mechanism of nonprogression needs to be further resolved. In this study, a transcriptome meta-analysis was performed on nonprogressor and progressor microarray data to identify differential transcriptome pathways and potential biomarkers. Using the INMEX (integrative meta-analysis of expression data) program, we performed the meta-analysis to identify consistently differentially expressed genes (DEGs) in nonprogressors and further performed functional interpretation (gene ontology analysis and pathway analysis) of the DEGs identified in the meta-analysis. Five microarray datasets (81 cases and 98 controls in total), including whole blood, CD4 + and CD8 + T cells, were collected for meta-analysis. We determined that nonprogressors have reduced expression of important interferon-stimulated genes (ISGs), CD38, lymphocyte activation gene 3 (LAG-3) in whole blood, CD4 + and CD8 + T cells. Gene ontology (GO) analysis showed a significant enrichment in DEGs that function in the type I interferon signaling pathway. Upregulated pathways, including the PI3K-Akt signaling pathway in whole blood, cytokine-cytokine receptor interaction in CD4 + T cells and the MAPK signaling pathway in CD8 + T cells, were identified in nonprogressors compared with progressors. In each metabolic functional category, the number of downregulated DEGs was more than the upregulated DEGs, and almost all genes were downregulated DEGs in the oxidative phosphorylation (OXPHOS) and tricarboxylic acid (TCA) cycle in the three types of samples. Our transcriptomic meta-analysis provides a comprehensive evaluation of the gene expression profiles in major blood types of nonprogressors, providing new

  16. Protein Interaction Networks Reveal Novel Autism Risk Genes within GWAS Statistical Noise

    PubMed Central

    Correia, Catarina; Oliveira, Guiomar; Vicente, Astrid M.

    2014-01-01

    Genome-wide association studies (GWAS) for Autism Spectrum Disorder (ASD) thus far met limited success in the identification of common risk variants, consistent with the notion that variants with small individual effects cannot be detected individually in single SNP analysis. To further capture disease risk gene information from ASD association studies, we applied a network-based strategy to the Autism Genome Project (AGP) and the Autism Genetics Resource Exchange GWAS datasets, combining family-based association data with Human Protein-Protein interaction (PPI) data. Our analysis showed that autism-associated proteins at higher than conventional levels of significance (P<0.1) directly interact more than random expectation and are involved in a limited number of interconnected biological processes, indicating that they are functionally related. The functionally coherent networks generated by this approach contain ASD-relevant disease biology, as demonstrated by an improved positive predictive value and sensitivity in retrieving known ASD candidate genes relative to the top associated genes from either GWAS, as well as a higher gene overlap between the two ASD datasets. Analysis of the intersection between the networks obtained from the two ASD GWAS and six unrelated disease datasets identified fourteen genes exclusively present in the ASD networks. These are mostly novel genes involved in abnormal nervous system phenotypes in animal models, and in fundamental biological processes previously implicated in ASD, such as axon guidance, cell adhesion or cytoskeleton organization. Overall, our results highlighted novel susceptibility genes previously hidden within GWAS statistical “noise” that warrant further analysis for causal variants. PMID:25409314

  17. Protein interaction networks reveal novel autism risk genes within GWAS statistical noise.

    PubMed

    Correia, Catarina; Oliveira, Guiomar; Vicente, Astrid M

    2014-01-01

    Genome-wide association studies (GWAS) for Autism Spectrum Disorder (ASD) thus far met limited success in the identification of common risk variants, consistent with the notion that variants with small individual effects cannot be detected individually in single SNP analysis. To further capture disease risk gene information from ASD association studies, we applied a network-based strategy to the Autism Genome Project (AGP) and the Autism Genetics Resource Exchange GWAS datasets, combining family-based association data with Human Protein-Protein interaction (PPI) data. Our analysis showed that autism-associated proteins at higher than conventional levels of significance (P<0.1) directly interact more than random expectation and are involved in a limited number of interconnected biological processes, indicating that they are functionally related. The functionally coherent networks generated by this approach contain ASD-relevant disease biology, as demonstrated by an improved positive predictive value and sensitivity in retrieving known ASD candidate genes relative to the top associated genes from either GWAS, as well as a higher gene overlap between the two ASD datasets. Analysis of the intersection between the networks obtained from the two ASD GWAS and six unrelated disease datasets identified fourteen genes exclusively present in the ASD networks. These are mostly novel genes involved in abnormal nervous system phenotypes in animal models, and in fundamental biological processes previously implicated in ASD, such as axon guidance, cell adhesion or cytoskeleton organization. Overall, our results highlighted novel susceptibility genes previously hidden within GWAS statistical "noise" that warrant further analysis for causal variants.

  18. A CRISPR Cas9-based gene drive platform for genetic interaction analysis in Candida albicans

    PubMed Central

    Shapiro, Rebecca S.; Chavez, Alejandro; Porter, Caroline B. M.; Hamblin, Meagan; Kaas, Christian S.; DiCarlo, James E.; Zeng, Guisheng; Xu, Xiaoli; Revtovich, Alexey V.; Kirienko, Natalia V.; Wang, Yue; Church, George M.; Collins, James J.

    2018-01-01

    Candida albicans is the leading cause of fungal infections; yet, complex genetic interaction analysis remains cumbersome in this diploid pathogen. Here, we developed a CRISPR-Cas9-based ‘gene drive array’ (GDA) platform to facilitate efficient genetic analysis in C. albicans. In our system, a modified DNA donor molecule acts as a selfish genetic element, replaces the targeted site, and propagates to replace additional wild-type loci. Using mating-competent C. albicans haploids, each carrying a different gene drive disabling a gene of interest, we are able to create diploid strains that are homozygous double-deletion mutants. We generate double-gene deletion libraries to demonstrate this technology, targeting antifungal efflux and biofilm adhesion factors. We screen these libraries to identify virulence regulators and determine how genetic networks shift under diverse conditions. This platform transforms our ability to perform genetic interaction analysis in C. albicans and is readily extended to other fungal pathogens. PMID:29062088

  19. Genes Interacting with Occupational Exposures to Low Molecular Weight Agents and Irritants on Adult-Onset Asthma in Three European Studies

    PubMed Central

    Rava, Marta; Ahmed, Ismail; Kogevinas, Manolis; Le Moual, Nicole; Bouzigon, Emmanuelle; Curjuric, Ivan; Dizier, Marie-Hélène; Dumas, Orianne; Gonzalez, Juan R.; Imboden, Medea; Mehta, Amar J.; Tubert-Bitter, Pascale; Zock, Jan-Paul; Jarvis, Deborah; Probst-Hensch, Nicole M.; Demenais, Florence; Nadif, Rachel

    2016-01-01

    Background: The biological mechanisms by which cleaning products and disinfectants—an emerging risk factor—affect respiratory health remain incompletely evaluated. Studying genes by environment interactions (G × E) may help identify new genes related to adult-onset asthma. Objectives: We identified interactions between genetic polymorphisms of a large set of genes involved in the response to oxidative stress and occupational exposures to low molecular weight (LMW) agents or irritants on adult-onset asthma. Methods: Our data came from three large European cohorts: Epidemiological Family-based Study of the Genetics and Environment of Asthma (EGEA), Swiss Cohort Study on Air Pollution and Lung and Heart Disease in Adults (SAPALDIA), and European Community Respiratory Health Survey in Adults (ECRHS). A candidate pathway–based strategy identified 163 genes involved in the response to oxidative stress and potentially related to exposures to LMW agents/irritants. Occupational exposures were evaluated using an asthma job-exposure matrix and job-specific questionnaires for cleaners and healthcare workers. Logistic regression models were used to detect G × E interactions, adjusted for age, sex, and population ancestry, in 2,599 adults (mean age, 47 years; 60% women, 36% exposed, 18% asthmatics). p-Values were corrected for multiple comparisons. Results: Ever exposure to LMW agents/irritants was associated with current adult-onset asthma [OR = 1.28 (95% CI: 1.04, 1.58)]. Eight single nucleotide polymorphism (SNP) by exposure interactions at five loci were found at p < 0.005: PLA2G4A (rs932476, chromosome 1), near PLA2R1 (rs2667026, chromosome 2), near RELA (rs931127, rs7949980, chromosome 11), PRKD1 (rs1958980, rs11847351, rs1958987, chromosome 14), and PRKCA (rs6504453, chromosome 17). Results were consistent across the three studies and after accounting for smoking. Conclusions: Using a pathway-based selection process, we identified novel genes potentially involved

  20. An Integrative Genetics Approach to Identify Candidate Genes Regulating BMD: Combining Linkage, Gene Expression, and Association

    PubMed Central

    Farber, Charles R; van Nas, Atila; Ghazalpour, Anatole; Aten, Jason E; Doss, Sudheer; Sos, Brandon; Schadt, Eric E; Ingram-Drake, Leslie; Davis, Richard C; Horvath, Steve; Smith, Desmond J; Drake, Thomas A; Lusis, Aldons J

    2009-01-01

    Numerous quantitative trait loci (QTLs) affecting bone traits have been identified in the mouse; however, few of the underlying genes have been discovered. To improve the process of transitioning from QTL to gene, we describe an integrative genetics approach, which combines linkage analysis, expression QTL (eQTL) mapping, causality modeling, and genetic association in outbred mice. In C57BL/6J × C3H/HeJ (BXH) F2 mice, nine QTLs regulating femoral BMD were identified. To select candidate genes from within each QTL region, microarray gene expression profiles from individual F2 mice were used to identify 148 genes whose expression was correlated with BMD and regulated by local eQTLs. Many of the genes that were the most highly correlated with BMD have been previously shown to modulate bone mass or skeletal development. Candidates were further prioritized by determining whether their expression was predicted to underlie variation in BMD. Using network edge orienting (NEO), a causality modeling algorithm, 18 of the 148 candidates were predicted to be causally related to differences in BMD. To fine-map QTLs, markers in outbred MF1 mice were tested for association with BMD. Three chromosome 11 SNPs were identified that were associated with BMD within the Bmd11 QTL. Finally, our approach provides strong support for Wnt9a, Rasd1, or both underlying Bmd11. Integration of multiple genetic and genomic data sets can substantially improve the efficiency of QTL fine-mapping and candidate gene identification. PMID:18767929

  1. NFI Transcription Factors Interact with FOXA1 to Regulate Prostate-Specific Gene Expression

    PubMed Central

    Elliott, Amicia D.; DeGraff, David J.; Anderson, Philip D.; Anumanthan, Govindaraj; Yamashita, Hironobu; Sun, Qian; Friedman, David B.; Hachey, David L.; Yu, Xiuping; Sheehan, Jonathan H.; Ahn, Jung-Mo; Raj, Ganesh V.; Piston, David W.; Gronostajski, Richard M.; Matusik, Robert J.

    2014-01-01

    Androgen receptor (AR) action throughout prostate development and in maintenance of the prostatic epithelium is partly controlled by interactions between AR and forkhead box (FOX) transcription factors, particularly FOXA1. We sought to identity additional FOXA1 binding partners that may mediate prostate-specific gene expression. Here we identify the nuclear factor I (NFI) family of transcription factors as novel FOXA1 binding proteins. All four family members (NFIA, NFIB, NFIC, and NFIX) can interact with FOXA1, and knockdown studies in androgen-dependent LNCaP cells determined that modulating expression of NFI family members results in changes in AR target gene expression. This effect is probably mediated by binding of NFI family members to AR target gene promoters, because chromatin immunoprecipitation (ChIP) studies found that NFIB bound to the prostate-specific antigen enhancer. Förster resonance energy transfer studies revealed that FOXA1 is capable of bringing AR and NFIX into proximity, indicating that FOXA1 facilitates the AR and NFI interaction by bridging the complex. To determine the extent to which NFI family members regulate AR/FOXA1 target genes, motif analysis of publicly available data for ChIP followed by sequencing was undertaken. This analysis revealed that 34.4% of peaks bound by AR and FOXA1 contain NFI binding sites. Validation of 8 of these peaks by ChIP revealed that NFI family members can bind 6 of these predicted genomic elements, and 4 of the 8 associated genes undergo gene expression changes as a result of individual NFI knockdown. These observations suggest that NFI regulation of FOXA1/AR action is a frequent event, with individual family members playing distinct roles in AR target gene expression. PMID:24801505

  2. Bioinformatics, interaction network analysis, and neural networks to characterize gene expression of radicular cyst and periapical granuloma.

    PubMed

    Poswar, Fabiano de Oliveira; Farias, Lucyana Conceição; Fraga, Carlos Alberto de Carvalho; Bambirra, Wilson; Brito-Júnior, Manoel; Sousa-Neto, Manoel Damião; Santos, Sérgio Henrique Souza; de Paula, Alfredo Maurício Batista; D'Angelo, Marcos Flávio Silveira Vasconcelos; Guimarães, André Luiz Sena

    2015-06-01

    Bioinformatics has emerged as an important tool to analyze the large amount of data generated by research in different diseases. In this study, gene expression for radicular cysts (RCs) and periapical granulomas (PGs) was characterized based on a leader gene approach. A validated bioinformatics algorithm was applied to identify leader genes for RCs and PGs. Genes related to RCs and PGs were first identified in PubMed, GenBank, GeneAtlas, and GeneCards databases. The Web-available STRING software (The European Molecular Biology Laboratory [EMBL], Heidelberg, Baden-Württemberg, Germany) was used in order to build the interaction map among the identified genes by a significance score named weighted number of links. Based on the weighted number of links, genes were clustered using k-means. The genes in the highest cluster were considered leader genes. Multilayer perceptron neural network analysis was used as a complementary supplement for gene classification. For RCs, the suggested leader genes were TP53 and EP300, whereas PGs were associated with IL2RG, CCL2, CCL4, CCL5, CCR1, CCR3, and CCR5 genes. Our data revealed different gene expression for RCs and PGs, suggesting that not only the inflammatory nature but also other biological processes might differentiate RCs and PGs. Copyright © 2015 American Association of Endodontists. Published by Elsevier Inc. All rights reserved.

  3. Identifying conserved gene clusters in the presence of homology families.

    PubMed

    He, Xin; Goldwasser, Michael H

    2005-01-01

    The study of conserved gene clusters is important for understanding the forces behind genome organization and evolution, as well as the function of individual genes or gene groups. In this paper, we present a new model and algorithm for identifying conserved gene clusters from pairwise genome comparison. This generalizes a recent model called "gene teams." A gene team is a set of genes that appear homologously in two or more species, possibly in a different order yet with the distance of adjacent genes in the team for each chromosome always no more than a certain threshold. We remove the constraint in the original model that each gene must have a unique occurrence in each chromosome and thus allow the analysis on complex prokaryotic or eukaryotic genomes with extensive paralogs. Our algorithm analyzes a pair of chromosomes in O(mn) time and uses O(m+n) space, where m and n are the number of genes in the respective chromosomes. We demonstrate the utility of our methods by studying two bacterial genomes, E. coli K-12 and B. subtilis. Many of the teams identified by our algorithm correlate with documented E. coli operons, while several others match predicted operons, previously suggested by computational techniques. Our implementation and data are publicly available at euler.slu.edu/ approximately goldwasser/homologyteams/.

  4. Gene-diet interactions and aging in C. elegans

    PubMed Central

    Yen, Chia An; Curran, Sean P.

    2016-01-01

    Diet is the most variable aspect of life history, as most individuals have a large diversity of food choices, varying in the type and amount that they ingest. In the short-term, diet can affect metabolism and energy levels. However, in the long run, the net deficiency or excess of calories from diet can influence the progression and severity of age-related diseases. An old and yet still debated question is: how do specific dietary choices impact health- and lifespan? It is clear that genetics can play a critical role — perhaps just as important as diet choices. For example, poor diet in combination with genetic susceptibility can lead to metabolic disorders, such as obesity and type 2 diabetes. Recent work in Caenorhabditis elegans has identified the existence of diet-gene pairs, where the consequence of mutating a specific gene is only realized on specific diets. Many core metabolic pathways are conserved from worm to human. Although only a handful of these diet-gene pairs has been characterized, there are potentially hundreds, if not thousands, of such interactions, which may explain the variability in the rates of aging in humans and the incidence and severity of age-related diseases. PMID:26924670

  5. Interaction between the Sbcc Gene of Escherichia Coli and the Gam Gene of Phage λ

    PubMed Central

    Kulkarni, S. K.; Stahl, F. W.

    1989-01-01

    gam mutants of phage λ carrying long palindromes fail to form plaques on wild-type Escherichia coli but do grow on strains that are mutant in the sbcC gene. gam(+) λ carrying the same palindrome grow on both hosts and on a host deleted for the recB, C and D genes. These results suggest that the Gam protein of λ, known to interact also with E. coli's recBCD protein, can interact with the product of the sbcC gene. PMID:2531105

  6. Transcriptome profiling of two maize inbreds with distinct responses to Gibberella ear rot disease to identify candidate resistance genes.

    PubMed

    Kebede, Aida Z; Johnston, Anne; Schneiderman, Danielle; Bosnich, Whynn; Harris, Linda J

    2018-02-09

    Gibberella ear rot (GER) is one of the most economically important fungal diseases of maize in the temperate zone due to moldy grain contaminated with health threatening mycotoxins. To develop resistant genotypes and control the disease, understanding the host-pathogen interaction is essential. RNA-Seq-derived transcriptome profiles of fungal- and mock-inoculated developing kernel tissues of two maize inbred lines were used to identify differentially expressed transcripts and propose candidate genes mapping within GER resistance quantitative trait loci (QTL). A total of 1255 transcripts were significantly (P ≤ 0.05) up regulated due to fungal infection in both susceptible and resistant inbreds. A greater number of transcripts were up regulated in the former (1174) than the latter (497) and increased as the infection progressed from 1 to 2 days after inoculation. Focusing on differentially expressed genes located within QTL regions for GER resistance, we identified 81 genes involved in membrane transport, hormone regulation, cell wall modification, cell detoxification, and biosynthesis of pathogenesis related proteins and phytoalexins as candidate genes contributing to resistance. Applying droplet digital PCR, we validated the expression profiles of a subset of these candidate genes from QTL regions contributed by the resistant inbred on chromosomes 1, 2 and 9. By screening global gene expression profiles for differentially expressed genes mapping within resistance QTL regions, we have identified candidate genes for gibberella ear rot resistance on several maize chromosomes which could potentially lead to a better understanding of Fusarium resistance mechanisms.

  7. Functional Logistic Regression Approach to Detecting Gene by Longitudinal Environmental Exposure Interaction in a Case-Control Study

    PubMed Central

    Wei, Peng; Tang, Hongwei; Li, Donghui

    2014-01-01

    Most complex human diseases are likely the consequence of the joint actions of genetic and environmental factors. Identification of gene-environment (GxE) interactions not only contributes to a better understanding of the disease mechanisms, but also improves disease risk prediction and targeted intervention. In contrast to the large number of genetic susceptibility loci discovered by genome-wide association studies, there have been very few successes in identifying GxE interactions which may be partly due to limited statistical power and inaccurately measured exposures. While existing statistical methods only consider interactions between genes and static environmental exposures, many environmental/lifestyle factors, such as air pollution and diet, change over time, and cannot be accurately captured at one measurement time point or by simply categorizing into static exposure categories. There is a dearth of statistical methods for detecting gene by time-varying environmental exposure interactions. Here we propose a powerful functional logistic regression (FLR) approach to model the time-varying effect of longitudinal environmental exposure and its interaction with genetic factors on disease risk. Capitalizing on the powerful functional data analysis framework, our proposed FLR model is capable of accommodating longitudinal exposures measured at irregular time points and contaminated by measurement errors, commonly encountered in observational studies. We use extensive simulations to show that the proposed method can control the Type I error and is more powerful than alternative ad hoc methods. We demonstrate the utility of this new method using data from a case-control study of pancreatic cancer to identify the windows of vulnerability of lifetime body mass index on the risk of pancreatic cancer as well as genes which may modify this association. PMID:25219575

  8. DGIdb 3.0: a redesign and expansion of the drug-gene interaction database.

    PubMed

    Cotto, Kelsy C; Wagner, Alex H; Feng, Yang-Yang; Kiwala, Susanna; Coffman, Adam C; Spies, Gregory; Wollam, Alex; Spies, Nicholas C; Griffith, Obi L; Griffith, Malachi

    2018-01-04

    The drug-gene interaction database (DGIdb, www.dgidb.org) consolidates, organizes and presents drug-gene interactions and gene druggability information from papers, databases and web resources. DGIdb normalizes content from 30 disparate sources and allows for user-friendly advanced browsing, searching and filtering for ease of access through an intuitive web user interface, application programming interface (API) and public cloud-based server image. DGIdb v3.0 represents a major update of the database. Nine of the previously included 24 sources were updated. Six new resources were added, bringing the total number of sources to 30. These updates and additions of sources have cumulatively resulted in 56 309 interaction claims. This has also substantially expanded the comprehensive catalogue of druggable genes and anti-neoplastic drug-gene interactions included in the DGIdb. Along with these content updates, v3.0 has received a major overhaul of its codebase, including an updated user interface, preset interaction search filters, consolidation of interaction information into interaction groups, greatly improved search response times and upgrading the underlying web application framework. In addition, the expanded API features new endpoints which allow users to extract more detailed information about queried drugs, genes and drug-gene interactions, including listings of PubMed IDs, interaction type and other interaction metadata.

  9. Evolution of a Novel Antiviral Immune-Signaling Interaction by Partial-Gene Duplication

    PubMed Central

    Korithoski, Bryan; Kolaczkowski, Oralia; Mukherjee, Krishanu; Kola, Reema; Earl, Chandra; Kolaczkowski, Bryan

    2015-01-01

    The RIG-like receptors (RLRs) are related proteins that identify viral RNA in the cytoplasm and activate cellular immune responses, primarily through direct protein-protein interactions with the signal transducer, IPS1. Although it has been well established that the RLRs, RIG-I and MDA5, activate IPS1 through binding between the twin caspase activation and recruitment domains (CARDs) on the RLR and a homologous CARD on IPS1, it is less clear which specific RLR CARD(s) are required for this interaction, and almost nothing is known about how the RLR-IPS1 interaction evolved. In contrast to what has been observed in the presence of immune-modulating K63-linked polyubiquitin, here we show that—in the absence of ubiquitin—it is the first CARD domain of human RIG-I and MDA5 (CARD1) that binds directly to IPS1 CARD, and not the second (CARD2). Although the RLRs originated in the earliest animals, both the IPS1 gene and the twin-CARD domain architecture of RIG-I and MDA5 arose much later in the deuterostome lineage, probably through a series of tandem partial-gene duplication events facilitated by tight clustering of RLRs and IPS1 in the ancestral deuterostome genome. Functional differentiation of RIG-I CARD1 and CARD2 appears to have occurred early during this proliferation of RLR and related CARDs, potentially driven by adaptive coevolution between RIG-I CARD domains and IPS1 CARD. However, functional differentiation of MDA5 CARD1 and CARD2 occurred later. These results fit a general model in which duplications of protein-protein interaction domains into novel gene contexts could facilitate the expansion of signaling networks and suggest a potentially important role for functionally-linked gene clusters in generating novel immune-signaling pathways. PMID:26356745

  10. C-State: an interactive web app for simultaneous multi-gene visualization and comparative epigenetic pattern search.

    PubMed

    Sowpati, Divya Tej; Srivastava, Surabhi; Dhawan, Jyotsna; Mishra, Rakesh K

    2017-09-13

    Comparative epigenomic analysis across multiple genes presents a bottleneck for bench biologists working with NGS data. Despite the development of standardized peak analysis algorithms, the identification of novel epigenetic patterns and their visualization across gene subsets remains a challenge. We developed a fast and interactive web app, C-State (Chromatin-State), to query and plot chromatin landscapes across multiple loci and cell types. C-State has an interactive, JavaScript-based graphical user interface and runs locally in modern web browsers that are pre-installed on all computers, thus eliminating the need for cumbersome data transfer, pre-processing and prior programming knowledge. C-State is unique in its ability to extract and analyze multi-gene epigenetic information. It allows for powerful GUI-based pattern searching and visualization. We include a case study to demonstrate its potential for identifying user-defined epigenetic trends in context of gene expression profiles.

  11. Exploring Plant Co-Expression and Gene-Gene Interactions with CORNET 3.0.

    PubMed

    Van Bel, Michiel; Coppens, Frederik

    2017-01-01

    Selecting and filtering a reference expression and interaction dataset when studying specific pathways and regulatory interactions can be a very time-consuming and error-prone task. In order to reduce the duplicated efforts required to amass such datasets, we have created the CORNET (CORrelation NETworks) platform which allows for easy access to a wide variety of data types: coexpression data, protein-protein interactions, regulatory interactions, and functional annotations. The CORNET platform outputs its results in either text format or through the Cytoscape framework, which is automatically launched by the CORNET website.CORNET 3.0 is the third iteration of the web platform designed for the user exploration of the coexpression space of plant genomes, with a focus on the model species Arabidopsis thaliana. Here we describe the platform: the tools, data, and best practices when using the platform. We indicate how the platform can be used to infer networks from a set of input genes, such as upregulated genes from an expression experiment. By exploring the network, new target and regulator genes can be discovered, allowing for follow-up experiments and more in-depth study. We also indicate how to avoid common pitfalls when evaluating the networks and how to avoid over interpretation of the results.All CORNET versions are available at http://bioinformatics.psb.ugent.be/cornet/ .

  12. Gene Signature in Sessile Serrated Polyps Identifies Colon Cancer Subtype

    PubMed Central

    Kanth, Priyanka; Bronner, Mary P.; Boucher, Kenneth M.; Burt, Randall W.; Neklason, Deborah W.; Hagedorn, Curt H.; Delker, Don A.

    2016-01-01

    Sessile serrated colon adenoma/polyps (SSA/Ps) are found during routine screening colonoscopy and may account for 20–30% of colon cancers. However, differentiating SSA/Ps from hyperplastic polyps (HP) with little risk of cancer is challenging and complementary molecular markers are needed. Additionally, the molecular mechanisms of colon cancer development from SSA/Ps are poorly understood. RNA sequencing was performed on 21 SSA/Ps, 10 HPs, 10 adenomas, 21 uninvolved colon and 20 control colon specimens. Differential expression and leave-one-out cross validation methods were used to define a unique gene signature of SSA/Ps. Our SSA/P gene signature was evaluated in colon cancer RNA-Seq data from The Cancer Genome Atlas (TCGA) to identify a subtype of colon cancers that may develop from SSA/Ps. A total of 1422 differentially expressed genes were found in SSA/Ps relative to controls. Serrated polyposis syndrome (n=12) and sporadic SSA/Ps (n=9) exhibited almost complete (96%) gene overlap. A 51-gene panel in SSA/P showed similar expression in a subset of TCGA colon cancers with high microsatellite instability (MSI-H). A smaller seven-gene panel showed high sensitivity and specificity in identifying BRAF mutant, CpG island methylator phenotype high (CIMP-H) and MLH1 silenced colon cancers. We describe a unique gene signature in SSA/Ps that identifies a subset of colon cancers likely to develop through the serrated pathway. These gene panels may be utilized for improved differentiation of SSA/Ps from HPs and provide insights into novel molecular pathways altered in colon cancer arising from the serrated pathway. PMID:27026680

  13. Association of methylenetetrahydrofolate reductase gene-gene interaction and haplotype with susceptibility to acute lymphoblastic leukemia in Chinese children.

    PubMed

    Xia, Xiaojun; Duan, Yun; Cui, Jie; Jiang, Junfeng; Lin, Li; Peng, Xiaojuan; Wang, YuHong; Guo, Bingtao; Liu, Shouhai; Lei, Xudong

    2017-08-01

    The aim of this study was to investigate the association of methylenetetrahydrofolate reductase (MTHFR) gene polymorphism and additional gene-gene interaction with acute lymphoblastic leukemia (ALL) risk. Logistic regression was performed to investigate the association between two single nucleotide polymorphisms (SNPs) within MTHFR gene and ALL risk and additional gene-gene interaction between rs1801133 and rs1801131. The minor allele of rs1801133 and rs1801131 is associated with decreased ALL risk, OR (95% CI) were 0.61 (0.38-0.89), and 0.68 (0.50-0.96), respectively. We also found a significantly interaction between the two SNPs, participants with rs1801133 - CT or TT and rs1801131 - AC or CC genotype have the lowest ALL risk, compared with participants with rs1801133 - CC and rs1801131 - AA genotype, OR (95% CI) was 0.32 (0.12-0.63). We did not find any haplotype between the rs1801133 and rs1801131 associated with ALL risk. rs1801133 and rs1801131 within MTHFR gene and their interaction were both associated with ALL risk in Chinese children.

  14. To Control False Positives in Gene-Gene Interaction Analysis: Two Novel Conditional Entropy-Based Approaches

    PubMed Central

    Lin, Meihua; Li, Haoli; Zhao, Xiaolei; Qin, Jiheng

    2013-01-01

    Genome-wide analysis of gene-gene interactions has been recognized as a powerful avenue to identify the missing genetic components that can not be detected by using current single-point association analysis. Recently, several model-free methods (e.g. the commonly used information based metrics and several logistic regression-based metrics) were developed for detecting non-linear dependence between genetic loci, but they are potentially at the risk of inflated false positive error, in particular when the main effects at one or both loci are salient. In this study, we proposed two conditional entropy-based metrics to challenge this limitation. Extensive simulations demonstrated that the two proposed metrics, provided the disease is rare, could maintain consistently correct false positive rate. In the scenarios for a common disease, our proposed metrics achieved better or comparable control of false positive error, compared to four previously proposed model-free metrics. In terms of power, our methods outperformed several competing metrics in a range of common disease models. Furthermore, in real data analyses, both metrics succeeded in detecting interactions and were competitive with the originally reported results or the logistic regression approaches. In conclusion, the proposed conditional entropy-based metrics are promising as alternatives to current model-based approaches for detecting genuine epistatic effects. PMID:24339984

  15. GBOOST: a GPU-based tool for detecting gene-gene interactions in genome-wide case control studies.

    PubMed

    Yung, Ling Sing; Yang, Can; Wan, Xiang; Yu, Weichuan

    2011-05-01

    Collecting millions of genetic variations is feasible with the advanced genotyping technology. With a huge amount of genetic variations data in hand, developing efficient algorithms to carry out the gene-gene interaction analysis in a timely manner has become one of the key problems in genome-wide association studies (GWAS). Boolean operation-based screening and testing (BOOST), a recent work in GWAS, completes gene-gene interaction analysis in 2.5 days on a desktop computer. Compared with central processing units (CPUs), graphic processing units (GPUs) are highly parallel hardware and provide massive computing resources. We are, therefore, motivated to use GPUs to further speed up the analysis of gene-gene interactions. We implement the BOOST method based on a GPU framework and name it GBOOST. GBOOST achieves a 40-fold speedup compared with BOOST. It completes the analysis of Wellcome Trust Case Control Consortium Type 2 Diabetes (WTCCC T2D) genome data within 1.34 h on a desktop computer equipped with Nvidia GeForce GTX 285 display card. GBOOST code is available at http://bioinformatics.ust.hk/BOOST.html#GBOOST.

  16. Host susceptibility to malaria in human and mice: compatible approaches to identify potential resistant genes.

    PubMed

    Hernandez-Valladares, Maria; Rihet, Pascal; Iraqi, Fuad A

    2014-01-01

    There is growing evidence for human genetic factors controlling the outcome of malaria infection, while molecular basis of this genetic control is still poorly understood. Case-control and family-based studies have been carried out to identify genes underlying host susceptibility to malarial infection. Parasitemia and mild malaria have been genetically linked to human chromosomes 5q31-q33 and 6p21.3, and several immune genes located within those regions have been associated with malaria-related phenotypes. Association and linkage studies of resistance to malaria are not easy to carry out in human populations, because of the difficulty in surveying a significant number of families. Murine models have proven to be an excellent genetic tool for studying host response to malaria; their use allowed mapping 14 resistance loci, eight of them controlling parasitic levels and six controlling cerebral malaria. Once quantitative trait loci or genes have been identified, the human ortholog may then be identified. Comparative mapping studies showed that a couple of human and mouse might share similar genetically controlled mechanisms of resistance. In this way, char8, which controls parasitemia, was mapped on chromosome 11; char8 corresponds to human chromosome 5q31-q33 and contains immune genes, such as Il3, Il4, Il5, Il12b, Il13, Irf1, and Csf2. Nevertheless, part of the genetic factors controlling malaria traits might differ in both hosts because of specific host-pathogen interactions. Finally, novel genetic tools including animal models were recently developed and will offer new opportunities for identifying genetic factors underlying host phenotypic response to malaria, which will help in better therapeutic strategies including vaccine and drug development.

  17. Shame and Guilt-Proneness in Adolescents: Gene-Environment Interactions

    PubMed Central

    Szentágotai-Tătar, Aurora; Chiș, Adina; Vulturar, Romana; Dobrean, Anca; Cândea, Diana Mirela; Miu, Andrei C.

    2015-01-01

    Rooted in people’s preoccupation with how they are perceived and evaluated, shame and guilt are self-conscious emotions that play adaptive roles in social behavior, but can also contribute to psychopathology when dysregulated. Shame and guilt-proneness develop during childhood and adolescence, and are influenced by genetic and environmental factors that are little known to date. This study investigated the effects of early traumatic events and functional polymorphisms in the brain-derived neurotrophic factor (BDNF) gene and the serotonin transporter gene promoter (5-HTTLPR) on shame and guilt in adolescents. A sample of N = 271 healthy adolescents between 14 and 17 years of age filled in measures of early traumatic events and proneness to shame and guilt, and were genotyped for the BDNF Val66Met and 5-HTTLPR polymorphisms. Results of moderator analyses indicated that trauma intensity was positively associated with guilt-proneness only in carriers of the low-expressing Met allele of BDNF Val66Met. This is the first study that identifies a gene-environment interaction that significantly contributes to guilt proneness in adolescents, with potential implications for developmental psychopathology. PMID:26230319

  18. Pathways-Driven Sparse Regression Identifies Pathways and Genes Associated with High-Density Lipoprotein Cholesterol in Two Asian Cohorts

    PubMed Central

    Silver, Matt; Chen, Peng; Li, Ruoying; Cheng, Ching-Yu; Wong, Tien-Yin; Tai, E-Shyong; Teo, Yik-Ying; Montana, Giovanni

    2013-01-01

    Standard approaches to data analysis in genome-wide association studies (GWAS) ignore any potential functional relationships between gene variants. In contrast gene pathways analysis uses prior information on functional structure within the genome to identify pathways associated with a trait of interest. In a second step, important single nucleotide polymorphisms (SNPs) or genes may be identified within associated pathways. The pathways approach is motivated by the fact that genes do not act alone, but instead have effects that are likely to be mediated through their interaction in gene pathways. Where this is the case, pathways approaches may reveal aspects of a trait's genetic architecture that would otherwise be missed when considering SNPs in isolation. Most pathways methods begin by testing SNPs one at a time, and so fail to capitalise on the potential advantages inherent in a multi-SNP, joint modelling approach. Here, we describe a dual-level, sparse regression model for the simultaneous identification of pathways and genes associated with a quantitative trait. Our method takes account of various factors specific to the joint modelling of pathways with genome-wide data, including widespread correlation between genetic predictors, and the fact that variants may overlap multiple pathways. We use a resampling strategy that exploits finite sample variability to provide robust rankings for pathways and genes. We test our method through simulation, and use it to perform pathways-driven gene selection in a search for pathways and genes associated with variation in serum high-density lipoprotein cholesterol levels in two separate GWAS cohorts of Asian adults. By comparing results from both cohorts we identify a number of candidate pathways including those associated with cardiomyopathy, and T cell receptor and PPAR signalling. Highlighted genes include those associated with the L-type calcium channel, adenylate cyclase, integrin, laminin, MAPK signalling and immune

  19. Pathways-driven sparse regression identifies pathways and genes associated with high-density lipoprotein cholesterol in two Asian cohorts.

    PubMed

    Silver, Matt; Chen, Peng; Li, Ruoying; Cheng, Ching-Yu; Wong, Tien-Yin; Tai, E-Shyong; Teo, Yik-Ying; Montana, Giovanni

    2013-11-01

    Standard approaches to data analysis in genome-wide association studies (GWAS) ignore any potential functional relationships between gene variants. In contrast gene pathways analysis uses prior information on functional structure within the genome to identify pathways associated with a trait of interest. In a second step, important single nucleotide polymorphisms (SNPs) or genes may be identified within associated pathways. The pathways approach is motivated by the fact that genes do not act alone, but instead have effects that are likely to be mediated through their interaction in gene pathways. Where this is the case, pathways approaches may reveal aspects of a trait's genetic architecture that would otherwise be missed when considering SNPs in isolation. Most pathways methods begin by testing SNPs one at a time, and so fail to capitalise on the potential advantages inherent in a multi-SNP, joint modelling approach. Here, we describe a dual-level, sparse regression model for the simultaneous identification of pathways and genes associated with a quantitative trait. Our method takes account of various factors specific to the joint modelling of pathways with genome-wide data, including widespread correlation between genetic predictors, and the fact that variants may overlap multiple pathways. We use a resampling strategy that exploits finite sample variability to provide robust rankings for pathways and genes. We test our method through simulation, and use it to perform pathways-driven gene selection in a search for pathways and genes associated with variation in serum high-density lipoprotein cholesterol levels in two separate GWAS cohorts of Asian adults. By comparing results from both cohorts we identify a number of candidate pathways including those associated with cardiomyopathy, and T cell receptor and PPAR signalling. Highlighted genes include those associated with the L-type calcium channel, adenylate cyclase, integrin, laminin, MAPK signalling and immune

  20. Phenoscape: Identifying Candidate Genes for Evolutionary Phenotypes

    PubMed Central

    Edmunds, Richard C.; Su, Baofeng; Balhoff, James P.; Eames, B. Frank; Dahdul, Wasila M.; Lapp, Hilmar; Lundberg, John G.; Vision, Todd J.; Dunham, Rex A.; Mabee, Paula M.; Westerfield, Monte

    2016-01-01

    Phenotypes resulting from mutations in genetic model organisms can help reveal candidate genes for evolutionarily important phenotypic changes in related taxa. Although testing candidate gene hypotheses experimentally in nonmodel organisms is typically difficult, ontology-driven information systems can help generate testable hypotheses about developmental processes in experimentally tractable organisms. Here, we tested candidate gene hypotheses suggested by expert use of the Phenoscape Knowledgebase, specifically looking for genes that are candidates responsible for evolutionarily interesting phenotypes in the ostariophysan fishes that bear resemblance to mutant phenotypes in zebrafish. For this, we searched ZFIN for genetic perturbations that result in either loss of basihyal element or loss of scales phenotypes, because these are the ancestral phenotypes observed in catfishes (Siluriformes). We tested the identified candidate genes by examining their endogenous expression patterns in the channel catfish, Ictalurus punctatus. The experimental results were consistent with the hypotheses that these features evolved through disruption in developmental pathways at, or upstream of, brpf1 and eda/edar for the ancestral losses of basihyal element and scales, respectively. These results demonstrate that ontological annotations of the phenotypic effects of genetic alterations in model organisms, when aggregated within a knowledgebase, can be used effectively to generate testable, and useful, hypotheses about evolutionary changes in morphology. PMID:26500251

  1. Robustness of meta-analyses in finding gene × environment interactions

    PubMed Central

    Shi, Gang; Nehorai, Arye

    2017-01-01

    Meta-analyses that synthesize statistical evidence across studies have become important analytical tools for genetic studies. Inspired by the success of genome-wide association studies of the genetic main effect, researchers are searching for gene × environment interactions. Confounders are routinely included in the genome-wide gene × environment interaction analysis as covariates; however, this does not control for any confounding effects on the results if covariate × environment interactions are present. We carried out simulation studies to evaluate the robustness to the covariate × environment confounder for meta-regression and joint meta-analysis, which are two commonly used meta-analysis methods for testing the gene × environment interaction or the genetic main effect and interaction jointly. Here we show that meta-regression is robust to the covariate × environment confounder while joint meta-analysis is subject to the confounding effect with inflated type I error rates. Given vast sample sizes employed in genome-wide gene × environment interaction studies, non-significant covariate × environment interactions at the study level could substantially elevate the type I error rate at the consortium level. When covariate × environment confounders are present, type I errors can be controlled in joint meta-analysis by including the covariate × environment terms in the analysis at the study level. Alternatively, meta-regression can be applied, which is robust to potential covariate × environment confounders. PMID:28362796

  2. Type 2 diabetes mellitus disease risk genes identified by genome wide copy number variation scan in normal populations.

    PubMed

    Prabhanjan, Manasa; Suresh, Raviraj V; Murthy, Megha N; Ramachandra, Nallur B

    2016-03-01

    To identify the role of copy number variations (CNVs) on disease risk genes and its effect on disease phenotypes in type 2 diabetes mellitus (T2DM) in 12 random populations using high throughput arrays. CNV analysis was carried out on a total of 1715 individuals from 12 populations, from ArrayExpress Archive of the European Bioinformatics Institute along with our subjects using Affymetrix Genome Wide SNP 6.0 array. CNV effect on T2DM genes were analyzed using several bioinformatics tools and a molecular protein interaction network was constructed to identify the disease mechanism altered by the CNVs. Analysis showed 34.4% of the total population to be under CNV burden for T2DM, with 83 disease causal and associated genes being under CNV influence. Hotspots were identified on chromosomes 22, 12, 6, 19 and 11.Overlap studies with case cohorts revealed significant disease risk genes such as EGFR, E2F1, PPP1R3A, HLA and TSPAN8. CNVs play a significant role in predisposing T2DM in normal cohorts and contribute to the phenotypic effects. Thus, CNVs should be considered as one of the major contributors in predisposition of the disease. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.

  3. LHX3 interacts with inhibitor of histone acetyltransferase complex subunits LANP and TAF-1β to modulate pituitary gene regulation.

    PubMed

    Hunter, Chad S; Malik, Raleigh E; Witzmann, Frank A; Rhodes, Simon J

    2013-01-01

    LIM-homeodomain 3 (LHX3) is a transcription factor required for mammalian pituitary gland and nervous system development. Human patients and animal models with LHX3 gene mutations present with severe pediatric syndromes that feature hormone deficiencies and symptoms associated with nervous system dysfunction. The carboxyl terminus of the LHX3 protein is required for pituitary gene regulation, but the mechanism by which this domain operates is unknown. In order to better understand LHX3-dependent pituitary hormone gene transcription, we used biochemical and mass spectrometry approaches to identify and characterize proteins that interact with the LHX3 carboxyl terminus. This approach identified the LANP/pp32 and TAF-1β/SET proteins, which are components of the inhibitor of histone acetyltransferase (INHAT) multi-subunit complex that serves as a multifunctional repressor to inhibit histone acetylation and modulate chromatin structure. The protein domains of LANP and TAF-1β that interact with LHX3 were mapped using biochemical techniques. Chromatin immunoprecipitation experiments demonstrated that LANP and TAF-1β are associated with LHX3 target genes in pituitary cells, and experimental alterations of LANP and TAF-1β levels affected LHX3-mediated pituitary gene regulation. Together, these data suggest that transcriptional regulation of pituitary genes by LHX3 involves regulated interactions with the INHAT complex.

  4. LHX3 Interacts with Inhibitor of Histone Acetyltransferase Complex Subunits LANP and TAF-1β to Modulate Pituitary Gene Regulation

    PubMed Central

    Witzmann, Frank A.; Rhodes, Simon J.

    2013-01-01

    LIM-homeodomain 3 (LHX3) is a transcription factor required for mammalian pituitary gland and nervous system development. Human patients and animal models with LHX3 gene mutations present with severe pediatric syndromes that feature hormone deficiencies and symptoms associated with nervous system dysfunction. The carboxyl terminus of the LHX3 protein is required for pituitary gene regulation, but the mechanism by which this domain operates is unknown. In order to better understand LHX3-dependent pituitary hormone gene transcription, we used biochemical and mass spectrometry approaches to identify and characterize proteins that interact with the LHX3 carboxyl terminus. This approach identified the LANP/pp32 and TAF-1β/SET proteins, which are components of the inhibitor of histone acetyltransferase (INHAT) multi-subunit complex that serves as a multifunctional repressor to inhibit histone acetylation and modulate chromatin structure. The protein domains of LANP and TAF-1β that interact with LHX3 were mapped using biochemical techniques. Chromatin immunoprecipitation experiments demonstrated that LANP and TAF-1β are associated with LHX3 target genes in pituitary cells, and experimental alterations of LANP and TAF-1β levels affected LHX3-mediated pituitary gene regulation. Together, these data suggest that transcriptional regulation of pituitary genes by LHX3 involves regulated interactions with the INHAT complex. PMID:23861948

  5. Novel genes identified in a high-density genome wide association study for nicotine dependence.

    PubMed

    Bierut, Laura Jean; Madden, Pamela A F; Breslau, Naomi; Johnson, Eric O; Hatsukami, Dorothy; Pomerleau, Ovide F; Swan, Gary E; Rutter, Joni; Bertelsen, Sarah; Fox, Louis; Fugman, Douglas; Goate, Alison M; Hinrichs, Anthony L; Konvicka, Karel; Martin, Nicholas G; Montgomery, Grant W; Saccone, Nancy L; Saccone, Scott F; Wang, Jen C; Chase, Gary A; Rice, John P; Ballinger, Dennis G

    2007-01-01

    Tobacco use is a leading contributor to disability and death worldwide, and genetic factors contribute in part to the development of nicotine dependence. To identify novel genes for which natural variation contributes to the development of nicotine dependence, we performed a comprehensive genome wide association study using nicotine dependent smokers as cases and non-dependent smokers as controls. To allow the efficient, rapid, and cost effective screen of the genome, the study was carried out using a two-stage design. In the first stage, genotyping of over 2.4 million single nucleotide polymorphisms (SNPs) was completed in case and control pools. In the second stage, we selected SNPs for individual genotyping based on the most significant allele frequency differences between cases and controls from the pooled results. Individual genotyping was performed in 1050 cases and 879 controls using 31 960 selected SNPs. The primary analysis, a logistic regression model with covariates of age, gender, genotype and gender by genotype interaction, identified 35 SNPs with P-values less than 10(-4) (minimum P-value 1.53 x 10(-6)). Although none of the individual findings is statistically significant after correcting for multiple tests, additional statistical analyses support the existence of true findings in this group. Our study nominates several novel genes, such as Neurexin 1 (NRXN1), in the development of nicotine dependence while also identifying a known candidate gene, the beta3 nicotinic cholinergic receptor. This work anticipates the future directions of large-scale genome wide association studies with state-of-the-art methodological approaches and sharing of data with the scientific community.

  6. Diet-Gene Interactions and PUFA Metabolism: A Potential Contributor to Health Disparities and Human Diseases

    PubMed Central

    Chilton, Floyd H.; Murphy, Robert C.; Wilson, Bryan A.; Sergeant, Susan; Ainsworth, Hannah; Seeds, Michael C.; Mathias, Rasika A.

    2014-01-01

    The “modern western” diet (MWD) has increased the onset and progression of chronic human diseases as qualitatively and quantitatively maladaptive dietary components give rise to obesity and destructive gene-diet interactions. There has been a three-fold increase in dietary levels of the omega-6 (n-6) 18 carbon (C18), polyunsaturated fatty acid (PUFA) linoleic acid (LA; 18:2n-6), with the addition of cooking oils and processed foods to the MWD. Intense debate has emerged regarding the impact of this increase on human health. Recent studies have uncovered population-related genetic variation in the LCPUFA biosynthetic pathway (especially within the fatty acid desaturase gene (FADS) cluster) that is associated with levels of circulating and tissue PUFAs and several biomarkers and clinical endpoints of cardiovascular disease (CVD). Importantly, populations of African descent have higher frequencies of variants associated with elevated levels of arachidonic acid (ARA), CVD biomarkers and disease endpoints. Additionally, nutrigenomic interactions between dietary n-6 PUFAs and variants in genes that encode for enzymes that mobilize and metabolize ARA to eicosanoids have been identified. These observations raise important questions of whether gene-PUFA interactions are differentially driving the risk of cardiovascular and other diseases in diverse populations, and contributing to health disparities, especially in African American populations. PMID:24853887

  7. An Efficient Test for Gene-Environment Interaction in Generalized Linear Mixed Models with Family Data.

    PubMed

    Mazo Lopera, Mauricio A; Coombes, Brandon J; de Andrade, Mariza

    2017-09-27

    Gene-environment (GE) interaction has important implications in the etiology of complex diseases that are caused by a combination of genetic factors and environment variables. Several authors have developed GE analysis in the context of independent subjects or longitudinal data using a gene-set. In this paper, we propose to analyze GE interaction for discrete and continuous phenotypes in family studies by incorporating the relatedness among the relatives for each family into a generalized linear mixed model (GLMM) and by using a gene-based variance component test. In addition, we deal with collinearity problems arising from linkage disequilibrium among single nucleotide polymorphisms (SNPs) by considering their coefficients as random effects under the null model estimation. We show that the best linear unbiased predictor (BLUP) of such random effects in the GLMM is equivalent to the ridge regression estimator. This equivalence provides a simple method to estimate the ridge penalty parameter in comparison to other computationally-demanding estimation approaches based on cross-validation schemes. We evaluated the proposed test using simulation studies and applied it to real data from the Baependi Heart Study consisting of 76 families. Using our approach, we identified an interaction between BMI and the Peroxisome Proliferator Activated Receptor Gamma ( PPARG ) gene associated with diabetes.

  8. Identifying novel members of the Wntless interactome through genetic and candidate gene approaches.

    PubMed

    Petko, Jessica; Tranchina, Trevor; Patel, Goral; Levenson, Robert; Justice-Bitner, Stephanie

    2018-04-01

    Wnt signaling is an important pathway that regulates several aspects of embryogenesis, stem cell maintenance, and neural connectivity. We have recently determined that opioids decrease Wnt secretion, presumably by inhibiting the recycling of the Wnt trafficking protein Wntless (Wls). This effect appears to be mediated by protein-protein interaction between Wls and the mu-opioid receptor (MOR), the primary cellular target of opioid drugs. The goal of this study was to identify novel protein interactors of Wls that are expressed in the brain and may also play a role in reward or addiction. Using genetic and candidate gene approaches, we show that among a variety of protein, Wls interacts with the dopamine transporter (target of cocaine), cannabinoid receptors (target of THC), Adenosine A2A receptor (target of caffeine), and SGIP1 (endocytic regulator of cannabinoid receptors). Our study shows that aside from opioid receptors, Wntless interacts with additional proteins involved in reward and/or addiction. Future studies will determine whether Wntless and WNT signaling play a more universal role in these processes. Copyright © 2017 Elsevier Inc. All rights reserved.

  9. Identifying User Interaction Patterns in E-Textbooks.

    PubMed

    Saarinen, Santeri; Heimonen, Tomi; Turunen, Markku; Mikkilä-Erdmann, Mirjamaija; Raisamo, Roope; Erdmann, Norbert; Yrjänäinen, Sari; Keskinen, Tuuli

    2015-01-01

    We introduce a new architecture for e-textbooks which contains two navigational aids: an index and a concept map. We report results from an evaluation in a university setting with 99 students. The interaction sequences of the users were captured during the user study. We found several clusters of user interaction types in our data. Three separate user types were identified based on the interaction sequences: passive user, term clicker, and concept map user. We also discovered that with the concept map interface users started to interact with the application significantly sooner than with the index interface. Overall, our findings suggest that analysis of interaction patterns allows deeper insights into the use of e-textbooks than is afforded by summative evaluation.

  10. Identifying User Interaction Patterns in E-Textbooks

    PubMed Central

    Saarinen, Santeri; Turunen, Markku; Mikkilä-Erdmann, Mirjamaija; Erdmann, Norbert; Yrjänäinen, Sari; Keskinen, Tuuli

    2015-01-01

    We introduce a new architecture for e-textbooks which contains two navigational aids: an index and a concept map. We report results from an evaluation in a university setting with 99 students. The interaction sequences of the users were captured during the user study. We found several clusters of user interaction types in our data. Three separate user types were identified based on the interaction sequences: passive user, term clicker, and concept map user. We also discovered that with the concept map interface users started to interact with the application significantly sooner than with the index interface. Overall, our findings suggest that analysis of interaction patterns allows deeper insights into the use of e-textbooks than is afforded by summative evaluation. PMID:26605377

  11. Bioinformatic prediction of leader genes in human periodontitis.

    PubMed

    Covani, Ugo; Marconcini, Simone; Giacomelli, Luca; Sivozhelevov, Victor; Barone, Antonio; Nicolini, Claudio

    2008-10-01

    Genes involved in different biologic processes form complex interaction networks. However, only a few have a high number of interactions with the other genes in the network. In previous bioinformatics and experimental studies concerning the T lymphocyte cell cycle, these genes were identified and termed "leader genes." In this work, genes involved in human periodontitis were tentatively identified and ranked according to their number of interactions to obtain a preliminary, broader view of molecular mechanisms of periodontitis and plan targeted experimentation. Genes were identified with interrelated queries of several databases. The interactions among these genes were mapped and given a significance score. The weighted number of links (weighted sum of scores for every interaction in which the given gene is involved) was calculated for each gene. Genes were clustered according to this parameter. The genes in the highest cluster were termed leader genes. Sixty-one genes involved or potentially involved in periodontitis were identified. Only five were identified as leader genes, whereas 12 others were ranked in an immediately lower cluster. For 10 of 17 genes there is evidence of involvement in periodontitis; seven new genes that are potentially involved in this disease were identified. The involvement in periodontitis has been completely established for only two leader genes. We applied a validated bioinformatics algorithm to increase our knowledge of molecular mechanisms of periodontitis. Even with the limitations of this ab initio analysis, this theoretical study can suggest ad hoc experimentation targeted on significant genes and, therefore, simpler than mass-scale molecular genomics. Moreover, the identification of leader genes might suggest new potential risk factors and therapeutic targets.

  12. Functional logistic regression approach to detecting gene by longitudinal environmental exposure interaction in a case-control study.

    PubMed

    Wei, Peng; Tang, Hongwei; Li, Donghui

    2014-11-01

    Most complex human diseases are likely the consequence of the joint actions of genetic and environmental factors. Identification of gene-environment (G × E) interactions not only contributes to a better understanding of the disease mechanisms, but also improves disease risk prediction and targeted intervention. In contrast to the large number of genetic susceptibility loci discovered by genome-wide association studies, there have been very few successes in identifying G × E interactions, which may be partly due to limited statistical power and inaccurately measured exposures. Although existing statistical methods only consider interactions between genes and static environmental exposures, many environmental/lifestyle factors, such as air pollution and diet, change over time, and cannot be accurately captured at one measurement time point or by simply categorizing into static exposure categories. There is a dearth of statistical methods for detecting gene by time-varying environmental exposure interactions. Here, we propose a powerful functional logistic regression (FLR) approach to model the time-varying effect of longitudinal environmental exposure and its interaction with genetic factors on disease risk. Capitalizing on the powerful functional data analysis framework, our proposed FLR model is capable of accommodating longitudinal exposures measured at irregular time points and contaminated by measurement errors, commonly encountered in observational studies. We use extensive simulations to show that the proposed method can control the Type I error and is more powerful than alternative ad hoc methods. We demonstrate the utility of this new method using data from a case-control study of pancreatic cancer to identify the windows of vulnerability of lifetime body mass index on the risk of pancreatic cancer as well as genes that may modify this association. © 2014 Wiley Periodicals, Inc.

  13. Identification and comprehensive evaluation of reference genes for RT-qPCR analysis of host gene-expression in Brassica juncea-aphid interaction using microarray data.

    PubMed

    Ram, Chet; Koramutla, Murali Krishna; Bhattacharya, Ramcharan

    2017-07-01

    Brassica juncea is a chief oil yielding crop in many parts of the world including India. With advancement of molecular techniques, RT-qPCR based study of gene-expression has become an integral part of experimentations in crop breeding. In RT-qPCR, use of appropriate reference gene(s) is pivotal. The virtue of the reference genes, being constant in expression throughout the experimental treatments, needs to be validated case by case. Appropriate reference gene(s) for normalization of gene-expression data in B. juncea during the biotic stress of aphid infestation is not known. In the present investigation, 11 reference genes identified from microarray database of Arabidopsis-aphid interaction at a cut off FDR ≤0.1, along with two known reference genes of B. juncea, were analyzed for their expression stability upon aphid infestation. These included 6 frequently used and 5 newly identified reference genes. Ranking orders of the reference genes in terms of expression stability were calculated using advanced statistical approaches such as geNorm, NormFinder, delta Ct and BestKeeper. The analysis suggested CAC, TUA and DUF179 as the most suitable reference genes. Further, normalization of the gene-expression data of STP4 and PR1 by the most and the least stable reference gene, respectively has demonstrated importance and applicability of the recommended reference genes in aphid infested samples of B. juncea. Copyright © 2017 Elsevier Masson SAS. All rights reserved.

  14. Genetic background effects in quantitative genetics: gene-by-system interactions.

    PubMed

    Sardi, Maria; Gasch, Audrey P

    2018-04-11

    Proper cell function depends on networks of proteins that interact physically and functionally to carry out physiological processes. Thus, it seems logical that the impact of sequence variation in one protein could be significantly influenced by genetic variants at other loci in a genome. Nonetheless, the importance of such genetic interactions, known as epistasis, in explaining phenotypic variation remains a matter of debate in genetics. Recent work from our lab revealed that genes implicated from an association study of toxin tolerance in Saccharomyces cerevisiae show extensive interactions with the genetic background: most implicated genes, regardless of allele, are important for toxin tolerance in only one of two tested strains. The prevalence of background effects in our study adds to other reports of widespread genetic-background interactions in model organisms. We suggest that these effects represent many-way interactions with myriad features of the cellular system that vary across classes of individuals. Such gene-by-system interactions may influence diverse traits and require new modeling approaches to accurately represent genotype-phenotype relationships across individuals.

  15. Gene × Smoking Interactions on Human Brain Gene Expression: Finding Common Mechanisms in Adolescents and Adults

    ERIC Educational Resources Information Center

    Wolock, Samuel L.; Yates, Andrew; Petrill, Stephen A.; Bohland, Jason W.; Blair, Clancy; Li, Ning; Machiraju, Raghu; Huang, Kun; Bartlett, Christopher W.

    2013-01-01

    Background: Numerous studies have examined gene × environment interactions (G × E) in cognitive and behavioral domains. However, these studies have been limited in that they have not been able to directly assess differential patterns of gene expression in the human brain. Here, we assessed G × E interactions using two publically available datasets…

  16. A cross-species bi-clustering approach to identifying conserved co-regulated genes.

    PubMed

    Sun, Jiangwen; Jiang, Zongliang; Tian, Xiuchun; Bi, Jinbo

    2016-06-15

    A growing number of studies have explored the process of pre-implantation embryonic development of multiple mammalian species. However, the conservation and variation among different species in their developmental programming are poorly defined due to the lack of effective computational methods for detecting co-regularized genes that are conserved across species. The most sophisticated method to date for identifying conserved co-regulated genes is a two-step approach. This approach first identifies gene clusters for each species by a cluster analysis of gene expression data, and subsequently computes the overlaps of clusters identified from different species to reveal common subgroups. This approach is ineffective to deal with the noise in the expression data introduced by the complicated procedures in quantifying gene expression. Furthermore, due to the sequential nature of the approach, the gene clusters identified in the first step may have little overlap among different species in the second step, thus difficult to detect conserved co-regulated genes. We propose a cross-species bi-clustering approach which first denoises the gene expression data of each species into a data matrix. The rows of the data matrices of different species represent the same set of genes that are characterized by their expression patterns over the developmental stages of each species as columns. A novel bi-clustering method is then developed to cluster genes into subgroups by a joint sparse rank-one factorization of all the data matrices. This method decomposes a data matrix into a product of a column vector and a row vector where the column vector is a consistent indicator across the matrices (species) to identify the same gene cluster and the row vector specifies for each species the developmental stages that the clustered genes co-regulate. Efficient optimization algorithm has been developed with convergence analysis. This approach was first validated on synthetic data and compared

  17. Epistatic interaction between the monoamine oxidase A and serotonin transporter genes in anorexia nervosa.

    PubMed

    Urwin, Ruth Elizabeth; Nunn, Kenneth Patrick

    2005-03-01

    The serotonin (5-HT) and norepinephrine (NE) systems are likely involved in the aetiology of anorexia nervosa (AN) as sufferers are premorbidly anxious. Specifically, we hypothesize that genes encoding proteins, which clear 5-HT and NE from the synapse, are prime candidates for affecting susceptibility to AN. Supporting our hypothesis, we earlier showed that the NE transporter (NET) and monoamine oxidase A (MAOA) genes appear to contribute additively to increased risk of developing restricting AN (AN-R). With regard to the MAOA gene, a sequence variant that increases MAOA activity and has suggested association with the anxiety condition, panic disorder was preferentially transmitted from parents to affected children. Here we provide evidence in support of interaction between the MAOA and serotonin transporter (SERT) genes in 114 AN nuclear families (patient with AN plus biological parents). A SERT gene genotype with no apparent individual effect on risk and known to be associated with anxiety is preferentially transmitted to children with AN (chi2 trend=9.457, 1 df, P=0.0021) and AN-R alone (chi2 trend=7.477, 1 df, P=0.0063) when the 'more active' MAOA gene variant is also transmitted. The increased risk of developing the disorder is up to eight times greater than the risk imposed by the MAOA gene variant alone--an example of synergistic epistatic interaction. If independently replicated, our findings to date suggest that we may have identified three genes affecting susceptibility to AN, particularly AN-R: the MAOA, SERT, and NET genes.

  18. Novel Genes Affecting the Interaction between the Cabbage Whitefly and Arabidopsis Uncovered by Genome-Wide Association Mapping

    PubMed Central

    Broekgaarden, Colette; Bucher, Johan; Bac-Molenaar, Johanna; Keurentjes, Joost J. B.; Kruijer, Willem; Voorrips, Roeland E.; Vosman, Ben

    2015-01-01

    Plants have evolved a variety of ways to defend themselves against biotic attackers. This has resulted in the presence of substantial variation in defense mechanisms among plants, even within a species. Genome-wide association (GWA) mapping is a useful tool to study the genetic architecture of traits, but has so far only had limited exploitation in studies of plant defense. Here, we study the genetic architecture of defense against the phloem-feeding insect cabbage whitefly (Aleyrodes proletella) in Arabidopsis thaliana. We determined whitefly performance, i.e. the survival and reproduction of whitefly females, on 360 worldwide selected natural accessions and subsequently performed GWA mapping using 214,051 SNPs. Substantial variation for whitefly adult survival and oviposition rate (number of eggs laid per female per day) was observed between the accessions. We identified 39 candidate SNPs for either whitefly adult survival or oviposition rate, all with relatively small effects, underpinning the complex architecture of defense traits. Among the corresponding candidate genes, i.e. genes in linkage disequilibrium (LD) with candidate SNPs, none have previously been identified as a gene playing a role in the interaction between plants and phloem-feeding insects. Whitefly performance on knock-out mutants of a number of candidate genes was significantly affected, validating the potential of GWA mapping for novel gene discovery in plant-insect interactions. Our results show that GWA analysis is a very useful tool to gain insight into the genetic architecture of plant defense against herbivorous insects, i.e. we identified and validated several genes affecting whitefly performance that have not previously been related to plant defense against herbivorous insects. PMID:26699853

  19. Identifying RNA splicing factors using IFT genes in Chlamydomonas reinhardtii.

    PubMed

    Lin, Huawen; Zhang, Zhengyan; Iomini, Carlo; Dutcher, Susan K

    2018-03-01

    Intraflagellar transport moves proteins in and out of flagella/cilia and it is essential for the assembly of these organelles. Using whole-genome sequencing, we identified splice site mutations in two IFT genes, IFT81 ( fla9 ) and IFT121 ( ift121-2 ), which lead to flagellar assembly defects in the unicellular green alga Chlamydomonas reinhardtii The splicing defects in these ift mutants are partially corrected by mutations in two conserved spliceosome proteins, DGR14 and FRA10. We identified a dgr14 deletion mutant, which suppresses the 3' splice site mutation in IFT81 , and a frameshift mutant of FRA10 , which suppresses the 5' splice site mutation in IFT121 Surprisingly, we found dgr14-1 and fra10 mutations suppress both splice site mutations. We suggest these two proteins are involved in facilitating splice site recognition/interaction; in their absence some splice site mutations are tolerated. Nonsense mutations in SMG1 , which is involved in nonsense-mediated decay, lead to accumulation of aberrant transcripts and partial restoration of flagellar assembly in the ift mutants. The high density of introns and the conservation of noncore splicing factors, together with the ease of scoring the ift mutant phenotype, make Chlamydomonas an attractive organism to identify new proteins involved in splicing through suppressor screening. © 2018 The Authors.

  20. Robust Tests for Additive Gene-Environment Interaction in Case-Control Studies Using Gene-Environment Independence.

    PubMed

    Liu, Gang; Mukherjee, Bhramar; Lee, Seunggeun; Lee, Alice W; Wu, Anna H; Bandera, Elisa V; Jensen, Allan; Rossing, Mary Anne; Moysich, Kirsten B; Chang-Claude, Jenny; Doherty, Jennifer A; Gentry-Maharaj, Aleksandra; Kiemeney, Lambertus; Gayther, Simon A; Modugno, Francesmary; Massuger, Leon; Goode, Ellen L; Fridley, Brooke L; Terry, Kathryn L; Cramer, Daniel W; Ramus, Susan J; Anton-Culver, Hoda; Ziogas, Argyrios; Tyrer, Jonathan P; Schildkraut, Joellen M; Kjaer, Susanne K; Webb, Penelope M; Ness, Roberta B; Menon, Usha; Berchuck, Andrew; Pharoah, Paul D; Risch, Harvey; Pearce, Celeste Leigh

    2018-02-01

    There have been recent proposals advocating the use of additive gene-environment interaction instead of the widely used multiplicative scale, as a more relevant public health measure. Using gene-environment independence enhances statistical power for testing multiplicative interaction in case-control studies. However, under departure from this assumption, substantial bias in the estimates and inflated type I error in the corresponding tests can occur. In this paper, we extend the empirical Bayes (EB) approach previously developed for multiplicative interaction, which trades off between bias and efficiency in a data-adaptive way, to the additive scale. An EB estimator of the relative excess risk due to interaction is derived, and the corresponding Wald test is proposed with a general regression setting under a retrospective likelihood framework. We study the impact of gene-environment association on the resultant test with case-control data. Our simulation studies suggest that the EB approach uses the gene-environment independence assumption in a data-adaptive way and provides a gain in power compared with the standard logistic regression analysis and better control of type I error when compared with the analysis assuming gene-environment independence. We illustrate the methods with data from the Ovarian Cancer Association Consortium. © The Author(s) 2017. Published by Oxford University Press on behalf of the Johns Hopkins Bloomberg School of Public Health. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  1. ICan: an integrated co-alteration network to identify ovarian cancer-related genes.

    PubMed

    Zhou, Yuanshuai; Liu, Yongjing; Li, Kening; Zhang, Rui; Qiu, Fujun; Zhao, Ning; Xu, Yan

    2015-01-01

    Over the last decade, an increasing number of integrative studies on cancer-related genes have been published. Integrative analyses aim to overcome the limitation of a single data type, and provide a more complete view of carcinogenesis. The vast majority of these studies used sample-matched data of gene expression and copy number to investigate the impact of copy number alteration on gene expression, and to predict and prioritize candidate oncogenes and tumor suppressor genes. However, correlations between genes were neglected in these studies. Our work aimed to evaluate the co-alteration of copy number, methylation and expression, allowing us to identify cancer-related genes and essential functional modules in cancer. We built the Integrated Co-alteration network (ICan) based on multi-omics data, and analyzed the network to uncover cancer-related genes. After comparison with random networks, we identified 155 ovarian cancer-related genes, including well-known (TP53, BRCA1, RB1 and PTEN) and also novel cancer-related genes, such as PDPN and EphA2. We compared the results with a conventional method: CNAmet, and obtained a significantly better area under the curve value (ICan: 0.8179, CNAmet: 0.5183). In this paper, we describe a framework to find cancer-related genes based on an Integrated Co-alteration network. Our results proved that ICan could precisely identify candidate cancer genes and provide increased mechanistic understanding of carcinogenesis. This work suggested a new research direction for biological network analyses involving multi-omics data.

  2. Virus-Induced Gene Silencing Using Tobacco Rattle Virus as a Tool to Study the Interaction between Nicotiana attenuata and Rhizophagus irregularis.

    PubMed

    Groten, Karin; Pahari, Nabin T; Xu, Shuqing; Miloradovic van Doorn, Maja; Baldwin, Ian T

    2015-01-01

    Most land plants live in a symbiotic association with arbuscular mycorrhizal fungi (AMF) that belong to the phylum Glomeromycota. Although a number of plant genes involved in the plant-AMF interactions have been identified by analyzing mutants, the ability to rapidly manipulate gene expression to study the potential functions of new candidate genes remains unrealized. We analyzed changes in gene expression of wild tobacco roots (Nicotiana attenuata) after infection with mycorrhizal fungi (Rhizophagus irregularis) by serial analysis of gene expression (SuperSAGE) combined with next generation sequencing, and established a virus-induced gene-silencing protocol to study the function of candidate genes in the interaction. From 92,434 SuperSAGE Tag sequences, 32,808 (35%) matched with our in-house Nicotiana attenuata transcriptome database and 3,698 (4%) matched to Rhizophagus genes. In total, 11,194 Tags showed a significant change in expression (p<0.05, >2-fold change) after infection. When comparing the functions of highly up-regulated annotated Tags in this study with those of two previous large-scale gene expression studies, 18 gene functions were found to be up-regulated in all three studies mainly playing roles related to phytohormone metabolism, catabolism and defense. To validate the function of identified candidate genes, we used the technique of virus-induced gene silencing (VIGS) to silence the expression of three putative N. attenuata genes: germin-like protein, indole-3-acetic acid-amido synthetase GH3.9 and, as a proof-of-principle, calcium and calmodulin-dependent protein kinase (CCaMK). The silencing of the three plant genes in roots was successful, but only CCaMK silencing had a significant effect on the interaction with R. irregularis. Interestingly, when a highly activated inoculum was used for plant inoculation, the effect of CCaMK silencing on fungal colonization was masked, probably due to trans-complementation. This study demonstrates that large

  3. An investigation of gene-environment interactions between 47 newly identified breast cancer susceptibility loci and environmental risk factors

    PubMed Central

    Rudolph, Anja; Milne, Roger L.; Truong, Thérèse; Knight, Julia A.; Seibold, Petra; Flesch-Janys, Dieter; Behrens, Sabine; Eilber, Ursula; Bolla, Manjeet K.; Wang, Qin; Dennis, Joe; Dunning, Alison M.; Shah, Mitul; Munday, Hannah R.; Darabi, Hatef; Eriksson, Mikael; Brand, Judith S.; Olson, Janet; Vachon, Celine M.; Hallberg, Emily; Castelao, J. Esteban; Carracedo, Angel; Torres, Maria; Li, Jingmei; Humphreys, Keith; Cordina-Duverger, Emilie; Menegaux, Florence; Flyger, Henrik; Nordestgaard, Børge G.; Nielsen, Sune F.; Yesilyurt, Betul T.; Floris, Giuseppe; Leunen, Karin; Engelhardt, Ellen G.; Broeks, Annegien; Rutgers, Emiel J.; Glendon, Gord; Mulligan, Anna Marie; Cross, Simon; Reed, Malcolm; Gonzalez-Neira, Anna; Perez, José Ignacio Arias; Provenzano, Elena; Apicella, Carmel; Southey, Melissa C.; Spurdle, Amanda; Investigators, kConFab; Group, AOCS; Häberle, Lothar; Beckmann, Matthias W.; Ekici, Arif B.; Dieffenbach, Aida Karina; Arndt, Volker; Stegmaier, Christa; McLean, Catriona; Baglietto, Laura; Chanock, Stephen J.; Lissowska, Jolanta; Sherman, Mark E.; Brüning, Thomas; Hamann, Ute; Ko, Yon-Dschun; Orr, Nick; Schoemaker, Minouk; Ashworth, Alan; Kosma, Veli-Matti; Kataja, Vesa; Hartikainen, Jaana M.; Mannermaa, Arto; Swerdlow, Anthony; Giles, Graham G.; Brenner, Hermann; Fasching, Peter A.; Chenevix-Trench, Georgia; Hopper, John; Benítez, Javier; Cox, Angela; Andrulis, Irene L.; Lambrechts, Diether; Gago-Dominguez, Manuela; Couch, Fergus; Czene, Kamila; Bojesen, Stig E.; Easton, Doug F.; Schmidt, Marjanka K.; Guénel, Pascal; Hall, Per; Pharoah, Paul D. P.; Garcia-Closas, Montserrat; Chang-Claude, Jenny

    2014-01-01

    A large genotyping project within the Breast Cancer Association Consortium (BCAC) recently identified 41 associations between single nucleotide polymorphisms (SNPs) and overall breast cancer (BC) risk. We investigated whether the effects of these 41 SNPs, as well as six SNPs associated with estrogen receptor (ER) negative BC risk are modified by 13 environmental risk factors for BC. Data from 22 studies participating in BCAC were pooled, comprising up to 26,633 cases and 30,119 controls. Interactions between SNPs and environmental factors were evaluated using an empirical Bayes-type shrinkage estimator. Six SNPs showed interactions with associated p-values (pint) <1.1×10−3. None of the observed interactions was significant after accounting for multiple testing. The Bayesian False Discovery Probability was used to rank the findings, which indicated three interactions as being noteworthy at 1% prior probability of interaction. SNP rs6828523 was associated with increased ER-negative BC risk in women ≥170cm (OR=1.22, p=0.017), but inversely associated with ER-negative BC risk in women <160cm (OR=0.83, p=0.039, pint=1.9×10−4). The inverse association between rs4808801 and overall BC risk was stronger for women who had had four or more pregnancies (OR=0.85, p=2.0×10−4), and absent in women who had had just one (OR=0.96, p=0.19, pint = 6.1×10−4). SNP rs11242675 was inversely associated with overall BC risk in never/former smokers (OR=0.93, p=2.8×10−5), but no association was observed in current smokers (OR=1.07, p=0.14, pint = 3.4×10−4). In conclusion, recently identified breast cancer susceptibility loci are not strongly modified by established risk factors and the observed potential interactions require confirmation in independent studies. PMID:25227710

  4. A recellularized human colon model identifies cancer driver genes

    PubMed Central

    Chen, Huanhuan Joyce; Wei, Zhubo; Sun, Jian; Bhattacharya, Asmita; Savage, David J; Serda, Rita; Mackeyev, Yuri; Curley, Steven A.; Bu, Pengcheng; Wang, Lihua; Chen, Shuibing; Cohen-Gould, Leona; Huang, Emina; Shen, Xiling; Lipkin, Steven M.; Copeland, Neal G.; Jenkins, Nancy A.; Shuler, Michael L.

    2016-01-01

    Refined cancer models are needed to bridge the gap between cell-line, animal and clinical research. Here we describe the engineering of an organotypic colon cancer model by recellularization of a native human matrix that contains cell-populated mucosa and an intact muscularis mucosa layer. This ex vivo system recapitulates the pathophysiological progression from APC-mutant neoplasia to submucosal invasive tumor. We used it to perform a Sleeping Beauty transposon mutagenesis screen to identify genes that cooperate with mutant APC in driving invasive neoplasia. 38 candidate invasion driver genes were identified, 17 of which have been previously implicated in colorectal cancer progression, including TCF7L2, TWIST2, MSH2, DCC and EPHB1/2. Six invasion driver genes that to our knowledge have not been previously described were validated in vitro using cell proliferation, migration and invasion assays, and ex vivo using recellularized human colon. These results demonstrate the utility of our organoid model for studying cancer biology. PMID:27398792

  5. Testing for gene-environment interaction under exposure misspecification.

    PubMed

    Sun, Ryan; Carroll, Raymond J; Christiani, David C; Lin, Xihong

    2017-11-09

    Complex interplay between genetic and environmental factors characterizes the etiology of many diseases. Modeling gene-environment (GxE) interactions is often challenged by the unknown functional form of the environment term in the true data-generating mechanism. We study the impact of misspecification of the environmental exposure effect on inference for the GxE interaction term in linear and logistic regression models. We first examine the asymptotic bias of the GxE interaction regression coefficient, allowing for confounders as well as arbitrary misspecification of the exposure and confounder effects. For linear regression, we show that under gene-environment independence and some confounder-dependent conditions, when the environment effect is misspecified, the regression coefficient of the GxE interaction can be unbiased. However, inference on the GxE interaction is still often incorrect. In logistic regression, we show that the regression coefficient is generally biased if the genetic factor is associated with the outcome directly or indirectly. Further, we show that the standard robust sandwich variance estimator for the GxE interaction does not perform well in practical GxE studies, and we provide an alternative testing procedure that has better finite sample properties. © 2017, The International Biometric Society.

  6. Gene-by-Socioeconomic Status Interaction on School Readiness

    PubMed Central

    Rhemtulla, Mijke; Tucker-Drob, Elliot M.

    2017-01-01

    In previous work with a nationally representative sample of over 1,400 monozygotic and dizygotic twins born in the United States, Tucker-Drob, Rhemtulla, Harden, Turkheimer, and Fask (2011; Psychological Science, 22, 125–133) uncovered a gene × environment interaction on scores on the Bayley Short Form test of mental ability at 2 years of age—higher socioeconomic status (SES) was associated not only with higher mental ability, but also with larger genetic contributions to individual differences in mental ability. The current study examined gene × SES interactions in mathematics skill and reading skill at 4 years of age (preschool age) in the same sample of twins, and further examined whether interactions detected at 4 years could be attributed to the persistence of the interaction previously observed at 2 years. For early mathematics skill but not early reading skill, genetic influences were more pronounced at higher levels of SES. This interaction was not accounted for by the interaction observed at 2 years. These findings indicate that SES moderates the etiological influences on certain cognitive functions at multiple stages of child development. PMID:22350185

  7. Gene expression profiling combined with bioinformatics analysis identify biomarkers for Parkinson disease.

    PubMed

    Diao, Hongyu; Li, Xinxing; Hu, Sheng; Liu, Yunhui

    2012-01-01

    Parkinson disease (PD) progresses relentlessly and affects approximately 4% of the population aged over 80 years old. It is difficult to diagnose in its early stages. The purpose of our study is to identify molecular biomarkers for PD initiation using a computational bioinformatics analysis of gene expression. We downloaded the gene expression profile of PD from Gene Expression Omnibus and identified differentially coexpressed genes (DCGs) and dysfunctional pathways in PD patients compared to controls. Besides, we built a regulatory network by mapping the DCGs to known regulatory data between transcription factors (TFs) and target genes and calculated the regulatory impact factor of each transcription factor. As the results, a total of 1004 genes associated with PD initiation were identified. Pathway enrichment of these genes suggests that biological processes of protein turnover were impaired in PD. In the regulatory network, HLF, E2F1 and STAT4 were found have altered expression levels in PD patients. The expression levels of other transcription factors, NKX3-1, TAL1, RFX1 and EGR3, were not found altered. However, they regulated differentially expressed genes. In conclusion, we suggest that HLF, E2F1 and STAT4 may be used as molecular biomarkers for PD; however, more work is needed to validate our result.

  8. Gene Expression Profiling Combined with Bioinformatics Analysis Identify Biomarkers for Parkinson Disease

    PubMed Central

    Diao, Hongyu; Li, Xinxing; Hu, Sheng; Liu, Yunhui

    2012-01-01

    Parkinson disease (PD) progresses relentlessly and affects approximately 4% of the population aged over 80 years old. It is difficult to diagnose in its early stages. The purpose of our study is to identify molecular biomarkers for PD initiation using a computational bioinformatics analysis of gene expression. We downloaded the gene expression profile of PD from Gene Expression Omnibus and identified differentially coexpressed genes (DCGs) and dysfunctional pathways in PD patients compared to controls. Besides, we built a regulatory network by mapping the DCGs to known regulatory data between transcription factors (TFs) and target genes and calculated the regulatory impact factor of each transcription factor. As the results, a total of 1004 genes associated with PD initiation were identified. Pathway enrichment of these genes suggests that biological processes of protein turnover were impaired in PD. In the regulatory network, HLF, E2F1 and STAT4 were found have altered expression levels in PD patients. The expression levels of other transcription factors, NKX3-1, TAL1, RFX1 and EGR3, were not found altered. However, they regulated differentially expressed genes. In conclusion, we suggest that HLF, E2F1 and STAT4 may be used as molecular biomarkers for PD; however, more work is needed to validate our result. PMID:23284986

  9. Axon Regeneration Genes Identified by RNAi Screening in C. elegans

    PubMed Central

    Nix, Paola; Hammarlund, Marc; Hauth, Linda; Lachnit, Martina; Jorgensen, Erik M.

    2014-01-01

    Axons of the mammalian CNS lose the ability to regenerate soon after development due to both an inhibitory CNS environment and the loss of cell-intrinsic factors necessary for regeneration. The complex molecular events required for robust regeneration of mature neurons are not fully understood, particularly in vivo. To identify genes affecting axon regeneration in Caenorhabditis elegans, we performed both an RNAi-based screen for defective motor axon regeneration in unc-70/β-spectrin mutants and a candidate gene screen. From these screens, we identified at least 50 conserved genes with growth-promoting or growth-inhibiting functions. Through our analysis of mutants, we shed new light on certain aspects of regeneration, including the role of β-spectrin and membrane dynamics, the antagonistic activity of MAP kinase signaling pathways, and the role of stress in promoting axon regeneration. Many gene candidates had not previously been associated with axon regeneration and implicate new pathways of interest for therapeutic intervention. PMID:24403161

  10. A gene co-expression network model identifies yield-related vicinity networks in Jatropha curcas shoot system.

    PubMed

    Govender, Nisha; Senan, Siju; Mohamed-Hussein, Zeti-Azura; Wickneswari, Ratnam

    2018-06-15

    The plant shoot system consists of reproductive organs such as inflorescences, buds and fruits, and the vegetative leaves and stems. In this study, the reproductive part of the Jatropha curcas shoot system, which includes the aerial shoots, shoots bearing the inflorescence and inflorescence were investigated in regard to gene-to-gene interactions underpinning yield-related biological processes. An RNA-seq based sequencing of shoot tissues performed on an Illumina HiSeq. 2500 platform generated 18 transcriptomes. Using the reference genome-based mapping approach, a total of 64 361 genes was identified in all samples and the data was annotated against the non-redundant database by the BLAST2GO Pro. Suite. After removing the outlier genes and samples, a total of 12 734 genes across 17 samples were subjected to gene co-expression network construction using petal, an R library. A gene co-expression network model built with scale-free and small-world properties extracted four vicinity networks (VNs) with putative involvement in yield-related biological processes as follow; heat stress tolerance, floral and shoot meristem differentiation, biosynthesis of chlorophyll molecules and laticifers, cell wall metabolism and epigenetic regulations. Our VNs revealed putative key players that could be adapted in breeding strategies for J. curcas shoot system improvements.

  11. Gene expression profiles analysis identifies key genes for acute lung injury in patients with sepsis.

    PubMed

    Guo, Zhiqiang; Zhao, Chuncheng; Wang, Zheng

    2014-09-26

    To identify critical genes and biological pathways in acute lung injury (ALI), a comparative analysis of gene expression profiles of patients with ALI + sepsis compared with patients with sepsis alone were performed with bioinformatic tools. GSE10474 was downloaded from Gene Expression Omnibus, including a collective of 13 whole blood samples with ALI + sepsis and 21 whole blood samples with sepsis alone. After pre-treatment with robust multichip averaging (RMA) method, differential analysis was conducted using simpleaffy package based upon t-test and fold change. Hierarchical clustering was also performed using function hclust from package stats. Beisides, functional enrichment analysis was conducted using iGepros. Moreover, the gene regulatory network was constructed with information from Kyoto Encyclopedia of Genes and Genomes (KEGG) and then visualized by Cytoscape. A total of 128 differentially expressed genes (DEGs) were identified, including 47 up- and 81 down-regulated genes. The significantly enriched functions included negative regulation of cell proliferation, regulation of response to stimulus and cellular component morphogenesis. A total of 27 DEGs were significantly enriched in 16 KEGG pathways, such as protein digestion and absorption, fatty acid metabolism, amoebiasis, etc. Furthermore, the regulatory network of these 27 DEGs was constructed, which involved several key genes, including protein tyrosine kinase 2 (PTK2), v-src avian sarcoma (SRC) and Caveolin 2 (CAV2). PTK2, SRC and CAV2 may be potential markers for diagnosis and treatment of ALI. The virtual slide(s) for this article can be found here: http://www.diagnosticpathology.diagnomx.eu/vs/5865162912987143.

  12. A novel approach to simulate gene-environment interactions in complex diseases.

    PubMed

    Amato, Roberto; Pinelli, Michele; D'Andrea, Daniel; Miele, Gennaro; Nicodemi, Mario; Raiconi, Giancarlo; Cocozza, Sergio

    2010-01-05

    Complex diseases are multifactorial traits caused by both genetic and environmental factors. They represent the major part of human diseases and include those with largest prevalence and mortality (cancer, heart disease, obesity, etc.). Despite a large amount of information that has been collected about both genetic and environmental risk factors, there are few examples of studies on their interactions in epidemiological literature. One reason can be the incomplete knowledge of the power of statistical methods designed to search for risk factors and their interactions in these data sets. An improvement in this direction would lead to a better understanding and description of gene-environment interactions. To this aim, a possible strategy is to challenge the different statistical methods against data sets where the underlying phenomenon is completely known and fully controllable, for example simulated ones. We present a mathematical approach that models gene-environment interactions. By this method it is possible to generate simulated populations having gene-environment interactions of any form, involving any number of genetic and environmental factors and also allowing non-linear interactions as epistasis. In particular, we implemented a simple version of this model in a Gene-Environment iNteraction Simulator (GENS), a tool designed to simulate case-control data sets where a one gene-one environment interaction influences the disease risk. The main aim has been to allow the input of population characteristics by using standard epidemiological measures and to implement constraints to make the simulator behaviour biologically meaningful. By the multi-logistic model implemented in GENS it is possible to simulate case-control samples of complex disease where gene-environment interactions influence the disease risk. The user has full control of the main characteristics of the simulated population and a Monte Carlo process allows random variability. A knowledge

  13. Systems biology approach to late-onset Alzheimer's disease genome-wide association study identifies novel candidate genes validated using brain expression data and Caenorhabditis elegans experiments.

    PubMed

    Mukherjee, Shubhabrata; Russell, Joshua C; Carr, Daniel T; Burgess, Jeremy D; Allen, Mariet; Serie, Daniel J; Boehme, Kevin L; Kauwe, John S K; Naj, Adam C; Fardo, David W; Dickson, Dennis W; Montine, Thomas J; Ertekin-Taner, Nilufer; Kaeberlein, Matt R; Crane, Paul K

    2017-10-01

    We sought to determine whether a systems biology approach may identify novel late-onset Alzheimer's disease (LOAD) loci. We performed gene-wide association analyses and integrated results with human protein-protein interaction data using network analyses. We performed functional validation on novel genes using a transgenic Caenorhabditis elegans Aβ proteotoxicity model and evaluated novel genes using brain expression data from people with LOAD and other neurodegenerative conditions. We identified 13 novel candidate LOAD genes outside chromosome 19. Of those, RNA interference knockdowns of the C. elegans orthologs of UBC, NDUFS3, EGR1, and ATP5H were associated with Aβ toxicity, and NDUFS3, SLC25A11, ATP5H, and APP were differentially expressed in the temporal cortex. Network analyses identified novel LOAD candidate genes. We demonstrated a functional role for four of these in a C. elegans model and found enrichment of differentially expressed genes in the temporal cortex. Copyright © 2017 the Alzheimer's Association. Published by Elsevier Inc. All rights reserved.

  14. Combining Genome-Scale Experimental and Computational Methods To Identify Essential Genes in Rhodobacter sphaeroides

    DOE PAGES

    Burger, Brian T.; Imam, Saheed; Scarborough, Matthew J.; ...

    2017-06-06

    Rhodobacter sphaeroides is one of the best-studied alphaproteobacteria from biochemical, genetic, and genomic perspectives. To gain a better systems-level understanding of this organism, we generated a large transposon mutant library and used transposon sequencing (Tn-seq) to identify genes that are essential under several growth conditions. Using newly developed Tn-seq analysis software (TSAS), we identified 493 genes as essential for aerobic growth on a rich medium. We then used the mutant library to identify conditionally essential genes under two laboratory growth conditions, identifying 85 additional genes required for aerobic growth in a minimal medium and 31 additional genes required for photosyntheticmore » growth. In all instances, our analyses confirmed essentiality for many known genes and identified genes not previously considered to be essential. We used the resulting Tn-seq data to refine and improve a genome-scale metabolic network model (GEM) for R. sphaeroides. Together, we demonstrate how genetic, genomic, and computational approaches can be combined to obtain a systems-level understanding of the genetic framework underlying metabolic diversity in bacterial species.« less

  15. Secretome Characterization and Correlation Analysis Reveal Putative Pathogenicity Mechanisms and Identify Candidate Avirulence Genes in the Wheat Stripe Rust Fungus Puccinia striiformis f. sp. tritici.

    PubMed

    Xia, Chongjing; Wang, Meinan; Cornejo, Omar E; Jiwan, Derick A; See, Deven R; Chen, Xianming

    2017-01-01

    Stripe (yellow) rust, caused by Puccinia striiformis f. sp. tritici ( Pst ), is one of the most destructive diseases of wheat worldwide. Planting resistant cultivars is an effective way to control this disease, but race-specific resistance can be overcome quickly due to the rapid evolving Pst population. Studying the pathogenicity mechanisms is critical for understanding how Pst virulence changes and how to develop wheat cultivars with durable resistance to stripe rust. We re-sequenced 7 Pst isolates and included additional 7 previously sequenced isolates to represent balanced virulence/avirulence profiles for several avirulence loci in seretome analyses. We observed an uneven distribution of heterozygosity among the isolates. Secretome comparison of Pst with other rust fungi identified a large portion of species-specific secreted proteins, suggesting that they may have specific roles when interacting with the wheat host. Thirty-two effectors of Pst were identified from its secretome. We identified candidates for Avr genes corresponding to six Yr genes by correlating polymorphisms for effector genes to the virulence/avirulence profiles of the 14 Pst isolates. The putative AvYr76 was present in the avirulent isolates, but absent in the virulent isolates, suggesting that deleting the coding region of the candidate avirulence gene has produced races virulent to resistance gene Yr76 . We conclude that incorporating avirulence/virulence phenotypes into correlation analysis with variations in genomic structure and secretome, particularly presence/absence polymorphisms of effectors, is an efficient way to identify candidate Avr genes in Pst . The candidate effector genes provide a rich resource for further studies to determine the evolutionary history of Pst populations and the co-evolutionary arms race between Pst and wheat. The Avr candidates identified in this study will lead to cloning avirulence genes in Pst , which will enable us to understand molecular mechanisms

  16. Yeast Phenomics: An Experimental Approach for Modeling Gene Interaction Networks that Buffer Disease

    PubMed Central

    Hartman, John L.; Stisher, Chandler; Outlaw, Darryl A.; Guo, Jingyu; Shah, Najaf A.; Tian, Dehua; Santos, Sean M.; Rodgers, John W.; White, Richard A.

    2015-01-01

    The genome project increased appreciation of genetic complexity underlying disease phenotypes: many genes contribute each phenotype and each gene contributes multiple phenotypes. The aspiration of predicting common disease in individuals has evolved from seeking primary loci to marginal risk assignments based on many genes. Genetic interaction, defined as contributions to a phenotype that are dependent upon particular digenic allele combinations, could improve prediction of phenotype from complex genotype, but it is difficult to study in human populations. High throughput, systematic analysis of S. cerevisiae gene knockouts or knockdowns in the context of disease-relevant phenotypic perturbations provides a tractable experimental approach to derive gene interaction networks, in order to deduce by cross-species gene homology how phenotype is buffered against disease-risk genotypes. Yeast gene interaction network analysis to date has revealed biology more complex than previously imagined. This has motivated the development of more powerful yeast cell array phenotyping methods to globally model the role of gene interaction networks in modulating phenotypes (which we call yeast phenomic analysis). The article illustrates yeast phenomic technology, which is applied here to quantify gene X media interaction at higher resolution and supports use of a human-like media for future applications of yeast phenomics for modeling human disease. PMID:25668739

  17. ICan: An Integrated Co-Alteration Network to Identify Ovarian Cancer-Related Genes

    PubMed Central

    Zhou, Yuanshuai; Liu, Yongjing; Li, Kening; Zhang, Rui; Qiu, Fujun; Zhao, Ning; Xu, Yan

    2015-01-01

    Background Over the last decade, an increasing number of integrative studies on cancer-related genes have been published. Integrative analyses aim to overcome the limitation of a single data type, and provide a more complete view of carcinogenesis. The vast majority of these studies used sample-matched data of gene expression and copy number to investigate the impact of copy number alteration on gene expression, and to predict and prioritize candidate oncogenes and tumor suppressor genes. However, correlations between genes were neglected in these studies. Our work aimed to evaluate the co-alteration of copy number, methylation and expression, allowing us to identify cancer-related genes and essential functional modules in cancer. Results We built the Integrated Co-alteration network (ICan) based on multi-omics data, and analyzed the network to uncover cancer-related genes. After comparison with random networks, we identified 155 ovarian cancer-related genes, including well-known (TP53, BRCA1, RB1 and PTEN) and also novel cancer-related genes, such as PDPN and EphA2. We compared the results with a conventional method: CNAmet, and obtained a significantly better area under the curve value (ICan: 0.8179, CNAmet: 0.5183). Conclusion In this paper, we describe a framework to find cancer-related genes based on an Integrated Co-alteration network. Our results proved that ICan could precisely identify candidate cancer genes and provide increased mechanistic understanding of carcinogenesis. This work suggested a new research direction for biological network analyses involving multi-omics data. PMID:25803614

  18. MMTV insertional mutagenesis identifies genes, gene families and pathways involved in mammary cancer.

    PubMed

    Theodorou, Vassiliki; Kimm, Melanie A; Boer, Mandy; Wessels, Lodewyk; Theelen, Wendy; Jonkers, Jos; Hilkens, John

    2007-06-01

    We performed a high-throughput retroviral insertional mutagenesis screen in mouse mammary tumor virus (MMTV)-induced mammary tumors and identified 33 common insertion sites, of which 17 genes were previously not known to be associated with mammary cancer and 13 had not previously been linked to cancer in general. Although members of the Wnt and fibroblast growth factors (Fgf) families were frequently tagged, our exhaustive screening for MMTV insertion sites uncovered a new repertoire of candidate breast cancer oncogenes. We validated one of these genes, Rspo3, as an oncogene by overexpression in a p53-deficient mammary epithelial cell line. The human orthologs of the candidate oncogenes were frequently deregulated in human breast cancers and associated with several tumor parameters. Computational analysis of all MMTV-tagged genes uncovered specific gene families not previously associated with cancer and showed a significant overrepresentation of protein domains and signaling pathways mainly associated with development and growth factor signaling. Comparison of all tagged genes in MMTV and Moloney murine leukemia virus-induced malignancies showed that both viruses target mostly different genes that act predominantly in distinct pathways.

  19. nana plant2 Encodes a Maize Ortholog of the Arabidopsis Brassinosteroid Biosynthesis Gene DWARF1, Identifying Developmental Interactions between Brassinosteroids and Gibberellins1[OPEN

    PubMed Central

    Budka, Josh; Fujioka, Shozo; Johal, Gurmukh

    2016-01-01

    A small number of phytohormones dictate the pattern of plant form affecting fitness via reproductive architecture and the plant’s ability to forage for light, water, and nutrients. Individual phytohormone contributions to plant architecture have been studied extensively, often following a single component of plant architecture, such as plant height or branching. Both brassinosteroid (BR) and gibberellin (GA) affect plant height, branching, and sexual organ development in maize (Zea mays). We identified the molecular basis of the nana plant2 (na2) phenotype as a loss-of-function mutation in one of the two maize paralogs of the Arabidopsis (Arabidopsis thaliana) BR biosynthetic gene DWARF1 (DWF1). These mutants accumulate the DWF1 substrate 24-methylenecholesterol and exhibit decreased levels of downstream BR metabolites. We utilized this mutant and known GA biosynthetic mutants to investigate the genetic interactions between BR and GA. Double mutants exhibited additivity for some phenotypes and epistasis for others with no unifying pattern, indicating that BR and GA interact to affect development but in a context-dependent manner. Similar results were observed in double mutant analyses using additional BR and GA biosynthetic mutant loci. Thus, the BR and GA interactions were neither locus nor allele specific. Exogenous application of GA3 to na2 and d5, a GA biosynthetic mutant, also resulted in a diverse pattern of growth responses, including BR-dependent GA responses. These findings demonstrate that BR and GA do not interact via a single inclusive pathway in maize but rather suggest that differential signal transduction and downstream responses are affected dependent upon the developmental context. PMID:27288361

  20. Identifying Mendelian disease genes with the Variant Effect Scoring Tool

    PubMed Central

    2013-01-01

    Background Whole exome sequencing studies identify hundreds to thousands of rare protein coding variants of ambiguous significance for human health. Computational tools are needed to accelerate the identification of specific variants and genes that contribute to human disease. Results We have developed the Variant Effect Scoring Tool (VEST), a supervised machine learning-based classifier, to prioritize rare missense variants with likely involvement in human disease. The VEST classifier training set comprised ~ 45,000 disease mutations from the latest Human Gene Mutation Database release and another ~45,000 high frequency (allele frequency >1%) putatively neutral missense variants from the Exome Sequencing Project. VEST outperforms some of the most popular methods for prioritizing missense variants in carefully designed holdout benchmarking experiments (VEST ROC AUC = 0.91, PolyPhen2 ROC AUC = 0.86, SIFT4.0 ROC AUC = 0.84). VEST estimates variant score p-values against a null distribution of VEST scores for neutral variants not included in the VEST training set. These p-values can be aggregated at the gene level across multiple disease exomes to rank genes for probable disease involvement. We tested the ability of an aggregate VEST gene score to identify candidate Mendelian disease genes, based on whole-exome sequencing of a small number of disease cases. We used whole-exome data for two Mendelian disorders for which the causal gene is known. Considering only genes that contained variants in all cases, the VEST gene score ranked dihydroorotate dehydrogenase (DHODH) number 2 of 2253 genes in four cases of Miller syndrome, and myosin-3 (MYH3) number 2 of 2313 genes in three cases of Freeman Sheldon syndrome. Conclusions Our results demonstrate the potential power gain of aggregating bioinformatics variant scores into gene-level scores and the general utility of bioinformatics in assisting the search for disease genes in large-scale exome sequencing studies. VEST is

  1. Gene-environment interactions in geriatric depression.

    PubMed

    Lotrich, Francis E

    2011-06-01

    In older adults, several environmental challenges can potentially trigger the onset of an episode of major depression. Vulnerability to these challenges can be influenced by genetics. There is accumulating evidence for an interaction between stress and a serotonin transporter polymorphism, though there is also heterogeneity among studies. Other relevant genes include those encoding for the neuroendocrine stress axis, growth factors, and other monoaminergic systems. Each of these may interact with either predisposing traumas in early childhood or precipitating events later in life. Copyright © 2011 Elsevier Inc. All rights reserved.

  2. Systematic analysis of microarray datasets to identify Parkinson's disease‑associated pathways and genes.

    PubMed

    Feng, Yinling; Wang, Xuefeng

    2017-03-01

    In order to investigate commonly disturbed genes and pathways in various brain regions of patients with Parkinson's disease (PD), microarray datasets from previous studies were collected and systematically analyzed. Different normalization methods were applied to microarray datasets from different platforms. A strategy combining gene co‑expression networks and clinical information was adopted, using weighted gene co‑expression network analysis (WGCNA) to screen for commonly disturbed genes in different brain regions of patients with PD. Functional enrichment analysis of commonly disturbed genes was performed using the Database for Annotation, Visualization, and Integrated Discovery (DAVID). Co‑pathway relationships were identified with Pearson's correlation coefficient tests and a hypergeometric distribution‑based test. Common genes in pathway pairs were selected out and regarded as risk genes. A total of 17 microarray datasets from 7 platforms were retained for further analysis. Five gene coexpression modules were identified, containing 9,745, 736, 233, 101 and 93 genes, respectively. One module was significantly correlated with PD samples and thus the 736 genes it contained were considered to be candidate PD‑associated genes. Functional enrichment analysis demonstrated that these genes were implicated in oxidative phosphorylation and PD. A total of 44 pathway pairs and 52 risk genes were revealed, and a risk gene pathway relationship network was constructed. Eight modules were identified and were revealed to be associated with PD, cancers and metabolism. A number of disturbed pathways and risk genes were unveiled in PD, and these findings may help advance understanding of PD pathogenesis.

  3. Identifying Candidate Reprogramming Genes in Mouse Induced Pluripotent Stem Cells.

    PubMed

    Gao, Fang; Li, Jingyu; Zhang, Heng; Yang, Xu; An, Tiezhu

    2017-08-01

    Factor-based induced reprogramming approaches have tremendous potential for human regenerative medicine, but the efficiencies of these approaches are still low. In this study, we analyzed the global transcriptional profiles of mouse induced pluripotent stem cells (miPSCs) and mouse embryonic stem cells (mESCs) from seven different labs and present here the first successful clustering according to cell type, not by lab of origin. We identified 2131 different expression genes (DEs) as candidate pluripotency-associated genes by comparing mESCs/miPSCs with somatic cells and 720 DEs between miPSCs and mESCs. Interestingly, there was a significant overlap between the two DE sets. Therefore, we defined the overlap DEs as "consensus DEs" including 313 miPSC-specific genes expressed at a higher level in miPSCs versus mESCs and 184 mESC-specific genes in total and reasoned that these may contribute to the differences in pluripotency between mESCs and miPSCs. A classification of "consensus DEs" according to their different expression levels between somatic cells and mESCs/miPSCs shows that 86% of the miPSC-specific genes are more highly expressed in somatic cells, while 73% of mESC-specific genes are highly expressed in mESCs/miPSCs, indicating that the miPSCs have not efficiently silenced the expression pattern of the somatic cells from which they are derived and failed to completely induce the genes with high expression levels in mESCs. We further revealed a strong correlation between oocyte-enriched factors and insufficiently induced mESC-specific genes and identified 11 hub genes via network analysis. In light of these findings, we postulated that these key hub genes might not only drive somatic cell nuclear transfer (SCNT) reprogramming but also augment the efficiency and quality of miPSC reprogramming.

  4. Current Status and Challenges in Identifying Disease Resistance Genes in Brassica napus

    PubMed Central

    Neik, Ting Xiang; Barbetti, Martin J.; Batley, Jacqueline

    2017-01-01

    Brassica napus is an economically important crop across different continents including temperate and subtropical regions in Europe, Canada, South Asia, China and Australia. Its widespread cultivation also brings setbacks as it plays host to fungal, oomycete and chytrid pathogens that can lead to serious yield loss. For sustainable crop production, identification of resistance (R) genes in B. napus has become of critical importance. In this review, we discuss four key pathogens affecting Brassica crops: Clubroot (Plasmodiophora brassicae), Blackleg (Leptosphaeria maculans and L. biglobosa), Sclerotinia Stem Rot (Sclerotinia sclerotiorum), and Downy Mildew (Hyaloperonospora parasitica). We first review current studies covering prevalence of these pathogens on Brassica crops and highlight the R genes and QTL that have been identified from Brassica species against these pathogens. Insights into the relationships between the pathogen and its Brassica host, the unique host resistance mechanisms and how these affect resistance outcomes is also presented. We discuss challenges in identification and deployment of R genes in B. napus in relation to highly specific genetic interactions between host subpopulations and pathogen pathotypes and emphasize the need for common or shared techniques and research materials or tighter collaboration between researchers to reconcile the inconsistencies in the research outcomes. Using current genomics tools, we provide examples of how characterization and cloning of R genes in B. napus can be carried out more effectively. Lastly, we put forward strategies to breed resistant cultivars through introgressions supported by genomic approaches and suggest prospects that can be implemented in the future for a better, pathogen-resistant B. napus. PMID:29163558

  5. Exome Sequencing Identifies Three Novel Candidate Genes Implicated in Intellectual Disability

    PubMed Central

    Azam, Maleeha; Ayub, Humaira; Vissers, Lisenka E. L. M.; Gilissen, Christian; Ali, Syeda Hafiza Benish; Riaz, Moeen; Veltman, Joris A.; Pfundt, Rolph; van Bokhoven, Hans; Qamar, Raheel

    2014-01-01

    Intellectual disability (ID) is a major health problem mostly with an unknown etiology. Recently exome sequencing of individuals with ID identified novel genes implicated in the disease. Therefore the purpose of the present study was to identify the genetic cause of ID in one syndromic and two non-syndromic Pakistani families. Whole exome of three ID probands was sequenced. Missense variations in two plausible novel genes implicated in autosomal recessive ID were identified: lysine (K)-specific methyltransferase 2B (KMT2B), zinc finger protein 589 (ZNF589), as well as hedgehog acyltransferase (HHAT) with a de novo mutation with autosomal dominant mode of inheritance. The KMT2B recessive variant is the first report of recessive Kleefstra syndrome-like phenotype. Identification of plausible causative mutations for two recessive and a dominant type of ID, in genes not previously implicated in disease, underscores the large genetic heterogeneity of ID. These results also support the viewpoint that large number of ID genes converge on limited number of common networks i.e. ZNF589 belongs to KRAB-domain zinc-finger proteins previously implicated in ID, HHAT is predicted to affect sonic hedgehog, which is involved in several disorders with ID, KMT2B associated with syndromic ID fits the epigenetic module underlying the Kleefstra syndromic spectrum. The association of these novel genes in three different Pakistani ID families highlights the importance of screening these genes in more families with similar phenotypes from different populations to confirm the involvement of these genes in pathogenesis of ID. PMID:25405613

  6. Integrating Gene Expression with Summary Association Statistics to Identify Genes Associated with 30 Complex Traits.

    PubMed

    Mancuso, Nicholas; Shi, Huwenbo; Goddard, Pagé; Kichaev, Gleb; Gusev, Alexander; Pasaniuc, Bogdan

    2017-03-02

    Although genome-wide association studies (GWASs) have identified thousands of risk loci for many complex traits and diseases, the causal variants and genes at these loci remain largely unknown. Here, we introduce a method for estimating the local genetic correlation between gene expression and a complex trait and utilize it to estimate the genetic correlation due to predicted expression between pairs of traits. We integrated gene expression measurements from 45 expression panels with summary GWAS data to perform 30 multi-tissue transcriptome-wide association studies (TWASs). We identified 1,196 genes whose expression is associated with these traits; of these, 168 reside more than 0.5 Mb away from any previously reported GWAS significant variant. We then used our approach to find 43 pairs of traits with significant genetic correlation at the level of predicted expression; of these, eight were not found through genetic correlation at the SNP level. Finally, we used bi-directional regression to find evidence that BMI causally influences triglyceride levels and that triglyceride levels causally influence low-density lipoprotein. Together, our results provide insight into the role of gene expression in the susceptibility of complex traits and diseases. Copyright © 2017 American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.

  7. Synergistic interactions of biotic and abiotic environmental stressors on gene expression.

    PubMed

    Altshuler, Ianina; McLeod, Anne M; Colbourne, John K; Yan, Norman D; Cristescu, Melania E

    2015-03-01

    Understanding the response of organisms to multiple stressors is critical for predicting if populations can adapt to rapid environmental change. Natural and anthropogenic stressors often interact, complicating general predictions. In this study, we examined the interactive and cumulative effects of two common environmental stressors, lowered calcium concentration, an anthropogenic stressor, and predator presence, a natural stressor, on the water flea Daphnia pulex. We analyzed expression changes of five genes involved in calcium homeostasis - cuticle proteins (Cutie, Icp2), calbindin (Calb), and calcium pump and channel (Serca and Ip3R) - using real-time quantitative PCR (RT-qPCR) in a full factorial experiment. We observed strong synergistic interactions between low calcium concentration and predator presence. While the Ip3R gene was not affected by the stressors, the other four genes were affected in their transcriptional levels by the combination of the stressors. Transcriptional patterns of genes that code for cuticle proteins (Cutie and Icp2) and a sarcoplasmic calcium pump (Serca) only responded to the combination of stressors, changing their relative expression levels in a synergistic response, while a calcium-binding protein (Calb) responded to low calcium stress and the combination of both stressors. The expression pattern of these genes (Cutie, Icp2, and Serca) were nonlinear, yet they were dose dependent across the calcium gradient. Multiple stressors can have complex, often unexpected effects on ecosystems. This study demonstrates that the dominant interaction for the set of tested genes appears to be synergism. We argue that gene expression patterns can be used to understand and predict the type of interaction expected when organisms are exposed simultaneously to natural and anthropogenic stressors.

  8. Genome-wide gene–gene interaction analysis for next-generation sequencing

    PubMed Central

    Zhao, Jinying; Zhu, Yun; Xiong, Momiao

    2016-01-01

    The critical barrier in interaction analysis for next-generation sequencing (NGS) data is that the traditional pairwise interaction analysis that is suitable for common variants is difficult to apply to rare variants because of their prohibitive computational time, large number of tests and low power. The great challenges for successful detection of interactions with NGS data are (1) the demands in the paradigm of changes in interaction analysis; (2) severe multiple testing; and (3) heavy computations. To meet these challenges, we shift the paradigm of interaction analysis between two SNPs to interaction analysis between two genomic regions. In other words, we take a gene as a unit of analysis and use functional data analysis techniques as dimensional reduction tools to develop a novel statistic to collectively test interaction between all possible pairs of SNPs within two genome regions. By intensive simulations, we demonstrate that the functional logistic regression for interaction analysis has the correct type 1 error rates and higher power to detect interaction than the currently used methods. The proposed method was applied to a coronary artery disease dataset from the Wellcome Trust Case Control Consortium (WTCCC) study and the Framingham Heart Study (FHS) dataset, and the early-onset myocardial infarction (EOMI) exome sequence datasets with European origin from the NHLBI's Exome Sequencing Project. We discovered that 6 of 27 pairs of significantly interacted genes in the FHS were replicated in the independent WTCCC study and 24 pairs of significantly interacted genes after applying Bonferroni correction in the EOMI study. PMID:26173972

  9. An interactional network of genes involved in chitin synthesis in Saccharomyces cerevisiae.

    PubMed

    Lesage, Guillaume; Shapiro, Jesse; Specht, Charles A; Sdicu, Anne-Marie; Ménard, Patrice; Hussein, Shamiza; Tong, Amy Hin Yan; Boone, Charles; Bussey, Howard

    2005-02-16

    In S. cerevisiae the beta-1,4-linked N-acetylglucosamine polymer, chitin, is synthesized by a family of 3 specialized but interacting chitin synthases encoded by CHS1, CHS2 and CHS3. Chs2p makes chitin in the primary septum, while Chs3p makes chitin in the lateral cell wall and in the bud neck, and can partially compensate for the lack of Chs2p. Chs3p requires a pathway of Bni4p, Chs4p, Chs5p, Chs6p and Chs7p for its localization and activity. Chs1p is thought to have a septum repair function after cell separation. To further explore interactions in the chitin synthase family and to find processes buffering chitin synthesis, we compiled a genetic interaction network of genes showing synthetic interactions with CHS1, CHS3 and genes involved in Chs3p localization and function and made a phenotypic analysis of their mutants. Using deletion mutants in CHS1, CHS3, CHS4, CHS5, CHS6, CHS7 and BNI4 in a synthetic genetic array analysis we assembled a network of 316 interactions among 163 genes. The interaction network with CHS3, CHS4, CHS5, CHS6, CHS7 or BNI4 forms a dense neighborhood, with many genes functioning in cell wall assembly or polarized secretion. Chitin levels were altered in 54 of the mutants in individually deleted genes, indicating a functional relationship between them and chitin synthesis. 32 of these mutants triggered the chitin stress response, with elevated chitin levels and a dependence on CHS3. A large fraction of the CHS1-interaction set was distinct from that of the CHS3 network, indicating broad roles for Chs1p in buffering both Chs2p function and more global cell wall robustness. Based on their interaction patterns and chitin levels we group interacting mutants into functional categories. Genes interacting with CHS3 are involved in the amelioration of cell wall defects and in septum or bud neck chitin synthesis, and we newly assign a number of genes to these functions. Our genetic analysis of genes not interacting with CHS3 indicate expanded

  10. An interactional network of genes involved in chitin synthesis in Saccharomyces cerevisiae

    PubMed Central

    Lesage, Guillaume; Shapiro, Jesse; Specht, Charles A; Sdicu, Anne-Marie; Ménard, Patrice; Hussein, Shamiza; Tong, Amy Hin Yan; Boone, Charles; Bussey, Howard

    2005-01-01

    Background In S. cerevisiae the β-1,4-linked N-acetylglucosamine polymer, chitin, is synthesized by a family of 3 specialized but interacting chitin synthases encoded by CHS1, CHS2 and CHS3. Chs2p makes chitin in the primary septum, while Chs3p makes chitin in the lateral cell wall and in the bud neck, and can partially compensate for the lack of Chs2p. Chs3p requires a pathway of Bni4p, Chs4p, Chs5p, Chs6p and Chs7p for its localization and activity. Chs1p is thought to have a septum repair function after cell separation. To further explore interactions in the chitin synthase family and to find processes buffering chitin synthesis, we compiled a genetic interaction network of genes showing synthetic interactions with CHS1, CHS3 and genes involved in Chs3p localization and function and made a phenotypic analysis of their mutants. Results Using deletion mutants in CHS1, CHS3, CHS4, CHS5, CHS6, CHS7 and BNI4 in a synthetic genetic array analysis we assembled a network of 316 interactions among 163 genes. The interaction network with CHS3, CHS4, CHS5, CHS6, CHS7 or BNI4 forms a dense neighborhood, with many genes functioning in cell wall assembly or polarized secretion. Chitin levels were altered in 54 of the mutants in individually deleted genes, indicating a functional relationship between them and chitin synthesis. 32 of these mutants triggered the chitin stress response, with elevated chitin levels and a dependence on CHS3. A large fraction of the CHS1-interaction set was distinct from that of the CHS3 network, indicating broad roles for Chs1p in buffering both Chs2p function and more global cell wall robustness. Conclusion Based on their interaction patterns and chitin levels we group interacting mutants into functional categories. Genes interacting with CHS3 are involved in the amelioration of cell wall defects and in septum or bud neck chitin synthesis, and we newly assign a number of genes to these functions. Our genetic analysis of genes not interacting with

  11. The heritable basis of gene-environment interactions in cardiometabolic traits.

    PubMed

    Poveda, Alaitz; Chen, Yan; Brändström, Anders; Engberg, Elisabeth; Hallmans, Göran; Johansson, Ingegerd; Renström, Frida; Kurbasic, Azra; Franks, Paul W

    2017-03-01

    Little is known about the heritable basis of gene-environment interactions in humans. We therefore screened multiple cardiometabolic traits to assess the probability that they are influenced by genotype-environment interactions. Fourteen established environmental risk exposures and 11 cardiometabolic traits were analysed in the VIKING study, a cohort of 16,430 Swedish adults from 1682 extended pedigrees with available detailed genealogical, phenotypic and demographic information, using a maximum likelihood variance decomposition method in Sequential Oligogenic Linkage Analysis Routines software. All cardiometabolic traits had statistically significant heritability estimates, with narrow-sense heritabilities (h 2 ) ranging from 24% to 47%. Genotype-environment interactions were detected for age and sex (for the majority of traits), physical activity (for triacylglycerols, 2 h glucose and diastolic BP), smoking (for weight), alcohol intake (for weight, BMI and 2 h glucose) and diet pattern (for weight, BMI, glycaemic traits and systolic BP). Genotype-age interactions for weight and systolic BP, genotype-sex interactions for BMI and triacylglycerols and genotype-alcohol intake interactions for weight remained significant after multiple test correction. Age, sex and alcohol intake are likely to be major modifiers of genetic effects for a range of cardiometabolic traits. This information may prove valuable for studies that seek to identify specific loci that modify the effects of lifestyle in cardiometabolic disease.

  12. What Makes Sports Fans Interactive? Identifying Factors Affecting Chat Interactions in Online Sports Viewing

    PubMed Central

    Yeo, Jaeryong; Lee, Juyeong

    2016-01-01

    Sports fans are able to watch games from many locations using TV services while interacting with other fans online. In this paper, we identify the factors that affect sports viewers’ online interactions. Using a large-scale dataset of more than 25 million chat messages from a popular social TV site for baseball, we extract various game-related factors, and investigate the relationships between these factors and fans’ interactions using a series of multiple regression analyses. As a result, we identify several factors that are significantly related to viewer interactions. In addition, we determine that the influence of these factors varies according to the user group; i.e., active vs. less active users, and loyal vs. non-loyal users. PMID:26849568

  13. An integrative framework for Bayesian variable selection with informative priors for identifying genes and pathways.

    PubMed

    Peng, Bin; Zhu, Dianwen; Ander, Bradley P; Zhang, Xiaoshuai; Xue, Fuzhong; Sharp, Frank R; Yang, Xiaowei

    2013-01-01

    The discovery of genetic or genomic markers plays a central role in the development of personalized medicine. A notable challenge exists when dealing with the high dimensionality of the data sets, as thousands of genes or millions of genetic variants are collected on a relatively small number of subjects. Traditional gene-wise selection methods using univariate analyses face difficulty to incorporate correlational, structural, or functional structures amongst the molecular measures. For microarray gene expression data, we first summarize solutions in dealing with 'large p, small n' problems, and then propose an integrative Bayesian variable selection (iBVS) framework for simultaneously identifying causal or marker genes and regulatory pathways. A novel partial least squares (PLS) g-prior for iBVS is developed to allow the incorporation of prior knowledge on gene-gene interactions or functional relationships. From the point view of systems biology, iBVS enables user to directly target the joint effects of multiple genes and pathways in a hierarchical modeling diagram to predict disease status or phenotype. The estimated posterior selection probabilities offer probabilitic and biological interpretations. Both simulated data and a set of microarray data in predicting stroke status are used in validating the performance of iBVS in a Probit model with binary outcomes. iBVS offers a general framework for effective discovery of various molecular biomarkers by combining data-based statistics and knowledge-based priors. Guidelines on making posterior inferences, determining Bayesian significance levels, and improving computational efficiencies are also discussed.

  14. cDNA-AFLP analysis reveals differential gene expression in compatible interaction of wheat challenged with Puccinia striiformis f. sp. tritici

    PubMed Central

    Wang, Xiaojie; Tang, Chunlei; Zhang, Gang; Li, Yingchun; Wang, Chenfang; Liu, Bo; Qu, Zhipeng; Zhao, Jie; Han, Qingmei; Huang, Lili; Chen, Xianming; Kang, Zhensheng

    2009-01-01

    Background Puccinia striiformis f. sp. tritici is a fungal pathogen causing stripe rust, one of the most important wheat diseases worldwide. The fungus is strictly biotrophic and thus, completely dependent on living host cells for its reproduction, which makes it difficult to study genes of the pathogen. In spite of its economic importance, little is known about the molecular basis of compatible interaction between the pathogen and wheat host. In this study, we identified wheat and P. striiformis genes associated with the infection process by conducting a large-scale transcriptomic analysis using cDNA-AFLP. Results Of the total 54,912 transcript derived fragments (TDFs) obtained using cDNA-AFLP with 64 primer pairs, 2,306 (4.2%) displayed altered expression patterns after inoculation, of which 966 showed up-regulated and 1,340 down-regulated. 186 TDFs produced reliable sequences after sequencing of 208 TDFs selected, of which 74 (40%) had known functions through BLAST searching the GenBank database. Majority of the latter group had predicted gene products involved in energy (13%), signal transduction (5.4%), disease/defence (5.9%) and metabolism (5% of the sequenced TDFs). BLAST searching of the wheat stem rust fungus genome database identified 18 TDFs possibly from the stripe rust pathogen, of which 9 were validated of the pathogen origin using PCR-based assays followed by sequencing confirmation. Of the 186 reliable TDFs, 29 homologous to genes known to play a role in disease/defense, signal transduction or uncharacterized genes were further selected for validation of cDNA-AFLP expression patterns using qRT-PCR analyses. Results confirmed the altered expression patterns of 28 (96.5%) genes revealed by the cDNA-AFLP technique. Conclusion The results show that cDNA-AFLP is a reliable technique for studying expression patterns of genes involved in the wheat-stripe rust interactions. Genes involved in compatible interactions between wheat and the stripe rust pathogen

  15. Using SCOPE to identify potential regulatory motifs in coregulated genes.

    PubMed

    Martyanov, Viktor; Gross, Robert H

    2011-05-31

    SCOPE is an ensemble motif finder that uses three component algorithms in parallel to identify potential regulatory motifs by over-representation and motif position preference. Each component algorithm is optimized to find a different kind of motif. By taking the best of these three approaches, SCOPE performs better than any single algorithm, even in the presence of noisy data. In this article, we utilize a web version of SCOPE to examine genes that are involved in telomere maintenance. SCOPE has been incorporated into at least two other motif finding programs and has been used in other studies. The three algorithms that comprise SCOPE are BEAM, which finds non-degenerate motifs (ACCGGT), PRISM, which finds degenerate motifs (ASCGWT), and SPACER, which finds longer bipartite motifs (ACCnnnnnnnnGGT). These three algorithms have been optimized to find their corresponding type of motif. Together, they allow SCOPE to perform extremely well. Once a gene set has been analyzed and candidate motifs identified, SCOPE can look for other genes that contain the motif which, when added to the original set, will improve the motif score. This can occur through over-representation or motif position preference. Working with partial gene sets that have biologically verified transcription factor binding sites, SCOPE was able to identify most of the rest of the genes also regulated by the given transcription factor. Output from SCOPE shows candidate motifs, their significance, and other information both as a table and as a graphical motif map. FAQs and video tutorials are available at the SCOPE web site which also includes a "Sample Search" button that allows the user to perform a trial run. Scope has a very friendly user interface that enables novice users to access the algorithm's full power without having to become an expert in the bioinformatics of motif finding. As input, SCOPE can take a list of genes, or FASTA sequences. These can be entered in browser text fields, or read from

  16. [Key effect genes responding to nerve injury identified by gene ontology and computer pattern recognition].

    PubMed

    Pan, Qian; Peng, Jin; Zhou, Xue; Yang, Hao; Zhang, Wei

    2012-07-01

    In order to screen out important genes from large gene data of gene microarray after nerve injury, we combine gene ontology (GO) method and computer pattern recognition technology to find key genes responding to nerve injury, and then verify one of these screened-out genes. Data mining and gene ontology analysis of gene chip data GSE26350 was carried out through MATLAB software. Cd44 was selected from screened-out key gene molecular spectrum by comparing genes' different GO terms and positions on score map of principal component. Function interferences were employed to influence the normal binding of Cd44 and one of its ligands, chondroitin sulfate C (CSC), to observe neurite extension. Gene ontology analysis showed that the first genes on score map (marked by red *) mainly distributed in molecular transducer activity, receptor activity, protein binding et al molecular function GO terms. Cd44 is one of six effector protein genes, and attracted us with its function diversity. After adding different reagents into the medium to interfere the normal binding of CSC and Cd44, varying-degree remissions of CSC's inhibition on neurite extension were observed. CSC can inhibit neurite extension through binding Cd44 on the neuron membrane. This verifies that important genes in given physiological processes can be identified by gene ontology analysis of gene chip data.

  17. A Systems Biology Framework Identifies Molecular Underpinnings of Coronary Heart Disease

    PubMed Central

    Huan, Tianxiao; Zhang, Bin; Wang, Zhi; Joehanes, Roby; Zhu, Jun; Johnson, Andrew D.; Ying, Saixia; Munson, Peter J.; Raghavachari, Nalini; Wang, Richard; Liu, Poching; Courchesne, Paul; Hwang, Shih-Jen; Assimes, Themistocles L.; McPherson, Ruth; Samani, Nilesh J.; Schunkert, Heribert; Meng, Qingying; Suver, Christine; O'Donnell, Christopher J.; Derry, Jonathan; Yang, Xia; Levy, Daniel

    2013-01-01

    Objective Genetic approaches have identified numerous loci associated with coronary heart disease (CHD). The molecular mechanisms underlying CHD gene-disease associations, however, remain unclear. We hypothesized that genetic variants with both strong and subtle effects drive gene subnetworks that in turn affect CHD. Approach and Results We surveyed CHD-associated molecular interactions by constructing coexpression networks using whole blood gene expression profiles from 188 CHD cases and 188 age- and sex-matched controls. 24 coexpression modules were identified including one case-specific and one control-specific differential module (DM). The DMs were enriched for genes involved in B-cell activation, immune response, and ion transport. By integrating the DMs with altered gene expression associated SNPs (eSNPs) and with results of GWAS of CHD and its risk factors, the control-specific DM was implicated as CHD-causal based on its significant enrichment for both CHD and lipid eSNPs. This causal DM was further integrated with tissue-specific Bayesian networks and protein-protein interaction networks to identify regulatory key driver (KD) genes. Multi-tissue KDs (SPIB and TNFRSF13C) and tissue-specific KDs (e.g. EBF1) were identified. Conclusions Our network-driven integrative analysis not only identified CHD-related genes, but also defined network structure that sheds light on the molecular interactions of genes associated with CHD risk. PMID:23539213

  18. [From stone-craved genes to Michelangelo: significance and different aspects of gene-environment interaction].

    PubMed

    Lazary, Judit

    2017-12-01

    Although genetic studies have improved a lot in recent years, without clinical relevance sometimes their significance is devalued. Reviewing the major milestones of psychogenomics it can be seen that break-through success is just a question of time. Investigations of direct effect of genetic variants on phenotypes have not yielded positive findings. However, an important step was taken by adapting the gene-environment interaction model. In this model genetic vulnerability stepped into the place of "stone craved" pathology. Further progress happened when studies of environmental factors were combined with genetic function (epigenetics). This model provided the possibility for investigation of therapeutic interventions as environmental factors and it was proven that effective treatments exert a modifying effect on gene expression. Moreover, recent developments focus on therapeutic manipulation of gene function (e.g. chemogenetics). Instead of "stone craved" genes up-to-date dynamically interacting gene function became the basis of psychogenomics in which correction of the expression is a potential therapeutic tool. Keeping in mind these trends and developments, there is no doubt that genetics will be a fundamental part of daily clinical routine in the future.

  19. Inferring gene and protein interactions using PubMed citations and consensus Bayesian networks

    PubMed Central

    Dalman, Mark; Haddad, Joseph; Duan, Zhong-Hui

    2017-01-01

    The PubMed database offers an extensive set of publication data that can be useful, yet inherently complex to use without automated computational techniques. Data repositories such as the Genomic Data Commons (GDC) and the Gene Expression Omnibus (GEO) offer experimental data storage and retrieval as well as curated gene expression profiles. Genetic interaction databases, including Reactome and Ingenuity Pathway Analysis, offer pathway and experiment data analysis using data curated from these publications and data repositories. We have created a method to generate and analyze consensus networks, inferring potential gene interactions, using large numbers of Bayesian networks generated by data mining publications in the PubMed database. Through the concept of network resolution, these consensus networks can be tailored to represent possible genetic interactions. We designed a set of experiments to confirm that our method is stable across variation in both sample and topological input sizes. Using gene product interactions from the KEGG pathway database and data mining PubMed publication abstracts, we verify that regardless of the network resolution or the inferred consensus network, our method is capable of inferring meaningful gene interactions through consensus Bayesian network generation with multiple, randomized topological orderings. Our method can not only confirm the existence of currently accepted interactions, but has the potential to hypothesize new ones as well. We show our method confirms the existence of known gene interactions such as JAK-STAT-PI3K-AKT-mTOR, infers novel gene interactions such as RAS- Bcl-2 and RAS-AKT, and found significant pathway-pathway interactions between the JAK-STAT signaling and Cardiac Muscle Contraction KEGG pathways. PMID:29049295

  20. Inferring gene and protein interactions using PubMed citations and consensus Bayesian networks.

    PubMed

    Deeter, Anthony; Dalman, Mark; Haddad, Joseph; Duan, Zhong-Hui

    2017-01-01

    The PubMed database offers an extensive set of publication data that can be useful, yet inherently complex to use without automated computational techniques. Data repositories such as the Genomic Data Commons (GDC) and the Gene Expression Omnibus (GEO) offer experimental data storage and retrieval as well as curated gene expression profiles. Genetic interaction databases, including Reactome and Ingenuity Pathway Analysis, offer pathway and experiment data analysis using data curated from these publications and data repositories. We have created a method to generate and analyze consensus networks, inferring potential gene interactions, using large numbers of Bayesian networks generated by data mining publications in the PubMed database. Through the concept of network resolution, these consensus networks can be tailored to represent possible genetic interactions. We designed a set of experiments to confirm that our method is stable across variation in both sample and topological input sizes. Using gene product interactions from the KEGG pathway database and data mining PubMed publication abstracts, we verify that regardless of the network resolution or the inferred consensus network, our method is capable of inferring meaningful gene interactions through consensus Bayesian network generation with multiple, randomized topological orderings. Our method can not only confirm the existence of currently accepted interactions, but has the potential to hypothesize new ones as well. We show our method confirms the existence of known gene interactions such as JAK-STAT-PI3K-AKT-mTOR, infers novel gene interactions such as RAS- Bcl-2 and RAS-AKT, and found significant pathway-pathway interactions between the JAK-STAT signaling and Cardiac Muscle Contraction KEGG pathways.

  1. Sleeping Beauty transposon mutagenesis identifies genes that cooperate with mutant Smad4 in gastric cancer development

    PubMed Central

    Takeda, Haruna; Rust, Alistair G.; Ward, Jerrold M.; Yew, Christopher Chin Kuan; Jenkins, Nancy A.; Copeland, Neal G.

    2016-01-01

    Mutations in SMAD4 predispose to the development of gastrointestinal cancer, which is the third leading cause of cancer-related deaths. To identify genes driving gastric cancer (GC) development, we performed a Sleeping Beauty (SB) transposon mutagenesis screen in the stomach of Smad4+/− mutant mice. This screen identified 59 candidate GC trunk drivers and a much larger number of candidate GC progression genes. Strikingly, 22 SB-identified trunk drivers are known or candidate cancer genes, whereas four SB-identified trunk drivers, including PTEN, SMAD4, RNF43, and NF1, are known human GC trunk drivers. Similar to human GC, pathway analyses identified WNT, TGF-β, and PI3K-PTEN signaling, ubiquitin-mediated proteolysis, adherens junctions, and RNA degradation in addition to genes involved in chromatin modification and organization as highly deregulated pathways in GC. Comparative oncogenomic filtering of the complete list of SB-identified genes showed that they are highly enriched for genes mutated in human GC and identified many candidate human GC genes. Finally, by comparing our complete list of SB-identified genes against the list of mutated genes identified in five large-scale human GC sequencing studies, we identified LDL receptor-related protein 1B (LRP1B) as a previously unidentified human candidate GC tumor suppressor gene. In LRP1B, 129 mutations were found in 462 human GC samples sequenced, and LRP1B is one of the top 10 most deleted genes identified in a panel of 3,312 human cancers. SB mutagenesis has, thus, helped to catalog the cooperative molecular mechanisms driving SMAD4-induced GC growth and discover genes with potential clinical importance in human GC. PMID:27006499

  2. Sleeping Beauty transposon mutagenesis identifies genes that cooperate with mutant Smad4 in gastric cancer development.

    PubMed

    Takeda, Haruna; Rust, Alistair G; Ward, Jerrold M; Yew, Christopher Chin Kuan; Jenkins, Nancy A; Copeland, Neal G

    2016-04-05

    Mutations in SMAD4 predispose to the development of gastrointestinal cancer, which is the third leading cause of cancer-related deaths. To identify genes driving gastric cancer (GC) development, we performed a Sleeping Beauty (SB) transposon mutagenesis screen in the stomach of Smad4(+/-) mutant mice. This screen identified 59 candidate GC trunk drivers and a much larger number of candidate GC progression genes. Strikingly, 22 SB-identified trunk drivers are known or candidate cancer genes, whereas four SB-identified trunk drivers, including PTEN, SMAD4, RNF43, and NF1, are known human GC trunk drivers. Similar to human GC, pathway analyses identified WNT, TGF-β, and PI3K-PTEN signaling, ubiquitin-mediated proteolysis, adherens junctions, and RNA degradation in addition to genes involved in chromatin modification and organization as highly deregulated pathways in GC. Comparative oncogenomic filtering of the complete list of SB-identified genes showed that they are highly enriched for genes mutated in human GC and identified many candidate human GC genes. Finally, by comparing our complete list of SB-identified genes against the list of mutated genes identified in five large-scale human GC sequencing studies, we identified LDL receptor-related protein 1B (LRP1B) as a previously unidentified human candidate GC tumor suppressor gene. In LRP1B, 129 mutations were found in 462 human GC samples sequenced, and LRP1B is one of the top 10 most deleted genes identified in a panel of 3,312 human cancers. SB mutagenesis has, thus, helped to catalog the cooperative molecular mechanisms driving SMAD4-induced GC growth and discover genes with potential clinical importance in human GC.

  3. Integration of mouse and human genome-wide association data identifies KCNIP4 as an asthma gene.

    PubMed

    Himes, Blanca E; Sheppard, Keith; Berndt, Annerose; Leme, Adriana S; Myers, Rachel A; Gignoux, Christopher R; Levin, Albert M; Gauderman, W James; Yang, James J; Mathias, Rasika A; Romieu, Isabelle; Torgerson, Dara G; Roth, Lindsey A; Huntsman, Scott; Eng, Celeste; Klanderman, Barbara; Ziniti, John; Senter-Sylvia, Jody; Szefler, Stanley J; Lemanske, Robert F; Zeiger, Robert S; Strunk, Robert C; Martinez, Fernando D; Boushey, Homer; Chinchilli, Vernon M; Israel, Elliot; Mauger, David; Koppelman, Gerard H; Postma, Dirkje S; Nieuwenhuis, Maartje A E; Vonk, Judith M; Lima, John J; Irvin, Charles G; Peters, Stephen P; Kubo, Michiaki; Tamari, Mayumi; Nakamura, Yusuke; Litonjua, Augusto A; Tantisira, Kelan G; Raby, Benjamin A; Bleecker, Eugene R; Meyers, Deborah A; London, Stephanie J; Barnes, Kathleen C; Gilliland, Frank D; Williams, L Keoki; Burchard, Esteban G; Nicolae, Dan L; Ober, Carole; DeMeo, Dawn L; Silverman, Edwin K; Paigen, Beverly; Churchill, Gary; Shapiro, Steve D; Weiss, Scott T

    2013-01-01

    Asthma is a common chronic respiratory disease characterized by airway hyperresponsiveness (AHR). The genetics of asthma have been widely studied in mouse and human, and homologous genomic regions have been associated with mouse AHR and human asthma-related phenotypes. Our goal was to identify asthma-related genes by integrating AHR associations in mouse with human genome-wide association study (GWAS) data. We used Efficient Mixed Model Association (EMMA) analysis to conduct a GWAS of baseline AHR measures from males and females of 31 mouse strains. Genes near or containing SNPs with EMMA p-values <0.001 were selected for further study in human GWAS. The results of the previously reported EVE consortium asthma GWAS meta-analysis consisting of 12,958 diverse North American subjects from 9 study centers were used to select a subset of homologous genes with evidence of association with asthma in humans. Following validation attempts in three human asthma GWAS (i.e., Sepracor/LOCCS/LODO/Illumina, GABRIEL, DAG) and two human AHR GWAS (i.e., SHARP, DAG), the Kv channel interacting protein 4 (KCNIP4) gene was identified as nominally associated with both asthma and AHR at a gene- and SNP-level. In EVE, the smallest KCNIP4 association was at rs6833065 (P-value 2.9e-04), while the strongest associations for Sepracor/LOCCS/LODO/Illumina, GABRIEL, DAG were 1.5e-03, 1.0e-03, 3.1e-03 at rs7664617, rs4697177, rs4696975, respectively. At a SNP level, the strongest association across all asthma GWAS was at rs4697177 (P-value 1.1e-04). The smallest P-values for association with AHR were 2.3e-03 at rs11947661 in SHARP and 2.1e-03 at rs402802 in DAG. Functional studies are required to validate the potential involvement of KCNIP4 in modulating asthma susceptibility and/or AHR. Our results suggest that a useful approach to identify genes associated with human asthma is to leverage mouse AHR association data.

  4. Novel Myopia Genes and Pathways Identified From Syndromic Forms of Myopia

    PubMed Central

    Loughman, James; Wildsoet, Christine F.; Williams, Cathy; Guggenheim, Jeremy A.

    2018-01-01

    Purpose To test the hypothesis that genes known to cause clinical syndromes featuring myopia also harbor polymorphisms contributing to nonsyndromic refractive errors. Methods Clinical phenotypes and syndromes that have refractive errors as a recognized feature were identified using the Online Mendelian Inheritance in Man (OMIM) database. One hundred fifty-four unique causative genes were identified, of which 119 were specifically linked with myopia and 114 represented syndromic myopia (i.e., myopia and at least one other clinical feature). Myopia was the only refractive error listed for 98 genes and hyperopia and the only refractive error noted for 28 genes, with the remaining 28 genes linked to phenotypes with multiple forms of refractive error. Pathway analysis was carried out to find biological processes overrepresented within these sets of genes. Genetic variants located within 50 kb of the 119 myopia-related genes were evaluated for involvement in refractive error by analysis of summary statistics from genome-wide association studies (GWAS) conducted by the CREAM Consortium and 23andMe, using both single-marker and gene-based tests. Results Pathway analysis identified several biological processes already implicated in refractive error development through prior GWAS analyses and animal studies, including extracellular matrix remodeling, focal adhesion, and axon guidance, supporting the research hypothesis. Novel pathways also implicated in myopia development included mannosylation, glycosylation, lens development, gliogenesis, and Schwann cell differentiation. Hyperopia was found to be linked to a different pattern of biological processes, mostly related to organogenesis. Comparison with GWAS findings further confirmed that syndromic myopia genes were enriched for genetic variants that influence refractive errors in the general population. Gene-based analyses implicated 21 novel candidate myopia genes (ADAMTS18, ADAMTS2, ADAMTSL4, AGK, ALDH18A1, ASXL1, COL4A1

  5. Interaction between Social/Psychosocial Factors and Genetic Variants on Body Mass Index: A Gene-Environment Interaction Analysis in a Longitudinal Setting.

    PubMed

    Zhao, Wei; Ware, Erin B; He, Zihuai; Kardia, Sharon L R; Faul, Jessica D; Smith, Jennifer A

    2017-09-29

    Obesity, which develops over time, is one of the leading causes of chronic diseases such as cardiovascular disease. However, hundreds of BMI (body mass index)-associated genetic loci identified through large-scale genome-wide association studies (GWAS) only explain about 2.7% of BMI variation. Most common human traits are believed to be influenced by both genetic and environmental factors. Past studies suggest a variety of environmental features that are associated with obesity, including socioeconomic status and psychosocial factors. This study combines both gene/regions and environmental factors to explore whether social/psychosocial factors (childhood and adult socioeconomic status, social support, anger, chronic burden, stressful life events, and depressive symptoms) modify the effect of sets of genetic variants on BMI in European American and African American participants in the Health and Retirement Study (HRS). In order to incorporate longitudinal phenotype data collected in the HRS and investigate entire sets of single nucleotide polymorphisms (SNPs) within gene/region simultaneously, we applied a novel set-based test for gene-environment interaction in longitudinal studies (LGEWIS). Childhood socioeconomic status (parental education) was found to modify the genetic effect in the gene/region around SNP rs9540493 on BMI in European Americans in the HRS. The most significant SNP (rs9540488) by childhood socioeconomic status interaction within the rs9540493 gene/region was suggestively replicated in the Multi-Ethnic Study of Atherosclerosis (MESA) ( p = 0.07).

  6. Interaction between Social/Psychosocial Factors and Genetic Variants on Body Mass Index: A Gene-Environment Interaction Analysis in a Longitudinal Setting

    PubMed Central

    Zhao, Wei; He, Zihuai; Kardia, Sharon L. R.; Faul, Jessica D.

    2017-01-01

    Obesity, which develops over time, is one of the leading causes of chronic diseases such as cardiovascular disease. However, hundreds of BMI (body mass index)-associated genetic loci identified through large-scale genome-wide association studies (GWAS) only explain about 2.7% of BMI variation. Most common human traits are believed to be influenced by both genetic and environmental factors. Past studies suggest a variety of environmental features that are associated with obesity, including socioeconomic status and psychosocial factors. This study combines both gene/regions and environmental factors to explore whether social/psychosocial factors (childhood and adult socioeconomic status, social support, anger, chronic burden, stressful life events, and depressive symptoms) modify the effect of sets of genetic variants on BMI in European American and African American participants in the Health and Retirement Study (HRS). In order to incorporate longitudinal phenotype data collected in the HRS and investigate entire sets of single nucleotide polymorphisms (SNPs) within gene/region simultaneously, we applied a novel set-based test for gene-environment interaction in longitudinal studies (LGEWIS). Childhood socioeconomic status (parental education) was found to modify the genetic effect in the gene/region around SNP rs9540493 on BMI in European Americans in the HRS. The most significant SNP (rs9540488) by childhood socioeconomic status interaction within the rs9540493 gene/region was suggestively replicated in the Multi-Ethnic Study of Atherosclerosis (MESA) (p = 0.07). PMID:28961216

  7. Expression profiling identifies novel Hh/Gli regulated genes in developing zebrafish embryos.

    PubMed Central

    Bergeron, Sadie A.; Milla, Luis A.; Villegas, Rosario; Shen, Meng-Chieh; Burgess, Shawn M.; Allende, Miguel L.; Karlstrom, Rolf O.; Palma, Verónica

    2008-01-01

    The Hedgehog (Hh) signaling pathway plays critical instructional roles during embryonic development. Mis-regulation of Hh/Gli signaling is a major causative factor in human congenital disorders and in a variety of cancers. The zebrafish is a powerful genetic model for the study of Hh signaling during embryogenesis, as a large number of mutants have been identified affecting different components of the Hh/Gli signaling system. By performing global profiling of gene expression in different Hh/Gli gain- and loss-of-function scenarios we identified several known (e.g. ptc1 and nkx2.2a) as well as a large number of novel Hh regulated genes that are differentially expressed in embryos with altered Hh/Gli signaling function. By uncovering changes in tissue specific gene expression, we revealed new embryological processes that are influenced by Hh signaling. We thus provide a comprehensive survey of Hh/Gli regulated genes during embryogenesis and we identify new Hh-regulated genes that may be targets of mis-regulation during tumorogenesis. PMID:18055165

  8. On meta- and mega-analyses for gene-environment interactions.

    PubMed

    Huang, Jing; Liu, Yulun; Vitale, Steve; Penning, Trevor M; Whitehead, Alexander S; Blair, Ian A; Vachani, Anil; Clapper, Margie L; Muscat, Joshua E; Lazarus, Philip; Scheet, Paul; Moore, Jason H; Chen, Yong

    2017-12-01

    Gene-by-environment (G × E) interactions are important in explaining the missing heritability and understanding the causation of complex diseases, but a single, moderately sized study often has limited statistical power to detect such interactions. With the increasing need for integrating data and reporting results from multiple collaborative studies or sites, debate over choice between mega- versus meta-analysis continues. In principle, data from different sites can be integrated at the individual level into a "mega" data set, which can be fit by a joint "mega-analysis." Alternatively, analyses can be done at each site, and results across sites can be combined through a "meta-analysis" procedure without integrating individual level data across sites. Although mega-analysis has been advocated in several recent initiatives, meta-analysis has the advantages of simplicity and feasibility, and has recently led to several important findings in identifying main genetic effects. In this paper, we conducted empirical and simulation studies, using data from a G × E study of lung cancer, to compare the mega- and meta-analyses in four commonly used G × E analyses under the scenario that the number of studies is small and sample sizes of individual studies are relatively large. We compared the two data integration approaches in the context of fixed effect models and random effects models separately. Our investigations provide valuable insights in understanding the differences between mega- and meta-analyses in practice of combining small number of studies in identifying G × E interactions. © 2017 WILEY PERIODICALS, INC.

  9. Identifying Stress Transcription Factors Using Gene Expression and TF-Gene Association Data

    PubMed Central

    Wu, Wei-Sheng; Chen, Bor-Sen

    2007-01-01

    Unicellular organisms such as yeasts have evolved to survive environmental stresses by rapidly reorganizing the genomic expression program to meet the challenges of harsh environments. The complex adaptation mechanisms to stress remain to be elucidated. In this study, we developed Stress Transcription Factor Identification Algorithm (STFIA), which integrates gene expression and TF-gene association data to identify the stress transcription factors (TFs) of six kinds of stresses. We identified some general stress TFs that are in response to various stresses, and some specific stress TFs that are in response to one specific stress. The biological significance of our findings is validated by the literature. We found that a small number of TFs may be sufficient to control a wide variety of expression patterns in yeast under different stresses. Two implications can be inferred from this observation. First, the adaptation mechanisms to different stresses may have a bow-tie structure. Second, there may exist extensive regulatory cross-talk among different stress responses. In conclusion, this study proposes a network of the regulators of stress responses and their mechanism of action. PMID:20066130

  10. GeneCOST: a novel scoring-based prioritization framework for identifying disease causing genes.

    PubMed

    Ozer, Bugra; Sağıroğlu, Mahmut; Demirci, Hüseyin

    2015-11-15

    Due to the big data produced by next-generation sequencing studies, there is an evident need for methods to extract the valuable information gathered from these experiments. In this work, we propose GeneCOST, a novel scoring-based method to evaluate every gene for their disease association. Without any prior filtering and any prior knowledge, we assign a disease likelihood score to each gene in correspondence with their variations. Then, we rank all genes based on frequency, conservation, pedigree and detailed variation information to find out the causative reason of the disease state. We demonstrate the usage of GeneCOST with public and real life Mendelian disease cases including recessive, dominant, compound heterozygous and sporadic models. As a result, we were able to identify causative reason behind the disease state in top rankings of our list, proving that this novel prioritization framework provides a powerful environment for the analysis in genetic disease studies alternative to filtering-based approaches. GeneCOST software is freely available at www.igbam.bilgem.tubitak.gov.tr/en/softwares/genecost-en/index.html. buozer@gmail.com Supplementary data are available at Bioinformatics online. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  11. Genome-Wide Gene-Sodium Interaction Analyses on Blood Pressure: The Genetic Epidemiology Network of Salt-Sensitivity Study.

    PubMed

    Li, Changwei; He, Jiang; Chen, Jing; Zhao, Jinying; Gu, Dongfeng; Hixson, James E; Rao, Dabeeru C; Jaquish, Cashell E; Gu, Charles C; Chen, Jichun; Huang, Jianfeng; Chen, Shufeng; Kelly, Tanika N

    2016-08-01

    We performed genome-wide analyses to identify genomic loci that interact with sodium to influence blood pressure (BP) using single-marker-based (1 and 2 df joint tests) and gene-based tests among 1876 Chinese participants of the Genetic Epidemiology Network of Salt-Sensitivity (GenSalt) study. Among GenSalt participants, the average of 3 urine samples was used to estimate sodium excretion. Nine BP measurements were taken using a random zero sphygmomanometer. A total of 2.05 million single-nucleotide polymorphisms were imputed using Affymetrix 6.0 genotype data and the Chinese Han of Beijing and Japanese of Tokyo HapMap reference panel. Promising findings (P<1.00×10(-4)) from GenSalt were evaluated for replication among 775 Chinese participants of the Multi-Ethnic Study of Atherosclerosis (MESA). Single-nucleotide polymorphism and gene-based results were meta-analyzed across the GenSalt and MESA studies to determine genome-wide significance. The 1 df tests identified interactions for UST rs13211840 on diastolic BP (P=3.13×10(-9)). The 2 df tests additionally identified associations for CLGN rs2567241 (P=3.90×10(-12)) and LOC105369882 rs11104632 (P=4.51×10(-8)) with systolic BP. The CLGN variant rs2567241 was also associated with diastolic BP (P=3.11×10(-22)) and mean arterial pressure (P=2.86×10(-15)). Genome-wide gene-based analysis identified MKNK1 (P=6.70×10(-7)), C2orf80 (P<1.00×10(-12)), EPHA6 (P=2.88×10(-7)), SCOC-AS1 (P=4.35×10(-14)), SCOC (P=6.46×10(-11)), CLGN (P=3.68×10(-13)), MGAT4D (P=4.73×10(-11)), ARHGAP42 (P≤1.00×10(-12)), CASP4 (P=1.31×10(-8)), and LINC01478 (P=6.75×10(-10)) that were associated with at least 1 BP phenotype. In summary, we identified 8 novel and 1 previously reported BP loci through the examination of single-nucleotide polymorphism and gene-based interactions with sodium. © 2016 American Heart Association, Inc.

  12. GeneNetFinder2: Improved Inference of Dynamic Gene Regulatory Relations with Multiple Regulators.

    PubMed

    Han, Kyungsook; Lee, Jeonghoon

    2016-01-01

    A gene involved in complex regulatory interactions may have multiple regulators since gene expression in such interactions is often controlled by more than one gene. Another thing that makes gene regulatory interactions complicated is that regulatory interactions are not static, but change over time during the cell cycle. Most research so far has focused on identifying gene regulatory relations between individual genes in a particular stage of the cell cycle. In this study we developed a method for identifying dynamic gene regulations of several types from the time-series gene expression data. The method can find gene regulations with multiple regulators that work in combination or individually as well as those with single regulators. The method has been implemented as the second version of GeneNetFinder (hereafter called GeneNetFinder2) and tested on several gene expression datasets. Experimental results with gene expression data revealed the existence of genes that are not regulated by individual genes but rather by a combination of several genes. Such gene regulatory relations cannot be found by conventional methods. Our method finds such regulatory relations as well as those with multiple, independent regulators or single regulators, and represents gene regulatory relations as a dynamic network in which different gene regulatory relations are shown in different stages of the cell cycle. GeneNetFinder2 is available at http://bclab.inha.ac.kr/GeneNetFinder and will be useful for modeling dynamic gene regulations with multiple regulators.

  13. Pathway-driven gene stability selection of two rheumatoid arthritis GWAS identifies and validates new susceptibility genes in receptor mediated signalling pathways.

    PubMed

    Eleftherohorinou, Hariklia; Hoggart, Clive J; Wright, Victoria J; Levin, Michael; Coin, Lachlan J M

    2011-09-01

    Rheumatoid arthritis (RA) is the commonest chronic, systemic, inflammatory disorder affecting ∼1% of the world population. It has a strong genetic component and a growing number of associated genes have been discovered in genome-wide association studies (GWAS), which nevertheless only account for 23% of the total genetic risk. We aimed to identify additional susceptibility loci through the analysis of GWAS in the context of biological function. We bridge the gap between pathway and gene-oriented analyses of GWAS, by introducing a pathway-driven gene stability-selection methodology that identifies potential causal genes in the top-associated disease pathways that may be driving the pathway association signals. We analysed the WTCCC and the NARAC studies of ∼5000 and ∼2000 subjects, respectively. We examined 700 pathways comprising ∼8000 genes. Ranking pathways by significance revealed that the NARAC top-ranked ∼6% laid within the top 10% of WTCCC. Gene selection on those pathways identified 58 genes in WTCCC and 61 in NARAC; 21 of those were common (P(overlap)< 10(-21)), of which 16 were novel discoveries. Among the identified genes, we validated 10 known RA associations in WTCCC and 13 in NARAC, not discovered using single-SNP approaches on the same data. Gene ontology functional enrichment analysis on the identified genes showed significant over-representation of signalling activity (P< 10(-29)) in both studies. Our findings suggest a novel model of RA genetic predisposition, which involves cell-membrane receptors and genes in second messenger signalling systems, in addition to genes that regulate immune responses, which have been the focus of interest previously.

  14. Using phylogenetically-informed annotation (PIA) to search for light-interacting genes in transcriptomes from non-model organisms.

    PubMed

    Speiser, Daniel I; Pankey, M Sabrina; Zaharoff, Alexander K; Battelle, Barbara A; Bracken-Grissom, Heather D; Breinholt, Jesse W; Bybee, Seth M; Cronin, Thomas W; Garm, Anders; Lindgren, Annie R; Patel, Nipam H; Porter, Megan L; Protas, Meredith E; Rivera, Ajna S; Serb, Jeanne M; Zigler, Kirk S; Crandall, Keith A; Oakley, Todd H

    2014-11-19

    trees for LIT genes will be a valuable resource for researchers studying the evolution of eyes or other light-interacting structures. We also introduce PIA, a high throughput method for using phylogenetic relationships to identify LIT genes in transcriptomes from non-model organisms. With simple modifications, our methods may be used to search for different sets of genes or to annotate data sets from taxa outside of Metazoa.

  15. Use of RNA-seq to identify cardiac genes and gene pathways differentially expressed between dogs with and without dilated cardiomyopathy

    PubMed Central

    Friedenberg, Steven G.; Chdid, Lhoucine; Keene, Bruce; Sherry, Barbara; Motsinger-Reif, Alison; Meurs, Kathryn M.

    2017-01-01

    OBJECTIVE To identify cardiac tissue genes and gene pathways differentially expressed between dogs with and without dilated cardiomyopathy (DCM). ANIMALS 8 dogs with and 5 dogs without DCM. PROCEDURES Following euthanasia, samples of left ventricular myocardium were collected from each dog. Total RNA was extracted from tissue samples, and RNA sequencing was performed on each sample. Samples from dogs with and without DCM were grouped to identify genes that were differentially regulated between the 2 populations. Overrepresentation analysis was performed on upregulated and downregulated gene sets to identify altered molecular pathways in dogs with DCM. RESULTS Genes involved in cellular energy metabolism, especially metabolism of carbohydrates and fats, were significantly downregulated in dogs with DCM. Expression of cardiac structural proteins was also altered in affected dogs. CONCLUSIONS AND CLINICAL RELEVANCE Results suggested that RNA sequencing may provide important insights into the pathogenesis of DCM in dogs and highlight pathways that should be explored to identify causative mutations and develop novel therapeutic interventions. PMID:27347821

  16. Use of RNA-seq to identify cardiac genes and gene pathways differentially expressed between dogs with and without dilated cardiomyopathy.

    PubMed

    Friedenberg, Steven G; Chdid, Lhoucine; Keene, Bruce; Sherry, Barbara; Motsinger-Reif, Alison; Meurs, Kathryn M

    2016-07-01

    OBJECTIVE To identify cardiac tissue genes and gene pathways differentially expressed between dogs with and without dilated cardiomyopathy (DCM). ANIMALS 8 dogs with and 5 dogs without DCM. PROCEDURES Following euthanasia, samples of left ventricular myocardium were collected from each dog. Total RNA was extracted from tissue samples, and RNA sequencing was performed on each sample. Samples from dogs with and without DCM were grouped to identify genes that were differentially regulated between the 2 populations. Overrepresentation analysis was performed on upregulated and downregulated gene sets to identify altered molecular pathways in dogs with DCM. RESULTS Genes involved in cellular energy metabolism, especially metabolism of carbohydrates and fats, were significantly downregulated in dogs with DCM. Expression of cardiac structural proteins was also altered in affected dogs. CONCLUSIONS AND CLINICAL RELEVANCE Results suggested that RNA sequencing may provide important insights into the pathogenesis of DCM in dogs and highlight pathways that should be explored to identify causative mutations and develop novel therapeutic interventions.

  17. Whole-Genome Sequencing of Sordaria macrospora Mutants Identifies Developmental Genes.

    PubMed

    Nowrousian, Minou; Teichert, Ines; Masloff, Sandra; Kück, Ulrich

    2012-02-01

    The study of mutants to elucidate gene functions has a long and successful history; however, to discover causative mutations in mutants that were generated by random mutagenesis often takes years of laboratory work and requires previously generated genetic and/or physical markers, or resources like DNA libraries for complementation. Here, we present an alternative method to identify defective genes in developmental mutants of the filamentous fungus Sordaria macrospora through Illumina/Solexa whole-genome sequencing. We sequenced pooled DNA from progeny of crosses of three mutants and the wild type and were able to pinpoint the causative mutations in the mutant strains through bioinformatics analysis. One mutant is a spore color mutant, and the mutated gene encodes a melanin biosynthesis enzyme. The causative mutation is a G to A change in the first base of an intron, leading to a splice defect. The second mutant carries an allelic mutation in the pro41 gene encoding a protein essential for sexual development. In the mutant, we detected a complex pattern of deletion/rearrangements at the pro41 locus. In the third mutant, a point mutation in the stop codon of a transcription factor-encoding gene leads to the production of immature fruiting bodies. For all mutants, transformation with a wild type-copy of the affected gene restored the wild-type phenotype. Our data demonstrate that whole-genome sequencing of mutant strains is a rapid method to identify developmental genes in an organism that can be genetically crossed and where a reference genome sequence is available, even without prior mapping information.

  18. Whole-Genome Sequencing of Sordaria macrospora Mutants Identifies Developmental Genes

    PubMed Central

    Nowrousian, Minou; Teichert, Ines; Masloff, Sandra; Kück, Ulrich

    2012-01-01

    The study of mutants to elucidate gene functions has a long and successful history; however, to discover causative mutations in mutants that were generated by random mutagenesis often takes years of laboratory work and requires previously generated genetic and/or physical markers, or resources like DNA libraries for complementation. Here, we present an alternative method to identify defective genes in developmental mutants of the filamentous fungus Sordaria macrospora through Illumina/Solexa whole-genome sequencing. We sequenced pooled DNA from progeny of crosses of three mutants and the wild type and were able to pinpoint the causative mutations in the mutant strains through bioinformatics analysis. One mutant is a spore color mutant, and the mutated gene encodes a melanin biosynthesis enzyme. The causative mutation is a G to A change in the first base of an intron, leading to a splice defect. The second mutant carries an allelic mutation in the pro41 gene encoding a protein essential for sexual development. In the mutant, we detected a complex pattern of deletion/rearrangements at the pro41 locus. In the third mutant, a point mutation in the stop codon of a transcription factor-encoding gene leads to the production of immature fruiting bodies. For all mutants, transformation with a wild type-copy of the affected gene restored the wild-type phenotype. Our data demonstrate that whole-genome sequencing of mutant strains is a rapid method to identify developmental genes in an organism that can be genetically crossed and where a reference genome sequence is available, even without prior mapping information. PMID:22384404

  19. A general method for identifying major hybrid male sterility genes in Drosophila.

    PubMed

    Zeng, L W; Singh, R S

    1995-10-01

    The genes responsible for hybrid male sterility in species crosses are usually identified by introgressing chromosome segments, monitored by visible markers, between closely related species by continuous backcrosses. This commonly used method, however, suffers from two problems. First, it relies on the availability of markers to monitor the introgressed regions and so the portion of the genome examined is limited to the marked regions. Secondly, the introgressed regions are usually large and it is impossible to tell if the effects of the introgressed regions are the result of single (or few) major genes or many minor genes (polygenes). Here we introduce a simple and general method for identifying putative major hybrid male sterility genes which is free of these problems. In this method, the actual hybrid male sterility genes (rather than markers), or tightly linked gene complexes with large effects, are selectively introgressed from one species into the background of another species by repeated backcrosses. This is performed by selectively backcrossing heterozygous (for hybrid male sterility gene or genes) females producing fertile and sterile sons in roughly equal proportions to males of either parental species. As no marker gene is required for this procedure, this method can be used with any species pairs that produce unisexual sterility. With the application of this method, a small X chromosome region of Drosophila mauritiana which produces complete hybrid male sterility (aspermic testes) in the background of D. simulans was identified. Recombination analysis reveals that this region contains a second major hybrid male sterility gene linked to the forked locus located at either 62.7 +/- 0.66 map units or at the centromere region of the X chromosome of D. mauritiana.

  20. Epidermal growth factor gene is a newly identified candidate gene for gout.

    PubMed

    Han, Lin; Cao, Chunwei; Jia, Zhaotong; Liu, Shiguo; Liu, Zhen; Xin, Ruosai; Wang, Can; Li, Xinde; Ren, Wei; Wang, Xuefeng; Li, Changgui

    2016-08-10

    Chromosome 4q25 has been identified as a genomic region associated with gout. However, the associations of gout with the genes in this region have not yet been confirmed. Here, we performed two-stage analysis to determine whether variations in candidate genes in the 4q25 region are associated with gout in a male Chinese Han population. We first evaluated 96 tag single nucleotide polymorphisms (SNPs) in eight inflammatory/immune pathway- or glucose/lipid metabolism-related genes in the 4q25 region in 480 male gout patients and 480 controls. The SNP rs12504538, located in the elongation of very-long-chain-fatty-acid-like family member 6 gene (Elovl6), was found to be associated with gout susceptibility (Padjusted = 0.00595). In the second stage of analysis, we performed fine mapping analysis of 93 tag SNPs in Elovl6 and in the epidermal growth factor gene (EGF) and its flanking regions in 1017 male patients gout and 1897 healthy male controls. We observed a significant association between the T allele of EGF rs2298999 and gout (odds ratio = 0.77, 95% confidence interval = 0.67-0.88, Padjusted = 6.42 × 10(-3)). These results provide the first evidence for an association between the EGF rs2298999 C/T polymorphism and gout. Our findings should be validated in additional populations.

  1. The association of multiple interacting genes with specific phenotypes in rice using gene coexpression networks.

    PubMed

    Ficklin, Stephen P; Luo, Feng; Feltus, F Alex

    2010-09-01

    Discovering gene sets underlying the expression of a given phenotype is of great importance, as many phenotypes are the result of complex gene-gene interactions. Gene coexpression networks, built using a set of microarray samples as input, can help elucidate tightly coexpressed gene sets (modules) that are mixed with genes of known and unknown function. Functional enrichment analysis of modules further subdivides the coexpressed gene set into cofunctional gene clusters that may coexist in the module with other functionally related gene clusters. In this study, 45 coexpressed gene modules and 76 cofunctional gene clusters were discovered for rice (Oryza sativa) using a global, knowledge-independent paradigm and the combination of two network construction methodologies. Some clusters were enriched for previously characterized mutant phenotypes, providing evidence for specific gene sets (and their annotated molecular functions) that underlie specific phenotypes.

  2. The host-pathogen interaction between wheat and yellow rust induces temporally coordinated waves of gene expression.

    PubMed

    Dobon, Albor; Bunting, Daniel C E; Cabrera-Quio, Luis Enrique; Uauy, Cristobal; Saunders, Diane G O

    2016-05-20

    Understanding how plants and pathogens modulate gene expression during the host-pathogen interaction is key to uncovering the molecular mechanisms that regulate disease progression. Recent advances in sequencing technologies have provided new opportunities to decode the complexity of such interactions. In this study, we used an RNA-based sequencing approach (RNA-seq) to assess the global expression profiles of the wheat yellow rust pathogen Puccinia striiformis f. sp. tritici (PST) and its host during infection. We performed a detailed RNA-seq time-course for a susceptible and a resistant wheat host infected with PST. This study (i) defined the global gene expression profiles for PST and its wheat host, (ii) substantially improved the gene models for PST, (iii) evaluated the utility of several programmes for quantification of global gene expression for PST and wheat, and (iv) identified clusters of differentially expressed genes in the host and pathogen. By focusing on components of the defence response in susceptible and resistant hosts, we were able to visualise the effect of PST infection on the expression of various defence components and host immune receptors. Our data showed sequential, temporally coordinated activation and suppression of expression of a suite of immune-response regulators that varied between compatible and incompatible interactions. These findings provide the framework for a better understanding of how PST causes disease and support the idea that PST can suppress the expression of defence components in wheat to successfully colonize a susceptible host.

  3. Whole exome sequencing identifies novel candidate genes that modify chronic obstructive pulmonary disease susceptibility.

    PubMed

    Bruse, Shannon; Moreau, Michael; Bromberg, Yana; Jang, Jun-Ho; Wang, Nan; Ha, Hongseok; Picchi, Maria; Lin, Yong; Langley, Raymond J; Qualls, Clifford; Klensney-Tait, Julia; Zabner, Joseph; Leng, Shuguang; Mao, Jenny; Belinsky, Steven A; Xing, Jinchuan; Nyunoya, Toru

    2016-01-07

    Chronic obstructive pulmonary disease (COPD) is characterized by an irreversible airflow limitation in response to inhalation of noxious stimuli, such as cigarette smoke. However, only 15-20 % smokers manifest COPD, suggesting a role for genetic predisposition. Although genome-wide association studies have identified common genetic variants that are associated with susceptibility to COPD, effect sizes of the identified variants are modest, as is the total heritability accounted for by these variants. In this study, an extreme phenotype exome sequencing study was combined with in vitro modeling to identify COPD candidate genes. We performed whole exome sequencing of 62 highly susceptible smokers and 30 exceptionally resistant smokers to identify rare variants that may contribute to disease risk or resistance to COPD. This was a cross-sectional case-control study without therapeutic intervention or longitudinal follow-up information. We identified candidate genes based on rare variant analyses and evaluated exonic variants to pinpoint individual genes whose function was computationally established to be significantly different between susceptible and resistant smokers. Top scoring candidate genes from these analyses were further filtered by requiring that each gene be expressed in human bronchial epithelial cells (HBECs). A total of 81 candidate genes were thus selected for in vitro functional testing in cigarette smoke extract (CSE)-exposed HBECs. Using small interfering RNA (siRNA)-mediated gene silencing experiments, we showed that silencing of several candidate genes augmented CSE-induced cytotoxicity in vitro. Our integrative analysis through both genetic and functional approaches identified two candidate genes (TACC2 and MYO1E) that augment cigarette smoke (CS)-induced cytotoxicity and, potentially, COPD susceptibility.

  4. Clustering approaches to identifying gene expression patterns from DNA microarray data.

    PubMed

    Do, Jin Hwan; Choi, Dong-Kug

    2008-04-30

    The analysis of microarray data is essential for large amounts of gene expression data. In this review we focus on clustering techniques. The biological rationale for this approach is the fact that many co-expressed genes are co-regulated, and identifying co-expressed genes could aid in functional annotation of novel genes, de novo identification of transcription factor binding sites and elucidation of complex biological pathways. Co-expressed genes are usually identified in microarray experiments by clustering techniques. There are many such methods, and the results obtained even for the same datasets may vary considerably depending on the algorithms and metrics for dissimilarity measures used, as well as on user-selectable parameters such as desired number of clusters and initial values. Therefore, biologists who want to interpret microarray data should be aware of the weakness and strengths of the clustering methods used. In this review, we survey the basic principles of clustering of DNA microarray data from crisp clustering algorithms such as hierarchical clustering, K-means and self-organizing maps, to complex clustering algorithms like fuzzy clustering.

  5. Genes Important for Schizosaccharomyces pombe Meiosis Identified Through a Functional Genomics Screen

    PubMed Central

    Blyth, Julie; Makrantoni, Vasso; Barton, Rachael E.; Spanos, Christos; Rappsilber, Juri; Marston, Adele L.

    2018-01-01

    Meiosis is a specialized cell division that generates gametes, such as eggs and sperm. Errors in meiosis result in miscarriages and are the leading cause of birth defects; however, the molecular origins of these defects remain unknown. Studies in model organisms are beginning to identify the genes and pathways important for meiosis, but the parts list is still poorly defined. Here we present a comprehensive catalog of genes important for meiosis in the fission yeast, Schizosaccharomyces pombe. Our genome-wide functional screen surveyed all nonessential genes for roles in chromosome segregation and spore formation. Novel genes important at distinct stages of the meiotic chromosome segregation and differentiation program were identified. Preliminary characterization implicated three of these genes in centrosome/spindle pole body, centromere, and cohesion function. Our findings represent a near-complete parts list of genes important for meiosis in fission yeast, providing a valuable resource to advance our molecular understanding of meiosis. PMID:29259000

  6. Microarray and differential display identify genes involved in jasmonate-dependent anther development.

    PubMed

    Mandaokar, Ajin; Kumar, V Dinesh; Amway, Matt; Browse, John

    2003-07-01

    Jasmonate (JA) is a signaling compound essential for anther development and pollen fertility in Arabidopsis. Mutations that block the pathway of JA synthesis result into male sterility. To understand the processes of anther and pollen maturation, we used microarray and differential display approaches to compare gene expression pattern in anthers of wild-type Arabidopsis and the male-sterile mutant, opr3. Microarray experiment revealed 25 genes that were up-regulated more than 1.8-fold in wild-type anthers as compared to mutant anthers. Experiments based on differential display identified 13 additional genes up-regulated in wild-type anthers compared to opr3 for a total of 38 differentially expressed genes. Searches of the Arabidopsis and non-redundant databases disclosed known or likely functions for 28 of the 38 genes identified, while 10 genes encode proteins of unknown function. Northern blot analysis of eight representative clones as probes confirmed low expression in opr3 anthers compared with wild-type anthers. JA responsiveness of these same genes was also investigated by northern blot analysis of anther RNA isolated from wild-type and opr3 plants, In these experiments, four genes were induced in opr3 anthers within 0.5-1 h of JA treatment while the remaining genes were up-regulated only 1-8 h after JA application. None of these genes was induced by JA in anthers of the coil mutant that is deficient in JA responsiveness. The four early-induced genes in opr3 encode lipoxygenase, a putative bHLH transcription factor, epithiospecifier protein and an unknown protein. We propose that these and other early components may be involved in JA signaling and in the initiation of developmental processes. The four late genes encode an extensin-like protein, a peptide transporter and two unknown proteins, which may represent components required later in anther and pollen maturation. Transcript profiling has provided a successful approach to identify genes involved in

  7. Lentiviral vector-based insertional mutagenesis identifies genes associated with liver cancer

    PubMed Central

    Ranzani, Marco; Cesana, Daniela; Bartholomae, Cynthia C.; Sanvito, Francesca; Pala, Mauro; Benedicenti, Fabrizio; Gallina, Pierangela; Sergi, Lucia Sergi; Merella, Stefania; Bulfone, Alessandro; Doglioni, Claudio; von Kalle, Christof; Kim, Yoon Jun; Schmidt, Manfred; Tonon, Giovanni; Naldini, Luigi; Montini, Eugenio

    2013-01-01

    Transposons and γ-retroviruses have been efficiently used as insertional mutagens in different tissues to identify molecular culprits of cancer. However, these systems are characterized by recurring integrations that accumulate in tumor cells, hampering the identification of early cancer-driving events amongst bystander and progression-related events. We developed an insertional mutagenesis platform based on lentiviral vectors (LVV) by which we could efficiently induce hepatocellular carcinoma (HCC) in 3 different mouse models. By virtue of LVV’s replication-deficient nature and broad genome-wide integration pattern, LVV-based insertional mutagenesis allowed identification of 4 new liver cancer genes from a limited number of integrations. We validated the oncogenic potential of all the identified genes in vivo, with different levels of penetrance. Our newly identified cancer genes are likely to play a role in human disease, since they are upregulated and/or amplified/deleted in human HCCs and can predict clinical outcome of patients. PMID:23314173

  8. A novel approach to exploring potential interactions among single-nucleotide polymorphisms of inflammation genes in gliomagenesis: an exploratory case-only study.

    PubMed

    Amirian, E Susan; Scheurer, Michael E; Liu, Yanhong; D'Amelio, Anthony M; Houlston, Richard S; Etzel, Carol J; Shete, Sanjay; Swerdlow, Anthony J; Schoemaker, Minouk J; McKinney, Patricia A; Fleming, Sarah J; Muir, Kenneth R; Lophatananon, Artitaya; Bondy, Melissa L

    2011-08-01

    Despite extensive research on the topic, glioma etiology remains largely unknown. Exploration of potential interactions between single-nucleotide polymorphisms (SNP) of immune genes is a promising new area of glioma research. The case-only study design is a powerful and efficient design for exploring possible multiplicative interactions between factors that are independent of one another. The purpose of our study was to use this exploratory design to identify potential pair wise SNP-SNP interactions from genes involved in several different immune-related pathways for investigation in future studies. The study population consisted of two case groups: 1,224 histologic confirmed, non-Hispanic white glioma cases from the United States and a validation population of 634 glioma cases from the United Kingdom. Polytomous logistic regression, in which one SNP was coded as the outcome and the other SNP was included as the exposure, was utilized to calculate the ORs of the likelihood of cases simultaneously having the variant alleles of two different SNPs. Potential interactions were examined only between SNPs located in different genes or chromosomes. Using this data mining strategy, we found 396 significant SNP-SNP interactions among polymorphisms of immune-related genes that were present in both the U.S. and U.K. study populations. This exploratory study was conducted for the purpose of hypothesis generation, and thus has provided several new hypotheses that can be tested using traditional case-control study designs to obtain estimates of risk. This is the first study, to our knowledge, to take this novel approach to identifying SNP-SNP interactions relevant to glioma etiology. ©2011 AACR.

  9. Identifying candidate genes for Type 2 Diabetes Mellitus and obesity through gene expression profiling in multiple tissues or cells.

    PubMed

    Chen, Junhui; Meng, Yuhuan; Zhou, Jinghui; Zhuo, Min; Ling, Fei; Zhang, Yu; Du, Hongli; Wang, Xiaoning

    2013-01-01

    Type 2 Diabetes Mellitus (T2DM) and obesity have become increasingly prevalent in recent years. Recent studies have focused on identifying causal variations or candidate genes for obesity and T2DM via analysis of expression quantitative trait loci (eQTL) within a single tissue. T2DM and obesity are affected by comprehensive sets of genes in multiple tissues. In the current study, gene expression levels in multiple human tissues from GEO datasets were analyzed, and 21 candidate genes displaying high percentages of differential expression were filtered out. Specifically, DENND1B, LYN, MRPL30, POC1B, PRKCB, RP4-655J12.3, HIBADH, and TMBIM4 were identified from the T2DM-control study, and BCAT1, BMP2K, CSRNP2, MYNN, NCKAP5L, SAP30BP, SLC35B4, SP1, BAP1, GRB14, HSP90AB1, ITGA5, and TOMM5 were identified from the obesity-control study. The majority of these genes are known to be involved in T2DM and obesity. Therefore, analysis of gene expression in various tissues using GEO datasets may be an effective and feasible method to determine novel or causal genes associated with T2DM and obesity.

  10. Insulators form gene loops by interacting with promoters in Drosophila.

    PubMed

    Erokhin, Maksim; Davydova, Anna; Kyrchanova, Olga; Parshikov, Alexander; Georgiev, Pavel; Chetverina, Darya

    2011-09-01

    Chromatin insulators are regulatory elements involved in the modulation of enhancer-promoter communication. The 1A2 and Wari insulators are located immediately downstream of the Drosophila yellow and white genes, respectively. Using an assay based on the yeast GAL4 activator, we have found that both insulators are able to interact with their target promoters in transgenic lines, forming gene loops. The existence of an insulator-promoter loop is confirmed by the fact that insulator proteins could be detected on the promoter only in the presence of an insulator in the transgene. The upstream promoter regions, which are required for long-distance stimulation by enhancers, are not essential for promoter-insulator interactions. Both insulators support basal activity of the yellow and white promoters in eyes. Thus, the ability of insulators to interact with promoters might play an important role in the regulation of basal gene transcription.

  11. Applying Multivariate Adaptive Splines to Identify Genes With Expressions Varying After Diagnosis in Microarray Experiments.

    PubMed

    Duan, Fenghai; Xu, Ye

    2017-01-01

    To analyze a microarray experiment to identify the genes with expressions varying after the diagnosis of breast cancer. A total of 44 928 probe sets in an Affymetrix microarray data publicly available on Gene Expression Omnibus from 249 patients with breast cancer were analyzed by the nonparametric multivariate adaptive splines. Then, the identified genes with turning points were grouped by K-means clustering, and their network relationship was subsequently analyzed by the Ingenuity Pathway Analysis. In total, 1640 probe sets (genes) were reliably identified to have turning points along with the age at diagnosis in their expression profiling, of which 927 expressed lower after turning points and 713 expressed higher after the turning points. K-means clustered them into 3 groups with turning points centering at 54, 62.5, and 72, respectively. The pathway analysis showed that the identified genes were actively involved in various cancer-related functions or networks. In this article, we applied the nonparametric multivariate adaptive splines method to a publicly available gene expression data and successfully identified genes with expressions varying before and after breast cancer diagnosis.

  12. 4C-ker: A Method to Reproducibly Identify Genome-Wide Interactions Captured by 4C-Seq Experiments.

    PubMed

    Raviram, Ramya; Rocha, Pedro P; Müller, Christian L; Miraldi, Emily R; Badri, Sana; Fu, Yi; Swanzey, Emily; Proudhon, Charlotte; Snetkova, Valentina; Bonneau, Richard; Skok, Jane A

    2016-03-01

    4C-Seq has proven to be a powerful technique to identify genome-wide interactions with a single locus of interest (or "bait") that can be important for gene regulation. However, analysis of 4C-Seq data is complicated by the many biases inherent to the technique. An important consideration when dealing with 4C-Seq data is the differences in resolution of signal across the genome that result from differences in 3D distance separation from the bait. This leads to the highest signal in the region immediately surrounding the bait and increasingly lower signals in far-cis and trans. Another important aspect of 4C-Seq experiments is the resolution, which is greatly influenced by the choice of restriction enzyme and the frequency at which it can cut the genome. Thus, it is important that a 4C-Seq analysis method is flexible enough to analyze data generated using different enzymes and to identify interactions across the entire genome. Current methods for 4C-Seq analysis only identify interactions in regions near the bait or in regions located in far-cis and trans, but no method comprehensively analyzes 4C signals of different length scales. In addition, some methods also fail in experiments where chromatin fragments are generated using frequent cutter restriction enzymes. Here, we describe 4C-ker, a Hidden-Markov Model based pipeline that identifies regions throughout the genome that interact with the 4C bait locus. In addition, we incorporate methods for the identification of differential interactions in multiple 4C-seq datasets collected from different genotypes or experimental conditions. Adaptive window sizes are used to correct for differences in signal coverage in near-bait regions, far-cis and trans chromosomes. Using several datasets, we demonstrate that 4C-ker outperforms all existing 4C-Seq pipelines in its ability to reproducibly identify interaction domains at all genomic ranges with different resolution enzymes.

  13. 4C-ker: A Method to Reproducibly Identify Genome-Wide Interactions Captured by 4C-Seq Experiments

    PubMed Central

    Raviram, Ramya; Rocha, Pedro P.; Müller, Christian L.; Miraldi, Emily R.; Badri, Sana; Fu, Yi; Swanzey, Emily; Proudhon, Charlotte; Snetkova, Valentina

    2016-01-01

    4C-Seq has proven to be a powerful technique to identify genome-wide interactions with a single locus of interest (or “bait”) that can be important for gene regulation. However, analysis of 4C-Seq data is complicated by the many biases inherent to the technique. An important consideration when dealing with 4C-Seq data is the differences in resolution of signal across the genome that result from differences in 3D distance separation from the bait. This leads to the highest signal in the region immediately surrounding the bait and increasingly lower signals in far-cis and trans. Another important aspect of 4C-Seq experiments is the resolution, which is greatly influenced by the choice of restriction enzyme and the frequency at which it can cut the genome. Thus, it is important that a 4C-Seq analysis method is flexible enough to analyze data generated using different enzymes and to identify interactions across the entire genome. Current methods for 4C-Seq analysis only identify interactions in regions near the bait or in regions located in far-cis and trans, but no method comprehensively analyzes 4C signals of different length scales. In addition, some methods also fail in experiments where chromatin fragments are generated using frequent cutter restriction enzymes. Here, we describe 4C-ker, a Hidden-Markov Model based pipeline that identifies regions throughout the genome that interact with the 4C bait locus. In addition, we incorporate methods for the identification of differential interactions in multiple 4C-seq datasets collected from different genotypes or experimental conditions. Adaptive window sizes are used to correct for differences in signal coverage in near-bait regions, far-cis and trans chromosomes. Using several datasets, we demonstrate that 4C-ker outperforms all existing 4C-Seq pipelines in its ability to reproducibly identify interaction domains at all genomic ranges with different resolution enzymes. PMID:26938081

  14. Pollen Killer Gene S35 Function Requires Interaction with an Activator That Maps Close to S24, Another Pollen Killer Gene in Rice.

    PubMed

    Kubo, Takahiko; Yoshimura, Atsushi; Kurata, Nori

    2016-05-03

    Pollen killer genes disable noncarrier pollens, and are responsible for male sterility and segregation distortion in hybrid populations of distantly related plant species. The genetic networks and the molecular mechanisms underlying the pollen killer system remain largely unknown. Two pollen killer genes, S24 and S35, have been found in an intersubspecific cross of Oryza sativa ssp. indica and japonica The effect of S24 is counteracted by an unlinked locus EFS Additionally, S35 has been proposed to interact with S24 to induce pollen sterility. These genetic interactions are suggestive of a single S24-centric genetic pathway (EFS-S24-S35) for the pollen killer system. To examine this hypothetical genetic pathway, the S35 and the S24 regions were further characterized and genetically dissected in this study. Our results indicated that S35 causes pollen sterility independently of both the EFS and S24 genes, but is dependent on a novel gene close to the S24 locus, named incentive for killing pollen (INK). We confirmed the phenotypic effect of the INK gene separately from the S24 gene, and identified the INK locus within an interval of less than 0.6 Mb on rice chromosome 5. This study characterized the genetic effect of the two independent genetic pathways of INK-S35 and EFS-S24 in indica-japonica hybrid progeny. Our results provide clear evidence that hybrid male sterility in rice is caused by several pollen killer networks with multiple factors positively and negatively regulating pollen killer genes. Copyright © 2016 Kubo et al.

  15. Pollen Killer Gene S35 Function Requires Interaction with an Activator That Maps Close to S24, Another Pollen Killer Gene in Rice

    PubMed Central

    Kubo, Takahiko; Yoshimura, Atsushi; Kurata, Nori

    2016-01-01

    Pollen killer genes disable noncarrier pollens, and are responsible for male sterility and segregation distortion in hybrid populations of distantly related plant species. The genetic networks and the molecular mechanisms underlying the pollen killer system remain largely unknown. Two pollen killer genes, S24 and S35, have been found in an intersubspecific cross of Oryza sativa ssp. indica and japonica. The effect of S24 is counteracted by an unlinked locus EFS. Additionally, S35 has been proposed to interact with S24 to induce pollen sterility. These genetic interactions are suggestive of a single S24-centric genetic pathway (EFS–S24–S35) for the pollen killer system. To examine this hypothetical genetic pathway, the S35 and the S24 regions were further characterized and genetically dissected in this study. Our results indicated that S35 causes pollen sterility independently of both the EFS and S24 genes, but is dependent on a novel gene close to the S24 locus, named incentive for killing pollen (INK). We confirmed the phenotypic effect of the INK gene separately from the S24 gene, and identified the INK locus within an interval of less than 0.6 Mb on rice chromosome 5. This study characterized the genetic effect of the two independent genetic pathways of INK–S35 and EFS–S24 in indica–japonica hybrid progeny. Our results provide clear evidence that hybrid male sterility in rice is caused by several pollen killer networks with multiple factors positively and negatively regulating pollen killer genes. PMID:27172610

  16. Optimization of neural network architecture using genetic programming improves detection and modeling of gene-gene interactions in studies of human diseases

    PubMed Central

    Ritchie, Marylyn D; White, Bill C; Parker, Joel S; Hahn, Lance W; Moore, Jason H

    2003-01-01

    Background Appropriate definition of neural network architecture prior to data analysis is crucial for successful data mining. This can be challenging when the underlying model of the data is unknown. The goal of this study was to determine whether optimizing neural network architecture using genetic programming as a machine learning strategy would improve the ability of neural networks to model and detect nonlinear interactions among genes in studies of common human diseases. Results Using simulated data, we show that a genetic programming optimized neural network approach is able to model gene-gene interactions as well as a traditional back propagation neural network. Furthermore, the genetic programming optimized neural network is better than the traditional back propagation neural network approach in terms of predictive ability and power to detect gene-gene interactions when non-functional polymorphisms are present. Conclusion This study suggests that a machine learning strategy for optimizing neural network architecture may be preferable to traditional trial-and-error approaches for the identification and characterization of gene-gene interactions in common, complex human diseases. PMID:12846935

  17. An Integrative Framework for Bayesian Variable Selection with Informative Priors for Identifying Genes and Pathways

    PubMed Central

    Ander, Bradley P.; Zhang, Xiaoshuai; Xue, Fuzhong; Sharp, Frank R.; Yang, Xiaowei

    2013-01-01

    The discovery of genetic or genomic markers plays a central role in the development of personalized medicine. A notable challenge exists when dealing with the high dimensionality of the data sets, as thousands of genes or millions of genetic variants are collected on a relatively small number of subjects. Traditional gene-wise selection methods using univariate analyses face difficulty to incorporate correlational, structural, or functional structures amongst the molecular measures. For microarray gene expression data, we first summarize solutions in dealing with ‘large p, small n’ problems, and then propose an integrative Bayesian variable selection (iBVS) framework for simultaneously identifying causal or marker genes and regulatory pathways. A novel partial least squares (PLS) g-prior for iBVS is developed to allow the incorporation of prior knowledge on gene-gene interactions or functional relationships. From the point view of systems biology, iBVS enables user to directly target the joint effects of multiple genes and pathways in a hierarchical modeling diagram to predict disease status or phenotype. The estimated posterior selection probabilities offer probabilitic and biological interpretations. Both simulated data and a set of microarray data in predicting stroke status are used in validating the performance of iBVS in a Probit model with binary outcomes. iBVS offers a general framework for effective discovery of various molecular biomarkers by combining data-based statistics and knowledge-based priors. Guidelines on making posterior inferences, determining Bayesian significance levels, and improving computational efficiencies are also discussed. PMID:23844055

  18. Exome sequencing of Pakistani consanguineous families identifies 30 novel candidate genes for recessive intellectual disability

    PubMed Central

    Riazuddin, S; Hussain, M; Razzaq, A; Iqbal, Z; Shahzad, M; Polla, D L; Song, Y; van Beusekom, E; Khan, A A; Tomas-Roca, L; Rashid, M; Zahoor, M Y; Wissink-Lindhout, W M; Basra, M A R; Ansar, M; Agha, Z; van Heeswijk, K; Rasheed, F; Van de Vorst, M; Veltman, J A; Gilissen, C; Akram, J; Kleefstra, T; Assir, M Z; Grozeva, D; Carss, K; Raymond, F L; O'Connor, T D; Riazuddin, S A; Khan, S N; Ahmed, Z M; de Brouwer, A P M; van Bokhoven, H; Riazuddin, S

    2017-01-01

    Intellectual disability (ID) is a clinically and genetically heterogeneous disorder, affecting 1–3% of the general population. Although research into the genetic causes of ID has recently gained momentum, identification of pathogenic mutations that cause autosomal recessive ID (ARID) has lagged behind, predominantly due to non-availability of sizeable families. Here we present the results of exome sequencing in 121 large consanguineous Pakistani ID families. In 60 families, we identified homozygous or compound heterozygous DNA variants in a single gene, 30 affecting reported ID genes and 30 affecting novel candidate ID genes. Potential pathogenicity of these alleles was supported by co-segregation with the phenotype, low frequency in control populations and the application of stringent bioinformatics analyses. In another eight families segregation of multiple pathogenic variants was observed, affecting 19 genes that were either known or are novel candidates for ID. Transcriptome profiles of normal human brain tissues showed that the novel candidate ID genes formed a network significantly enriched for transcriptional co-expression (P<0.0001) in the frontal cortex during fetal development and in the temporal–parietal and sub-cortex during infancy through adulthood. In addition, proteins encoded by 12 novel ID genes directly interact with previously reported ID proteins in six known pathways essential for cognitive function (P<0.0001). These results suggest that disruptions of temporal parietal and sub-cortical neurogenesis during infancy are critical to the pathophysiology of ID. These findings further expand the existing repertoire of genes involved in ARID, and provide new insights into the molecular mechanisms and the transcriptome map of ID. PMID:27457812

  19. Exome sequencing of Pakistani consanguineous families identifies 30 novel candidate genes for recessive intellectual disability.

    PubMed

    Riazuddin, S; Hussain, M; Razzaq, A; Iqbal, Z; Shahzad, M; Polla, D L; Song, Y; van Beusekom, E; Khan, A A; Tomas-Roca, L; Rashid, M; Zahoor, M Y; Wissink-Lindhout, W M; Basra, M A R; Ansar, M; Agha, Z; van Heeswijk, K; Rasheed, F; Van de Vorst, M; Veltman, J A; Gilissen, C; Akram, J; Kleefstra, T; Assir, M Z; Grozeva, D; Carss, K; Raymond, F L; O'Connor, T D; Riazuddin, S A; Khan, S N; Ahmed, Z M; de Brouwer, A P M; van Bokhoven, H; Riazuddin, S

    2017-11-01

    Intellectual disability (ID) is a clinically and genetically heterogeneous disorder, affecting 1-3% of the general population. Although research into the genetic causes of ID has recently gained momentum, identification of pathogenic mutations that cause autosomal recessive ID (ARID) has lagged behind, predominantly due to non-availability of sizeable families. Here we present the results of exome sequencing in 121 large consanguineous Pakistani ID families. In 60 families, we identified homozygous or compound heterozygous DNA variants in a single gene, 30 affecting reported ID genes and 30 affecting novel candidate ID genes. Potential pathogenicity of these alleles was supported by co-segregation with the phenotype, low frequency in control populations and the application of stringent bioinformatics analyses. In another eight families segregation of multiple pathogenic variants was observed, affecting 19 genes that were either known or are novel candidates for ID. Transcriptome profiles of normal human brain tissues showed that the novel candidate ID genes formed a network significantly enriched for transcriptional co-expression (P<0.0001) in the frontal cortex during fetal development and in the temporal-parietal and sub-cortex during infancy through adulthood. In addition, proteins encoded by 12 novel ID genes directly interact with previously reported ID proteins in six known pathways essential for cognitive function (P<0.0001). These results suggest that disruptions of temporal parietal and sub-cortical neurogenesis during infancy are critical to the pathophysiology of ID. These findings further expand the existing repertoire of genes involved in ARID, and provide new insights into the molecular mechanisms and the transcriptome map of ID.

  20. Gene-by-Psychosocial Factor Interactions Influence Diastolic Blood Pressure in European and African Ancestry Populations: Meta-Analysis of Four Cohort Studies.

    PubMed

    Smith, Jennifer A; Zhao, Wei; Yasutake, Kalyn; August, Carmella; Ratliff, Scott M; Faul, Jessica D; Boerwinkle, Eric; Chakravarti, Aravinda; Diez Roux, Ana V; Gao, Yan; Griswold, Michael E; Heiss, Gerardo; Kardia, Sharon L R; Morrison, Alanna C; Musani, Solomon K; Mwasongwe, Stanford; North, Kari E; Rose, Kathryn M; Sims, Mario; Sun, Yan V; Weir, David R; Needham, Belinda L

    2017-12-18

    Inter-individual variability in blood pressure (BP) is influenced by both genetic and non-genetic factors including socioeconomic and psychosocial stressors. A deeper understanding of the gene-by-socioeconomic/psychosocial factor interactions on BP may help to identify individuals that are genetically susceptible to high BP in specific social contexts. In this study, we used a genomic region-based method for longitudinal analysis, Longitudinal Gene-Environment-Wide Interaction Studies (LGEWIS), to evaluate the effects of interactions between known socioeconomic/psychosocial and genetic risk factors on systolic and diastolic BP in four large epidemiologic cohorts of European and/or African ancestry. After correction for multiple testing, two interactions were significantly associated with diastolic BP. In European ancestry participants, outward/trait anger score had a significant interaction with the C10orf107 genomic region ( p = 0.0019). In African ancestry participants, depressive symptom score had a significant interaction with the HFE genomic region ( p = 0.0048). This study provides a foundation for using genomic region-based longitudinal analysis to identify subgroups of the population that may be at greater risk of elevated BP due to the combined influence of genetic and socioeconomic/psychosocial risk factors.

  1. Gene-environment studies: any advantage over environmental studies?

    PubMed

    Bermejo, Justo Lorenzo; Hemminki, Kari

    2007-07-01

    Gene-environment studies have been motivated by the likely existence of prevalent low-risk genes that interact with common environmental exposures. The present study assessed the statistical advantage of the simultaneous consideration of genes and environment to investigate the effect of environmental risk factors on disease. In particular, we contemplated the possibility that several genes modulate the environmental effect. Environmental exposures, genotypes and phenotypes were simulated according to a wide range of parameter settings. Different models of gene-gene-environment interaction were considered. For each parameter combination, we estimated the probability of detecting the main environmental effect, the power to identify the gene-environment interaction and the frequency of environmentally affected individuals at which environmental and gene-environment studies show the same statistical power. The proportion of cases in the population attributable to the modeled risk factors was also calculated. Our data indicate that environmental exposures with weak effects may account for a significant proportion of the population prevalence of the disease. A general result was that, if the environmental effect was restricted to rare genotypes, the power to detect the gene-environment interaction was higher than the power to identify the main environmental effect. In other words, when few individuals contribute to the overall environmental effect, individual contributions are large and result in easily identifiable gene-environment interactions. Moreover, when multiple genes interacted with the environment, the statistical benefit of gene-environment studies was limited to those studies that included major contributors to the gene-environment interaction. The advantage of gene-environment over plain environmental studies also depends on the inheritance mode of the involved genes, on the study design and, to some extend, on the disease prevalence.

  2. Computational gene network study on antibiotic resistance genes of Acinetobacter baumannii.

    PubMed

    Anitha, P; Anbarasu, Anand; Ramaiah, Sudha

    2014-05-01

    Multi Drug Resistance (MDR) in Acinetobacter baumannii is one of the major threats for emerging nosocomial infections in hospital environment. Multidrug-resistance in A. baumannii may be due to the implementation of multi-combination resistance mechanisms such as β-lactamase synthesis, Penicillin-Binding Proteins (PBPs) changes, alteration in porin proteins and in efflux pumps against various existing classes of antibiotics. Multiple antibiotic resistance genes are involved in MDR. These resistance genes are transferred through plasmids, which are responsible for the dissemination of antibiotic resistance among Acinetobacter spp. In addition, these resistance genes may also have a tendency to interact with each other or with their gene products. Therefore, it becomes necessary to understand the impact of these interactions in antibiotic resistance mechanism. Hence, our study focuses on protein and gene network analysis on various resistance genes, to elucidate the role of the interacting proteins and to study their functional contribution towards antibiotic resistance. From the search tool for the retrieval of interacting gene/protein (STRING), a total of 168 functional partners for 15 resistance genes were extracted based on the confidence scoring system. The network study was then followed up with functional clustering of associated partners using molecular complex detection (MCODE). Later, we selected eight efficient clusters based on score. Interestingly, the associated protein we identified from the network possessed greater functional similarity with known resistance genes. This network-based approach on resistance genes of A. baumannii could help in identifying new genes/proteins and provide clues on their association in antibiotic resistance. Copyright © 2014 Elsevier Ltd. All rights reserved.

  3. Epidermal growth factor gene is a newly identified candidate gene for gout

    PubMed Central

    Han, Lin; Cao, Chunwei; Jia, Zhaotong; Liu, Shiguo; Liu, Zhen; Xin, Ruosai; Wang, Can; Li, Xinde; Ren, Wei; Wang, Xuefeng; Li, Changgui

    2016-01-01

    Chromosome 4q25 has been identified as a genomic region associated with gout. However, the associations of gout with the genes in this region have not yet been confirmed. Here, we performed two-stage analysis to determine whether variations in candidate genes in the 4q25 region are associated with gout in a male Chinese Han population. We first evaluated 96 tag single nucleotide polymorphisms (SNPs) in eight inflammatory/immune pathway- or glucose/lipid metabolism-related genes in the 4q25 region in 480 male gout patients and 480 controls. The SNP rs12504538, located in the elongation of very-long-chain-fatty-acid-like family member 6 gene (Elovl6), was found to be associated with gout susceptibility (Padjusted = 0.00595). In the second stage of analysis, we performed fine mapping analysis of 93 tag SNPs in Elovl6 and in the epidermal growth factor gene (EGF) and its flanking regions in 1017 male patients gout and 1897 healthy male controls. We observed a significant association between the T allele of EGF rs2298999 and gout (odds ratio = 0.77, 95% confidence interval = 0.67–0.88, Padjusted = 6.42 × 10−3). These results provide the first evidence for an association between the EGF rs2298999 C/T polymorphism and gout. Our findings should be validated in additional populations. PMID:27506295

  4. Combining suppressive subtractive hybridization and cDNA microarrays to identify dietary phosphorus-responsive genes of the rainbow trout (Oncorhynchus mykiss) kidney.

    PubMed

    Lake, Jennifer; Gravel, Catherine; Koko, Gabriel Koffi D; Robert, Claude; Vandenberg, Grant W

    2010-03-01

    Phosphorus (P)-responsive genes and how they regulate renal adaptation to phosphorous-deficient diets in animals, including fish, are not well understood. RNA abundance profiling using cDNA microarrays is an efficient approach to study nutrient-gene interactions and identify these dietary P-responsive genes. To test the hypothesis that dietary P-responsive genes are differentially expressed in fish fed varying P levels, rainbow trout were fed a practical high-P diet (R20: 0.96% P) or a low-P diet (R0: 0.38% P) for 7 weeks. The differentially-expressed genes between dietary groups were identified and compared from the kidney by combining suppressive subtractive hybridization (SSH) with cDNA microarray analysis. A number of genes were confirmed by real-time PCR, and correlated with plasma and bone P concentrations. Approximately 54 genes were identified as potential dietary P-responsive after 7 weeks on a diet deficient in P according to cDNA microarray analysis. Of 18 selected genes, 13 genes were confirmed to be P-responsive at 7 weeks by real-time PCR analysis, including: iNOS, cytochrome b, cytochrome c oxidase subunit II , alpha-globin I, beta-globin, ATP synthase, hyperosmotic protein 21, COL1A3, Nkef, NDPK, glucose phosphate isomerase 1, Na+/H+ exchange protein and GDP dissociation inhibitor 2. Many of these dietary P-responsive genes responded in a moderate way (R0/R20 ratio: <2-3 or >0.5) and in a transient manner to dietary P limitation. In summary, renal adaptation to dietary P deficiency in trout involves changes in the expression of several genes, suggesting a profile of metabolic stress, since many of these differentially-expressed candidates are associated with the cellular adaptative responses. Crown Copyright 2009. Published by Elsevier Inc. All rights reserved.

  5. The Proteomic Profile of Deleted in Breast Cancer 1 (DBC1) Interactions Points to a Multifaceted Regulation of Gene Expression*

    PubMed Central

    Giguère, Sophie S. B.; Guise, Amanda J.; Jean Beltran, Pierre M.; Joshi, Preeti M.; Greco, Todd M.; Quach, Olivia L.; Kong, Jeffery; Cristea, Ileana M.

    2016-01-01

    Deleted in breast cancer 1 (DBC1) has emerged as an important regulator of multiple cellular processes, ranging from gene expression to cell cycle progression. DBC1 has been linked to tumorigenesis both as an inhibitor of histone deacetylases, HDAC3 and sirtuin 1, and as a transcriptional cofactor for nuclear hormone receptors. However, despite mounting interest in DBC1, relatively little is known about the range of its interacting partners and the scope of its functions. Here, we carried out a functional proteomics-based investigation of DBC1 interactions in two relevant cell types, T cells and kidney cells. Microscopy, molecular biology, biochemistry, and mass spectrometry studies allowed us to assess DBC1 mRNA and protein levels, localization, phosphorylation status, and protein interaction networks. The comparison of DBC1 interactions in these cell types revealed conserved regulatory roles for DBC1 in gene expression, chromatin organization and modification, and cell cycle progression. Interestingly, we observe previously unrecognized DBC1 interactions with proteins encoded by cancer-associated genes. Among these interactions are five components of the SWI/SNF complex, the most frequently mutated chromatin remodeling complex in human cancers. Additionally, we identified a DBC1 interaction with TBL1XR1, a component of the NCoR complex, which we validated by reciprocal isolation. Strikingly, we discovered that DBC1 associates with proteins that regulate the circadian cycle, including DDX5, DHX9, and SFPQ. We validated this interaction by colocalization and reciprocal isolation. Functional assessment of this association demonstrated that DBC1 protein levels are important for regulating CLOCK and BMAL1 protein oscillations in synchronized T cells. Our results suggest that DBC1 is integral to the maintenance of the circadian molecular clock. Furthermore, the identified interactions provide a valuable resource for the exploration of pathways involved in DBC1

  6. Combining gene expression and genetic analyses to identify candidate genes involved in cold responses in pea.

    PubMed

    Legrand, Sylvain; Marque, Gilles; Blassiau, Christelle; Bluteau, Aurélie; Canoy, Anne-Sophie; Fontaine, Véronique; Jaminon, Odile; Bahrman, Nasser; Mautord, Julie; Morin, Julie; Petit, Aurélie; Baranger, Alain; Rivière, Nathalie; Wilmer, Jeroen; Delbreil, Bruno; Lejeune-Hénaut, Isabelle

    2013-09-01

    Cold stress affects plant growth and development. In order to better understand the responses to cold (chilling or freezing tolerance), we used two contrasted pea lines. Following a chilling period, the Champagne line becomes tolerant to frost whereas the Terese line remains sensitive. Four suppression subtractive hybridisation libraries were obtained using mRNAs isolated from pea genotypes Champagne and Terese. Using quantitative polymerase chain reaction (qPCR) performed on 159 genes, 43 and 54 genes were identified as differentially expressed at the initial time point and during the time course study, respectively. Molecular markers were developed from the differentially expressed genes and were genotyped on a population of 164 RILs derived from a cross between Champagne and Terese. We identified 5 candidate genes colocalizing with 3 different frost damage quantitative trait loci (QTL) intervals and a protein quantity locus (PQL) rich region previously reported. This investigation revealed the role of constitutive differences between both genotypes in the cold responses, in particular with genes related to glycine degradation pathway that could confer to Champagne a better frost tolerance. We showed that freezing tolerance involves a decrease of expression of genes related to photosynthesis and the expression of a gene involved in the production of cysteine and methionine that could act as cryoprotectant molecules. Although it remains to be confirmed, this study could also reveal the involvement of the jasmonate pathway in the cold responses, since we observed that two genes related to this pathway were mapped in a frost damage QTL interval and in a PQL rich region interval, respectively. Copyright © 2013 Elsevier GmbH. All rights reserved.

  7. GeneWiz browser: An Interactive Tool for Visualizing Sequenced Chromosomes.

    PubMed

    Hallin, Peter F; Stærfeldt, Hans-Henrik; Rotenberg, Eva; Binnewies, Tim T; Benham, Craig J; Ussery, David W

    2009-09-25

    We present an interactive web application for visualizing genomic data of prokaryotic chromosomes. The tool (GeneWiz browser) allows users to carry out various analyses such as mapping alignments of homologous genes to other genomes, mapping of short sequencing reads to a reference chromosome, and calculating DNA properties such as curvature or stacking energy along the chromosome. The GeneWiz browser produces an interactive graphic that enables zooming from a global scale down to single nucleotides, without changing the size of the plot. Its ability to disproportionally zoom provides optimal readability and increased functionality compared to other browsers. The tool allows the user to select the display of various genomic features, color setting and data ranges. Custom numerical data can be added to the plot allowing, for example, visualization of gene expression and regulation data. Further, standard atlases are pre-generated for all prokaryotic genomes available in GenBank, providing a fast overview of all available genomes, including recently deposited genome sequences. The tool is available online from http://www.cbs.dtu.dk/services/gwBrowser. Supplemental material including interactive atlases is available online at http://www.cbs.dtu.dk/services/gwBrowser/suppl/.

  8. Identifying and quantifying interactions in a laboratory swarm

    NASA Astrophysics Data System (ADS)

    Puckett, James; Kelley, Douglas; Ouellette, Nicholas

    2013-03-01

    Emergent collective behavior, such as in flocks of birds or swarms of bees, is exhibited throughout the animal kingdom. Many models have been developed to describe swarming and flocking behavior using systems of self-propelled particles obeying simple rules or interacting via various potentials. However, due to experimental difficulties and constraints, little empirical data exists for characterizing the exact form of the biological interactions. We study laboratory swarms of flying Chironomus riparius midges, using stereoimaging and particle tracking techniques to record three-dimensional trajectories for all the individuals in the swarm. We describe methods to identify and quantify interactions by examining these trajectories, and report results on interaction magnitude, frequency, and mutuality.

  9. Measured Gene-by-Environment Interaction in Relation to Attention-Deficit/Hyperactivity Disorder

    ERIC Educational Resources Information Center

    Nigg, Joel; Nikolas, Molly; Burt, S. Alexandra

    2010-01-01

    Objective: To summarize and evaluate the state of knowledge regarding the role of measured gene-by-environment interactions in relation to attention-deficit/hyperactivity disorder. Method: A selective review of methodologic issues was followed by a systematic search for relevant articles on measured gene-by-environment interactions; the search…

  10. ToxCast Data Expands Universe of Chemical-Gene Interactions (SOT)

    EPA Science Inventory

    Characterizing the effects of chemicals in biological systems is often summarized by chemical-gene interactions, which have sparse coverage in literature. The ToxCast chemical screening program has produced bioactivity data for nearly 2000 chemicals and over 450 gene targets. Thi...

  11. Interactions of OsMADS1 with Floral Homeotic Genes in Rice Flower Development.

    PubMed

    Hu, Yun; Liang, Wanqi; Yin, Changsong; Yang, Xuelian; Ping, Baozhe; Li, Anxue; Jia, Ru; Chen, Mingjiao; Luo, Zhijing; Cai, Qiang; Zhao, Xiangxiang; Zhang, Dabing; Yuan, Zheng

    2015-09-01

    During reproductive development, rice plants develop unique flower organs which determine the final grain yield. OsMADS1, one of SEPALLATA-like MADS-box genes, has been unraveled to play critical roles in rice floral organ identity specification and floral meristem determinacy. However, the molecular mechanisms underlying interactions of OsMADS1 with other floral homeotic genes in regulating flower development remains largely elusive. In this work, we studied the genetic interactions of OsMADS1 with B-, C-, and D-class genes along with physical interactions among their proteins. We show that the physical and genetic interactions between OsMADS1 and OsMADS3 are essential for floral meristem activity maintenance and organ identity specification; while OsMADS1 physically and genetically interacts with OsMADS58 in regulating floral meristem determinacy and suppressing spikelet meristem reversion. We provided important genetic evidence to support the neofunctionalization of two rice C-class genes (OsMADS3 and OsMADS58) during flower development. Gene expression profiling and quantitative RT-PCR analyses further revealed that OsMADS1 affects the expression of many genes involved in floral identity and hormone signaling, and chromatin immunoprecipitation (ChIP)-PCR assay further demonstrated that OsMADS17 is a direct target gene of OsMADS1. Taken together, these results reveal that OsMADS1 has diversified regulatory functions in specifying rice floral organ and meristem identity, probably through its genetic and physical interactions with different floral homeotic regulators. Copyright © 2015 The Author. Published by Elsevier Inc. All rights reserved.

  12. Microarray expression profiling identifies genes with altered expression in HDL-deficient mice

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Callow, Matthew J.; Dudoit, Sandrine; Gong, Elaine L.

    2000-05-05

    Based on the assumption that severe alterations in the expression of genes known to be involved in HDL metabolism may affect the expression of other genes we screened an array of over 5000 mouse expressed sequence tags (ESTs) for altered gene expression in the livers of two lines of mice with dramatic decreases in HDL plasma concentrations. Labeled cDNA from livers of apolipoprotein AI (apo AI) knockout mice, Scavenger Receptor BI (SR-BI) transgenic mice and control mice were co-hybridized to microarrays. Two-sample t-statistics were used to identify genes with altered expression levels in the knockout or transgenic mice compared withmore » the control mice. In the SR-BI group we found 9 array elements representing at least 5 genes to be significantly altered on the basis of an adjusted p value of less than 0.05. In the apo AI knockout group 8 array elements representing 4 genes were altered compared with the control group (p < 0.05). Several of the genes identified in the SR-BI transgenic suggest altered sterol metabolism and oxidative processes. These studies illustrate the use of multiple-testing methods for the identification of genes with altered expression in replicated microarray experiments of apo AI knockout and SR-BI transgenic mice.« less

  13. Exploring the key genes and pathways in enchondromas using a gene expression microarray.

    PubMed

    Shi, Zhongju; Zhou, Hengxing; Pan, Bin; Lu, Lu; Kang, Yi; Liu, Lu; Wei, Zhijian; Feng, Shiqing

    2017-07-04

    Enchondromas are the most common primary benign osseous neoplasms that occur in the medullary bone; they can undergo malignant transformation into chondrosarcoma. However, enchondromas are always undetected in patients, and the molecular mechanism is unclear. To identify key genes and pathways associated with the occurrence and development of enchondromas, we downloaded the gene expression dataset GSE22855 and obtained the differentially expressed genes (DEGs) by analyzing high-throughput gene expression in enchondromas. In total, 635 genes were identified as DEGs. Of these, 225 genes (35.43%) were up-regulated, and the remaining 410 genes (64.57%) were down-regulated. We identified the predominant gene ontology (GO) categories and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways that were significantly over-represented in the enchondromas samples compared with the control samples. Subsequently the top 10 core genes were identified from the protein-protein interaction (PPI) network. The enrichment analyses of the genes mainly involved in two significant modules showed that the DEGs were principally related to ribosomes, protein digestion and absorption, ECM-receptor interaction, focal adhesion, amoebiasis and the PI3K-Akt signaling pathway.Together, these data elucidate the molecular mechanisms underlying the occurrence and development of enchondromas and provide promising candidates for therapeutic intervention and prognostic evaluation. However, further experimental studies are needed to confirm these results.

  14. Molecular characterization and analysis of the acrB gene of Aspergillus nidulans: a gene identified by genetic interaction as a component of the regulatory network that includes the CreB deubiquitination enzyme.

    PubMed Central

    Boase, Natasha A; Lockington, Robin A; Adams, Julian R J; Rodbourn, Louise; Kelly, Joan M

    2003-01-01

    Mutations in the acrB gene, which were originally selected through their resistance to acriflavine, also result in reduced growth on a range of sole carbon sources, including fructose, cellobiose, raffinose, and starch, and reduced utilization of omega-amino acids, including GABA and beta-alanine, as sole carbon and nitrogen sources. The acrB2 mutation suppresses the phenotypic effects of mutations in the creB gene that encodes a regulatory deubiquitinating enzyme, and in the creC gene that encodes a WD40-repeat-containing protein. Thus AcrB interacts with a regulatory network controlling carbon source utilization that involves ubiquitination and deubiquitination. The acrB gene was cloned and physically analyzed, and it encodes a novel protein that contains three putative transmembrane domains and a coiled-coil region. AcrB may play a role in the ubiquitination aspect of this regulatory network. PMID:12750323

  15. The REP2 Repeats of the Genome of Neisseria meningitidis Are Associated with Genes Coordinately Regulated during Bacterial Cell Interaction

    PubMed Central

    Morelle, Sandrine; Carbonnelle, Etienne; Nassif, Xavier

    2003-01-01

    Interaction with host cells is essential in meningococcal pathogenesis especially at the blood-brain barrier. This step is likely to involve a common regulatory pathway allowing coordinate regulation of genes necessary for the interaction with endothelial cells. The analysis of the genomic sequence of Neisseria meningitidis Z2491 revealed the presence of many repeats. One of these, designated REP2, contains a −24/−12 type promoter and a ribosome binding site 5 to 13 bp before an ATG. In addition most of these REP2 sequences are located immediately upstream of an ORF. Among these REP2-associated genes are pilC1 and crgA, described as being involved in steps essential for the interaction of N. meningitidis with host cells. Furthermore, the REP2 sequences located upstream of pilC1 and crgA correspond to the previously identified promoters known to be induced during the initial localized adhesion of N. meningitidis with human cells. This characteristic led us to hypothesize that at least some of the REP2-associated genes were upregulated under the same circumstances as pilC1 and crgA. Quantitative PCR in real time demonstrated that the expression of 14 out of 16 REP2-associated genes were upregulated during the initial localized adhesion of N. meningitidis. Taken together, these data suggest that these repeats control a set of genes necessary for the efficient interaction of this pathogen with host cells. Subsequent mutational analysis was performed to address the role of these genes during meningococcus-cell interaction. PMID:12670987

  16. Identifying spatially similar gene expression patterns in early stage fruit fly embryo images: binary feature versus invariant moment digital representations

    PubMed Central

    Gurunathan, Rajalakshmi; Van Emden, Bernard; Panchanathan, Sethuraman; Kumar, Sudhir

    2004-01-01

    Background Modern developmental biology relies heavily on the analysis of embryonic gene expression patterns. Investigators manually inspect hundreds or thousands of expression patterns to identify those that are spatially similar and to ultimately infer potential gene interactions. However, the rapid accumulation of gene expression pattern data over the last two decades, facilitated by high-throughput techniques, has produced a need for the development of efficient approaches for direct comparison of images, rather than their textual descriptions, to identify spatially similar expression patterns. Results The effectiveness of the Binary Feature Vector (BFV) and Invariant Moment Vector (IMV) based digital representations of the gene expression patterns in finding biologically meaningful patterns was compared for a small (226 images) and a large (1819 images) dataset. For each dataset, an ordered list of images, with respect to a query image, was generated to identify overlapping and similar gene expression patterns, in a manner comparable to what a developmental biologist might do. The results showed that the BFV representation consistently outperforms the IMV representation in finding biologically meaningful matches when spatial overlap of the gene expression pattern and the genes involved are considered. Furthermore, we explored the value of conducting image-content based searches in a dataset where individual expression components (or domains) of multi-domain expression patterns were also included separately. We found that this technique improves performance of both IMV and BFV based searches. Conclusions We conclude that the BFV representation consistently produces a more extensive and better list of biologically useful patterns than the IMV representation. The high quality of results obtained scales well as the search database becomes larger, which encourages efforts to build automated image query and retrieval systems for spatial gene expression patterns

  17. Identifying the rooted species tree from the distribution of unrooted gene trees under the coalescent.

    PubMed

    Allman, Elizabeth S; Degnan, James H; Rhodes, John A

    2011-06-01

    Gene trees are evolutionary trees representing the ancestry of genes sampled from multiple populations. Species trees represent populations of individuals-each with many genes-splitting into new populations or species. The coalescent process, which models ancestry of gene copies within populations, is often used to model the probability distribution of gene trees given a fixed species tree. This multispecies coalescent model provides a framework for phylogeneticists to infer species trees from gene trees using maximum likelihood or Bayesian approaches. Because the coalescent models a branching process over time, all trees are typically assumed to be rooted in this setting. Often, however, gene trees inferred by traditional phylogenetic methods are unrooted. We investigate probabilities of unrooted gene trees under the multispecies coalescent model. We show that when there are four species with one gene sampled per species, the distribution of unrooted gene tree topologies identifies the unrooted species tree topology and some, but not all, information in the species tree edges (branch lengths). The location of the root on the species tree is not identifiable in this situation. However, for 5 or more species with one gene sampled per species, we show that the distribution of unrooted gene tree topologies identifies the rooted species tree topology and all its internal branch lengths. The length of any pendant branch leading to a leaf of the species tree is also identifiable for any species from which more than one gene is sampled.

  18. Genome-wide methylation analysis identifies genes silenced in non-seminoma cell lines

    PubMed Central

    Noor, Dzul Azri Mohamed; Jeyapalan, Jennie N; Alhazmi, Safiah; Carr, Matthew; Squibb, Benjamin; Wallace, Claire; Tan, Christopher; Cusack, Martin; Hughes, Jaime; Reader, Tom; Shipley, Janet; Sheer, Denise; Scotting, Paul J

    2016-01-01

    Silencing of genes by DNA methylation is a common phenomenon in many types of cancer. However, the genome-wide effect of DNA methylation on gene expression has been analysed in relatively few cancers. Germ cell tumours (GCTs) are a complex group of malignancies. They are unique in developing from a pluripotent progenitor cell. Previous analyses have suggested that non-seminomas exhibit much higher levels of DNA methylation than seminomas. The genomic targets that are methylated, the extent to which this results in gene silencing and the identity of the silenced genes most likely to play a role in the tumours’ biology have not yet been established. In this study, genome-wide methylation and expression analysis of GCT cell lines was combined with gene expression data from primary tumours to address this question. Genome methylation was analysed using the Illumina infinium HumanMethylome450 bead chip system and gene expression was analysed using Affymetrix GeneChip Human Genome U133 Plus 2.0 arrays. Regulation by methylation was confirmed by demethylation using 5-aza-2-deoxycytidine and reverse transcription–quantitative PCR. Large differences in the level of methylation of the CpG islands of individual genes between tumour cell lines correlated well with differential gene expression. Treatment of non-seminoma cells with 5-aza-2-deoxycytidine verified that methylation of all genes tested played a role in their silencing in yolk sac tumour cells and many of these genes were also differentially expressed in primary tumours. Genes silenced by methylation in the various GCT cell lines were identified. Several pluripotency-associated genes were identified as a major functional group of silenced genes. PMID:29263807

  19. Genome-wide methylation analysis identifies genes silenced in non-seminoma cell lines.

    PubMed

    Noor, Dzul Azri Mohamed; Jeyapalan, Jennie N; Alhazmi, Safiah; Carr, Matthew; Squibb, Benjamin; Wallace, Claire; Tan, Christopher; Cusack, Martin; Hughes, Jaime; Reader, Tom; Shipley, Janet; Sheer, Denise; Scotting, Paul J

    2016-01-01

    Silencing of genes by DNA methylation is a common phenomenon in many types of cancer. However, the genome-wide effect of DNA methylation on gene expression has been analysed in relatively few cancers. Germ cell tumours (GCTs) are a complex group of malignancies. They are unique in developing from a pluripotent progenitor cell. Previous analyses have suggested that non-seminomas exhibit much higher levels of DNA methylation than seminomas. The genomic targets that are methylated, the extent to which this results in gene silencing and the identity of the silenced genes most likely to play a role in the tumours' biology have not yet been established. In this study, genome-wide methylation and expression analysis of GCT cell lines was combined with gene expression data from primary tumours to address this question. Genome methylation was analysed using the Illumina infinium HumanMethylome450 bead chip system and gene expression was analysed using Affymetrix GeneChip Human Genome U133 Plus 2.0 arrays. Regulation by methylation was confirmed by demethylation using 5-aza-2-deoxycytidine and reverse transcription-quantitative PCR. Large differences in the level of methylation of the CpG islands of individual genes between tumour cell lines correlated well with differential gene expression. Treatment of non-seminoma cells with 5-aza-2-deoxycytidine verified that methylation of all genes tested played a role in their silencing in yolk sac tumour cells and many of these genes were also differentially expressed in primary tumours. Genes silenced by methylation in the various GCT cell lines were identified. Several pluripotency-associated genes were identified as a major functional group of silenced genes.

  20. Interactions between collagen gene variants and risk of anterior cruciate ligament rupture.

    PubMed

    O'Connell, Kevin; Knight, Hayley; Ficek, Krzysztof; Leonska-Duniec, Agata; Maciejewska-Karlowska, Agnieszka; Sawczuk, Marek; Stepien-Slodkowska, Marta; O'Cuinneagain, Dion; van der Merwe, Willem; Posthumus, Michael; Cieszczyk, Pawel; Collins, Malcolm

    2015-01-01

    The COL5A1 and COL12A1 variants are independently associated with modulating the risk of anterior cruciate ligament (ACL) rupture in females. The objective of this study was to further investigate if COL3A1 and COL6A1 variants independently, as well as, collagen gene-gene interactions, modulate ACL rupture risk. Three hundred and thirty-three South African (SA, n = 242) and Polish (PL, n = 91) participants with diagnosed ACL ruptures and 378 controls (235 SA and 143 PL) were recruited. Participants were genotyped for COL3A1 rs1800255 G/A, COL5A1 rs12722 (T/C), COL6A1 rs35796750 (T/C) and COL12A1 rs970547 (A/G). No significant associations were identified between COL6A1 rs35796750 and COL3A1 rs1800255 genotypes and risk of ACL rupture in the SA cohort. The COL3A1 AA genotype was, however, significantly (p = 0.036) over-represented in the PL ACL group (9.9%, n = 9) when compared to the PL control (CON) group (2.8%, n = 4). Although there were genotype distribution differences between the SA and PL cohorts, the T+A-inferred pseudo-haplotype constructed from COL5A1 and COL12A1 was significantly over-represented in the female ACL group when compared to the female CON group within the SA (T+A ACL 50.5%, T+A CON 38.1%, p = 0.022), PL (T+A ACL 56.3%, T+A CON 36.3%, p = 0.029) and combined (T+A ACL 51.8%, T+A CON 37.5%, p = 0.004) cohorts. In conclusion, the novel main finding of this study was a significant interaction between the COL5A1 rs12722 T/C and COL12A1 rs970547 A/G variants and risk of ACL injury. These results highlight the importance of investigating gene-gene interactions in the aetiology of ACL ruptures in multiple independent cohorts.

  1. Haplotype Analysis in Multiple Crosses to Identify a QTL Gene

    PubMed Central

    Wang, Xiaosong; Korstanje, Ron; Higgins, David; Paigen, Beverly

    2004-01-01

    Identifying quantitative trait locus (QTL) genes is a challenging task. Herein, we report using a two-step process to identify Apoa2 as the gene underlying Hdlq5, a QTL for plasma high-density lipoprotein cholesterol (HDL) levels on mouse chromosome 1. First, we performed a sequence analysis of the Apoa2 coding region in 46 genetically diverse mouse strains and found five different APOA2 protein variants, which we named APOA2a to APOA2e. Second, we conducted a haplotype analysis of the strains in 21 crosses that have so far detected HDL QTLs; we found that Hdlq5 was detected only in the nine crosses where one parent had the APOA2b protein variant characterized by an Ala61-to-Val61 substitution. We then found that strains with the APOA2b variant had significantly higher (P ≤ 0.002) plasma HDL levels than those with either the APOA2a or the APOA2c variant. These findings support Apoa2 as the underlying Hdlq5 gene and suggest the Apoa2 polymorphisms responsible for the Hdlq5 phenotype. Therefore, haplotype analysis in multiple crosses can be used to support a candidate QTL gene. PMID:15310659

  2. Haplotype analysis in multiple crosses to identify a QTL gene.

    PubMed

    Wang, Xiaosong; Korstanje, Ron; Higgins, David; Paigen, Beverly

    2004-09-01

    Identifying quantitative trait locus (QTL) genes is a challenging task. Herein, we report using a two-step process to identify Apoa2 as the gene underlying Hdlq5, a QTL for plasma high-density lipoprotein cholesterol (HDL) levels on mouse chromosome 1. First, we performed a sequence analysis of the Apoa2 coding region in 46 genetically diverse mouse strains and found five different APOA2 protein variants, which we named APOA2a to APOA2e. Second, we conducted a haplotype analysis of the strains in 21 crosses that have so far detected HDL QTLs; we found that Hdlq5 was detected only in the nine crosses where one parent had the APOA2b protein variant characterized by an Ala61-to-Val61 substitution. We then found that strains with the APOA2b variant had significantly higher (P < or = 0.002) plasma HDL levels than those with either the APOA2a or the APOA2c variant. These findings support Apoa2 as the underlying Hdlq5 gene and suggest the Apoa2 polymorphisms responsible for the Hdlq5 phenotype. Therefore, haplotype analysis in multiple crosses can be used to support a candidate QTL gene.

  3. Gene-Gene Combination Effect and Interactions among ABCA1, APOA1, SR-B1, and CETP Polymorphisms for Serum High-Density Lipoprotein-Cholesterol in the Japanese Population

    PubMed Central

    Nakamura, Akihiko; Niimura, Hideshi; Kuwabara, Kazuyo; Takezaki, Toshiro; Morita, Emi; Wakai, Kenji; Hamajima, Nobuyuki; Nishida, Yuichiro; Turin, Tanvir Chowdhury; Suzuki, Sadao; Ohnaka, Keizo; Uemura, Hirokazu; Ozaki, Etsuko; Hosono, Satoyo; Mikami, Haruo; Kubo, Michiaki; Tanaka, Hideo

    2013-01-01

    Background/Objective Gene-gene interactions in the reverse cholesterol transport system for high-density lipoprotein-cholesterol (HDL-C) are poorly understood. The present study observed gene-gene combination effect and interactions between single nucleotide polymorphisms (SNPs) in ABCA1, APOA1, SR-B1, and CETP in serum HDL-C from a cross-sectional study in the Japanese population. Methods The study population comprised 1,535 men and 1,515 women aged 35–69 years who were enrolled in the Japan Multi-Institutional Collaborative Cohort (J-MICC) Study. We selected 13 SNPs in the ABCA1, APOA1, CETP, and SR-B1 genes in the reverse cholesterol transport system. The effects of genetic and environmental factors were assessed using general linear and logistic regression models after adjusting for age, sex, and region. Principal Findings Alcohol consumption and daily activity were positively associated with HDL-C levels, whereas smoking had a negative relationship. The T allele of CETP, rs3764261, was correlated with higher HDL-C levels and had the highest coefficient (2.93 mg/dL/allele) among the 13 SNPs, which was statistically significant after applying the Bonferroni correction (p<0.001). Gene-gene combination analysis revealed that CETP rs3764261 was associated with high HDL-C levels with any combination of SNPs from ABCA1, APOA1, and SR-B1, although no gene-gene interaction was apparent. An increasing trend for serum HDL-C was also observed with an increasing number of alleles (p<0.001). Conclusions The present study identified a multiplier effect from a polymorphism in CETP with ABCA1, APOA1, and SR-B1, as well as a dose-dependence according to the number of alleles present. PMID:24376512

  4. Differentially Coexpressed Disease Gene Identification Based on Gene Coexpression Network.

    PubMed

    Jiang, Xue; Zhang, Han; Quan, Xiongwen

    2016-01-01

    Screening disease-related genes by analyzing gene expression data has become a popular theme. Traditional disease-related gene selection methods always focus on identifying differentially expressed gene between case samples and a control group. These traditional methods may not fully consider the changes of interactions between genes at different cell states and the dynamic processes of gene expression levels during the disease progression. However, in order to understand the mechanism of disease, it is important to explore the dynamic changes of interactions between genes in biological networks at different cell states. In this study, we designed a novel framework to identify disease-related genes and developed a differentially coexpressed disease-related gene identification method based on gene coexpression network (DCGN) to screen differentially coexpressed genes. We firstly constructed phase-specific gene coexpression network using time-series gene expression data and defined the conception of differential coexpression of genes in coexpression network. Then, we designed two metrics to measure the value of gene differential coexpression according to the change of local topological structures between different phase-specific networks. Finally, we conducted meta-analysis of gene differential coexpression based on the rank-product method. Experimental results demonstrated the feasibility and effectiveness of DCGN and the superior performance of DCGN over other popular disease-related gene selection methods through real-world gene expression data sets.

  5. Entropy Based Genetic Association Tests and Gene-Gene Interaction Tests

    PubMed Central

    de Andrade, Mariza; Wang, Xin

    2011-01-01

    In the past few years, several entropy-based tests have been proposed for testing either single SNP association or gene-gene interaction. These tests are mainly based on Shannon entropy and have higher statistical power when compared to standard χ2 tests. In this paper, we extend some of these tests using a more generalized entropy definition, Rényi entropy, where Shannon entropy is a special case of order 1. The order λ (>0) of Rényi entropy weights the events (genotype/haplotype) according to their probabilities (frequencies). Higher λ places more emphasis on higher probability events while smaller λ (close to 0) tends to assign weights more equally. Thus, by properly choosing the λ, one can potentially increase the power of the tests or the p-value level of significance. We conducted simulation as well as real data analyses to assess the impact of the order λ and the performance of these generalized tests. The results showed that for dominant model the order 2 test was more powerful and for multiplicative model the order 1 or 2 had similar power. The analyses indicate that the choice of λ depends on the underlying genetic model and Shannon entropy is not necessarily the most powerful entropy measure for constructing genetic association or interaction tests. PMID:23089811

  6. Nature versus nurture: A systematic approach to elucidate gene-environment interactions in the development of myopic refractive errors.

    PubMed

    Miraldi Utz, Virginia

    2017-01-01

    Myopia is the most common eye disorder and major cause of visual impairment worldwide. As the incidence of myopia continues to rise, the need to further understand the complex roles of molecular and environmental factors controlling variation in refractive error is of increasing importance. Tkatchenko and colleagues applied a systematic approach using a combination of gene set enrichment analysis, genome-wide association studies, and functional analysis of a murine model to identify a myopia susceptibility gene, APLP2. Differential expression of refractive error was associated with time spent reading for those with low frequency variants in this gene. This provides support for the longstanding hypothesis of gene-environment interactions in refractive error development.

  7. Interaction between genes and macronutrient intake on the risk of developing type 2 diabetes: systematic review and findings from European Prospective Investigation into Cancer (EPIC)-InterAct.

    PubMed

    Li, Sherly X; Imamura, Fumiaki; Ye, Zheng; Schulze, Matthias B; Zheng, Jusheng; Ardanaz, Eva; Arriola, Larraitz; Boeing, Heiner; Dow, Courtney; Fagherazzi, Guy; Franks, Paul W; Agudo, Antonio; Grioni, Sara; Kaaks, Rudolf; Katzke, Verena A; Key, Timothy J; Khaw, Kay Tee; Mancini, Francesca R; Navarro, Carmen; Nilsson, Peter M; Onland-Moret, N Charlotte; Overvad, Kim; Palli, Domenico; Panico, Salvatore; Quirós, J Ramón; Rolandsson, Olov; Sacerdote, Carlotta; Sánchez, María-José; Slimani, Nadia; Sluijs, Ivonne; Spijkerman, Annemieke Mw; Tjonneland, Anne; Tumino, Rosario; Sharp, Stephen J; Riboli, Elio; Langenberg, Claudia; Scott, Robert A; Forouhi, Nita G; Wareham, Nicholas J

    2017-07-01

    Background: Gene-diet interactions have been reported to contribute to the development of type 2 diabetes (T2D). However, to our knowledge, few examples have been consistently replicated to date. Objective: We aimed to identify existing evidence for gene-macronutrient interactions and T2D and to examine the reported interactions in a large-scale study. Design: We systematically reviewed studies reporting gene-macronutrient interactions and T2D. We searched the MEDLINE, Human Genome Epidemiology Network, and WHO International Clinical Trials Registry Platform electronic databases to identify studies published up to October 2015. Eligibility criteria included assessment of macronutrient quantity (e.g., total carbohydrate) or indicators of quality (e.g., dietary fiber) by use of self-report or objective biomarkers of intake. Interactions identified in the review were subsequently examined in the EPIC (European Prospective Investigation into Cancer)-InterAct case-cohort study ( n = 21,148, with 9403 T2D cases; 8 European countries). Prentice-weighted Cox regression was used to estimate country-specific HRs, 95% CIs, and P -interaction values, which were then pooled by random-effects meta-analysis. A primary model was fitted by using the same covariates as reported in the published studies, and a second model adjusted for additional covariates and estimated the effects of isocaloric macronutrient substitution. Results: Thirteen observational studies met the eligibility criteria ( n < 1700 cases). Eight unique interactions were reported to be significant between macronutrients [carbohydrate, fat, saturated fat, dietary fiber, and glycemic load derived from self-report of dietary intake and circulating n-3 (ω-3) polyunsaturated fatty acids] and genetic variants in or near transcription factor 7-like 2 ( TCF7L2 ), gastric inhibitory polypeptide receptor ( GIPR ), caveolin 2 ( CAV2 ), and peptidase D ( PEPD ) ( P -interaction < 0.05). We found no evidence of interaction

  8. A transposon-based genetic screen in mice identifies genes altered in colorectal cancer.

    PubMed

    Starr, Timothy K; Allaei, Raha; Silverstein, Kevin A T; Staggs, Rodney A; Sarver, Aaron L; Bergemann, Tracy L; Gupta, Mihir; O'Sullivan, M Gerard; Matise, Ilze; Dupuy, Adam J; Collier, Lara S; Powers, Scott; Oberg, Ann L; Asmann, Yan W; Thibodeau, Stephen N; Tessarollo, Lino; Copeland, Neal G; Jenkins, Nancy A; Cormier, Robert T; Largaespada, David A

    2009-03-27

    Human colorectal cancers (CRCs) display a large number of genetic and epigenetic alterations, some of which are causally involved in tumorigenesis (drivers) and others that have little functional impact (passengers). To help distinguish between these two classes of alterations, we used a transposon-based genetic screen in mice to identify candidate genes for CRC. Mice harboring mutagenic Sleeping Beauty (SB) transposons were crossed with mice expressing SB transposase in gastrointestinal tract epithelium. Most of the offspring developed intestinal lesions, including intraepithelial neoplasia, adenomas, and adenocarcinomas. Analysis of over 16,000 transposon insertions identified 77 candidate CRC genes, 60 of which are mutated and/or dysregulated in human CRC and thus are most likely to drive tumorigenesis. These genes include APC, PTEN, and SMAD4. The screen also identified 17 candidate genes that had not previously been implicated in CRC, including POLI, PTPRK, and RSPO2.

  9. Effect of occupational exposures on lung cancer susceptibility: a study of gene-environment interaction analysis.

    PubMed

    Malhotra, Jyoti; Sartori, Samantha; Brennan, Paul; Zaridze, David; Szeszenia-Dabrowska, Neonila; Świątkowska, Beata; Rudnai, Peter; Lissowska, Jolanta; Fabianova, Eleonora; Mates, Dana; Bencko, Vladimir; Gaborieau, Valerie; Stücker, Isabelle; Foretova, Lenka; Janout, Vladimir; Boffetta, Paolo

    2015-03-01

    Occupational exposures are known risk factors for lung cancer. Role of genetically determined host factors in occupational exposure-related lung cancer is unclear. We used genome-wide association (GWA) data from a case-control study conducted in 6 European countries from 1998 to 2002 to identify gene-occupation interactions and related pathways for lung cancer risk. GWA analysis was performed for each exposure using logistic regression and interaction term for genotypes, and exposure was included in this model. Both SNP-based and gene-based interaction P values were calculated. Pathway analysis was performed using three complementary methods, and analyses were adjusted for multiple comparisons. We analyzed 312,605 SNPs and occupational exposure to 70 agents from 1,802 lung cancer cases and 1,725 cancer-free controls. Mean age of study participants was 60.1 ± 9.1 years and 75% were male. Largest number of significant associations (P ≤ 1 × 10(-5)) at SNP level was demonstrated for nickel, brick dust, concrete dust, and cement dust, and for brick dust and cement dust at the gene-level (P ≤ 1 × 10(-4)). Approximately 14 occupational exposures showed significant gene-occupation interactions with pathways related to response to environmental information processing via signal transduction (P < 0.001 and FDR < 0.05). Other pathways that showed significant enrichment were related to immune processes and xenobiotic metabolism. Our findings suggest that pathways related to signal transduction, immune process, and xenobiotic metabolism may be involved in occupational exposure-related lung carcinogenesis. Our study exemplifies an integrative approach using pathway-based analysis to demonstrate the role of genetic variants in occupational exposure-related lung cancer susceptibility. Cancer Epidemiol Biomarkers Prev; 24(3); 570-9. ©2015 AACR. ©2015 American Association for Cancer Research.

  10. The Interaction Network Ontology-supported modeling and mining of complex interactions represented with multiple keywords in biomedical literature.

    PubMed

    Özgür, Arzucan; Hur, Junguk; He, Yongqun

    2016-01-01

    The Interaction Network Ontology (INO) logically represents biological interactions, pathways, and networks. INO has been demonstrated to be valuable in providing a set of structured ontological terms and associated keywords to support literature mining of gene-gene interactions from biomedical literature. However, previous work using INO focused on single keyword matching, while many interactions are represented with two or more interaction keywords used in combination. This paper reports our extension of INO to include combinatory patterns of two or more literature mining keywords co-existing in one sentence to represent specific INO interaction classes. Such keyword combinations and related INO interaction type information could be automatically obtained via SPARQL queries, formatted in Excel format, and used in an INO-supported SciMiner, an in-house literature mining program. We studied the gene interaction sentences from the commonly used benchmark Learning Logic in Language (LLL) dataset and one internally generated vaccine-related dataset to identify and analyze interaction types containing multiple keywords. Patterns obtained from the dependency parse trees of the sentences were used to identify the interaction keywords that are related to each other and collectively represent an interaction type. The INO ontology currently has 575 terms including 202 terms under the interaction branch. The relations between the INO interaction types and associated keywords are represented using the INO annotation relations: 'has literature mining keywords' and 'has keyword dependency pattern'. The keyword dependency patterns were generated via running the Stanford Parser to obtain dependency relation types. Out of the 107 interactions in the LLL dataset represented with two-keyword interaction types, 86 were identified by using the direct dependency relations. The LLL dataset contained 34 gene regulation interaction types, each of which associated with multiple keywords. A

  11. Comparison of gene expression in segregating families identifies genes and genomic regions involved in a novel adaptation, zinc hyperaccumulation.

    PubMed

    Filatov, Victor; Dowdle, John; Smirnoff, Nicholas; Ford-Lloyd, Brian; Newbury, H John; Macnair, Mark R

    2006-09-01

    One of the challenges of comparative genomics is to identify specific genetic changes associated with the evolution of a novel adaptation or trait. We need to be able to disassociate the genes involved with a particular character from all the other genetic changes that take place as lineages diverge. Here we show that by comparing the transcriptional profile of segregating families with that of parent species differing in a novel trait, it is possible to narrow down substantially the list of potential target genes. In addition, by assuming synteny with a related model organism for which the complete genome sequence is available, it is possible to use the cosegregation of markers differing in transcription level to identify regions of the genome which probably contain quantitative trait loci (QTLs) for the character. This novel combination of genomics and classical genetics provides a very powerful tool to identify candidate genes. We use this methodology to investigate zinc hyperaccumulation in Arabidopsis halleri, the sister species to the model plant, Arabidopsis thaliana. We compare the transcriptional profile of A. halleri with that of its sister nonaccumulator species, Arabidopsis petraea, and between accumulator and nonaccumulator F(3)s derived from the cross between the two species. We identify eight genes which consistently show greater expression in accumulator phenotypes in both roots and shoots, including two metal transporter genes (NRAMP3 and ZIP6), and cytoplasmic aconitase, a gene involved in iron homeostasis in mammals. We also show that there appear to be two QTLs for zinc accumulation, on chromosomes 3 and 7.

  12. Identifying Drug-Target Interactions with Decision Templates.

    PubMed

    Yan, Xiao-Ying; Zhang, Shao-Wu

    2018-01-01

    During the development process of new drugs, identification of the drug-target interactions wins primary concerns. However, the chemical or biological experiments bear the limitation in coverage as well as the huge cost of both time and money. Based on drug similarity and target similarity, chemogenomic methods can be able to predict potential drug-target interactions (DTIs) on a large scale and have no luxurious need about target structures or ligand entries. In order to reflect the cases that the drugs having variant structures interact with common targets and the targets having dissimilar sequences interact with same drugs. In addition, though several other similarity metrics have been developed to predict DTIs, the combination of multiple similarity metrics (especially heterogeneous similarities) is too naïve to sufficiently explore the multiple similarities. In this paper, based on Gene Ontology and pathway annotation, we introduce two novel target similarity metrics to address above issues. More importantly, we propose a more effective strategy via decision template to integrate multiple classifiers designed with multiple similarity metrics. In the scenarios that predict existing targets for new drugs and predict approved drugs for new protein targets, the results on the DTI benchmark datasets show that our target similarity metrics are able to enhance the predictive accuracies in two scenarios. And the elaborate fusion strategy of multiple classifiers has better predictive power than the naïve combination of multiple similarity metrics. Compared with other two state-of-the-art approaches on the four popular benchmark datasets of binary drug-target interactions, our method achieves the best results in terms of AUC and AUPR for predicting available targets for new drugs (S2), and predicting approved drugs for new protein targets (S3).These results demonstrate that our method can effectively predict the drug-target interactions. The software package can

  13. Identifying Mother-Child Interaction Styles Using a Person-Centered Approach.

    PubMed

    Nelson, Jackie A; O'Brien, Marion; Grimm, Kevin J; Leerkes, Esther M

    2014-05-01

    Parent-child conflict in the context of a supportive relationship has been discussed as a potentially constructive interaction pattern; the current study is the first to test this using a holistic analytic approach. Interaction styles, defined as mother-child conflict in the context of maternal sensitivity, were identified and described with demographic and stress-related characteristics of families. Longitudinal associations were tested between interaction styles and children's later social competence. Participants included 814 partnered mothers with a first-grade child. Latent profile analysis identified agreeable , dynamic , and disconnected interaction styles. Mothers' intimacy with a partner, depressive symptoms, and authoritarian childrearing beliefs, along with children's later conflict with a best friend and externalizing problems, were associated with group membership. Notably, the dynamic style, characterized by high sensitivity and high conflict, included families who experienced psychological and relational stressors. Findings are discussed with regard to how family stressors shape parent-child interaction patterns.

  14. Integrating genome-wide association study and expression quantitative trait loci data identifies multiple genes and gene set associated with neuroticism.

    PubMed

    Fan, Qianrui; Wang, Wenyu; Hao, Jingcan; He, Awen; Wen, Yan; Guo, Xiong; Wu, Cuiyan; Ning, Yujie; Wang, Xi; Wang, Sen; Zhang, Feng

    2017-08-01

    Neuroticism is a fundamental personality trait with significant genetic determinant. To identify novel susceptibility genes for neuroticism, we conducted an integrative analysis of genomic and transcriptomic data of genome wide association study (GWAS) and expression quantitative trait locus (eQTL) study. GWAS summary data was driven from published studies of neuroticism, totally involving 170,906 subjects. eQTL dataset containing 927,753 eQTLs were obtained from an eQTL meta-analysis of 5311 samples. Integrative analysis of GWAS and eQTL data was conducted by summary data-based Mendelian randomization (SMR) analysis software. To identify neuroticism associated gene sets, the SMR analysis results were further subjected to gene set enrichment analysis (GSEA). The gene set annotation dataset (containing 13,311 annotated gene sets) of GSEA Molecular Signatures Database was used. SMR single gene analysis identified 6 significant genes for neuroticism, including MSRA (p value=2.27×10 -10 ), MGC57346 (p value=6.92×10 -7 ), BLK (p value=1.01×10 -6 ), XKR6 (p value=1.11×10 -6 ), C17ORF69 (p value=1.12×10 -6 ) and KIAA1267 (p value=4.00×10 -6 ). Gene set enrichment analysis observed significant association for Chr8p23 gene set (false discovery rate=0.033). Our results provide novel clues for the genetic mechanism studies of neuroticism. Copyright © 2017. Published by Elsevier Inc.

  15. Genome-Wide Analysis in Three Fusarium Pathogens Identifies Rapidly Evolving Chromosomes and Genes Associated with Pathogenicity

    PubMed Central

    Sperschneider, Jana; Gardiner, Donald M.; Thatcher, Louise F.; Lyons, Rebecca; Singh, Karam B.; Manners, John M.; Taylor, Jennifer M.

    2015-01-01

    Pathogens and hosts are in an ongoing arms race and genes involved in host–pathogen interactions are likely to undergo diversifying selection. Fusarium plant pathogens have evolved diverse infection strategies, but how they interact with their hosts in the biotrophic infection stage remains puzzling. To address this, we analyzed the genomes of three Fusarium plant pathogens for genes that are under diversifying selection. We found a two-speed genome structure both on the chromosome and gene group level. Diversifying selection acts strongly on the dispensable chromosomes in Fusarium oxysporum f. sp. lycopersici and on distinct core chromosome regions in Fusarium graminearum, all of which have associations with virulence. Members of two gene groups evolve rapidly, namely those that encode proteins with an N-terminal [SG]-P-C-[KR]-P sequence motif and proteins that are conserved predominantly in pathogens. Specifically, 29 F. graminearum genes are rapidly evolving, in planta induced and encode secreted proteins, strongly pointing toward effector function. In summary, diversifying selection in Fusarium is strongly reflected as genomic footprints and can be used to predict a small gene set likely to be involved in host–pathogen interactions for experimental verification. PMID:25994930

  16. A Multiomics Approach to Identify Genes Associated with Childhood Asthma Risk and Morbidity.

    PubMed

    Forno, Erick; Wang, Ting; Yan, Qi; Brehm, John; Acosta-Perez, Edna; Colon-Semidey, Angel; Alvarez, Maria; Boutaoui, Nadia; Cloutier, Michelle M; Alcorn, John F; Canino, Glorisa; Chen, Wei; Celedón, Juan C

    2017-10-01

    Childhood asthma is a complex disease. In this study, we aim to identify genes associated with childhood asthma through a multiomics "vertical" approach that integrates multiple analytical steps using linear and logistic regression models. In a case-control study of childhood asthma in Puerto Ricans (n = 1,127), we used adjusted linear or logistic regression models to evaluate associations between several analytical steps of omics data, including genome-wide (GW) genotype data, GW methylation, GW expression profiling, cytokine levels, asthma-intermediate phenotypes, and asthma status. At each point, only the top genes/single-nucleotide polymorphisms/probes/cytokines were carried forward for subsequent analysis. In step 1, asthma modified the gene expression-protein level association for 1,645 genes; pathway analysis showed an enrichment of these genes in the cytokine signaling system (n = 269 genes). In steps 2-3, expression levels of 40 genes were associated with intermediate phenotypes (asthma onset age, forced expiratory volume in 1 second, exacerbations, eosinophil counts, and skin test reactivity); of those, methylation of seven genes was also associated with asthma. Of these seven candidate genes, IL5RA was also significant in analytical steps 4-8. We then measured plasma IL-5 receptor α levels, which were associated with asthma age of onset and moderate-severe exacerbations. In addition, in silico database analysis showed that several of our identified IL5RA single-nucleotide polymorphisms are associated with transcription factors related to asthma and atopy. This approach integrates several analytical steps and is able to identify biologically relevant asthma-related genes, such as IL5RA. It differs from other methods that rely on complex statistical models with various assumptions.

  17. Disentangling Gene-Environment Correlations and Interactions on Adolescent Depressive Symptoms

    ERIC Educational Resources Information Center

    Lau, Jennifer Y. F.; Eley, Thalia C.

    2008-01-01

    Background: Genetic risks for depression may be expressed through greater exposure towards environmental stressors (gene-environment correlation, rGE) and increased susceptibility to these stressors (gene-environment interaction, G x E). While these effects are often studied independently, evidence supports their co-occurrence on depression.…

  18. Identification of genes differentially expressed during interaction of resistant and susceptible apple cultivars (Malus × domestica) with Erwinia amylovora

    PubMed Central

    2010-01-01

    Background The necrogenic enterobacterium, Erwinia amylovora is the causal agent of the fire blight (FB) disease in many Rosaceaespecies, including apple and pear. During the infection process, the bacteria induce an oxidative stress response with kinetics similar to those induced in an incompatible bacteria-plant interaction. No resistance mechanism to E. amylovora in host plants has yet been characterized, recent work has identified some molecular events which occur in resistant and/or susceptible host interaction with E. amylovora: In order to understand the mechanisms that characterize responses to FB, differentially expressed genes were identified by cDNA-AFLP analysis in resistant and susceptible apple genotypes after inoculation with E. amylovora. Results cDNA were isolated from M.26 (susceptible) and G.41 (resistant) apple tissues collected 2 h and 48 h after challenge with a virulent E. amylovora strain or mock (buffer) inoculated. To identify differentially expressed transcripts, electrophoretic banding patterns were obtained from cDNAs. In the AFLP experiments, M.26 and G.41 showed different patterns of expression, including genes specifically induced, not induced, or repressed by E. amylovora. In total, 190 ESTs differentially expressed between M.26 and G.41 were identified using 42 pairs of AFLP primers. cDNA-AFLP analysis of global EST expression in a resistant and a susceptible apple genotype identified different major classes of genes. EST sequencing data showed that genes linked to resistance, encoding proteins involved in recognition, signaling, defense and apoptosis, were modulated by E. amylovora in its host plant. The expression time course of some of these ESTs selected via a bioinformatic analysis has been characterized. Conclusion These data are being used to develop hypotheses of resistance or susceptibility mechanisms in Malus to E. amylovora and provide an initial categorization of genes possibly involved in recognition events, early

  19. Super-Enhancers and Broad H3K4me3 Domains Form Complex Gene Regulatory Circuits Involving Chromatin Interactions.

    PubMed

    Cao, Fan; Fang, Yiwen; Tan, Hong Kee; Goh, Yufen; Choy, Jocelyn Yeen Hui; Koh, Bryan Thean Howe; Hao Tan, Jiong; Bertin, Nicolas; Ramadass, Aroul; Hunter, Ewan; Green, Jayne; Salter, Matthew; Akoulitchev, Alexandre; Wang, Wilson; Chng, Wee Joo; Tenen, Daniel G; Fullwood, Melissa J

    2017-05-19

    Stretched histone regions, such as super-enhancers and broad H3K4me3 domains, are associated with maintenance of cell identity and cancer. We connected super-enhancers and broad H3K4me3 domains in the K562 chronic myelogenous leukemia cell line as well as the MCF-7 breast cancer cell line with chromatin interactions. Super-enhancers and broad H3K4me3 domains showed higher association with chromatin interactions than their typical counterparts. Interestingly, we identified a subset of super-enhancers that overlap with broad H3K4me3 domains and show high association with cancer-associated genes including tumor suppressor genes. Besides cell lines, we could observe chromatin interactions by a Chromosome Conformation Capture (3C)-based method, in primary human samples. Several chromatin interactions involving super-enhancers and broad H3K4me3 domains are constitutive and can be found in both cancer and normal samples. Taken together, these results reveal a new layer of complexity in gene regulation by super-enhancers and broad H3K4me3 domains.

  20. A large-scale RNA interference screen identifies genes that regulate autophagy at different stages.

    PubMed

    Guo, Sujuan; Pridham, Kevin J; Virbasius, Ching-Man; He, Bin; Zhang, Liqing; Varmark, Hanne; Green, Michael R; Sheng, Zhi

    2018-02-12

    Dysregulated autophagy is central to the pathogenesis and therapeutic development of cancer. However, how autophagy is regulated in cancer is not well understood and genes that modulate cancer autophagy are not fully defined. To gain more insights into autophagy regulation in cancer, we performed a large-scale RNA interference screen in K562 human chronic myeloid leukemia cells using monodansylcadaverine staining, an autophagy-detecting approach equivalent to immunoblotting of the autophagy marker LC3B or fluorescence microscopy of GFP-LC3B. By coupling monodansylcadaverine staining with fluorescence-activated cell sorting, we successfully isolated autophagic K562 cells where we identified 336 short hairpin RNAs. After candidate validation using Cyto-ID fluorescence spectrophotometry, LC3B immunoblotting, and quantitative RT-PCR, 82 genes were identified as autophagy-regulating genes. 20 genes have been reported previously and the remaining 62 candidates are novel autophagy mediators. Bioinformatic analyses revealed that most candidate genes were involved in molecular pathways regulating autophagy, rather than directly participating in the autophagy process. Further autophagy flux assays revealed that 57 autophagy-regulating genes suppressed autophagy initiation, whereas 21 candidates promoted autophagy maturation. Our RNA interference screen identifies identified genes that regulate autophagy at different stages, which helps decode autophagy regulation in cancer and offers novel avenues to develop autophagy-related therapies for cancer.

  1. Identification of Genes Involved in Breast Cancer Metastasis by Integrating Protein-Protein Interaction Information with Expression Data.

    PubMed

    Tian, Xin; Xin, Mingyuan; Luo, Jian; Liu, Mingyao; Jiang, Zhenran

    2017-02-01

    The selection of relevant genes for breast cancer metastasis is critical for the treatment and prognosis of cancer patients. Although much effort has been devoted to the gene selection procedures by use of different statistical analysis methods or computational techniques, the interpretation of the variables in the resulting survival models has been limited so far. This article proposes a new Random Forest (RF)-based algorithm to identify important variables highly related with breast cancer metastasis, which is based on the important scores of two variable selection algorithms, including the mean decrease Gini (MDG) criteria of Random Forest and the GeneRank algorithm with protein-protein interaction (PPI) information. The new gene selection algorithm can be called PPIRF. The improved prediction accuracy fully illustrated the reliability and high interpretability of gene list selected by the PPIRF approach.

  2. Transcriptional Profiling Identifies Functional Interactions of TGFβ and PPARβ/δ Signaling

    PubMed Central

    Kaddatz, Kerstin; Adhikary, Till; Finkernagel, Florian; Meissner, Wolfgang; Müller-Brüsselbach, Sabine; Müller, Rolf

    2010-01-01

    Peroxisome proliferator-activated receptors (PPARs) not only play a key role in regulating metabolic pathways but also modulate inflammatory processes, pointing to a functional interaction between PPAR and cytokine signaling pathways. In this study, we show by genome-wide transcriptional profiling that PPARβ/δ and transforming growth factor-β (TGFβ) pathways functionally interact in human myofibroblasts and that a subset of these genes is cooperatively activated by TGFβ and PPARβ/δ. Using the angiopoietin-like 4 (ANGPTL4) gene as a model, we demonstrate that two enhancer regions cooperate to mediate the observed synergistic response. A TGFβ-responsive enhancer located ∼8 kb upstream of the transcriptional start site is regulated by a mechanism involving SMAD3, ETS1, RUNX, and AP-1 transcription factors that interact with multiple contiguous binding sites. A second enhancer (PPAR-E) consisting of three juxtaposed PPAR response elements is located in the third intron ∼3.5 kb downstream of the transcriptional start site. The PPAR-E is strongly activated by all three PPAR subtypes, with a novel type of PPAR response element motif playing a central role. Although the PPAR-E is not regulated by TGFβ, it interacts with SMAD3, ETS1, RUNX2, and AP-1 in vivo, providing a possible mechanistic explanation for the observed synergism. PMID:20595396

  3. Mutation analysis of aryl hydrocarbon receptor interacting protein (AIP) gene in colorectal, breast, and prostate cancers

    PubMed Central

    Georgitsi, M; Karhu, A; Winqvist, R; Visakorpi, T; Waltering, K; Vahteristo, P; Launonen, V; Aaltonen, L A

    2007-01-01

    Germline mutations in the aryl hydrocarbon receptor interacting protein (AIP) gene were recently identified in individuals with pituitary adenoma predisposition (PAP). These patients have prolactin (PRL) or growth hormone (GH) oversecreting pituitary adenomas, the latter exhibiting acromegaly or gigantism. Loss-of-heterozygosity (LOH) analysis revealed that AIP is lost in PAP tumours, suggesting that it acts as a tumour-suppressor gene. Aryl hydrocarbon receptor interacting protein is involved in several pathways, but it is best characterised as a cytoplasmic partner of the aryl hydrocarbon receptor (AHR). To examine the possible role of AIP in the genesis of common cancers, we performed somatic mutation screening in a series of 373 colorectal cancers (CRCs), 82 breast cancers, and 44 prostate tumour samples. A missense R16H (47G>A) change was identified in two CRC samples, as well as in the respective normal tissues, but was absent in 209 healthy controls. The remaining findings were silent, previously unreported, changes of the coding, non-coding, or untranslated regions of AIP. These results suggest that somatic AIP mutations are not common in CRC, breast, and prostate cancers. PMID:17242703

  4. Integrating mean and variance heterogeneities to identify differentially expressed genes.

    PubMed

    Ouyang, Weiwei; An, Qiang; Zhao, Jinying; Qin, Huaizhen

    2016-12-06

    In functional genomics studies, tests on mean heterogeneity have been widely employed to identify differentially expressed genes with distinct mean expression levels under different experimental conditions. Variance heterogeneity (aka, the difference between condition-specific variances) of gene expression levels is simply neglected or calibrated for as an impediment. The mean heterogeneity in the expression level of a gene reflects one aspect of its distribution alteration; and variance heterogeneity induced by condition change may reflect another aspect. Change in condition may alter both mean and some higher-order characteristics of the distributions of expression levels of susceptible genes. In this report, we put forth a conception of mean-variance differentially expressed (MVDE) genes, whose expression means and variances are sensitive to the change in experimental condition. We mathematically proved the null independence of existent mean heterogeneity tests and variance heterogeneity tests. Based on the independence, we proposed an integrative mean-variance test (IMVT) to combine gene-wise mean heterogeneity and variance heterogeneity induced by condition change. The IMVT outperformed its competitors under comprehensive simulations of normality and Laplace settings. For moderate samples, the IMVT well controlled type I error rates, and so did existent mean heterogeneity test (i.e., the Welch t test (WT), the moderated Welch t test (MWT)) and the procedure of separate tests on mean and variance heterogeneities (SMVT), but the likelihood ratio test (LRT) severely inflated type I error rates. In presence of variance heterogeneity, the IMVT appeared noticeably more powerful than all the valid mean heterogeneity tests. Application to the gene profiles of peripheral circulating B raised solid evidence of informative variance heterogeneity. After adjusting for background data structure, the IMVT replicated previous discoveries and identified novel experiment

  5. Gender specific gene-environment interactions on laboratory-assessed aggression.

    PubMed

    Verona, Edelyn; Joiner, Thomas E; Johnson, Frank; Bender, Theodore W

    2006-01-01

    We examined gene-environment interactive effects on aggressive behavior among men and women genotyped (short versus long alleles) for the serotonin transporter gene. Aggressive behavior was indexed via a laboratory paradigm that measured the intensity and duration of shocks delivered to a putative "employee". Half of the participants were exposed to a physical stressor during the procedure (stress) and half were not (no-stress). Participants' physiological responses were gauged via acoustic startle eyeblink reactions (startle reactivity). Results were that men with the homozygous short (s/s) genotype showed increased aggression only under stress, whereas women and men carrying the long allele did not show differences in aggression in stress versus no-stress. However, although stress exposure produced increases in startle reactivity, there were no genotype or gender differences in physiology. These results replicate longitudinal research findings confirming the interactive effects of genes and environment on behavioral reactivity and on the development of externalizing psychopathological syndromes, at least in men.

  6. Coordinated Rates of Evolution between Interacting Plastid and Nuclear Genes in Geraniaceae

    PubMed Central

    Zhang, Jin; Ruhlman, Tracey A.; Sabir, Jamal; Blazier, J. Chris; Jansen, Robert K.

    2015-01-01

    Although gene coevolution has been widely observed within individuals and between different organisms, rarely has this phenomenon been investigated within a phylogenetic framework. The Geraniaceae is an attractive system in which to study plastid-nuclear genome coevolution due to the highly elevated evolutionary rates in plastid genomes. In plants, the plastid-encoded RNA polymerase (PEP) is a protein complex composed of subunits encoded by both plastid (rpoA, rpoB, rpoC1, and rpoC2) and nuclear genes (sig1-6). We used transcriptome and genomic data for 27 species of Geraniales in a systematic evaluation of coevolution between genes encoding subunits of the PEP holoenzyme. We detected strong correlations of dN (nonsynonymous substitutions) but not dS (synonymous substitutions) within rpoB/sig1 and rpoC2/sig2, but not for other plastid/nuclear gene pairs, and identified the correlation of dN/dS ratio between rpoB/C1/C2 and sig1/5/6, rpoC1/C2 and sig2, and rpoB/C2 and sig3 genes. Correlated rates between interacting plastid and nuclear sequences across the Geraniales could result from plastid-nuclear genome coevolution. Analyses of coevolved amino acid positions suggest that structurally mediated coevolution is not the major driver of plastid-nuclear coevolution. The detection of strong correlation of evolutionary rates between SIG and RNAP genes suggests a plausible explanation for plastome-genome incompatibility in Geraniaceae. PMID:25724640

  7. G20210A prothrombin gene mutation identified in patients with venous leg ulcers.

    PubMed

    Jebeleanu, G; Procopciuc, L

    2001-01-01

    The G20210A mutation variant of prothrombin gene is the second most frequent mutation identified in patients with deep venous thrombosis, after factor V Leiden. The risk for developing deep venous thrombosis is high in patients identified as heterozygous for G20210A mutation. In order to identify this polymorphism in the gene coding prothrombin, the 345bp fragment in the 3'- untranslated region of the prothrombin gene was amplified using amplification by polymerase chain reaction and enzymatic digestion by HindIII (restriction endonuclease enzyme). The products of amplification and enzymatic's digestion were analized using agarose gel electrophoresis. We investigated 20 patients with venous leg ulcers and we found 2 heterozygous (10%) for G20210A mutation. None of the patients in the control group had G20210A mutation. Our study confirms the presence of G20210A mutation in the Romanian population. Our study also shows the link between venous leg ulcers and this polymorphism in the prothrombin gene.

  8. A Novel Yeast Genomics Method for Identifying New Breast Cancer Susceptibility Genes

    DTIC Science & Technology

    2007-05-01

    find new candidate genes for breast cancer susceptibility in women and identifying these human genes can further improve monitoring and treatment...breast cancer susceptibility genes in humans that are currently unknown and not deducible from current methodologies. It is a fundamental...template to faithfully repair the broken strand. In human cancer it is loss of HR, rather than NHEJ, that is more important in increasing cancer

  9. Identifying optimal reference genes for the normalization of microRNA expression in cucumber under viral stress

    PubMed Central

    Liang, Chaoqiong; Hao, Jianjun; Meng, Yan; Luo, Laixin; Li, Jianqiang

    2018-01-01

    Cucumber green mottle mosaic virus (CGMMV) is an economically important pathogen and causes significant reduction of both yield and quality of cucumber (Cucumis sativus). Currently, there were no satisfied strategies for controlling the disease. A better understanding of microRNA (miRNA) expression related to the regulation of plant-virus interactions and virus resistance would be of great assistance when developing control strategies for CGMMV. However, accurate expression analysis is highly dependent on robust and reliable reference gene used as an internal control for normalization of miRNA expression. Most commonly used reference genes involved in CGMMV-infected cucumber are not universally expressed depending on tissue types and stages of plant development. It is therefore crucial to identify suitable reference genes in investigating the role of miRNA expression. In this study, seven reference genes, including Actin, Tubulin, EF-1α, 18S rRNA, Ubiquitin, GAPDH and Cyclophilin, were evaluated for the most accurate results in analyses using reverse transcription-quantitative polymerase chain reaction (RT-qPCR). Gene expression was assayed on cucumber leaves, stems and roots that were collected at different days post inoculation with CGMMV. The expression data were analyzed using algorithms including delta-Ct, geNorm, NormFinder, and BestKeeper as well as the comparative tool RefFinder. The reference genes were subsequently validated using miR159. The results showed that EF-1α and GAPDH were the most reliable reference genes for normalizing miRNA expression in leaf, root and stem samples, while Ubiquitin and EF-1α were the most suitable combination overall. PMID:29543906

  10. Identifying Mother-Child Interaction Styles Using a Person-Centered Approach

    PubMed Central

    Nelson, Jackie A.; O’Brien, Marion; Grimm, Kevin J.; Leerkes, Esther M.

    2016-01-01

    Parent-child conflict in the context of a supportive relationship has been discussed as a potentially constructive interaction pattern; the current study is the first to test this using a holistic analytic approach. Interaction styles, defined as mother-child conflict in the context of maternal sensitivity, were identified and described with demographic and stress-related characteristics of families. Longitudinal associations were tested between interaction styles and children’s later social competence. Participants included 814 partnered mothers with a first-grade child. Latent profile analysis identified agreeable, dynamic, and disconnected interaction styles. Mothers’ intimacy with a partner, depressive symptoms, and authoritarian childrearing beliefs, along with children’s later conflict with a best friend and externalizing problems, were associated with group membership. Notably, the dynamic style, characterized by high sensitivity and high conflict, included families who experienced psychological and relational stressors. Findings are discussed with regard to how family stressors shape parent-child interaction patterns. PMID:28751818

  11. Next-generation sequencing to identify candidate genes and develop diagnostic markers for a novel Phytophthora resistance gene, RpsHC18, in soybean.

    PubMed

    Zhong, Chao; Sun, Suli; Li, Yinping; Duan, Canxing; Zhu, Zhendong

    2018-03-01

    A novel Phytophthora sojae resistance gene RpsHC18 was identified and finely mapped on soybean chromosome 3. Two NBS-LRR candidate genes were identified and two diagnostic markers of RpsHC18 were developed. Phytophthora root rot caused by Phytophthora sojae is a destructive disease of soybean. The most effective disease-control strategy is to deploy resistant cultivars carrying Phytophthora-resistant Rps genes. The soybean cultivar Huachun 18 has a broad and distinct resistance spectrum to 12 P. sojae isolates. Quantitative trait loci sequencing (QTL-seq), based on the whole-genome resequencing (WGRS) of two extreme resistant and susceptible phenotype bulks from an F 2:3 population, was performed, and one 767-kb genomic region with ΔSNP-index ≥ 0.9 on chromosome 3 was identified as the RpsHC18 candidate region in Huachun 18. The candidate region was reduced to a 146-kb region by fine mapping. Nonsynonymous SNP and haplotype analyses were carried out in the 146-kb region among ten soybean genotypes using WGRS. Four specific nonsynonymous SNPs were identified in two nucleotide-binding sites-leucine-rich repeat (NBS-LRR) genes, RpsHC18-NBL1 and RpsHC18-NBL2, which were considered to be the candidate genes. Finally, one specific SNP marker in each candidate gene was successfully developed using a tetra-primer ARMS-PCR assay, and the two markers were verified to be specific for RpsHC18 and to effectively distinguish other known Rps genes. In this study, we applied an integrated genomic-based strategy combining WGRS with traditional genetic mapping to identify RpsHC18 candidate genes and develop diagnostic markers. These results suggest that next-generation sequencing is a precise, rapid and cost-effective way to identify candidate genes and develop diagnostic markers, and it can accelerate Rps gene cloning and marker-assisted selection for breeding of P. sojae-resistant soybean cultivars.

  12. Combining Random Gene Fission and Rational Gene Fusion To Discover Near-Infrared Fluorescent Protein Fragments That Report on Protein–Protein Interactions

    PubMed Central

    2015-01-01

    Gene fission can convert monomeric proteins into two-piece catalysts, reporters, and transcription factors for systems and synthetic biology. However, some proteins can be challenging to fragment without disrupting function, such as near-infrared fluorescent protein (IFP). We describe a directed evolution strategy that can overcome this challenge by randomly fragmenting proteins and concomitantly fusing the protein fragments to pairs of proteins or peptides that associate. We used this method to create libraries that express fragmented IFP as fusions to a pair of associating peptides (IAAL-E3 and IAAL-K3) and proteins (CheA and CheY) and screened for fragmented IFP with detectable near-infrared fluorescence. Thirteen novel fragmented IFPs were identified, all of which arose from backbone fission proximal to the interdomain linker. Either the IAAL-E3 and IAAL-K3 peptides or CheA and CheY proteins could assist with IFP fragment complementation, although the IAAL-E3 and IAAL-K3 peptides consistently yielded higher fluorescence. These results demonstrate how random gene fission can be coupled to rational gene fusion to create libraries enriched in fragmented proteins with AND gate logic that is dependent upon a protein–protein interaction, and they suggest that these near-infrared fluorescent protein fragments will be suitable as reporters for pairs of promoters and protein–protein interactions within whole animals. PMID:25265085

  13. Combining random gene fission and rational gene fusion to discover near-infrared fluorescent protein fragments that report on protein-protein interactions.

    PubMed

    Pandey, Naresh; Nobles, Christopher L; Zechiedrich, Lynn; Maresso, Anthony W; Silberg, Jonathan J

    2015-05-15

    Gene fission can convert monomeric proteins into two-piece catalysts, reporters, and transcription factors for systems and synthetic biology. However, some proteins can be challenging to fragment without disrupting function, such as near-infrared fluorescent protein (IFP). We describe a directed evolution strategy that can overcome this challenge by randomly fragmenting proteins and concomitantly fusing the protein fragments to pairs of proteins or peptides that associate. We used this method to create libraries that express fragmented IFP as fusions to a pair of associating peptides (IAAL-E3 and IAAL-K3) and proteins (CheA and CheY) and screened for fragmented IFP with detectable near-infrared fluorescence. Thirteen novel fragmented IFPs were identified, all of which arose from backbone fission proximal to the interdomain linker. Either the IAAL-E3 and IAAL-K3 peptides or CheA and CheY proteins could assist with IFP fragment complementation, although the IAAL-E3 and IAAL-K3 peptides consistently yielded higher fluorescence. These results demonstrate how random gene fission can be coupled to rational gene fusion to create libraries enriched in fragmented proteins with AND gate logic that is dependent upon a protein-protein interaction, and they suggest that these near-infrared fluorescent protein fragments will be suitable as reporters for pairs of promoters and protein-protein interactions within whole animals.

  14. A 6-gene signature identifies four molecular subgroups of neuroblastoma

    PubMed Central

    2011-01-01

    Background There are currently three postulated genomic subtypes of the childhood tumour neuroblastoma (NB); Type 1, Type 2A, and Type 2B. The most aggressive forms of NB are characterized by amplification of the oncogene MYCN (MNA) and low expression of the favourable marker NTRK1. Recently, mutations or high expression of the familial predisposition gene Anaplastic Lymphoma Kinase (ALK) was associated to unfavourable biology of sporadic NB. Also, various other genes have been linked to NB pathogenesis. Results The present study explores subgroup discrimination by gene expression profiling using three published microarray studies on NB (47 samples). Four distinct clusters were identified by Principal Components Analysis (PCA) in two separate data sets, which could be verified by an unsupervised hierarchical clustering in a third independent data set (101 NB samples) using a set of 74 discriminative genes. The expression signature of six NB-associated genes ALK, BIRC5, CCND1, MYCN, NTRK1, and PHOX2B, significantly discriminated the four clusters (p < 0.05, one-way ANOVA test). PCA clusters p1, p2, and p3 were found to correspond well to the postulated subtypes 1, 2A, and 2B, respectively. Remarkably, a fourth novel cluster was detected in all three independent data sets. This cluster comprised mainly 11q-deleted MNA-negative tumours with low expression of ALK, BIRC5, and PHOX2B, and was significantly associated with higher tumour stage, poor outcome and poor survival compared to the Type 1-corresponding favourable group (INSS stage 4 and/or dead of disease, p < 0.05, Fisher's exact test). Conclusions Based on expression profiling we have identified four molecular subgroups of neuroblastoma, which can be distinguished by a 6-gene signature. The fourth subgroup has not been described elsewhere, and efforts are currently made to further investigate this group's specific characteristics. PMID:21492432

  15. Biomphalaria glabrata transcriptome: cDNA microarray profiling identifies resistant- and susceptible-specific gene expression in haemocytes from snail strains exposed to Schistosoma mansoni

    PubMed Central

    Lockyer, Anne E; Spinks, Jenny; Kane, Richard A; Hoffmann, Karl F; Fitzpatrick, Jennifer M; Rollinson, David; Noble, Leslie R; Jones, Catherine S

    2008-01-01

    Background Biomphalaria glabrata is an intermediate snail host for Schistosoma mansoni, one of the important schistosomes infecting man. B. glabrata/S. mansoni provides a useful model system for investigating the intimate interactions between host and parasite. Examining differential gene expression between S. mansoni-exposed schistosome-resistant and susceptible snail lines will identify genes and pathways that may be involved in snail defences. Results We have developed a 2053 element cDNA microarray for B. glabrata containing clones from ORESTES (Open Reading frame ESTs) libraries, suppression subtractive hybridization (SSH) libraries and clones identified in previous expression studies. Snail haemocyte RNA, extracted from parasite-challenged resistant and susceptible snails, 2 to 24 h post-exposure to S. mansoni, was hybridized to the custom made cDNA microarray and 98 differentially expressed genes or gene clusters were identified, 94 resistant-associated and 4 susceptible-associated. Quantitative PCR analysis verified the cDNA microarray results for representative transcripts. Differentially expressed genes were annotated and clustered using gene ontology (GO) terminology and Kyoto Encyclopaedia of Genes and Genomes (KEGG) pathway analysis. 61% of the identified differentially expressed genes have no known function including the 4 susceptible strain-specific transcripts. Resistant strain-specific expression of genes implicated in innate immunity of invertebrates was identified, including hydrolytic enzymes such as cathepsin L, a cysteine proteinase involved in lysis of phagocytosed particles; metabolic enzymes such as ornithine decarboxylase, the rate-limiting enzyme in the production of polyamines, important in inflammation and infection processes, as well as scavenging damaging free radicals produced during production of reactive oxygen species; stress response genes such as HSP70; proteins involved in signalling, such as importin 7 and copine 1

  16. Biomphalaria glabrata transcriptome: cDNA microarray profiling identifies resistant- and susceptible-specific gene expression in haemocytes from snail strains exposed to Schistosoma mansoni.

    PubMed

    Lockyer, Anne E; Spinks, Jenny; Kane, Richard A; Hoffmann, Karl F; Fitzpatrick, Jennifer M; Rollinson, David; Noble, Leslie R; Jones, Catherine S

    2008-12-29

    Biomphalaria glabrata is an intermediate snail host for Schistosoma mansoni, one of the important schistosomes infecting man. B. glabrata/S. mansoni provides a useful model system for investigating the intimate interactions between host and parasite. Examining differential gene expression between S. mansoni-exposed schistosome-resistant and susceptible snail lines will identify genes and pathways that may be involved in snail defences. We have developed a 2053 element cDNA microarray for B. glabrata containing clones from ORESTES (Open Reading frame ESTs) libraries, suppression subtractive hybridization (SSH) libraries and clones identified in previous expression studies. Snail haemocyte RNA, extracted from parasite-challenged resistant and susceptible snails, 2 to 24 h post-exposure to S. mansoni, was hybridized to the custom made cDNA microarray and 98 differentially expressed genes or gene clusters were identified, 94 resistant-associated and 4 susceptible-associated. Quantitative PCR analysis verified the cDNA microarray results for representative transcripts. Differentially expressed genes were annotated and clustered using gene ontology (GO) terminology and Kyoto Encyclopaedia of Genes and Genomes (KEGG) pathway analysis. 61% of the identified differentially expressed genes have no known function including the 4 susceptible strain-specific transcripts. Resistant strain-specific expression of genes implicated in innate immunity of invertebrates was identified, including hydrolytic enzymes such as cathepsin L, a cysteine proteinase involved in lysis of phagocytosed particles; metabolic enzymes such as ornithine decarboxylase, the rate-limiting enzyme in the production of polyamines, important in inflammation and infection processes, as well as scavenging damaging free radicals produced during production of reactive oxygen species; stress response genes such as HSP70; proteins involved in signalling, such as importin 7 and copine 1, cytoplasmic intermediate

  17. Exome sequencing of a large family identifies potential candidate genes contributing risk to bipolar disorder.

    PubMed

    Zhang, Tianxiao; Hou, Liping; Chen, David T; McMahon, Francis J; Wang, Jen-Chyong; Rice, John P

    2018-03-01

    Bipolar disorder is a mental illness with lifetime prevalence of about 1%. Previous genetic studies have identified multiple chromosomal linkage regions and candidate genes that might be associated with bipolar disorder. The present study aimed to identify potential susceptibility variants for bipolar disorder using 6 related case samples from a four-generation family. A combination of exome sequencing and linkage analysis was performed to identify potential susceptibility variants for bipolar disorder. Our study identified a list of five potential candidate genes for bipolar disorder. Among these five genes, GRID1(Glutamate Receptor Delta-1 Subunit), which was previously reported to be associated with several psychiatric disorders and brain related traits, is particularly interesting. Variants with functional significance in this gene were identified from two cousins in our bipolar disorder pedigree. Our findings suggest a potential role for these genes and the related rare variants in the onset and development of bipolar disorder in this one family. Additional research is needed to replicate these findings and evaluate their patho-biological significance. Copyright © 2017 Elsevier B.V. All rights reserved.

  18. Associations and interactions between SNPs in the alcohol metabolizing genes and alcoholism phenotypes in European Americans.

    PubMed

    Sherva, Richard; Rice, John P; Neuman, Rosalind J; Rochberg, Nanette; Saccone, Nancy L; Bierut, Laura J

    2009-05-01

    Alcohol dependence is a major cause of morbidity and mortality worldwide and has a strong familial component. Several linkage and association studies have identified chromosomal regions and/or genes that affect alcohol consumption, notably in genes involved in the 2-stage pathway of alcohol metabolism. Here, we use multiple regression models to test for associations and interactions between 2 alcohol-related phenotypes and SNPs in 17 genes involved in alcohol metabolism in a sample of 1,588 European American subjects. The strongest evidence for association after correcting for multiple testing was between rs1229984, a nonsynonymous coding SNP in ADH1B, and DSM-IV symptom count (p = 0.0003). This SNP was also associated with maximum number of drinks in 24 hours (p = 0.0004). Each minor allele at this SNP predicts 45% fewer DSM-IV symptoms and 18% fewer max drinks. Another SNP in a splice site in ALDH1A1 (rs8187974) showed evidence for association with both phenotypes as well (p = 0.02 and 0.004, respectively), but neither association was significant after accounting for multiple testing. Minor alleles at this SNP predict greater alcohol consumption. In addition, pairwise interactions were observed between SNPs in several genes (p = 0.00002). We replicated the large effect of rs1229984 on alcohol behavior, and although not common (MAF = 4%), this polymorphism may be highly relevant from a public health perspective in European Americans. Another SNP, rs8187974, may also affect alcohol behavior but requires replication. Also, interactions between polymorphisms in genes involved in alcohol metabolism are likely determinants of the parameters that ultimately affect alcohol consumption.

  19. Axonal guidance signaling pathway interacting with smoking in modifying the risk of pancreatic cancer: a gene- and pathway-based interaction analysis of GWAS data.

    PubMed

    Tang, Hongwei; Wei, Peng; Duell, Eric J; Risch, Harvey A; Olson, Sara H; Bueno-de-Mesquita, H Bas; Gallinger, Steven; Holly, Elizabeth A; Petersen, Gloria; Bracci, Paige M; McWilliams, Robert R; Jenab, Mazda; Riboli, Elio; Tjønneland, Anne; Boutron-Ruault, Marie Christine; Kaaks, Rudolph; Trichopoulos, Dimitrios; Panico, Salvatore; Sund, Malin; Peeters, Petra H M; Khaw, Kay-Tee; Amos, Christopher I; Li, Donghui

    2014-05-01

    Cigarette smoking is the best established modifiable risk factor for pancreatic cancer. Genetic factors that underlie smoking-related pancreatic cancer have previously not been examined at the genome-wide level. Taking advantage of the existing Genome-wide association study (GWAS) genotype and risk factor data from the Pancreatic Cancer Case Control Consortium, we conducted a discovery study in 2028 cases and 2109 controls to examine gene-smoking interactions at pathway/gene/single nucleotide polymorphism (SNP) level. Using the likelihood ratio test nested in logistic regression models and ingenuity pathway analysis (IPA), we examined 172 KEGG (Kyoto Encyclopedia of Genes and Genomes) pathways, 3 manually curated gene sets, 3 nicotine dependency gene ontology pathways, 17 912 genes and 468 114 SNPs. None of the individual pathway/gene/SNP showed significant interaction with smoking after adjusting for multiple comparisons. Six KEGG pathways showed nominal interactions (P < 0.05) with smoking, and the top two are the pancreatic secretion and salivary secretion pathways (major contributing genes: RAB8A, PLCB and CTRB1). Nine genes, i.e. ZBED2, EXO1, PSG2, SLC36A1, CLSTN1, MTHFSD, FAT2, IL10RB and ATXN2 had P interaction < 0.0005. Five intergenic region SNPs and two SNPs of the EVC and KCNIP4 genes had P interaction < 0.00003. In IPA analysis of genes with nominal interactions with smoking, axonal guidance signaling $$\\left(P=2.12\\times 1{0}^{-7}\\right)$$ and α-adrenergic signaling $$\\left(P=2.52\\times 1{0}^{-5}\\right)$$ genes were significantly overrepresented canonical pathways. Genes contributing to the axon guidance signaling pathway included the SLIT/ROBO signaling genes that were frequently altered in pancreatic cancer. These observations need to be confirmed in additional data set. Once confirmed, it will open a new avenue to unveiling the etiology of smoking-associated pancreatic cancer.

  20. Information-Theoretic Metrics for Visualizing Gene-Environment Interactions

    PubMed Central

    Chanda, Pritam ; Zhang, Aidong ; Brazeau, Daniel ; Sucheston, Lara ; Freudenheim, Jo L. ; Ambrosone, Christine ; Ramanathan, Murali 

    2007-01-01

    The purpose of our work was to develop heuristics for visualizing and interpreting gene-environment interactions (GEIs) and to assess the dependence of candidate visualization metrics on biological and study-design factors. Two information-theoretic metrics, the k-way interaction information (KWII) and the total correlation information (TCI), were investigated. The effectiveness of the KWII and TCI to detect GEIs in a diverse range of simulated data sets and a Crohn disease data set was assessed. The sensitivity of the KWII and TCI spectra to biological and study-design variables was determined. Head-to-head comparisons with the relevance-chain, multifactor dimensionality reduction, and the pedigree disequilibrium test (PDT) methods were obtained. The KWII and TCI spectra, which are graphical summaries of the KWII and TCI for each subset of environmental and genotype variables, were found to detect each known GEI in the simulated data sets. The patterns in the KWII and TCI spectra were informative for factors such as case-control misassignment, locus heterogeneity, allele frequencies, and linkage disequilibrium. The KWII and TCI spectra were found to have excellent sensitivity for identifying the key disease-associated genetic variations in the Crohn disease data set. In head-to-head comparisons with the relevance-chain, multifactor dimensionality reduction, and PDT methods, the results from visual interpretation of the KWII and TCI spectra performed satisfactorily. The KWII and TCI are promising metrics for visualizing GEIs. They are capable of detecting interactions among numerous single-nucleotide polymorphisms and environmental variables for a diverse range of GEI models. PMID:17924337

  1. Avirulence Genes in Cereal Powdery Mildews: The Gene-for-Gene Hypothesis 2.0.

    PubMed

    Bourras, Salim; McNally, Kaitlin E; Müller, Marion C; Wicker, Thomas; Keller, Beat

    2016-01-01

    The gene-for-gene hypothesis states that for each gene controlling resistance in the host, there is a corresponding, specific gene controlling avirulence in the pathogen. Allelic series of the cereal mildew resistance genes Pm3 and Mla provide an excellent system for genetic and molecular analysis of resistance specificity. Despite this opportunity for molecular research, avirulence genes in mildews remain underexplored. Earlier work in barley powdery mildew (B.g. hordei) has shown that the reaction to some Mla resistance alleles is controlled by multiple genes. Similarly, several genes are involved in the specific interaction of wheat mildew (B.g. tritici) with the Pm3 allelic series. We found that two mildew genes control avirulence on Pm3f: one gene is involved in recognition by the resistance protein as demonstrated by functional studies in wheat and the heterologous host Nicotiana benthamiana. A second gene is a suppressor, and resistance is only observed in mildew genotypes combining the inactive suppressor and the recognized Avr. We propose that such suppressor/avirulence gene combinations provide the basis of specificity in mildews. Depending on the particular gene combinations in a mildew race, different genes will be genetically identified as the "avirulence" gene. Additionally, the observation of two LINE retrotransposon-encoded avirulence genes in B.g. hordei further suggests that the control of avirulence in mildew is more complex than a canonical gene-for-gene interaction. To fully understand the mildew-cereal interactions, more knowledge on avirulence determinants is needed and we propose ways how this can be achieved based on recent advances in the field.

  2. Weighted gene co-expression network analysis of expression data of monozygotic twins identifies specific modules and hub genes related to BMI.

    PubMed

    Wang, Weijing; Jiang, Wenjie; Hou, Lin; Duan, Haiping; Wu, Yili; Xu, Chunsheng; Tan, Qihua; Li, Shuxia; Zhang, Dongfeng

    2017-11-13

    The therapeutic management of obesity is challenging, hence further elucidating the underlying mechanisms of obesity development and identifying new diagnostic biomarkers and therapeutic targets are urgent and necessary. Here, we performed differential gene expression analysis and weighted gene co-expression network analysis (WGCNA) to identify significant genes and specific modules related to BMI based on gene expression profile data of 7 discordant monozygotic twins. In the differential gene expression analysis, it appeared that 32 differentially expressed genes (DEGs) were with a trend of up-regulation in twins with higher BMI when compared to their siblings. Categories of positive regulation of nitric-oxide synthase biosynthetic process, positive regulation of NF-kappa B import into nucleus, and peroxidase activity were significantly enriched within GO database and NF-kappa B signaling pathway within KEGG database. DEGs of NAMPT, TLR9, PTGS2, HBD, and PCSK1N might be associated with obesity. In the WGCNA, among the total 20 distinct co-expression modules identified, coral1 module (68 genes) had the strongest positive correlation with BMI (r = 0.56, P = 0.04) and disease status (r = 0.56, P = 0.04). Categories of positive regulation of phospholipase activity, high-density lipoprotein particle clearance, chylomicron remnant clearance, reverse cholesterol transport, intermediate-density lipoprotein particle, chylomicron, low-density lipoprotein particle, very-low-density lipoprotein particle, voltage-gated potassium channel complex, cholesterol transporter activity, and neuropeptide hormone activity were significantly enriched within GO database for this module. And alcoholism and cell adhesion molecules pathways were significantly enriched within KEGG database. Several hub genes, such as GAL, ASB9, NPPB, TBX2, IL17C, APOE, ABCG4, and APOC2 were also identified. The module eigengene of saddlebrown module (212 genes) was also significantly

  3. Analysis of global gene expression profiles to identify differentially expressed genes critical for embryo development in Brassica rapa.

    PubMed

    Zhang, Yu; Peng, Lifang; Wu, Ya; Shen, Yanyue; Wu, Xiaoming; Wang, Jianbo

    2014-11-01

    Embryo development represents a crucial developmental period in the life cycle of flowering plants. To gain insights into the genetic programs that control embryo development in Brassica rapa L., RNA sequencing technology was used to perform transcriptome profiling analysis of B. rapa developing embryos. The results generated 42,906,229 sequence reads aligned with 32,941 genes. In total, 27,760, 28,871, 28,384, and 25,653 genes were identified from embryos at globular, heart, early cotyledon, and mature developmental stages, respectively, and analysis between stages revealed a subset of stage-specific genes. We next investigated 9,884 differentially expressed genes with more than fivefold changes in expression and false discovery rate ≤ 0.001 from three adjacent-stage comparisons; 1,514, 3,831, and 6,633 genes were detected between globular and heart stage embryo libraries, heart stage and early cotyledon stage, and early cotyledon and mature stage, respectively. Large numbers of genes related to cellular process, metabolism process, response to stimulus, and biological process were expressed during the early and middle stages of embryo development. Fatty acid biosynthesis, biosynthesis of secondary metabolites, and photosynthesis-related genes were expressed predominantly in embryos at the middle stage. Genes for lipid metabolism and storage proteins were highly expressed in the middle and late stages of embryo development. We also identified 911 transcription factor genes that show differential expression across embryo developmental stages. These results increase our understanding of the complex molecular and cellular events during embryo development in B. rapa and provide a foundation for future studies on other oilseed crops.

  4. A global interaction network maps a wiring diagram of cellular function

    PubMed Central

    Costanzo, Michael; VanderSluis, Benjamin; Koch, Elizabeth N.; Baryshnikova, Anastasia; Pons, Carles; Tan, Guihong; Wang, Wen; Usaj, Matej; Hanchard, Julia; Lee, Susan D.; Pelechano, Vicent; Styles, Erin B.; Billmann, Maximilian; van Leeuwen, Jolanda; van Dyk, Nydia; Lin, Zhen-Yuan; Kuzmin, Elena; Nelson, Justin; Piotrowski, Jeff S.; Srikumar, Tharan; Bahr, Sondra; Chen, Yiqun; Deshpande, Raamesh; Kurat, Christoph F.; Li, Sheena C.; Li, Zhijian; Usaj, Mojca Mattiazzi; Okada, Hiroki; Pascoe, Natasha; Luis, Bryan-Joseph San; Sharifpoor, Sara; Shuteriqi, Emira; Simpkins, Scott W.; Snider, Jamie; Suresh, Harsha Garadi; Tan, Yizhao; Zhu, Hongwei; Malod-Dognin, Noel; Janjic, Vuk; Przulj, Natasa; Troyanskaya, Olga G.; Stagljar, Igor; Xia, Tian; Ohya, Yoshikazu; Gingras, Anne-Claude; Raught, Brian; Boutros, Michael; Steinmetz, Lars M.; Moore, Claire L.; Rosebrock, Adam P.; Caudy, Amy A.; Myers, Chad L.; Andrews, Brenda; Boone, Charles

    2017-01-01

    We generated a global genetic interaction network for Saccharomyces cerevisiae, constructing over 23 million double mutants, identifying ~550,000 negative and ~350,000 positive genetic interactions. This comprehensive network maps genetic interactions for essential gene pairs, highlighting essential genes as densely connected hubs. Genetic interaction profiles enabled assembly of a hierarchical model of cell function, including modules corresponding to protein complexes and pathways, biological processes, and cellular compartments. Negative interactions connected functionally related genes, mapped core bioprocesses, and identified pleiotropic genes, whereas positive interactions often mapped general regulatory connections among gene pairs, rather than shared functionality. The global network illustrates how coherent sets of genetic interactions connect protein complex and pathway modules to map a functional wiring diagram of the cell. PMID:27708008

  5. Identification of genes differentially expressed during interaction of Mexican lime tree infected with "Candidatus Phytoplasma aurantifolia"

    PubMed Central

    2011-01-01

    Background "Candidatus Phytoplasma aurantifolia", is the causative agent of witches' broom disease in Mexican lime trees (Citrus aurantifolia L.), and is responsible for major losses of Mexican lime trees in Southern Iran and Oman. The pathogen is strictly biotrophic, and thus is completely dependent on living host cells for its survival. The molecular basis of compatibility and disease development in this system is poorly understood. Therefore, we have applied a cDNA- amplified fragment length polymorphism (AFLP) approach to analyze gene expression in Mexican lime trees infected by "Ca. Phytoplasma aurantifolia". Results We carried out cDNA-AFLP analysis on grafted infected Mexican lime trees of the susceptible cultivar at the representative symptoms stage. Selective amplifications with 43 primer combinations allowed the visualisation of 55 transcript-derived fragments that were expressed differentially between infected and non-infected leaves. We sequenced 51 fragments, 36 of which were identified as lime tree transcripts after homology searching. Of the 36 genes, 70.5% were down-regulated during infection and could be classified into various functional groups. We showed that Mexican lime tree genes that were homologous to known resistance genes tended to be repressed in response to infection. These included the genes for modifier of snc1 and autophagy protein 5. Furthermore, down-regulation of genes involved in metabolism, transcription, transport and cytoskeleton was observed, which included the genes for formin, importin β 3, transducin, L-asparaginase, glycerophosphoryl diester phosphodiesterase, and RNA polymerase β. In contrast, genes that encoded a proline-rich protein, ubiquitin-protein ligase, phosphatidyl glycerol specific phospholipase C-like, and serine/threonine-protein kinase were up-regulated during the infection. Conclusion The present study identifies a number of candidate genes that might be involved in the interaction of Mexican lime trees

  6. Evolutionary Inference across Eukaryotes Identifies Specific Pressures Favoring Mitochondrial Gene Retention.

    PubMed

    Johnston, Iain G; Williams, Ben P

    2016-02-24

    Since their endosymbiotic origin, mitochondria have lost most of their genes. Although many selective mechanisms underlying the evolution of mitochondrial genomes have been proposed, a data-driven exploration of these hypotheses is lacking, and a quantitatively supported consensus remains absent. We developed HyperTraPS, a methodology coupling stochastic modeling with Bayesian inference, to identify the ordering of evolutionary events and suggest their causes. Using 2015 complete mitochondrial genomes, we inferred evolutionary trajectories of mtDNA gene loss across the eukaryotic tree of life. We find that proteins comprising the structural cores of the electron transport chain are preferentially encoded within mitochondrial genomes across eukaryotes. A combination of high GC content and high protein hydrophobicity is required to explain patterns of mtDNA gene retention; a model that accounts for these selective pressures can also predict the success of artificial gene transfer experiments in vivo. This work provides a general method for data-driven inference of the ordering of evolutionary and progressive events, here identifying the distinct features shaping mitochondrial genomes of present-day species. Copyright © 2016 Elsevier Inc. All rights reserved.

  7. Comprehensive Analysis of Gene Expression Profiles of Sepsis-Induced Multiorgan Failure Identified Its Valuable Biomarkers.

    PubMed

    Wang, Yumei; Yin, Xiaoling; Yang, Fang

    2018-02-01

    Sepsis is an inflammatory-related disease, and severe sepsis would induce multiorgan dysfunction, which is the most common cause of death of patients in noncoronary intensive care units. Progression of novel therapeutic strategies has proven to be of little impact on the mortality of severe sepsis, and unfortunately, its mechanisms still remain poorly understood. In this study, we analyzed gene expression profiles of severe sepsis with failure of lung, kidney, and liver for the identification of potential biomarkers. We first downloaded the gene expression profiles from the Gene Expression Omnibus and performed preprocessing of raw microarray data sets and identification of differential expression genes (DEGs) through the R programming software; then, significantly enriched functions of DEGs in lung, kidney, and liver failure sepsis samples were obtained from the Database for Annotation, Visualization, and Integrated Discovery; finally, protein-protein interaction network was constructed for DEGs based on the STRING database, and network modules were also obtained through the MCODE cluster method. As a result, lung failure sepsis has the highest number of DEGs of 859, whereas the number of DEGs in kidney and liver failure sepsis samples is 178 and 175, respectively. In addition, 17 overlaps were obtained among the three lists of DEGs. Biological processes related to immune and inflammatory response were found to be significantly enriched in DEGs. Network and module analysis identified four gene clusters in which all or most of genes were upregulated. The expression changes of Icam1 and Socs3 were further validated through quantitative PCR analysis. This study should shed light on the development of sepsis and provide potential therapeutic targets for sepsis-induced multiorgan failure.

  8. Analyses of interactions among pair-rule genes and the gap gene Krüppel in Bombyx segmentation.

    PubMed

    Nakao, Hajime

    2015-09-01

    In the short-germ insect Tribolium, a pair-rule gene circuit consisting of the Tribolium homologs of even-skipped, runt, and odd-skipped (Tc-eve, Tc-run and Tc-odd, respectively) has been implicated in segment formation. To examine the application of the model to other taxa, I studied the expression and function of pair-rule genes in Bombyx mori, together with a Bombyx homolog of Krüppel (Bm-Kr), a known gap gene. Knockdown embryos of Bombyx homologs of eve, run and odd (Bm-eve, Bm-run and Bm-odd) exhibited asegmental phenotypes similar to those of Tribolium knockdowns. However, pair-rule gene interactions were similar to those of both Tribolium and Drosophila, which, different from Tribolium, shows a hierarchical segmentation mode. Additionally, the Bm-odd expression pattern shares characteristics with those of Drosophila pair-rule genes that receive upstream regulatory input. On the other hand, Bm-Kr knockdowns exhibited a large posterior segment deletion as observed in short-germ insects. However, a detailed analysis of these embryos indicated that Bm-Kr modulates expression of pair-rule genes like in Drosophila, although the mechanisms appear to be different. This suggested hierarchical interactions between Bm-Kr and pair-rule genes. Based on these results, I concluded that the pair-rule gene circuit model that describes Tribolium development is not applicable to Bombyx. Copyright © 2015 Elsevier Inc. All rights reserved.

  9. Whole genome co-expression analysis of soybean cytochrome P450 genes identifies nodulation-specific P450 monooxygenases

    PubMed Central

    2010-01-01

    Background Cytochrome P450 monooxygenases (P450s) catalyze oxidation of various substrates using oxygen and NAD(P)H. Plant P450s are involved in the biosynthesis of primary and secondary metabolites performing diverse biological functions. The recent availability of the soybean genome sequence allows us to identify and analyze soybean putative P450s at a genome scale. Co-expression analysis using an available soybean microarray and Illumina sequencing data provides clues for functional annotation of these enzymes. This approach is based on the assumption that genes that have similar expression patterns across a set of conditions may have a functional relationship. Results We have identified a total number of 332 full-length P450 genes and 378 pseudogenes from the soybean genome. From the full-length sequences, 195 genes belong to A-type, which could be further divided into 20 families. The remaining 137 genes belong to non-A type P450s and are classified into 28 families. A total of 178 probe sets were found to correspond to P450 genes on the Affymetrix soybean array. Out of these probe sets, 108 represented single genes. Using the 28 publicly available microarray libraries that contain organ-specific information, some tissue-specific P450s were identified. Similarly, stress responsive soybean P450s were retrieved from 99 microarray soybean libraries. We also utilized Illumina transcriptome sequencing technology to analyze the expressions of all 332 soybean P450 genes. This dataset contains total RNAs isolated from nodules, roots, root tips, leaves, flowers, green pods, apical meristem, mock-inoculated and Bradyrhizobium japonicum-infected root hair cells. The tissue-specific expression patterns of these P450 genes were analyzed and the expression of a representative set of genes were confirmed by qRT-PCR. We performed the co-expression analysis on many of the 108 P450 genes on the Affymetrix arrays. First we confirmed that CYP93C5 (an isoflavone synthase gene) is

  10. Protein-protein interaction network of gene expression in the hydrocortisone-treated keloid.

    PubMed

    Chen, Rui; Zhang, Zhiliang; Xue, Zhujia; Wang, Lin; Fu, Mingang; Lu, Yi; Bai, Ling; Zhang, Ping; Fan, Zhihong

    2015-01-01

    In order to explore the molecular mechanism of hydrocortisone in keloid tissue, the gene expression profiles of keloid samples treated with hydrocortisone were subjected to bioinformatics analysis. Firstly, the gene expression profiles (GSE7890) of five samples of keloid treated with hydrocortisone and five untreated keloid samples were downloaded from the Gene Expression Omnibus (GEO) database. Secondly, data were preprocessed using packages in R language and differentially expressed genes (DEGs) were screened using a significance analysis of microarrays (SAM) protocol. Thirdly, the DEGs were subjected to gene ontology (GO) function and KEGG pathway enrichment analysis. Finally, the interactions of DEGs in samples of keloid treated with hydrocortisone were explored in a human protein-protein interaction (PPI) network, and sub-modules of the DEGs interaction network were analyzed using Cytoscape software. Based on the analysis, 572 DEGs in the hydrocortisone-treated samples were screened; most of these were involved in the signal transduction and cell cycle. Furthermore, three critical genes in the module, including COL1A1, NID1, and PRELP, were screened in the PPI network analysis. These findings enhance understanding of the pathogenesis of the keloid and provide references for keloid therapy. © 2015 The International Society of Dermatology.

  11. Gene-environment interaction: Does fluoride influence the reproductive hormones in male farmers modified by ERα gene polymorphisms?

    PubMed

    Ma, Qiang; Huang, Hui; Sun, Long; Zhou, Tong; Zhu, Jingyuan; Cheng, Xuemin; Duan, Lijv; Li, Zhiyuan; Cui, Liuxin; Ba, Yue

    2017-12-01

    The occurrence of endemic fluorosis is derived from high fluoride levels in drinking water and industrial fumes or dust. Reproductive disruption is also a major harm caused by fluoride exposure besides dental and skeletal lesions. However, few studies focus on the mechanism of fluoride exposure on male reproductive function, especially the possible interaction of fluoride exposure and gene polymorphism on male reproductive hormones. Therefore, we conducted a cross-sectional study in rural areas of Henan province in China to explore the interaction between the estrogen receptor alpha (ERα) gene and fluoride exposure on reproductive hormone levels in male farmers living in the endemic fluorosis villages. The results showed that fluoride exposure significantly increased the serum level of estradiol in the hypothalamic-pituitary-testicular (HPT) axis in male farmers. Moreover, the observations indicated that fluoride exposure and genetic markers had an interaction on serum concentration of follicle-stimulating hormone and estradiol, and the interaction among different loci of the ERα gene could impact the serum testosterone level. Findings in the present work suggest that chronic fluoride exposure in drinking water could modulate the levels of reproductive hormones in males living in endemic fluorosis areas, and the interaction between fluoride exposure and ERα polymorphisms might affect the serum levels of hormones in the HPT axis in male farmers. Copyright © 2017 Elsevier Ltd. All rights reserved.

  12. Gene-gene interactions of CYP2A6 and MAOA polymorphisms on smoking behavior in Chinese male population.

    PubMed

    Tang, Xun; Guo, Song; Sun, Hongqiang; Song, Xuemei; Jiang, Zuonin; Sheng, Lixiang; Zhou, Dongfeng; Hu, Yonghua; Chen, Dafang

    2009-05-01

    Nicotine is the major psychoactive ingredient in tobacco, and is responsible for dependence through the nicotine-stimulated reward pathway mediated by the central dopaminergic system. Consequently, genetic polymorphisms in both nicotine metabolism and dopamine catabolism genes may influence smoking behavior, and interact with each other resulting in risk modulation. In this study, we investigated the association and multilocus gene-gene interactions of cytochrome P450 2A6 (CYP2A6), dopamine beta-hydroxylase (DBH), catechol O-methyl transferase (COMT), and monoamine oxidase A (MAOA) polymorphisms with smoking behavior in a community-based Chinese male population. The polymorphisms were genotyped in 203 current smokers, 66 former smokers, and 102 never smokers. Multivariate logistic regression models and the multifactor dimensionality reduction method were used to analyze the association and multilocus gene-gene interactions. Statistically significant trends were shown for increased risk of smoking initiation in participants with CYP2A6*1B/CYP2A6*1B genotypes compared with those with CYP2A6*1A/CYP2A6*1A genotypes [odds ratio (OR)=3.5, 95% confidence interval (CI)= 1.5-8.1], and participants with CYP2A6*1/CYP2A6*1 genotypes were at higher risk of smoking initiation (OR=2.4, 95% CI=1.2-4.5) and smoking persistence (OR=4.0, 95% CI=1.5-10.3) than those who have CYP2A6*4C genotypes. Moreover, the best model involved a gene-gene interaction between MAOA and CYP2A6 was characterized by the multifactor dimensionality reduction method (64.11% accuracy, P<0.001), and indicated that carriers of the combined 1460 T/O genotype for MAOA EcoRV and CYP2A6*1/CYP2A6*1 genotypes were at higher risk of smoking (OR=15.4, 95% CI=4.5-52.5). These findings suggested a substantial influence of CYP2A6 polymorphism as well as the interaction with MAOA resulting in risk modulation on smoking behavior in Chinese male population.

  13. The FUN of identifying gene function in bacterial pathogens; insights from Salmonella functional genomics.

    PubMed

    Hammarlöf, Disa L; Canals, Rocío; Hinton, Jay C D

    2013-10-01

    The availability of thousands of genome sequences of bacterial pathogens poses a particular challenge because each genome contains hundreds of genes of unknown function (FUN). How can we easily discover which FUN genes encode important virulence factors? One solution is to combine two different functional genomic approaches. First, transcriptomics identifies bacterial FUN genes that show differential expression during the process of mammalian infection. Second, global mutagenesis identifies individual FUN genes that the pathogen requires to cause disease. The intersection of these datasets can reveal a small set of candidate genes most likely to encode novel virulence attributes. We demonstrate this approach with the Salmonella infection model, and propose that a similar strategy could be used for other bacterial pathogens. Copyright © 2013 Elsevier Ltd. All rights reserved.

  14. Transposon mutagenesis identifies genes that cooperate with mutant Pten in breast cancer progression

    PubMed Central

    Rangel, Roberto; Lee, Song-Choon; Hon-Kim Ban, Kenneth; Guzman-Rojas, Liliana; Mann, Michael B.; Newberg, Justin Y.; McNoe, Leslie A.; Selvanesan, Luxmanan; Ward, Jerrold M.; Rust, Alistair G.; Chin, Kuan-Yew; Black, Michael A.; Jenkins, Nancy A.; Copeland, Neal G.

    2016-01-01

    Triple-negative breast cancer (TNBC) has the worst prognosis of any breast cancer subtype. To better understand the genetic forces driving TNBC, we performed a transposon mutagenesis screen in a phosphatase and tensin homolog (Pten) mutant mice and identified 12 candidate trunk drivers and a much larger number of progression genes. Validation studies identified eight TNBC tumor suppressor genes, including the GATA-like transcriptional repressor TRPS1. Down-regulation of TRPS1 in TNBC cells promoted epithelial-to-mesenchymal transition (EMT) by deregulating multiple EMT pathway genes, in addition to increasing the expression of SERPINE1 and SERPINB2 and the subsequent migration, invasion, and metastasis of tumor cells. Transposon mutagenesis has thus provided a better understanding of the genetic forces driving TNBC and discovered genes with potential clinical importance in TNBC. PMID:27849608

  15. Two-Way Gene Interaction From Microarray Data Based on Correlation Methods.

    PubMed

    Alavi Majd, Hamid; Talebi, Atefeh; Gilany, Kambiz; Khayyer, Nasibeh

    2016-06-01

    Gene networks have generated a massive explosion in the development of high-throughput techniques for monitoring various aspects of gene activity. Networks offer a natural way to model interactions between genes, and extracting gene network information from high-throughput genomic data is an important and difficult task. The purpose of this study is to construct a two-way gene network based on parametric and nonparametric correlation coefficients. The first step in constructing a Gene Co-expression Network is to score all pairs of gene vectors. The second step is to select a score threshold and connect all gene pairs whose scores exceed this value. In the foundation-application study, we constructed two-way gene networks using nonparametric methods, such as Spearman's rank correlation coefficient and Blomqvist's measure, and compared them with Pearson's correlation coefficient. We surveyed six genes of venous thrombosis disease, made a matrix entry representing the score for the corresponding gene pair, and obtained two-way interactions using Pearson's correlation, Spearman's rank correlation, and Blomqvist's coefficient. Finally, these methods were compared with Cytoscape, based on BIND, and Gene Ontology, based on molecular function visual methods; R software version 3.2 and Bioconductor were used to perform these methods. Based on the Pearson and Spearman correlations, the results were the same and were confirmed by Cytoscape and GO visual methods; however, Blomqvist's coefficient was not confirmed by visual methods. Some results of the correlation coefficients are not the same with visualization. The reason may be due to the small number of data.

  16. Using reporter gene assays to identify cis regulatory differences between humans and chimpanzees.

    PubMed

    Chabot, Adrien; Shrit, Ralla A; Blekhman, Ran; Gilad, Yoav

    2007-08-01

    Most phenotypic differences between human and chimpanzee are likely to result from differences in gene regulation, rather than changes to protein-coding regions. To date, however, only a handful of human-chimpanzee nucleotide differences leading to changes in gene regulation have been identified. To hone in on differences in regulatory elements between human and chimpanzee, we focused on 10 genes that were previously found to be differentially expressed between the two species. We then designed reporter gene assays for the putative human and chimpanzee promoters of the 10 genes. Of seven promoters that we found to be active in human liver cell lines, human and chimpanzee promoters had significantly different activity in four cases, three of which recapitulated the gene expression difference seen in the microarray experiment. For these three genes, we were therefore able to demonstrate that a change in cis influences expression differences between humans and chimpanzees. Moreover, using site-directed mutagenesis on one construct, the promoter for the DDA3 gene, we were able to identify three nucleotides that together lead to a cis regulatory difference between the species. High-throughput application of this approach can provide a map of regulatory element differences between humans and our close evolutionary relatives.

  17. Genome-Wide Association Study Identifies Candidate Genes for Starch Content Regulation in Maize Kernels

    PubMed Central

    Liu, Na; Xue, Yadong; Guo, Zhanyong; Li, Weihua; Tang, Jihua

    2016-01-01

    Kernel starch content is an important trait in maize (Zea mays L.) as it accounts for 65–75% of the dry kernel weight and positively correlates with seed yield. A number of starch synthesis-related genes have been identified in maize in recent years. However, many loci underlying variation in starch content among maize inbred lines still remain to be identified. The current study is a genome-wide association study that used a set of 263 maize inbred lines. In this panel, the average kernel starch content was 66.99%, ranging from 60.60 to 71.58% over the three study years. These inbred lines were genotyped with the SNP50 BeadChip maize array, which is comprised of 56,110 evenly spaced, random SNPs. Population structure was controlled by a mixed linear model (MLM) as implemented in the software package TASSEL. After the statistical analyses, four SNPs were identified as significantly associated with starch content (P ≤ 0.0001), among which one each are located on chromosomes 1 and 5 and two are on chromosome 2. Furthermore, 77 candidate genes associated with starch synthesis were found within the 100-kb intervals containing these four QTLs, and four highly associated genes were within 20-kb intervals of the associated SNPs. Among the four genes, Glucose-1-phosphate adenylyltransferase (APS1; Gene ID GRMZM2G163437) is known as an important regulator of kernel starch content. The identified SNPs, QTLs, and candidate genes may not only be readily used for germplasm improvement by marker-assisted selection in breeding, but can also elucidate the genetic basis of starch content. Further studies on these identified candidate genes may help determine the molecular mechanisms regulating kernel starch content in maize and other important cereal crops. PMID:27512395

  18. Interactions between genetic variation and cellular environment in skeletal muscle gene expression.

    PubMed

    Taylor, D Leland; Knowles, David A; Scott, Laura J; Ramirez, Andrea H; Casale, Francesco Paolo; Wolford, Brooke N; Guan, Li; Varshney, Arushi; Albanus, Ricardo D'Oliveira; Parker, Stephen C J; Narisu, Narisu; Chines, Peter S; Erdos, Michael R; Welch, Ryan P; Kinnunen, Leena; Saramies, Jouko; Sundvall, Jouko; Lakka, Timo A; Laakso, Markku; Tuomilehto, Jaakko; Koistinen, Heikki A; Stegle, Oliver; Boehnke, Michael; Birney, Ewan; Collins, Francis S

    2018-01-01

    From whole organisms to individual cells, responses to environmental conditions are influenced by genetic makeup, where the effect of genetic variation on a trait depends on the environmental context. RNA-sequencing quantifies gene expression as a molecular trait, and is capable of capturing both genetic and environmental effects. In this study, we explore opportunities of using allele-specific expression (ASE) to discover cis-acting genotype-environment interactions (GxE)-genetic effects on gene expression that depend on an environmental condition. Treating 17 common, clinical traits as approximations of the cellular environment of 267 skeletal muscle biopsies, we identify 10 candidate environmental response expression quantitative trait loci (reQTLs) across 6 traits (12 unique gene-environment trait pairs; 10% FDR per trait) including sex, systolic blood pressure, and low-density lipoprotein cholesterol. Although using ASE is in principle a promising approach to detect GxE effects, replication of such signals can be challenging as validation requires harmonization of environmental traits across cohorts and a sufficient sampling of heterozygotes for a transcribed SNP. Comprehensive discovery and replication will require large human transcriptome datasets, or the integration of multiple transcribed SNPs, coupled with standardized clinical phenotyping.

  19. Linking Genes to Cardiovascular Diseases: Gene Action and Gene–Environment Interactions

    PubMed Central

    2016-01-01

    A unique myocardial characteristic is its ability to grow/remodel in order to adapt; this is determined partly by genes and partly by the environment and the milieu intérieur. In the “post-genomic” era, a need is emerging to elucidate the physiologic functions of myocardial genes, as well as potential adaptive and maladaptive modulations induced by environmental/epigenetic factors. Genome sequencing and analysis advances have become exponential lately, with escalation of our knowledge concerning sometimes controversial genetic underpinnings of cardiovascular diseases. Current technologies can identify candidate genes variously involved in diverse normal/abnormal morphomechanical phenotypes, and offer insights into multiple genetic factors implicated in complex cardiovascular syndromes. The expression profiles of thousands of genes are regularly ascertained under diverse conditions. Global analyses of gene expression levels are useful for cataloging genes and correlated phenotypes, and for elucidating the role of genes in maladies. Comparative expression of gene networks coupled to complex disorders can contribute insights as to how “modifier genes” influence the expressed phenotypes. Increasingly, a more comprehensive and detailed systematic understanding of genetic abnormalities underlying, for example, various genetic cardiomyopathies is emerging. Implementing genomic findings in cardiology practice may well lead directly to better diagnosing and therapeutics. There is currently evolving a strong appreciation for the value of studying gene anomalies, and doing so in a non-disjointed, cohesive manner. However, it is challenging for many—practitioners and investigators—to comprehend, interpret, and utilize the clinically increasingly accessible and affordable cardiovascular genomics studies. This survey addresses the need for fundamental understanding in this vital area. PMID:26545598

  20. GENE EXPRESSION PROFILING TO IDENTIFY MECHANISMS OF MALE REPRODUCTIVE TOXICITY

    EPA Science Inventory

    Gene Expression Profiling to Identify Mechanisms of Male Reproductive Toxicity
    David J. Dix
    National Health and Environmental Effects Research Laboratory, Office of Research and Development, U.S. Environmental Protection Agency, Research Triangle Park, NC, 27711, USA.
    Ab...

  1. A Knockout Screen of ApiAP2 Genes Reveals Networks of Interacting Transcriptional Regulators Controlling the Plasmodium Life Cycle.

    PubMed

    Modrzynska, Katarzyna; Pfander, Claudia; Chappell, Lia; Yu, Lu; Suarez, Catherine; Dundas, Kirsten; Gomes, Ana Rita; Goulding, David; Rayner, Julian C; Choudhary, Jyoti; Billker, Oliver

    2017-01-11

    A family of apicomplexa-specific proteins containing AP2 DNA-binding domains (ApiAP2s) was identified in malaria parasites. This family includes sequence-specific transcription factors that are key regulators of development. However, functions for the majority of ApiAP2 genes remain unknown. Here, a systematic knockout screen in Plasmodium berghei identified ten ApiAP2 genes that were essential for mosquito transmission: four were critical for the formation of infectious ookinetes, and three were required for sporogony. We describe non-essential functions for AP2-O and AP2-SP proteins in blood stages, and identify AP2-G2 as a repressor active in both asexual and sexual stages. Comparative transcriptomics across mutants and developmental stages revealed clusters of co-regulated genes with shared cis promoter elements, whose expression can be controlled positively or negatively by different ApiAP2 factors. We propose that stage-specific interactions between ApiAP2 proteins on partly overlapping sets of target genes generate the complex transcriptional network that controls the Plasmodium life cycle. Copyright © 2017 The Author(s). Published by Elsevier Inc. All rights reserved.

  2. High-Resolution Array CGH Profiling Identifies Na/K Transporting ATPase Interacting 2 (NKAIN2) as a Predisposing Candidate Gene in Neuroblastoma

    PubMed Central

    Romania, Paolo; Castellano, Aurora; Surace, Cecilia; Citti, Arianna; De Ioris, Maria Antonietta; Sirleto, Pietro; De Mariano, Marilena; Longo, Luca; Boldrini, Renata; Angioni, Adriano; Locatelli, Franco; Fruci, Doriana

    2013-01-01

    Neuroblastoma (NB), the most common solid cancer in early childhood, usually occurs sporadically but also its familial occurance is known in 1-2% of NB patients. Germline mutations in the ALK and PHOX2B genes have been found in a subset of familial NBs. However, because some individuals harbouring mutations in these genes do not develop this tumor, additional genetic alterations appear to be required for NB pathogenesis. Herein, we studied an Italian family with three NB patients, two siblings and a first cousin, carrying an ALK germline-activating mutation R1192P, that was inherited from their unaffected mothers and with no mutations in the PHOX2B gene. A comparison between somatic and germline DNA copy number changes in the two affected siblings by a high resolution array-based Comparative Genomic Hybridization (CGH) analysis revealed a germline gain at NKAIN2 (Na/K transporting ATPase interacting 2) locus in one of the sibling, that was inherited from the parent who does not carry the ALK mutation. Surprisingly, NKAIN2 was expressed at high levels also in the affected sibling that lacks the genomic gain at this locus, clearly suggesting the existance of other regulatory mechanisms. High levels of NKAIN2 were detected in the MYCN-amplified NB cell lines and in the most aggressive NB lesions as well as in the peripheral blood of a large cohort of NB patients. Consistent with a role of NKAIN2 in NB development, NKAIN2 was down-regulated during all-trans retinoic acid differentiation in two NB cell lines. Taken together, these data indicate a potential role of NKAIN2 gene in NB growth and differentiation. PMID:24205241

  3. High-resolution array CGH profiling identifies Na/K transporting ATPase interacting 2 (NKAIN2) as a predisposing candidate gene in neuroblastoma.

    PubMed

    Romania, Paolo; Castellano, Aurora; Surace, Cecilia; Citti, Arianna; De Ioris, Maria Antonietta; Sirleto, Pietro; De Mariano, Marilena; Longo, Luca; Boldrini, Renata; Angioni, Adriano; Locatelli, Franco; Fruci, Doriana

    2013-01-01

    Neuroblastoma (NB), the most common solid cancer in early childhood, usually occurs sporadically but also its familial occurance is known in 1-2% of NB patients. Germline mutations in the ALK and PHOX2B genes have been found in a subset of familial NBs. However, because some individuals harbouring mutations in these genes do not develop this tumor, additional genetic alterations appear to be required for NB pathogenesis. Herein, we studied an Italian family with three NB patients, two siblings and a first cousin, carrying an ALK germline-activating mutation R1192P, that was inherited from their unaffected mothers and with no mutations in the PHOX2B gene. A comparison between somatic and germline DNA copy number changes in the two affected siblings by a high resolution array-based Comparative Genomic Hybridization (CGH) analysis revealed a germline gain at NKAIN2 (Na/K transporting ATPase interacting 2) locus in one of the sibling, that was inherited from the parent who does not carry the ALK mutation. Surprisingly, NKAIN2 was expressed at high levels also in the affected sibling that lacks the genomic gain at this locus, clearly suggesting the existance of other regulatory mechanisms. High levels of NKAIN2 were detected in the MYCN-amplified NB cell lines and in the most aggressive NB lesions as well as in the peripheral blood of a large cohort of NB patients. Consistent with a role of NKAIN2 in NB development, NKAIN2 was down-regulated during all-trans retinoic acid differentiation in two NB cell lines. Taken together, these data indicate a potential role of NKAIN2 gene in NB growth and differentiation.

  4. Genes-environment interactions in obesity- and diabetes-associated pancreatic cancer: a GWAS data analysis.

    PubMed

    Tang, Hongwei; Wei, Peng; Duell, Eric J; Risch, Harvey A; Olson, Sara H; Bueno-de-Mesquita, H Bas; Gallinger, Steven; Holly, Elizabeth A; Petersen, Gloria M; Bracci, Paige M; McWilliams, Robert R; Jenab, Mazda; Riboli, Elio; Tjønneland, Anne; Boutron-Ruault, Marie Christine; Kaaks, Rudolf; Trichopoulos, Dimitrios; Panico, Salvatore; Sund, Malin; Peeters, Petra H M; Khaw, Kay-Tee; Amos, Christopher I; Li, Donghui

    2014-01-01

    Obesity and diabetes are potentially alterable risk factors for pancreatic cancer. Genetic factors that modify the associations of obesity and diabetes with pancreatic cancer have previously not been examined at the genome-wide level. Using genome-wide association studies (GWAS) genotype and risk factor data from the Pancreatic Cancer Case Control Consortium, we conducted a discovery study of 2,028 cases and 2,109 controls to examine gene-obesity and gene-diabetes interactions in relation to pancreatic cancer risk by using the likelihood-ratio test nested in logistic regression models and Ingenuity Pathway Analysis (IPA). After adjusting for multiple comparisons, a significant interaction of the chemokine signaling pathway with obesity (P = 3.29 × 10(-6)) and a near significant interaction of calcium signaling pathway with diabetes (P = 1.57 × 10(-4)) in modifying the risk of pancreatic cancer were observed. These findings were supported by results from IPA analysis of the top genes with nominal interactions. The major contributing genes to the two top pathways include GNGT2, RELA, TIAM1, and GNAS. None of the individual genes or single-nucleotide polymorphism (SNP) except one SNP remained significant after adjusting for multiple testing. Notably, SNP rs10818684 of the PTGS1 gene showed an interaction with diabetes (P = 7.91 × 10(-7)) at a false discovery rate of 6%. Genetic variations in inflammatory response and insulin resistance may affect the risk of obesity- and diabetes-related pancreatic cancer. These observations should be replicated in additional large datasets. A gene-environment interaction analysis may provide new insights into the genetic susceptibility and molecular mechanisms of obesity- and diabetes-related pancreatic cancer.

  5. Coordinated rates of evolution between interacting plastid and nuclear genes in Geraniaceae.

    PubMed

    Zhang, Jin; Ruhlman, Tracey A; Sabir, Jamal; Blazier, J Chris; Jansen, Robert K

    2015-03-01

    Although gene coevolution has been widely observed within individuals and between different organisms, rarely has this phenomenon been investigated within a phylogenetic framework. The Geraniaceae is an attractive system in which to study plastid-nuclear genome coevolution due to the highly elevated evolutionary rates in plastid genomes. In plants, the plastid-encoded RNA polymerase (PEP) is a protein complex composed of subunits encoded by both plastid (rpoA, rpoB, rpoC1, and rpoC2) and nuclear genes (sig1-6). We used transcriptome and genomic data for 27 species of Geraniales in a systematic evaluation of coevolution between genes encoding subunits of the PEP holoenzyme. We detected strong correlations of dN (nonsynonymous substitutions) but not dS (synonymous substitutions) within rpoB/sig1 and rpoC2/sig2, but not for other plastid/nuclear gene pairs, and identified the correlation of dN/dS ratio between rpoB/C1/C2 and sig1/5/6, rpoC1/C2 and sig2, and rpoB/C2 and sig3 genes. Correlated rates between interacting plastid and nuclear sequences across the Geraniales could result from plastid-nuclear genome coevolution. Analyses of coevolved amino acid positions suggest that structurally mediated coevolution is not the major driver of plastid-nuclear coevolution. The detection of strong correlation of evolutionary rates between SIG and RNAP genes suggests a plausible explanation for plastome-genome incompatibility in Geraniaceae. © 2015 American Society of Plant Biologists. All rights reserved.

  6. Inverse gene-for-gene interactions contribute additively to tan spot susceptibility in wheat

    USDA-ARS?s Scientific Manuscript database

    Tan spot of wheat, caused by Pyrenophora tritici-repentis, is an important disease in almost all wheat-growing areas of the world. The disease system is known to involve at least three fungal-produced necrotrophic effectors (NEs) that interact with corresponding host sensitivity (S) genes in an inv...

  7. Global Mapping of the Yeast Genetic Interaction Network

    NASA Astrophysics Data System (ADS)

    Tong, Amy Hin Yan; Lesage, Guillaume; Bader, Gary D.; Ding, Huiming; Xu, Hong; Xin, Xiaofeng; Young, James; Berriz, Gabriel F.; Brost, Renee L.; Chang, Michael; Chen, YiQun; Cheng, Xin; Chua, Gordon; Friesen, Helena; Goldberg, Debra S.; Haynes, Jennifer; Humphries, Christine; He, Grace; Hussein, Shamiza; Ke, Lizhu; Krogan, Nevan; Li, Zhijian; Levinson, Joshua N.; Lu, Hong; Ménard, Patrice; Munyana, Christella; Parsons, Ainslie B.; Ryan, Owen; Tonikian, Raffi; Roberts, Tania; Sdicu, Anne-Marie; Shapiro, Jesse; Sheikh, Bilal; Suter, Bernhard; Wong, Sharyl L.; Zhang, Lan V.; Zhu, Hongwei; Burd, Christopher G.; Munro, Sean; Sander, Chris; Rine, Jasper; Greenblatt, Jack; Peter, Matthias; Bretscher, Anthony; Bell, Graham; Roth, Frederick P.; Brown, Grant W.; Andrews, Brenda; Bussey, Howard; Boone, Charles

    2004-02-01

    A genetic interaction network containing ~1000 genes and ~4000 interactions was mapped by crossing mutations in 132 different query genes into a set of ~4700 viable gene yeast deletion mutants and scoring the double mutant progeny for fitness defects. Network connectivity was predictive of function because interactions often occurred among functionally related genes, and similar patterns of interactions tended to identify components of the same pathway. The genetic network exhibited dense local neighborhoods; therefore, the position of a gene on a partially mapped network is predictive of other genetic interactions. Because digenic interactions are common in yeast, similar networks may underlie the complex genetics associated with inherited phenotypes in other organisms.

  8. Gene-to-gene interactions regulate endogenous pain modulation in fibromyalgia patients and healthy controls—antagonistic effects between opioid and serotonin-related genes

    PubMed Central

    Tour, Jeanette; Löfgren, Monika; Mannerkorpi, Kaisa; Gerdle, Björn; Larsson, Anette; Palstam, Annie; Bileviciute-Ljungar, Indre; Bjersing, Jan; Martin, Ingvar; Ernberg, Malin; Schalling, Martin; Kosek, Eva

    2017-01-01

    Abstract Chronic pain is associated with dysfunctional endogenous pain modulation, involving both central opioid and serotonergic (5-HT) signaling. Fibromyalgia (FM) is a chronic pain syndrome, characterized by widespread musculoskeletal pain and reduced exercise-induced hypoalgesia (EIH). In this study, we assessed the effects of 3 functional genetic polymorphisms on EIH in 130 patients with FM and 132 healthy controls. Subjects were genotyped regarding the mu-opioid receptor (OPRM1) gene (rs1799971), the serotonin transporter (5-HTT) gene (5-HTTLPR/rs25531), and the serotonin-1a receptor (5-HT1a) gene (rs6296). The patients with FM had increased pain sensitivity and reduced EIH compared with healthy controls. None of the polymorphisms had an effect on EIH on their own. We found significant gene-to-gene interactions between OPRM1 x 5-HTT and OPRM1 x 5-HT1a regarding activation of EIH, with no statistically significant difference between groups. Better EIH was found in individuals with genetically inferred strong endogenous opioid signaling (OPRM1 G) in combination with weak 5-HT tone (5-HTT low/5-HT1a G), compared with strong 5-HT tone (5-HTT high/5-HT1a CC). Based on the proposed mechanisms of these genetic variants, the findings indicate antagonistic interactions between opioid and serotonergic mechanisms during EIH. Moreover, despite different baseline pain level, similar results were detected in FM and controls, not supporting an altered interaction between opioid and 5-HT mechanisms as the basis for dysfunction of EIH in patients with FM. In summary, our results suggest that, by genetic association, the mu-opioid receptor interacts with 2 major serotonergic structures involved in 5-HT reuptake and release, to modulate EIH. PMID:28282362

  9. Gene-to-gene interactions regulate endogenous pain modulation in fibromyalgia patients and healthy controls-antagonistic effects between opioid and serotonin-related genes.

    PubMed

    Tour, Jeanette; Löfgren, Monika; Mannerkorpi, Kaisa; Gerdle, Björn; Larsson, Anette; Palstam, Annie; Bileviciute-Ljungar, Indre; Bjersing, Jan; Martin, Ingvar; Ernberg, Malin; Schalling, Martin; Kosek, Eva

    2017-07-01

    Chronic pain is associated with dysfunctional endogenous pain modulation, involving both central opioid and serotonergic (5-HT) signaling. Fibromyalgia (FM) is a chronic pain syndrome, characterized by widespread musculoskeletal pain and reduced exercise-induced hypoalgesia (EIH). In this study, we assessed the effects of 3 functional genetic polymorphisms on EIH in 130 patients with FM and 132 healthy controls. Subjects were genotyped regarding the mu-opioid receptor (OPRM1) gene (rs1799971), the serotonin transporter (5-HTT) gene (5-HTTLPR/rs25531), and the serotonin-1a receptor (5-HT1a) gene (rs6296). The patients with FM had increased pain sensitivity and reduced EIH compared with healthy controls. None of the polymorphisms had an effect on EIH on their own. We found significant gene-to-gene interactions between OPRM1 x 5-HTT and OPRM1 x 5-HT1a regarding activation of EIH, with no statistically significant difference between groups. Better EIH was found in individuals with genetically inferred strong endogenous opioid signaling (OPRM1 G) in combination with weak 5-HT tone (5-HTT low/5-HT1a G), compared with strong 5-HT tone (5-HTT high/5-HT1a CC). Based on the proposed mechanisms of these genetic variants, the findings indicate antagonistic interactions between opioid and serotonergic mechanisms during EIH. Moreover, despite different baseline pain level, similar results were detected in FM and controls, not supporting an altered interaction between opioid and 5-HT mechanisms as the basis for dysfunction of EIH in patients with FM. In summary, our results suggest that, by genetic association, the mu-opioid receptor interacts with 2 major serotonergic structures involved in 5-HT reuptake and release, to modulate EIH.

  10. Novel β-catenin target genes identified in thalamic neurons encode modulators of neuronal excitability

    PubMed Central

    2012-01-01

    Background LEF1/TCF transcription factors and their activator β-catenin are effectors of the canonical Wnt pathway. Although Wnt/β-catenin signaling has been implicated in neurodegenerative and psychiatric disorders, its possible role in the adult brain remains enigmatic. To address this issue, we sought to identify the genetic program activated by β-catenin in neurons. We recently showed that β-catenin accumulates specifically in thalamic neurons where it activates Cacna1g gene expression. In the present study, we combined bioinformatics and experimental approaches to find new β-catenin targets in the adult thalamus. Results We first selected the genes with at least two conserved LEF/TCF motifs within the regulatory elements. The resulting list of 428 putative LEF1/TCF targets was significantly enriched in known Wnt targets, validating our approach. Functional annotation of the presumed targets also revealed a group of 41 genes, heretofore not associated with Wnt pathway activity, that encode proteins involved in neuronal signal transmission. Using custom polymerase chain reaction arrays, we profiled the expression of these genes in the rat forebrain. We found that nine of the analyzed genes were highly expressed in the thalamus compared with the cortex and hippocampus. Removal of nuclear β-catenin from thalamic neurons in vitro by introducing its negative regulator Axin2 reduced the expression of six of the nine genes. Immunoprecipitation of chromatin from the brain tissues confirmed the interaction between β-catenin and some of the predicted LEF1/TCF motifs. The results of these experiments validated four genes as authentic and direct targets of β-catenin: Gabra3 for the receptor of GABA neurotransmitter, Calb2 for the Ca2+-binding protein calretinin, and the Cacna1g and Kcna6 genes for voltage-gated ion channels. Two other genes from the latter cluster, Cacna2d2 and Kcnh8, appeared to be regulated by β-catenin, although the binding of β-catenin to the

  11. Genetic interaction analysis of point mutations enables interrogation of gene function at a residue-level resolution

    PubMed Central

    Braberg, Hannes; Moehle, Erica A.; Shales, Michael; Guthrie, Christine; Krogan, Nevan J.

    2014-01-01

    We have achieved a residue-level resolution of genetic interaction mapping – a technique that measures how the function of one gene is affected by the alteration of a second gene – by analyzing point mutations. Here, we describe how to interpret point mutant genetic interactions, and outline key applications for the approach, including interrogation of protein interaction interfaces and active sites, and examination of post-translational modifications. Genetic interaction analysis has proven effective for characterizing cellular processes; however, to date, systematic high-throughput genetic interaction screens have relied on gene deletions or knockdowns, which limits the resolution of gene function analysis and poses problems for multifunctional genes. Our point mutant approach addresses these issues, and further provides a tool for in vivo structure-function analysis that complements traditional biophysical methods. We also discuss the potential for genetic interaction mapping of point mutations in human cells and its application to personalized medicine. PMID:24842270

  12. Avirulence Genes in Cereal Powdery Mildews: The Gene-for-Gene Hypothesis 2.0

    PubMed Central

    Bourras, Salim; McNally, Kaitlin E.; Müller, Marion C.; Wicker, Thomas; Keller, Beat

    2016-01-01

    The gene-for-gene hypothesis states that for each gene controlling resistance in the host, there is a corresponding, specific gene controlling avirulence in the pathogen. Allelic series of the cereal mildew resistance genes Pm3 and Mla provide an excellent system for genetic and molecular analysis of resistance specificity. Despite this opportunity for molecular research, avirulence genes in mildews remain underexplored. Earlier work in barley powdery mildew (B.g. hordei) has shown that the reaction to some Mla resistance alleles is controlled by multiple genes. Similarly, several genes are involved in the specific interaction of wheat mildew (B.g. tritici) with the Pm3 allelic series. We found that two mildew genes control avirulence on Pm3f: one gene is involved in recognition by the resistance protein as demonstrated by functional studies in wheat and the heterologous host Nicotiana benthamiana. A second gene is a suppressor, and resistance is only observed in mildew genotypes combining the inactive suppressor and the recognized Avr. We propose that such suppressor/avirulence gene combinations provide the basis of specificity in mildews. Depending on the particular gene combinations in a mildew race, different genes will be genetically identified as the “avirulence” gene. Additionally, the observation of two LINE retrotransposon-encoded avirulence genes in B.g. hordei further suggests that the control of avirulence in mildew is more complex than a canonical gene-for-gene interaction. To fully understand the mildew–cereal interactions, more knowledge on avirulence determinants is needed and we propose ways how this can be achieved based on recent advances in the field. PMID:26973683

  13. The genetic interacting landscape of 63 candidate genes in Major Depressive Disorder: an explorative study.

    PubMed

    Lekman, Magnus; Hössjer, Ola; Andrews, Peter; Källberg, Henrik; Uvehag, Daniel; Charney, Dennis; Manji, Husseini; Rush, John A; McMahon, Francis J; Moore, Jason H; Kockum, Ingrid

    2014-01-01

    Genetic contributions to major depressive disorder (MDD) are thought to result from multiple genes interacting with each other. Different procedures have been proposed to detect such interactions. Which approach is best for explaining the risk of developing disease is unclear. This study sought to elucidate the genetic interaction landscape in candidate genes for MDD by conducting a SNP-SNP interaction analysis using an exhaustive search through 3,704 SNP-markers in 1,732 cases and 1,783 controls provided from the GAIN MDD study. We used three different methods to detect interactions, two logistic regressions models (multiplicative and additive) and one data mining and machine learning (MDR) approach. Although none of the interaction survived correction for multiple comparisons, the results provide important information for future genetic interaction studies in complex disorders. Among the 0.5% most significant observations, none had been reported previously for risk to MDD. Within this group of interactions, less than 0.03% would have been detectable based on main effect approach or an a priori algorithm. We evaluated correlations among the three different models and conclude that all three algorithms detected the same interactions to a low degree. Although the top interactions had a surprisingly large effect size for MDD (e.g. additive dominant model Puncorrected = 9.10E-9 with attributable proportion (AP) value = 0.58 and multiplicative recessive model with Puncorrected = 6.95E-5 with odds ratio (OR estimated from β3) value = 4.99) the area under the curve (AUC) estimates were low (< 0.54). Moreover, the population attributable fraction (PAF) estimates were also low (< 0.15). We conclude that the top interactions on their own did not explain much of the genetic variance of MDD. The different statistical interaction methods we used in the present study did not identify the same pairs of interacting markers. Genetic interaction studies may uncover previously

  14. Discovering disease-associated genes in weighted protein-protein interaction networks

    NASA Astrophysics Data System (ADS)

    Cui, Ying; Cai, Meng; Stanley, H. Eugene

    2018-04-01

    Although there have been many network-based attempts to discover disease-associated genes, most of them have not taken edge weight - which quantifies their relative strength - into consideration. We use connection weights in a protein-protein interaction (PPI) network to locate disease-related genes. We analyze the topological properties of both weighted and unweighted PPI networks and design an improved random forest classifier to distinguish disease genes from non-disease genes. We use a cross-validation test to confirm that weighted networks are better able to discover disease-associated genes than unweighted networks, which indicates that including link weight in the analysis of network properties provides a better model of complex genotype-phenotype associations.

  15. PREFACE: Physics approaches to protein interactions and gene regulation Physics approaches to protein interactions and gene regulation

    NASA Astrophysics Data System (ADS)

    Nussinov, Ruth; Panchenko, Anna R.; Przytycka, Teresa

    2011-06-01

    networks have been identified, including scale free distribution of the vertex degree, network motifs, and modularity, to name a few. These studies of network organization require the network to be as complete as possible, which given the limitations of experimental techniques is not currently the case. Therefore, experimental procedures for detecting biomolecular interactions should be complemented by computational approaches. The paper by Lees et al provides a review of computational methods, integrating multiple independent sources of data to infer physical and functional protein-protein interaction networks. One of the important aspects of protein interactions that should be accounted for in the prediction of protein interaction networks is that many proteins are composed of distinct domains. Protein domains may mediate protein interactions while proteins and their interaction networks may gain complexity through gene duplication and expansion of existing domain architectures via domain rearrangements. The latter mechanisms have been explored in detail in the paper by Cohen-Gihon et al. Protein-protein interactions are not the only component of the cell's interactome. Regulation of cell activity can be achieved at the level of transcription and involve a transcription factor—DNA binding which typically requires recognition of a specific DNA sequence motif. Chip-Chip and the more recent Chip-Seq technologies allow in vivo identification of DNA binding sites and, together with novel in vitro approaches, provide data necessary for deciphering the corresponding binding motifs. Such information, complemented by structures of protein-DNA complexes and knowledge of the differences in binding sites among homologs, opens the door to constructing predictive binding models. The paper by Persikov and Singh provides an example of such a model in the Cys2His2 zinc finger family. Recent studies have indicated that the presence of such binding motifs is, however, neither necessary

  16. Identification of interactive gene networks: a novel approach in gene array profiling of myometrial events during guinea pig pregnancy.

    PubMed

    Mason, Clifford W; Swaan, Peter W; Weiner, Carl P

    2006-06-01

    The transition from myometrial quiescence to activation is poorly understood, and the analysis of array data is limited by the available data mining tools. We applied functional analysis and logical operations along regulatory gene networks to identify molecular processes and pathways underlying quiescence and activation. We analyzed some 18,400 transcripts and variants in guinea pig myometrium at stages corresponding to quiescence and activation, and compared them to the nonpregnant (control) counterpart using a functional mapping tool, MetaCore (GeneGo, St Joseph, MI) to identify novel gene networks composed of biological pathways during mid (MP) and late (LP) pregnancy. Genes altered during quiescence and or activation were identified following gene specific comparisons with myometrium from nonpregnant animals, and then linked to curated pathways and formulated networks. The MP and LP networks were subtracted from each other to identify unique genomic events during those periods. For example, changes 2-fold or greater in genes mediating protein biosynthesis, programmed cell death, microtubule polymerization, and microtubule based movement were noted during the transition to LP. We describe a novel approach combining microarrays and genetic data to identify networks associated with normal myometrial events. The resulting insights help identify potential biomarkers and permit future targeted investigations of these pathways or networks to confirm or refute their importance.

  17. Recurrent Targeted Genes of Hepatitis B Virus in the Liver Cancer Genomes Identified by a Next-Generation Sequencing–Based Approach

    PubMed Central

    Ding, Dong; Lou, Xiaoyan; Hua, Dasong; Yu, Wei; Li, Lisha; Wang, Jun; Gao, Feng; Zhao, Na; Ren, Guoping; Li, Lanjuan; Lin, Biaoyang

    2012-01-01

    Integration of the viral DNA into host chromosomes was found in most of the hepatitis B virus (HBV)–related hepatocellular carcinomas (HCCs). Here we devised a massive anchored parallel sequencing (MAPS) method using next-generation sequencing to isolate and sequence HBV integrants. Applying MAPS to 40 pairs of HBV–related HCC tissues (cancer and adjacent tissues), we identified 296 HBV integration events corresponding to 286 unique integration sites (UISs) with precise HBV–Human DNA junctions. HBV integration favored chromosome 17 and preferentially integrated into human transcript units. HBV targeted genes were enriched in GO terms: cAMP metabolic processes, T cell differentiation and activation, TGF beta receptor pathway, ncRNA catabolic process, and dsRNA fragmentation and cellular response to dsRNA. The HBV targeted genes include 7 genes (PTPRJ, CNTN6, IL12B, MYOM1, FNDC3B, LRFN2, FN1) containing IPR003961 (Fibronectin, type III domain), 7 genes (NRG3, MASP2, NELL1, LRP1B, ADAM21, NRXN1, FN1) containing IPR013032 (EGF-like region, conserved site), and three genes (PDE7A, PDE4B, PDE11A) containing IPR002073 (3′, 5′-cyclic-nucleotide phosphodiesterase). Enriched pathways include hsa04512 (ECM-receptor interaction), hsa04510 (Focal adhesion), and hsa04012 (ErbB signaling pathway). Fewer integration events were found in cancers compared to cancer-adjacent tissues, suggesting a clonal expansion model in HCC development. Finally, we identified 8 genes that were recurrent target genes by HBV integration including fibronectin 1 (FN1) and telomerase reverse transcriptase (TERT1), two known recurrent target genes, and additional novel target genes such as SMAD family member 5 (SMAD5), phosphatase and actin regulator 4 (PHACTR4), and RNA binding protein fox-1 homolog (C. elegans) 1 (RBFOX1). Integrating analysis with recently published whole-genome sequencing analysis, we identified 14 additional recurrent HBV target genes, greatly expanding the HBV recurrent

  18. batman Interacts with polycomb and trithorax group genes and encodes a BTB/POZ protein that is included in a complex containing GAGA factor.

    PubMed

    Faucheux, M; Roignant, J-Y; Netter, S; Charollais, J; Antoniewski, C; Théodore, L

    2003-02-01

    Polycomb and trithorax group genes maintain the appropriate repressed or activated state of homeotic gene expression throughout Drosophila melanogaster development. We have previously identified the batman gene as a Polycomb group candidate since its function is necessary for the repression of Sex combs reduced. However, our present genetic analysis indicates functions of batman in both activation and repression of homeotic genes. The 127-amino-acid Batman protein is almost reduced to a BTB/POZ domain, an evolutionary conserved protein-protein interaction domain found in a large protein family. We show that this domain is involved in the interaction between Batman and the DNA binding GAGA factor encoded by the Trithorax-like gene. The GAGA factor and Batman codistribute on polytene chromosomes, coimmunoprecipitate from nuclear embryonic and larval extracts, and interact in the yeast two-hybrid assay. Batman, together with the GAGA factor, binds to MHS-70, a 70-bp fragment of the bithoraxoid Polycomb response element. This binding, like that of the GAGA factor, requires the presence of d(GA)n sequences. Together, our results suggest that batman belongs to a subset of the Polycomb/trithorax group of genes that includes Trithorax-like, whose products are involved in both activation and repression of homeotic genes.

  19. batman Interacts with Polycomb and trithorax Group Genes and Encodes a BTB/POZ Protein That Is Included in a Complex Containing GAGA Factor

    PubMed Central

    Faucheux, M.; Roignant, J.-Y.; Netter, S.; Charollais, J.; Antoniewski, C.; Théodore, L.

    2003-01-01

    Polycomb and trithorax group genes maintain the appropriate repressed or activated state of homeotic gene expression throughout Drosophila melanogaster development. We have previously identified the batman gene as a Polycomb group candidate since its function is necessary for the repression of Sex combs reduced. However, our present genetic analysis indicates functions of batman in both activation and repression of homeotic genes. The 127-amino-acid Batman protein is almost reduced to a BTB/POZ domain, an evolutionary conserved protein-protein interaction domain found in a large protein family. We show that this domain is involved in the interaction between Batman and the DNA binding GAGA factor encoded by the Trithorax-like gene. The GAGA factor and Batman codistribute on polytene chromosomes, coimmunoprecipitate from nuclear embryonic and larval extracts, and interact in the yeast two-hybrid assay. Batman, together with the GAGA factor, binds to MHS-70, a 70-bp fragment of the bithoraxoid Polycomb response element. This binding, like that of the GAGA factor, requires the presence of d(GA)n sequences. Together, our results suggest that batman belongs to a subset of the Polycomb/trithorax group of genes that includes Trithorax-like, whose products are involved in both activation and repression of homeotic genes. PMID:12556479

  20. A data mining paradigm for identifying key factors in biological processes using gene expression data.

    PubMed

    Li, Jin; Zheng, Le; Uchiyama, Akihiko; Bin, Lianghua; Mauro, Theodora M; Elias, Peter M; Pawelczyk, Tadeusz; Sakowicz-Burkiewicz, Monika; Trzeciak, Magdalena; Leung, Donald Y M; Morasso, Maria I; Yu, Peng

    2018-06-13

    A large volume of biological data is being generated for studying mechanisms of various biological processes. These precious data enable large-scale computational analyses to gain biological insights. However, it remains a challenge to mine the data efficiently for knowledge discovery. The heterogeneity of these data makes it difficult to consistently integrate them, slowing down the process of biological discovery. We introduce a data processing paradigm to identify key factors in biological processes via systematic collection of gene expression datasets, primary analysis of data, and evaluation of consistent signals. To demonstrate its effectiveness, our paradigm was applied to epidermal development and identified many genes that play a potential role in this process. Besides the known epidermal development genes, a substantial proportion of the identified genes are still not supported by gain- or loss-of-function studies, yielding many novel genes for future studies. Among them, we selected a top gene for loss-of-function experimental validation and confirmed its function in epidermal differentiation, proving the ability of this paradigm to identify new factors in biological processes. In addition, this paradigm revealed many key genes in cold-induced thermogenesis using data from cold-challenged tissues, demonstrating its generalizability. This paradigm can lead to fruitful results for studying molecular mechanisms in an era of explosive accumulation of publicly available biological data.