Sample records for identify causal genes

  1. A Strategy for Identifying Quantitative Trait Genes Using Gene Expression Analysis and Causal Analysis.

    PubMed

    Ishikawa, Akira

    2017-11-27

    Large numbers of quantitative trait loci (QTL) affecting complex diseases and other quantitative traits have been reported in humans and model animals. However, the genetic architecture of these traits remains elusive due to the difficulty in identifying causal quantitative trait genes (QTGs) for common QTL with relatively small phenotypic effects. A traditional strategy based on techniques such as positional cloning does not always enable identification of a single candidate gene for a QTL of interest because it is difficult to narrow down a target genomic interval of the QTL to a very small interval harboring only one gene. A combination of gene expression analysis and statistical causal analysis can greatly reduce the number of candidate genes. This integrated approach provides causal evidence that one of the candidate genes is a putative QTG for the QTL. Using this approach, I have recently succeeded in identifying a single putative QTG for resistance to obesity in mice. Here, I outline the integration approach and discuss its usefulness using my studies as an example.

  2. Genetic regulation of gene expression in the lung identifies CST3 and CD22 as potential causal genes for airflow obstruction.

    PubMed

    Lamontagne, Maxime; Timens, Wim; Hao, Ke; Bossé, Yohan; Laviolette, Michel; Steiling, Katrina; Campbell, Joshua D; Couture, Christian; Conti, Massimo; Sherwood, Karen; Hogg, James C; Brandsma, Corry-Anke; van den Berge, Maarten; Sandford, Andrew; Lam, Stephen; Lenburg, Marc E; Spira, Avrum; Paré, Peter D; Nickle, David; Sin, Don D; Postma, Dirkje S

    2014-11-01

    COPD is a complex chronic disease with poorly understood pathogenesis. Integrative genomic approaches have the potential to elucidate the biological networks underlying COPD and lung function. We recently combined genome-wide genotyping and gene expression in 1111 human lung specimens to map expression quantitative trait loci (eQTL). To determine causal associations between COPD and lung function-associated single nucleotide polymorphisms (SNPs) and lung tissue gene expression changes in our lung eQTL dataset. We evaluated causality between SNPs and gene expression for three COPD phenotypes: FEV(1)% predicted, FEV(1)/FVC and COPD as a categorical variable. Different models were assessed in the three cohorts independently and in a meta-analysis. SNPs associated with a COPD phenotype and gene expression were subjected to causal pathway modelling and manual curation. In silico analyses evaluated functional enrichment of biological pathways among newly identified causal genes. Biologically relevant causal genes were validated in two separate gene expression datasets of lung tissues and bronchial airway brushings. High reliability causal relations were found in SNP-mRNA-phenotype triplets for FEV(1)% predicted (n=169) and FEV(1)/FVC (n=80). Several genes of potential biological relevance for COPD were revealed. eQTL-SNPs upregulating cystatin C (CST3) and CD22 were associated with worse lung function. Signalling pathways enriched with causal genes included xenobiotic metabolism, apoptosis, protease-antiprotease and oxidant-antioxidant balance. By using integrative genomics and analysing the relationships of COPD phenotypes with SNPs and gene expression in lung tissue, we identified CST3 and CD22 as potential causal genes for airflow obstruction. This study also augmented the understanding of previously described COPD pathways. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://group.bmj.com/group/rights-licensing/permissions.

  3. Pharmacological Validation of Candidate Causal Sleep Genes Identified in an N2 Cross

    PubMed Central

    Brunner, Joseph I.; Gotter, Anthony L.; Millstein, Joshua; Garson, Susan; Binns, Jacquelyn; Fox, Steven V.; Savitz, Alan T.; Yang, He S.; Fitzpatrick, Karrie; Zhou, Lili; Owens, Joseph R.; Webber, Andrea L.; Vitaterna, Martha H.; Kasarskis, Andrew; Uebele, Victor N.; Turek, Fred; Renger, John J.; Winrow, Christopher J.

    2013-01-01

    Despite the substantial impact of sleep disturbances on human health and the many years of study dedicated to understanding sleep pathologies, the underlying genetic mechanisms that govern sleep and wake largely remain unknown. Recently, we completed large scale genetic and gene expression analyses in a segregating inbred mouse cross and identified candidate causal genes that regulate the mammalian sleep-wake cycle, across multiple traits including total sleep time, amounts of REM, non-REM, sleep bout duration and sleep fragmentation. Here we describe a novel approach toward validating candidate causal genes, while also identifying potential targets for sleep-related indications. Select small molecule antagonists and agonists were used to interrogate candidate causal gene function in rodent sleep polysomnography assays to determine impact on overall sleep architecture and to evaluate alignment with associated sleep-wake traits. Significant effects on sleep architecture were observed in validation studies using compounds targeting the muscarinic acetylcholine receptor M3 subunit (Chrm3)(wake promotion), nicotinic acetylcholine receptor alpha4 subunit (Chrna4)(wake promotion), dopamine receptor D5 subunit (Drd5)(sleep induction), serotonin 1D receptor (Htr1d)(altered REM fragmentation), glucagon-like peptide-1 receptor (Glp1r)(light sleep promotion and reduction of deep sleep), and Calcium channel, voltage-dependent, T type, alpha 1I subunit (Cacna1i)(increased bout duration slow wave sleep). Taken together, these results show the complexity of genetic components that regulate sleep-wake traits and highlight the importance of evaluating this complex behavior at a systems level. Pharmacological validation of genetically identified putative targets provides a rapid alternative to generating knock out or transgenic animal models, and may ultimately lead towards new therapeutic opportunities. PMID:22091728

  4. Leveraging network analytics to infer patient syndrome and identify causal genes in rare disease cases.

    PubMed

    Krämer, Andreas; Shah, Sohela; Rebres, Robert Anthony; Tang, Susan; Richards, Daniel Rene

    2017-08-11

    Next-generation sequencing is widely used to identify disease-causing variants in patients with rare genetic disorders. Identifying those variants from whole-genome or exome data can be both scientifically challenging and time consuming. A significant amount of time is spent on variant annotation, and interpretation. Fully or partly automated solutions are therefore needed to streamline and scale this process. We describe Phenotype Driven Ranking (PDR), an algorithm integrated into Ingenuity Variant Analysis, that uses observed patient phenotypes to prioritize diseases and genes in order to expedite causal-variant discovery. Our method is based on a network of phenotype-disease-gene relationships derived from the QIAGEN Knowledge Base, which allows for efficient computational association of phenotypes to implicated diseases, and also enables scoring and ranking. We have demonstrated the utility and performance of PDR by applying it to a number of clinical rare-disease cases, where the true causal gene was known beforehand. It is also shown that PDR compares favorably to a representative alternative tool.

  5. Identification of causal genes for complex traits

    PubMed Central

    Hormozdiari, Farhad; Kichaev, Gleb; Yang, Wen-Yun; Pasaniuc, Bogdan; Eskin, Eleazar

    2015-01-01

    Motivation: Although genome-wide association studies (GWAS) have identified thousands of variants associated with common diseases and complex traits, only a handful of these variants are validated to be causal. We consider ‘causal variants’ as variants which are responsible for the association signal at a locus. As opposed to association studies that benefit from linkage disequilibrium (LD), the main challenge in identifying causal variants at associated loci lies in distinguishing among the many closely correlated variants due to LD. This is particularly important for model organisms such as inbred mice, where LD extends much further than in human populations, resulting in large stretches of the genome with significantly associated variants. Furthermore, these model organisms are highly structured and require correction for population structure to remove potential spurious associations. Results: In this work, we propose CAVIAR-Gene (CAusal Variants Identification in Associated Regions), a novel method that is able to operate across large LD regions of the genome while also correcting for population structure. A key feature of our approach is that it provides as output a minimally sized set of genes that captures the genes which harbor causal variants with probability ρ. Through extensive simulations, we demonstrate that our method not only speeds up computation, but also have an average of 10% higher recall rate compared with the existing approaches. We validate our method using a real mouse high-density lipoprotein data (HDL) and show that CAVIAR-Gene is able to identify Apoa2 (a gene known to harbor causal variants for HDL), while reducing the number of genes that need to be tested for functionality by a factor of 2. Availability and implementation: Software is freely available for download at genetics.cs.ucla.edu/caviar. Contact: eeskin@cs.ucla.edu PMID:26072484

  6. Identification of causal genes for complex traits.

    PubMed

    Hormozdiari, Farhad; Kichaev, Gleb; Yang, Wen-Yun; Pasaniuc, Bogdan; Eskin, Eleazar

    2015-06-15

    Although genome-wide association studies (GWAS) have identified thousands of variants associated with common diseases and complex traits, only a handful of these variants are validated to be causal. We consider 'causal variants' as variants which are responsible for the association signal at a locus. As opposed to association studies that benefit from linkage disequilibrium (LD), the main challenge in identifying causal variants at associated loci lies in distinguishing among the many closely correlated variants due to LD. This is particularly important for model organisms such as inbred mice, where LD extends much further than in human populations, resulting in large stretches of the genome with significantly associated variants. Furthermore, these model organisms are highly structured and require correction for population structure to remove potential spurious associations. In this work, we propose CAVIAR-Gene (CAusal Variants Identification in Associated Regions), a novel method that is able to operate across large LD regions of the genome while also correcting for population structure. A key feature of our approach is that it provides as output a minimally sized set of genes that captures the genes which harbor causal variants with probability ρ. Through extensive simulations, we demonstrate that our method not only speeds up computation, but also have an average of 10% higher recall rate compared with the existing approaches. We validate our method using a real mouse high-density lipoprotein data (HDL) and show that CAVIAR-Gene is able to identify Apoa2 (a gene known to harbor causal variants for HDL), while reducing the number of genes that need to be tested for functionality by a factor of 2. Software is freely available for download at genetics.cs.ucla.edu/caviar. © The Author 2015. Published by Oxford University Press.

  7. Identifying Causal Variants at Loci with Multiple Signals of Association

    PubMed Central

    Hormozdiari, Farhad; Kostem, Emrah; Kang, Eun Yong; Pasaniuc, Bogdan; Eskin, Eleazar

    2014-01-01

    Although genome-wide association studies have successfully identified thousands of risk loci for complex traits, only a handful of the biologically causal variants, responsible for association at these loci, have been successfully identified. Current statistical methods for identifying causal variants at risk loci either use the strength of the association signal in an iterative conditioning framework or estimate probabilities for variants to be causal. A main drawback of existing methods is that they rely on the simplifying assumption of a single causal variant at each risk locus, which is typically invalid at many risk loci. In this work, we propose a new statistical framework that allows for the possibility of an arbitrary number of causal variants when estimating the posterior probability of a variant being causal. A direct benefit of our approach is that we predict a set of variants for each locus that under reasonable assumptions will contain all of the true causal variants with a high confidence level (e.g., 95%) even when the locus contains multiple causal variants. We use simulations to show that our approach provides 20–50% improvement in our ability to identify the causal variants compared to the existing methods at loci harboring multiple causal variants. We validate our approach using empirical data from an expression QTL study of CHI3L2 to identify new causal variants that affect gene expression at this locus. CAVIAR is publicly available online at http://genetics.cs.ucla.edu/caviar/. PMID:25104515

  8. Identifying causal variants at loci with multiple signals of association.

    PubMed

    Hormozdiari, Farhad; Kostem, Emrah; Kang, Eun Yong; Pasaniuc, Bogdan; Eskin, Eleazar

    2014-10-01

    Although genome-wide association studies have successfully identified thousands of risk loci for complex traits, only a handful of the biologically causal variants, responsible for association at these loci, have been successfully identified. Current statistical methods for identifying causal variants at risk loci either use the strength of the association signal in an iterative conditioning framework or estimate probabilities for variants to be causal. A main drawback of existing methods is that they rely on the simplifying assumption of a single causal variant at each risk locus, which is typically invalid at many risk loci. In this work, we propose a new statistical framework that allows for the possibility of an arbitrary number of causal variants when estimating the posterior probability of a variant being causal. A direct benefit of our approach is that we predict a set of variants for each locus that under reasonable assumptions will contain all of the true causal variants with a high confidence level (e.g., 95%) even when the locus contains multiple causal variants. We use simulations to show that our approach provides 20-50% improvement in our ability to identify the causal variants compared to the existing methods at loci harboring multiple causal variants. We validate our approach using empirical data from an expression QTL study of CHI3L2 to identify new causal variants that affect gene expression at this locus. CAVIAR is publicly available online at http://genetics.cs.ucla.edu/caviar/. Copyright © 2014 by the Genetics Society of America.

  9. Causal network analysis of head and neck keloid tissue identifies potential master regulators.

    PubMed

    Garcia-Rodriguez, Laura; Jones, Lamont; Chen, Kang Mei; Datta, Indrani; Divine, George; Worsham, Maria J

    2016-10-01

    To generate novel insights and hypotheses in keloid development from potential master regulators. Prospective cohort. Six fresh keloid and six normal skin samples from 12 anonymous donors were used in a prospective cohort study. Genome-wide profiling was done previously on the cohort using the Infinium HumanMethylation450 BeadChip (Illumina, San Diego, CA). The 190 statistically significant CpG islands between keloid and normal tissue mapped to 152 genes (P < .05). The top 10 statistically significant genes (VAMP5, ACTR3C, GALNT3, KCNAB2, LRRC61, SCML4, SYNGR1, TNS1, PLEKHG5, PPP1R13-α, false discovery rate <.015) were uploaded into the Ingenuity Pathway Analysis software's Causal Network Analysis (QIAGEN, Redwood City, CA). To reflect expected gene expression direction in the context of methylation changes, the inverse of the methylation ratio from keloid versus normal tissue was used for the analysis. Causal Network Analysis identified disease-specific master regulator molecules based on downstream differentially expressed keloid-specific genes and expected directionality of expression (hypermethylated vs. hypomethylated). Causal Network Analysis software identified four hierarchical networks that included four master regulators (pyroxamide, tributyrin, PRKG2, and PENK) and 19 intermediate regulators. Causal Network Analysis of differentiated methylated gene data of keloid versus normal skin demonstrated four causal networks with four master regulators. These hierarchical networks suggest potential driver roles for their downstream keloid gene targets in the pathogenesis of the keloid phenotype, likely triggered due to perturbation/injury to normal tissue. NA Laryngoscope, 126:E319-E324, 2016. © 2016 The American Laryngological, Rhinological and Otological Society, Inc.

  10. ICSNPathway: identify candidate causal SNPs and pathways from genome-wide association study by one analytical framework.

    PubMed

    Zhang, Kunlin; Chang, Suhua; Cui, Sijia; Guo, Liyuan; Zhang, Liuyan; Wang, Jing

    2011-07-01

    Genome-wide association study (GWAS) is widely utilized to identify genes involved in human complex disease or some other trait. One key challenge for GWAS data interpretation is to identify causal SNPs and provide profound evidence on how they affect the trait. Currently, researches are focusing on identification of candidate causal variants from the most significant SNPs of GWAS, while there is lack of support on biological mechanisms as represented by pathways. Although pathway-based analysis (PBA) has been designed to identify disease-related pathways by analyzing the full list of SNPs from GWAS, it does not emphasize on interpreting causal SNPs. To our knowledge, so far there is no web server available to solve the challenge for GWAS data interpretation within one analytical framework. ICSNPathway is developed to identify candidate causal SNPs and their corresponding candidate causal pathways from GWAS by integrating linkage disequilibrium (LD) analysis, functional SNP annotation and PBA. ICSNPathway provides a feasible solution to bridge the gap between GWAS and disease mechanism study by generating hypothesis of SNP → gene → pathway(s). The ICSNPathway server is freely available at http://icsnpathway.psych.ac.cn/.

  11. Gene expression meta-analysis identifies chromosomal regions and candidate genes involved in breast cancer metastasis.

    PubMed

    Thomassen, Mads; Tan, Qihua; Kruse, Torben A

    2009-01-01

    Breast cancer cells exhibit complex karyotypic alterations causing deregulation of numerous genes. Some of these genes are probably causal for cancer formation and local growth whereas others are causal for the various steps of metastasis. In a fraction of tumors deregulation of the same genes might be caused by epigenetic modulations, point mutations or the influence of other genes. We have investigated the relation of gene expression and chromosomal position, using eight datasets including more than 1200 breast tumors, to identify chromosomal regions and candidate genes possibly causal for breast cancer metastasis. By use of "Gene Set Enrichment Analysis" we have ranked chromosomal regions according to their relation to metastasis. Overrepresentation analysis identified regions with increased expression for chromosome 1q41-42, 8q24, 12q14, 16q22, 16q24, 17q12-21.2, 17q21-23, 17q25, 20q11, and 20q13 among metastasizing tumors and reduced gene expression at 1p31-21, 8p22-21, and 14q24. By analysis of genes with extremely imbalanced expression in these regions we identified DIRAS3 at 1p31, PSD3, LPL, EPHX2 at 8p21-22, and FOS at 14q24 as candidate metastasis suppressor genes. Potential metastasis promoting genes includes RECQL4 at 8q24, PRMT7 at 16q22, GINS2 at 16q24, and AURKA at 20q13.

  12. Causal gene identification using combinatorial V-structure search.

    PubMed

    Cai, Ruichu; Zhang, Zhenjie; Hao, Zhifeng

    2013-07-01

    With the advances of biomedical techniques in the last decade, the costs of human genomic sequencing and genomic activity monitoring are coming down rapidly. To support the huge genome-based business in the near future, researchers are eager to find killer applications based on human genome information. Causal gene identification is one of the most promising applications, which may help the potential patients to estimate the risk of certain genetic diseases and locate the target gene for further genetic therapy. Unfortunately, existing pattern recognition techniques, such as Bayesian networks, cannot be directly applied to find the accurate causal relationship between genes and diseases. This is mainly due to the insufficient number of samples and the extremely high dimensionality of the gene space. In this paper, we present the first practical solution to causal gene identification, utilizing a new combinatorial formulation over V-Structures commonly used in conventional Bayesian networks, by exploring the combinations of significant V-Structures. We prove the NP-hardness of the combinatorial search problem under a general settings on the significance measure on the V-Structures, and present a greedy algorithm to find sub-optimal results. Extensive experiments show that our proposal is both scalable and effective, particularly with interesting findings on the causal genes over real human genome data. Copyright © 2013 Elsevier Ltd. All rights reserved.

  13. Identification of Causal Genes, Networks, and Transcriptional Regulators of REM Sleep and Wake

    PubMed Central

    Millstein, Joshua; Winrow, Christopher J.; Kasarskis, Andrew; Owens, Joseph R.; Zhou, Lili; Summa, Keith C.; Fitzpatrick, Karrie; Zhang, Bin; Vitaterna, Martha H.; Schadt, Eric E.; Renger, John J.; Turek, Fred W.

    2011-01-01

    Study Objective: Sleep-wake traits are well-known to be under substantial genetic control, but the specific genes and gene networks underlying primary sleep-wake traits have largely eluded identification using conventional approaches, especially in mammals. Thus, the aim of this study was to use systems genetics and statistical approaches to uncover the genetic networks underlying 2 primary sleep traits in the mouse: 24-h duration of REM sleep and wake. Design: Genome-wide RNA expression data from 3 tissues (anterior cortex, hypothalamus, thalamus/midbrain) were used in conjunction with high-density genotyping to identify candidate causal genes and networks mediating the effects of 2 QTL regulating the 24-h duration of REM sleep and one regulating the 24-h duration of wake. Setting: Basic sleep research laboratory. Patients or Participants: Male [C57BL/6J × (BALB/cByJ × C57BL/6J*) F1] N2 mice (n = 283). Interventions: None. Measurements and Results: The genetic variation of a mouse N2 mapping cross was leveraged against sleep-state phenotypic variation as well as quantitative gene expression measurement in key brain regions using integrative genomics approaches to uncover multiple causal sleep-state regulatory genes, including several surprising novel candidates, which interact as components of networks that modulate REM sleep and wake. In particular, it was discovered that a core network module, consisting of 20 genes, involved in the regulation of REM sleep duration is conserved across the cortex, hypothalamus, and thalamus. A novel application of a formal causal inference test was also used to identify those genes directly regulating sleep via control of expression. Conclusion: Systems genetics approaches reveal novel candidate genes, complex networks and specific transcriptional regulators of REM sleep and wake duration in mammals. Citation: Millstein J; Winrow CJ; Kasarskis A; Owens JR; Zhou L; Summa KC; Fitzpatrick K; Zhang B; Vitaterna MH; Schadt EE

  14. Multiple-input multiple-output causal strategies for gene selection.

    PubMed

    Bontempi, Gianluca; Haibe-Kains, Benjamin; Desmedt, Christine; Sotiriou, Christos; Quackenbush, John

    2011-11-25

    Traditional strategies for selecting variables in high dimensional classification problems aim to find sets of maximally relevant variables able to explain the target variations. If these techniques may be effective in generalization accuracy they often do not reveal direct causes. The latter is essentially related to the fact that high correlation (or relevance) does not imply causation. In this study, we show how to efficiently incorporate causal information into gene selection by moving from a single-input single-output to a multiple-input multiple-output setting. We show in synthetic case study that a better prioritization of causal variables can be obtained by considering a relevance score which incorporates a causal term. In addition we show, in a meta-analysis study of six publicly available breast cancer microarray datasets, that the improvement occurs also in terms of accuracy. The biological interpretation of the results confirms the potential of a causal approach to gene selection. Integrating causal information into gene selection algorithms is effective both in terms of prediction accuracy and biological interpretation.

  15. An Integrative Genetics Approach to Identify Candidate Genes Regulating BMD: Combining Linkage, Gene Expression, and Association

    PubMed Central

    Farber, Charles R; van Nas, Atila; Ghazalpour, Anatole; Aten, Jason E; Doss, Sudheer; Sos, Brandon; Schadt, Eric E; Ingram-Drake, Leslie; Davis, Richard C; Horvath, Steve; Smith, Desmond J; Drake, Thomas A; Lusis, Aldons J

    2009-01-01

    Numerous quantitative trait loci (QTLs) affecting bone traits have been identified in the mouse; however, few of the underlying genes have been discovered. To improve the process of transitioning from QTL to gene, we describe an integrative genetics approach, which combines linkage analysis, expression QTL (eQTL) mapping, causality modeling, and genetic association in outbred mice. In C57BL/6J × C3H/HeJ (BXH) F2 mice, nine QTLs regulating femoral BMD were identified. To select candidate genes from within each QTL region, microarray gene expression profiles from individual F2 mice were used to identify 148 genes whose expression was correlated with BMD and regulated by local eQTLs. Many of the genes that were the most highly correlated with BMD have been previously shown to modulate bone mass or skeletal development. Candidates were further prioritized by determining whether their expression was predicted to underlie variation in BMD. Using network edge orienting (NEO), a causality modeling algorithm, 18 of the 148 candidates were predicted to be causally related to differences in BMD. To fine-map QTLs, markers in outbred MF1 mice were tested for association with BMD. Three chromosome 11 SNPs were identified that were associated with BMD within the Bmd11 QTL. Finally, our approach provides strong support for Wnt9a, Rasd1, or both underlying Bmd11. Integration of multiple genetic and genomic data sets can substantially improve the efficiency of QTL fine-mapping and candidate gene identification. PMID:18767929

  16. Integrating Gene Expression with Summary Association Statistics to Identify Genes Associated with 30 Complex Traits.

    PubMed

    Mancuso, Nicholas; Shi, Huwenbo; Goddard, Pagé; Kichaev, Gleb; Gusev, Alexander; Pasaniuc, Bogdan

    2017-03-02

    Although genome-wide association studies (GWASs) have identified thousands of risk loci for many complex traits and diseases, the causal variants and genes at these loci remain largely unknown. Here, we introduce a method for estimating the local genetic correlation between gene expression and a complex trait and utilize it to estimate the genetic correlation due to predicted expression between pairs of traits. We integrated gene expression measurements from 45 expression panels with summary GWAS data to perform 30 multi-tissue transcriptome-wide association studies (TWASs). We identified 1,196 genes whose expression is associated with these traits; of these, 168 reside more than 0.5 Mb away from any previously reported GWAS significant variant. We then used our approach to find 43 pairs of traits with significant genetic correlation at the level of predicted expression; of these, eight were not found through genetic correlation at the SNP level. Finally, we used bi-directional regression to find evidence that BMI causally influences triglyceride levels and that triglyceride levels causally influence low-density lipoprotein. Together, our results provide insight into the role of gene expression in the susceptibility of complex traits and diseases. Copyright © 2017 American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.

  17. Prioritizing causal disease genes using unbiased genomic features.

    PubMed

    Deo, Rahul C; Musso, Gabriel; Tasan, Murat; Tang, Paul; Poon, Annie; Yuan, Christiana; Felix, Janine F; Vasan, Ramachandran S; Beroukhim, Rameen; De Marco, Teresa; Kwok, Pui-Yan; MacRae, Calum A; Roth, Frederick P

    2014-12-03

    Cardiovascular disease (CVD) is the leading cause of death in the developed world. Human genetic studies, including genome-wide sequencing and SNP-array approaches, promise to reveal disease genes and mechanisms representing new therapeutic targets. In practice, however, identification of the actual genes contributing to disease pathogenesis has lagged behind identification of associated loci, thus limiting the clinical benefits. To aid in localizing causal genes, we develop a machine learning approach, Objective Prioritization for Enhanced Novelty (OPEN), which quantitatively prioritizes gene-disease associations based on a diverse group of genomic features. This approach uses only unbiased predictive features and thus is not hampered by a preference towards previously well-characterized genes. We demonstrate success in identifying genetic determinants for CVD-related traits, including cholesterol levels, blood pressure, and conduction system and cardiomyopathy phenotypes. Using OPEN, we prioritize genes, including FLNC, for association with increased left ventricular diameter, which is a defining feature of a prevalent cardiovascular disorder, dilated cardiomyopathy or DCM. Using a zebrafish model, we experimentally validate FLNC and identify a novel FLNC splice-site mutation in a patient with severe DCM. Our approach stands to assist interpretation of large-scale genetic studies without compromising their fundamentally unbiased nature.

  18. Functional clustering of time series gene expression data by Granger causality

    PubMed Central

    2012-01-01

    Background A common approach for time series gene expression data analysis includes the clustering of genes with similar expression patterns throughout time. Clustered gene expression profiles point to the joint contribution of groups of genes to a particular cellular process. However, since genes belong to intricate networks, other features, besides comparable expression patterns, should provide additional information for the identification of functionally similar genes. Results In this study we perform gene clustering through the identification of Granger causality between and within sets of time series gene expression data. Granger causality is based on the idea that the cause of an event cannot come after its consequence. Conclusions This kind of analysis can be used as a complementary approach for functional clustering, wherein genes would be clustered not solely based on their expression similarity but on their topological proximity built according to the intensity of Granger causality among them. PMID:23107425

  19. Capture Hi-C identifies a novel causal gene, IL20RA, in the pan-autoimmune genetic susceptibility region 6q23.

    PubMed

    McGovern, Amanda; Schoenfelder, Stefan; Martin, Paul; Massey, Jonathan; Duffus, Kate; Plant, Darren; Yarwood, Annie; Pratt, Arthur G; Anderson, Amy E; Isaacs, John D; Diboll, Julie; Thalayasingam, Nishanthi; Ospelt, Caroline; Barton, Anne; Worthington, Jane; Fraser, Peter; Eyre, Stephen; Orozco, Gisela

    2016-11-01

    The identification of causal genes from genome-wide association studies (GWAS) is the next important step for the translation of genetic findings into biologically meaningful mechanisms of disease and potential therapeutic targets. Using novel chromatin interaction detection techniques and allele specific assays in T and B cell lines, we provide compelling evidence that redefines causal genes at the 6q23 locus, one of the most important loci that confers autoimmunity risk. Although the function of disease-associated non-coding single nucleotide polymorphisms (SNPs) at 6q23 is unknown, the association is generally assigned to TNFAIP3, the closest gene. However, the DNA fragment containing the associated SNPs interacts through chromatin looping not only with TNFAIP3, but also with IL20RA, located 680 kb upstream. The risk allele of the most likely causal SNP, rs6927172, is correlated with both a higher frequency of interactions and increased expression of IL20RA, along with a stronger binding of both the NFκB transcription factor and chromatin marks characteristic of active enhancers in T-cells. Our results highlight the importance of gene assignment for translating GWAS findings into biologically meaningful mechanisms of disease and potential therapeutic targets; indeed, monoclonal antibody therapy targeting IL-20 is effective in the treatment of rheumatoid arthritis and psoriasis, both with strong GWAS associations to this region.

  20. Inferring causal genomic alterations in breast cancer using gene expression data

    PubMed Central

    2011-01-01

    Background One of the primary objectives in cancer research is to identify causal genomic alterations, such as somatic copy number variation (CNV) and somatic mutations, during tumor development. Many valuable studies lack genomic data to detect CNV; therefore, methods that are able to infer CNVs from gene expression data would help maximize the value of these studies. Results We developed a framework for identifying recurrent regions of CNV and distinguishing the cancer driver genes from the passenger genes in the regions. By inferring CNV regions across many datasets we were able to identify 109 recurrent amplified/deleted CNV regions. Many of these regions are enriched for genes involved in many important processes associated with tumorigenesis and cancer progression. Genes in these recurrent CNV regions were then examined in the context of gene regulatory networks to prioritize putative cancer driver genes. The cancer driver genes uncovered by the framework include not only well-known oncogenes but also a number of novel cancer susceptibility genes validated via siRNA experiments. Conclusions To our knowledge, this is the first effort to systematically identify and validate drivers for expression based CNV regions in breast cancer. The framework where the wavelet analysis of copy number alteration based on expression coupled with the gene regulatory network analysis, provides a blueprint for leveraging genomic data to identify key regulatory components and gene targets. This integrative approach can be applied to many other large-scale gene expression studies and other novel types of cancer data such as next-generation sequencing based expression (RNA-Seq) as well as CNV data. PMID:21806811

  1. New insights into old methods for identifying causal rare variants.

    PubMed

    Wang, Haitian; Huang, Chien-Hsun; Lo, Shaw-Hwa; Zheng, Tian; Hu, Inchi

    2011-11-29

    The advance of high-throughput next-generation sequencing technology makes possible the analysis of rare variants. However, the investigation of rare variants in unrelated-individuals data sets faces the challenge of low power, and most methods circumvent the difficulty by using various collapsing procedures based on genes, pathways, or gene clusters. We suggest a new way to identify causal rare variants using the F-statistic and sliced inverse regression. The procedure is tested on the data set provided by the Genetic Analysis Workshop 17 (GAW17). After preliminary data reduction, we ranked markers according to their F-statistic values. Top-ranked markers were then subjected to sliced inverse regression, and those with higher absolute coefficients in the most significant sliced inverse regression direction were selected. The procedure yields good false discovery rates for the GAW17 data and thus is a promising method for future study on rare variants.

  2. Identifying causal linkages between environmental variables and African conflicts

    NASA Astrophysics Data System (ADS)

    Nguy-Robertson, A. L.; Dartevelle, S.

    2017-12-01

    Environmental variables that contribute to droughts, flooding, and other natural hazards are often identified as factors contributing to conflict; however, few studies attempt to quantify these causal linkages. Recent research has demonstrated that the environment operates within a dynamical system framework and the influence of variables can be identified from convergent cross mapping (CCM) between shadow manifolds. We propose to use CCM to identify causal linkages between environmental variables and incidences of conflict. This study utilizes time series data from Climate Forecast System ver. 2 and MODIS satellite sensors processed using Google Earth Engine to aggregate country and regional trends. These variables are then compared to Armed Conflict Location & Event Data Project observations at similar scales. Results provide relative rankings of variables and their linkage to conflict. Being able to identify which factors contributed more strongly to a conflict can allow policy makers to prepare solutions to mitigate future crises. Knowledge of the primary environmental factors can lead to the identification of other variables to examine in the causal network influencing conflict.

  3. Prior knowledge driven Granger causality analysis on gene regulatory network discovery

    DOE PAGES

    Yao, Shun; Yoo, Shinjae; Yu, Dantong

    2015-08-28

    Our study focuses on discovering gene regulatory networks from time series gene expression data using the Granger causality (GC) model. However, the number of available time points (T) usually is much smaller than the number of target genes (n) in biological datasets. The widely applied pairwise GC model (PGC) and other regularization strategies can lead to a significant number of false identifications when n>>T. In this study, we proposed a new method, viz., CGC-2SPR (CGC using two-step prior Ridge regularization) to resolve the problem by incorporating prior biological knowledge about a target gene data set. In our simulation experiments, themore » propose new methodology CGC-2SPR showed significant performance improvement in terms of accuracy over other widely used GC modeling (PGC, Ridge and Lasso) and MI-based (MRNET and ARACNE) methods. In addition, we applied CGC-2SPR to a real biological dataset, i.e., the yeast metabolic cycle, and discovered more true positive edges with CGC-2SPR than with the other existing methods. In our research, we noticed a “ 1+1>2” effect when we combined prior knowledge and gene expression data to discover regulatory networks. Based on causality networks, we made a functional prediction that the Abm1 gene (its functions previously were unknown) might be related to the yeast’s responses to different levels of glucose. In conclusion, our research improves causality modeling by combining heterogeneous knowledge, which is well aligned with the future direction in system biology. Furthermore, we proposed a method of Monte Carlo significance estimation (MCSE) to calculate the edge significances which provide statistical meanings to the discovered causality networks. All of our data and source codes will be available under the link https://bitbucket.org/dtyu/granger-causality/wiki/Home.« less

  4. Identifying candidate genes for Type 2 Diabetes Mellitus and obesity through gene expression profiling in multiple tissues or cells.

    PubMed

    Chen, Junhui; Meng, Yuhuan; Zhou, Jinghui; Zhuo, Min; Ling, Fei; Zhang, Yu; Du, Hongli; Wang, Xiaoning

    2013-01-01

    Type 2 Diabetes Mellitus (T2DM) and obesity have become increasingly prevalent in recent years. Recent studies have focused on identifying causal variations or candidate genes for obesity and T2DM via analysis of expression quantitative trait loci (eQTL) within a single tissue. T2DM and obesity are affected by comprehensive sets of genes in multiple tissues. In the current study, gene expression levels in multiple human tissues from GEO datasets were analyzed, and 21 candidate genes displaying high percentages of differential expression were filtered out. Specifically, DENND1B, LYN, MRPL30, POC1B, PRKCB, RP4-655J12.3, HIBADH, and TMBIM4 were identified from the T2DM-control study, and BCAT1, BMP2K, CSRNP2, MYNN, NCKAP5L, SAP30BP, SLC35B4, SP1, BAP1, GRB14, HSP90AB1, ITGA5, and TOMM5 were identified from the obesity-control study. The majority of these genes are known to be involved in T2DM and obesity. Therefore, analysis of gene expression in various tissues using GEO datasets may be an effective and feasible method to determine novel or causal genes associated with T2DM and obesity.

  5. Leveraging lung tissue transcriptome to uncover candidate causal genes in COPD genetic associations.

    PubMed

    Lamontagne, Maxime; Bérubé, Jean-Christophe; Obeidat, Ma'en; Cho, Michael H; Hobbs, Brian D; Sakornsakolpat, Phuwanat; de Jong, Kim; Boezen, H Marike; Nickle, David; Hao, Ke; Timens, Wim; van den Berge, Maarten; Joubert, Philippe; Laviolette, Michel; Sin, Don D; Paré, Peter D; Bossé, Yohan

    2018-05-15

    Causal genes of chronic obstructive pulmonary disease (COPD) remain elusive. The current study aims at integrating genome-wide association studies (GWAS) and lung expression quantitative trait loci (eQTL) data to map COPD candidate causal genes and gain biological insights into the recently discovered COPD susceptibility loci. Two complementary genomic datasets on COPD were studied. First, the lung eQTL dataset which included whole-genome gene expression and genotyping data from 1038 individuals. Second, the largest COPD GWAS to date from the International COPD Genetics Consortium (ICGC) with 13 710 cases and 38 062 controls. Methods that integrated GWAS with eQTL signals including transcriptome-wide association study (TWAS), colocalization and Mendelian randomization-based (SMR) approaches were used to map causality genes, i.e. genes with the strongest evidence of being the functional effector at specific loci. These methods were applied at the genome-wide level and at COPD risk loci derived from the GWAS literature. Replication was performed using lung data from GTEx. We collated 129 non-overlapping risk loci for COPD from the GWAS literature. At the genome-wide scale, 12 new COPD candidate genes/loci were revealed and six replicated in GTEx including CAMK2A, DMPK, MYO15A, TNFRSF10A, BTN3A2 and TRBV30. In addition, we mapped candidate causal genes for 60 out of the 129 GWAS-nominated loci and 23 of them were replicated in GTEx. Mapping candidate causal genes in lung tissue represents an important contribution to the genetics of COPD, enriches our biological interpretation of GWAS findings, and brings us closer to clinical translation of genetic associations.

  6. Identifying interactions in the time and frequency domains in local and global networks - A Granger Causality Approach.

    PubMed

    Zou, Cunlu; Ladroue, Christophe; Guo, Shuixia; Feng, Jianfeng

    2010-06-21

    Reverse-engineering approaches such as Bayesian network inference, ordinary differential equations (ODEs) and information theory are widely applied to deriving causal relationships among different elements such as genes, proteins, metabolites, neurons, brain areas and so on, based upon multi-dimensional spatial and temporal data. There are several well-established reverse-engineering approaches to explore causal relationships in a dynamic network, such as ordinary differential equations (ODE), Bayesian networks, information theory and Granger Causality. Here we focused on Granger causality both in the time and frequency domain and in local and global networks, and applied our approach to experimental data (genes and proteins). For a small gene network, Granger causality outperformed all the other three approaches mentioned above. A global protein network of 812 proteins was reconstructed, using a novel approach. The obtained results fitted well with known experimental findings and predicted many experimentally testable results. In addition to interactions in the time domain, interactions in the frequency domain were also recovered. The results on the proteomic data and gene data confirm that Granger causality is a simple and accurate approach to recover the network structure. Our approach is general and can be easily applied to other types of temporal data.

  7. An Evaluation of Active Learning Causal Discovery Methods for Reverse-Engineering Local Causal Pathways of Gene Regulation

    PubMed Central

    Ma, Sisi; Kemmeren, Patrick; Aliferis, Constantin F.; Statnikov, Alexander

    2016-01-01

    Reverse-engineering of causal pathways that implicate diseases and vital cellular functions is a fundamental problem in biomedicine. Discovery of the local causal pathway of a target variable (that consists of its direct causes and direct effects) is essential for effective intervention and can facilitate accurate diagnosis and prognosis. Recent research has provided several active learning methods that can leverage passively observed high-throughput data to draft causal pathways and then refine the inferred relations with a limited number of experiments. The current study provides a comprehensive evaluation of the performance of active learning methods for local causal pathway discovery in real biological data. Specifically, 54 active learning methods/variants from 3 families of algorithms were applied for local causal pathways reconstruction of gene regulation for 5 transcription factors in S. cerevisiae. Four aspects of the methods’ performance were assessed, including adjacency discovery quality, edge orientation accuracy, complete pathway discovery quality, and experimental cost. The results of this study show that some methods provide significant performance benefits over others and therefore should be routinely used for local causal pathway discovery tasks. This study also demonstrates the feasibility of local causal pathway reconstruction in real biological systems with significant quality and low experimental cost. PMID:26939894

  8. Temporal Expression Profiling Identifies Pathways Mediating Effect of Causal Variant on Phenotype

    PubMed Central

    Gupta, Saumya; Radhakrishnan, Aparna; Raharja-Liu, Pandu; Lin, Gen; Steinmetz, Lars M.; Gagneur, Julien; Sinha, Himanshu

    2015-01-01

    Even with identification of multiple causal genetic variants for common human diseases, understanding the molecular processes mediating the causal variants’ effect on the disease remains a challenge. This understanding is crucial for the development of therapeutic strategies to prevent and treat disease. While static profiling of gene expression is primarily used to get insights into the biological bases of diseases, it makes differentiating the causative from the correlative effects difficult, as the dynamics of the underlying biological processes are not monitored. Using yeast as a model, we studied genome-wide gene expression dynamics in the presence of a causal variant as the sole genetic determinant, and performed allele-specific functional validation to delineate the causal effects of the genetic variant on the phenotype. Here, we characterized the precise genetic effects of a functional MKT1 allelic variant in sporulation efficiency variation. A mathematical model describing meiotic landmark events and conditional activation of MKT1 expression during sporulation specified an early meiotic role of this variant. By analyzing the early meiotic genome-wide transcriptional response, we demonstrate an MKT1-dependent role of novel modulators, namely, RTG1/3, regulators of mitochondrial retrograde signaling, and DAL82, regulator of nitrogen starvation, in additively effecting sporulation efficiency. In the presence of functional MKT1 allele, better respiration during early sporulation was observed, which was dependent on the mitochondrial retrograde regulator, RTG3. Furthermore, our approach showed that MKT1 contributes to sporulation independent of Puf3, an RNA-binding protein that steady-state transcription profiling studies have suggested to mediate MKT1-pleiotropic effects during mitotic growth. These results uncover interesting regulatory links between meiosis and mitochondrial retrograde signaling. In this study, we highlight the advantage of analyzing

  9. Identifying Seizure Onset Zone From the Causal Connectivity Inferred Using Directed Information

    NASA Astrophysics Data System (ADS)

    Malladi, Rakesh; Kalamangalam, Giridhar; Tandon, Nitin; Aazhang, Behnaam

    2016-10-01

    In this paper, we developed a model-based and a data-driven estimator for directed information (DI) to infer the causal connectivity graph between electrocorticographic (ECoG) signals recorded from brain and to identify the seizure onset zone (SOZ) in epileptic patients. Directed information, an information theoretic quantity, is a general metric to infer causal connectivity between time-series and is not restricted to a particular class of models unlike the popular metrics based on Granger causality or transfer entropy. The proposed estimators are shown to be almost surely convergent. Causal connectivity between ECoG electrodes in five epileptic patients is inferred using the proposed DI estimators, after validating their performance on simulated data. We then proposed a model-based and a data-driven SOZ identification algorithm to identify SOZ from the causal connectivity inferred using model-based and data-driven DI estimators respectively. The data-driven SOZ identification outperforms the model-based SOZ identification algorithm when benchmarked against visual analysis by neurologist, the current clinical gold standard. The causal connectivity analysis presented here is the first step towards developing novel non-surgical treatments for epilepsy.

  10. Toward the identification of causal genes in complex diseases: a gene-centric joint test of significance combining genomic and transcriptomic data.

    PubMed

    Charlesworth, Jac C; Peralta, Juan M; Drigalenko, Eugene; Göring, Harald Hh; Almasy, Laura; Dyer, Thomas D; Blangero, John

    2009-12-15

    Gene identification using linkage, association, or genome-wide expression is often underpowered. We propose that formal combination of information from multiple gene-identification approaches may lead to the identification of novel loci that are missed when only one form of information is available. Firstly, we analyze the Genetic Analysis Workshop 16 Framingham Heart Study Problem 2 genome-wide association data for HDL-cholesterol using a "gene-centric" approach. Then we formally combine the association test results with genome-wide transcriptional profiling data for high-density lipoprotein cholesterol (HDL-C), from the San Antonio Family Heart Study, using a Z-transform test (Stouffer's method). We identified 39 genes by the joint test at a conservative 1% false-discovery rate, including 9 from the significant gene-based association test and 23 whose expression was significantly correlated with HDL-C. Seven genes identified as significant in the joint test were not independently identified by either the association or expression tests. This combined approach has increased power and leads to the direct nomination of novel candidate genes likely to be involved in the determination of HDL-C levels. Such information can then be used as justification for a more exhaustive search for functional sequence variation within the nominated genes. We anticipate that this type of analysis will improve our speed of identification of regulatory genes causally involved in disease risk.

  11. Knowing Who Dunnit: Infants Identify the Causal Agent in an Unseen Causal Interaction

    ERIC Educational Resources Information Center

    Saxe, Rebecca; Tzelnic, Tania; Carey, Susan

    2007-01-01

    Preverbal infants can represent the causal structure of events, including distinguishing the agentive and receptive roles and categorizing entities according to stable causal dispositions. This study investigated how infants combine these 2 kinds of causal inference. In Experiments 1 and 2, 9.5-month-olds used the position of a human hand or a…

  12. Prophetic Granger Causality to infer gene regulatory networks.

    PubMed

    Carlin, Daniel E; Paull, Evan O; Graim, Kiley; Wong, Christopher K; Bivol, Adrian; Ryabinin, Peter; Ellrott, Kyle; Sokolov, Artem; Stuart, Joshua M

    2017-01-01

    We introduce a novel method called Prophetic Granger Causality (PGC) for inferring gene regulatory networks (GRNs) from protein-level time series data. The method uses an L1-penalized regression adaptation of Granger Causality to model protein levels as a function of time, stimuli, and other perturbations. When combined with a data-independent network prior, the framework outperformed all other methods submitted to the HPN-DREAM 8 breast cancer network inference challenge. Our investigations reveal that PGC provides complementary information to other approaches, raising the performance of ensemble learners, while on its own achieves moderate performance. Thus, PGC serves as a valuable new tool in the bioinformatics toolkit for analyzing temporal datasets. We investigate the general and cell-specific interactions predicted by our method and find several novel interactions, demonstrating the utility of the approach in charting new tumor wiring.

  13. Prophetic Granger Causality to infer gene regulatory networks

    PubMed Central

    Carlin, Daniel E.; Paull, Evan O.; Graim, Kiley; Wong, Christopher K.; Bivol, Adrian; Ryabinin, Peter; Ellrott, Kyle; Sokolov, Artem

    2017-01-01

    We introduce a novel method called Prophetic Granger Causality (PGC) for inferring gene regulatory networks (GRNs) from protein-level time series data. The method uses an L1-penalized regression adaptation of Granger Causality to model protein levels as a function of time, stimuli, and other perturbations. When combined with a data-independent network prior, the framework outperformed all other methods submitted to the HPN-DREAM 8 breast cancer network inference challenge. Our investigations reveal that PGC provides complementary information to other approaches, raising the performance of ensemble learners, while on its own achieves moderate performance. Thus, PGC serves as a valuable new tool in the bioinformatics toolkit for analyzing temporal datasets. We investigate the general and cell-specific interactions predicted by our method and find several novel interactions, demonstrating the utility of the approach in charting new tumor wiring. PMID:29211761

  14. Identifying direct miRNA-mRNA causal regulatory relationships in heterogeneous data.

    PubMed

    Zhang, Junpeng; Le, Thuc Duy; Liu, Lin; Liu, Bing; He, Jianfeng; Goodall, Gregory J; Li, Jiuyong

    2014-12-01

    Discovering the regulatory relationships between microRNAs (miRNAs) and mRNAs is an important problem that interests many biologists and medical researchers. A number of computational methods have been proposed to infer miRNA-mRNA regulatory relationships, and are mostly based on the statistical associations between miRNAs and mRNAs discovered in observational data. The miRNA-mRNA regulatory relationships identified by these methods can be both direct and indirect regulations. However, differentiating direct regulatory relationships from indirect ones is important for biologists in experimental designs. In this paper, we present a causal discovery based framework (called DirectTarget) to infer direct miRNA-mRNA causal regulatory relationships in heterogeneous data, including expression profiles of miRNAs and mRNAs, and miRNA target information. DirectTarget is applied to the Epithelial to Mesenchymal Transition (EMT) datasets. The validation by experimentally confirmed target databases suggests that the proposed method can effectively identify direct miRNA-mRNA regulatory relationships. To explore the upstream regulators of miRNA regulation, we further identify the causal feedforward patterns (CFFPs) of TF-miRNA-mRNA to provide insights into the miRNA regulation in EMT. DirectTarget has the potential to be applied to other datasets to elucidate the direct miRNA-mRNA causal regulatory relationships and to explore the regulatory patterns. Copyright © 2014 Elsevier Inc. All rights reserved.

  15. A transposon-based genetic screen in mice identifies genes altered in colorectal cancer.

    PubMed

    Starr, Timothy K; Allaei, Raha; Silverstein, Kevin A T; Staggs, Rodney A; Sarver, Aaron L; Bergemann, Tracy L; Gupta, Mihir; O'Sullivan, M Gerard; Matise, Ilze; Dupuy, Adam J; Collier, Lara S; Powers, Scott; Oberg, Ann L; Asmann, Yan W; Thibodeau, Stephen N; Tessarollo, Lino; Copeland, Neal G; Jenkins, Nancy A; Cormier, Robert T; Largaespada, David A

    2009-03-27

    Human colorectal cancers (CRCs) display a large number of genetic and epigenetic alterations, some of which are causally involved in tumorigenesis (drivers) and others that have little functional impact (passengers). To help distinguish between these two classes of alterations, we used a transposon-based genetic screen in mice to identify candidate genes for CRC. Mice harboring mutagenic Sleeping Beauty (SB) transposons were crossed with mice expressing SB transposase in gastrointestinal tract epithelium. Most of the offspring developed intestinal lesions, including intraepithelial neoplasia, adenomas, and adenocarcinomas. Analysis of over 16,000 transposon insertions identified 77 candidate CRC genes, 60 of which are mutated and/or dysregulated in human CRC and thus are most likely to drive tumorigenesis. These genes include APC, PTEN, and SMAD4. The screen also identified 17 candidate genes that had not previously been implicated in CRC, including POLI, PTPRK, and RSPO2.

  16. CaSPIAN: A Causal Compressive Sensing Algorithm for Discovering Directed Interactions in Gene Networks

    PubMed Central

    Emad, Amin; Milenkovic, Olgica

    2014-01-01

    We introduce a novel algorithm for inference of causal gene interactions, termed CaSPIAN (Causal Subspace Pursuit for Inference and Analysis of Networks), which is based on coupling compressive sensing and Granger causality techniques. The core of the approach is to discover sparse linear dependencies between shifted time series of gene expressions using a sequential list-version of the subspace pursuit reconstruction algorithm and to estimate the direction of gene interactions via Granger-type elimination. The method is conceptually simple and computationally efficient, and it allows for dealing with noisy measurements. Its performance as a stand-alone platform without biological side-information was tested on simulated networks, on the synthetic IRMA network in Saccharomyces cerevisiae, and on data pertaining to the human HeLa cell network and the SOS network in E. coli. The results produced by CaSPIAN are compared to the results of several related algorithms, demonstrating significant improvements in inference accuracy of documented interactions. These findings highlight the importance of Granger causality techniques for reducing the number of false-positives, as well as the influence of noise and sampling period on the accuracy of the estimates. In addition, the performance of the method was tested in conjunction with biological side information of the form of sparse “scaffold networks”, to which new edges were added using available RNA-seq or microarray data. These biological priors aid in increasing the sensitivity and precision of the algorithm in the small sample regime. PMID:24622336

  17. An integrative systems genetics approach reveals potential causal genes and pathways related to obesity.

    PubMed

    Kogelman, Lisette J A; Zhernakova, Daria V; Westra, Harm-Jan; Cirera, Susanna; Fredholm, Merete; Franke, Lude; Kadarmideen, Haja N

    2015-10-20

    Obesity is a multi-factorial health problem in which genetic factors play an important role. Limited results have been obtained in single-gene studies using either genomic or transcriptomic data. RNA sequencing technology has shown its potential in gaining accurate knowledge about the transcriptome, and may reveal novel genes affecting complex diseases. Integration of genomic and transcriptomic variation (expression quantitative trait loci [eQTL] mapping) has identified causal variants that affect complex diseases. We integrated transcriptomic data from adipose tissue and genomic data from a porcine model to investigate the mechanisms involved in obesity using a systems genetics approach. Using a selective gene expression profiling approach, we selected 36 animals based on a previously created genomic Obesity Index for RNA sequencing of subcutaneous adipose tissue. Differential expression analysis was performed using the Obesity Index as a continuous variable in a linear model. eQTL mapping was then performed to integrate 60 K porcine SNP chip data with the RNA sequencing data. Results were restricted based on genome-wide significant single nucleotide polymorphisms, detected differentially expressed genes, and previously detected co-expressed gene modules. Further data integration was performed by detecting co-expression patterns among eQTLs and integration with protein data. Differential expression analysis of RNA sequencing data revealed 458 differentially expressed genes. The eQTL mapping resulted in 987 cis-eQTLs and 73 trans-eQTLs (false discovery rate < 0.05), of which the cis-eQTLs were associated with metabolic pathways. We reduced the eQTL search space by focusing on differentially expressed and co-expressed genes and disease-associated single nucleotide polymorphisms to detect obesity-related genes and pathways. Building a co-expression network using eQTLs resulted in the detection of a module strongly associated with lipid pathways. Furthermore, we

  18. A Sparse Reconstruction Approach for Identifying Gene Regulatory Networks Using Steady-State Experiment Data

    PubMed Central

    Zhang, Wanhong; Zhou, Tong

    2015-01-01

    Motivation Identifying gene regulatory networks (GRNs) which consist of a large number of interacting units has become a problem of paramount importance in systems biology. Situations exist extensively in which causal interacting relationships among these units are required to be reconstructed from measured expression data and other a priori information. Though numerous classical methods have been developed to unravel the interactions of GRNs, these methods either have higher computing complexities or have lower estimation accuracies. Note that great similarities exist between identification of genes that directly regulate a specific gene and a sparse vector reconstruction, which often relates to the determination of the number, location and magnitude of nonzero entries of an unknown vector by solving an underdetermined system of linear equations y = Φx. Based on these similarities, we propose a novel framework of sparse reconstruction to identify the structure of a GRN, so as to increase accuracy of causal regulation estimations, as well as to reduce their computational complexity. Results In this paper, a sparse reconstruction framework is proposed on basis of steady-state experiment data to identify GRN structure. Different from traditional methods, this approach is adopted which is well suitable for a large-scale underdetermined problem in inferring a sparse vector. We investigate how to combine the noisy steady-state experiment data and a sparse reconstruction algorithm to identify causal relationships. Efficiency of this method is tested by an artificial linear network, a mitogen-activated protein kinase (MAPK) pathway network and the in silico networks of the DREAM challenges. The performance of the suggested approach is compared with two state-of-the-art algorithms, the widely adopted total least-squares (TLS) method and those available results on the DREAM project. Actual results show that, with a lower computational cost, the proposed method can

  19. QTL Mapping and CRISPR/Cas9 Editing to Identify a Drug Resistance Gene in Toxoplasma gondii.

    PubMed

    Shen, Bang; Powell, Robin H; Behnke, Michael S

    2017-06-22

    Scientific knowledge is intrinsically linked to available technologies and methods. This article will present two methods that allowed for the identification and verification of a drug resistance gene in the Apicomplexan parasite Toxoplasma gondii, the method of Quantitative Trait Locus (QTL) mapping using a Whole Genome Sequence (WGS) -based genetic map and the method of Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR)/Cas9 -based gene editing. The approach of QTL mapping allows one to test if there is a correlation between a genomic region(s) and a phenotype. Two datasets are required to run a QTL scan, a genetic map based on the progeny of a recombinant cross and a quantifiable phenotype assessed in each of the progeny of that cross. These datasets are then formatted to be compatible with R/qtl software that generates a QTL scan to identify significant loci correlated with the phenotype. Although this can greatly narrow the search window of possible candidates, QTLs span regions containing a number of genes from which the causal gene needs to be identified. Having WGS of the progeny was critical to identify the causal drug resistance mutation at the gene level. Once identified, the candidate mutation can be verified by genetic manipulation of drug sensitive parasites. The most facile and efficient method to genetically modify T. gondii is the CRISPR/Cas9 system. This system comprised of just 2 components both encoded on a single plasmid, a single guide RNA (gRNA) containing a 20 bp sequence complementary to the genomic target and the Cas9 endonuclease that generates a double-strand DNA break (DSB) at the target, repair of which allows for insertion or deletion of sequences around the break site. This article provides detailed protocols to use CRISPR/Cas9 based genome editing tools to verify the gene responsible for sinefungin resistance and to construct transgenic parasites.

  20. A transcriptome-wide association study of 229,000 women identifies new candidate susceptibility genes for breast cancer.

    PubMed

    Wu, Lang; Shi, Wei; Long, Jirong; Guo, Xingyi; Michailidou, Kyriaki; Beesley, Jonathan; Bolla, Manjeet K; Shu, Xiao-Ou; Lu, Yingchang; Cai, Qiuyin; Al-Ejeh, Fares; Rozali, Esdy; Wang, Qin; Dennis, Joe; Li, Bingshan; Zeng, Chenjie; Feng, Helian; Gusev, Alexander; Barfield, Richard T; Andrulis, Irene L; Anton-Culver, Hoda; Arndt, Volker; Aronson, Kristan J; Auer, Paul L; Barrdahl, Myrto; Baynes, Caroline; Beckmann, Matthias W; Benitez, Javier; Bermisheva, Marina; Blomqvist, Carl; Bogdanova, Natalia V; Bojesen, Stig E; Brauch, Hiltrud; Brenner, Hermann; Brinton, Louise; Broberg, Per; Brucker, Sara Y; Burwinkel, Barbara; Caldés, Trinidad; Canzian, Federico; Carter, Brian D; Castelao, J Esteban; Chang-Claude, Jenny; Chen, Xiaoqing; Cheng, Ting-Yuan David; Christiansen, Hans; Clarke, Christine L; Collée, Margriet; Cornelissen, Sten; Couch, Fergus J; Cox, David; Cox, Angela; Cross, Simon S; Cunningham, Julie M; Czene, Kamila; Daly, Mary B; Devilee, Peter; Doheny, Kimberly F; Dörk, Thilo; Dos-Santos-Silva, Isabel; Dumont, Martine; Dwek, Miriam; Eccles, Diana M; Eilber, Ursula; Eliassen, A Heather; Engel, Christoph; Eriksson, Mikael; Fachal, Laura; Fasching, Peter A; Figueroa, Jonine; Flesch-Janys, Dieter; Fletcher, Olivia; Flyger, Henrik; Fritschi, Lin; Gabrielson, Marike; Gago-Dominguez, Manuela; Gapstur, Susan M; García-Closas, Montserrat; Gaudet, Mia M; Ghoussaini, Maya; Giles, Graham G; Goldberg, Mark S; Goldgar, David E; González-Neira, Anna; Guénel, Pascal; Hahnen, Eric; Haiman, Christopher A; Håkansson, Niclas; Hall, Per; Hallberg, Emily; Hamann, Ute; Harrington, Patricia; Hein, Alexander; Hicks, Belynda; Hillemanns, Peter; Hollestelle, Antoinette; Hoover, Robert N; Hopper, John L; Huang, Guanmengqian; Humphreys, Keith; Hunter, David J; Jakubowska, Anna; Janni, Wolfgang; John, Esther M; Johnson, Nichola; Jones, Kristine; Jones, Michael E; Jung, Audrey; Kaaks, Rudolf; Kerin, Michael J; Khusnutdinova, Elza; Kosma, Veli-Matti; Kristensen, Vessela N; Lambrechts, Diether; Le Marchand, Loic; Li, Jingmei; Lindström, Sara; Lissowska, Jolanta; Lo, Wing-Yee; Loibl, Sibylle; Lubinski, Jan; Luccarini, Craig; Lux, Michael P; MacInnis, Robert J; Maishman, Tom; Kostovska, Ivana Maleva; Mannermaa, Arto; Manson, JoAnn E; Margolin, Sara; Mavroudis, Dimitrios; Meijers-Heijboer, Hanne; Meindl, Alfons; Menon, Usha; Meyer, Jeffery; Mulligan, Anna Marie; Neuhausen, Susan L; Nevanlinna, Heli; Neven, Patrick; Nielsen, Sune F; Nordestgaard, Børge G; Olopade, Olufunmilayo I; Olson, Janet E; Olsson, Håkan; Peterlongo, Paolo; Peto, Julian; Plaseska-Karanfilska, Dijana; Prentice, Ross; Presneau, Nadege; Pylkäs, Katri; Rack, Brigitte; Radice, Paolo; Rahman, Nazneen; Rennert, Gad; Rennert, Hedy S; Rhenius, Valerie; Romero, Atocha; Romm, Jane; Rudolph, Anja; Saloustros, Emmanouil; Sandler, Dale P; Sawyer, Elinor J; Schmidt, Marjanka K; Schmutzler, Rita K; Schneeweiss, Andreas; Scott, Rodney J; Scott, Christopher G; Seal, Sheila; Shah, Mitul; Shrubsole, Martha J; Smeets, Ann; Southey, Melissa C; Spinelli, John J; Stone, Jennifer; Surowy, Harald; Swerdlow, Anthony J; Tamimi, Rulla M; Tapper, William; Taylor, Jack A; Terry, Mary Beth; Tessier, Daniel C; Thomas, Abigail; Thöne, Kathrin; Tollenaar, Rob A E M; Torres, Diana; Truong, Thérèse; Untch, Michael; Vachon, Celine; Van Den Berg, David; Vincent, Daniel; Waisfisz, Quinten; Weinberg, Clarice R; Wendt, Camilla; Whittemore, Alice S; Wildiers, Hans; Willett, Walter C; Winqvist, Robert; Wolk, Alicja; Xia, Lucy; Yang, Xiaohong R; Ziogas, Argyrios; Ziv, Elad; Dunning, Alison M; Pharoah, Paul D P; Simard, Jacques; Milne, Roger L; Edwards, Stacey L; Kraft, Peter; Easton, Douglas F; Chenevix-Trench, Georgia; Zheng, Wei

    2018-06-18

    The breast cancer risk variants identified in genome-wide association studies explain only a small fraction of the familial relative risk, and the genes responsible for these associations remain largely unknown. To identify novel risk loci and likely causal genes, we performed a transcriptome-wide association study evaluating associations of genetically predicted gene expression with breast cancer risk in 122,977 cases and 105,974 controls of European ancestry. We used data from the Genotype-Tissue Expression Project to establish genetic models to predict gene expression in breast tissue and evaluated model performance using data from The Cancer Genome Atlas. Of the 8,597 genes evaluated, significant associations were identified for 48 at a Bonferroni-corrected threshold of P < 5.82 × 10 -6 , including 14 genes at loci not yet reported for breast cancer. We silenced 13 genes and showed an effect for 11 on cell proliferation and/or colony-forming efficiency. Our study provides new insights into breast cancer genetics and biology.

  1. Pathway-driven gene stability selection of two rheumatoid arthritis GWAS identifies and validates new susceptibility genes in receptor mediated signalling pathways.

    PubMed

    Eleftherohorinou, Hariklia; Hoggart, Clive J; Wright, Victoria J; Levin, Michael; Coin, Lachlan J M

    2011-09-01

    Rheumatoid arthritis (RA) is the commonest chronic, systemic, inflammatory disorder affecting ∼1% of the world population. It has a strong genetic component and a growing number of associated genes have been discovered in genome-wide association studies (GWAS), which nevertheless only account for 23% of the total genetic risk. We aimed to identify additional susceptibility loci through the analysis of GWAS in the context of biological function. We bridge the gap between pathway and gene-oriented analyses of GWAS, by introducing a pathway-driven gene stability-selection methodology that identifies potential causal genes in the top-associated disease pathways that may be driving the pathway association signals. We analysed the WTCCC and the NARAC studies of ∼5000 and ∼2000 subjects, respectively. We examined 700 pathways comprising ∼8000 genes. Ranking pathways by significance revealed that the NARAC top-ranked ∼6% laid within the top 10% of WTCCC. Gene selection on those pathways identified 58 genes in WTCCC and 61 in NARAC; 21 of those were common (P(overlap)< 10(-21)), of which 16 were novel discoveries. Among the identified genes, we validated 10 known RA associations in WTCCC and 13 in NARAC, not discovered using single-SNP approaches on the same data. Gene ontology functional enrichment analysis on the identified genes showed significant over-representation of signalling activity (P< 10(-29)) in both studies. Our findings suggest a novel model of RA genetic predisposition, which involves cell-membrane receptors and genes in second messenger signalling systems, in addition to genes that regulate immune responses, which have been the focus of interest previously.

  2. QTL Mapping and CRISPR/Cas9 Editing to Identify a Drug Resistance Gene in Toxoplasma gondii

    PubMed Central

    Shen, Bang; Powell, Robin H.; Behnke, Michael S.

    2017-01-01

    Scientific knowledge is intrinsically linked to available technologies and methods. This article will present two methods that allowed for the identification and verification of a drug resistance gene in the Apicomplexan parasite Toxoplasma gondii, the method of Quantitative Trait Locus (QTL) mapping using a Whole Genome Sequence (WGS) -based genetic map and the method of Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR)/Cas9 -based gene editing. The approach of QTL mapping allows one to test if there is a correlation between a genomic region(s) and a phenotype. Two datasets are required to run a QTL scan, a genetic map based on the progeny of a recombinant cross and a quantifiable phenotype assessed in each of the progeny of that cross. These datasets are then formatted to be compatible with R/qtl software that generates a QTL scan to identify significant loci correlated with the phenotype. Although this can greatly narrow the search window of possible candidates, QTLs span regions containing a number of genes from which the causal gene needs to be identified. Having WGS of the progeny was critical to identify the causal drug resistance mutation at the gene level. Once identified, the candidate mutation can be verified by genetic manipulation of drug sensitive parasites. The most facile and efficient method to genetically modify T. gondii is the CRISPR/Cas9 system. This system comprised of just 2 components both encoded on a single plasmid, a single guide RNA (gRNA) containing a 20 bp sequence complementary to the genomic target and the Cas9 endonuclease that generates a double-strand DNA break (DSB) at the target, repair of which allows for insertion or deletion of sequences around the break site. This article provides detailed protocols to use CRISPR/Cas9 based genome editing tools to verify the gene responsible for sinefungin resistance and to construct transgenic parasites. PMID:28671645

  3. Under What Assumptions Do Site-by-Treatment Instruments Identify Average Causal Effects?

    ERIC Educational Resources Information Center

    Reardon, Sean F.; Raudenbush, Stephen W.

    2011-01-01

    The purpose of this paper is to clarify the assumptions that must be met if this--multiple site, multiple mediator--strategy, hereafter referred to as "MSMM," is to identify the average causal effects (ATE) in the populations of interest. The authors' investigation of the assumptions of the multiple-mediator, multiple-site IV model demonstrates…

  4. Partial Granger causality--eliminating exogenous inputs and latent variables.

    PubMed

    Guo, Shuixia; Seth, Anil K; Kendrick, Keith M; Zhou, Cong; Feng, Jianfeng

    2008-07-15

    Attempts to identify causal interactions in multivariable biological time series (e.g., gene data, protein data, physiological data) can be undermined by the confounding influence of environmental (exogenous) inputs. Compounding this problem, we are commonly only able to record a subset of all related variables in a system. These recorded variables are likely to be influenced by unrecorded (latent) variables. To address this problem, we introduce a novel variant of a widely used statistical measure of causality--Granger causality--that is inspired by the definition of partial correlation. Our 'partial Granger causality' measure is extensively tested with toy models, both linear and nonlinear, and is applied to experimental data: in vivo multielectrode array (MEA) local field potentials (LFPs) recorded from the inferotemporal cortex of sheep. Our results demonstrate that partial Granger causality can reveal the underlying interactions among elements in a network in the presence of exogenous inputs and latent variables in many cases where the existing conditional Granger causality fails.

  5. Significance of functional disease-causal/susceptible variants identified by whole-genome analyses for the understanding of human diseases.

    PubMed

    Hitomi, Yuki; Tokunaga, Katsushi

    2017-01-01

    Human genome variation may cause differences in traits and disease risks. Disease-causal/susceptible genes and variants for both common and rare diseases can be detected by comprehensive whole-genome analyses, such as whole-genome sequencing (WGS), using next-generation sequencing (NGS) technology and genome-wide association studies (GWAS). Here, in addition to the application of an NGS as a whole-genome analysis method, we summarize approaches for the identification of functional disease-causal/susceptible variants from abundant genetic variants in the human genome and methods for evaluating their functional effects in human diseases, using an NGS and in silico and in vitro functional analyses. We also discuss the clinical applications of the functional disease causal/susceptible variants to personalized medicine.

  6. Identifying Mendelian disease genes with the Variant Effect Scoring Tool

    PubMed Central

    2013-01-01

    Background Whole exome sequencing studies identify hundreds to thousands of rare protein coding variants of ambiguous significance for human health. Computational tools are needed to accelerate the identification of specific variants and genes that contribute to human disease. Results We have developed the Variant Effect Scoring Tool (VEST), a supervised machine learning-based classifier, to prioritize rare missense variants with likely involvement in human disease. The VEST classifier training set comprised ~ 45,000 disease mutations from the latest Human Gene Mutation Database release and another ~45,000 high frequency (allele frequency >1%) putatively neutral missense variants from the Exome Sequencing Project. VEST outperforms some of the most popular methods for prioritizing missense variants in carefully designed holdout benchmarking experiments (VEST ROC AUC = 0.91, PolyPhen2 ROC AUC = 0.86, SIFT4.0 ROC AUC = 0.84). VEST estimates variant score p-values against a null distribution of VEST scores for neutral variants not included in the VEST training set. These p-values can be aggregated at the gene level across multiple disease exomes to rank genes for probable disease involvement. We tested the ability of an aggregate VEST gene score to identify candidate Mendelian disease genes, based on whole-exome sequencing of a small number of disease cases. We used whole-exome data for two Mendelian disorders for which the causal gene is known. Considering only genes that contained variants in all cases, the VEST gene score ranked dihydroorotate dehydrogenase (DHODH) number 2 of 2253 genes in four cases of Miller syndrome, and myosin-3 (MYH3) number 2 of 2313 genes in three cases of Freeman Sheldon syndrome. Conclusions Our results demonstrate the potential power gain of aggregating bioinformatics variant scores into gene-level scores and the general utility of bioinformatics in assisting the search for disease genes in large-scale exome sequencing studies. VEST is

  7. Identifying Causal Risk Factors for Violence among Discharged Patients

    PubMed Central

    Coid, Jeremy W.; Kallis, Constantinos; Doyle, Mike; Shaw, Jenny; Ullrich, Simone

    2015-01-01

    Background Structured Professional Judgement (SPJ) is routinely administered in mental health and criminal justice settings but cannot identify violence risk above moderate accuracy. There is no current evidence that violence can be prevented using SPJ. This may be explained by routine application of predictive instead of causal statistical models when standardising SPJ instruments. Methods We carried out a prospective cohort study of 409 male and female patients discharged from medium secure services in England and Wales to the community. Measures were taken at baseline (pre-discharge), 6 and 12 months post-discharge using the Historical, Clinical and Risk-20 items version 3 (HCR-20v3) and Structural Assessment of Protective Factors (SAPROF). Information on violence was obtained via the McArthur community violence instrument and the Police National Computer. Results In a lagged model, HCR-20v3 and SAPROF items were poor predictors of violence. Eight items of the HCR-20v3 and 4 SAPROF items did not predict violent behaviour better than chance. In re-analyses considering temporal proximity of risk/ protective factors (exposure) on violence (outcome), risk was elevated due to violent ideation (OR 6.98, 95% CI 13.85–12.65, P<0.001), instability (OR 5.41, 95% CI 3.44–8.50, P<0.001), and poor coping/ stress (OR 8.35, 95% CI 4.21–16.57, P<0.001). All 3 risk factors were explanatory variables which drove the association with violent outcome. Self-control (OR 0.13, 95% CI 0.08–0.24, P<0.001) conveyed protective effects and explained the association of other protective factors with violence. Conclusions Using two standardised SPJ instruments, predictive (lagged) methods could not identify risk and protective factors which must be targeted in interventions for discharged patients with severe mental illness. Predictive methods should be abandoned if the aim is to progress from risk assessment to effective risk management and replaced by methods which identify factors

  8. Dynamics and causalities of atmospheric and oceanic data identified by complex networks and Granger causality analysis

    NASA Astrophysics Data System (ADS)

    Charakopoulos, A. K.; Katsouli, G. A.; Karakasidis, T. E.

    2018-04-01

    Understanding the underlying processes and extracting detailed characteristics of spatiotemporal dynamics of ocean and atmosphere as well as their interaction is of significant interest and has not been well thoroughly established. The purpose of this study was to examine the performance of two main additional methodologies for the identification of spatiotemporal underlying dynamic characteristics and patterns among atmospheric and oceanic variables from Seawatch buoys from Aegean and Ionian Sea, provided by the Hellenic Center for Marine Research (HCMR). The first approach involves the estimation of cross correlation analysis in an attempt to investigate time-lagged relationships, and further in order to identify the direction of interactions between the variables we performed the Granger causality method. According to the second approach the time series are converted into complex networks and then the main topological network properties such as degree distribution, average path length, diameter, modularity and clustering coefficient are evaluated. Our results show that the proposed analysis of complex network analysis of time series can lead to the extraction of hidden spatiotemporal characteristics. Also our findings indicate high level of positive and negative correlations and causalities among variables, both from the same buoy and also between buoys from different stations, which cannot be determined from the use of simple statistical measures.

  9. Estimating causal effects with a non-paranormal method for the design of efficient intervention experiments

    PubMed Central

    2014-01-01

    Background Knockdown or overexpression of genes is widely used to identify genes that play important roles in many aspects of cellular functions and phenotypes. Because next-generation sequencing generates high-throughput data that allow us to detect genes, it is important to identify genes that drive functional and phenotypic changes of cells. However, conventional methods rely heavily on the assumption of normality and they often give incorrect results when the assumption is not true. To relax the Gaussian assumption in causal inference, we introduce the non-paranormal method to test conditional independence in the PC-algorithm. Then, we present the non-paranormal intervention-calculus when the directed acyclic graph (DAG) is absent (NPN-IDA), which incorporates the cumulative nature of effects through a cascaded pathway via causal inference for ranking causal genes against a phenotype with the non-paranormal method for estimating DAGs. Results We demonstrate that causal inference with the non-paranormal method significantly improves the performance in estimating DAGs on synthetic data in comparison with the original PC-algorithm. Moreover, we show that NPN-IDA outperforms the conventional methods in exploring regulators of the flowering time in Arabidopsis thaliana and regulators that control the browning of white adipocytes in mice. Our results show that performance improvement in estimating DAGs contributes to an accurate estimation of causal effects. Conclusions Although the simplest alternative procedure was used, our proposed method enables us to design efficient intervention experiments and can be applied to a wide range of research purposes, including drug discovery, because of its generality. PMID:24980787

  10. Estimating causal effects with a non-paranormal method for the design of efficient intervention experiments.

    PubMed

    Teramoto, Reiji; Saito, Chiaki; Funahashi, Shin-ichi

    2014-06-30

    Knockdown or overexpression of genes is widely used to identify genes that play important roles in many aspects of cellular functions and phenotypes. Because next-generation sequencing generates high-throughput data that allow us to detect genes, it is important to identify genes that drive functional and phenotypic changes of cells. However, conventional methods rely heavily on the assumption of normality and they often give incorrect results when the assumption is not true. To relax the Gaussian assumption in causal inference, we introduce the non-paranormal method to test conditional independence in the PC-algorithm. Then, we present the non-paranormal intervention-calculus when the directed acyclic graph (DAG) is absent (NPN-IDA), which incorporates the cumulative nature of effects through a cascaded pathway via causal inference for ranking causal genes against a phenotype with the non-paranormal method for estimating DAGs. We demonstrate that causal inference with the non-paranormal method significantly improves the performance in estimating DAGs on synthetic data in comparison with the original PC-algorithm. Moreover, we show that NPN-IDA outperforms the conventional methods in exploring regulators of the flowering time in Arabidopsis thaliana and regulators that control the browning of white adipocytes in mice. Our results show that performance improvement in estimating DAGs contributes to an accurate estimation of causal effects. Although the simplest alternative procedure was used, our proposed method enables us to design efficient intervention experiments and can be applied to a wide range of research purposes, including drug discovery, because of its generality.

  11. QTL and gene expression analyses identify genes affecting carcass weight and marbling on BTA14 in Hanwoo (Korean Cattle).

    PubMed

    Lee, Seung Hwan; van der Werf, J H J; Kim, Nam Kuk; Lee, Sang Hong; Gondro, C; Park, Eung Woo; Oh, Sung Jong; Gibson, J P; Thompson, J M

    2011-10-01

    Causal mutations affecting quantitative trait variation can be good targets for marker-assisted selection for carcass traits in beef cattle. In this study, linkage and linkage disequilibrium analysis (LDLA) for four carcass traits was undertaken using 19 markers on bovine chromosome 14. The LDLA analysis detected quantitative trait loci (QTL) for carcass weight (CWT) and eye muscle area (EMA) at the same position at around 50 cM and surrounded by the markers FABP4SNP2774C>G and FABP4_μsat3237. The QTL for marbling (MAR) was identified at the midpoint of markers BMS4513 and RM137 in a 3.5-cM marker interval. The most likely position for a second QTL for CWT was found at the midpoint of tenth marker bracket (FABP4SNP2774C>G and FABP4_μsat3237). For this marker bracket, the total number of haplotypes was 34 with a most common frequency of 0.118. Effects of haplotypes on CWT varied from a -5-kg deviation for haplotype 6 to +8 kg for haplotype 23. To determine which genes contribute to the QTL effect, gene expression analysis was performed in muscle for a wide range of phenotypes. The results demonstrate that two genes, LOC781182 (p = 0.002) and TRPS1 (p = 0.006) were upregulated with increasing CWT and EMA, whereas only LOC614744 (p = 0.04) has a significant effect on intramuscular fat (IMF) content. Two genetic markers detected in FABP4 were the most likely QTL position in this QTL study, but FABP4 did not show a significant effect on both traits (CWT and EMA) in gene expression analysis. We conclude that three genes could be potential causal genes affecting carcass traits CWT, EMA, and IMF in Hanwoo.

  12. Causal relationship between the AHSG gene and BMD through fetuin-A and BMI: multiple mediation analysis.

    PubMed

    Sritara, C; Thakkinstian, A; Ongphiphadhanakul, B; Chailurkit, L; Chanprasertyothin, S; Ratanachaiwong, W; Vathesatogkit, P; Sritara, P

    2014-05-01

    Using mediation analysis, a causal relationship between the AHSG gene and bone mineral density (BMD) through fetuin-A and body mass index (BMI) mediators was suggested. Fetuin-A, a multifunctional protein of hepatic origin, is associated with bone mineral density. It is unclear if this association is causal. This study aimed at clarification of this issue. A cross-sectional study was conducted among 1,741 healthy workers from the Electricity Generating Authority of Thailand (EGAT) cohort. The alpha-2-Heremans-Schmid glycoprotein (AHSG) rs2248690 gene was genotyped. Three mediation models were constructed using seemingly unrelated regression analysis. First, the ln[fetuin-A] group was regressed on the AHSG gene. Second, the BMI group was regressed on the AHSG gene and the ln[fetuin-A] group. Finally, the BMD model was constructed by fitting BMD on two mediators (ln[fetuin-A] and BMI) and the independent AHSG variable. All three analyses were adjusted for confounders. The prevalence of the minor T allele for the AHSG locus was 15.2%. The AHSG locus was highly related to serum fetuin-A levels (P < 0.001). Multiple mediation analyses showed that AHSG was significantly associated with BMD through the ln[fetuin-A] and BMI pathway, with beta coefficients of 0.0060 (95% CI 0.0038, 0.0083) and 0.0030 (95% CI 0.0020, 0.0045) at the total hip and lumbar spine, respectively. About 27.3 and 26.0% of total genetic effects on hip and spine BMD, respectively, were explained by the mediation effects of fetuin-A and BMI. Our study suggested evidence of a causal relationship between the AHSG gene and BMD through fetuin-A and BMI mediators.

  13. An innovative strategy for the molecular diagnosis of Usher syndrome identifies causal biallelic mutations in 93% of European patients.

    PubMed

    Bonnet, Crystel; Riahi, Zied; Chantot-Bastaraud, Sandra; Smagghe, Luce; Letexier, Mélanie; Marcaillou, Charles; Lefèvre, Gaëlle M; Hardelin, Jean-Pierre; El-Amraoui, Aziz; Singh-Estivalet, Amrit; Mohand-Saïd, Saddek; Kohl, Susanne; Kurtenbach, Anne; Sliesoraityte, Ieva; Zobor, Ditta; Gherbi, Souad; Testa, Francesco; Simonelli, Francesca; Banfi, Sandro; Fakin, Ana; Glavač, Damjan; Jarc-Vidmar, Martina; Zupan, Andrej; Battelino, Saba; Martorell Sampol, Loreto; Claveria, Maria Antonia; Catala Mora, Jaume; Dad, Shzeena; Møller, Lisbeth B; Rodriguez Jorge, Jesus; Hawlina, Marko; Auricchio, Alberto; Sahel, José-Alain; Marlin, Sandrine; Zrenner, Eberhart; Audo, Isabelle; Petit, Christine

    2016-12-01

    Usher syndrome (USH), the most prevalent cause of hereditary deafness-blindness, is an autosomal recessive and genetically heterogeneous disorder. Three clinical subtypes (USH1-3) are distinguishable based on the severity of the sensorineural hearing impairment, the presence or absence of vestibular dysfunction, and the age of onset of the retinitis pigmentosa. A total of 10 causal genes, 6 for USH1, 3 for USH2, and 1 for USH3, and an USH2 modifier gene, have been identified. A robust molecular diagnosis is required not only to improve genetic counseling, but also to advance gene therapy in USH patients. Here, we present an improved diagnostic strategy that is both cost- and time-effective. It relies on the sequential use of three different techniques to analyze selected genomic regions: targeted exome sequencing, comparative genome hybridization, and quantitative exon amplification. We screened a large cohort of 427 patients (139 USH1, 282 USH2, and six of undefined clinical subtype) from various European medical centers for mutations in all USH genes and the modifier gene. We identified a total of 421 different sequence variants predicted to be pathogenic, about half of which had not been previously reported. Remarkably, we detected large genomic rearrangements, most of which were novel and unique, in 9% of the patients. Thus, our strategy led to the identification of biallelic and monoallelic mutations in 92.7% and 5.8% of the USH patients, respectively. With an overall 98.5% mutation characterization rate, the diagnosis efficiency was substantially improved compared with previously reported methods.

  14. An innovative strategy for the molecular diagnosis of Usher syndrome identifies causal biallelic mutations in 93% of European patients

    PubMed Central

    Bonnet, Crystel; Riahi, Zied; Chantot-Bastaraud, Sandra; Smagghe, Luce; Letexier, Mélanie; Marcaillou, Charles; Lefèvre, Gaëlle M; Hardelin, Jean-Pierre; El-Amraoui, Aziz; Singh-Estivalet, Amrit; Mohand-Saïd, Saddek; Kohl, Susanne; Kurtenbach, Anne; Sliesoraityte, Ieva; Zobor, Ditta; Gherbi, Souad; Testa, Francesco; Simonelli, Francesca; Banfi, Sandro; Fakin, Ana; Glavač, Damjan; Jarc-Vidmar, Martina; Zupan, Andrej; Battelino, Saba; Martorell Sampol, Loreto; Claveria, Maria Antonia; Catala Mora, Jaume; Dad, Shzeena; Møller, Lisbeth B; Rodriguez Jorge, Jesus; Hawlina, Marko; Auricchio, Alberto; Sahel, José-Alain; Marlin, Sandrine; Zrenner, Eberhart; Audo, Isabelle; Petit, Christine

    2016-01-01

    Usher syndrome (USH), the most prevalent cause of hereditary deafness–blindness, is an autosomal recessive and genetically heterogeneous disorder. Three clinical subtypes (USH1–3) are distinguishable based on the severity of the sensorineural hearing impairment, the presence or absence of vestibular dysfunction, and the age of onset of the retinitis pigmentosa. A total of 10 causal genes, 6 for USH1, 3 for USH2, and 1 for USH3, and an USH2 modifier gene, have been identified. A robust molecular diagnosis is required not only to improve genetic counseling, but also to advance gene therapy in USH patients. Here, we present an improved diagnostic strategy that is both cost- and time-effective. It relies on the sequential use of three different techniques to analyze selected genomic regions: targeted exome sequencing, comparative genome hybridization, and quantitative exon amplification. We screened a large cohort of 427 patients (139 USH1, 282 USH2, and six of undefined clinical subtype) from various European medical centers for mutations in all USH genes and the modifier gene. We identified a total of 421 different sequence variants predicted to be pathogenic, about half of which had not been previously reported. Remarkably, we detected large genomic rearrangements, most of which were novel and unique, in 9% of the patients. Thus, our strategy led to the identification of biallelic and monoallelic mutations in 92.7% and 5.8% of the USH patients, respectively. With an overall 98.5% mutation characterization rate, the diagnosis efficiency was substantially improved compared with previously reported methods. PMID:27460420

  15. 15 years of research on Oral-Facial-Digital syndromes: from 1 to 16 causal genes

    PubMed Central

    Bruel, Ange-Line; Franco, Brunella; Duffourd, Yannis; Thevenon, Julien; Jego, Laurence; Lopez, Estelle; Deleuze, Jean-François; Doummar, Diane; Giles, Rachel H.; Johnson, Colin A.; Huynen, Martijn A.; Chevrier, Véronique; Burglen, Lydie; Morleo, Manuela; Desguerres, Isabelle; Pierquin, Geneviève; Doray, Bérénice; Gilbert-Dussardier, Brigitte; Reversade, Bruno; Steichen-Gersdorf, Elisabeth; Baumann, Clarisse; Panigrahi, Inusha; Fargeot-Espaliat, Anne; Dieux, Anne; David, Albert; Goldenberg, Alice; Bongers, Ernie; Gaillard, Dominique; Argente, Jesús; Aral, Bernard; Gigot, Nadège; St-Onge, Judith; Birnbaum, Daniel; Phadke, Shubha R.; Cormier-Daire, Valérie; Eguether, Thibaut; Pazour, Gregory J.; Herranz-Pérez, Vicente; Lee, Jaclyn S.; Pasquier, Laurent; Loget, Philippe; Saunier, Sophie; Mégarbané, André; Rosnet, Olivier; Leroux, Michel R.; Wallingford, John B.; Blacque, Oliver E.; Nachury, Maxence V.; Attie-Bitach, Tania; Rivière, Jean-Baptiste; Faivre, Laurence; Thauvin-Robinet, Christel

    2017-01-01

    Oral-facial-digital syndromes (OFDS) gather rare genetic disorders characterized by facial, oral and digital abnormalities associated with a wide range of additional features (polycystic kidney disease, cerebral malformations and several others) to delineate a growing list of OFD subtypes. The most frequent, OFD type I, is caused by a heterozygous mutation in the OFD1 gene encoding a centrosomal protein. The wide clinical heterogeneity of OFDS suggests the involvement of other ciliary genes. For 15 years, we have aimed to identify the molecular bases of OFDS. This effort has been greatly helped by the recent development of whole exome sequencing (WES). Here, we present all our published and unpublished results for WES in 24 OFDS cases. We identified causal variants in five new genes (C2CD3, TMEM107, INTU, KIAA0753, IFT57) and related the clinical spectrum of four genes in other ciliopathies (C5orf42, TMEM138, TMEM231, WDPCP) to OFDS. Mutations were also detected in two genes previously implicated in OFDS. Functional studies revealed the involvement of centriole elongation, transition zone and intraflagellar transport defects in OFDS, thus characterizing three ciliary protein modules: the complex KIAA0753-FOPNL-OFD1, a regulator of centriole elongation; the MKS module, a major component of the transition zone; and the CPLANE complex necessary for IFT-A assembly. OFDS now appear to be a distinct subgroup of ciliopathies with wide heterogeneity, which makes the initial classification obsolete. A clinical classification restricted to the three frequent/well-delineated subtypes could be proposed, and for patients who do not fit one of these 3 main subtypes, a further classification could be based on the genotype. PMID:28289185

  16. Identifying the Average Causal Mediation Effects with Multiple Mediators in the Presence of Treatment Non-Compliance

    ERIC Educational Resources Information Center

    Park, Soojin

    2015-01-01

    Identifying the causal mechanisms is becoming more essential in social and medical sciences. In the presence of treatment non-compliance, the Intent-To-Treated effect (hereafter, ITT effect) is identified as long as the treatment is randomized (Angrist et al., 1996). However, the mediated portion of effect is not identified without additional…

  17. DNA Methylation and BMI: Investigating Identified Methylation Sites at HIF3A in a Causal Framework

    PubMed Central

    Richmond, Rebecca C.; Ward, Mary E.; Fraser, Abigail; Lyttleton, Oliver; McArdle, Wendy L.; Ring, Susan M.; Gaunt, Tom R.; Lawlor, Debbie A.; Davey Smith, George; Relton, Caroline L.

    2016-01-01

    Multiple differentially methylated sites and regions associated with adiposity have now been identified in large-scale cross-sectional studies. We tested for replication of associations between previously identified CpG sites at HIF3A and adiposity in ∼1,000 mother-offspring pairs from the Avon Longitudinal Study of Parents and Children (ALSPAC). Availability of methylation and adiposity measures at multiple time points, as well as genetic data, allowed us to assess the temporal associations between adiposity and methylation and to make inferences regarding causality and directionality. Overall, our results were discordant with those expected if HIF3A methylation has a causal effect on BMI and provided more evidence for causality in the reverse direction (i.e., an effect of BMI on HIF3A methylation). These results are based on robust evidence from longitudinal analyses and were also partially supported by Mendelian randomization analysis, although this latter analysis was underpowered to detect a causal effect of BMI on HIF3A methylation. Our results also highlight an apparent long-lasting intergenerational influence of maternal BMI on offspring methylation at this locus, which may confound associations between own adiposity and HIF3A methylation. Further work is required to replicate and uncover the mechanisms underlying the direct and intergenerational effect of adiposity on DNA methylation. PMID:26861784

  18. Genetic Susceptibility to Vitiligo: GWAS Approaches for Identifying Vitiligo Susceptibility Genes and Loci

    PubMed Central

    Shen, Changbing; Gao, Jing; Sheng, Yujun; Dou, Jinfa; Zhou, Fusheng; Zheng, Xiaodong; Ko, Randy; Tang, Xianfa; Zhu, Caihong; Yin, Xianyong; Sun, Liangdan; Cui, Yong; Zhang, Xuejun

    2016-01-01

    Vitiligo is an autoimmune disease with a strong genetic component, characterized by areas of depigmented skin resulting from loss of epidermal melanocytes. Genetic factors are known to play key roles in vitiligo through discoveries in association studies and family studies. Previously, vitiligo susceptibility genes were mainly revealed through linkage analysis and candidate gene studies. Recently, our understanding of the genetic basis of vitiligo has been rapidly advancing through genome-wide association study (GWAS). More than 40 robust susceptible loci have been identified and confirmed to be associated with vitiligo by using GWAS. Most of these associated genes participate in important pathways involved in the pathogenesis of vitiligo. Many susceptible loci with unknown functions in the pathogenesis of vitiligo have also been identified, indicating that additional molecular mechanisms may contribute to the risk of developing vitiligo. In this review, we summarize the key loci that are of genome-wide significance, which have been shown to influence vitiligo risk. These genetic loci may help build the foundation for genetic diagnosis and personalize treatment for patients with vitiligo in the future. However, substantial additional studies, including gene-targeted and functional studies, are required to confirm the causality of the genetic variants and their biological relevance in the development of vitiligo. PMID:26870082

  19. Integrative analysis of DNA methylation and gene expression data identifies EPAS1 as a key regulator of COPD.

    PubMed

    Yoo, Seungyeul; Takikawa, Sachiko; Geraghty, Patrick; Argmann, Carmen; Campbell, Joshua; Lin, Luan; Huang, Tao; Tu, Zhidong; Foronjy, Robert F; Feronjy, Robert; Spira, Avrum; Schadt, Eric E; Powell, Charles A; Zhu, Jun

    2015-01-01

    Chronic Obstructive Pulmonary Disease (COPD) is a complex disease. Genetic, epigenetic, and environmental factors are known to contribute to COPD risk and disease progression. Therefore we developed a systematic approach to identify key regulators of COPD that integrates genome-wide DNA methylation, gene expression, and phenotype data in lung tissue from COPD and control samples. Our integrative analysis identified 126 key regulators of COPD. We identified EPAS1 as the only key regulator whose downstream genes significantly overlapped with multiple genes sets associated with COPD disease severity. EPAS1 is distinct in comparison with other key regulators in terms of methylation profile and downstream target genes. Genes predicted to be regulated by EPAS1 were enriched for biological processes including signaling, cell communications, and system development. We confirmed that EPAS1 protein levels are lower in human COPD lung tissue compared to non-disease controls and that Epas1 gene expression is reduced in mice chronically exposed to cigarette smoke. As EPAS1 downstream genes were significantly enriched for hypoxia responsive genes in endothelial cells, we tested EPAS1 function in human endothelial cells. EPAS1 knockdown by siRNA in endothelial cells impacted genes that significantly overlapped with EPAS1 downstream genes in lung tissue including hypoxia responsive genes, and genes associated with emphysema severity. Our first integrative analysis of genome-wide DNA methylation and gene expression profiles illustrates that not only does DNA methylation play a 'causal' role in the molecular pathophysiology of COPD, but it can be leveraged to directly identify novel key mediators of this pathophysiology.

  20. DNA Methylation and BMI: Investigating Identified Methylation Sites at HIF3A in a Causal Framework.

    PubMed

    Richmond, Rebecca C; Sharp, Gemma C; Ward, Mary E; Fraser, Abigail; Lyttleton, Oliver; McArdle, Wendy L; Ring, Susan M; Gaunt, Tom R; Lawlor, Debbie A; Davey Smith, George; Relton, Caroline L

    2016-05-01

    Multiple differentially methylated sites and regions associated with adiposity have now been identified in large-scale cross-sectional studies. We tested for replication of associations between previously identified CpG sites at HIF3A and adiposity in ∼1,000 mother-offspring pairs from the Avon Longitudinal Study of Parents and Children (ALSPAC). Availability of methylation and adiposity measures at multiple time points, as well as genetic data, allowed us to assess the temporal associations between adiposity and methylation and to make inferences regarding causality and directionality. Overall, our results were discordant with those expected if HIF3A methylation has a causal effect on BMI and provided more evidence for causality in the reverse direction (i.e., an effect of BMI on HIF3A methylation). These results are based on robust evidence from longitudinal analyses and were also partially supported by Mendelian randomization analysis, although this latter analysis was underpowered to detect a causal effect of BMI on HIF3A methylation. Our results also highlight an apparent long-lasting intergenerational influence of maternal BMI on offspring methylation at this locus, which may confound associations between own adiposity and HIF3A methylation. Further work is required to replicate and uncover the mechanisms underlying the direct and intergenerational effect of adiposity on DNA methylation. © 2016 by the American Diabetes Association. Readers may use this article as long as the work is properly cited, the use is educational and not for profit, and the work is not altered.

  1. Next generation sequencing identifies abnormal Y chromosome and candidate causal variants in premature ovarian failure patients.

    PubMed

    Lee, Yujung; Kim, Changshin; Park, YoungJoon; Pyun, Jung-A; Kwack, KyuBum

    2016-12-01

    Premature ovarian failure (POF) is characterized by heterogeneous genetic causes such as chromosomal abnormalities and variants in causal genes. Recently, development of techniques made next generation sequencing (NGS) possible to detect genome wide variants including chromosomal abnormalities. Among 37 Korean POF patients, XY karyotype with distal part deletions of Y chromosome, Yp11.32-31 and Yp12 end part, was observed in two patients through NGS. Six deleterious variants in POF genes were also detected which might explain the pathogenesis of POF with abnormalities in the sex chromosomes. Additionally, the two POF patients had no mutation in SRY but three non-synonymous variants were detected in genes regarding sex reversal. These findings suggest candidate causes of POF and sex reversal and show the propriety of NGS to approach the heterogeneous pathogenesis of POF. Copyright © 2016 Elsevier Inc. All rights reserved.

  2. A stratified transcriptomics analysis of polygenic fat and lean mouse adipose tissues identifies novel candidate obesity genes.

    PubMed

    Morton, Nicholas M; Nelson, Yvonne B; Michailidou, Zoi; Di Rollo, Emma M; Ramage, Lynne; Hadoke, Patrick W F; Seckl, Jonathan R; Bunger, Lutz; Horvat, Simon; Kenyon, Christopher J; Dunbar, Donald R

    2011-01-01

    Obesity and metabolic syndrome results from a complex interaction between genetic and environmental factors. In addition to brain-regulated processes, recent genome wide association studies have indicated that genes highly expressed in adipose tissue affect the distribution and function of fat and thus contribute to obesity. Using a stratified transcriptome gene enrichment approach we attempted to identify adipose tissue-specific obesity genes in the unique polygenic Fat (F) mouse strain generated by selective breeding over 60 generations for divergent adiposity from a comparator Lean (L) strain. To enrich for adipose tissue obesity genes a 'snap-shot' pooled-sample transcriptome comparison of key fat depots and non adipose tissues (muscle, liver, kidney) was performed. Known obesity quantitative trait loci (QTL) information for the model allowed us to further filter genes for increased likelihood of being causal or secondary for obesity. This successfully identified several genes previously linked to obesity (C1qr1, and Np3r) as positional QTL candidate genes elevated specifically in F line adipose tissue. A number of novel obesity candidate genes were also identified (Thbs1, Ppp1r3d, Tmepai, Trp53inp2, Ttc7b, Tuba1a, Fgf13, Fmr) that have inferred roles in fat cell function. Quantitative microarray analysis was then applied to the most phenotypically divergent adipose depot after exaggerating F and L strain differences with chronic high fat feeding which revealed a distinct gene expression profile of line, fat depot and diet-responsive inflammatory, angiogenic and metabolic pathways. Selected candidate genes Npr3 and Thbs1, as well as Gys2, a non-QTL gene that otherwise passed our enrichment criteria were characterised, revealing novel functional effects consistent with a contribution to obesity. A focussed candidate gene enrichment strategy in the unique F and L model has identified novel adipose tissue-enriched genes contributing to obesity.

  3. Identifying X-consumers using causal recipes: "whales" and "jumbo shrimps" casino gamblers.

    PubMed

    Woodside, Arch G; Zhang, Mann

    2012-03-01

    X-consumers are the extremely frequent (top 2-3%) users who typically consume 25% of a product category. This article shows how to use fuzzy-set qualitative comparative analysis (QCA) to provide "causal recipes" sufficient for profiling X-consumers accurately. The study extends Dik Twedt's "heavy-half" product users for building theory and strategies to nurture or control X-behavior. The study here applies QCA to offer configurations that are sufficient in identifying "whales" and "jumbo shrimps" among X-casino gamblers. The findings support the principle that not all X-consumers are alike. The theory and method are applicable for identifying the degree of consistency and coverage of alternative X-consumers among users of all product-service category and brands.

  4. Fifteen years of research on oral-facial-digital syndromes: from 1 to 16 causal genes.

    PubMed

    Bruel, Ange-Line; Franco, Brunella; Duffourd, Yannis; Thevenon, Julien; Jego, Laurence; Lopez, Estelle; Deleuze, Jean-François; Doummar, Diane; Giles, Rachel H; Johnson, Colin A; Huynen, Martijn A; Chevrier, Véronique; Burglen, Lydie; Morleo, Manuela; Desguerres, Isabelle; Pierquin, Geneviève; Doray, Bérénice; Gilbert-Dussardier, Brigitte; Reversade, Bruno; Steichen-Gersdorf, Elisabeth; Baumann, Clarisse; Panigrahi, Inusha; Fargeot-Espaliat, Anne; Dieux, Anne; David, Albert; Goldenberg, Alice; Bongers, Ernie; Gaillard, Dominique; Argente, Jesús; Aral, Bernard; Gigot, Nadège; St-Onge, Judith; Birnbaum, Daniel; Phadke, Shubha R; Cormier-Daire, Valérie; Eguether, Thibaut; Pazour, Gregory J; Herranz-Pérez, Vicente; Goldstein, Jaclyn S; Pasquier, Laurent; Loget, Philippe; Saunier, Sophie; Mégarbané, André; Rosnet, Olivier; Leroux, Michel R; Wallingford, John B; Blacque, Oliver E; Nachury, Maxence V; Attie-Bitach, Tania; Rivière, Jean-Baptiste; Faivre, Laurence; Thauvin-Robinet, Christel

    2017-06-01

    Oral-facial-digital syndromes (OFDS) gather rare genetic disorders characterised by facial, oral and digital abnormalities associated with a wide range of additional features (polycystic kidney disease, cerebral malformations and several others) to delineate a growing list of OFDS subtypes. The most frequent, OFD type I, is caused by a heterozygous mutation in the OFD1 gene encoding a centrosomal protein. The wide clinical heterogeneity of OFDS suggests the involvement of other ciliary genes. For 15 years, we have aimed to identify the molecular bases of OFDS. This effort has been greatly helped by the recent development of whole-exome sequencing (WES). Here, we present all our published and unpublished results for WES in 24 cases with OFDS. We identified causal variants in five new genes ( C2CD3 , TMEM107 , INTU , KIAA0753 and IFT57 ) and related the clinical spectrum of four genes in other ciliopathies ( C5orf42 , TMEM138 , TMEM231 and WDPCP ) to OFDS. Mutations were also detected in two genes previously implicated in OFDS. Functional studies revealed the involvement of centriole elongation, transition zone and intraflagellar transport defects in OFDS, thus characterising three ciliary protein modules: the complex KIAA0753-FOPNL-OFD1, a regulator of centriole elongation; the Meckel-Gruber syndrome module, a major component of the transition zone; and the CPLANE complex necessary for IFT-A assembly. OFDS now appear to be a distinct subgroup of ciliopathies with wide heterogeneity, which makes the initial classification obsolete. A clinical classification restricted to the three frequent/well-delineated subtypes could be proposed, and for patients who do not fit one of these three main subtypes, a further classification could be based on the genotype. © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2017. All rights reserved. No commercial use is permitted unless otherwise expressly granted.

  5. Obesity and infection: reciprocal causality.

    PubMed

    Hainer, V; Zamrazilová, H; Kunešová, M; Bendlová, B; Aldhoon-Hainerová, I

    2015-01-01

    Associations between different infectious agents and obesity have been reported in humans for over thirty years. In many cases, as in nosocomial infections, this relationship reflects the greater susceptibility of obese individuals to infection due to impaired immunity. In such cases, the infection is not related to obesity as a causal factor but represents a complication of obesity. In contrast, several infections have been suggested as potential causal factors in human obesity. However, evidence of a causal linkage to human obesity has only been provided for adenovirus 36 (Adv36). This virus activates lipogenic and proinflammatory pathways in adipose tissue, improves insulin sensitivity, lipid profile and hepatic steatosis. The E4orf1 gene of Adv36 exerts insulin senzitizing effects, but is devoid of its pro-inflammatory modalities. The development of a vaccine to prevent Adv36-induced obesity or the use of E4orf1 as a ligand for novel antidiabetic drugs could open new horizons in the prophylaxis and treatment of obesity and diabetes. More experimental and clinical studies are needed to elucidate the mutual relations between infection and obesity, identify additional infectious agents causing human obesity, as well as define the conditions that predispose obese individuals to specific infections.

  6. Discovering causal signaling pathways through gene-expression patterns

    PubMed Central

    Parikh, Jignesh R.; Klinger, Bertram; Xia, Yu; Marto, Jarrod A.; Blüthgen, Nils

    2010-01-01

    High-throughput gene-expression studies result in lists of differentially expressed genes. Most current meta-analyses of these gene lists include searching for significant membership of the translated proteins in various signaling pathways. However, such membership enrichment algorithms do not provide insight into which pathways caused the genes to be differentially expressed in the first place. Here, we present an intuitive approach for discovering upstream signaling pathways responsible for regulating these differentially expressed genes. We identify consistently regulated signature genes specific for signal transduction pathways from a panel of single-pathway perturbation experiments. An algorithm that detects overrepresentation of these signature genes in a gene group of interest is used to infer the signaling pathway responsible for regulation. We expose our novel resource and algorithm through a web server called SPEED: Signaling Pathway Enrichment using Experimental Data sets. SPEED can be freely accessed at http://speed.sys-bio.net/. PMID:20494976

  7. NIH Researchers Identify OCD Risk Gene

    MedlinePlus

    ... News From NIH NIH Researchers Identify OCD Risk Gene Past Issues / Summer 2006 Table of Contents For ... and Alcoholism (NIAAA) have identified a previously unknown gene variant that doubles an individual's risk for obsessive- ...

  8. Causal inference in nonlinear systems: Granger causality versus time-delayed mutual information

    NASA Astrophysics Data System (ADS)

    Li, Songting; Xiao, Yanyang; Zhou, Douglas; Cai, David

    2018-05-01

    The Granger causality (GC) analysis has been extensively applied to infer causal interactions in dynamical systems arising from economy and finance, physics, bioinformatics, neuroscience, social science, and many other fields. In the presence of potential nonlinearity in these systems, the validity of the GC analysis in general is questionable. To illustrate this, here we first construct minimal nonlinear systems and show that the GC analysis fails to infer causal relations in these systems—it gives rise to all types of incorrect causal directions. In contrast, we show that the time-delayed mutual information (TDMI) analysis is able to successfully identify the direction of interactions underlying these nonlinear systems. We then apply both methods to neuroscience data collected from experiments and demonstrate that the TDMI analysis but not the GC analysis can identify the direction of interactions among neuronal signals. Our work exemplifies inference hazards in the GC analysis in nonlinear systems and suggests that the TDMI analysis can be an appropriate tool in such a case.

  9. A Stratified Transcriptomics Analysis of Polygenic Fat and Lean Mouse Adipose Tissues Identifies Novel Candidate Obesity Genes

    PubMed Central

    Morton, Nicholas M.; Nelson, Yvonne B.; Michailidou, Zoi; Di Rollo, Emma M.; Ramage, Lynne; Hadoke, Patrick W. F.; Seckl, Jonathan R.; Bunger, Lutz; Horvat, Simon; Kenyon, Christopher J.; Dunbar, Donald R.

    2011-01-01

    Background Obesity and metabolic syndrome results from a complex interaction between genetic and environmental factors. In addition to brain-regulated processes, recent genome wide association studies have indicated that genes highly expressed in adipose tissue affect the distribution and function of fat and thus contribute to obesity. Using a stratified transcriptome gene enrichment approach we attempted to identify adipose tissue-specific obesity genes in the unique polygenic Fat (F) mouse strain generated by selective breeding over 60 generations for divergent adiposity from a comparator Lean (L) strain. Results To enrich for adipose tissue obesity genes a ‘snap-shot’ pooled-sample transcriptome comparison of key fat depots and non adipose tissues (muscle, liver, kidney) was performed. Known obesity quantitative trait loci (QTL) information for the model allowed us to further filter genes for increased likelihood of being causal or secondary for obesity. This successfully identified several genes previously linked to obesity (C1qr1, and Np3r) as positional QTL candidate genes elevated specifically in F line adipose tissue. A number of novel obesity candidate genes were also identified (Thbs1, Ppp1r3d, Tmepai, Trp53inp2, Ttc7b, Tuba1a, Fgf13, Fmr) that have inferred roles in fat cell function. Quantitative microarray analysis was then applied to the most phenotypically divergent adipose depot after exaggerating F and L strain differences with chronic high fat feeding which revealed a distinct gene expression profile of line, fat depot and diet-responsive inflammatory, angiogenic and metabolic pathways. Selected candidate genes Npr3 and Thbs1, as well as Gys2, a non-QTL gene that otherwise passed our enrichment criteria were characterised, revealing novel functional effects consistent with a contribution to obesity. Conclusions A focussed candidate gene enrichment strategy in the unique F and L model has identified novel adipose tissue-enriched genes

  10. Causal diagrams for empirical legal research: a methodology for identifying causation, avoiding bias and interpreting results

    PubMed Central

    VanderWeele, Tyler J.; Staudt, Nancy

    2014-01-01

    In this paper we introduce methodology—causal directed acyclic graphs—that empirical researchers can use to identify causation, avoid bias, and interpret empirical results. This methodology has become popular in a number of disciplines, including statistics, biostatistics, epidemiology and computer science, but has yet to appear in the empirical legal literature. Accordingly we outline the rules and principles underlying this new methodology and then show how it can assist empirical researchers through both hypothetical and real-world examples found in the extant literature. While causal directed acyclic graphs are certainly not a panacea for all empirical problems, we show they have potential to make the most basic and fundamental tasks, such as selecting covariate controls, relatively easy and straightforward. PMID:25685055

  11. Type 2 diabetes mellitus disease risk genes identified by genome wide copy number variation scan in normal populations.

    PubMed

    Prabhanjan, Manasa; Suresh, Raviraj V; Murthy, Megha N; Ramachandra, Nallur B

    2016-03-01

    To identify the role of copy number variations (CNVs) on disease risk genes and its effect on disease phenotypes in type 2 diabetes mellitus (T2DM) in 12 random populations using high throughput arrays. CNV analysis was carried out on a total of 1715 individuals from 12 populations, from ArrayExpress Archive of the European Bioinformatics Institute along with our subjects using Affymetrix Genome Wide SNP 6.0 array. CNV effect on T2DM genes were analyzed using several bioinformatics tools and a molecular protein interaction network was constructed to identify the disease mechanism altered by the CNVs. Analysis showed 34.4% of the total population to be under CNV burden for T2DM, with 83 disease causal and associated genes being under CNV influence. Hotspots were identified on chromosomes 22, 12, 6, 19 and 11.Overlap studies with case cohorts revealed significant disease risk genes such as EGFR, E2F1, PPP1R3A, HLA and TSPAN8. CNVs play a significant role in predisposing T2DM in normal cohorts and contribute to the phenotypic effects. Thus, CNVs should be considered as one of the major contributors in predisposition of the disease. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.

  12. A Systems Biology Framework Identifies Molecular Underpinnings of Coronary Heart Disease

    PubMed Central

    Huan, Tianxiao; Zhang, Bin; Wang, Zhi; Joehanes, Roby; Zhu, Jun; Johnson, Andrew D.; Ying, Saixia; Munson, Peter J.; Raghavachari, Nalini; Wang, Richard; Liu, Poching; Courchesne, Paul; Hwang, Shih-Jen; Assimes, Themistocles L.; McPherson, Ruth; Samani, Nilesh J.; Schunkert, Heribert; Meng, Qingying; Suver, Christine; O'Donnell, Christopher J.; Derry, Jonathan; Yang, Xia; Levy, Daniel

    2013-01-01

    Objective Genetic approaches have identified numerous loci associated with coronary heart disease (CHD). The molecular mechanisms underlying CHD gene-disease associations, however, remain unclear. We hypothesized that genetic variants with both strong and subtle effects drive gene subnetworks that in turn affect CHD. Approach and Results We surveyed CHD-associated molecular interactions by constructing coexpression networks using whole blood gene expression profiles from 188 CHD cases and 188 age- and sex-matched controls. 24 coexpression modules were identified including one case-specific and one control-specific differential module (DM). The DMs were enriched for genes involved in B-cell activation, immune response, and ion transport. By integrating the DMs with altered gene expression associated SNPs (eSNPs) and with results of GWAS of CHD and its risk factors, the control-specific DM was implicated as CHD-causal based on its significant enrichment for both CHD and lipid eSNPs. This causal DM was further integrated with tissue-specific Bayesian networks and protein-protein interaction networks to identify regulatory key driver (KD) genes. Multi-tissue KDs (SPIB and TNFRSF13C) and tissue-specific KDs (e.g. EBF1) were identified. Conclusions Our network-driven integrative analysis not only identified CHD-related genes, but also defined network structure that sheds light on the molecular interactions of genes associated with CHD risk. PMID:23539213

  13. Inductive reasoning about causally transmitted properties.

    PubMed

    Shafto, Patrick; Kemp, Charles; Bonawitz, Elizabeth Baraff; Coley, John D; Tenenbaum, Joshua B

    2008-11-01

    Different intuitive theories constrain and guide inferences in different contexts. Formalizing simple intuitive theories as probabilistic processes operating over structured representations, we present a new computational model of category-based induction about causally transmitted properties. A first experiment demonstrates undergraduates' context-sensitive use of taxonomic and food web knowledge to guide reasoning about causal transmission and shows good qualitative agreement between model predictions and human inferences. A second experiment demonstrates strong quantitative and qualitative fits to inferences about a more complex artificial food web. A third experiment investigates human reasoning about complex novel food webs where species have known taxonomic relations. Results demonstrate a double-dissociation between the predictions of our causal model and a related taxonomic model [Kemp, C., & Tenenbaum, J. B. (2003). Learning domain structures. In Proceedings of the 25th annual conference of the cognitive science society]: the causal model predicts human inferences about diseases but not genes, while the taxonomic model predicts human inferences about genes but not diseases. We contrast our framework with previous models of category-based induction and previous formal instantiations of intuitive theories, and outline challenges in developing a complete model of context-sensitive reasoning.

  14. Genetic Obesity and the Risk of Atrial Fibrillation – Causal Estimates from Mendelian Randomization

    PubMed Central

    Chatterjee, Neal A.; Arking, Dan E.; Ellinor, Patrick T.; Heeringa, Jan; Lin, Honghuang; Lubitz, Steven A.; Soliman, Elsayed Z.; Verweij, Niek; Alonso, Alvaro; Benjamin, Emelia J.; Gudnason, Vilmundur; Stricker, Bruno H. C.; Van Der Harst, Pim; Chasman, Daniel I.; Albert, Christine M.

    2017-01-01

    Background Observational studies have identified an association between body mass index (BMI) and incident atrial fibrillation (AF). Inferring causality from observational studies, however, is subject to residual confounding, reverse causation, and bias. The primary objective of this study was to evaluate the causal association between BMI and AF using genetic predictors of BMI. Methods We identified 51 646 individuals of European ancestry without AF at baseline from seven prospective population-based cohorts initiated between 1987 and 2002 in the United States, Iceland, and the Netherlands with incident AF ascertained between 1987 and 2012. Cohort-specific mean follow-up ranged 7.4 to 19.2 years, over which period there were a total of 4178 cases of incident AF. We performed a Mendelian randomization with instrumental variable analysis to estimate a cohort-specific causal hazard ratio for the association between BMI and AF. Two genetic instruments for BMI were utilized: FTO genotype (rs1558902) and a BMI gene score comprised of 39 single nucleotide polymorphisms identified by genome-wide association studies to be associated with BMI. Cohort-specific estimates were combined by random-effects, inverse variance weighted meta-analysis. Results In age- and sex-adjusted meta-analysis, both genetic instruments were significantly associated with BMI (FTO: 0.43 [95% CI: 0.32 – 0.54] kg/m2 per A-allele, p<0.001); BMI gene score: 1.05 [95% CI: 0.90-1.20] kg/m2 per 1 unit increase, p<0.001) and incident AF (FTO – HR: 1.07 [1.02-1.11] per A-allele, p=0.004; BMI gene score – HR: 1.11 [1.05-1.18] per 1-unit increase, p<0.001). Age- and sex-adjusted instrumental variable estimates for the causal association between BMI and incident AF were HR 1.15 [1.04-1.26] per kg/m2, p=0.005 (FTO) and 1.11 [1.05-1.17] per kg/m2, p<0.001 (BMI gene score). Both of these estimates were consistent with the meta-analyzed estimate between observed BMI and AF (age- and sex-adjusted HR 1.05 [1

  15. Causal and causally separable processes

    NASA Astrophysics Data System (ADS)

    Oreshkov, Ognyan; Giarmatzi, Christina

    2016-09-01

    The idea that events are equipped with a partial causal order is central to our understanding of physics in the tested regimes: given two pointlike events A and B, either A is in the causal past of B, B is in the causal past of A, or A and B are space-like separated. Operationally, the meaning of these order relations corresponds to constraints on the possible correlations between experiments performed in the vicinities of the respective events: if A is in the causal past of B, an experimenter at A could signal to an experimenter at B but not the other way around, while if A and B are space-like separated, no signaling is possible in either direction. In the context of a concrete physical theory, the correlations compatible with a given causal configuration may obey further constraints. For instance, space-like correlations in quantum mechanics arise from local measurements on joint quantum states, while time-like correlations are established via quantum channels. Similarly to other variables, however, the causal order of a set of events could be random, and little is understood about the constraints that causality implies in this case. A main difficulty concerns the fact that the order of events can now generally depend on the operations performed at the locations of these events, since, for instance, an operation at A could influence the order in which B and C occur in A’s future. So far, no formal theory of causality compatible with such dynamical causal order has been developed. Apart from being of fundamental interest in the context of inferring causal relations, such a theory is imperative for understanding recent suggestions that the causal order of events in quantum mechanics can be indefinite. Here, we develop such a theory in the general multipartite case. Starting from a background-independent definition of causality, we derive an iteratively formulated canonical decomposition of multipartite causal correlations. For a fixed number of settings and

  16. MIIC online: a web server to reconstruct causal or non-causal networks from non-perturbative data.

    PubMed

    Sella, Nadir; Verny, Louis; Uguzzoni, Guido; Affeldt, Séverine; Isambert, Hervé

    2018-07-01

    We present a web server running the MIIC algorithm, a network learning method combining constraint-based and information-theoretic frameworks to reconstruct causal, non-causal or mixed networks from non-perturbative data, without the need for an a priori choice on the class of reconstructed network. Starting from a fully connected network, the algorithm first removes dispensable edges by iteratively subtracting the most significant information contributions from indirect paths between each pair of variables. The remaining edges are then filtered based on their confidence assessment or oriented based on the signature of causality in observational data. MIIC online server can be used for a broad range of biological data, including possible unobserved (latent) variables, from single-cell gene expression data to protein sequence evolution and outperforms or matches state-of-the-art methods for either causal or non-causal network reconstruction. MIIC online can be freely accessed at https://miic.curie.fr. Supplementary data are available at Bioinformatics online.

  17. Epidemiological causality.

    PubMed

    Morabia, Alfredo

    2005-01-01

    Epidemiological methods, which combine population thinking and group comparisons, can primarily identify causes of disease in populations. There is therefore a tension between our intuitive notion of a cause, which we want to be deterministic and invariant at the individual level, and the epidemiological notion of causes, which are invariant only at the population level. Epidemiologists have given heretofore a pragmatic solution to this tension. Causal inference in epidemiology consists in checking the logical coherence of a causality statement and determining whether what has been found grossly contradicts what we think we already know: how strong is the association? Is there a dose-response relationship? Does the cause precede the effect? Is the effect biologically plausible? Etc. This approach to causal inference can be traced back to the English philosophers David Hume and John Stuart Mill. On the other hand, the mode of establishing causality, devised by Jakob Henle and Robert Koch, which has been fruitful in bacteriology, requires that in every instance the effect invariably follows the cause (e.g., inoculation of Koch bacillus and tuberculosis). This is incompatible with epidemiological causality which has to deal with probabilistic effects (e.g., smoking and lung cancer), and is therefore invariant only for the population.

  18. Identifying Conditions That Support Causal Inference in Observational Studies in Education: Empirical Evidence from within Study Comparisons

    ERIC Educational Resources Information Center

    Hallberg, Kelly

    2013-01-01

    This dissertation is a collection of three papers that employ empirical within study comparisons (WSCs) to identify conditions that support causal inference in observational studies. WSC studies empirically estimate the extent to which a given observational study reproduces the result of a randomized clinical trial (RCT) when both share the same…

  19. Fine Mapping of a Clubroot Resistance Gene in Chinese Cabbage Using SNP Markers Identified from Bulked Segregant RNA Sequencing

    PubMed Central

    Huang, Zhen; Peng, Gary; Liu, Xunjia; Deora, Abhinandan; Falk, Kevin C.; Gossen, Bruce D.; McDonald, Mary R.; Yu, Fengqun

    2017-01-01

    Clubroot, caused by Plasmodiophora brassicae, is an important disease of canola (Brassica napus) in western Canada and worldwide. In this study, a clubroot resistance gene (Rcr2) was identified and fine mapped in Chinese cabbage cv. “Jazz” using single-nucleotide polymorphisms (SNP) markers identified from bulked segregant RNA sequencing (BSR-Seq) and molecular markers were developed for use in marker assisted selection. In total, 203.9 million raw reads were generated from one pooled resistant (R) and one pooled susceptible (S) sample, and >173,000 polymorphic SNP sites were identified between the R and S samples. One significant peak was observed between 22 and 26 Mb of chromosome A03, which had been predicted by BSR-Seq to contain the causal gene Rcr2. There were 490 polymorphic SNP sites identified in the region. A segregating population consisting of 675 plants was analyzed with 15 SNP sites in the region using the Kompetitive Allele Specific PCR method, and Rcr2 was fine mapped between two SNP markers, SNP_A03_32 and SNP_A03_67 with 0.1 and 0.3 cM from Rcr2, respectively. Five SNP markers co-segregated with Rcr2 in this region. Variants were identified in 14 of 36 genes annotated in the Rcr2 target region. The numbers of poly variants differed among the genes. Four genes encode TIR-NBS-LRR proteins and two of them Bra019410 and Bra019413, had high numbers of polymorphic variants and so are the most likely candidates of Rcr2. PMID:28894454

  20. Causality and Causal Inference in Social Work: Quantitative and Qualitative Perspectives

    PubMed Central

    Palinkas, Lawrence A.

    2015-01-01

    Achieving the goals of social work requires matching a specific solution to a specific problem. Understanding why the problem exists and why the solution should work requires a consideration of cause and effect. However, it is unclear whether it is desirable for social workers to identify cause and effect, whether it is possible for social workers to identify cause and effect, and, if so, what is the best means for doing so. These questions are central to determining the possibility of developing a science of social work and how we go about doing it. This article has four aims: (1) provide an overview of the nature of causality; (2) examine how causality is treated in social work research and practice; (3) highlight the role of quantitative and qualitative methods in the search for causality; and (4) demonstrate how both methods can be employed to support a “science” of social work. PMID:25821393

  1. Nightmares in the general population: identifying potential causal factors.

    PubMed

    Rek, Stephanie; Sheaves, Bryony; Freeman, Daniel

    2017-09-01

    Nightmares are inherently distressing, prevent restorative sleep, and are associated with a number of psychiatric problems, but have rarely been the subject of empirical study. Negative affect, linked to stressful events, is generally considered the key trigger of nightmares; hence nightmares have most often been considered in the context of post-traumatic stress disorder (PTSD). However, many individuals with heightened negative affect do not have nightmares. The objective of this study was to identify mechanistically plausible factors, beyond negative affect, that may explain why individuals experience nightmares. 846 participants from the UK general population completed an online survey about nightmare occurrence and severity (pre-occupation, distress, and impairment), negative affect, worry, depersonalisation, hallucinatory experiences, paranoia, alcohol use, sleep duration, physical activity levels, PTSD symptoms, and stressful life events. Associations of nightmares with the putative predictive factors were tested controlling for levels of negative affect. Analyses were also repeated controlling for levels of PTSD and the recent occurrence of stressful life events. Nightmare occurrence, adjusting for negative affect, was associated with higher levels of worry, depersonalisation, hallucinatory experiences, paranoia, and sleep duration (odds ratios 1.25-1.45). Nightmare severity, controlling for negative affect, was associated with higher levels of worry, depersonalisation, hallucinatory experiences, and paranoia (R 2 s: 0.33-0.39). Alcohol use and physical activity levels were not associated with nightmares. The study identifies a number of potential predictors of the occurrence and severity of nightmares. Causal roles require testing in future longitudinal, experimental, and treatment studies.

  2. Chemical-gene interaction networks and causal reasoning for ...

    EPA Pesticide Factsheets

    Evaluating the potential human health and ecological risks associated with exposures to complex chemical mixtures in the environment is one of the main challenges of chemical safety assessment and environmental protection. There is a need for approaches that can help to integrate chemical monitoring and biological effects data to evaluate risks associated with chemicals present in the environment. Here, we used prior knowledge about chemical-gene interactions to develop a knowledge assembly model for detected chemicals at five locations near the North Branch and Chisago wastewater treatment plants (WWTP) in the St. Croix River Basin, MN and WI. The assembly model was used to generate hypotheses about the biological impacts of the chemicals at each location. The hypotheses were tested using empirical hepatic gene expression data from fathead minnows exposed for 12 d at each location. Empirical gene expression data were also mapped to the assembly models to evaluate the likelihood of a chemical contributing to the observed biological responses using richness and concordance statistics. The prior knowledge approach was able predict the observed biological pathways impacted at one site but not the other. Atrazine was identified as a potential contributor to the observed gene expression responses at a location upstream of the North Branch WTTP. Four chemicals were identified as contributors to the observed biological responses at the effluent and downstream o

  3. An integrative framework for Bayesian variable selection with informative priors for identifying genes and pathways.

    PubMed

    Peng, Bin; Zhu, Dianwen; Ander, Bradley P; Zhang, Xiaoshuai; Xue, Fuzhong; Sharp, Frank R; Yang, Xiaowei

    2013-01-01

    The discovery of genetic or genomic markers plays a central role in the development of personalized medicine. A notable challenge exists when dealing with the high dimensionality of the data sets, as thousands of genes or millions of genetic variants are collected on a relatively small number of subjects. Traditional gene-wise selection methods using univariate analyses face difficulty to incorporate correlational, structural, or functional structures amongst the molecular measures. For microarray gene expression data, we first summarize solutions in dealing with 'large p, small n' problems, and then propose an integrative Bayesian variable selection (iBVS) framework for simultaneously identifying causal or marker genes and regulatory pathways. A novel partial least squares (PLS) g-prior for iBVS is developed to allow the incorporation of prior knowledge on gene-gene interactions or functional relationships. From the point view of systems biology, iBVS enables user to directly target the joint effects of multiple genes and pathways in a hierarchical modeling diagram to predict disease status or phenotype. The estimated posterior selection probabilities offer probabilitic and biological interpretations. Both simulated data and a set of microarray data in predicting stroke status are used in validating the performance of iBVS in a Probit model with binary outcomes. iBVS offers a general framework for effective discovery of various molecular biomarkers by combining data-based statistics and knowledge-based priors. Guidelines on making posterior inferences, determining Bayesian significance levels, and improving computational efficiencies are also discussed.

  4. Utilizing Gene Tree Variation to Identify Candidate Effector Genes in Zymoseptoria tritici

    PubMed Central

    McDonald, Megan C.; McGinness, Lachlan; Hane, James K.; Williams, Angela H.; Milgate, Andrew; Solomon, Peter S.

    2016-01-01

    Zymoseptoria tritici is a host-specific, necrotrophic pathogen of wheat. Infection by Z. tritici is characterized by its extended latent period, which typically lasts 2 wks, and is followed by extensive host cell death, and rapid proliferation of fungal biomass. This work characterizes the level of genomic variation in 13 isolates, for which we have measured virulence on 11 wheat cultivars with differential resistance genes. Between the reference isolate, IPO323, and the 13 Australian isolates we identified over 800,000 single nucleotide polymorphisms, of which ∼10% had an effect on the coding regions of the genome. Furthermore, we identified over 1700 probable presence/absence polymorphisms in genes across the Australian isolates using de novo assembly. Finally, we developed a gene tree sorting method that quickly identifies groups of isolates within a single gene alignment whose sequence haplotypes correspond with virulence scores on a single wheat cultivar. Using this method, we have identified < 100 candidate effector genes whose gene sequence correlates with virulence toward a wheat cultivar carrying a major resistance gene. PMID:26837952

  5. Mutation screening of 75 candidate genes in 152 complex I deficiency cases identifies pathogenic variants in 16 genes including NDUFB9.

    PubMed

    Haack, Tobias B; Madignier, Florence; Herzer, Martina; Lamantea, Eleonora; Danhauser, Katharina; Invernizzi, Federica; Koch, Johannes; Freitag, Martin; Drost, Rene; Hillier, Ingo; Haberberger, Birgit; Mayr, Johannes A; Ahting, Uwe; Tiranti, Valeria; Rötig, Agnes; Iuso, Arcangela; Horvath, Rita; Tesarova, Marketa; Baric, Ivo; Uziel, Graziella; Rolinski, Boris; Sperl, Wolfgang; Meitinger, Thomas; Zeviani, Massimo; Freisinger, Peter; Prokisch, Holger

    2012-02-01

    Mitochondrial complex I deficiency is the most common cause of mitochondrial disease in childhood. Identification of the molecular basis is difficult given the clinical and genetic heterogeneity. Most patients lack a molecular definition in routine diagnostics. A large-scale mutation screen of 75 candidate genes in 152 patients with complex I deficiency was performed by high-resolution melting curve analysis and Sanger sequencing. The causal role of a new disease allele was confirmed by functional complementation assays. The clinical phenotype of patients carrying mutations was documented using a standardised questionnaire. Causative mutations were detected in 16 genes, 15 of which had previously been associated with complex I deficiency: three mitochondrial DNA genes encoding complex I subunits, two mitochondrial tRNA genes and nuclear DNA genes encoding six complex I subunits and four assembly factors. For the first time, a causal mutation is described in NDUFB9, coding for a complex I subunit, resulting in reduction in NDUFB9 protein and both amount and activity of complex I. These features were rescued by expression of wild-type NDUFB9 in patient-derived fibroblasts. Mutant NDUFB9 is a new cause of complex I deficiency. A molecular diagnosis related to complex I deficiency was established in 18% of patients. However, most patients are likely to carry mutations in genes so far not associated with complex I function. The authors conclude that the high degree of genetic heterogeneity in complex I disorders warrants the implementation of unbiased genome-wide strategies for the complete molecular dissection of mitochondrial complex I deficiency.

  6. Pla2g12b and Hpn Are Genes Identified by Mouse ENU Mutagenesis That Affect HDL Cholesterol

    PubMed Central

    Aljakna, Aleksandra; Choi, Seungbum; Savage, Holly; Hageman Blair, Rachael; Gu, Tongjun; Svenson, Karen L.; Churchill, Gary A.; Hibbs, Matt; Korstanje, Ron

    2012-01-01

    Despite considerable progress understanding genes that affect the HDL particle, its function, and cholesterol content, genes identified to date explain only a small percentage of the genetic variation. We used N-ethyl-N-nitrosourea mutagenesis in mice to discover novel genes that affect HDL cholesterol levels. Two mutant lines (Hlb218 and Hlb320) with low HDL cholesterol levels were established. Causal mutations in these lines were mapped using linkage analysis: for line Hlb218 within a 12 Mbp region on Chr 10; and for line Hlb320 within a 21 Mbp region on Chr 7. High-throughput sequencing of Hlb218 liver RNA identified a mutation in Pla2g12b. The transition of G to A leads to a cysteine to tyrosine change and most likely causes a loss of a disulfide bridge. Microarray analysis of Hlb320 liver RNA showed a 7-fold downregulation of Hpn; sequencing identified a mutation in the 3′ splice site of exon 8. Northern blot confirmed lower mRNA expression level in Hlb320 and did not show a difference in splicing, suggesting that the mutation only affects the splicing rate. In addition to affecting HDL cholesterol, the mutated genes also lead to reduction in serum non-HDL cholesterol and triglyceride levels. Despite low HDL cholesterol levels, the mice from both mutant lines show similar atherosclerotic lesion sizes compared to control mice. These new mutant mouse models are valuable tools to further study the role of these genes, their affect on HDL cholesterol levels, and metabolism. PMID:22912808

  7. PPARα siRNA–Treated Expression Profiles Uncover the Causal Sufficiency Network for Compound-Induced Liver Hypertrophy

    PubMed Central

    Dai, Xudong; Souza, Angus T. De; Dai, Hongyue; Lewis, David L; Lee, Chang-kyu; Spencer, Andy G; Herweijer, Hans; Hagstrom, Jim E; Linsley, Peter S; Bassett, Douglas E; Ulrich, Roger G; He, Yudong D

    2007-01-01

    Uncovering pathways underlying drug-induced toxicity is a fundamental objective in the field of toxicogenomics. Developing mechanism-based toxicity biomarkers requires the identification of such novel pathways and the order of their sufficiency in causing a phenotypic response. Genome-wide RNA interference (RNAi) phenotypic screening has emerged as an effective tool in unveiling the genes essential for specific cellular functions and biological activities. However, eliciting the relative contribution of and sufficiency relationships among the genes identified remains challenging. In the rodent, the most widely used animal model in preclinical studies, it is unrealistic to exhaustively examine all potential interactions by RNAi screening. Application of existing computational approaches to infer regulatory networks with biological outcomes in the rodent is limited by the requirements for a large number of targeted permutations. Therefore, we developed a two-step relay method that requires only one targeted perturbation for genome-wide de novo pathway discovery. Using expression profiles in response to small interfering RNAs (siRNAs) against the gene for peroxisome proliferator-activated receptor α (Ppara), our method unveiled the potential causal sufficiency order network for liver hypertrophy in the rodent. The validity of the inferred 16 causal transcripts or 15 known genes for PPARα-induced liver hypertrophy is supported by their ability to predict non-PPARα–induced liver hypertrophy with 84% sensitivity and 76% specificity. Simulation shows that the probability of achieving such predictive accuracy without the inferred causal relationship is exceedingly small (p < 0.005). Five of the most sufficient causal genes have been previously disrupted in mouse models; the resulting phenotypic changes in the liver support the inferred causal roles in liver hypertrophy. Our results demonstrate the feasibility of defining pathways mediating drug-induced toxicity from

  8. Fine-Scale Mapping at 9p22.2 Identifies Candidate Causal Variants That Modify Ovarian Cancer Risk in BRCA1 and BRCA2 Mutation Carriers

    PubMed Central

    Vigorito, Elena; Kuchenbaecker, Karoline B.; Beesley, Jonathan; Adlard, Julian; Agnarsson, Bjarni A.; Andrulis, Irene L.; Arun, Banu K.; Barjhoux, Laure; Belotti, Muriel; Benitez, Javier; Berger, Andreas; Bojesen, Anders; Bonanni, Bernardo; Brewer, Carole; Caldes, Trinidad; Caligo, Maria A.; Campbell, Ian; Chan, Salina B.; Claes, Kathleen B. M.; Cohn, David E.; Cook, Jackie; Daly, Mary B.; Damiola, Francesca; Davidson, Rosemarie; de Pauw, Antoine; Delnatte, Capucine; Diez, Orland; Domchek, Susan M.; Dumont, Martine; Durda, Katarzyna; Dworniczak, Bernd; Easton, Douglas F.; Eccles, Diana; Edwinsdotter Ardnor, Christina; Eeles, Ros; Ejlertsen, Bent; Ellis, Steve; Evans, D. Gareth; Feliubadalo, Lidia; Fostira, Florentia; Foulkes, William D.; Friedman, Eitan; Frost, Debra; Gaddam, Pragna; Ganz, Patricia A.; Garber, Judy; Garcia-Barberan, Vanesa; Gauthier-Villars, Marion; Gehrig, Andrea; Gerdes, Anne-Marie; Giraud, Sophie; Godwin, Andrew K.; Goldgar, David E.; Hake, Christopher R.; Hansen, Thomas V. O.; Healey, Sue; Hodgson, Shirley; Hogervorst, Frans B. L.; Houdayer, Claude; Hulick, Peter J.; Imyanitov, Evgeny N.; Isaacs, Claudine; Izatt, Louise; Izquierdo, Angel; Jacobs, Lauren; Jakubowska, Anna; Janavicius, Ramunas; Jaworska-Bieniek, Katarzyna; Jensen, Uffe Birk; John, Esther M.; Vijai, Joseph; Karlan, Beth Y.; Kast, Karin; Investigators, KConFab; Khan, Sofia; Kwong, Ava; Laitman, Yael; Lester, Jenny; Lesueur, Fabienne; Liljegren, Annelie; Lubinski, Jan; Mai, Phuong L.; Manoukian, Siranoush; Mazoyer, Sylvie; Meindl, Alfons; Mensenkamp, Arjen R.; Montagna, Marco; Nathanson, Katherine L.; Neuhausen, Susan L.; Nevanlinna, Heli; Niederacher, Dieter; Olah, Edith; Olopade, Olufunmilayo I.; Ong, Kai-ren; Osorio, Ana; Park, Sue Kyung; Paulsson-Karlsson, Ylva; Pedersen, Inge Sokilde; Peissel, Bernard; Peterlongo, Paolo; Pfeiler, Georg; Phelan, Catherine M.; Piedmonte, Marion; Poppe, Bruce; Pujana, Miquel Angel; Radice, Paolo; Rennert, Gad; Rodriguez, Gustavo C.; Rookus, Matti A.; Ross, Eric A.; Schmutzler, Rita Katharina; Simard, Jacques; Singer, Christian F.; Slavin, Thomas P.; Soucy, Penny; Southey, Melissa; Steinemann, Doris; Stoppa-Lyonnet, Dominique; Sukiennicki, Grzegorz; Sutter, Christian; Szabo, Csilla I.; Tea, Muy-Kheng; Teixeira, Manuel R.; Teo, Soo-Hwang; Terry, Mary Beth; Thomassen, Mads; Tibiletti, Maria Grazia; Tihomirova, Laima; Tognazzo, Silvia; van Rensburg, Elizabeth J.; Varesco, Liliana; Varon-Mateeva, Raymonda; Vratimos, Athanassios; Weitzel, Jeffrey N.; McGuffog, Lesley; Kirk, Judy; Toland, Amanda Ewart; Hamann, Ute; Lindor, Noralane; Ramus, Susan J.; Greene, Mark H.; Couch, Fergus J.; Offit, Kenneth; Pharoah, Paul D. P.; Chenevix-Trench, Georgia; Antoniou, Antonis C.

    2016-01-01

    Population-based genome wide association studies have identified a locus at 9p22.2 associated with ovarian cancer risk, which also modifies ovarian cancer risk in BRCA1 and BRCA2 mutation carriers. We conducted fine-scale mapping at 9p22.2 to identify potential causal variants in BRCA1 and BRCA2 mutation carriers. Genotype data were available for 15,252 (2,462 ovarian cancer cases) BRCA1 and 8,211 (631 ovarian cancer cases) BRCA2 mutation carriers. Following genotype imputation, ovarian cancer associations were assessed for 4,873 and 5,020 SNPs in BRCA1 and BRCA 2 mutation carriers respectively, within a retrospective cohort analytical framework. In BRCA1 mutation carriers one set of eight correlated candidate causal variants for ovarian cancer risk modification was identified (top SNP rs10124837, HR: 0.73, 95%CI: 0.68 to 0.79, p-value 2× 10−16). These variants were located up to 20 kb upstream of BNC2. In BRCA2 mutation carriers one region, up to 45 kb upstream of BNC2, and containing 100 correlated SNPs was identified as candidate causal (top SNP rs62543585, HR: 0.69, 95%CI: 0.59 to 0.80, p-value 1.0 × 10−6). The candidate causal in BRCA1 mutation carriers did not include the strongest associated variant at this locus in the general population. In sum, we identified a set of candidate causal variants in a region that encompasses the BNC2 transcription start site. The ovarian cancer association at 9p22.2 may be mediated by different variants in BRCA1 mutation carriers and in the general population. Thus, potentially different mechanisms may underlie ovarian cancer risk for mutation carriers and the general population. PMID:27463617

  9. Fine-Scale Mapping at 9p22.2 Identifies Candidate Causal Variants That Modify Ovarian Cancer Risk in BRCA1 and BRCA2 Mutation Carriers.

    PubMed

    Vigorito, Elena; Kuchenbaecker, Karoline B; Beesley, Jonathan; Adlard, Julian; Agnarsson, Bjarni A; Andrulis, Irene L; Arun, Banu K; Barjhoux, Laure; Belotti, Muriel; Benitez, Javier; Berger, Andreas; Bojesen, Anders; Bonanni, Bernardo; Brewer, Carole; Caldes, Trinidad; Caligo, Maria A; Campbell, Ian; Chan, Salina B; Claes, Kathleen B M; Cohn, David E; Cook, Jackie; Daly, Mary B; Damiola, Francesca; Davidson, Rosemarie; Pauw, Antoine de; Delnatte, Capucine; Diez, Orland; Domchek, Susan M; Dumont, Martine; Durda, Katarzyna; Dworniczak, Bernd; Easton, Douglas F; Eccles, Diana; Edwinsdotter Ardnor, Christina; Eeles, Ros; Ejlertsen, Bent; Ellis, Steve; Evans, D Gareth; Feliubadalo, Lidia; Fostira, Florentia; Foulkes, William D; Friedman, Eitan; Frost, Debra; Gaddam, Pragna; Ganz, Patricia A; Garber, Judy; Garcia-Barberan, Vanesa; Gauthier-Villars, Marion; Gehrig, Andrea; Gerdes, Anne-Marie; Giraud, Sophie; Godwin, Andrew K; Goldgar, David E; Hake, Christopher R; Hansen, Thomas V O; Healey, Sue; Hodgson, Shirley; Hogervorst, Frans B L; Houdayer, Claude; Hulick, Peter J; Imyanitov, Evgeny N; Isaacs, Claudine; Izatt, Louise; Izquierdo, Angel; Jacobs, Lauren; Jakubowska, Anna; Janavicius, Ramunas; Jaworska-Bieniek, Katarzyna; Jensen, Uffe Birk; John, Esther M; Vijai, Joseph; Karlan, Beth Y; Kast, Karin; Investigators, KConFab; Khan, Sofia; Kwong, Ava; Laitman, Yael; Lester, Jenny; Lesueur, Fabienne; Liljegren, Annelie; Lubinski, Jan; Mai, Phuong L; Manoukian, Siranoush; Mazoyer, Sylvie; Meindl, Alfons; Mensenkamp, Arjen R; Montagna, Marco; Nathanson, Katherine L; Neuhausen, Susan L; Nevanlinna, Heli; Niederacher, Dieter; Olah, Edith; Olopade, Olufunmilayo I; Ong, Kai-Ren; Osorio, Ana; Park, Sue Kyung; Paulsson-Karlsson, Ylva; Pedersen, Inge Sokilde; Peissel, Bernard; Peterlongo, Paolo; Pfeiler, Georg; Phelan, Catherine M; Piedmonte, Marion; Poppe, Bruce; Pujana, Miquel Angel; Radice, Paolo; Rennert, Gad; Rodriguez, Gustavo C; Rookus, Matti A; Ross, Eric A; Schmutzler, Rita Katharina; Simard, Jacques; Singer, Christian F; Slavin, Thomas P; Soucy, Penny; Southey, Melissa; Steinemann, Doris; Stoppa-Lyonnet, Dominique; Sukiennicki, Grzegorz; Sutter, Christian; Szabo, Csilla I; Tea, Muy-Kheng; Teixeira, Manuel R; Teo, Soo-Hwang; Terry, Mary Beth; Thomassen, Mads; Tibiletti, Maria Grazia; Tihomirova, Laima; Tognazzo, Silvia; van Rensburg, Elizabeth J; Varesco, Liliana; Varon-Mateeva, Raymonda; Vratimos, Athanassios; Weitzel, Jeffrey N; McGuffog, Lesley; Kirk, Judy; Toland, Amanda Ewart; Hamann, Ute; Lindor, Noralane; Ramus, Susan J; Greene, Mark H; Couch, Fergus J; Offit, Kenneth; Pharoah, Paul D P; Chenevix-Trench, Georgia; Antoniou, Antonis C

    2016-01-01

    Population-based genome wide association studies have identified a locus at 9p22.2 associated with ovarian cancer risk, which also modifies ovarian cancer risk in BRCA1 and BRCA2 mutation carriers. We conducted fine-scale mapping at 9p22.2 to identify potential causal variants in BRCA1 and BRCA2 mutation carriers. Genotype data were available for 15,252 (2,462 ovarian cancer cases) BRCA1 and 8,211 (631 ovarian cancer cases) BRCA2 mutation carriers. Following genotype imputation, ovarian cancer associations were assessed for 4,873 and 5,020 SNPs in BRCA1 and BRCA 2 mutation carriers respectively, within a retrospective cohort analytical framework. In BRCA1 mutation carriers one set of eight correlated candidate causal variants for ovarian cancer risk modification was identified (top SNP rs10124837, HR: 0.73, 95%CI: 0.68 to 0.79, p-value 2× 10-16). These variants were located up to 20 kb upstream of BNC2. In BRCA2 mutation carriers one region, up to 45 kb upstream of BNC2, and containing 100 correlated SNPs was identified as candidate causal (top SNP rs62543585, HR: 0.69, 95%CI: 0.59 to 0.80, p-value 1.0 × 10-6). The candidate causal in BRCA1 mutation carriers did not include the strongest associated variant at this locus in the general population. In sum, we identified a set of candidate causal variants in a region that encompasses the BNC2 transcription start site. The ovarian cancer association at 9p22.2 may be mediated by different variants in BRCA1 mutation carriers and in the general population. Thus, potentially different mechanisms may underlie ovarian cancer risk for mutation carriers and the general population.

  10. Developing Causal Understanding with Causal Maps: The Impact of Total Links, Temporal Flow, and Lateral Position of Outcome Nodes

    ERIC Educational Resources Information Center

    Jeong, Allan; Lee, Woon Jee

    2012-01-01

    This study examined some of the methodological approaches used by students to construct causal maps in order to determine which approaches help students understand the underlying causes and causal mechanisms in a complex system. This study tested the relationship between causal understanding (ratio of root causes correctly/incorrectly identified,…

  11. Causal Inference and Developmental Psychology

    ERIC Educational Resources Information Center

    Foster, E. Michael

    2010-01-01

    Causal inference is of central importance to developmental psychology. Many key questions in the field revolve around improving the lives of children and their families. These include identifying risk factors that if manipulated in some way would foster child development. Such a task inherently involves causal inference: One wants to know whether…

  12. Gene panel sequencing improves the diagnostic work-up of patients with idiopathic erythrocytosis and identifies new mutations

    PubMed Central

    Camps, Carme; Petousi, Nayia; Bento, Celeste; Cario, Holger; Copley, Richard R.; McMullin, Mary Frances; van Wijk, Richard; Ratcliffe, Peter J.; Robbins, Peter A.; Taylor, Jenny C.

    2016-01-01

    Erythrocytosis is a rare disorder characterized by increased red cell mass and elevated hemoglobin concentration and hematocrit. Several genetic variants have been identified as causes for erythrocytosis in genes belonging to different pathways including oxygen sensing, erythropoiesis and oxygen transport. However, despite clinical investigation and screening for these mutations, the cause of disease cannot be found in a considerable number of patients, who are classified as having idiopathic erythrocytosis. In this study, we developed a targeted next-generation sequencing panel encompassing the exonic regions of 21 genes from relevant pathways (~79 Kb) and sequenced 125 patients with idiopathic erythrocytosis. The panel effectively screened 97% of coding regions of these genes, with an average coverage of 450×. It identified 51 different rare variants, all leading to alterations of protein sequence, with 57 out of 125 cases (45.6%) having at least one of these variants. Ten of these were known erythrocytosis-causing variants, which had been missed following existing diagnostic algorithms. Twenty-two were novel variants in erythrocytosis-associated genes (EGLN1, EPAS1, VHL, BPGM, JAK2, SH2B3) and in novel genes included in the panel (e.g. EPO, EGLN2, HIF3A, OS9), some with a high likelihood of functionality, for which future segregation, functional and replication studies will be useful to provide further evidence for causality. The rest were classified as polymorphisms. Overall, these results demonstrate the benefits of using a gene panel rather than existing methods in which focused genetic screening is performed depending on biochemical measurements: the gene panel improves diagnostic accuracy and provides the opportunity for discovery of novel variants. PMID:27651169

  13. Gene panel sequencing improves the diagnostic work-up of patients with idiopathic erythrocytosis and identifies new mutations.

    PubMed

    Camps, Carme; Petousi, Nayia; Bento, Celeste; Cario, Holger; Copley, Richard R; McMullin, Mary Frances; van Wijk, Richard; Ratcliffe, Peter J; Robbins, Peter A; Taylor, Jenny C

    2016-11-01

    Erythrocytosis is a rare disorder characterized by increased red cell mass and elevated hemoglobin concentration and hematocrit. Several genetic variants have been identified as causes for erythrocytosis in genes belonging to different pathways including oxygen sensing, erythropoiesis and oxygen transport. However, despite clinical investigation and screening for these mutations, the cause of disease cannot be found in a considerable number of patients, who are classified as having idiopathic erythrocytosis. In this study, we developed a targeted next-generation sequencing panel encompassing the exonic regions of 21 genes from relevant pathways (~79 Kb) and sequenced 125 patients with idiopathic erythrocytosis. The panel effectively screened 97% of coding regions of these genes, with an average coverage of 450×. It identified 51 different rare variants, all leading to alterations of protein sequence, with 57 out of 125 cases (45.6%) having at least one of these variants. Ten of these were known erythrocytosis-causing variants, which had been missed following existing diagnostic algorithms. Twenty-two were novel variants in erythrocytosis-associated genes (EGLN1, EPAS1, VHL, BPGM, JAK2, SH2B3) and in novel genes included in the panel (e.g. EPO, EGLN2, HIF3A, OS9), some with a high likelihood of functionality, for which future segregation, functional and replication studies will be useful to provide further evidence for causality. The rest were classified as polymorphisms. Overall, these results demonstrate the benefits of using a gene panel rather than existing methods in which focused genetic screening is performed depending on biochemical measurements: the gene panel improves diagnostic accuracy and provides the opportunity for discovery of novel variants. Copyright© Ferrata Storti Foundation.

  14. Identifying Cancer Driver Genes Using Replication-Incompetent Retroviral Vectors

    PubMed Central

    Bii, Victor M.; Trobridge, Grant D.

    2016-01-01

    Identifying novel genes that drive tumor metastasis and drug resistance has significant potential to improve patient outcomes. High-throughput sequencing approaches have identified cancer genes, but distinguishing driver genes from passengers remains challenging. Insertional mutagenesis screens using replication-incompetent retroviral vectors have emerged as a powerful tool to identify cancer genes. Unlike replicating retroviruses and transposons, replication-incompetent retroviral vectors lack additional mutagenesis events that can complicate the identification of driver mutations from passenger mutations. They can also be used for almost any human cancer due to the broad tropism of the vectors. Replication-incompetent retroviral vectors have the ability to dysregulate nearby cancer genes via several mechanisms including enhancer-mediated activation of gene promoters. The integrated provirus acts as a unique molecular tag for nearby candidate driver genes which can be rapidly identified using well established methods that utilize next generation sequencing and bioinformatics programs. Recently, retroviral vector screens have been used to efficiently identify candidate driver genes in prostate, breast, liver and pancreatic cancers. Validated driver genes can be potential therapeutic targets and biomarkers. In this review, we describe the emergence of retroviral insertional mutagenesis screens using replication-incompetent retroviral vectors as a novel tool to identify cancer driver genes in different cancer types. PMID:27792127

  15. Integrating Genetic and Gene Co-expression Analysis Identifies Gene Networks Involved in Alcohol and Stress Responses

    PubMed Central

    Luo, Jie; Xu, Pei; Cao, Peijian; Wan, Hongjian; Lv, Xiaonan; Xu, Shengchun; Wang, Gangjun; Cook, Melloni N.; Jones, Byron C.; Lu, Lu; Wang, Xusheng

    2018-01-01

    Although the link between stress and alcohol is well recognized, the underlying mechanisms of how they interplay at the molecular level remain unclear. The purpose of this study is to identify molecular networks underlying the effects of alcohol and stress responses, as well as their interaction on anxiety behaviors in the hippocampus of mice using a systems genetics approach. Here, we applied a gene co-expression network approach to transcriptomes of 41 BXD mouse strains under four conditions: stress, alcohol, stress-induced alcohol and control. The co-expression analysis identified 14 modules and characterized four expression patterns across the four conditions. The four expression patterns include up-regulation in no restraint stress and given an ethanol injection (NOE) but restoration in restraint stress followed by an ethanol injection (RSE; pattern 1), down-regulation in NOE but rescue in RSE (pattern 2), up-regulation in both restraint stress followed by a saline injection (RSS) and NOE, and further amplification in RSE (pattern 3), and up-regulation in RSS but reduction in both NOE and RSE (pattern 4). We further identified four functional subnetworks by superimposing protein-protein interactions (PPIs) to the 14 co-expression modules, including γ-aminobutyric acid receptor (GABA) signaling, glutamate signaling, neuropeptide signaling, cAMP-dependent signaling. We further performed module specificity analysis to identify modules that are specific to stress, alcohol, or stress-induced alcohol responses. Finally, we conducted causality analysis to link genetic variation to these identified modules, and anxiety behaviors after stress and alcohol treatments. This study underscores the importance of integrative analysis and offers new insights into the molecular networks underlying stress and alcohol responses. PMID:29674951

  16. An Integrative Framework for Bayesian Variable Selection with Informative Priors for Identifying Genes and Pathways

    PubMed Central

    Ander, Bradley P.; Zhang, Xiaoshuai; Xue, Fuzhong; Sharp, Frank R.; Yang, Xiaowei

    2013-01-01

    The discovery of genetic or genomic markers plays a central role in the development of personalized medicine. A notable challenge exists when dealing with the high dimensionality of the data sets, as thousands of genes or millions of genetic variants are collected on a relatively small number of subjects. Traditional gene-wise selection methods using univariate analyses face difficulty to incorporate correlational, structural, or functional structures amongst the molecular measures. For microarray gene expression data, we first summarize solutions in dealing with ‘large p, small n’ problems, and then propose an integrative Bayesian variable selection (iBVS) framework for simultaneously identifying causal or marker genes and regulatory pathways. A novel partial least squares (PLS) g-prior for iBVS is developed to allow the incorporation of prior knowledge on gene-gene interactions or functional relationships. From the point view of systems biology, iBVS enables user to directly target the joint effects of multiple genes and pathways in a hierarchical modeling diagram to predict disease status or phenotype. The estimated posterior selection probabilities offer probabilitic and biological interpretations. Both simulated data and a set of microarray data in predicting stroke status are used in validating the performance of iBVS in a Probit model with binary outcomes. iBVS offers a general framework for effective discovery of various molecular biomarkers by combining data-based statistics and knowledge-based priors. Guidelines on making posterior inferences, determining Bayesian significance levels, and improving computational efficiencies are also discussed. PMID:23844055

  17. An algorithm for direct causal learning of influences on patient outcomes.

    PubMed

    Rathnam, Chandramouli; Lee, Sanghoon; Jiang, Xia

    2017-01-01

    these three algorithms for this network type. However, when we use a more continuous measure of accuracy, we find that all the DCL methods are able to better partially predict more direct causes than FGS and CPC for the complex networks. In addition, DCL consistently had faster runtimes than the other algorithms. In the application to the real datasets, DCL identified rs6784615, located on the NISCH gene, and rs10824310, located on the PRKG1 gene, as direct causes of late onset Alzheimer's disease (LOAD) development. In addition, DCL identified ER category as a direct predictor of breast cancer mortality within 5 years, and HER2 status as a direct predictor of 10-year breast cancer mortality. These predictors have been identified in previous studies to have a direct causal relationship with their respective phenotypes, supporting the predictive power of DCL. When the other algorithms discovered predictors from the real datasets, these predictors were either also found by DCL or could not be supported by previous studies. Our results show that DCL outperforms FGS, PC, CPC, and FCI in almost every case, demonstrating its potential to advance causal learning. Furthermore, our DCL algorithm effectively identifies direct causes in the LOAD and Metabric GWAS datasets, which indicates its potential for clinical applications. Copyright © 2016 Elsevier B.V. All rights reserved.

  18. Causal discovery and inference: concepts and recent methodological advances.

    PubMed

    Spirtes, Peter; Zhang, Kun

    This paper aims to give a broad coverage of central concepts and principles involved in automated causal inference and emerging approaches to causal discovery from i.i.d data and from time series. After reviewing concepts including manipulations, causal models, sample predictive modeling, causal predictive modeling, and structural equation models, we present the constraint-based approach to causal discovery, which relies on the conditional independence relationships in the data, and discuss the assumptions underlying its validity. We then focus on causal discovery based on structural equations models, in which a key issue is the identifiability of the causal structure implied by appropriately defined structural equation models: in the two-variable case, under what conditions (and why) is the causal direction between the two variables identifiable? We show that the independence between the error term and causes, together with appropriate structural constraints on the structural equation, makes it possible. Next, we report some recent advances in causal discovery from time series. Assuming that the causal relations are linear with nonGaussian noise, we mention two problems which are traditionally difficult to solve, namely causal discovery from subsampled data and that in the presence of confounding time series. Finally, we list a number of open questions in the field of causal discovery and inference.

  19. Stable Causal Relationships Are Better Causal Relationships.

    PubMed

    Vasilyeva, Nadya; Blanchard, Thomas; Lombrozo, Tania

    2018-05-01

    We report three experiments investigating whether people's judgments about causal relationships are sensitive to the robustness or stability of such relationships across a range of background circumstances. In Experiment 1, we demonstrate that people are more willing to endorse causal and explanatory claims based on stable (as opposed to unstable) relationships, even when the overall causal strength of the relationship is held constant. In Experiment 2, we show that this effect is not driven by a causal generalization's actual scope of application. In Experiment 3, we offer evidence that stable causal relationships may be seen as better guides to action. Collectively, these experiments document a previously underappreciated factor that shapes people's causal reasoning: the stability of the causal relationship. Copyright © 2018 Cognitive Science Society, Inc.

  20. Biological interpretation of genome-wide association studies using predicted gene functions.

    PubMed

    Pers, Tune H; Karjalainen, Juha M; Chan, Yingleong; Westra, Harm-Jan; Wood, Andrew R; Yang, Jian; Lui, Julian C; Vedantam, Sailaja; Gustafsson, Stefan; Esko, Tonu; Frayling, Tim; Speliotes, Elizabeth K; Boehnke, Michael; Raychaudhuri, Soumya; Fehrmann, Rudolf S N; Hirschhorn, Joel N; Franke, Lude

    2015-01-19

    The main challenge for gaining biological insights from genetic associations is identifying which genes and pathways explain the associations. Here we present DEPICT, an integrative tool that employs predicted gene functions to systematically prioritize the most likely causal genes at associated loci, highlight enriched pathways and identify tissues/cell types where genes from associated loci are highly expressed. DEPICT is not limited to genes with established functions and prioritizes relevant gene sets for many phenotypes.

  1. Candidate genes for panhypopituitarism identified by gene expression profiling

    PubMed Central

    Mortensen, Amanda H.; MacDonald, James W.; Ghosh, Debashis

    2011-01-01

    Mutations in the transcription factors PROP1 and PIT1 (POU1F1) lead to pituitary hormone deficiency and hypopituitarism in mice and humans. The dysmorphology of developing Prop1 mutant pituitaries readily distinguishes them from those of Pit1 mutants and normal mice. This and other features suggest that Prop1 controls the expression of genes besides Pit1 that are important for pituitary cell migration, survival, and differentiation. To identify genes involved in these processes we used microarray analysis of gene expression to compare pituitary RNA from newborn Prop1 and Pit1 mutants and wild-type littermates. Significant differences in gene expression were noted between each mutant and their normal littermates, as well as between Prop1 and Pit1 mutants. Otx2, a gene critical for normal eye and pituitary development in humans and mice, exhibited elevated expression specifically in Prop1 mutant pituitaries. We report the spatial and temporal regulation of Otx2 in normal mice and Prop1 mutants, and the results suggest Otx2 could influence pituitary development by affecting signaling from the ventral diencephalon and regulation of gene expression in Rathke's pouch. The discovery that Otx2 expression is affected by Prop1 deficiency provides support for our hypothesis that identifying molecular differences in mutants will contribute to understanding the molecular mechanisms that control pituitary organogenesis and lead to human pituitary disease. PMID:21828248

  2. Causality and headache triggers

    PubMed Central

    Turner, Dana P.; Smitherman, Todd A.; Martin, Vincent T.; Penzien, Donald B.; Houle, Timothy T.

    2013-01-01

    Objective The objective of this study was to explore the conditions necessary to assign causal status to headache triggers. Background The term “headache trigger” is commonly used to label any stimulus that is assumed to cause headaches. However, the assumptions required for determining if a given stimulus in fact has a causal-type relationship in eliciting headaches have not been explicated. Methods A synthesis and application of Rubin’s Causal Model is applied to the context of headache causes. From this application the conditions necessary to infer that one event (trigger) causes another (headache) are outlined using basic assumptions and examples from relevant literature. Results Although many conditions must be satisfied for a causal attribution, three basic assumptions are identified for determining causality in headache triggers: 1) constancy of the sufferer; 2) constancy of the trigger effect; and 3) constancy of the trigger presentation. A valid evaluation of a potential trigger’s effect can only be undertaken once these three basic assumptions are satisfied during formal or informal studies of headache triggers. Conclusions Evaluating these assumptions is extremely difficult or infeasible in clinical practice, and satisfying them during natural experimentation is unlikely. Researchers, practitioners, and headache sufferers are encouraged to avoid natural experimentation to determine the causal effects of headache triggers. Instead, formal experimental designs or retrospective diary studies using advanced statistical modeling techniques provide the best approaches to satisfy the required assumptions and inform causal statements about headache triggers. PMID:23534872

  3. Influence of Resting Venous Blood Volume Fraction on Dynamic Causal Modeling and System Identifiability

    PubMed Central

    Hu, Zhenghui; Ni, Pengyu; Wan, Qun; Zhang, Yan; Shi, Pengcheng; Lin, Qiang

    2016-01-01

    Changes in BOLD signals are sensitive to the regional blood content associated with the vasculature, which is known as V0 in hemodynamic models. In previous studies involving dynamic causal modeling (DCM) which embodies the hemodynamic model to invert the functional magnetic resonance imaging signals into neuronal activity, V0 was arbitrarily set to a physiolog-ically plausible value to overcome the ill-posedness of the inverse problem. It is interesting to investigate how the V0 value influences DCM. In this study we addressed this issue by using both synthetic and real experiments. The results show that the ability of DCM analysis to reveal information about brain causality depends critically on the assumed V0 value used in the analysis procedure. The choice of V0 value not only directly affects the strength of system connections, but more importantly also affects the inferences about the network architecture. Our analyses speak to a possible refinement of how the hemody-namic process is parameterized (i.e., by making V0 a free parameter); however, the conditional dependencies induced by a more complex model may create more problems than they solve. Obtaining more realistic V0 information in DCM can improve the identifiability of the system and would provide more reliable inferences about the properties of brain connectivity. PMID:27389074

  4. Influence of Resting Venous Blood Volume Fraction on Dynamic Causal Modeling and System Identifiability.

    PubMed

    Hu, Zhenghui; Ni, Pengyu; Wan, Qun; Zhang, Yan; Shi, Pengcheng; Lin, Qiang

    2016-07-08

    Changes in BOLD signals are sensitive to the regional blood content associated with the vasculature, which is known as V0 in hemodynamic models. In previous studies involving dynamic causal modeling (DCM) which embodies the hemodynamic model to invert the functional magnetic resonance imaging signals into neuronal activity, V0 was arbitrarily set to a physiolog-ically plausible value to overcome the ill-posedness of the inverse problem. It is interesting to investigate how the V0 value influences DCM. In this study we addressed this issue by using both synthetic and real experiments. The results show that the ability of DCM analysis to reveal information about brain causality depends critically on the assumed V0 value used in the analysis procedure. The choice of V0 value not only directly affects the strength of system connections, but more importantly also affects the inferences about the network architecture. Our analyses speak to a possible refinement of how the hemody-namic process is parameterized (i.e., by making V0 a free parameter); however, the conditional dependencies induced by a more complex model may create more problems than they solve. Obtaining more realistic V0 information in DCM can improve the identifiability of the system and would provide more reliable inferences about the properties of brain connectivity.

  5. Ectodermal dysplasias: A clinical classification and a causal review

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Pinheiro, M.; Freire-Maia, N.

    1994-11-01

    The authors present a causal review of 154 ectodermal dysplasias (EDs) as classified into 11 clinical subgroups. The number of EDs in each subgroup varies from one to 43. The numbers of conditions due to autosomal dominant, autosomal recessive, and X-linked genes are, respectively, 41, 52, and 8. In 53 conditions cause is unknown; 35 of them present some causal (genetic) suggestion.

  6. Causal Networks or Causal Islands? The Representation of Mechanisms and the Transitivity of Causal Judgment

    ERIC Educational Resources Information Center

    Johnson, Samuel G. B.; Ahn, Woo-kyoung

    2015-01-01

    Knowledge of mechanisms is critical for causal reasoning. We contrasted two possible organizations of causal knowledge--an interconnected causal "network," where events are causally connected without any boundaries delineating discrete mechanisms; or a set of disparate mechanisms--causal "islands"--such that events in different…

  7. Biological interpretation of genome-wide association studies using predicted gene functions

    PubMed Central

    Pers, Tune H.; Karjalainen, Juha M.; Chan, Yingleong; Westra, Harm-Jan; Wood, Andrew R.; Yang, Jian; Lui, Julian C.; Vedantam, Sailaja; Gustafsson, Stefan; Esko, Tonu; Frayling, Tim; Speliotes, Elizabeth K.; Boehnke, Michael; Raychaudhuri, Soumya; Fehrmann, Rudolf S.N.; Hirschhorn, Joel N.; Franke, Lude

    2015-01-01

    The main challenge for gaining biological insights from genetic associations is identifying which genes and pathways explain the associations. Here we present DEPICT, an integrative tool that employs predicted gene functions to systematically prioritize the most likely causal genes at associated loci, highlight enriched pathways and identify tissues/cell types where genes from associated loci are highly expressed. DEPICT is not limited to genes with established functions and prioritizes relevant gene sets for many phenotypes. PMID:25597830

  8. Causal Networks or Causal Islands? The Representation of Mechanisms and the Transitivity of Causal Judgment.

    PubMed

    Johnson, Samuel G B; Ahn, Woo-kyoung

    2015-09-01

    Knowledge of mechanisms is critical for causal reasoning. We contrasted two possible organizations of causal knowledge—an interconnected causal network, where events are causally connected without any boundaries delineating discrete mechanisms; or a set of disparate mechanisms—causal islands—such that events in different mechanisms are not thought to be related even when they belong to the same causal chain. To distinguish these possibilities, we tested whether people make transitive judgments about causal chains by inferring, given A causes B and B causes C, that A causes C. Specifically, causal chains schematized as one chunk or mechanism in semantic memory (e.g., exercising, becoming thirsty, drinking water) led to transitive causal judgments. On the other hand, chains schematized as multiple chunks (e.g., having sex, becoming pregnant, becoming nauseous) led to intransitive judgments despite strong intermediate links ((Experiments 1-3). Normative accounts of causal intransitivity could not explain these intransitive judgments (Experiments 4 and 5). Copyright © 2015 Cognitive Science Society, Inc.

  9. Molecular genetic and functional characterization implicate muscle-restricted coiled-coil gene (MURC) as a causal gene for familial dilated cardiomyopathy.

    PubMed

    Rodriguez, Gabriela; Ueyama, Tomomi; Ogata, Takehiro; Czernuszewicz, Grazyna; Tan, Yanli; Dorn, Gerald W; Bogaev, Roberta; Amano, Katsuya; Oh, Hidemasa; Matsubara, Hiroaki; Willerson, James T; Marian, Ali J

    2011-08-01

    Dilated cardiomyopathy (DCM) and hypertrophic cardiomyopathy (HCM) are classic forms of systolic and diastolic heart failure, respectively. Mutations in genes encoding sarcomere and cytoskeletal proteins are major causes of HCM and DCM. MURC, encoding muscle-restricted coiled-coil, a Z-line protein, regulates cardiac function in mice. We investigated potential causal role of MURC in human cardiomyopathies. We sequenced MURC in 1199 individuals, including 383 probands with DCM, 307 with HCM, and 509 healthy control subjects. We found 6 heterozygous DCM-specific missense variants (p.N128K, p.R140W, p.L153P, p.S307T, p.P324L, and p.S364L) in 8 unrelated probands. Variants p.N128K and p.S307T segregated with inheritance of DCM in small families (χ(2)=8.5, P=0.003). Variants p.N128K, p.R140W, p.L153P, and p.S364L were considered probably or possibly damaging. Variant p.P324L recurred in 3 independent probands, including 1 proband with a TPM1 mutation (p.M245T). A deletion variant (p.L232-R238del) was present in 3 unrelated HCM probands, but it did not segregate with HCM in a family who also had a MYH7 mutation (p.L907V). The phenotype in mutation carriers was notable for progressive heart failure leading to heart transplantation in 4 patients, conduction defects, and atrial arrhythmias. Expression of mutant MURC proteins in neonatal rat cardiac myocytes transduced with recombinant adenoviruses was associated with reduced RhoA activity, lower mRNA levels of hypertrophic markers and smaller myocyte size as compared with wild-type MURC. MURC mutations impart loss-of-function effects on MURC functions and probably are causal variants in human DCM. The causal role of a deletion mutation in HCM is uncertain.

  10. Causal inference in biology networks with integrated belief propagation.

    PubMed

    Chang, Rui; Karr, Jonathan R; Schadt, Eric E

    2015-01-01

    Inferring causal relationships among molecular and higher order phenotypes is a critical step in elucidating the complexity of living systems. Here we propose a novel method for inferring causality that is no longer constrained by the conditional dependency arguments that limit the ability of statistical causal inference methods to resolve causal relationships within sets of graphical models that are Markov equivalent. Our method utilizes Bayesian belief propagation to infer the responses of perturbation events on molecular traits given a hypothesized graph structure. A distance measure between the inferred response distribution and the observed data is defined to assess the 'fitness' of the hypothesized causal relationships. To test our algorithm, we infer causal relationships within equivalence classes of gene networks in which the form of the functional interactions that are possible are assumed to be nonlinear, given synthetic microarray and RNA sequencing data. We also apply our method to infer causality in real metabolic network with v-structure and feedback loop. We show that our method can recapitulate the causal structure and recover the feedback loop only from steady-state data which conventional method cannot.

  11. Identifying potential maternal genes of Bombyx mori using digital gene expression profiling

    PubMed Central

    Xu, Pingzhen

    2018-01-01

    Maternal genes present in mature oocytes play a crucial role in the early development of silkworm. Although maternal genes have been widely studied in many other species, there has been limited research in Bombyx mori. High-throughput next generation sequencing provides a practical method for gene discovery on a genome-wide level. Herein, a transcriptome study was used to identify maternal-related genes from silkworm eggs. Unfertilized eggs from five different stages of early development were used to detect the changing situation of gene expression. The expressed genes showed different patterns over time. Seventy-six maternal genes were annotated according to homology analysis with Drosophila melanogaster. More than half of the differentially expressed maternal genes fell into four expression patterns, while the expression patterns showed a downward trend over time. The functional annotation of these material genes was mainly related to transcription factor activity, growth factor activity, nucleic acid binding, RNA binding, ATP binding, and ion binding. Additionally, twenty-two gene clusters including maternal genes were identified from 18 scaffolds. Altogether, we plotted a profile for the maternal genes of Bombyx mori using a digital gene expression profiling method. This will provide the basis for maternal-specific signature research and improve the understanding of the early development of silkworm. PMID:29462160

  12. Detectability of Granger causality for subsampled continuous-time neurophysiological processes.

    PubMed

    Barnett, Lionel; Seth, Anil K

    2017-01-01

    Granger causality is well established within the neurosciences for inference of directed functional connectivity from neurophysiological data. These data usually consist of time series which subsample a continuous-time biophysiological process. While it is well known that subsampling can lead to imputation of spurious causal connections where none exist, less is known about the effects of subsampling on the ability to reliably detect causal connections which do exist. We present a theoretical analysis of the effects of subsampling on Granger-causal inference. Neurophysiological processes typically feature signal propagation delays on multiple time scales; accordingly, we base our analysis on a distributed-lag, continuous-time stochastic model, and consider Granger causality in continuous time at finite prediction horizons. Via exact analytical solutions, we identify relationships among sampling frequency, underlying causal time scales and detectability of causalities. We reveal complex interactions between the time scale(s) of neural signal propagation and sampling frequency. We demonstrate that detectability decays exponentially as the sample time interval increases beyond causal delay times, identify detectability "black spots" and "sweet spots", and show that downsampling may potentially improve detectability. We also demonstrate that the invariance of Granger causality under causal, invertible filtering fails at finite prediction horizons, with particular implications for inference of Granger causality from fMRI data. Our analysis emphasises that sampling rates for causal analysis of neurophysiological time series should be informed by domain-specific time scales, and that state-space modelling should be preferred to purely autoregressive modelling. On the basis of a very general model that captures the structure of neurophysiological processes, we are able to help identify confounds, and offer practical insights, for successful detection of causal connectivity

  13. Genome-Wide Association Study in African Americans with Acute Respiratory Distress Syndrome Identifies the Selectin P Ligand Gene as a Risk Factor.

    PubMed

    Bime, Christian; Pouladi, Nima; Sammani, Saad; Batai, Ken; Casanova, Nancy; Zhou, Tong; Kempf, Carrie L; Sun, Xiaoguang; Camp, Sara M; Wang, Ting; Kittles, Rick A; Lussier, Yves A; Jones, Tiffanie K; Reilly, John P; Meyer, Nuala J; Christie, Jason D; Karnes, Jason H; Gonzalez-Garay, Manuel; Christiani, David C; Yates, Charles R; Wurfel, Mark M; Meduri, Gianfranco U; Garcia, Joe G N

    2018-06-01

    Genetic factors are involved in acute respiratory distress syndrome (ARDS) susceptibility. Identification of novel candidate genes associated with increased risk and severity will improve our understanding of ARDS pathophysiology and enhance efforts to develop novel preventive and therapeutic approaches. To identify genetic susceptibility targets for ARDS. A genome-wide association study was performed on 232 African American patients with ARDS and 162 at-risk control subjects. The Identify Candidate Causal SNPs and Pathways platform was used to infer the association of known gene sets with the top prioritized intragenic SNPs. Preclinical validation of SELPLG (selectin P ligand gene) was performed using mouse models of LPS- and ventilator-induced lung injury. Exonic variation within SELPLG distinguishing patients with ARDS from sepsis control subjects was confirmed in an independent cohort. Pathway prioritization analysis identified a nonsynonymous coding SNP (rs2228315) within SELPLG, encoding P-selectin glycoprotein ligand 1, to be associated with increased susceptibility. In an independent cohort, two exonic SELPLG SNPs were significantly associated with ARDS susceptibility. Additional support for SELPLG as an ARDS candidate gene was derived from preclinical ARDS models where SELPLG gene expression in lung tissues was significantly increased in both ventilator-induced (twofold increase) and LPS-induced (5.7-fold increase) murine lung injury models compared with controls. Furthermore, Selplg -/- mice exhibited significantly reduced LPS-induced inflammatory lung injury compared with wild-type C57/B6 mice. Finally, an antibody that neutralizes P-selectin glycoprotein ligand 1 significantly attenuated LPS-induced lung inflammation. These findings identify SELPLG as a novel ARDS susceptibility gene among individuals of European and African descent.

  14. Exploratory Causal Analysis in Bivariate Time Series Data

    NASA Astrophysics Data System (ADS)

    McCracken, James M.

    Many scientific disciplines rely on observational data of systems for which it is difficult (or impossible) to implement controlled experiments and data analysis techniques are required for identifying causal information and relationships directly from observational data. This need has lead to the development of many different time series causality approaches and tools including transfer entropy, convergent cross-mapping (CCM), and Granger causality statistics. In this thesis, the existing time series causality method of CCM is extended by introducing a new method called pairwise asymmetric inference (PAI). It is found that CCM may provide counter-intuitive causal inferences for simple dynamics with strong intuitive notions of causality, and the CCM causal inference can be a function of physical parameters that are seemingly unrelated to the existence of a driving relationship in the system. For example, a CCM causal inference might alternate between ''voltage drives current'' and ''current drives voltage'' as the frequency of the voltage signal is changed in a series circuit with a single resistor and inductor. PAI is introduced to address both of these limitations. Many of the current approaches in the times series causality literature are not computationally straightforward to apply, do not follow directly from assumptions of probabilistic causality, depend on assumed models for the time series generating process, or rely on embedding procedures. A new approach, called causal leaning, is introduced in this work to avoid these issues. The leaning is found to provide causal inferences that agree with intuition for both simple systems and more complicated empirical examples, including space weather data sets. The leaning may provide a clearer interpretation of the results than those from existing time series causality tools. A practicing analyst can explore the literature to find many proposals for identifying drivers and causal connections in times series data

  15. Identifying key genes in rheumatoid arthritis by weighted gene co-expression network analysis.

    PubMed

    Ma, Chunhui; Lv, Qi; Teng, Songsong; Yu, Yinxian; Niu, Kerun; Yi, Chengqin

    2017-08-01

    This study aimed to identify rheumatoid arthritis (RA) related genes based on microarray data using the WGCNA (weighted gene co-expression network analysis) method. Two gene expression profile datasets GSE55235 (10 RA samples and 10 healthy controls) and GSE77298 (16 RA samples and seven healthy controls) were downloaded from Gene Expression Omnibus database. Characteristic genes were identified using metaDE package. WGCNA was used to find disease-related networks based on gene expression correlation coefficients, and module significance was defined as the average gene significance of all genes used to assess the correlation between the module and RA status. Genes in the disease-related gene co-expression network were subject to functional annotation and pathway enrichment analysis using Database for Annotation Visualization and Integrated Discovery. Characteristic genes were also mapped to the Connectivity Map to screen small molecules. A total of 599 characteristic genes were identified. For each dataset, characteristic genes in the green, red and turquoise modules were most closely associated with RA, with gene numbers of 54, 43 and 79, respectively. These genes were enriched in totally enriched in 17 Gene Ontology terms, mainly related to immune response (CD97, FYB, CXCL1, IKBKE, CCR1, etc.), inflammatory response (CD97, CXCL1, C3AR1, CCR1, LYZ, etc.) and homeostasis (C3AR1, CCR1, PLN, CCL19, PPT1, etc.). Two small-molecule drugs sanguinarine and papaverine were predicted to have a therapeutic effect against RA. Genes related to immune response, inflammatory response and homeostasis presumably have critical roles in RA pathogenesis. Sanguinarine and papaverine have a potential therapeutic effect against RA. © 2017 Asia Pacific League of Associations for Rheumatology and John Wiley & Sons Australia, Ltd.

  16. Causal imprinting in causal structure learning.

    PubMed

    Taylor, Eric G; Ahn, Woo-Kyoung

    2012-11-01

    Suppose one observes a correlation between two events, B and C, and infers that B causes C. Later one discovers that event A explains away the correlation between B and C. Normatively, one should now dismiss or weaken the belief that B causes C. Nonetheless, participants in the current study who observed a positive contingency between B and C followed by evidence that B and C were independent given A, persisted in believing that B causes C. The authors term this difficulty in revising initially learned causal structures "causal imprinting." Throughout four experiments, causal imprinting was obtained using multiple dependent measures and control conditions. A Bayesian analysis showed that causal imprinting may be normative under some conditions, but causal imprinting also occurred in the current study when it was clearly non-normative. It is suggested that causal imprinting occurs due to the influence of prior knowledge on how reasoners interpret later evidence. Consistent with this view, when participants first viewed the evidence showing that B and C are independent given A, later evidence with only B and C did not lead to the belief that B causes C. Copyright © 2012 Elsevier Inc. All rights reserved.

  17. Causal Imprinting in Causal Structure Learning

    PubMed Central

    Taylor, Eric G.; Ahn, Woo-kyoung

    2012-01-01

    Suppose one observes a correlation between two events, B and C, and infers that B causes C. Later one discovers that event A explains away the correlation between B and C. Normatively, one should now dismiss or weaken the belief that B causes C. Nonetheless, participants in the current study who observed a positive contingency between B and C followed by evidence that B and C were independent given A, persisted in believing that B causes C. The authors term this difficulty in revising initially learned causal structures “causal imprinting.” Throughout four experiments, causal imprinting was obtained using multiple dependent measures and control conditions. A Bayesian analysis showed that causal imprinting may be normative under some conditions, but causal imprinting also occurred in the current study when it was clearly non-normative. It is suggested that causal imprinting occurs due to the influence of prior knowledge on how reasoners interpret later evidence. Consistent with this view, when participants first viewed the evidence showing that B and C are independent given A, later evidence with only B and C did not lead to the belief that B causes C. PMID:22859019

  18. Identifying gene networks underlying the neurobiology of ethanol and alcoholism.

    PubMed

    Wolen, Aaron R; Miles, Michael F

    2012-01-01

    For complex disorders such as alcoholism, identifying the genes linked to these diseases and their specific roles is difficult. Traditional genetic approaches, such as genetic association studies (including genome-wide association studies) and analyses of quantitative trait loci (QTLs) in both humans and laboratory animals already have helped identify some candidate genes. However, because of technical obstacles, such as the small impact of any individual gene, these approaches only have limited effectiveness in identifying specific genes that contribute to complex diseases. The emerging field of systems biology, which allows for analyses of entire gene networks, may help researchers better elucidate the genetic basis of alcoholism, both in humans and in animal models. Such networks can be identified using approaches such as high-throughput molecular profiling (e.g., through microarray-based gene expression analyses) or strategies referred to as genetical genomics, such as the mapping of expression QTLs (eQTLs). Characterization of gene networks can shed light on the biological pathways underlying complex traits and provide the functional context for identifying those genes that contribute to disease development.

  19. Defining a new candidate gene for amelogenesis imperfecta: from molecular genetics to biochemistry.

    PubMed

    Urzúa, Blanca; Ortega-Pinto, Ana; Morales-Bozo, Irene; Rojas-Alcayaga, Gonzalo; Cifuentes, Víctor

    2011-02-01

    Amelogenesis imperfecta is a group of genetic conditions that affect the structure and clinical appearance of tooth enamel. The types (hypoplastic, hypocalcified, and hypomature) are correlated with defects in different stages of the process of enamel synthesis. Autosomal dominant, recessive, and X-linked types have been previously described. These disorders are considered clinically and genetically heterogeneous in etiology, involving a variety of genes, such as AMELX, ENAM, DLX3, FAM83H, MMP-20, KLK4, and WDR72. The mutations identified within these causal genes explain less than half of all cases of amelogenesis imperfecta. Most of the candidate and causal genes currently identified encode proteins involved in enamel synthesis. We think it is necessary to refocus the search for candidate genes using biochemical processes. This review provides theoretical evidence that the human SLC4A4 gene (sodium bicarbonate cotransporter) may be a new candidate gene.

  20. Tools for Detecting Causality in Space Systems

    NASA Astrophysics Data System (ADS)

    Johnson, J.; Wing, S.

    2017-12-01

    Complex systems such as the solar and magnetospheric envivonment often exhibit patterns of behavior that suggest underlying organizing principles. Causality is a key organizing principle that is particularly difficult to establish in strongly coupled nonlinear systems, but essential for understanding and modeling the behavior of systems. While traditional methods of time-series analysis can identify linear correlations, they do not adequately quantify the distinction between causal and coincidental dependence. We discuss tools for detecting causality including: granger causality, transfer entropy, conditional redundancy, and convergent cross maps. The tools are illustrated by applications to magnetospheric and solar physics including radiation belt, Dst (a magnetospheric state variable), substorm, and solar cycle dynamics.

  1. Identifying key genes in glaucoma based on a benchmarked dataset and the gene regulatory network.

    PubMed

    Chen, Xi; Wang, Qiao-Ling; Zhang, Meng-Hui

    2017-10-01

    The current study aimed to identify key genes in glaucoma based on a benchmarked dataset and gene regulatory network (GRN). Local and global noise was added to the gene expression dataset to produce a benchmarked dataset. Differentially-expressed genes (DEGs) between patients with glaucoma and normal controls were identified utilizing the Linear Models for Microarray Data (Limma) package based on benchmarked dataset. A total of 5 GRN inference methods, including Zscore, GeneNet, context likelihood of relatedness (CLR) algorithm, Partial Correlation coefficient with Information Theory (PCIT) and GEne Network Inference with Ensemble of Trees (Genie3) were evaluated using receiver operating characteristic (ROC) and precision and recall (PR) curves. The interference method with the best performance was selected to construct the GRN. Subsequently, topological centrality (degree, closeness and betweenness) was conducted to identify key genes in the GRN of glaucoma. Finally, the key genes were validated by performing reverse transcription-quantitative polymerase chain reaction (RT-qPCR). A total of 176 DEGs were detected from the benchmarked dataset. The ROC and PR curves of the 5 methods were analyzed and it was determined that Genie3 had a clear advantage over the other methods; thus, Genie3 was used to construct the GRN. Following topological centrality analysis, 14 key genes for glaucoma were identified, including IL6 , EPHA2 and GSTT1 and 5 of these 14 key genes were validated by RT-qPCR. Therefore, the current study identified 14 key genes in glaucoma, which may be potential biomarkers to use in the diagnosis of glaucoma and aid in identifying the molecular mechanism of this disease.

  2. Diametrical clustering for identifying anti-correlated gene clusters.

    PubMed

    Dhillon, Inderjit S; Marcotte, Edward M; Roshan, Usman

    2003-09-01

    Clustering genes based upon their expression patterns allows us to predict gene function. Most existing clustering algorithms cluster genes together when their expression patterns show high positive correlation. However, it has been observed that genes whose expression patterns are strongly anti-correlated can also be functionally similar. Biologically, this is not unintuitive-genes responding to the same stimuli, regardless of the nature of the response, are more likely to operate in the same pathways. We present a new diametrical clustering algorithm that explicitly identifies anti-correlated clusters of genes. Our algorithm proceeds by iteratively (i). re-partitioning the genes and (ii). computing the dominant singular vector of each gene cluster; each singular vector serving as the prototype of a 'diametric' cluster. We empirically show the effectiveness of the algorithm in identifying diametrical or anti-correlated clusters. Testing the algorithm on yeast cell cycle data, fibroblast gene expression data, and DNA microarray data from yeast mutants reveals that opposed cellular pathways can be discovered with this method. We present systems whose mRNA expression patterns, and likely their functions, oppose the yeast ribosome and proteosome, along with evidence for the inverse transcriptional regulation of a number of cellular systems.

  3. Optimal causal inference: estimating stored information and approximating causal architecture.

    PubMed

    Still, Susanne; Crutchfield, James P; Ellison, Christopher J

    2010-09-01

    We introduce an approach to inferring the causal architecture of stochastic dynamical systems that extends rate-distortion theory to use causal shielding--a natural principle of learning. We study two distinct cases of causal inference: optimal causal filtering and optimal causal estimation. Filtering corresponds to the ideal case in which the probability distribution of measurement sequences is known, giving a principled method to approximate a system's causal structure at a desired level of representation. We show that in the limit in which a model-complexity constraint is relaxed, filtering finds the exact causal architecture of a stochastic dynamical system, known as the causal-state partition. From this, one can estimate the amount of historical information the process stores. More generally, causal filtering finds a graded model-complexity hierarchy of approximations to the causal architecture. Abrupt changes in the hierarchy, as a function of approximation, capture distinct scales of structural organization. For nonideal cases with finite data, we show how the correct number of the underlying causal states can be found by optimal causal estimation. A previously derived model-complexity control term allows us to correct for the effect of statistical fluctuations in probability estimates and thereby avoid overfitting.

  4. Amodal causal capture in the tunnel effect.

    PubMed

    Bae, Gi Yeul; Flombaum, Jonathan I

    2011-01-01

    In addition to identifying individual objects in the world, the visual system must also characterize the relationships between objects, for instance when objects occlude one another or cause one another to move. Here we explored the relationship between perceived causality and occlusion. Can one perceive causality in an occluded location? In several experiments, observers judged whether a centrally presented event involved a single object passing behind an occluder, or one object causally launching another (out of view and behind the occluder). With no additional context, the centrally presented event was typically judged as a non-causal pass, even when the occluding and disoccluding objects were different colors--an illusion known as the 'tunnel effect' that results from spatiotemporal continuity. However, when a synchronized context event involved an unambiguous causal launch, participants perceived a causal launch behind the occluder. This percept of an occluded causal interaction could also be driven by grouping and synchrony cues in the absence of any explicitly causal interaction. These results reinforce the hypothesis that causality is an aspect of perception. It is among the interpretations of the world that are independently available to vision when resolving ambiguity, and that the visual system can 'fill in' amodally.

  5. Rapid identification of causal mutations in tomato EMS populations via mapping-by-sequencing.

    PubMed

    Garcia, Virginie; Bres, Cécile; Just, Daniel; Fernandez, Lucie; Tai, Fabienne Wong Jun; Mauxion, Jean-Philippe; Le Paslier, Marie-Christine; Bérard, Aurélie; Brunel, Dominique; Aoki, Koh; Alseekh, Saleh; Fernie, Alisdair R; Fraser, Paul D; Rothan, Christophe

    2016-12-01

    The tomato is the model species of choice for fleshy fruit development and for the Solanaceae family. Ethyl methanesulfonate (EMS) mutants of tomato have already proven their utility for analysis of gene function in plants, leading to improved breeding stocks and superior tomato varieties. However, until recently, the identification of causal mutations that underlie particular phenotypes has been a very lengthy task that many laboratories could not afford because of spatial and technical limitations. Here, we describe a simple protocol for identifying causal mutations in tomato using a mapping-by-sequencing strategy. Plants displaying phenotypes of interest are first isolated by screening an EMS mutant collection generated in the miniature cultivar Micro-Tom. A recombinant F 2 population is then produced by crossing the mutant with a wild-type (WT; non-mutagenized) genotype, and F 2 segregants displaying the same phenotype are subsequently pooled. Finally, whole-genome sequencing and analysis of allele distributions in the pools allow for the identification of the causal mutation. The whole process, from the isolation of the tomato mutant to the identification of the causal mutation, takes 6-12 months. This strategy overcomes many previous limitations, is simple to use and can be applied in most laboratories with limited facilities for plant culture and genotyping.

  6. Graphical modeling of gene expression in monocytes suggests molecular mechanisms explaining increased atherosclerosis in smokers.

    PubMed

    Verdugo, Ricardo A; Zeller, Tanja; Rotival, Maxime; Wild, Philipp S; Münzel, Thomas; Lackner, Karl J; Weidmann, Henri; Ninio, Ewa; Trégouët, David-Alexandre; Cambien, François; Blankenberg, Stefan; Tiret, Laurence

    2013-01-01

    Smoking is a risk factor for atherosclerosis with reported widespread effects on gene expression in circulating blood cells. We hypothesized that a molecular signature mediating the relation between smoking and atherosclerosis may be found in the transcriptome of circulating monocytes. Genome-wide expression profiles and counts of atherosclerotic plaques in carotid arteries were collected in 248 smokers and 688 non-smokers from the general population. Patterns of co-expressed genes were identified by Independent Component Analysis (ICA) and network structure of the pattern-specific gene modules was inferred by the PC-algorithm. A likelihood-based causality test was implemented to select patterns that fit models containing a path "smoking→gene expression→plaques". Robustness of the causal inference was assessed by bootstrapping. At a FDR ≤0.10, 3,368 genes were associated to smoking or plaques, of which 93% were associated to smoking only. SASH1 showed the strongest association to smoking and PPARG the strongest association to plaques. Twenty-nine gene patterns were identified by ICA. Modules containing SASH1 and PPARG did not show evidence for the "smoking→gene expression→plaques" causality model. Conversely, three modules had good support for causal effects and exhibited a network topology consistent with gene expression mediating the relation between smoking and plaques. The network with the strongest support for causal effects was connected to plaques through SLC39A8, a gene with known association to HDL-cholesterol and cellular uptake of cadmium from tobacco, while smoking was directly connected to GAS6, a gene reported to have anti-inflammatory effects in atherosclerosis and to be up-regulated in the placenta of women smoking during pregnancy. Our analysis of the transcriptome of monocytes recovered genes relevant for association to smoking and atherosclerosis, and connected genes that before, were only studied in separate contexts. Inspection of

  7. Graphical Modeling of Gene Expression in Monocytes Suggests Molecular Mechanisms Explaining Increased Atherosclerosis in Smokers

    PubMed Central

    Verdugo, Ricardo A.; Zeller, Tanja; Rotival, Maxime; Wild, Philipp S.; Münzel, Thomas; Lackner, Karl J.; Weidmann, Henri; Ninio, Ewa; Trégouët, David-Alexandre; Cambien, François; Blankenberg, Stefan; Tiret, Laurence

    2013-01-01

    Smoking is a risk factor for atherosclerosis with reported widespread effects on gene expression in circulating blood cells. We hypothesized that a molecular signature mediating the relation between smoking and atherosclerosis may be found in the transcriptome of circulating monocytes. Genome-wide expression profiles and counts of atherosclerotic plaques in carotid arteries were collected in 248 smokers and 688 non-smokers from the general population. Patterns of co-expressed genes were identified by Independent Component Analysis (ICA) and network structure of the pattern-specific gene modules was inferred by the PC-algorithm. A likelihood-based causality test was implemented to select patterns that fit models containing a path “smoking→gene expression→plaques”. Robustness of the causal inference was assessed by bootstrapping. At a FDR ≤0.10, 3,368 genes were associated to smoking or plaques, of which 93% were associated to smoking only. SASH1 showed the strongest association to smoking and PPARG the strongest association to plaques. Twenty-nine gene patterns were identified by ICA. Modules containing SASH1 and PPARG did not show evidence for the “smoking→gene expression→plaques” causality model. Conversely, three modules had good support for causal effects and exhibited a network topology consistent with gene expression mediating the relation between smoking and plaques. The network with the strongest support for causal effects was connected to plaques through SLC39A8, a gene with known association to HDL-cholesterol and cellular uptake of cadmium from tobacco, while smoking was directly connected to GAS6, a gene reported to have anti-inflammatory effects in atherosclerosis and to be up-regulated in the placenta of women smoking during pregnancy. Our analysis of the transcriptome of monocytes recovered genes relevant for association to smoking and atherosclerosis, and connected genes that before, were only studied in separate contexts

  8. ENU Mutagenesis in Mice Identifies Candidate Genes For Hypogonadism

    PubMed Central

    Weiss, Jeffrey; Hurley, Lisa A.; Harris, Rebecca M.; Finlayson, Courtney; Tong, Minghan; Fisher, Lisa A.; Moran, Jennifer L.; Beier, David R.; Mason, Christopher; Jameson, J. Larry

    2012-01-01

    Genome-wide mutagenesis was performed in mice to identify candidate genes for male infertility, for which the predominant causes remain idiopathic. Mice were mutagenized using N-ethyl-N-nitrosourea (ENU), bred, and screened for phenotypes associated with the male urogenital system. Fifteen heritable lines were isolated and chromosomal loci were assigned using low density genome-wide SNP arrays. Ten of the fifteen lines were pursued further using higher resolution SNP analysis to narrow the candidate gene regions. Exon sequencing of candidate genes identified mutations in mice with cystic kidneys (Bicc1), cryptorchidism (Rxfp2), restricted germ cell deficiency (Plk4), and severe germ cell deficiency (Prdm9). In two other lines with severe hypogonadism candidate sequencing failed to identify mutations, suggesting defects in genes with previously undocumented roles in gonadal function. These genomic intervals were sequenced in their entirety and a candidate mutation was identified in SnrpE in one of the two lines. The line harboring the SnrpE variant retains substantial spermatogenesis despite small testis size, an unusual phenotype. In addition to the reproductive defects, heritable phenotypes were observed in mice with ataxia (Myo5a), tremors (Pmp22), growth retardation (unknown gene), and hydrocephalus (unknown gene). These results demonstrate that the ENU screen is an effective tool for identifying potential causes of male infertility. PMID:22258617

  9. Causality

    NASA Astrophysics Data System (ADS)

    Pearl, Judea

    2000-03-01

    Written by one of the pre-eminent researchers in the field, this book provides a comprehensive exposition of modern analysis of causation. It shows how causality has grown from a nebulous concept into a mathematical theory with significant applications in the fields of statistics, artificial intelligence, philosophy, cognitive science, and the health and social sciences. Pearl presents a unified account of the probabilistic, manipulative, counterfactual and structural approaches to causation, and devises simple mathematical tools for analyzing the relationships between causal connections, statistical associations, actions and observations. The book will open the way for including causal analysis in the standard curriculum of statistics, artifical intelligence, business, epidemiology, social science and economics. Students in these areas will find natural models, simple identification procedures, and precise mathematical definitions of causal concepts that traditional texts have tended to evade or make unduly complicated. This book will be of interest to professionals and students in a wide variety of fields. Anyone who wishes to elucidate meaningful relationships from data, predict effects of actions and policies, assess explanations of reported events, or form theories of causal understanding and causal speech will find this book stimulating and invaluable.

  10. Molecular Genetic and Functional Characterization Implicate Muscle-Restricted Coiled-Coil Gene (MURC) as a Causal Gene for Familial Dilated Cardiomyopathy

    PubMed Central

    Rodriguez, Gabriela; Ueyama, Tomomi; Ogata, Takehiro; Czernuszewicz', Grazyna; Tan, Yanli; Dorn, Gerald W.; Bogaev, Roberta; Amano, Katsuya; Oh, Hidemasa; Matsubara, Hiroaki; Willerson, James T.; Marian, Ali J.

    2011-01-01

    Background Dilated cardiomyopathy (DCM) and hypertrophic cardiomyopathy (HCM) are classic forms of systolic and diastolic heart failure, respectively. Mutations in genes encoding sarcomere and cytoskeletal proteins are major causes of HCM and DCM. MURC, encoding muscle-restricted coiled-coil, a Z line protein, regulates cardiac function in mice. We investigated potential causal role of MURC in human cardiomyopathies. Methods and Results We sequenced MURC in 1,199 individuals including 383 probands with DCM, 307 with HCM and 509 healthy controls. We found six heterozygous DCM-specific missense variants (p.N128K, p.R140W, p.L153P, p.S307T, p.P324L and p.S364L) in eight unrelated probands. Variants p.N128K and p.S307T segregated with inheritance of DCM in small families (χ2=8.5, p=0.003). Variants p.N128K, p.R140W, p.L153P and p.S364L were considered probably or possibly damaging. Variant p.P324L recurred in three independent probands, including one proband with a TPM1 mutation (p.M245T). A deletion variant (p.L232-R238del) was present in three unrelated HCM probands but it did not segregate with HCM in a family who also had a MYH7 mutation (p.L970V). The phenotype in mutation carriers was notable for progressive heart failure leading to heart transplantation in four patients, conduction defects and atrial arrhythmias. Expression of mutant MURC proteins in neonatal rat cardiac myocytes transduced with recombinant adenoviruses was associated with reduced RhoA activity, lower mRNA levels of hypertrophic markers and smaller myocyte size as compared to wild type MURC. Conclusions MURC mutations impart loss-of-function effects on MURC functions and are likely causal variants in human DCM. The causal role of a deletion mutation in HCM is uncertain. PMID:21642240

  11. Links between causal effects and causal association for surrogacy evaluation in a gaussian setting.

    PubMed

    Conlon, Anna; Taylor, Jeremy; Li, Yun; Diaz-Ordaz, Karla; Elliott, Michael

    2017-11-30

    Two paradigms for the evaluation of surrogate markers in randomized clinical trials have been proposed: the causal effects paradigm and the causal association paradigm. Each of these paradigms rely on assumptions that must be made to proceed with estimation and to validate a candidate surrogate marker (S) for the true outcome of interest (T). We consider the setting in which S and T are Gaussian and are generated from structural models that include an unobserved confounder. Under the assumed structural models, we relate the quantities used to evaluate surrogacy within both the causal effects and causal association frameworks. We review some of the common assumptions made to aid in estimating these quantities and show that assumptions made within one framework can imply strong assumptions within the alternative framework. We demonstrate that there is a similarity, but not exact correspondence between the quantities used to evaluate surrogacy within each framework, and show that the conditions for identifiability of the surrogacy parameters are different from the conditions, which lead to a correspondence of these quantities. Copyright © 2017 John Wiley & Sons, Ltd.

  12. Diversity of the causal genes in hearing impaired Algerian individuals identified by whole exome sequencing

    PubMed Central

    Ammar-Khodja, Fatima; Bonnet, Crystel; Dahmani, Malika; Ouhab, Sofiane; Lefèvre, Gaelle M; Ibrahim, Hassina; Hardelin, Jean-Pierre; Weil, Dominique; Louha, Malek; Petit, Christine

    2015-01-01

    The genetic heterogeneity of congenital hearing disorders makes molecular diagnosis expensive and time-consuming using conventional techniques such as Sanger sequencing of DNA. In order to design an appropriate strategy of molecular diagnosis in the Algerian population, we explored the diversity of the involved mutations by studying 65 families affected by autosomal recessive forms of nonsyndromic hearing impairment (DFNB forms), which are the most prevalent early onset forms. We first carried out a systematic screening for mutations in GJB2 and the recurrent p.(Arg34*) mutation in TMC1, which were found in 31 (47.7%) families and 1 (1.5%) family, respectively. We then performed whole exome sequencing in nine of the remaining families, and identified the causative mutations in all the patients analyzed, either in the homozygous state (eight families) or in the compound heterozygous state (one family): (c.709C>T: p.(Arg237*)) and (c.2122C>T: p.(Arg708*)) in OTOF, (c.1334T>G: p.(Leu445Trp)) in SLC26A4, (c.764T>A: p.(Met255Lys)) in GIPC3, (c.518T>A: p.(Cys173Ser)) in LHFPL5, (c.5336T>C: p.(Leu1779Pro)) in MYO15A, (c.1807G>T: p.(Val603Phe)) in OTOA, (c.6080dup: p.(Asn2027Lys*9)) in PTPRQ, and (c.6017del: p.(Gly2006Alafs*13); c.7188_7189ins14: p.(Val2397Leufs*2)) in GPR98. Notably, 7 of these 10 mutations affecting 8 different genes had not been reported previously. These results highlight for the first time the genetic heterogeneity of the early onset forms of nonsyndromic deafness in Algerian families. PMID:26029705

  13. EPS8, encoding an actin-binding protein of cochlear hair cell stereocilia, is a new causal gene for autosomal recessive profound deafness

    PubMed Central

    2014-01-01

    Background Almost 90% of all cases of congenital, non-syndromic, severe to profound inherited deafness display an autosomal recessive mode of transmission (DFNB forms). To date, 47 causal DFNB genes have been identified, but many others remain to be discovered. We report the study of two siblings born to consanguineous Algerian parents and affected by isolated, profound congenital deafness. Method Whole-exome sequencing was carried out on these patients after a failure to identify mutations in the DFNB genes frequently involved. Results A biallelic nonsense mutation, c.88C > T (p.Gln30*), was identified in EPS8 that encodes epidermal growth factor receptor pathway substrate 8, a 822 amino-acid protein involved in actin dynamics. This mutation predicts a truncated inactive protein or no protein at all. The mutation was also present, in the heterozygous state, in one clinically unaffected sibling and in both unaffected parents, and was absent from the other two unaffected siblings. It was not found in 120 Algerian normal hearing control individuals or in the Exome Variant Server database. EPS8 is an F-actin capping and bundling protein. Mutant mice lacking EPS8 (Eps8−/− mice), which is present in the hair bundle, the sensory antenna of the auditory sensory cells that operate the mechano-electrical transduction, are also profoundly deaf and have abnormally short hair bundle stereocilia. Conclusion This new DFNB form is likely to arise from abnormal hair bundles resulting in compromised detection of physiological sound pressures. PMID:24741995

  14. CAUSAL INFERENCE WITH A GRAPHICAL HIERARCHY OF INTERVENTIONS

    PubMed Central

    Shpitser, Ilya; Tchetgen, Eric Tchetgen

    2017-01-01

    Identifying causal parameters from observational data is fraught with subtleties due to the issues of selection bias and confounding. In addition, more complex questions of interest, such as effects of treatment on the treated and mediated effects may not always be identified even in data where treatment assignment is known and under investigator control, or may be identified under one causal model but not another. Increasingly complex effects of interest, coupled with a diversity of causal models in use resulted in a fragmented view of identification. This fragmentation makes it unnecessarily difficult to determine if a given parameter is identified (and in what model), and what assumptions must hold for this to be the case. This, in turn, complicates the development of estimation theory and sensitivity analysis procedures. In this paper, we give a unifying view of a large class of causal effects of interest, including novel effects not previously considered, in terms of a hierarchy of interventions, and show that identification theory for this large class reduces to an identification theory of random variables under interventions from this hierarchy. Moreover, we show that one type of intervention in the hierarchy is naturally associated with queries identified under the Finest Fully Randomized Causally Interpretable Structure Tree Graph (FFRCISTG) model of Robins (via the extended g-formula), and another is naturally associated with queries identified under the Non-Parametric Structural Equation Model with Independent Errors (NPSEM-IE) of Pearl, via a more general functional we call the edge g-formula. Our results motivate the study of estimation theory for the edge g-formula, since we show it arises both in mediation analysis, and in settings where treatment assignment has unobserved causes, such as models associated with Pearl’s front-door criterion. PMID:28919652

  15. CAUSAL INFERENCE WITH A GRAPHICAL HIERARCHY OF INTERVENTIONS.

    PubMed

    Shpitser, Ilya; Tchetgen, Eric Tchetgen

    2016-12-01

    Identifying causal parameters from observational data is fraught with subtleties due to the issues of selection bias and confounding. In addition, more complex questions of interest, such as effects of treatment on the treated and mediated effects may not always be identified even in data where treatment assignment is known and under investigator control, or may be identified under one causal model but not another. Increasingly complex effects of interest, coupled with a diversity of causal models in use resulted in a fragmented view of identification. This fragmentation makes it unnecessarily difficult to determine if a given parameter is identified (and in what model), and what assumptions must hold for this to be the case. This, in turn, complicates the development of estimation theory and sensitivity analysis procedures. In this paper, we give a unifying view of a large class of causal effects of interest, including novel effects not previously considered, in terms of a hierarchy of interventions, and show that identification theory for this large class reduces to an identification theory of random variables under interventions from this hierarchy. Moreover, we show that one type of intervention in the hierarchy is naturally associated with queries identified under the Finest Fully Randomized Causally Interpretable Structure Tree Graph (FFRCISTG) model of Robins (via the extended g-formula), and another is naturally associated with queries identified under the Non-Parametric Structural Equation Model with Independent Errors (NPSEM-IE) of Pearl, via a more general functional we call the edge g-formula. Our results motivate the study of estimation theory for the edge g-formula, since we show it arises both in mediation analysis, and in settings where treatment assignment has unobserved causes, such as models associated with Pearl's front-door criterion.

  16. Causality Analysis: Identifying the Leading Element in a Coupled Dynamical System

    PubMed Central

    BozorgMagham, Amir E.; Motesharrei, Safa; Penny, Stephen G.; Kalnay, Eugenia

    2015-01-01

    Physical systems with time-varying internal couplings are abundant in nature. While the full governing equations of these systems are typically unknown due to insufficient understanding of their internal mechanisms, there is often interest in determining the leading element. Here, the leading element is defined as the sub-system with the largest coupling coefficient averaged over a selected time span. Previously, the Convergent Cross Mapping (CCM) method has been employed to determine causality and dominant component in weakly coupled systems with constant coupling coefficients. In this study, CCM is applied to a pair of coupled Lorenz systems with time-varying coupling coefficients, exhibiting switching between dominant sub-systems in different periods. Four sets of numerical experiments are carried out. The first three cases consist of different coupling coefficient schemes: I) Periodic–constant, II) Normal, and III) Mixed Normal/Non-normal. In case IV, numerical experiment of cases II and III are repeated with imposed temporal uncertainties as well as additive normal noise. Our results show that, through detecting directional interactions, CCM identifies the leading sub-system in all cases except when the average coupling coefficients are approximately equal, i.e., when the dominant sub-system is not well defined. PMID:26125157

  17. LNDriver: identifying driver genes by integrating mutation and expression data based on gene-gene interaction network.

    PubMed

    Wei, Pi-Jing; Zhang, Di; Xia, Junfeng; Zheng, Chun-Hou

    2016-12-23

    Cancer is a complex disease which is characterized by the accumulation of genetic alterations during the patient's lifetime. With the development of the next-generation sequencing technology, multiple omics data, such as cancer genomic, epigenomic and transcriptomic data etc., can be measured from each individual. Correspondingly, one of the key challenges is to pinpoint functional driver mutations or pathways, which contributes to tumorigenesis, from millions of functional neutral passenger mutations. In this paper, in order to identify driver genes effectively, we applied a generalized additive model to mutation profiles to filter genes with long length and constructed a new gene-gene interaction network. Then we integrated the mutation data and expression data into the gene-gene interaction network. Lastly, greedy algorithm was used to prioritize candidate driver genes from the integrated data. We named the proposed method Length-Net-Driver (LNDriver). Experiments on three TCGA datasets, i.e., head and neck squamous cell carcinoma, kidney renal clear cell carcinoma and thyroid carcinoma, demonstrated that the proposed method was effective. Also, it can identify not only frequently mutated drivers, but also rare candidate driver genes.

  18. Maternal age at first birth and offspring criminality: Using the children-of-twins design to test causal hypotheses

    PubMed Central

    Coyne, Claire A; Långström, Niklas; Rickert, Martin E; Lichtenstein, Paul; D’Onofrio, Brian M

    2013-01-01

    Teenage childbirth is a risk factor for poor offspring outcomes, particularly offspring antisocial behaviour. It is not clear if maternal age at first birth (MAFB) is causally associated with offspring antisocial behavior or if this association is due to selection factors that influence both the likelihood that a young woman gives birth early and that her offspring engage in antisocial behavior. The current study addresses the limitations of previous research by using longitudinal data from Swedish national registries and children-of-siblings and children-of-twins comparisons to identify the extent to which the association between MAFB and offspring criminal convictions is consistent with a causal influence and confounded by genetic or environmental factors that make cousins similar. We found offspring born to mothers who began childbearing earlier were more likely to be convicted of a crime than offspring born to mothers who delayed childbearing. The results from comparisons of differentially exposed cousins, especially born to discordant MZ twin sisters, provide support for a causal association between MAFB and offspring criminal convictions. The analyses also found little evidence for genetic confounding due to passive gene-environment correlation. Future studies are needed to replicate these findings and to identify environmental risk factors that mediate this causal association. PMID:23398750

  19. Maternal age at first birth and offspring criminality: using the children of twins design to test causal hypotheses.

    PubMed

    Coyne, Claire A; Långström, Niklas; Rickert, Martin E; Lichtenstein, Paul; D'Onofrio, Brian M

    2013-02-01

    Teenage childbirth is a risk factor for poor offspring outcomes, particularly offspring antisocial behavior. It is not clear, however, if maternal age at first birth (MAFB) is causally associated with offspring antisocial behavior or if this association is due to selection factors that influence both the likelihood that a young woman gives birth early and that her offspring engage in antisocial behavior. The current study addresses the limitations of previous research by using longitudinal data from Swedish national registries and children of siblings and children of twins comparisons to identify the extent to which the association between MAFB and offspring criminal convictions is consistent with a causal influence and confounded by genetic or environmental factors that make cousins similar. We found offspring born to mothers who began childbearing earlier were more likely to be convicted of a crime than offspring born to mothers who delayed childbearing. The results from comparisons of differentially exposed cousins, especially born to discordant monozygotic twin sisters, provide support for a causal association between MAFB and offspring criminal convictions. The analyses also found little evidence for genetic confounding due to passive gene-environment correlation. Future studies are needed to replicate these findings and to identify environmental risk factors that mediate this causal association.

  20. Novel mutations in CRB1 gene identified in a chinese pedigree with retinitis pigmentosa by targeted capture and next generation sequencing

    PubMed Central

    Lo, David; Weng, Jingning; Liu, xiaohong; Yang, Juhua; He, Fen; Wang, Yun; Liu, Xuyang

    2016-01-01

    PURPOSE To detect the disease-causing gene in a Chinese pedigree with autosomal-recessive retinitis pigmentosa (ARRP). METHODS All subjects in this family underwent a complete ophthalmic examination. Targeted-capture next generation sequencing (NGS) was performed on the proband to detect variants. All variants were verified in the remaining family members by PCR amplification and Sanger sequencing. RESULTS All the affected subjects in this pedigree were diagnosed with retinitis pigmentosa (RP). The compound heterozygous c.138delA (p.Asp47IlefsX24) and c.1841G>T (p.Gly614Val) mutations in the Crumbs homolog 1 (CRB1) gene were identified in all the affected patients but not in the unaffected individuals in this family. These mutations were inherited from their parents, respectively. CONCLUSION The novel compound heterozygous mutations in CRB1 were identified in a Chinese pedigree with ARRP using targeted-capture next generation sequencing. After evaluating the significant heredity and impaired protein function, the compound heterozygous c.138delA (p.Asp47IlefsX24) and c.1841G>T (p.Gly614Val) mutations are the causal genes of early onset ARRP in this pedigree. To the best of our knowledge, there is no previous report regarding the compound mutations. PMID:27806333

  1. Identifying key genes associated with acute myocardial infarction.

    PubMed

    Cheng, Ming; An, Shoukuan; Li, Junquan

    2017-10-01

    This study aimed to identify key genes associated with acute myocardial infarction (AMI) by reanalyzing microarray data. Three gene expression profile datasets GSE66360, GSE34198, and GSE48060 were downloaded from GEO database. After data preprocessing, genes without heterogeneity across different platforms were subjected to differential expression analysis between the AMI group and the control group using metaDE package. P < .05 was used as the cutoff for a differentially expressed gene (DEG). The expression data matrices of DEGs were imported in ReactomeFIViz to construct a gene functional interaction (FI) network. Then, DEGs in each module were subjected to pathway enrichment analysis using DAVID. MiRNAs and transcription factors predicted to regulate target DEGs were identified. Quantitative real-time polymerase chain reaction (RT-PCR) was applied to verify the expression of genes. A total of 913 upregulated genes and 1060 downregulated genes were identified in the AMI group. A FI network consists of 21 modules and DEGs in 12 modules were significantly enriched in pathways. The transcription factor-miRNA-gene network contains 2 transcription factors FOXO3 and MYBL2, and 2 miRNAs hsa-miR-21-5p and hsa-miR-30c-5p. RT-PCR validations showed that expression levels of FOXO3 and MYBL2 were significantly increased in AMI, and expression levels of hsa-miR-21-5p and hsa-miR-30c-5p were obviously decreased in AMI. A total of 41 DEGs, such as SOCS3, VAPA, and COL5A2, are speculated to have roles in the pathogenesis of AMI; 2 transcription factors FOXO3 and MYBL2, and 2 miRNAs hsa-miR-21-5p and hsa-miR-30c-5p may be involved in the regulation of the expression of these DEGs.

  2. A genomic approach to identify hybrid incompatibility genes.

    PubMed

    Cooper, Jacob C; Phadnis, Nitin

    2016-07-02

    Uncovering the genetic and molecular basis of barriers to gene flow between populations is key to understanding how new species are born. Intrinsic postzygotic reproductive barriers such as hybrid sterility and hybrid inviability are caused by deleterious genetic interactions known as hybrid incompatibilities. The difficulty in identifying these hybrid incompatibility genes remains a rate-limiting step in our understanding of the molecular basis of speciation. We recently described how whole genome sequencing can be applied to identify hybrid incompatibility genes, even from genetically terminal hybrids. Using this approach, we discovered a new hybrid incompatibility gene, gfzf, between Drosophila melanogaster and Drosophila simulans, and found that it plays an essential role in cell cycle regulation. Here, we discuss the history of the hunt for incompatibility genes between these species, discuss the molecular roles of gfzf in cell cycle regulation, and explore how intragenomic conflict drives the evolution of fundamental cellular mechanisms that lead to the developmental arrest of hybrids.

  3. A genomic approach to identify hybrid incompatibility genes

    PubMed Central

    Cooper, Jacob C.; Phadnis, Nitin

    2016-01-01

    ABSTRACT Uncovering the genetic and molecular basis of barriers to gene flow between populations is key to understanding how new species are born. Intrinsic postzygotic reproductive barriers such as hybrid sterility and hybrid inviability are caused by deleterious genetic interactions known as hybrid incompatibilities. The difficulty in identifying these hybrid incompatibility genes remains a rate-limiting step in our understanding of the molecular basis of speciation. We recently described how whole genome sequencing can be applied to identify hybrid incompatibility genes, even from genetically terminal hybrids. Using this approach, we discovered a new hybrid incompatibility gene, gfzf, between Drosophila melanogaster and Drosophila simulans, and found that it plays an essential role in cell cycle regulation. Here, we discuss the history of the hunt for incompatibility genes between these species, discuss the molecular roles of gfzf in cell cycle regulation, and explore how intragenomic conflict drives the evolution of fundamental cellular mechanisms that lead to the developmental arrest of hybrids. PMID:27230814

  4. Effective connectivity: Influence, causality and biophysical modeling

    PubMed Central

    Valdes-Sosa, Pedro A.; Roebroeck, Alard; Daunizeau, Jean; Friston, Karl

    2011-01-01

    This is the final paper in a Comments and Controversies series dedicated to “The identification of interacting networks in the brain using fMRI: Model selection, causality and deconvolution”. We argue that discovering effective connectivity depends critically on state-space models with biophysically informed observation and state equations. These models have to be endowed with priors on unknown parameters and afford checks for model Identifiability. We consider the similarities and differences among Dynamic Causal Modeling, Granger Causal Modeling and other approaches. We establish links between past and current statistical causal modeling, in terms of Bayesian dependency graphs and Wiener–Akaike–Granger–Schweder influence measures. We show that some of the challenges faced in this field have promising solutions and speculate on future developments. PMID:21477655

  5. Literature mining, gene-set enrichment and pathway analysis for target identification in Behçet's disease.

    PubMed

    Wilson, Paul; Larminie, Christopher; Smith, Rona

    2016-01-01

    To use literature mining to catalogue Behçet's associated genes, and advanced computational methods to improve the understanding of the pathways and signalling mechanisms that lead to the typical clinical characteristics of Behçet's patients. To extend this technique to identify potential treatment targets for further experimental validation. Text mining methods combined with gene enrichment tools, pathway analysis and causal analysis algorithms. This approach identified 247 human genes associated with Behçet's disease and the resulting disease map, comprising 644 nodes and 19220 edges, captured important details of the relationships between these genes and their associated pathways, as described in diverse data repositories. Pathway analysis has identified how Behçet's associated genes are likely to participate in innate and adaptive immune responses. Causal analysis algorithms have identified a number of potential therapeutic strategies for further investigation. Computational methods have captured pertinent features of the prominent disease characteristics presented in Behçet's disease and have highlighted NOD2, ICOS and IL18 signalling as potential therapeutic strategies.

  6. Inferring causal relationships between phenotypes using summary statistics from genome-wide association studies.

    PubMed

    Meng, Xiang-He; Shen, Hui; Chen, Xiang-Ding; Xiao, Hong-Mei; Deng, Hong-Wen

    2018-03-01

    Genome-wide association studies (GWAS) have successfully identified numerous genetic variants associated with diverse complex phenotypes and diseases, and provided tremendous opportunities for further analyses using summary association statistics. Recently, Pickrell et al. developed a robust method for causal inference using independent putative causal SNPs. However, this method may fail to infer the causal relationship between two phenotypes when only a limited number of independent putative causal SNPs identified. Here, we extended Pickrell's method to make it more applicable for the general situations. We extended the causal inference method by replacing the putative causal SNPs with the lead SNPs (the set of the most significant SNPs in each independent locus) and tested the performance of our extended method using both simulation and empirical data. Simulations suggested that when the same number of genetic variants is used, our extended method had similar distribution of test statistic under the null model as well as comparable power under the causal model compared with the original method by Pickrell et al. But in practice, our extended method would generally be more powerful because the number of independent lead SNPs was often larger than the number of independent putative causal SNPs. And including more SNPs, on the other hand, would not cause more false positives. By applying our extended method to summary statistics from GWAS for blood metabolites and femoral neck bone mineral density (FN-BMD), we successfully identified ten blood metabolites that may causally influence FN-BMD. We extended a causal inference method for inferring putative causal relationship between two phenotypes using summary statistics from GWAS, and identified a number of potential causal metabolites for FN-BMD, which may provide novel insights into the pathophysiological mechanisms underlying osteoporosis.

  7. Identifying a gene expression signature of cluster headache in blood

    PubMed Central

    Eising, Else; Pelzer, Nadine; Vijfhuizen, Lisanne S.; Vries, Boukje de; Ferrari, Michel D.; ‘t Hoen, Peter A. C.; Terwindt, Gisela M.; van den Maagdenberg, Arn M. J. M.

    2017-01-01

    Cluster headache is a relatively rare headache disorder, typically characterized by multiple daily, short-lasting attacks of excruciating, unilateral (peri-)orbital or temporal pain associated with autonomic symptoms and restlessness. To better understand the pathophysiology of cluster headache, we used RNA sequencing to identify differentially expressed genes and pathways in whole blood of patients with episodic (n = 19) or chronic (n = 20) cluster headache in comparison with headache-free controls (n = 20). Gene expression data were analysed by gene and by module of co-expressed genes with particular attention to previously implicated disease pathways including hypocretin dysregulation. Only moderate gene expression differences were identified and no associations were found with previously reported pathogenic mechanisms. At the level of functional gene sets, associations were observed for genes involved in several brain-related mechanisms such as GABA receptor function and voltage-gated channels. In addition, genes and modules of co-expressed genes showed a role for intracellular signalling cascades, mitochondria and inflammation. Although larger study samples may be required to identify the full range of involved pathways, these results indicate a role for mitochondria, intracellular signalling and inflammation in cluster headache. PMID:28074859

  8. Inferring Causalities in Landscape Genetics: An Extension of Wright's Causal Modeling to Distance Matrices.

    PubMed

    Fourtune, Lisa; Prunier, Jérôme G; Paz-Vinas, Ivan; Loot, Géraldine; Veyssière, Charlotte; Blanchet, Simon

    2018-04-01

    Identifying landscape features that affect functional connectivity among populations is a major challenge in fundamental and applied sciences. Landscape genetics combines landscape and genetic data to address this issue, with the main objective of disentangling direct and indirect relationships among an intricate set of variables. Causal modeling has strong potential to address the complex nature of landscape genetic data sets. However, this statistical approach was not initially developed to address the pairwise distance matrices commonly used in landscape genetics. Here, we aimed to extend the applicability of two causal modeling methods-that is, maximum-likelihood path analysis and the directional separation test-by developing statistical approaches aimed at handling distance matrices and improving functional connectivity inference. Using simulations, we showed that these approaches greatly improved the robustness of the absolute (using a frequentist approach) and relative (using an information-theoretic approach) fits of the tested models. We used an empirical data set combining genetic information on a freshwater fish species (Gobio occitaniae) and detailed landscape descriptors to demonstrate the usefulness of causal modeling to identify functional connectivity in wild populations. Specifically, we demonstrated how direct and indirect relationships involving altitude, temperature, and oxygen concentration influenced within- and between-population genetic diversity of G. occitaniae.

  9. Identifying key genes associated with acute myocardial infarction

    PubMed Central

    Cheng, Ming; An, Shoukuan; Li, Junquan

    2017-01-01

    Abstract Background: This study aimed to identify key genes associated with acute myocardial infarction (AMI) by reanalyzing microarray data. Methods: Three gene expression profile datasets GSE66360, GSE34198, and GSE48060 were downloaded from GEO database. After data preprocessing, genes without heterogeneity across different platforms were subjected to differential expression analysis between the AMI group and the control group using metaDE package. P < .05 was used as the cutoff for a differentially expressed gene (DEG). The expression data matrices of DEGs were imported in ReactomeFIViz to construct a gene functional interaction (FI) network. Then, DEGs in each module were subjected to pathway enrichment analysis using DAVID. MiRNAs and transcription factors predicted to regulate target DEGs were identified. Quantitative real-time polymerase chain reaction (RT-PCR) was applied to verify the expression of genes. Result: A total of 913 upregulated genes and 1060 downregulated genes were identified in the AMI group. A FI network consists of 21 modules and DEGs in 12 modules were significantly enriched in pathways. The transcription factor-miRNA-gene network contains 2 transcription factors FOXO3 and MYBL2, and 2 miRNAs hsa-miR-21-5p and hsa-miR-30c-5p. RT-PCR validations showed that expression levels of FOXO3 and MYBL2 were significantly increased in AMI, and expression levels of hsa-miR-21–5p and hsa-miR-30c-5p were obviously decreased in AMI. Conclusion: A total of 41 DEGs, such as SOCS3, VAPA, and COL5A2, are speculated to have roles in the pathogenesis of AMI; 2 transcription factors FOXO3 and MYBL2, and 2 miRNAs hsa-miR-21-5p and hsa-miR-30c-5p may be involved in the regulation of the expression of these DEGs. PMID:29049183

  10. Gene-based rare allele analysis identified a risk gene of Alzheimer's disease.

    PubMed

    Kim, Jong Hun; Song, Pamela; Lim, Hyunsun; Lee, Jae-Hyung; Lee, Jun Hong; Park, Sun Ah

    2014-01-01

    Alzheimer's disease (AD) has a strong propensity to run in families. However, the known risk genes excluding APOE are not clinically useful. In various complex diseases, gene studies have targeted rare alleles for unsolved heritability. Our study aims to elucidate previously unknown risk genes for AD by targeting rare alleles. We used data from five publicly available genetic studies from the Alzheimer's Disease Neuroimaging Initiative (ADNI) and the database of Genotypes and Phenotypes (dbGaP). A total of 4,171 cases and 9,358 controls were included. The genotype information of rare alleles was imputed using 1,000 genomes. We performed gene-based analysis of rare alleles (minor allele frequency≤3%). The genome-wide significance level was defined as meta P<1.8×10(-6) (0.05/number of genes in human genome = 0.05/28,517). ZNF628, which is located at chromosome 19q13.42, showed a genome-wide significant association with AD. The association of ZNF628 with AD was not dependent on APOE ε4. APOE and TREM2 were also significantly associated with AD, although not at genome-wide significance levels. Other genes identified by targeting common alleles could not be replicated in our gene-based rare allele analysis. We identified that rare variants in ZNF628 are associated with AD. The protein encoded by ZNF628 is known as a transcription factor. Furthermore, the associations of APOE and TREM2 with AD were highly significant, even in gene-based rare allele analysis, which implies that further deep sequencing of these genes is required in AD heritability studies.

  11. [Causal analysis approaches in epidemiology].

    PubMed

    Dumas, O; Siroux, V; Le Moual, N; Varraso, R

    2014-02-01

    Epidemiological research is mostly based on observational studies. Whether such studies can provide evidence of causation remains discussed. Several causal analysis methods have been developed in epidemiology. This paper aims at presenting an overview of these methods: graphical models, path analysis and its extensions, and models based on the counterfactual approach, with a special emphasis on marginal structural models. Graphical approaches have been developed to allow synthetic representations of supposed causal relationships in a given problem. They serve as qualitative support in the study of causal relationships. The sufficient-component cause model has been developed to deal with the issue of multicausality raised by the emergence of chronic multifactorial diseases. Directed acyclic graphs are mostly used as a visual tool to identify possible confounding sources in a study. Structural equations models, the main extension of path analysis, combine a system of equations and a path diagram, representing a set of possible causal relationships. They allow quantifying direct and indirect effects in a general model in which several relationships can be tested simultaneously. Dynamic path analysis further takes into account the role of time. The counterfactual approach defines causality by comparing the observed event and the counterfactual event (the event that would have been observed if, contrary to the fact, the subject had received a different exposure than the one he actually received). This theoretical approach has shown limits of traditional methods to address some causality questions. In particular, in longitudinal studies, when there is time-varying confounding, classical methods (regressions) may be biased. Marginal structural models have been developed to address this issue. In conclusion, "causal models", though they were developed partly independently, are based on equivalent logical foundations. A crucial step in the application of these models is the

  12. The causal meaning of Fisher’s average effect

    PubMed Central

    LEE, JAMES J.; CHOW, CARSON C.

    2013-01-01

    Summary In order to formulate the Fundamental Theorem of Natural Selection, Fisher defined the average excess and average effect of a gene substitution. Finding these notions to be somewhat opaque, some authors have recommended reformulating Fisher’s ideas in terms of covariance and regression, which are classical concepts of statistics. We argue that Fisher intended his two averages to express a distinction between correlation and causation. On this view, the average effect is a specific weighted average of the actual phenotypic changes that result from physically changing the allelic states of homologous genes. We show that the statistical and causal conceptions of the average effect, perceived as inconsistent by Falconer, can be reconciled if certain relationships between the genotype frequencies and non-additive residuals are conserved. There are certain theory-internal considerations favouring Fisher’s original formulation in terms of causality; for example, the frequency-weighted mean of the average effects equaling zero at each locus becomes a derivable consequence rather than an arbitrary constraint. More broadly, Fisher’s distinction between correlation and causation is of critical importance to gene-trait mapping studies and the foundations of evolutionary biology. PMID:23938113

  13. Using published data in Mendelian randomization: a blueprint for efficient identification of causal risk factors.

    PubMed

    Burgess, Stephen; Scott, Robert A; Timpson, Nicholas J; Davey Smith, George; Thompson, Simon G

    2015-07-01

    Finding individual-level data for adequately-powered Mendelian randomization analyses may be problematic. As publicly-available summarized data on genetic associations with disease outcomes from large consortia are becoming more abundant, use of published data is an attractive analysis strategy for obtaining precise estimates of the causal effects of risk factors on outcomes. We detail the necessary steps for conducting Mendelian randomization investigations using published data, and present novel statistical methods for combining data on the associations of multiple (correlated or uncorrelated) genetic variants with the risk factor and outcome into a single causal effect estimate. A two-sample analysis strategy may be employed, in which evidence on the gene-risk factor and gene-outcome associations are taken from different data sources. These approaches allow the efficient identification of risk factors that are suitable targets for clinical intervention from published data, although the ability to assess the assumptions necessary for causal inference is diminished. Methods and guidance are illustrated using the example of the causal effect of serum calcium levels on fasting glucose concentrations. The estimated causal effect of a 1 standard deviation (0.13 mmol/L) increase in calcium levels on fasting glucose (mM) using a single lead variant from the CASR gene region is 0.044 (95 % credible interval -0.002, 0.100). In contrast, using our method to account for the correlation between variants, the corresponding estimate using 17 genetic variants is 0.022 (95 % credible interval 0.009, 0.035), a more clearly positive causal effect.

  14. Causal reasoning with forces

    PubMed Central

    Wolff, Phillip; Barbey, Aron K.

    2015-01-01

    Causal composition allows people to generate new causal relations by combining existing causal knowledge. We introduce a new computational model of such reasoning, the force theory, which holds that people compose causal relations by simulating the processes that join forces in the world, and compare this theory with the mental model theory (Khemlani et al., 2014) and the causal model theory (Sloman et al., 2009), which explain causal composition on the basis of mental models and structural equations, respectively. In one experiment, the force theory was uniquely able to account for people's ability to compose causal relationships from complex animations of real-world events. In three additional experiments, the force theory did as well as or better than the other two theories in explaining the causal compositions people generated from linguistically presented causal relations. Implications for causal learning and the hierarchical structure of causal knowledge are discussed. PMID:25653611

  15. A gene-trap strategy identifies quiescence-induced genes in synchronized myoblasts.

    PubMed

    Sambasivan, Ramkumar; Pavlath, Grace K; Dhawan, Jyotsna

    2008-03-01

    Cellular quiescence is characterized not only by reduced mitotic and metabolic activity but also by altered gene expression. Growing evidence suggests that quiescence is not merely a basal state but is regulated by active mechanisms. To understand the molecular programme that governs reversible cell cycle exit, we focused on quiescence-related gene expression in a culture model of myogenic cell arrest and activation. Here we report the identification of quiescence-induced genes using a gene-trap strategy. Using a retroviral vector, we generated a library of gene traps in C2C12 myoblasts that were screened for arrest-induced insertions by live cell sorting (FACS-gal). Several independent gene- trap lines revealed arrest-dependent induction of betagal activity, confirming the efficacy of the FACS screen. The locus of integration was identified in 15 lines. In three lines,insertion occurred in genes previously implicated in the control of quiescence, i.e. EMSY - a BRCA2--interacting protein, p8/com1 - a p300HAT -- binding protein and MLL5 - a SET domain protein. Our results demonstrate that expression of chromatin modulatory genes is induced in G0, providing support to the notion that this reversibly arrested state is actively regulated.

  16. Granger causality revisited

    PubMed Central

    Friston, Karl J.; Bastos, André M.; Oswal, Ashwini; van Wijk, Bernadette; Richter, Craig; Litvak, Vladimir

    2014-01-01

    This technical paper offers a critical re-evaluation of (spectral) Granger causality measures in the analysis of biological timeseries. Using realistic (neural mass) models of coupled neuronal dynamics, we evaluate the robustness of parametric and nonparametric Granger causality. Starting from a broad class of generative (state-space) models of neuronal dynamics, we show how their Volterra kernels prescribe the second-order statistics of their response to random fluctuations; characterised in terms of cross-spectral density, cross-covariance, autoregressive coefficients and directed transfer functions. These quantities in turn specify Granger causality — providing a direct (analytic) link between the parameters of a generative model and the expected Granger causality. We use this link to show that Granger causality measures based upon autoregressive models can become unreliable when the underlying dynamics is dominated by slow (unstable) modes — as quantified by the principal Lyapunov exponent. However, nonparametric measures based on causal spectral factors are robust to dynamical instability. We then demonstrate how both parametric and nonparametric spectral causality measures can become unreliable in the presence of measurement noise. Finally, we show that this problem can be finessed by deriving spectral causality measures from Volterra kernels, estimated using dynamic causal modelling. PMID:25003817

  17. A P-Norm Robust Feature Extraction Method for Identifying Differentially Expressed Genes

    PubMed Central

    Liu, Jian; Liu, Jin-Xing; Gao, Ying-Lian; Kong, Xiang-Zhen; Wang, Xue-Song; Wang, Dong

    2015-01-01

    In current molecular biology, it becomes more and more important to identify differentially expressed genes closely correlated with a key biological process from gene expression data. In this paper, based on the Schatten p-norm and Lp-norm, a novel p-norm robust feature extraction method is proposed to identify the differentially expressed genes. In our method, the Schatten p-norm is used as the regularization function to obtain a low-rank matrix and the Lp-norm is taken as the error function to improve the robustness to outliers in the gene expression data. The results on simulation data show that our method can obtain higher identification accuracies than the competitive methods. Numerous experiments on real gene expression data sets demonstrate that our method can identify more differentially expressed genes than the others. Moreover, we confirmed that the identified genes are closely correlated with the corresponding gene expression data. PMID:26201006

  18. A P-Norm Robust Feature Extraction Method for Identifying Differentially Expressed Genes.

    PubMed

    Liu, Jian; Liu, Jin-Xing; Gao, Ying-Lian; Kong, Xiang-Zhen; Wang, Xue-Song; Wang, Dong

    2015-01-01

    In current molecular biology, it becomes more and more important to identify differentially expressed genes closely correlated with a key biological process from gene expression data. In this paper, based on the Schatten p-norm and Lp-norm, a novel p-norm robust feature extraction method is proposed to identify the differentially expressed genes. In our method, the Schatten p-norm is used as the regularization function to obtain a low-rank matrix and the Lp-norm is taken as the error function to improve the robustness to outliers in the gene expression data. The results on simulation data show that our method can obtain higher identification accuracies than the competitive methods. Numerous experiments on real gene expression data sets demonstrate that our method can identify more differentially expressed genes than the others. Moreover, we confirmed that the identified genes are closely correlated with the corresponding gene expression data.

  19. Determining causal miRNAs and their signaling cascade in diseases using an influence diffusion model.

    PubMed

    Nalluri, Joseph J; Rana, Pratip; Barh, Debmalya; Azevedo, Vasco; Dinh, Thang N; Vladimirov, Vladimir; Ghosh, Preetam

    2017-08-15

    In recent studies, miRNAs have been found to be extremely influential in many of the essential biological processes. They exhibit a self-regulatory mechanism through which they act as positive/negative regulators of expression of genes and other miRNAs. This has direct implications in the regulation of various pathophysiological conditions, signaling pathways and different types of cancers. Studying miRNA-disease associations has been an extensive area of research; however deciphering miRNA-miRNA network regulatory patterns in several diseases remains a challenge. In this study, we use information diffusion theory to quantify the influence diffusion in a miRNA-miRNA regulation network across multiple disease categories. Our proposed methodology determines the critical disease specific miRNAs which play a causal role in their signaling cascade and hence may regulate disease progression. We extensively validate our framework using existing computational tools from the literature. Furthermore, we implement our framework on a comprehensive miRNA expression data set for alcohol dependence and identify the causal miRNAs for alcohol-dependency in patients which were validated by the phase-shift in their expression scores towards the early stages of the disease. Finally, our computational framework for identifying causal miRNAs implicated in diseases is available as a free online tool for the greater scientific community.

  20. Copy number analysis reveals a novel multiexon deletion of the COLQ gene in congenital myasthenia.

    PubMed

    Wang, Wei; Wu, Yanhong; Wang, Chen; Jiao, Jinsong; Klein, Christopher J

    2016-12-01

    Congenital myasthenic syndrome (CMS) is genetically and clinically heterogeneous. 1 Despite a considerable number of causal genes discovered, many patients are left without a specific diagnosis after genetic testing. The presumption is that novel genes yet to be discovered will account for the majority of such patients. However, it is also possible that we are neglecting a type of genetic variation: copy number changes (>50 bp) as causal for some of these patients. Next-generation sequencing (NGS) can simultaneously screen all known causal genes 2 and is increasingly being validated to have a potential to identify copy number changes. 3 We present a CMS case who did not receive a genetic diagnosis from previous Sanger sequencing, but through a novel copy number analysis algorithm integrated into our targeted NGS panel, we discovered a novel copy number mutation in the COLQ gene and made a genetic diagnosis. This discovery expands the genotype-phenotype correlation of CMS, leads to improved genetic counsel, and allows for specific pharmacologic treatment. 1 .

  1. What Women Think: Cancer Causal Attributions in a Diverse Sample of Women

    PubMed Central

    Rodríguez, Vivian M.; Gyure, Maria E.; Corona, Rosalie; Bodurtha, Joann N.; Bowen, Deborah J.; Quillin, John M.

    2014-01-01

    Women hold diverse beliefs about cancer etiology, potentially affecting their use of cancer preventive behaviors. To date, research has greatly focused on the causal attributions cancer patients and survivors hold about cancer, and studies have been conducted primarily with White participants. Less is known about causal attributions held by women with and without a family history of cancer from a diverse community sample. This study sought to identify cancer causal attributions of women with and without a family history of cancer, and explore its relation to socio-cultural factors. Diverse women (60% African-American) recruited at an urban, safety-net women's health clinic (N=471) reported factors they believed cause cancer. Responses were coded into nine attributions and analyzed using chi-squares and logistic regressions. Lifestyle-choices (63%), genetics/heredity (34%), and environmental-exposures (19%) were the top causal attributions identified. Women without a family history of cancer were more likely to identify genetics/heredity as an attribution for cancer than women with a history of cancer in their families. Women who identified as White, who had a higher educational attainment, and had commercial insurance were more likely to report genetics/heredity as a causal attribution for cancer. These findings suggest that socio-cultural factors may play a role in the causal attributions individuals make about cancer, which can, in turn, inform cancer awareness and prevention messages. PMID:25398057

  2. Granger Causality Testing with Intensive Longitudinal Data.

    PubMed

    Molenaar, Peter C M

    2018-06-01

    The availability of intensive longitudinal data obtained by means of ambulatory assessment opens up new prospects for prevention research in that it allows the derivation of subject-specific dynamic networks of interacting variables by means of vector autoregressive (VAR) modeling. The dynamic networks thus obtained can be subjected to Granger causality testing in order to identify causal relations among the observed time-dependent variables. VARs have two equivalent representations: standard and structural. Results obtained with Granger causality testing depend upon which representation is chosen, yet no criteria exist on which this important choice can be based. A new equivalent representation is introduced called hybrid VARs with which the best representation can be chosen in a data-driven way. Partial directed coherence, a frequency-domain statistic for Granger causality testing, is shown to perform optimally when based on hybrid VARs. An application to real data is provided.

  3. Discovering graphical Granger causality using the truncating lasso penalty

    PubMed Central

    Shojaie, Ali; Michailidis, George

    2010-01-01

    Motivation: Components of biological systems interact with each other in order to carry out vital cell functions. Such information can be used to improve estimation and inference, and to obtain better insights into the underlying cellular mechanisms. Discovering regulatory interactions among genes is therefore an important problem in systems biology. Whole-genome expression data over time provides an opportunity to determine how the expression levels of genes are affected by changes in transcription levels of other genes, and can therefore be used to discover regulatory interactions among genes. Results: In this article, we propose a novel penalization method, called truncating lasso, for estimation of causal relationships from time-course gene expression data. The proposed penalty can correctly determine the order of the underlying time series, and improves the performance of the lasso-type estimators. Moreover, the resulting estimate provides information on the time lag between activation of transcription factors and their effects on regulated genes. We provide an efficient algorithm for estimation of model parameters, and show that the proposed method can consistently discover causal relationships in the large p, small n setting. The performance of the proposed model is evaluated favorably in simulated, as well as real, data examples. Availability: The proposed truncating lasso method is implemented in the R-package ‘grangerTlasso’ and is freely available at http://www.stat.lsa.umich.edu/∼shojaie/ Contact: shojaie@umich.edu PMID:20823316

  4. Inferring action structure and causal relationships in continuous sequences of human action.

    PubMed

    Buchsbaum, Daphna; Griffiths, Thomas L; Plunkett, Dillon; Gopnik, Alison; Baldwin, Dare

    2015-02-01

    In the real world, causal variables do not come pre-identified or occur in isolation, but instead are embedded within a continuous temporal stream of events. A challenge faced by both human learners and machine learning algorithms is identifying subsequences that correspond to the appropriate variables for causal inference. A specific instance of this problem is action segmentation: dividing a sequence of observed behavior into meaningful actions, and determining which of those actions lead to effects in the world. Here we present a Bayesian analysis of how statistical and causal cues to segmentation should optimally be combined, as well as four experiments investigating human action segmentation and causal inference. We find that both people and our model are sensitive to statistical regularities and causal structure in continuous action, and are able to combine these sources of information in order to correctly infer both causal relationships and segmentation boundaries. Copyright © 2014. Published by Elsevier Inc.

  5. Localization of causal locus in the genome of the brown macroalga Ectocarpus: NGS-based mapping and positional cloning approaches

    PubMed Central

    Billoud, Bernard; Jouanno, Émilie; Nehr, Zofia; Carton, Baptiste; Rolland, Élodie; Chenivesse, Sabine; Charrier, Bénédicte

    2015-01-01

    Mutagenesis is the only process by which unpredicted biological gene function can be identified. Despite that several macroalgal developmental mutants have been generated, their causal mutation was never identified, because experimental conditions were not gathered at that time. Today, progresses in macroalgal genomics and judicious choices of suitable genetic models make mutated gene identification possible. This article presents a comparative study of two methods aiming at identifying a genetic locus in the brown alga Ectocarpus siliculosus: positional cloning and Next-Generation Sequencing (NGS)-based mapping. Once necessary preliminary experimental tools were gathered, we tested both analyses on an Ectocarpus morphogenetic mutant. We show how a narrower localization results from the combination of the two methods. Advantages and drawbacks of these two approaches as well as potential transfer to other macroalgae are discussed. PMID:25745426

  6. The cradle of causal reasoning: newborns' preference for physical causality.

    PubMed

    Mascalzoni, Elena; Regolin, Lucia; Vallortigara, Giorgio; Simion, Francesca

    2013-05-01

    Perception of mechanical (i.e. physical) causality, in terms of a cause-effect relationship between two motion events, appears to be a powerful mechanism in our daily experience. In spite of a growing interest in the earliest causal representations, the role of experience in the origin of this sensitivity is still a matter of dispute. Here, we asked the question about the innate origin of causal perception, never tested before at birth. Three experiments were carried out to investigate sensitivity at birth to some visual spatiotemporal cues present in a launching event. Newborn babies, only a few hours old, showed that they significantly preferred a physical causality event (i.e. Michotte's Launching effect) when matched to a delay event (i.e. a delayed launching; Experiment 1) or to a non-causal event completely identical to the causal one except for the order of the displacements of the two objects involved which was swapped temporally (Experiment 3). This preference for the launching event, moreover, also depended on the continuity of the trajectory between the objects involved in the event (Experiment 2). These results support the hypothesis that the human system possesses an early available, possibly innate basic mechanism to compute causality, such a mechanism being sensitive to the additive effect of certain well-defined spatiotemporal cues present in the causal event independently of any prior visual experience. © 2013 Blackwell Publishing Ltd.

  7. Cause and Event: Supporting Causal Claims through Logistic Models

    ERIC Educational Resources Information Center

    O'Connell, Ann A.; Gray, DeLeon L.

    2011-01-01

    Efforts to identify and support credible causal claims have received intense interest in the research community, particularly over the past few decades. In this paper, we focus on the use of statistical procedures designed to support causal claims for a treatment or intervention when the response variable of interest is dichotomous. We identify…

  8. Detecting causal drivers and empirical prediction of the Indian Summer Monsoon

    NASA Astrophysics Data System (ADS)

    Di Capua, G.; Vellore, R.; Raghavan, K.; Coumou, D.

    2017-12-01

    The Indian summer monsoon (ISM) is crucial for the economy, society and natural ecosystems on the Indian peninsula. Predict the total seasonal rainfall at several months lead time would help to plan effective water management strategies, improve flood or drought protection programs and prevent humanitarian crisis. However, the complexity and strong internal variability of the ISM circulation system make skillful seasonal forecasting challenging. Moreover, to adequately identify the low-frequency, and far-away processes which influence ISM behavior novel tools are needed. We applied a Response-Guided Causal Precursor Detection (RGCPD) scheme, which is a novel empirical prediction method which unites a response-guided community detection scheme with a causal discovery algorithm (CEN). These tool allow us to assess causal pathways between different components of the ISM circulation system and with far-away regions in the tropics, mid-latitudes or Arctic. The scheme has successfully been used to identify causal precursors of the Stratospheric polar vortex enabling skillful predictions at (sub) seasonal timescales (Kretschmer et al. 2016, J.Clim., Kretschmer et al. 2017, GRL). We analyze observed ISM monthly rainfall over the monsoon trough region. Applying causal discovery techniques, we identify several causal precursor communities in the fields of 2m-temperature, sea level pressure and snow depth over Eurasia. Specifically, our results suggest that surface temperature conditions in both tropical and Arctic regions contribute to ISM variability. A linear regression prediction model based on the identified set of communities has good hindcasting skills with 4-5 months lead times. Further we separate El Nino, La Nina and ENSO-neutral years from each other and find that the causal precursors are different dependent on ENSO state. The ENSO-state dependent causal precursors give even higher skill, especially for La Nina years when the ISM is relatively strong. These

  9. Spot the difference: Causal contrasts in scientific diagrams.

    PubMed

    Scholl, Raphael

    2016-12-01

    An important function of scientific diagrams is to identify causal relationships. This commonly relies on contrasts that highlight the effects of specific difference-makers. However, causal contrast diagrams are not an obvious and easy to recognize category because they appear in many guises. In this paper, four case studies are presented to examine how causal contrast diagrams appear in a wide range of scientific reports, from experimental to observational and even purely theoretical studies. It is shown that causal contrasts can be expressed in starkly different formats, including photographs of complexly visualized macromolecules as well as line graphs, bar graphs, or plots of state spaces. Despite surface differences, however, there is a measure of conceptual unity among such diagrams. In empirical studies they often serve not only to infer and communicate specific causal claims, but also as evidence for them. The key data of some studies is given nowhere except in the diagrams. Many diagrams show multiple causal contrasts in order to demonstrate both that an effect exists and that the effect is specific - that is, to narrowly circumscribe the phenomenon to be explained. In a large range of scientific reports, causal contrast diagrams reflect the core epistemic claims of the researchers. Copyright © 2016. Published by Elsevier Ltd.

  10. Recursive causality in evolution: a model for epigenetic mechanisms in cancer development.

    PubMed

    Haslberger, A; Varga, F; Karlic, H

    2006-01-01

    Interactions between adaptative and selective processes are illustrated in the model of recursive causality as defined in Rupert Riedl's systems theory of evolution. One of the main features of this theory also termed as theory of evolving complexity is the centrality of the notion of 'recursive' or 'feedback' causality - 'the idea that every biological effect in living systems, in some way, feeds back to its own cause'. Our hypothesis is that "recursive" or "feedback" causality provides a model for explaining the consequences of interacting genetic and epigenetic mechanisms which are known to play a key role in development of cancer. Epigenetics includes any process that alters gene activity without changes of the DNA sequence. The most important epigenetic mechanisms are DNA-methylation and chromatin remodeling. Hypomethylation of so-called oncogenes and hypermethylation of tumor suppressor genes appear to be critical determinants of cancer. Folic acid, vitamin B12 and other nutrients influence the function of enzymes that participate in various methylation processes by affecting the supply of methyl groups into a variety of molecules which may be directly or indirectly associated with cancerogenesis. We present an example from our own studies by showing that vitamin D3 has the potential to de-methylate the osteocalcin-promoter in MG63 osteosarcoma cells. Consequently, a stimulation of osteocalcin synthesis can be observed. The above mentioned enzymes also play a role in development and differentiation of cells and organisms and thus illustrate the close association between evolutionary and developmental mechanisms. This enabled new ways to understand the interaction between the genome and environment and may improve biomedical concepts including environmental health aspects where epigenetic and genetic modifications are closely associated. Recent observations showed that methylated nucleotides in the gene promoter may serve as a target for solar UV

  11. Normalizing the causality between time series.

    PubMed

    Liang, X San

    2015-08-01

    Recently, a rigorous yet concise formula was derived to evaluate information flow, and hence the causality in a quantitative sense, between time series. To assess the importance of a resulting causality, it needs to be normalized. The normalization is achieved through distinguishing a Lyapunov exponent-like, one-dimensional phase-space stretching rate and a noise-to-signal ratio from the rate of information flow in the balance of the marginal entropy evolution of the flow recipient. It is verified with autoregressive models and applied to a real financial analysis problem. An unusually strong one-way causality is identified from IBM (International Business Machines Corporation) to GE (General Electric Company) in their early era, revealing to us an old story, which has almost faded into oblivion, about "Seven Dwarfs" competing with a giant for the mainframe computer market.

  12. Normalizing the causality between time series

    NASA Astrophysics Data System (ADS)

    Liang, X. San

    2015-08-01

    Recently, a rigorous yet concise formula was derived to evaluate information flow, and hence the causality in a quantitative sense, between time series. To assess the importance of a resulting causality, it needs to be normalized. The normalization is achieved through distinguishing a Lyapunov exponent-like, one-dimensional phase-space stretching rate and a noise-to-signal ratio from the rate of information flow in the balance of the marginal entropy evolution of the flow recipient. It is verified with autoregressive models and applied to a real financial analysis problem. An unusually strong one-way causality is identified from IBM (International Business Machines Corporation) to GE (General Electric Company) in their early era, revealing to us an old story, which has almost faded into oblivion, about "Seven Dwarfs" competing with a giant for the mainframe computer market.

  13. Gene-Trap Mutagenesis Identifies Mammalian Genes Contributing to Intoxication by Clostridium perfringens ε-Toxin

    PubMed Central

    Ivie, Susan E.; Fennessey, Christine M.; Sheng, Jinsong; Rubin, Donald H.; McClain, Mark S.

    2011-01-01

    The Clostridium perfringens ε-toxin is an extremely potent toxin associated with lethal toxemias in domesticated ruminants and may be toxic to humans. Intoxication results in fluid accumulation in various tissues, most notably in the brain and kidneys. Previous studies suggest that the toxin is a pore-forming toxin, leading to dysregulated ion homeostasis and ultimately cell death. However, mammalian host factors that likely contribute to ε-toxin-induced cytotoxicity are poorly understood. A library of insertional mutant Madin Darby canine kidney (MDCK) cells, which are highly susceptible to the lethal affects of ε-toxin, was used to select clones of cells resistant to ε-toxin-induced cytotoxicity. The genes mutated in 9 surviving resistant cell clones were identified. We focused additional experiments on one of the identified genes as a means of validating the experimental approach. Gene expression microarray analysis revealed that one of the identified genes, hepatitis A virus cellular receptor 1 (HAVCR1, KIM-1, TIM1), is more abundantly expressed in human kidney cell lines than it is expressed in human cells known to be resistant to ε-toxin. One human kidney cell line, ACHN, was found to be sensitive to the toxin and expresses a larger isoform of the HAVCR1 protein than the HAVCR1 protein expressed by other, toxin-resistant human kidney cell lines. RNA interference studies in MDCK and in ACHN cells confirmed that HAVCR1 contributes to ε-toxin-induced cytotoxicity. Additionally, ε-toxin was shown to bind to HAVCR1 in vitro. The results of this study indicate that HAVCR1 and the other genes identified through the use of gene-trap mutagenesis and RNA interference strategies represent important targets for investigation of the process by which ε-toxin induces cell death and new targets for potential therapeutic intervention. PMID:21412435

  14. Gene-trap mutagenesis identifies mammalian genes contributing to intoxication by Clostridium perfringens ε-toxin.

    PubMed

    Ivie, Susan E; Fennessey, Christine M; Sheng, Jinsong; Rubin, Donald H; McClain, Mark S

    2011-03-11

    The Clostridium perfringens ε-toxin is an extremely potent toxin associated with lethal toxemias in domesticated ruminants and may be toxic to humans. Intoxication results in fluid accumulation in various tissues, most notably in the brain and kidneys. Previous studies suggest that the toxin is a pore-forming toxin, leading to dysregulated ion homeostasis and ultimately cell death. However, mammalian host factors that likely contribute to ε-toxin-induced cytotoxicity are poorly understood. A library of insertional mutant Madin Darby canine kidney (MDCK) cells, which are highly susceptible to the lethal affects of ε-toxin, was used to select clones of cells resistant to ε-toxin-induced cytotoxicity. The genes mutated in 9 surviving resistant cell clones were identified. We focused additional experiments on one of the identified genes as a means of validating the experimental approach. Gene expression microarray analysis revealed that one of the identified genes, hepatitis A virus cellular receptor 1 (HAVCR1, KIM-1, TIM1), is more abundantly expressed in human kidney cell lines than it is expressed in human cells known to be resistant to ε-toxin. One human kidney cell line, ACHN, was found to be sensitive to the toxin and expresses a larger isoform of the HAVCR1 protein than the HAVCR1 protein expressed by other, toxin-resistant human kidney cell lines. RNA interference studies in MDCK and in ACHN cells confirmed that HAVCR1 contributes to ε-toxin-induced cytotoxicity. Additionally, ε-toxin was shown to bind to HAVCR1 in vitro. The results of this study indicate that HAVCR1 and the other genes identified through the use of gene-trap mutagenesis and RNA interference strategies represent important targets for investigation of the process by which ε-toxin induces cell death and new targets for potential therapeutic intervention.

  15. Efficient and accurate causal inference with hidden confounders from genome-transcriptome variation data

    PubMed Central

    2017-01-01

    Mapping gene expression as a quantitative trait using whole genome-sequencing and transcriptome analysis allows to discover the functional consequences of genetic variation. We developed a novel method and ultra-fast software Findr for higly accurate causal inference between gene expression traits using cis-regulatory DNA variations as causal anchors, which improves current methods by taking into consideration hidden confounders and weak regulations. Findr outperformed existing methods on the DREAM5 Systems Genetics challenge and on the prediction of microRNA and transcription factor targets in human lymphoblastoid cells, while being nearly a million times faster. Findr is publicly available at https://github.com/lingfeiwang/findr. PMID:28821014

  16. A Penalized Robust Method for Identifying Gene-Environment Interactions

    PubMed Central

    Shi, Xingjie; Liu, Jin; Huang, Jian; Zhou, Yong; Xie, Yang; Ma, Shuangge

    2015-01-01

    In high-throughput studies, an important objective is to identify gene-environment interactions associated with disease outcomes and phenotypes. Many commonly adopted methods assume specific parametric or semiparametric models, which may be subject to model mis-specification. In addition, they usually use significance level as the criterion for selecting important interactions. In this study, we adopt the rank-based estimation, which is much less sensitive to model specification than some of the existing methods and includes several commonly encountered data and models as special cases. Penalization is adopted for the identification of gene-environment interactions. It achieves simultaneous estimation and identification and does not rely on significance level. For computation feasibility, a smoothed rank estimation is further proposed. Simulation shows that under certain scenarios, for example with contaminated or heavy-tailed data, the proposed method can significantly outperform the existing alternatives with more accurate identification. We analyze a lung cancer prognosis study with gene expression measurements under the AFT (accelerated failure time) model. The proposed method identifies interactions different from those using the alternatives. Some of the identified genes have important implications. PMID:24616063

  17. Sensory Impairments and Autism: A Re-Examination of Causal Modelling

    ERIC Educational Resources Information Center

    Gerrard, Sue; Rugg, Gordon

    2009-01-01

    Sensory impairments are widely reported in autism, but remain largely unexplained by existing models. This article examines Kanner's causal reasoning and identifies unsupported assumptions implicit in later empirical work. Our analysis supports a heterogeneous causal model for autistic characteristics. We propose that the development of a…

  18. Predicting hepatocellular carcinoma through cross-talk genes identified by risk pathways

    PubMed Central

    Shao, Zhuo; Huo, Diwei; Zhang, Denan; Xie, Hongbo; Yang, Jingbo; Liu, Qiuqi; Chen, Xiujie

    2018-01-01

    Hepatocellular carcinoma (HCC) is the most frequent type of liver cancer with poor survival rate and high mortality. Despite efforts on the mechanism of HCC, new molecular markers are needed for exact diagnosis, evaluation and treatment. Here, we combined transcriptome of HCC with networks and pathways to identify reliable molecular markers. Through integrating 249 differentially expressed genes with syncretic protein interaction networks, we constructed a HCC-specific network, from which we further extracted 480 pivotal genes. Based on the cross-talk between the enriched pathways of the pivotal genes, we finally identified a HCC signature of 45 genes, which could accurately distinguish HCC patients with normal individuals and reveal the prognosis of HCC patients. Among these 45 genes, 15 showed dysregulated expression patterns and a part have been reported to be associated with HCC and/or other cancers. These findings suggested that our identified 45 gene signature could be potential and valuable molecular markers for diagnosis and evaluation of HCC. PMID:29765536

  19. Inferring Gene Family Histories in Yeast Identifies Lineage Specific Expansions

    PubMed Central

    Ames, Ryan M.; Money, Daniel; Lovell, Simon C.

    2014-01-01

    The complement of genes found in the genome is a balance between gene gain and gene loss. Knowledge of the specific genes that are gained and lost over evolutionary time allows an understanding of the evolution of biological functions. Here we use new evolutionary models to infer gene family histories across complete yeast genomes; these models allow us to estimate the relative genome-wide rates of gene birth, death, innovation and extinction (loss of an entire family) for the first time. We show that the rates of gene family evolution vary both between gene families and between species. We are also able to identify those families that have experienced rapid lineage specific expansion/contraction and show that these families are enriched for specific functions. Moreover, we find that families with specific functions are repeatedly expanded in multiple species, suggesting the presence of common adaptations and that these family expansions/contractions are not random. Additionally, we identify potential specialisations, unique to specific species, in the functions of lineage specific expanded families. These results suggest that an important mechanism in the evolution of genome content is the presence of lineage-specific gene family changes. PMID:24921666

  20. Repeated causal decision making.

    PubMed

    Hagmayer, York; Meder, Björn

    2013-01-01

    Many of our decisions refer to actions that have a causal impact on the external environment. Such actions may not only allow for the mere learning of expected values or utilities but also for acquiring knowledge about the causal structure of our world. We used a repeated decision-making paradigm to examine what kind of knowledge people acquire in such situations and how they use their knowledge to adapt to changes in the decision context. Our studies show that decision makers' behavior is strongly contingent on their causal beliefs and that people exploit their causal knowledge to assess the consequences of changes in the decision problem. A high consistency between hypotheses about causal structure, causally expected values, and actual choices was observed. The experiments show that (a) existing causal hypotheses guide the interpretation of decision feedback, (b) consequences of decisions are used to revise existing causal beliefs, and (c) decision makers use the experienced feedback to induce a causal model of the choice situation even when they have no initial causal hypotheses, which (d) enables them to adapt their choices to changes of the decision problem. (PsycINFO Database Record (c) 2013 APA, all rights reserved).

  1. Mapping of Gene Expression Reveals CYP27A1 as a Susceptibility Gene for Sporadic ALS

    PubMed Central

    van Rheenen, Wouter; Franke, Lude; Jansen, Ritsert C.; van Es, Michael A.; van Vught, Paul W. J.; Blauw, Hylke M.; Groen, Ewout J. N.; Horvath, Steve; Estrada, Karol; Rivadeneira, Fernando; Hofman, Albert; Uitterlinden, Andre G.; Robberecht, Wim; Andersen, Peter M.; Melki, Judith; Meininger, Vincent; Hardiman, Orla; Landers, John E.; Brown, Robert H.; Shatunov, Aleksey; Shaw, Christopher E.; Leigh, P. Nigel; Al-Chalabi, Ammar; Ophoff, Roel A.

    2012-01-01

    Amyotrophic lateral sclerosis (ALS) is a progressive, neurodegenerative disease characterized by loss of upper and lower motor neurons. ALS is considered to be a complex trait and genome-wide association studies (GWAS) have implicated a few susceptibility loci. However, many more causal loci remain to be discovered. Since it has been shown that genetic variants associated with complex traits are more likely to be eQTLs than frequency-matched variants from GWAS platforms, we conducted a two-stage genome-wide screening for eQTLs associated with ALS. In addition, we applied an eQTL analysis to finemap association loci. Expression profiles using peripheral blood of 323 sporadic ALS patients and 413 controls were mapped to genome-wide genotyping data. Subsequently, data from a two-stage GWAS (3,568 patients and 10,163 controls) were used to prioritize eQTLs identified in the first stage (162 ALS, 207 controls). These prioritized eQTLs were carried forward to the second sample with both gene-expression and genotyping data (161 ALS, 206 controls). Replicated eQTL SNPs were then tested for association in the second-stage GWAS data to find SNPs associated with disease, that survived correction for multiple testing. We thus identified twelve cis eQTLs with nominally significant associations in the second-stage GWAS data. Eight SNP-transcript pairs of highest significance (lowest p = 1.27×10−51) withstood multiple-testing correction in the second stage and modulated CYP27A1 gene expression. Additionally, we show that C9orf72 appears to be the only gene in the 9p21.2 locus that is regulated in cis, showing the potential of this approach in identifying causative genes in association loci in ALS. This study has identified candidate genes for sporadic ALS, most notably CYP27A1. Mutations in CYP27A1 are causal to cerebrotendinous xanthomatosis which can present as a clinical mimic of ALS with progressive upper motor neuron loss, making it a plausible susceptibility gene for

  2. The genetics of alcoholism: identifying specific genes through family studies.

    PubMed

    Edenberg, Howard J; Foroud, Tatiana

    2006-09-01

    Alcoholism is a complex disorder with both genetic and environmental risk factors. Studies in humans have begun to elucidate the genetic underpinnings of the risk for alcoholism. Here we briefly review strategies for identifying individual genes in which variations affect the risk for alcoholism and related phenotypes, in the context of one large study that has successfully identified such genes. The Collaborative Study on the Genetics of Alcoholism (COGA) is a family-based study that has collected detailed phenotypic data on individuals in families with multiple alcoholic members. A genome-wide linkage approach led to the identification of chromosomal regions containing genes that influenced alcoholism risk and related phenotypes. Subsequently, single nucleotide polymorphisms (SNPs) were genotyped in positional candidate genes located within the linked chromosomal regions, and analyzed for association with these phenotypes. Using this sequential approach, COGA has detected association with GABRA2, CHRM2 and ADH4; these associations have all been replicated by other researchers. COGA has detected association to additional genes including GABRG3, TAS2R16, SNCA, OPRK1 and PDYN, results that are awaiting confirmation. These successes demonstrate that genes contributing to the risk for alcoholism can be reliably identified using human subjects.

  3. Gene expression patterns combined with bioinformatics analysis identify genes associated with cholangiocarcinoma.

    PubMed

    Li, Chen; Shen, Weixing; Shen, Sheng; Ai, Zhilong

    2013-12-01

    To explore the molecular mechanisms of cholangiocarcinoma (CC), microarray technology was used to find biomarkers for early detection and diagnosis. The gene expression profiles from 6 patients with CC and 5 normal controls were downloaded from Gene Expression Omnibus and compared. As a result, 204 differentially co-expressed genes (DCGs) in CC patients compared to normal controls were identified using a computational bioinformatics analysis. These genes were mainly involved in coenzyme metabolic process, peptidase activity and oxidation reduction. A regulatory network was constructed by mapping the DCGs to known regulation data. Four transcription factors, FOXC1, ZIC2, NKX2-2 and GCGR, were hub nodes in the network. In conclusion, this study provides a set of targets useful for future investigations into molecular biomarker studies. Copyright © 2013 Elsevier Ltd. All rights reserved.

  4. Causality as a Rigorous Notion and Quantitative Causality Analysis with Time Series

    NASA Astrophysics Data System (ADS)

    Liang, X. S.

    2017-12-01

    Given two time series, can one faithfully tell, in a rigorous and quantitative way, the cause and effect between them? Here we show that this important and challenging question (one of the major challenges in the science of big data), which is of interest in a wide variety of disciplines, has a positive answer. Particularly, for linear systems, the maximal likelihood estimator of the causality from a series X2 to another series X1, written T2→1, turns out to be concise in form: T2→1 = [C11 C12 C2,d1 — C112 C1,d1] / [C112 C22 — C11C122] where Cij (i,j=1,2) is the sample covariance between Xi and Xj, and Ci,dj the covariance between Xi and ΔXj/Δt, the difference approximation of dXj/dt using the Euler forward scheme. An immediate corollary is that causation implies correlation, but not vice versa, resolving the long-standing debate over causation versus correlation. The above formula has been validated with touchstone series purportedly generated with one-way causality that evades the classical approaches such as Granger causality test and transfer entropy analysis. It has also been applied successfully to the investigation of many real problems. Through a simple analysis with the stock series of IBM and GE, an unusually strong one-way causality is identified from the former to the latter in their early era, revealing to us an old story, which has almost faded into oblivion, about "Seven Dwarfs" competing with a "Giant" for the computer market. Another example presented here regards the cause-effect relation between the two climate modes, El Niño and Indian Ocean Dipole (IOD). In general, these modes are mutually causal, but the causality is asymmetric. To El Niño, the information flowing from IOD manifests itself as a propagation of uncertainty from the Indian Ocean. In the third example, an unambiguous one-way causality is found between CO2 and the global mean temperature anomaly. While it is confirmed that CO2 indeed drives the recent global warming

  5. DEVELOPMENT PLAN FOR THE CAUSAL ANALYSIS ...

    EPA Pesticide Factsheets

    The Causal Analysis/Diagnosis Decision Information System (CADDIS) is a web-based system that provides technical support for states, tribes and other users of the Office of Water's Stressor Identification Guidance. The Stressor Identification Guidance provides a rigorous and scientifically defensible method for determining the causes of biological impairments of aquatic ecosystems. It is being used by states as part of the TMDL process and is being applied to other impaired ecosystems such as Superfund sites. However, because of the complexity of causal relationships in ecosystems, and because the guidance includes a strength-of-evidence analysis which uses multiple causal considerations, the process is complex and information intensive. CADDIS helps users deal with that inherent complexity. Increasingly, the regulatory, remedial, and restoration actions taken to manage impaired environments are based on measurement and analysis of the biotic community. When an aquatic assemblage has been identified as impaired, an accurate and defensible assessment of the cause can help ensure that appropriate actions are taken. The U.S. EPA's Stressor Identification Guidance describes a methodology for identifying the most likely causes of observed impairments in aquatic systems. Stressor identification requires extensive knowledge of the mechanisms, symptoms, and stressor-response relationships for various specific stressors as well as the ability to use that knowledge in a

  6. A Causal Model of Sentence Recall: Effects of Familiarity, Concreteness, Comprehensibility, and Interestingness.

    ERIC Educational Resources Information Center

    Sadoski, Mark; And Others

    1993-01-01

    Presents and tests a theoretically derived causal model of the recall of sentences. Notes that the causal model identifies familiarity and concreteness as causes of comprehensibility; familiarity, concreteness, and comprehensibility as causes of interestingness; and all the identified variables as causes of both immediate and delayed recall.…

  7. Relativistic causality

    NASA Astrophysics Data System (ADS)

    Valente, Giovanni; Owen Weatherall, James

    2014-11-01

    Relativity theory is often taken to include, or to imply, a prohibition on superluminal propagation of causal processes. Yet, what exactly the prohibition on superluminal propagation amounts to and how one should deal with its possible violation have remained open philosophical problems, both in the context of the metaphysics of causation and the foundations of physics. In particular, recent work in philosophy of physics has focused on the causal structure of spacetime in relativity theory and on how this causal structure manifests itself in our most fundamental theories of matter. These topics were the subject of a workshop on "Relativistic Causality in Quantum Field Theory and General Relativity" that we organized (along with John Earman) at the Center for Philosophy of Science in Pittsburgh on April 5-7, 2013. The present Special Issue comprises contributions by speakers in that workshop as well as several other experts exploring different aspects of relativistic causality. We are grateful to the journal for hosting this Special Issue, to the journal's managing editor, Femke Kuiling, for her help and support in putting the issue together, and to the authors and the referees for their excellent work.

  8. Encoding dependence in Bayesian causal networks

    USDA-ARS?s Scientific Manuscript database

    Bayesian networks (BNs) represent complex, uncertain spatio-temporal dynamics by propagation of conditional probabilities between identifiable states with a testable causal interaction model. Typically, they assume random variables are discrete in time and space with a static network structure that ...

  9. Identifying conserved gene clusters in the presence of homology families.

    PubMed

    He, Xin; Goldwasser, Michael H

    2005-01-01

    The study of conserved gene clusters is important for understanding the forces behind genome organization and evolution, as well as the function of individual genes or gene groups. In this paper, we present a new model and algorithm for identifying conserved gene clusters from pairwise genome comparison. This generalizes a recent model called "gene teams." A gene team is a set of genes that appear homologously in two or more species, possibly in a different order yet with the distance of adjacent genes in the team for each chromosome always no more than a certain threshold. We remove the constraint in the original model that each gene must have a unique occurrence in each chromosome and thus allow the analysis on complex prokaryotic or eukaryotic genomes with extensive paralogs. Our algorithm analyzes a pair of chromosomes in O(mn) time and uses O(m+n) space, where m and n are the number of genes in the respective chromosomes. We demonstrate the utility of our methods by studying two bacterial genomes, E. coli K-12 and B. subtilis. Many of the teams identified by our algorithm correlate with documented E. coli operons, while several others match predicted operons, previously suggested by computational techniques. Our implementation and data are publicly available at euler.slu.edu/ approximately goldwasser/homologyteams/.

  10. Causality re-established.

    PubMed

    D'Ariano, Giacomo Mauro

    2018-07-13

    Causality has never gained the status of a 'law' or 'principle' in physics. Some recent literature has even popularized the false idea that causality is a notion that should be banned from theory. Such misconception relies on an alleged universality of the reversibility of the laws of physics, based either on the determinism of classical theory, or on the multiverse interpretation of quantum theory, in both cases motivated by mere interpretational requirements for realism of the theory. Here, I will show that a properly defined unambiguous notion of causality is a theorem of quantum theory, which is also a falsifiable proposition of the theory. Such a notion of causality appeared in the literature within the framework of operational probabilistic theories. It is a genuinely theoretical notion, corresponding to establishing a definite partial order among events, in the same way as we do by using the future causal cone on Minkowski space. The notion of causality is logically completely independent of the misidentified concept of 'determinism', and, being a consequence of quantum theory, is ubiquitous in physics. In addition, as classical theory can be regarded as a restriction of quantum theory, causality holds also in the classical case, although the determinism of the theory trivializes it. I then conclude by arguing that causality naturally establishes an arrow of time. This implies that the scenario of the 'block Universe' and the connected 'past hypothesis' are incompatible with causality, and thus with quantum theory: they are both doomed to remain mere interpretations and, as such, are not falsifiable, similar to the hypothesis of 'super-determinism'.This article is part of a discussion meeting issue 'Foundations of quantum mechanics and their impact on contemporary society'. © 2018 The Author(s).

  11. Fine-mapping the human leukocyte antigen locus in rheumatoid arthritis and other rheumatic diseases: identifying causal amino acid variants?

    PubMed

    van Heemst, Jurgen; Huizinga, Tom J W; van der Woude, Diane; Toes, René E M

    2015-05-01

    To provide an update on and the context of the recent findings obtained with novel statistical methods on the association of the human leukocyte antigen (HLA) locus with rheumatic diseases. Novel single nucleotide polymorphism fine-mapping data obtained for the HLA locus have indicated the strongest association with amino acid positions 11 and 13 of HLA-DRB1 molecule for several rheumatic diseases. On the basis of these data, a dominant role for position 11/13 in driving the association with these diseases is proposed and the identification of causal variants in the HLA region in relation to disease susceptibility implicated. The HLA class II locus is the most important risk factor for several rheumatic diseases. Recently, new statistical approaches have identified previously unrecognized amino acid positions in the HLA-DR molecule that associate with anticitrullinated protein antibody-negative and anticitrullinated protein antibody-positive rheumatoid arthritis. Likewise, similar findings have been made for other rheumatic conditions such as giant-cell arteritis and systemic lupus erythematosus. Interestingly, all these studies point toward an association with the same amino acid positions: amino acid positions 11 and 13 of the HLA-DR β chain. As both these positions influence peptide binding by HLA-DR and have been implicated in antigen presentation, the novel fine-mapping approach is proposed to map causal variants in the HLA region relevant to rheumatoid arthritis and several rheumatic diseases. If these interpretations are correct, they would direct the biological research aiming to address the explanation for the HLA-disease association. Here, we provide an overview of the recent findings and evidence from literature that, although relevant new insights have been obtained on HLA-disease associations, the interpretation of the biological role of these amino acids as causal variants explaining that such associations should be taken with caution.

  12. Learning to learn causal models.

    PubMed

    Kemp, Charles; Goodman, Noah D; Tenenbaum, Joshua B

    2010-09-01

    Learning to understand a single causal system can be an achievement, but humans must learn about multiple causal systems over the course of a lifetime. We present a hierarchical Bayesian framework that helps to explain how learning about several causal systems can accelerate learning about systems that are subsequently encountered. Given experience with a set of objects, our framework learns a causal model for each object and a causal schema that captures commonalities among these causal models. The schema organizes the objects into categories and specifies the causal powers and characteristic features of these categories and the characteristic causal interactions between categories. A schema of this kind allows causal models for subsequent objects to be rapidly learned, and we explore this accelerated learning in four experiments. Our results confirm that humans learn rapidly about the causal powers of novel objects, and we show that our framework accounts better for our data than alternative models of causal learning. Copyright © 2010 Cognitive Science Society, Inc.

  13. Disease causality extraction based on lexical semantics and document-clause frequency from biomedical literature.

    PubMed

    Lee, Dong-Gi; Shin, Hyunjung

    2017-05-18

    Recently, research on human disease network has succeeded and has become an aid in figuring out the relationship between various diseases. In most disease networks, however, the relationship between diseases has been simply represented as an association. This representation results in the difficulty of identifying prior diseases and their influence on posterior diseases. In this paper, we propose a causal disease network that implements disease causality through text mining on biomedical literature. To identify the causality between diseases, the proposed method includes two schemes: the first is the lexicon-based causality term strength, which provides the causal strength on a variety of causality terms based on lexicon analysis. The second is the frequency-based causality strength, which determines the direction and strength of causality based on document and clause frequencies in the literature. We applied the proposed method to 6,617,833 PubMed literature, and chose 195 diseases to construct a causal disease network. From all possible pairs of disease nodes in the network, 1011 causal pairs of 149 diseases were extracted. The resulting network was compared with that of a previous study. In terms of both coverage and quality, the proposed method showed outperforming results; it determined 2.7 times more causalities and showed higher correlation with associated diseases than the existing method. This research has novelty in which the proposed method circumvents the limitations of time and cost in applying all possible causalities in biological experiments and it is a more advanced text mining technique by defining the concepts of causality term strength.

  14. Gene Signature in Sessile Serrated Polyps Identifies Colon Cancer Subtype

    PubMed Central

    Kanth, Priyanka; Bronner, Mary P.; Boucher, Kenneth M.; Burt, Randall W.; Neklason, Deborah W.; Hagedorn, Curt H.; Delker, Don A.

    2016-01-01

    Sessile serrated colon adenoma/polyps (SSA/Ps) are found during routine screening colonoscopy and may account for 20–30% of colon cancers. However, differentiating SSA/Ps from hyperplastic polyps (HP) with little risk of cancer is challenging and complementary molecular markers are needed. Additionally, the molecular mechanisms of colon cancer development from SSA/Ps are poorly understood. RNA sequencing was performed on 21 SSA/Ps, 10 HPs, 10 adenomas, 21 uninvolved colon and 20 control colon specimens. Differential expression and leave-one-out cross validation methods were used to define a unique gene signature of SSA/Ps. Our SSA/P gene signature was evaluated in colon cancer RNA-Seq data from The Cancer Genome Atlas (TCGA) to identify a subtype of colon cancers that may develop from SSA/Ps. A total of 1422 differentially expressed genes were found in SSA/Ps relative to controls. Serrated polyposis syndrome (n=12) and sporadic SSA/Ps (n=9) exhibited almost complete (96%) gene overlap. A 51-gene panel in SSA/P showed similar expression in a subset of TCGA colon cancers with high microsatellite instability (MSI-H). A smaller seven-gene panel showed high sensitivity and specificity in identifying BRAF mutant, CpG island methylator phenotype high (CIMP-H) and MLH1 silenced colon cancers. We describe a unique gene signature in SSA/Ps that identifies a subset of colon cancers likely to develop through the serrated pathway. These gene panels may be utilized for improved differentiation of SSA/Ps from HPs and provide insights into novel molecular pathways altered in colon cancer arising from the serrated pathway. PMID:27026680

  15. A quantum causal discovery algorithm

    NASA Astrophysics Data System (ADS)

    Giarmatzi, Christina; Costa, Fabio

    2018-03-01

    Finding a causal model for a set of classical variables is now a well-established task—but what about the quantum equivalent? Even the notion of a quantum causal model is controversial. Here, we present a causal discovery algorithm for quantum systems. The input to the algorithm is a process matrix describing correlations between quantum events. Its output consists of different levels of information about the underlying causal model. Our algorithm determines whether the process is causally ordered by grouping the events into causally ordered non-signaling sets. It detects if all relevant common causes are included in the process, which we label Markovian, or alternatively if some causal relations are mediated through some external memory. For a Markovian process, it outputs a causal model, namely the causal relations and the corresponding mechanisms, represented as quantum states and channels. Our algorithm opens the route to more general quantum causal discovery methods.

  16. Imputation of adverse drug reactions: Causality assessment in hospitals

    PubMed Central

    Mastroianni, Patricia de Carvalho

    2017-01-01

    Background & objectives Different algorithms have been developed to standardize the causality assessment of adverse drug reactions (ADR). Although most share common characteristics, the results of the causality assessment are variable depending on the algorithm used. Therefore, using 10 different algorithms, the study aimed to compare inter-rater and multi-rater agreement for ADR causality assessment and identify the most consistent to hospitals. Methods Using ten causality algorithms, four judges independently assessed the first 44 cases of ADRs reported during the first year of implementation of a risk management service in a medium complexity hospital in the state of Sao Paulo (Brazil). Owing to variations in the terminology used for causality, the equivalent imputation terms were grouped into four categories: definite, probable, possible and unlikely. Inter-rater and multi-rater agreement analysis was performed by calculating the Cohen´s and Light´s kappa coefficients, respectively. Results None of the algorithms showed 100% reproducibility in the causal imputation. Fair inter-rater and multi-rater agreement was found. Emanuele (1984) and WHO-UMC (2010) algorithms showed a fair rate of agreement between the judges (k = 0.36). Interpretation & conclusions Although the ADR causality assessment algorithms were poorly reproducible, our data suggest that WHO-UMC algorithm is the most consistent for imputation in hospitals, since it allows evaluating the quality of the report. However, to improve the ability of assessing the causality using algorithms, it is necessary to include criteria for the evaluation of drug-related problems, which may be related to confounding variables that underestimate the causal association. PMID:28166274

  17. Whole-exome sequencing and high throughput genotyping identified KCNJ11 as the thirteenth MODY gene.

    PubMed

    Bonnefond, Amélie; Philippe, Julien; Durand, Emmanuelle; Dechaume, Aurélie; Huyvaert, Marlène; Montagne, Louise; Marre, Michel; Balkau, Beverley; Fajardy, Isabelle; Vambergue, Anne; Vatin, Vincent; Delplanque, Jérôme; Le Guilcher, David; De Graeve, Franck; Lecoeur, Cécile; Sand, Olivier; Vaxillaire, Martine; Froguel, Philippe

    2012-01-01

    Maturity-onset of the young (MODY) is a clinically heterogeneous form of diabetes characterized by an autosomal-dominant mode of inheritance, an onset before the age of 25 years, and a primary defect in the pancreatic beta-cell function. Approximately 30% of MODY families remain genetically unexplained (MODY-X). Here, we aimed to use whole-exome sequencing (WES) in a four-generation MODY-X family to identify a new susceptibility gene for MODY. WES (Agilent-SureSelect capture/Illumina-GAIIx sequencing) was performed in three affected and one non-affected relatives in the MODY-X family. We then performed a high-throughput multiplex genotyping (Illumina-GoldenGate assay) of the putative causal mutations in the whole family and in 406 controls. A linkage analysis was also carried out. By focusing on variants of interest (i.e. gains of stop codon, frameshift, non-synonymous and splice-site variants not reported in dbSNP130) present in the three affected relatives and not present in the control, we found 69 mutations. However, as WES was not uniform between samples, a total of 324 mutations had to be assessed in the whole family and in controls. Only one mutation (p.Glu227Lys in KCNJ11) co-segregated with diabetes in the family (with a LOD-score of 3.68). No KCNJ11 mutation was found in 25 other MODY-X unrelated subjects. Beyond neonatal diabetes mellitus (NDM), KCNJ11 is also a MODY gene ('MODY13'), confirming the wide spectrum of diabetes related phenotypes due to mutations in NDM genes (i.e. KCNJ11, ABCC8 and INS). Therefore, the molecular diagnosis of MODY should include KCNJ11 as affected carriers can be ideally treated with oral sulfonylureas.

  18. Phenoscape: Identifying Candidate Genes for Evolutionary Phenotypes

    PubMed Central

    Edmunds, Richard C.; Su, Baofeng; Balhoff, James P.; Eames, B. Frank; Dahdul, Wasila M.; Lapp, Hilmar; Lundberg, John G.; Vision, Todd J.; Dunham, Rex A.; Mabee, Paula M.; Westerfield, Monte

    2016-01-01

    Phenotypes resulting from mutations in genetic model organisms can help reveal candidate genes for evolutionarily important phenotypic changes in related taxa. Although testing candidate gene hypotheses experimentally in nonmodel organisms is typically difficult, ontology-driven information systems can help generate testable hypotheses about developmental processes in experimentally tractable organisms. Here, we tested candidate gene hypotheses suggested by expert use of the Phenoscape Knowledgebase, specifically looking for genes that are candidates responsible for evolutionarily interesting phenotypes in the ostariophysan fishes that bear resemblance to mutant phenotypes in zebrafish. For this, we searched ZFIN for genetic perturbations that result in either loss of basihyal element or loss of scales phenotypes, because these are the ancestral phenotypes observed in catfishes (Siluriformes). We tested the identified candidate genes by examining their endogenous expression patterns in the channel catfish, Ictalurus punctatus. The experimental results were consistent with the hypotheses that these features evolved through disruption in developmental pathways at, or upstream of, brpf1 and eda/edar for the ancestral losses of basihyal element and scales, respectively. These results demonstrate that ontological annotations of the phenotypic effects of genetic alterations in model organisms, when aggregated within a knowledgebase, can be used effectively to generate testable, and useful, hypotheses about evolutionary changes in morphology. PMID:26500251

  19. Common Marker Genes Identified from Various Sample Types for Systemic Lupus Erythematosus.

    PubMed

    Bing, Peng-Fei; Xia, Wei; Wang, Lan; Zhang, Yong-Hong; Lei, Shu-Feng; Deng, Fei-Yan

    2016-01-01

    Systemic lupus erythematosus (SLE) is a complex auto-immune disease. Gene expression studies have been conducted to identify SLE-related genes in various types of samples. It is unknown whether there are common marker genes significant for SLE but independent of sample types, which may have potentials for follow-up translational research. The aim of this study is to identify common marker genes across various sample types for SLE. Based on four public microarray gene expression datasets for SLE covering three representative types of blood-born samples (monocyte; peripheral blood mononuclear cell, PBMC; whole blood), we utilized three statistics (fold-change, FC; t-test p value; false discovery rate adjusted p value) to scrutinize genes simultaneously regulated with SLE across various sample types. For common marker genes, we conducted the Gene Ontology enrichment analysis and Protein-Protein Interaction analysis to gain insights into their functions. We identified 10 common marker genes associated with SLE (IFI6, IFI27, IFI44L, OAS1, OAS2, EIF2AK2, PLSCR1, STAT1, RNASE2, and GSTO1). Significant up-regulation of IFI6, IFI27, and IFI44L with SLE was observed in all the studied sample types, though the FC was most striking in monocyte, compared with PBMC and whole blood (8.82-251.66 vs. 3.73-74.05 vs. 1.19-1.87). Eight of the above 10 genes, except RNASE2 and GSTO1, interact with each other and with known SLE susceptibility genes, participate in immune response, RNA and protein catabolism, and cell death. Our data suggest that there exist common marker genes across various sample types for SLE. The 10 common marker genes, identified herein, deserve follow-up studies to dissert their potentials as diagnostic or therapeutic markers to predict SLE or treatment response.

  20. Applying causal mediation analysis to personality disorder research.

    PubMed

    Walters, Glenn D

    2018-01-01

    This article is designed to address fundamental issues in the application of causal mediation analysis to research on personality disorders. Causal mediation analysis is used to identify mechanisms of effect by testing variables as putative links between the independent and dependent variables. As such, it would appear to have relevance to personality disorder research. It is argued that proper implementation of causal mediation analysis requires that investigators take several factors into account. These factors are discussed under 5 headings: variable selection, model specification, significance evaluation, effect size estimation, and sensitivity testing. First, care must be taken when selecting the independent, dependent, mediator, and control variables for a mediation analysis. Some variables make better mediators than others and all variables should be based on reasonably reliable indicators. Second, the mediation model needs to be properly specified. This requires that the data for the analysis be prospectively or historically ordered and possess proper causal direction. Third, it is imperative that the significance of the identified pathways be established, preferably with a nonparametric bootstrap resampling approach. Fourth, effect size estimates should be computed or competing pathways compared. Finally, investigators employing the mediation method are advised to perform a sensitivity analysis. Additional topics covered in this article include parallel and serial multiple mediation designs, moderation, and the relationship between mediation and moderation. (PsycINFO Database Record (c) 2018 APA, all rights reserved).

  1. Integrative Analysis of GWASs, Human Protein Interaction, and Gene Expression Identified Gene Modules Associated With BMDs

    PubMed Central

    He, Hao; Zhang, Lei; Li, Jian; Wang, Yu-Ping; Zhang, Ji-Gang; Shen, Jie; Guo, Yan-Fang

    2014-01-01

    Context: To date, few systems genetics studies in the bone field have been performed. We designed our study from a systems-level perspective by integrating genome-wide association studies (GWASs), human protein-protein interaction (PPI) network, and gene expression to identify gene modules contributing to osteoporosis risk. Methods: First we searched for modules significantly enriched with bone mineral density (BMD)-associated genes in human PPI network by using 2 large meta-analysis GWAS datasets through a dense module search algorithm. One included 7 individual GWAS samples (Meta7). The other was from the Genetic Factors for Osteoporosis Consortium (GEFOS2). One was assigned as a discovery dataset and the other as an evaluation dataset, and vice versa. Results: In total, 42 modules and 129 modules were identified significantly in both Meta7 and GEFOS2 datasets for femoral neck and spine BMD, respectively. There were 3340 modules identified for hip BMD only in Meta7. As candidate modules, they were assessed for the biological relevance to BMD by gene set enrichment analysis in 2 expression profiles generated from circulating monocytes in subjects with low versus high BMD values. Interestingly, there were 2 modules significantly enriched in monocytes from the low BMD group in both gene expression datasets (nominal P value <.05). Two modules had 16 nonredundant genes. Functional enrichment analysis revealed that both modules were enriched for genes involved in Wnt receptor signaling and osteoblast differentiation. Conclusion: We highlighted 2 modules and novel genes playing important roles in the regulation of bone mass, providing important clues for therapeutic approaches for osteoporosis. PMID:25119315

  2. Causal Analysis After Haavelmo

    PubMed Central

    Heckman, James; Pinto, Rodrigo

    2014-01-01

    Haavelmo's seminal 1943 and 1944 papers are the first rigorous treatment of causality. In them, he distinguished the definition of causal parameters from their identification. He showed that causal parameters are defined using hypothetical models that assign variation to some of the inputs determining outcomes while holding all other inputs fixed. He thus formalized and made operational Marshall's (1890) ceteris paribus analysis. We embed Haavelmo's framework into the recursive framework of Directed Acyclic Graphs (DAGs) used in one influential recent approach to causality (Pearl, 2000) and in the related literature on Bayesian nets (Lauritzen, 1996). We compare the simplicity of an analysis of causality based on Haavelmo's methodology with the complex and nonintuitive approach used in the causal literature of DAGs—the “do-calculus” of Pearl (2009). We discuss the severe limitations of DAGs and in particular of the do-calculus of Pearl in securing identification of economic models. We extend our framework to consider models for simultaneous causality, a central contribution of Haavelmo. In general cases, DAGs cannot be used to analyze models for simultaneous causality, but Haavelmo's approach naturally generalizes to cover them. PMID:25729123

  3. Biological causal links on physiological and evolutionary time scales.

    PubMed

    Karmon, Amit; Pilpel, Yitzhak

    2016-04-26

    Correlation does not imply causation. If two variables, say A and B, are correlated, it could be because A causes B, or that B causes A, or because a third factor affects them both. We suggest that in many cases in biology, the causal link might be bi-directional: A causes B through a fast-acting physiological process, while B causes A through a slowly accumulating evolutionary process. Furthermore, many trained biologists tend to consistently focus at first on the fast-acting direction, and overlook the slower process in the opposite direction. We analyse several examples from modern biology that demonstrate this bias (codon usage optimality and gene expression, gene duplication and genetic dispensability, stem cell division and cancer risk, and the microbiome and host metabolism) and also discuss an example from linguistics. These examples demonstrate mutual effects between the fast physiological processes and the slow evolutionary ones. We believe that building awareness of inference biases among biologists who tend to prefer one causal direction over another could improve scientific reasoning.

  4. Entanglement, holography and causal diamonds

    NASA Astrophysics Data System (ADS)

    de Boer, Jan; Haehl, Felix M.; Heller, Michal P.; Myers, Robert C.

    2016-08-01

    We argue that the degrees of freedom in a d-dimensional CFT can be reorganized in an insightful way by studying observables on the moduli space of causal diamonds (or equivalently, the space of pairs of timelike separated points). This 2 d-dimensional space naturally captures some of the fundamental nonlocality and causal structure inherent in the entanglement of CFT states. For any primary CFT operator, we construct an observable on this space, which is defined by smearing the associated one-point function over causal diamonds. Known examples of such quantities are the entanglement entropy of vacuum excitations and its higher spin generalizations. We show that in holographic CFTs, these observables are given by suitably defined integrals of dual bulk fields over the corresponding Ryu-Takayanagi minimal surfaces. Furthermore, we explain connections to the operator product expansion and the first law of entanglemententropy from this unifying point of view. We demonstrate that for small perturbations of the vacuum, our observables obey linear two-derivative equations of motion on the space of causal diamonds. In two dimensions, the latter is given by a product of two copies of a two-dimensional de Sitter space. For a class of universal states, we show that the entanglement entropy and its spin-three generalization obey nonlinear equations of motion with local interactions on this moduli space, which can be identified with Liouville and Toda equations, respectively. This suggests the possibility of extending the definition of our new observables beyond the linear level more generally and in such a way that they give rise to new dynamically interacting theories on the moduli space of causal diamonds. Various challenges one has to face in order to implement this idea are discussed.

  5. Gene networks specific for innate immunity define post-traumatic stress disorder.

    PubMed

    Breen, M S; Maihofer, A X; Glatt, S J; Tylee, D S; Chandler, S D; Tsuang, M T; Risbrough, V B; Baker, D G; O'Connor, D T; Nievergelt, C M; Woelk, C H

    2015-12-01

    The molecular factors involved in the development of Post-Traumatic Stress Disorder (PTSD) remain poorly understood. Previous transcriptomic studies investigating the mechanisms of PTSD apply targeted approaches to identify individual genes under a cross-sectional framework lack a holistic view of the behaviours and properties of these genes at the system-level. Here we sought to apply an unsupervised gene-network based approach to a prospective experimental design using whole-transcriptome RNA-Seq gene expression from peripheral blood leukocytes of U.S. Marines (N=188), obtained both pre- and post-deployment to conflict zones. We identified discrete groups of co-regulated genes (i.e., co-expression modules) and tested them for association to PTSD. We identified one module at both pre- and post-deployment containing putative causal signatures for PTSD development displaying an over-expression of genes enriched for functions of innate-immune response and interferon signalling (Type-I and Type-II). Importantly, these results were replicated in a second non-overlapping independent dataset of U.S. Marines (N=96), further outlining the role of innate immune and interferon signalling genes within co-expression modules to explain at least part of the causal pathophysiology for PTSD development. A second module, consequential of trauma exposure, contained PTSD resiliency signatures and an over-expression of genes involved in hemostasis and wound responsiveness suggesting that chronic levels of stress impair proper wound healing during/after exposure to the battlefield while highlighting the role of the hemostatic system as a clinical indicator of chronic-based stress. These findings provide novel insights for early preventative measures and advanced PTSD detection, which may lead to interventions that delay or perhaps abrogate the development of PTSD.

  6. Guidelines for investigating causality of sequence variants in human disease

    PubMed Central

    MacArthur, D. G.; Manolio, T. A.; Dimmock, D. P.; Rehm, H. L.; Shendure, J.; Abecasis, G. R.; Adams, D. R.; Altman, R. B.; Antonarakis, S. E.; Ashley, E. A.; Barrett, J. C.; Biesecker, L. G.; Conrad, D. F.; Cooper, G. M.; Cox, N. J.; Daly, M. J.; Gerstein, M. B.; Goldstein, D. B.; Hirschhorn, J. N.; Leal, S. M.; Pennacchio, L. A.; Stamatoyannopoulos, J. A.; Sunyaev, S. R.; Valle, D.; Voight, B. F.; Winckler, W.; Gunter, C.

    2014-01-01

    The discovery of rare genetic variants is accelerating, and clear guidelines for distinguishing disease-causing sequence variants from the many potentially functional variants present in any human genome are urgently needed. Without rigorous standards we risk an acceleration of false-positive reports of causality, which would impede the translation of genomic research findings into the clinical diagnostic setting and hinder biological understanding of disease. Here we discuss the key challenges of assessing sequence variants in human disease, integrating both gene-level and variant-level support for causality. We propose guidelines for summarizing confidence in variant pathogenicity and highlight several areas that require further resource development. PMID:24759409

  7. Guidelines for investigating causality of sequence variants in human disease.

    PubMed

    MacArthur, D G; Manolio, T A; Dimmock, D P; Rehm, H L; Shendure, J; Abecasis, G R; Adams, D R; Altman, R B; Antonarakis, S E; Ashley, E A; Barrett, J C; Biesecker, L G; Conrad, D F; Cooper, G M; Cox, N J; Daly, M J; Gerstein, M B; Goldstein, D B; Hirschhorn, J N; Leal, S M; Pennacchio, L A; Stamatoyannopoulos, J A; Sunyaev, S R; Valle, D; Voight, B F; Winckler, W; Gunter, C

    2014-04-24

    The discovery of rare genetic variants is accelerating, and clear guidelines for distinguishing disease-causing sequence variants from the many potentially functional variants present in any human genome are urgently needed. Without rigorous standards we risk an acceleration of false-positive reports of causality, which would impede the translation of genomic research findings into the clinical diagnostic setting and hinder biological understanding of disease. Here we discuss the key challenges of assessing sequence variants in human disease, integrating both gene-level and variant-level support for causality. We propose guidelines for summarizing confidence in variant pathogenicity and highlight several areas that require further resource development.

  8. A Kernel Embedding-Based Approach for Nonstationary Causal Model Inference.

    PubMed

    Hu, Shoubo; Chen, Zhitang; Chan, Laiwan

    2018-05-01

    Although nonstationary data are more common in the real world, most existing causal discovery methods do not take nonstationarity into consideration. In this letter, we propose a kernel embedding-based approach, ENCI, for nonstationary causal model inference where data are collected from multiple domains with varying distributions. In ENCI, we transform the complicated relation of a cause-effect pair into a linear model of variables of which observations correspond to the kernel embeddings of the cause-and-effect distributions in different domains. In this way, we are able to estimate the causal direction by exploiting the causal asymmetry of the transformed linear model. Furthermore, we extend ENCI to causal graph discovery for multiple variables by transforming the relations among them into a linear nongaussian acyclic model. We show that by exploiting the nonstationarity of distributions, both cause-effect pairs and two kinds of causal graphs are identifiable under mild conditions. Experiments on synthetic and real-world data are conducted to justify the efficacy of ENCI over major existing methods.

  9. Gene expression patterns combined with network analysis identify hub genes associated with bladder cancer.

    PubMed

    Bi, Dongbin; Ning, Hao; Liu, Shuai; Que, Xinxiang; Ding, Kejia

    2015-06-01

    To explore molecular mechanisms of bladder cancer (BC), network strategy was used to find biomarkers for early detection and diagnosis. The differentially expressed genes (DEGs) between bladder carcinoma patients and normal subjects were screened using empirical Bayes method of the linear models for microarray data package. Co-expression networks were constructed by differentially co-expressed genes and links. Regulatory impact factors (RIF) metric was used to identify critical transcription factors (TFs). The protein-protein interaction (PPI) networks were constructed by the Search Tool for the Retrieval of Interacting Genes/Proteins (STRING) and clusters were obtained through molecular complex detection (MCODE) algorithm. Centralities analyses for complex networks were performed based on degree, stress and betweenness. Enrichment analyses were performed based on Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) databases. Co-expression networks and TFs (based on expression data of global DEGs and DEGs in different stages and grades) were identified. Hub genes of complex networks, such as UBE2C, ACTA2, FABP4, CKS2, FN1 and TOP2A, were also obtained according to analysis of degree. In gene enrichment analyses of global DEGs, cell adhesion, proteinaceous extracellular matrix and extracellular matrix structural constituent were top three GO terms. ECM-receptor interaction, focal adhesion, and cell cycle were significant pathways. Our results provide some potential underlying biomarkers of BC. However, further validation is required and deep studies are needed to elucidate the pathogenesis of BC. Copyright © 2015 Elsevier Ltd. All rights reserved.

  10. The discourse of causal explanations in school science

    NASA Astrophysics Data System (ADS)

    Slater, Tammy Jayne Anne

    Researchers and educators working from a systemic functional linguistic perspective have provided a body of work on science discourse which offers an excellent starting point for examining the linguistic aspects of the development of causal discourse in school science, discourse which Derewianka (1995) claimed is critical to success in secondary school. No work has yet described the development of causal language by identifying the linguistic features present in oral discourse or by comparing the causal discourse of native and non-native (ESL) speakers of English. The current research responds to this gap by examining the oral discourse collected from ESL and non-ESL students at the primary and high school grades. Specifically, it asks the following questions: (1) How do the teachers and students in these four contexts develop causal explanations and their relevant taxonomies through classroom interactions? (2) What are the causal discourse features being used by the students in these four contexts to construct oral causal explanations? The findings of the social practice analysis showed that the teachers in the four contexts differed in their approaches to teaching, with the primary school mainstream teacher focusing largely on the hands-on practice , the primary school ESL teacher moving from practice to theory, the high school mainstream teacher moving from theory to practice, and the high school ESL teacher relying primarily on theory. The findings from the quantitative, small corpus approach suggest that the developmental path of cause which has been identified in the writing of experts shows up not only in written texts but also in the oral texts which learners construct. Moreover, this move appears when the discourse of high school ESL and non-ESL students is compared, suggesting a developmental progression in the acquisition of these features by these students. The findings also reveal that the knowledge constructed, as shown by the concept maps created

  11. The role of causal criteria in causal inferences: Bradford Hill's "aspects of association".

    PubMed

    Ward, Andrew C

    2009-06-17

    As noted by Wesley Salmon and many others, causal concepts are ubiquitous in every branch of theoretical science, in the practical disciplines and in everyday life. In the theoretical and practical sciences especially, people often base claims about causal relations on applications of statistical methods to data. However, the source and type of data place important constraints on the choice of statistical methods as well as on the warrant attributed to the causal claims based on the use of such methods. For example, much of the data used by people interested in making causal claims come from non-experimental, observational studies in which random allocations to treatment and control groups are not present. Thus, one of the most important problems in the social and health sciences concerns making justified causal inferences using non-experimental, observational data. In this paper, I examine one method of justifying such inferences that is especially widespread in epidemiology and the health sciences generally - the use of causal criteria. I argue that while the use of causal criteria is not appropriate for either deductive or inductive inferences, they do have an important role to play in inferences to the best explanation. As such, causal criteria, exemplified by what Bradford Hill referred to as "aspects of [statistical] associations", have an indispensible part to play in the goal of making justified causal claims.

  12. The role of causal criteria in causal inferences: Bradford Hill's "aspects of association"

    PubMed Central

    Ward, Andrew C

    2009-01-01

    As noted by Wesley Salmon and many others, causal concepts are ubiquitous in every branch of theoretical science, in the practical disciplines and in everyday life. In the theoretical and practical sciences especially, people often base claims about causal relations on applications of statistical methods to data. However, the source and type of data place important constraints on the choice of statistical methods as well as on the warrant attributed to the causal claims based on the use of such methods. For example, much of the data used by people interested in making causal claims come from non-experimental, observational studies in which random allocations to treatment and control groups are not present. Thus, one of the most important problems in the social and health sciences concerns making justified causal inferences using non-experimental, observational data. In this paper, I examine one method of justifying such inferences that is especially widespread in epidemiology and the health sciences generally – the use of causal criteria. I argue that while the use of causal criteria is not appropriate for either deductive or inductive inferences, they do have an important role to play in inferences to the best explanation. As such, causal criteria, exemplified by what Bradford Hill referred to as "aspects of [statistical] associations", have an indispensible part to play in the goal of making justified causal claims. PMID:19534788

  13. A cross-species bi-clustering approach to identifying conserved co-regulated genes.

    PubMed

    Sun, Jiangwen; Jiang, Zongliang; Tian, Xiuchun; Bi, Jinbo

    2016-06-15

    A growing number of studies have explored the process of pre-implantation embryonic development of multiple mammalian species. However, the conservation and variation among different species in their developmental programming are poorly defined due to the lack of effective computational methods for detecting co-regularized genes that are conserved across species. The most sophisticated method to date for identifying conserved co-regulated genes is a two-step approach. This approach first identifies gene clusters for each species by a cluster analysis of gene expression data, and subsequently computes the overlaps of clusters identified from different species to reveal common subgroups. This approach is ineffective to deal with the noise in the expression data introduced by the complicated procedures in quantifying gene expression. Furthermore, due to the sequential nature of the approach, the gene clusters identified in the first step may have little overlap among different species in the second step, thus difficult to detect conserved co-regulated genes. We propose a cross-species bi-clustering approach which first denoises the gene expression data of each species into a data matrix. The rows of the data matrices of different species represent the same set of genes that are characterized by their expression patterns over the developmental stages of each species as columns. A novel bi-clustering method is then developed to cluster genes into subgroups by a joint sparse rank-one factorization of all the data matrices. This method decomposes a data matrix into a product of a column vector and a row vector where the column vector is a consistent indicator across the matrices (species) to identify the same gene cluster and the row vector specifies for each species the developmental stages that the clustered genes co-regulate. Efficient optimization algorithm has been developed with convergence analysis. This approach was first validated on synthetic data and compared

  14. A Theory of Causal Learning in Children: Causal Maps and Bayes Nets

    ERIC Educational Resources Information Center

    Gopnik, Alison; Glymour, Clark; Sobel, David M.; Schulz, Laura E.; Kushnir, Tamar; Danks, David

    2004-01-01

    The authors outline a cognitive and computational account of causal learning in children. They propose that children use specialized cognitive systems that allow them to recover an accurate "causal map" of the world: an abstract, coherent, learned representation of the causal relations among events. This kind of knowledge can be perspicuously…

  15. Conjectures on the relations of linking and causality in causally simple spacetimes

    NASA Astrophysics Data System (ADS)

    Chernov, Vladimir

    2018-05-01

    We formulate the generalization of the Legendrian Low conjecture of Natario and Tod (proved by Nemirovski and myself before) to the case of causally simple spacetimes. We prove a weakened version of the corresponding statement. In all known examples, a causally simple spacetime can be conformally embedded as an open subset into some globally hyperbolic and the space of light rays in is an open submanifold of the space of light rays in . If this is always the case, this provides an approach to solving the conjectures relating causality and linking in causally simple spacetimes.

  16. Analogy in causal inference: rethinking Austin Bradford Hill's neglected consideration.

    PubMed

    Weed, Douglas L

    2018-05-01

    The purpose of this article was to rethink and resurrect Austin Bradford Hill's "criterion" of analogy as an important consideration in causal inference. In epidemiology today, analogy is either completely ignored (e.g., in many textbooks), or equated with biologic plausibility or coherence, or aligned with the scientist's imagination. None of these examples, however, captures Hill's description of analogy. His words suggest that there may be something gained by contrasting two bodies of evidence, one from an established causal relationship, the other not. Coupled with developments in the methods of systematic assessments of evidence-including but not limited to meta-analysis-analogy can be restructured as a key component in causal inference. This new approach will require that a collection-a library-of known cases of causal inference (i.e., bodies of evidence involving established causal relationships) be developed. This library would likely include causal assessments by organizations such as the International Agency for Research on Cancer, the National Toxicology Program, and the United States Environmental Protection Agency. In addition, a process for describing key features of a causal relationship would need to be developed along with what will be considered paradigm cases of causation. Finally, it will be important to develop ways to objectively compare a "new" body of evidence with the relevant paradigm case of causation. Analogy, along with all other existing methods and causal considerations, may improve our ability to identify causal relationships. Copyright © 2018 Elsevier Inc. All rights reserved.

  17. Intellectual disability and autism spectrum disorders: causal genes and molecular mechanisms.

    PubMed

    Srivastava, Anand K; Schwartz, Charles E

    2014-10-01

    Intellectual disability (ID) and autism spectrum disorder (ASD) are the most common developmental disorders present in humans. Combined, they affect between 3 and 5% of the population. Additionally, they can be found together in the same individual thereby complicating treatment. The causative factors (genes, epigenetic and environmental) are quite varied and likely interact so as to further complicate the assessment of an individual patient. Nonetheless, much valuable information has been gained by identifying candidate genes for ID or ASD. Understanding the etiology of either ID or ASD is of utmost importance for families. It allows a determination of the risk of recurrence, the possibility of other comorbidity medical problems, the molecular and cellular nature of the pathobiology and hopefully potential therapeutic approaches. Copyright © 2014 Elsevier Ltd. All rights reserved.

  18. Causal Systems Categories: Differences in Novice and Expert Categorization of Causal Phenomena

    ERIC Educational Resources Information Center

    Rottman, Benjamin M.; Gentner, Dedre; Goldwater, Micah B.

    2012-01-01

    We investigated the understanding of causal systems categories--categories defined by common causal structure rather than by common domain content--among college students. We asked students who were either novices or experts in the physical sciences to sort descriptions of real-world phenomena that varied in their causal structure (e.g., negative…

  19. A survey of single nucleotide polymorphisms identified from whole-genome sequencing and their functional effect in the porcine genome

    USDA-ARS?s Scientific Manuscript database

    Genetic variants detected from sequence have been used to successfully identify causal variants and map complex traits in several organisms. High and moderate impact variants, those expected to alter or disrupt the protein coded by a gene and those that regulate protein production, likely have a mor...

  20. A survey of FLS2 genes from multiple citrus species identifies candidates for enhancing disease resistance to Xanthomonas citri ssp. citri.

    PubMed Central

    Shi, Qingchun; Febres, Vicente J; Jones, Jeffrey B; Moore, Gloria A

    2016-01-01

    Pathogen-associated molecular patterns (PAMPs)-triggered immunity (PTI) is an important component of plant innate immunity. In a previous study, we showed that the PAMP flg22 from Xanthomonas citri ssp. citri (Xflg22), the causal agent of citrus canker, induced PTI in citrus, which correlated with the observed levels of canker resistance. Here, we identified and sequenced two bacterial flagellin/flg22 receptors (FLS2-1 and FLS2-2) from ‘Duncan’ grapefruit (Citrus paradisi, CpFLS2-1 and CpFLS2-2) and ‘Sun Chu Sha’ mandarin (C. reticulata, CrFLS2-1 and CrFLS2-2). We were able to isolate only one FLS2 from ‘Nagami’ kumquat (Fortunella margarita, FmFLS2-1) and gene flanking sequences suggest a rearrangement event that resulted in the deletion of FLS2-2 from the genome. Phylogenetic analysis, gene structure and presence of critical amino acid domains all indicate we identified the true FLS2 genes in citrus. FLS2-2 was more transcriptionally responsive to Xflg22 than FLS2-1, with induced expression levels higher in canker-resistant citrus than in susceptible ones. Interestingly, ‘Nagami’ kumquat showed the highest FLS2-1 steady-state expression levels, although it was not induced by Xflg22. We selected FmFLS2-1, CrFLS2-2 and CpFLS2-2 to further evaluate their capacity to enhance bacterial resistance using Agrobacterium-mediated transient expression assays. Both FmFLS2-1 and CrFLS2-2, the two proteins from canker-resistant species, conferred stronger Xflg22 responses and reduced canker symptoms in leaves of the susceptible grapefruit genotype. These two citrus genes will be useful resources to enhance PTI and achieve resistance against canker and possibly other bacterial pathogens in susceptible citrus types. PMID:27222722

  1. Inability to activate Rac1-dependent forgetting contributes to behavioral inflexibility in mutants of multiple autism-risk genes

    PubMed Central

    Dong, Tao; He, Jing; Wang, Shiqing; Wang, Lianzhang; Cheng, Yuqi; Zhong, Yi

    2016-01-01

    The etiology of autism is so complicated because it involves the effects of variants of several hundred risk genes along with the contribution of environmental factors. Therefore, it has been challenging to identify the causal paths that lead to the core autistic symptoms such as social deficit, repetitive behaviors, and behavioral inflexibility. As an alternative approach, extensive efforts have been devoted to identifying the convergence of the targets and functions of the autism-risk genes to facilitate mapping out causal paths. In this study, we used a reversal-learning task to measure behavioral flexibility in Drosophila and determined the effects of loss-of-function mutations in multiple autism-risk gene homologs in flies. Mutations of five autism-risk genes with diversified molecular functions all led to a similar phenotype of behavioral inflexibility indicated by impaired reversal-learning. These reversal-learning defects resulted from the inability to forget or rather, specifically, to activate Rac1 (Ras-related C3 botulinum toxin substrate 1)-dependent forgetting. Thus, behavior-evoked activation of Rac1-dependent forgetting has a converging function for autism-risk genes. PMID:27335463

  2. Causal mechanisms and balancing selection inferred from genetic associations with polycystic ovary syndrome.

    PubMed

    Day, Felix R; Hinds, David A; Tung, Joyce Y; Stolk, Lisette; Styrkarsdottir, Unnur; Saxena, Richa; Bjonnes, Andrew; Broer, Linda; Dunger, David B; Halldorsson, Bjarni V; Lawlor, Debbie A; Laval, Guillaume; Mathieson, Iain; McCardle, Wendy L; Louwers, Yvonne; Meun, Cindy; Ring, Susan; Scott, Robert A; Sulem, Patrick; Uitterlinden, André G; Wareham, Nicholas J; Thorsteinsdottir, Unnur; Welt, Corrine; Stefansson, Kari; Laven, Joop S E; Ong, Ken K; Perry, John R B

    2015-09-29

    Polycystic ovary syndrome (PCOS) is the most common reproductive disorder in women, yet there is little consensus regarding its aetiology. Here we perform a genome-wide association study of PCOS in up to 5,184 self-reported cases of White European ancestry and 82,759 controls, with follow-up in a further ∼2,000 clinically validated cases and ∼100,000 controls. We identify six signals for PCOS at genome-wide statistical significance (P<5 × 10(-8)), in/near genes ERBB4/HER4, YAP1, THADA, FSHB, RAD50 and KRR1. Variants in/near three of the four epidermal growth factor receptor genes (ERBB2/HER2, ERBB3/HER3 and ERBB4/HER4) are associated with PCOS at or near genome-wide significance. Mendelian randomization analyses indicate causal roles in PCOS aetiology for higher BMI (P=2.5 × 10(-9)), higher insulin resistance (P=6 × 10(-4)) and lower serum sex hormone binding globulin concentrations (P=5 × 10(-4)). Furthermore, genetic susceptibility to later menopause is associated with higher PCOS risk (P=1.6 × 10(-8)) and PCOS-susceptibility alleles are associated with higher serum anti-Müllerian hormone concentrations in girls (P=8.9 × 10(-5)). This large-scale study implicates an aetiological role of the epidermal growth factor receptors, infers causal mechanisms relevant to clinical management and prevention, and suggests balancing selection mechanisms involved in PCOS risk.

  3. ICan: an integrated co-alteration network to identify ovarian cancer-related genes.

    PubMed

    Zhou, Yuanshuai; Liu, Yongjing; Li, Kening; Zhang, Rui; Qiu, Fujun; Zhao, Ning; Xu, Yan

    2015-01-01

    Over the last decade, an increasing number of integrative studies on cancer-related genes have been published. Integrative analyses aim to overcome the limitation of a single data type, and provide a more complete view of carcinogenesis. The vast majority of these studies used sample-matched data of gene expression and copy number to investigate the impact of copy number alteration on gene expression, and to predict and prioritize candidate oncogenes and tumor suppressor genes. However, correlations between genes were neglected in these studies. Our work aimed to evaluate the co-alteration of copy number, methylation and expression, allowing us to identify cancer-related genes and essential functional modules in cancer. We built the Integrated Co-alteration network (ICan) based on multi-omics data, and analyzed the network to uncover cancer-related genes. After comparison with random networks, we identified 155 ovarian cancer-related genes, including well-known (TP53, BRCA1, RB1 and PTEN) and also novel cancer-related genes, such as PDPN and EphA2. We compared the results with a conventional method: CNAmet, and obtained a significantly better area under the curve value (ICan: 0.8179, CNAmet: 0.5183). In this paper, we describe a framework to find cancer-related genes based on an Integrated Co-alteration network. Our results proved that ICan could precisely identify candidate cancer genes and provide increased mechanistic understanding of carcinogenesis. This work suggested a new research direction for biological network analyses involving multi-omics data.

  4. Identifying differentially expressed genes in cancer patients using a non-parameter Ising model.

    PubMed

    Li, Xumeng; Feltus, Frank A; Sun, Xiaoqian; Wang, James Z; Luo, Feng

    2011-10-01

    Identification of genes and pathways involved in diseases and physiological conditions is a major task in systems biology. In this study, we developed a novel non-parameter Ising model to integrate protein-protein interaction network and microarray data for identifying differentially expressed (DE) genes. We also proposed a simulated annealing algorithm to find the optimal configuration of the Ising model. The Ising model was applied to two breast cancer microarray data sets. The results showed that more cancer-related DE sub-networks and genes were identified by the Ising model than those by the Markov random field model. Furthermore, cross-validation experiments showed that DE genes identified by Ising model can improve classification performance compared with DE genes identified by Markov random field model. Copyright © 2011 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  5. A recellularized human colon model identifies cancer driver genes

    PubMed Central

    Chen, Huanhuan Joyce; Wei, Zhubo; Sun, Jian; Bhattacharya, Asmita; Savage, David J; Serda, Rita; Mackeyev, Yuri; Curley, Steven A.; Bu, Pengcheng; Wang, Lihua; Chen, Shuibing; Cohen-Gould, Leona; Huang, Emina; Shen, Xiling; Lipkin, Steven M.; Copeland, Neal G.; Jenkins, Nancy A.; Shuler, Michael L.

    2016-01-01

    Refined cancer models are needed to bridge the gap between cell-line, animal and clinical research. Here we describe the engineering of an organotypic colon cancer model by recellularization of a native human matrix that contains cell-populated mucosa and an intact muscularis mucosa layer. This ex vivo system recapitulates the pathophysiological progression from APC-mutant neoplasia to submucosal invasive tumor. We used it to perform a Sleeping Beauty transposon mutagenesis screen to identify genes that cooperate with mutant APC in driving invasive neoplasia. 38 candidate invasion driver genes were identified, 17 of which have been previously implicated in colorectal cancer progression, including TCF7L2, TWIST2, MSH2, DCC and EPHB1/2. Six invasion driver genes that to our knowledge have not been previously described were validated in vitro using cell proliferation, migration and invasion assays, and ex vivo using recellularized human colon. These results demonstrate the utility of our organoid model for studying cancer biology. PMID:27398792

  6. A review of causal inference for biomedical informatics

    PubMed Central

    Kleinberg, Samantha; Hripcsak, George

    2011-01-01

    Causality is an important concept throughout the health sciences and is particularly vital for informatics work such as finding adverse drug events or risk factors for disease using electronic health records. While philosophers and scientists working for centuries on formalizing what makes something a cause have not reached a consensus, new methods for inference show that we can make progress in this area in many practical cases. This article reviews core concepts in understanding and identifying causality and then reviews current computational methods for inference and explanation, focusing on inference from large-scale observational data. While the problem is not fully solved, we show that graphical models and Granger causality provide useful frameworks for inference and that a more recent approach based on temporal logic addresses some of the limitations of these methods. PMID:21782035

  7. A noncoding melanophilin gene (MLPH) SNP at the splice donor of exon 1 represents a candidate causal mutation for coat color dilution in dogs.

    PubMed

    Drögemüller, Cord; Philipp, Ute; Haase, Bianca; Günzel-Apel, Anne-Rose; Leeb, Tosso

    2007-01-01

    Coat color dilution in several breeds of dog is characterized by a specific pigmentation phenotype and sometimes accompanied by hair loss and recurrent skin inflammation, the so-called color dilution alopecia or black hair follicular dysplasia. Coat color dilution (d) is inherited as a Mendelian autosomal recessive trait. In a previous study, MLPH polymorphisms showed perfect cosegregation with the dilute phenotype within breeds. However, different dilute haplotypes were found in different breeds, and no single polymorphism was identified in the coding sequence that was likely to be causative for the dilute phenotype. We resequenced the 5'-region of the canine MLPH gene and identified a strong candidate single nucleotide polymorphism within the nontranslated exon 1, which showed perfect association to the dilute phenotype in 65 dilute dogs from 7 different breeds. The A/G polymorphism is located at the last nucleotide of exon 1 and the mutant A-allele is predicted to reduce splicing efficiency 8-fold. An MLPH mRNA expression study using quantitative reverse transcriptase-polymerase chain reaction confirmed that dd animals had only about approximately 25% of the MLPH transcript compared with DD animals. These results provide preliminary evidence that the reported regulatory MLPH mutation might represent a causal mutation for coat color dilution in dogs.

  8. Time-varying causality between energy consumption, CO2 emissions, and economic growth: evidence from US states.

    PubMed

    Tzeremes, Panayiotis

    2018-02-01

    This study is the first attempt to investigate the relationship between CO 2 emissions, energy consumption, and economic growth at a state level, for the 50 US states, through a time-varying causality approach using annual data over the periods 1960-2010. The time-varying causality test facilitates the better understanding of the causal relationship between the covariates owing to the fact that it might identify causalities when the time-constant hypothesis is rejected. Our findings indicate the existence of a time-varying causality at the state level. Specifically, the results probe eight bidirectional time-varying causalities between energy consumption and CO 2 emission, six cases of two-way time-varying causalities between economic growth and energy consumption, and five bidirectional time-varying causalities between economic growth and CO 2 emission. Moreover, we examine the traditional environmental Kuznets curve hypothesis for the states. Notably, our results do not endorse the validity of the EKC, albeit the majority of states support an inverted N-shaped relationship. Lastly, we can identify multiple policy implications based on the empirical results.

  9. Gene expression profiling combined with bioinformatics analysis identify biomarkers for Parkinson disease.

    PubMed

    Diao, Hongyu; Li, Xinxing; Hu, Sheng; Liu, Yunhui

    2012-01-01

    Parkinson disease (PD) progresses relentlessly and affects approximately 4% of the population aged over 80 years old. It is difficult to diagnose in its early stages. The purpose of our study is to identify molecular biomarkers for PD initiation using a computational bioinformatics analysis of gene expression. We downloaded the gene expression profile of PD from Gene Expression Omnibus and identified differentially coexpressed genes (DCGs) and dysfunctional pathways in PD patients compared to controls. Besides, we built a regulatory network by mapping the DCGs to known regulatory data between transcription factors (TFs) and target genes and calculated the regulatory impact factor of each transcription factor. As the results, a total of 1004 genes associated with PD initiation were identified. Pathway enrichment of these genes suggests that biological processes of protein turnover were impaired in PD. In the regulatory network, HLF, E2F1 and STAT4 were found have altered expression levels in PD patients. The expression levels of other transcription factors, NKX3-1, TAL1, RFX1 and EGR3, were not found altered. However, they regulated differentially expressed genes. In conclusion, we suggest that HLF, E2F1 and STAT4 may be used as molecular biomarkers for PD; however, more work is needed to validate our result.

  10. Gene Expression Profiling Combined with Bioinformatics Analysis Identify Biomarkers for Parkinson Disease

    PubMed Central

    Diao, Hongyu; Li, Xinxing; Hu, Sheng; Liu, Yunhui

    2012-01-01

    Parkinson disease (PD) progresses relentlessly and affects approximately 4% of the population aged over 80 years old. It is difficult to diagnose in its early stages. The purpose of our study is to identify molecular biomarkers for PD initiation using a computational bioinformatics analysis of gene expression. We downloaded the gene expression profile of PD from Gene Expression Omnibus and identified differentially coexpressed genes (DCGs) and dysfunctional pathways in PD patients compared to controls. Besides, we built a regulatory network by mapping the DCGs to known regulatory data between transcription factors (TFs) and target genes and calculated the regulatory impact factor of each transcription factor. As the results, a total of 1004 genes associated with PD initiation were identified. Pathway enrichment of these genes suggests that biological processes of protein turnover were impaired in PD. In the regulatory network, HLF, E2F1 and STAT4 were found have altered expression levels in PD patients. The expression levels of other transcription factors, NKX3-1, TAL1, RFX1 and EGR3, were not found altered. However, they regulated differentially expressed genes. In conclusion, we suggest that HLF, E2F1 and STAT4 may be used as molecular biomarkers for PD; however, more work is needed to validate our result. PMID:23284986

  11. Axon Regeneration Genes Identified by RNAi Screening in C. elegans

    PubMed Central

    Nix, Paola; Hammarlund, Marc; Hauth, Linda; Lachnit, Martina; Jorgensen, Erik M.

    2014-01-01

    Axons of the mammalian CNS lose the ability to regenerate soon after development due to both an inhibitory CNS environment and the loss of cell-intrinsic factors necessary for regeneration. The complex molecular events required for robust regeneration of mature neurons are not fully understood, particularly in vivo. To identify genes affecting axon regeneration in Caenorhabditis elegans, we performed both an RNAi-based screen for defective motor axon regeneration in unc-70/β-spectrin mutants and a candidate gene screen. From these screens, we identified at least 50 conserved genes with growth-promoting or growth-inhibiting functions. Through our analysis of mutants, we shed new light on certain aspects of regeneration, including the role of β-spectrin and membrane dynamics, the antagonistic activity of MAP kinase signaling pathways, and the role of stress in promoting axon regeneration. Many gene candidates had not previously been associated with axon regeneration and implicate new pathways of interest for therapeutic intervention. PMID:24403161

  12. Does Causal Action Facilitate Causal Perception in Infants Younger than 6 Months of Age?

    ERIC Educational Resources Information Center

    Rakison, David H.; Krogh, Lauren

    2012-01-01

    Previous research has established that infants are unable to perceive causality until 6 1/4 months of age. The current experiments examined whether infants' ability to engage in causal action could facilitate causal perception prior to this age. In Experiment 1, 4 1/2-month-olds were randomly assigned to engage in causal action experience via…

  13. Complex Causal Process Diagrams for Analyzing the Health Impacts of Policy Interventions

    PubMed Central

    Joffe, Michael; Mindell, Jennifer

    2006-01-01

    Causal diagrams are rigorous tools for controlling confounding. They also can be used to describe complex causal systems, which is done routinely in communicable disease epidemiology. The use of change diagrams has advantages over static diagrams, because change diagrams are more tractable, relate better to interventions, and have clearer interpretations. Causal diagrams are a useful basis for modeling. They make assumptions explicit, provide a framework for analysis, generate testable predictions, explore the effects of interventions, and identify data gaps. Causal diagrams can be used to integrate different types of information and to facilitate communication both among public health experts and between public health experts and experts in other fields. Causal diagrams allow the use of instrumental variables, which can help control confounding and reverse causation. PMID:16449586

  14. Gene expression profiles analysis identifies key genes for acute lung injury in patients with sepsis.

    PubMed

    Guo, Zhiqiang; Zhao, Chuncheng; Wang, Zheng

    2014-09-26

    To identify critical genes and biological pathways in acute lung injury (ALI), a comparative analysis of gene expression profiles of patients with ALI + sepsis compared with patients with sepsis alone were performed with bioinformatic tools. GSE10474 was downloaded from Gene Expression Omnibus, including a collective of 13 whole blood samples with ALI + sepsis and 21 whole blood samples with sepsis alone. After pre-treatment with robust multichip averaging (RMA) method, differential analysis was conducted using simpleaffy package based upon t-test and fold change. Hierarchical clustering was also performed using function hclust from package stats. Beisides, functional enrichment analysis was conducted using iGepros. Moreover, the gene regulatory network was constructed with information from Kyoto Encyclopedia of Genes and Genomes (KEGG) and then visualized by Cytoscape. A total of 128 differentially expressed genes (DEGs) were identified, including 47 up- and 81 down-regulated genes. The significantly enriched functions included negative regulation of cell proliferation, regulation of response to stimulus and cellular component morphogenesis. A total of 27 DEGs were significantly enriched in 16 KEGG pathways, such as protein digestion and absorption, fatty acid metabolism, amoebiasis, etc. Furthermore, the regulatory network of these 27 DEGs was constructed, which involved several key genes, including protein tyrosine kinase 2 (PTK2), v-src avian sarcoma (SRC) and Caveolin 2 (CAV2). PTK2, SRC and CAV2 may be potential markers for diagnosis and treatment of ALI. The virtual slide(s) for this article can be found here: http://www.diagnosticpathology.diagnomx.eu/vs/5865162912987143.

  15. Common Viral Integration Sites Identified in Avian Leukosis Virus-Induced B-Cell Lymphomas

    PubMed Central

    Justice, James F.; Morgan, Robin W.

    2015-01-01

    ABSTRACT Avian leukosis virus (ALV) induces B-cell lymphoma and other neoplasms in chickens by integrating within or near cancer genes and perturbing their expression. Four genes—MYC, MYB, Mir-155, and TERT—have previously been identified as common integration sites in these virus-induced lymphomas and are thought to play a causal role in tumorigenesis. In this study, we employ high-throughput sequencing to identify additional genes driving tumorigenesis in ALV-induced B-cell lymphomas. In addition to the four genes implicated previously, we identify other genes as common integration sites, including TNFRSF1A, MEF2C, CTDSPL, TAB2, RUNX1, MLL5, CXorf57, and BACH2. We also analyze the genome-wide ALV integration landscape in vivo and find increased frequency of ALV integration near transcriptional start sites and within transcripts. Previous work has shown ALV prefers a weak consensus sequence for integration in cultured human cells. We confirm this consensus sequence for ALV integration in vivo in the chicken genome. PMID:26670384

  16. Combining Genome-Scale Experimental and Computational Methods To Identify Essential Genes in Rhodobacter sphaeroides

    DOE PAGES

    Burger, Brian T.; Imam, Saheed; Scarborough, Matthew J.; ...

    2017-06-06

    Rhodobacter sphaeroides is one of the best-studied alphaproteobacteria from biochemical, genetic, and genomic perspectives. To gain a better systems-level understanding of this organism, we generated a large transposon mutant library and used transposon sequencing (Tn-seq) to identify genes that are essential under several growth conditions. Using newly developed Tn-seq analysis software (TSAS), we identified 493 genes as essential for aerobic growth on a rich medium. We then used the mutant library to identify conditionally essential genes under two laboratory growth conditions, identifying 85 additional genes required for aerobic growth in a minimal medium and 31 additional genes required for photosyntheticmore » growth. In all instances, our analyses confirmed essentiality for many known genes and identified genes not previously considered to be essential. We used the resulting Tn-seq data to refine and improve a genome-scale metabolic network model (GEM) for R. sphaeroides. Together, we demonstrate how genetic, genomic, and computational approaches can be combined to obtain a systems-level understanding of the genetic framework underlying metabolic diversity in bacterial species.« less

  17. A Functional Genomics Approach to Identify Novel Breast Cancer Gene Targets in Yeast

    DTIC Science & Technology

    2004-05-01

    AD Award Number: DAMD17-03-1-0232 TITLE: A Functional Genomics Approach to Identify Novel Breast Cancer Gene Targets in Yeast PRINCIPAL INVESTIGATOR...Approach to Identify Novel Breast DAMD17-03-1-0232 Cancer Gene Targets in Yeast 6. A UTHOR(S) Craig Bennett, Ph.D. 7. PERFORMING ORGANIZA TION NAME(S...Unlimited 13. ABSTRACT (Maximum 200 Words) We are using the yeast Saccharomyces cerevisiae to identify new cancer gene targets that interact with the

  18. Agency, time, and causality

    PubMed Central

    Widlok, Thomas

    2014-01-01

    Cognitive Scientists interested in causal cognition increasingly search for evidence from non-Western Educational Industrial Rich Democratic people but find only very few cross-cultural studies that specifically target causal cognition. This article suggests how information about causality can be retrieved from ethnographic monographs, specifically from ethnographies that discuss agency and concepts of time. Many apparent cultural differences with regard to causal cognition dissolve when cultural extensions of agency and personhood to non-humans are taken into account. At the same time considerable variability remains when we include notions of time, linearity and sequence. The article focuses on ethnographic case studies from Africa but provides a more general perspective on the role of ethnography in research on the diversity and universality of causal cognition. PMID:25414683

  19. Photoreceptor dysplasia (pd) in miniature schnauzer dogs: evaluation of candidate genes by molecular genetic analysis.

    PubMed

    Zhang, Q; Baldwin, V J; Acland, G M; Parshall, C J; Haskel, J; Aguirre, G D; Ray, K

    1999-01-01

    Photoreceptor dysplasia (pd) is one of a group of at least six distinct autosomal and one X-linked retinal disorders identified in dogs which are collectively known as progressive retinal atrophy (PRA). It is an early onset retinal disease identified in miniature schnauzer dogs, and pedigree analysis and breeding studies have established autosomal recessive inheritance of the disease. Using a gene-based approach, a number of retina-expressed genes, including some members of the phototransduction pathway, have been causally implicated in retinal diseases of humans and other animals. Here we examined seven such potential candidate genes (opsin, RDS/peripherin, ROM1, rod cGMP-gated cation channel alpha-subunit, and three subunits of transducin) for their causal association with the pd locus by testing segregation of intragenic markers with the disease locus, or, in the absence of informative polymorphisms, sequencing of the coding regions of the genes. Based on these results, we have conclusively excluded four photoreceptor-specific genes as candidates for pd by linkage analysis. For three other photoreceptor-specific genes, we did not find any mutation in the coding sequences of the genes and have excluded them provisionally. Formal exclusion would require investigation of the levels of expression of the candidate genes in pd-affected dogs relative to age-matched controls. At present we are building suitable informative pedigrees for the disease locus with a sufficient number of meiosis to be useful for genomewide screening. This should identify markers linked to the disease locus and eventually permit progress toward the identification of the photoreceptor dysplasia gene and the disease-causing mutation.

  20. ICan: An Integrated Co-Alteration Network to Identify Ovarian Cancer-Related Genes

    PubMed Central

    Zhou, Yuanshuai; Liu, Yongjing; Li, Kening; Zhang, Rui; Qiu, Fujun; Zhao, Ning; Xu, Yan

    2015-01-01

    Background Over the last decade, an increasing number of integrative studies on cancer-related genes have been published. Integrative analyses aim to overcome the limitation of a single data type, and provide a more complete view of carcinogenesis. The vast majority of these studies used sample-matched data of gene expression and copy number to investigate the impact of copy number alteration on gene expression, and to predict and prioritize candidate oncogenes and tumor suppressor genes. However, correlations between genes were neglected in these studies. Our work aimed to evaluate the co-alteration of copy number, methylation and expression, allowing us to identify cancer-related genes and essential functional modules in cancer. Results We built the Integrated Co-alteration network (ICan) based on multi-omics data, and analyzed the network to uncover cancer-related genes. After comparison with random networks, we identified 155 ovarian cancer-related genes, including well-known (TP53, BRCA1, RB1 and PTEN) and also novel cancer-related genes, such as PDPN and EphA2. We compared the results with a conventional method: CNAmet, and obtained a significantly better area under the curve value (ICan: 0.8179, CNAmet: 0.5183). Conclusion In this paper, we describe a framework to find cancer-related genes based on an Integrated Co-alteration network. Our results proved that ICan could precisely identify candidate cancer genes and provide increased mechanistic understanding of carcinogenesis. This work suggested a new research direction for biological network analyses involving multi-omics data. PMID:25803614

  1. MMTV insertional mutagenesis identifies genes, gene families and pathways involved in mammary cancer.

    PubMed

    Theodorou, Vassiliki; Kimm, Melanie A; Boer, Mandy; Wessels, Lodewyk; Theelen, Wendy; Jonkers, Jos; Hilkens, John

    2007-06-01

    We performed a high-throughput retroviral insertional mutagenesis screen in mouse mammary tumor virus (MMTV)-induced mammary tumors and identified 33 common insertion sites, of which 17 genes were previously not known to be associated with mammary cancer and 13 had not previously been linked to cancer in general. Although members of the Wnt and fibroblast growth factors (Fgf) families were frequently tagged, our exhaustive screening for MMTV insertion sites uncovered a new repertoire of candidate breast cancer oncogenes. We validated one of these genes, Rspo3, as an oncogene by overexpression in a p53-deficient mammary epithelial cell line. The human orthologs of the candidate oncogenes were frequently deregulated in human breast cancers and associated with several tumor parameters. Computational analysis of all MMTV-tagged genes uncovered specific gene families not previously associated with cancer and showed a significant overrepresentation of protein domains and signaling pathways mainly associated with development and growth factor signaling. Comparison of all tagged genes in MMTV and Moloney murine leukemia virus-induced malignancies showed that both viruses target mostly different genes that act predominantly in distinct pathways.

  2. Structural Equations and Causal Explanations: Some Challenges for Causal SEM

    ERIC Educational Resources Information Center

    Markus, Keith A.

    2010-01-01

    One common application of structural equation modeling (SEM) involves expressing and empirically investigating causal explanations. Nonetheless, several aspects of causal explanation that have an impact on behavioral science methodology remain poorly understood. It remains unclear whether applications of SEM should attempt to provide complete…

  3. Chapter 15: Disease Gene Prioritization

    PubMed Central

    Bromberg, Yana

    2013-01-01

    Disease-causing aberrations in the normal function of a gene define that gene as a disease gene. Proving a causal link between a gene and a disease experimentally is expensive and time-consuming. Comprehensive prioritization of candidate genes prior to experimental testing drastically reduces the associated costs. Computational gene prioritization is based on various pieces of correlative evidence that associate each gene with the given disease and suggest possible causal links. A fair amount of this evidence comes from high-throughput experimentation. Thus, well-developed methods are necessary to reliably deal with the quantity of information at hand. Existing gene prioritization techniques already significantly improve the outcomes of targeted experimental studies. Faster and more reliable techniques that account for novel data types are necessary for the development of new diagnostics, treatments, and cure for many diseases. PMID:23633938

  4. The development of causal reasoning.

    PubMed

    Kuhn, Deanna

    2012-05-01

    How do inference rules for causal learning themselves change developmentally? A model of the development of causal reasoning must address this question, as well as specify the inference rules. Here, the evidence for developmental changes in processes of causal reasoning is reviewed, with the distinction made between diagnostic causal inference and causal prediction. Also addressed is the paradox of a causal reasoning literature that highlights the competencies of young children and the proneness to error among adults. WIREs Cogn Sci 2012, 3:327-335. doi: 10.1002/wcs.1160 For further resources related to this article, please visit the WIREs website. Copyright © 2012 John Wiley & Sons, Ltd.

  5. Gene-Based Genome-Wide Association Analysis in European and Asian Populations Identified Novel Genes for Rheumatoid Arthritis.

    PubMed

    Zhu, Hong; Xia, Wei; Mo, Xing-Bo; Lin, Xiang; Qiu, Ying-Hua; Yi, Neng-Jun; Zhang, Yong-Hong; Deng, Fei-Yan; Lei, Shu-Feng

    2016-01-01

    Rheumatoid arthritis (RA) is a complex autoimmune disease. Using a gene-based association research strategy, the present study aims to detect unknown susceptibility to RA and to address the ethnic differences in genetic susceptibility to RA between European and Asian populations. Gene-based association analyses were performed with KGG 2.5 by using publicly available large RA datasets (14,361 RA cases and 43,923 controls of European subjects, 4,873 RA cases and 17,642 controls of Asian Subjects). For the newly identified RA-associated genes, gene set enrichment analyses and protein-protein interactions analyses were carried out with DAVID and STRING version 10.0, respectively. Differential expression verification was conducted using 4 GEO datasets. The expression levels of three selected 'highly verified' genes were measured by ELISA among our in-house RA cases and controls. A total of 221 RA-associated genes were newly identified by gene-based association study, including 71'overlapped', 76 'European-specific' and 74 'Asian-specific' genes. Among them, 105 genes had significant differential expressions between RA patients and health controls at least in one dataset, especially for 20 genes including 11 'overlapped' (ABCF1, FLOT1, HLA-F, IER3, TUBB, ZKSCAN4, BTN3A3, HSP90AB1, CUTA, BRD2, HLA-DMA), 5 'European-specific' (PHTF1, RPS18, BAK1, TNFRSF14, SUOX) and 4 'Asian-specific' (RNASET2, HFE, BTN2A2, MAPK13) genes whose differential expressions were significant at least in three datasets. The protein expressions of two selected genes FLOT1 (P value = 1.70E-02) and HLA-DMA (P value = 4.70E-02) in plasma were significantly different in our in-house samples. Our study identified 221 novel RA-associated genes and especially highlighted the importance of 20 candidate genes on RA. The results addressed ethnic genetic background differences for RA susceptibility between European and Asian populations and detected a long list of overlapped or ethnic specific RA genes. The

  6. THE CAUSAL ANALYSIS / DIAGNOSIS DECISION ...

    EPA Pesticide Factsheets

    CADDIS is an on-line decision support system that helps investigators in the regions, states and tribes find, access, organize, use and share information to produce causal evaluations in aquatic systems. It is based on the US EPA's Stressor Identification process which is a formal method for identifying causes of impairments in aquatic systems. CADDIS 2007 increases access to relevant information useful for causal analysis and provides methods and tools that practitioners can use to analyze their own data. The new Candidate Cause section provides overviews of commonly encountered causes of impairments to aquatic systems: metals, sediments, nutrients, flow alteration, temperature, ionic strength, and low dissolved oxygen. CADDIS includes new Conceptual Models that illustrate the relationships from sources to stressors to biological effects. An Interactive Conceptual Model for phosphorus links the diagram with supporting literature citations. The new Analyzing Data section helps practitioners analyze their data sets and interpret and use those results as evidence within the USEPA causal assessment process. Downloadable tools include a graphical user interface statistical package (CADStat), and programs for use with the freeware R statistical package, and a Microsoft Excel template. These tools can be used to quantify associations between causes and biological impairments using innovative methods such as species-sensitivity distributions, biological inferenc

  7. Systematic analysis of microarray datasets to identify Parkinson's disease‑associated pathways and genes.

    PubMed

    Feng, Yinling; Wang, Xuefeng

    2017-03-01

    In order to investigate commonly disturbed genes and pathways in various brain regions of patients with Parkinson's disease (PD), microarray datasets from previous studies were collected and systematically analyzed. Different normalization methods were applied to microarray datasets from different platforms. A strategy combining gene co‑expression networks and clinical information was adopted, using weighted gene co‑expression network analysis (WGCNA) to screen for commonly disturbed genes in different brain regions of patients with PD. Functional enrichment analysis of commonly disturbed genes was performed using the Database for Annotation, Visualization, and Integrated Discovery (DAVID). Co‑pathway relationships were identified with Pearson's correlation coefficient tests and a hypergeometric distribution‑based test. Common genes in pathway pairs were selected out and regarded as risk genes. A total of 17 microarray datasets from 7 platforms were retained for further analysis. Five gene coexpression modules were identified, containing 9,745, 736, 233, 101 and 93 genes, respectively. One module was significantly correlated with PD samples and thus the 736 genes it contained were considered to be candidate PD‑associated genes. Functional enrichment analysis demonstrated that these genes were implicated in oxidative phosphorylation and PD. A total of 44 pathway pairs and 52 risk genes were revealed, and a risk gene pathway relationship network was constructed. Eight modules were identified and were revealed to be associated with PD, cancers and metabolism. A number of disturbed pathways and risk genes were unveiled in PD, and these findings may help advance understanding of PD pathogenesis.

  8. ‘Candidatus Liberibacter asiaticus’, causal agent of citrus Huanglongbing, is reduced by treatment with Brassinosteroids

    DOE PAGES

    Canales, Eduardo; Coll, Yamilet; Hernández, Ingrid; ...

    2016-01-05

    Huanglongbing (HLB) constitutes the most destructive disease of citrus worldwide, yet no established efficient management measures exist for it. Brassinosteroids, a family of plant steroidal compounds, are essential for plant growth, development and stress tolerance. As a possible control strategy for HLB, epibrassinolide was applied to as a foliar spray to citrus plants infected with the causal agent of HLB, ‘Candidatus Liberibacter asiaticus’. The bacterial titers were reduced after treatment with epibrassinolide under both greenhouse and field conditions but were stronger in the greenhouse. Known defense genes were induced in leaves by epibrassinolide. With the SuperSAGE technology combined with nextmore » generation sequencing, induction of genes known to be associated with defense response to bacteria and hormone transduction pathways were identified. Lastly, the results demonstrate that epibrassinolide may provide a useful tool for the management of HLB.« less

  9. ‘Candidatus Liberibacter asiaticus’, causal agent of citrus Huanglongbing, is reduced by treatment with Brassinosteroids

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Canales, Eduardo; Coll, Yamilet; Hernández, Ingrid

    Huanglongbing (HLB) constitutes the most destructive disease of citrus worldwide, yet no established efficient management measures exist for it. Brassinosteroids, a family of plant steroidal compounds, are essential for plant growth, development and stress tolerance. As a possible control strategy for HLB, epibrassinolide was applied to as a foliar spray to citrus plants infected with the causal agent of HLB, ‘Candidatus Liberibacter asiaticus’. The bacterial titers were reduced after treatment with epibrassinolide under both greenhouse and field conditions but were stronger in the greenhouse. Known defense genes were induced in leaves by epibrassinolide. With the SuperSAGE technology combined with nextmore » generation sequencing, induction of genes known to be associated with defense response to bacteria and hormone transduction pathways were identified. Lastly, the results demonstrate that epibrassinolide may provide a useful tool for the management of HLB.« less

  10. Learning a theory of causality.

    PubMed

    Goodman, Noah D; Ullman, Tomer D; Tenenbaum, Joshua B

    2011-01-01

    The very early appearance of abstract knowledge is often taken as evidence for innateness. We explore the relative learning speeds of abstract and specific knowledge within a Bayesian framework and the role for innate structure. We focus on knowledge about causality, seen as a domain-general intuitive theory, and ask whether this knowledge can be learned from co-occurrence of events. We begin by phrasing the causal Bayes nets theory of causality and a range of alternatives in a logical language for relational theories. This allows us to explore simultaneous inductive learning of an abstract theory of causality and a causal model for each of several causal systems. We find that the correct theory of causality can be learned relatively quickly, often becoming available before specific causal theories have been learned--an effect we term the blessing of abstraction. We then explore the effect of providing a variety of auxiliary evidence and find that a collection of simple perceptual input analyzers can help to bootstrap abstract knowledge. Together, these results suggest that the most efficient route to causal knowledge may be to build in not an abstract notion of causality but a powerful inductive learning mechanism and a variety of perceptual supports. While these results are purely computational, they have implications for cognitive development, which we explore in the conclusion.

  11. Finding the Cause: Verbal Framing Helps Children Extract Causal Evidence Embedded in a Complex Scene

    ERIC Educational Resources Information Center

    Butler, Lucas P.; Markman, Ellen M.

    2012-01-01

    In making causal inferences, children must both identify a causal problem and selectively attend to meaningful evidence. Four experiments demonstrate that verbally framing an event ("Which animals make Lion laugh?") helps 4-year-olds extract evidence from a complex scene to make accurate causal inferences. Whereas framing was unnecessary when…

  12. Analyzing brain networks with PCA and conditional Granger causality.

    PubMed

    Zhou, Zhenyu; Chen, Yonghong; Ding, Mingzhou; Wright, Paul; Lu, Zuhong; Liu, Yijun

    2009-07-01

    Identifying directional influences in anatomical and functional circuits presents one of the greatest challenges for understanding neural computations in the brain. Granger causality mapping (GCM) derived from vector autoregressive models of data has been employed for this purpose, revealing complex temporal and spatial dynamics underlying cognitive processes. However, the traditional GCM methods are computationally expensive, as signals from thousands of voxels within selected regions of interest (ROIs) are individually processed, and being based on pairwise Granger causality, they lack the ability to distinguish direct from indirect connectivity among brain regions. In this work a new algorithm called PCA based conditional GCM is proposed to overcome these problems. The algorithm implements the following two procedures: (i) dimensionality reduction in ROIs of interest with principle component analysis (PCA), and (ii) estimation of the direct causal influences in local brain networks, using conditional Granger causality. Our results show that the proposed method achieves greater accuracy in detecting network connectivity than the commonly used pairwise Granger causality method. Furthermore, the use of PCA components in conjunction with conditional GCM greatly reduces the computational cost relative to the use of individual voxel time series. Copyright 2009 Wiley-Liss, Inc

  13. Genetic architecture for human aggression: A study of gene-phenotype relationship in OMIM.

    PubMed

    Zhang-James, Yanli; Faraone, Stephen V

    2016-07-01

    Genetic studies of human aggression have mainly focused on known candidate genes and pathways regulating serotonin and dopamine signaling and hormonal functions. These studies have taught us much about the genetics of human aggression, but no genetic locus has yet achieved genome-significance. We here present a review based on a paradoxical hypothesis that studies of rare, functional genetic variations can lead to a better understanding of the molecular mechanisms underlying complex multifactorial disorders such as aggression. We examined all aggression phenotypes catalogued in Online Mendelian Inheritance in Man (OMIM), an Online Catalog of Human Genes and Genetic Disorders. We identified 95 human disorders that have documented aggressive symptoms in at least one individual with a well-defined genetic variant. Altogether, we retrieved 86 causal genes. Although most of these genes had not been implicated in human aggression by previous studies, the most significantly enriched canonical pathways had been previously implicated in aggression (e.g., serotonin and dopamine signaling). Our findings provide strong evidence to support the causal role of these pathways in the pathogenesis of aggression. In addition, the novel genes and pathways we identified suggest additional mechanisms underlying the origins of human aggression. Genome-wide association studies with very large samples will be needed to determine if common variants in these genes are risk factors for aggression. © 2015 Wiley Periodicals, Inc. © 2015 Wiley Periodicals, Inc.

  14. Identifying Candidate Reprogramming Genes in Mouse Induced Pluripotent Stem Cells.

    PubMed

    Gao, Fang; Li, Jingyu; Zhang, Heng; Yang, Xu; An, Tiezhu

    2017-08-01

    Factor-based induced reprogramming approaches have tremendous potential for human regenerative medicine, but the efficiencies of these approaches are still low. In this study, we analyzed the global transcriptional profiles of mouse induced pluripotent stem cells (miPSCs) and mouse embryonic stem cells (mESCs) from seven different labs and present here the first successful clustering according to cell type, not by lab of origin. We identified 2131 different expression genes (DEs) as candidate pluripotency-associated genes by comparing mESCs/miPSCs with somatic cells and 720 DEs between miPSCs and mESCs. Interestingly, there was a significant overlap between the two DE sets. Therefore, we defined the overlap DEs as "consensus DEs" including 313 miPSC-specific genes expressed at a higher level in miPSCs versus mESCs and 184 mESC-specific genes in total and reasoned that these may contribute to the differences in pluripotency between mESCs and miPSCs. A classification of "consensus DEs" according to their different expression levels between somatic cells and mESCs/miPSCs shows that 86% of the miPSC-specific genes are more highly expressed in somatic cells, while 73% of mESC-specific genes are highly expressed in mESCs/miPSCs, indicating that the miPSCs have not efficiently silenced the expression pattern of the somatic cells from which they are derived and failed to completely induce the genes with high expression levels in mESCs. We further revealed a strong correlation between oocyte-enriched factors and insufficiently induced mESC-specific genes and identified 11 hub genes via network analysis. In light of these findings, we postulated that these key hub genes might not only drive somatic cell nuclear transfer (SCNT) reprogramming but also augment the efficiency and quality of miPSC reprogramming.

  15. AOP: An R Package For Sufficient Causal Analysis in Pathway ...

    EPA Pesticide Factsheets

    Summary: How can I quickly find the key events in a pathway that I need to monitor to predict that a/an beneficial/adverse event/outcome will occur? This is a key question when using signaling pathways for drug/chemical screening in pharma-cology, toxicology and risk assessment. By identifying these sufficient causal key events, we have fewer events to monitor for a pathway, thereby decreasing assay costs and time, while maximizing the value of the information. I have developed the “aop” package which uses backdoor analysis of causal net-works to identify these minimal sets of key events that are suf-ficient for making causal predictions. Availability and Implementation: The source and binary are available online through the Bioconductor project (http://www.bioconductor.org/) as an R package titled “aop”. The R/Bioconductor package runs within the R statistical envi-ronment. The package has functions that can take pathways (as directed graphs) formatted as a Cytoscape JSON file as input, or pathways can be represented as directed graphs us-ing the R/Bioconductor “graph” package. The “aop” package has functions that can perform backdoor analysis to identify the minimal set of key events for making causal predictions.Contact: burgoon.lyle@epa.gov This paper describes an R/Bioconductor package that was developed to facilitate the identification of key events within an AOP that are the minimal set of sufficient key events that need to be tested/monit

  16. Exome Sequencing Identifies Three Novel Candidate Genes Implicated in Intellectual Disability

    PubMed Central

    Azam, Maleeha; Ayub, Humaira; Vissers, Lisenka E. L. M.; Gilissen, Christian; Ali, Syeda Hafiza Benish; Riaz, Moeen; Veltman, Joris A.; Pfundt, Rolph; van Bokhoven, Hans; Qamar, Raheel

    2014-01-01

    Intellectual disability (ID) is a major health problem mostly with an unknown etiology. Recently exome sequencing of individuals with ID identified novel genes implicated in the disease. Therefore the purpose of the present study was to identify the genetic cause of ID in one syndromic and two non-syndromic Pakistani families. Whole exome of three ID probands was sequenced. Missense variations in two plausible novel genes implicated in autosomal recessive ID were identified: lysine (K)-specific methyltransferase 2B (KMT2B), zinc finger protein 589 (ZNF589), as well as hedgehog acyltransferase (HHAT) with a de novo mutation with autosomal dominant mode of inheritance. The KMT2B recessive variant is the first report of recessive Kleefstra syndrome-like phenotype. Identification of plausible causative mutations for two recessive and a dominant type of ID, in genes not previously implicated in disease, underscores the large genetic heterogeneity of ID. These results also support the viewpoint that large number of ID genes converge on limited number of common networks i.e. ZNF589 belongs to KRAB-domain zinc-finger proteins previously implicated in ID, HHAT is predicted to affect sonic hedgehog, which is involved in several disorders with ID, KMT2B associated with syndromic ID fits the epigenetic module underlying the Kleefstra syndromic spectrum. The association of these novel genes in three different Pakistani ID families highlights the importance of screening these genes in more families with similar phenotypes from different populations to confirm the involvement of these genes in pathogenesis of ID. PMID:25405613

  17. Causality discovery technology

    NASA Astrophysics Data System (ADS)

    Chen, M.; Ertl, T.; Jirotka, M.; Trefethen, A.; Schmidt, A.; Coecke, B.; Bañares-Alcántara, R.

    2012-11-01

    Causality is the fabric of our dynamic world. We all make frequent attempts to reason causation relationships of everyday events (e.g., what was the cause of my headache, or what has upset Alice?). We attempt to manage causality all the time through planning and scheduling. The greatest scientific discoveries are usually about causality (e.g., Newton found the cause for an apple to fall, and Darwin discovered natural selection). Meanwhile, we continue to seek a comprehensive understanding about the causes of numerous complex phenomena, such as social divisions, economic crisis, global warming, home-grown terrorism, etc. Humans analyse and reason causality based on observation, experimentation and acquired a priori knowledge. Today's technologies enable us to make observations and carry out experiments in an unprecedented scale that has created data mountains everywhere. Whereas there are exciting opportunities to discover new causation relationships, there are also unparalleled challenges to benefit from such data mountains. In this article, we present a case for developing a new piece of ICT, called Causality Discovery Technology. We reason about the necessity, feasibility and potential impact of such a technology.

  18. Distinguishing time-delayed causal interactions using convergent cross mapping

    PubMed Central

    Ye, Hao; Deyle, Ethan R.; Gilarranz, Luis J.; Sugihara, George

    2015-01-01

    An important problem across many scientific fields is the identification of causal effects from observational data alone. Recent methods (convergent cross mapping, CCM) have made substantial progress on this problem by applying the idea of nonlinear attractor reconstruction to time series data. Here, we expand upon the technique of CCM by explicitly considering time lags. Applying this extended method to representative examples (model simulations, a laboratory predator-prey experiment, temperature and greenhouse gas reconstructions from the Vostok ice core, and long-term ecological time series collected in the Southern California Bight), we demonstrate the ability to identify different time-delayed interactions, distinguish between synchrony induced by strong unidirectional-forcing and true bidirectional causality, and resolve transitive causal chains. PMID:26435402

  19. Reasoning with Causal Cycles

    ERIC Educational Resources Information Center

    Rehder, Bob

    2017-01-01

    This article assesses how people reason with categories whose features are related in causal cycles. Whereas models based on causal graphical models (CGMs) have enjoyed success modeling category-based judgments as well as a number of other cognitive phenomena, CGMs are only able to represent causal structures that are acyclic. A number of new…

  20. Using SCOPE to identify potential regulatory motifs in coregulated genes.

    PubMed

    Martyanov, Viktor; Gross, Robert H

    2011-05-31

    SCOPE is an ensemble motif finder that uses three component algorithms in parallel to identify potential regulatory motifs by over-representation and motif position preference. Each component algorithm is optimized to find a different kind of motif. By taking the best of these three approaches, SCOPE performs better than any single algorithm, even in the presence of noisy data. In this article, we utilize a web version of SCOPE to examine genes that are involved in telomere maintenance. SCOPE has been incorporated into at least two other motif finding programs and has been used in other studies. The three algorithms that comprise SCOPE are BEAM, which finds non-degenerate motifs (ACCGGT), PRISM, which finds degenerate motifs (ASCGWT), and SPACER, which finds longer bipartite motifs (ACCnnnnnnnnGGT). These three algorithms have been optimized to find their corresponding type of motif. Together, they allow SCOPE to perform extremely well. Once a gene set has been analyzed and candidate motifs identified, SCOPE can look for other genes that contain the motif which, when added to the original set, will improve the motif score. This can occur through over-representation or motif position preference. Working with partial gene sets that have biologically verified transcription factor binding sites, SCOPE was able to identify most of the rest of the genes also regulated by the given transcription factor. Output from SCOPE shows candidate motifs, their significance, and other information both as a table and as a graphical motif map. FAQs and video tutorials are available at the SCOPE web site which also includes a "Sample Search" button that allows the user to perform a trial run. Scope has a very friendly user interface that enables novice users to access the algorithm's full power without having to become an expert in the bioinformatics of motif finding. As input, SCOPE can take a list of genes, or FASTA sequences. These can be entered in browser text fields, or read from

  1. [Key effect genes responding to nerve injury identified by gene ontology and computer pattern recognition].

    PubMed

    Pan, Qian; Peng, Jin; Zhou, Xue; Yang, Hao; Zhang, Wei

    2012-07-01

    In order to screen out important genes from large gene data of gene microarray after nerve injury, we combine gene ontology (GO) method and computer pattern recognition technology to find key genes responding to nerve injury, and then verify one of these screened-out genes. Data mining and gene ontology analysis of gene chip data GSE26350 was carried out through MATLAB software. Cd44 was selected from screened-out key gene molecular spectrum by comparing genes' different GO terms and positions on score map of principal component. Function interferences were employed to influence the normal binding of Cd44 and one of its ligands, chondroitin sulfate C (CSC), to observe neurite extension. Gene ontology analysis showed that the first genes on score map (marked by red *) mainly distributed in molecular transducer activity, receptor activity, protein binding et al molecular function GO terms. Cd44 is one of six effector protein genes, and attracted us with its function diversity. After adding different reagents into the medium to interfere the normal binding of CSC and Cd44, varying-degree remissions of CSC's inhibition on neurite extension were observed. CSC can inhibit neurite extension through binding Cd44 on the neuron membrane. This verifies that important genes in given physiological processes can be identified by gene ontology analysis of gene chip data.

  2. Sleeping Beauty transposon mutagenesis identifies genes that cooperate with mutant Smad4 in gastric cancer development

    PubMed Central

    Takeda, Haruna; Rust, Alistair G.; Ward, Jerrold M.; Yew, Christopher Chin Kuan; Jenkins, Nancy A.; Copeland, Neal G.

    2016-01-01

    Mutations in SMAD4 predispose to the development of gastrointestinal cancer, which is the third leading cause of cancer-related deaths. To identify genes driving gastric cancer (GC) development, we performed a Sleeping Beauty (SB) transposon mutagenesis screen in the stomach of Smad4+/− mutant mice. This screen identified 59 candidate GC trunk drivers and a much larger number of candidate GC progression genes. Strikingly, 22 SB-identified trunk drivers are known or candidate cancer genes, whereas four SB-identified trunk drivers, including PTEN, SMAD4, RNF43, and NF1, are known human GC trunk drivers. Similar to human GC, pathway analyses identified WNT, TGF-β, and PI3K-PTEN signaling, ubiquitin-mediated proteolysis, adherens junctions, and RNA degradation in addition to genes involved in chromatin modification and organization as highly deregulated pathways in GC. Comparative oncogenomic filtering of the complete list of SB-identified genes showed that they are highly enriched for genes mutated in human GC and identified many candidate human GC genes. Finally, by comparing our complete list of SB-identified genes against the list of mutated genes identified in five large-scale human GC sequencing studies, we identified LDL receptor-related protein 1B (LRP1B) as a previously unidentified human candidate GC tumor suppressor gene. In LRP1B, 129 mutations were found in 462 human GC samples sequenced, and LRP1B is one of the top 10 most deleted genes identified in a panel of 3,312 human cancers. SB mutagenesis has, thus, helped to catalog the cooperative molecular mechanisms driving SMAD4-induced GC growth and discover genes with potential clinical importance in human GC. PMID:27006499

  3. Sleeping Beauty transposon mutagenesis identifies genes that cooperate with mutant Smad4 in gastric cancer development.

    PubMed

    Takeda, Haruna; Rust, Alistair G; Ward, Jerrold M; Yew, Christopher Chin Kuan; Jenkins, Nancy A; Copeland, Neal G

    2016-04-05

    Mutations in SMAD4 predispose to the development of gastrointestinal cancer, which is the third leading cause of cancer-related deaths. To identify genes driving gastric cancer (GC) development, we performed a Sleeping Beauty (SB) transposon mutagenesis screen in the stomach of Smad4(+/-) mutant mice. This screen identified 59 candidate GC trunk drivers and a much larger number of candidate GC progression genes. Strikingly, 22 SB-identified trunk drivers are known or candidate cancer genes, whereas four SB-identified trunk drivers, including PTEN, SMAD4, RNF43, and NF1, are known human GC trunk drivers. Similar to human GC, pathway analyses identified WNT, TGF-β, and PI3K-PTEN signaling, ubiquitin-mediated proteolysis, adherens junctions, and RNA degradation in addition to genes involved in chromatin modification and organization as highly deregulated pathways in GC. Comparative oncogenomic filtering of the complete list of SB-identified genes showed that they are highly enriched for genes mutated in human GC and identified many candidate human GC genes. Finally, by comparing our complete list of SB-identified genes against the list of mutated genes identified in five large-scale human GC sequencing studies, we identified LDL receptor-related protein 1B (LRP1B) as a previously unidentified human candidate GC tumor suppressor gene. In LRP1B, 129 mutations were found in 462 human GC samples sequenced, and LRP1B is one of the top 10 most deleted genes identified in a panel of 3,312 human cancers. SB mutagenesis has, thus, helped to catalog the cooperative molecular mechanisms driving SMAD4-induced GC growth and discover genes with potential clinical importance in human GC.

  4. Implementation and reporting of causal mediation analysis in 2015: a systematic review in epidemiological studies.

    PubMed

    Liu, Shao-Hsien; Ulbricht, Christine M; Chrysanthopoulou, Stavroula A; Lapane, Kate L

    2016-07-20

    Causal mediation analysis is often used to understand the impact of variables along the causal pathway of an occurrence relation. How well studies apply and report the elements of causal mediation analysis remains unknown. We systematically reviewed epidemiological studies published in 2015 that employed causal mediation analysis to estimate direct and indirect effects of observed associations between an exposure on an outcome. We identified potential epidemiological studies through conducting a citation search within Web of Science and a keyword search within PubMed. Two reviewers independently screened studies for eligibility. For eligible studies, one reviewer performed data extraction, and a senior epidemiologist confirmed the extracted information. Empirical application and methodological details of the technique were extracted and summarized. Thirteen studies were eligible for data extraction. While the majority of studies reported and identified the effects of measures, most studies lacked sufficient details on the extent to which identifiability assumptions were satisfied. Although most studies addressed issues of unmeasured confounders either from empirical approaches or sensitivity analyses, the majority did not examine the potential bias arising from the measurement error of the mediator. Some studies allowed for exposure-mediator interaction and only a few presented results from models both with and without interactions. Power calculations were scarce. Reporting of causal mediation analysis is varied and suboptimal. Given that the application of causal mediation analysis will likely continue to increase, developing standards of reporting of causal mediation analysis in epidemiological research would be prudent.

  5. Novel Myopia Genes and Pathways Identified From Syndromic Forms of Myopia

    PubMed Central

    Loughman, James; Wildsoet, Christine F.; Williams, Cathy; Guggenheim, Jeremy A.

    2018-01-01

    Purpose To test the hypothesis that genes known to cause clinical syndromes featuring myopia also harbor polymorphisms contributing to nonsyndromic refractive errors. Methods Clinical phenotypes and syndromes that have refractive errors as a recognized feature were identified using the Online Mendelian Inheritance in Man (OMIM) database. One hundred fifty-four unique causative genes were identified, of which 119 were specifically linked with myopia and 114 represented syndromic myopia (i.e., myopia and at least one other clinical feature). Myopia was the only refractive error listed for 98 genes and hyperopia and the only refractive error noted for 28 genes, with the remaining 28 genes linked to phenotypes with multiple forms of refractive error. Pathway analysis was carried out to find biological processes overrepresented within these sets of genes. Genetic variants located within 50 kb of the 119 myopia-related genes were evaluated for involvement in refractive error by analysis of summary statistics from genome-wide association studies (GWAS) conducted by the CREAM Consortium and 23andMe, using both single-marker and gene-based tests. Results Pathway analysis identified several biological processes already implicated in refractive error development through prior GWAS analyses and animal studies, including extracellular matrix remodeling, focal adhesion, and axon guidance, supporting the research hypothesis. Novel pathways also implicated in myopia development included mannosylation, glycosylation, lens development, gliogenesis, and Schwann cell differentiation. Hyperopia was found to be linked to a different pattern of biological processes, mostly related to organogenesis. Comparison with GWAS findings further confirmed that syndromic myopia genes were enriched for genetic variants that influence refractive errors in the general population. Gene-based analyses implicated 21 novel candidate myopia genes (ADAMTS18, ADAMTS2, ADAMTSL4, AGK, ALDH18A1, ASXL1, COL4A1

  6. Causal-explanatory pluralism: How intentions, functions, and mechanisms influence causal ascriptions.

    PubMed

    Lombrozo, Tania

    2010-12-01

    Both philosophers and psychologists have argued for the existence of distinct kinds of explanations, including teleological explanations that cite functions or goals, and mechanistic explanations that cite causal mechanisms. Theories of causation, in contrast, have generally been unitary, with dominant theories focusing either on counterfactual dependence or on physical connections. This paper argues that both approaches to causation are psychologically real, with different modes of explanation promoting judgments more or less consistent with each approach. Two sets of experiments isolate the contributions of counterfactual dependence and physical connections in causal ascriptions involving events with people, artifacts, or biological traits, and manipulate whether the events are construed teleologically or mechanistically. The findings suggest that when events are construed teleologically, causal ascriptions are sensitive to counterfactual dependence and relatively insensitive to the presence of physical connections, but when events are construed mechanistically, causal ascriptions are sensitive to both counterfactual dependence and physical connections. The conclusion introduces an account of causation, an "exportable dependence theory," that provides a way to understand the contributions of physical connections and teleology in terms of the functions of causal ascriptions. Copyright © 2010 Elsevier Inc. All rights reserved.

  7. Expression profiling identifies novel Hh/Gli regulated genes in developing zebrafish embryos.

    PubMed Central

    Bergeron, Sadie A.; Milla, Luis A.; Villegas, Rosario; Shen, Meng-Chieh; Burgess, Shawn M.; Allende, Miguel L.; Karlstrom, Rolf O.; Palma, Verónica

    2008-01-01

    The Hedgehog (Hh) signaling pathway plays critical instructional roles during embryonic development. Mis-regulation of Hh/Gli signaling is a major causative factor in human congenital disorders and in a variety of cancers. The zebrafish is a powerful genetic model for the study of Hh signaling during embryogenesis, as a large number of mutants have been identified affecting different components of the Hh/Gli signaling system. By performing global profiling of gene expression in different Hh/Gli gain- and loss-of-function scenarios we identified several known (e.g. ptc1 and nkx2.2a) as well as a large number of novel Hh regulated genes that are differentially expressed in embryos with altered Hh/Gli signaling function. By uncovering changes in tissue specific gene expression, we revealed new embryological processes that are influenced by Hh signaling. We thus provide a comprehensive survey of Hh/Gli regulated genes during embryogenesis and we identify new Hh-regulated genes that may be targets of mis-regulation during tumorogenesis. PMID:18055165

  8. Identifying Stress Transcription Factors Using Gene Expression and TF-Gene Association Data

    PubMed Central

    Wu, Wei-Sheng; Chen, Bor-Sen

    2007-01-01

    Unicellular organisms such as yeasts have evolved to survive environmental stresses by rapidly reorganizing the genomic expression program to meet the challenges of harsh environments. The complex adaptation mechanisms to stress remain to be elucidated. In this study, we developed Stress Transcription Factor Identification Algorithm (STFIA), which integrates gene expression and TF-gene association data to identify the stress transcription factors (TFs) of six kinds of stresses. We identified some general stress TFs that are in response to various stresses, and some specific stress TFs that are in response to one specific stress. The biological significance of our findings is validated by the literature. We found that a small number of TFs may be sufficient to control a wide variety of expression patterns in yeast under different stresses. Two implications can be inferred from this observation. First, the adaptation mechanisms to different stresses may have a bow-tie structure. Second, there may exist extensive regulatory cross-talk among different stress responses. In conclusion, this study proposes a network of the regulators of stress responses and their mechanism of action. PMID:20066130

  9. Structure and Strength in Causal Induction

    ERIC Educational Resources Information Center

    Griffiths, Thomas L.; Tenenbaum, Joshua B.

    2005-01-01

    We present a framework for the rational analysis of elemental causal induction--learning about the existence of a relationship between a single cause and effect--based upon causal graphical models. This framework makes precise the distinction between causal structure and causal strength: the difference between asking whether a causal relationship…

  10. Genomic analyses identify hundreds of variants associated with age at menarche and support a role for puberty timing in cancer risk.

    PubMed

    Day, Felix R; Thompson, Deborah J; Helgason, Hannes; Chasman, Daniel I; Finucane, Hilary; Sulem, Patrick; Ruth, Katherine S; Whalen, Sean; Sarkar, Abhishek K; Albrecht, Eva; Altmaier, Elisabeth; Amini, Marzyeh; Barbieri, Caterina M; Boutin, Thibaud; Campbell, Archie; Demerath, Ellen; Giri, Ayush; He, Chunyan; Hottenga, Jouke J; Karlsson, Robert; Kolcic, Ivana; Loh, Po-Ru; Lunetta, Kathryn L; Mangino, Massimo; Marco, Brumat; McMahon, George; Medland, Sarah E; Nolte, Ilja M; Noordam, Raymond; Nutile, Teresa; Paternoster, Lavinia; Perjakova, Natalia; Porcu, Eleonora; Rose, Lynda M; Schraut, Katharina E; Segrè, Ayellet V; Smith, Albert V; Stolk, Lisette; Teumer, Alexander; Andrulis, Irene L; Bandinelli, Stefania; Beckmann, Matthias W; Benitez, Javier; Bergmann, Sven; Bochud, Murielle; Boerwinkle, Eric; Bojesen, Stig E; Bolla, Manjeet K; Brand, Judith S; Brauch, Hiltrud; Brenner, Hermann; Broer, Linda; Brüning, Thomas; Buring, Julie E; Campbell, Harry; Catamo, Eulalia; Chanock, Stephen; Chenevix-Trench, Georgia; Corre, Tanguy; Couch, Fergus J; Cousminer, Diana L; Cox, Angela; Crisponi, Laura; Czene, Kamila; Davey Smith, George; de Geus, Eco J C N; de Mutsert, Renée; De Vivo, Immaculata; Dennis, Joe; Devilee, Peter; Dos-Santos-Silva, Isabel; Dunning, Alison M; Eriksson, Johan G; Fasching, Peter A; Fernández-Rhodes, Lindsay; Ferrucci, Luigi; Flesch-Janys, Dieter; Franke, Lude; Gabrielson, Marike; Gandin, Ilaria; Giles, Graham G; Grallert, Harald; Gudbjartsson, Daniel F; Guénel, Pascal; Hall, Per; Hallberg, Emily; Hamann, Ute; Harris, Tamara B; Hartman, Catharina A; Heiss, Gerardo; Hooning, Maartje J; Hopper, John L; Hu, Frank; Hunter, David J; Ikram, M Arfan; Im, Hae Kyung; Järvelin, Marjo-Riitta; Joshi, Peter K; Karasik, David; Kellis, Manolis; Kutalik, Zoltan; LaChance, Genevieve; Lambrechts, Diether; Langenberg, Claudia; Launer, Lenore J; Laven, Joop S E; Lenarduzzi, Stefania; Li, Jingmei; Lind, Penelope A; Lindstrom, Sara; Liu, YongMei; Luan, Jian'an; Mägi, Reedik; Mannermaa, Arto; Mbarek, Hamdi; McCarthy, Mark I; Meisinger, Christa; Meitinger, Thomas; Menni, Cristina; Metspalu, Andres; Michailidou, Kyriaki; Milani, Lili; Milne, Roger L; Montgomery, Grant W; Mulligan, Anna M; Nalls, Mike A; Navarro, Pau; Nevanlinna, Heli; Nyholt, Dale R; Oldehinkel, Albertine J; O'Mara, Tracy A; Padmanabhan, Sandosh; Palotie, Aarno; Pedersen, Nancy; Peters, Annette; Peto, Julian; Pharoah, Paul D P; Pouta, Anneli; Radice, Paolo; Rahman, Iffat; Ring, Susan M; Robino, Antonietta; Rosendaal, Frits R; Rudan, Igor; Rueedi, Rico; Ruggiero, Daniela; Sala, Cinzia F; Schmidt, Marjanka K; Scott, Robert A; Shah, Mitul; Sorice, Rossella; Southey, Melissa C; Sovio, Ulla; Stampfer, Meir; Steri, Maristella; Strauch, Konstantin; Tanaka, Toshiko; Tikkanen, Emmi; Timpson, Nicholas J; Traglia, Michela; Truong, Thérèse; Tyrer, Jonathan P; Uitterlinden, André G; Edwards, Digna R Velez; Vitart, Veronique; Völker, Uwe; Vollenweider, Peter; Wang, Qin; Widen, Elisabeth; van Dijk, Ko Willems; Willemsen, Gonneke; Winqvist, Robert; Wolffenbuttel, Bruce H R; Zhao, Jing Hua; Zoledziewska, Magdalena; Zygmunt, Marek; Alizadeh, Behrooz Z; Boomsma, Dorret I; Ciullo, Marina; Cucca, Francesco; Esko, Tõnu; Franceschini, Nora; Gieger, Christian; Gudnason, Vilmundur; Hayward, Caroline; Kraft, Peter; Lawlor, Debbie A; Magnusson, Patrik K E; Martin, Nicholas G; Mook-Kanamori, Dennis O; Nohr, Ellen A; Polasek, Ozren; Porteous, David; Price, Alkes L; Ridker, Paul M; Snieder, Harold; Spector, Tim D; Stöckl, Doris; Toniolo, Daniela; Ulivi, Sheila; Visser, Jenny A; Völzke, Henry; Wareham, Nicholas J; Wilson, James F; Spurdle, Amanda B; Thorsteindottir, Unnur; Pollard, Katherine S; Easton, Douglas F; Tung, Joyce Y; Chang-Claude, Jenny; Hinds, David; Murray, Anna; Murabito, Joanne M; Stefansson, Kari; Ong, Ken K; Perry, John R B

    2017-06-01

    The timing of puberty is a highly polygenic childhood trait that is epidemiologically associated with various adult diseases. Using 1000 Genomes Project-imputed genotype data in up to ∼370,000 women, we identify 389 independent signals (P < 5 × 10 -8 ) for age at menarche, a milestone in female pubertal development. In Icelandic data, these signals explain ∼7.4% of the population variance in age at menarche, corresponding to ∼25% of the estimated heritability. We implicate ∼250 genes via coding variation or associated expression, demonstrating significant enrichment in neural tissues. Rare variants near the imprinted genes MKRN3 and DLK1 were identified, exhibiting large effects when paternally inherited. Mendelian randomization analyses suggest causal inverse associations, independent of body mass index (BMI), between puberty timing and risks for breast and endometrial cancers in women and prostate cancer in men. In aggregate, our findings highlight the complexity of the genetic regulation of puberty timing and support causal links with cancer susceptibility.

  11. Genomic analyses identify hundreds of variants associated with age at menarche and support a role for puberty timing in cancer risk

    PubMed Central

    Day, Felix R; Thompson, Deborah J; Helgason, Hannes; Chasman, Daniel I; Finucane, Hilary; Sulem, Patrick; Ruth, Katherine S; Whalen, Sean; Sarkar, Abhishek K; Albrecht, Eva; Altmaier, Elisabeth; Amini, Marzyeh; Barbieri, Caterina M; Boutin, Thibaud; Campbell, Archie; Demerath, Ellen; Giri, Ayush; He, Chunyan; Hottenga, Jouke J; Karlsson, Robert; Kolcic, Ivana; Loh, Po-Ru; Lunetta, Kathryn L; Mangino, Massimo; Marco, Brumat; McMahon, George; Medland, Sarah E; Nolte, Ilja M; Noordam, Raymond; Nutile, Teresa; Paternoster, Lavinia; Perjakova, Natalia; Porcu, Eleonora; Rose, Lynda M; Schraut, Katharina E; Segrè, Ayellet V; Smith, Albert V; Stolk, Lisette; Teumer, Alexander; Andrulis, Irene L; Bandinelli, Stefania; Beckmann, Matthias W; Benitez, Javier; Bergmann, Sven; Bochud, Murielle; Boerwinkle, Eric; Bojesen, Stig E; Bolla, Manjeet K; Brand, Judith S; Brauch, Hiltrud; Brenner, Hermann; Broer, Linda; Brüning, Thomas; Buring, Julie E; Campbell, Harry; Catamo, Eulalia; Chanock, Stephen; Chenevix-Trench, Georgia; Corre, Tanguy; Couch, Fergus J; Cousminer, Diana L; Cox, Angela; Crisponi, Laura; Czene, Kamila; Smith, George Davey; de Geus, Eco JCN; de Mutsert, Renée; De Vivo, Immaculata; Dennis, Joe; Devilee, Peter; dos-Santos-Silva, Isabel; Dunning, Alison M; Eriksson, Johan G; Fasching, Peter A; Fernández-Rhodes, Lindsay; Ferrucci, Luigi; Flesch-Janys, Dieter; Franke, Lude; Gabrielson, Marike; Gandin, Ilaria; Giles, Graham G; Grallert, Harald; Gudbjartsson, Daniel F; Guénel, Pascal; Hall, Per; Hallberg, Emily; Hamann, Ute; Harris, Tamara B; Hartman, Catharina A; Heiss, Gerardo; Hooning, Maartje J; Hopper, John L; Hu, Frank; Hunter, David J; Ikram, M Arfan; Im, Hae Kyung; Järvelin, Marjo-Riitta; Joshi, Peter K; Karasik, David; Kellis, Manolis; Kutalik, Zoltan; LaChance, Genevieve; Lambrechts, Diether; Langenberg, Claudia; Launer, Lenore J; Laven, Joop S E; Lenarduzzi, Stefania; Li, Jingmei; Lind, Penelope A; Lindstrom, Sara; Liu, YongMei; Luan, Jian’an; Mägi, Reedik; Mannermaa, Arto; Mbarek, Hamdi; McCarthy, Mark I; Meisinger, Christa; Meitinger, Thomas; Menni, Cristina; Metspalu, Andres; Michailidou, Kyriaki; Milani, Lili; Milne, Roger L; Montgomery, Grant W; Mulligan, Anna M; Nalls, Mike A; Navarro, Pau; Nevanlinna, Heli; Nyholt, Dale R; Oldehinkel, Albertine J; O’Mara, Tracy A; Padmanabhan, Sandosh; Palotie, Aarno; Pedersen, Nancy; Peters, Annette; Peto, Julian; Pharoah, Paul D P; Pouta, Anneli; Radice, Paolo; Rahman, Iffat; Ring, Susan M; Robino, Antonietta; Rosendaal, Frits R; Rudan, Igor; Rueedi, Rico; Ruggiero, Daniela; Sala, Cinzia F; Schmidt, Marjanka K; Scott, Robert A; Shah, Mitul; Sorice, Rossella; Southey, Melissa C; Sovio, Ulla; Stampfer, Meir; Steri, Maristella; Strauch, Konstantin; Tanaka, Toshiko; Tikkanen, Emmi; Timpson, Nicholas J; Traglia, Michela; Truong, Thérèse; Tyrer, Jonathan P; Uitterlinden, André G; Velez Edwards, Digna R; Vitart, Veronique; Völker, Uwe; Vollenweider, Peter; Wang, Qin; Widen, Elisabeth; van Dijk, Ko Willems; Willemsen, Gonneke; Winqvist, Robert; Wolffenbuttel, Bruce H R; Zhao, Jing Hua; Zoledziewska, Magdalena; Zygmunt, Marek; Alizadeh, Behrooz Z; Boomsma, Dorret I; Ciullo, Marina; Cucca, Francesco; Esko, Tõnu; Franceschini, Nora; Gieger, Christian; Gudnason, Vilmundur; Hayward, Caroline; Kraft, Peter; Lawlor, Debbie A; Magnusson, Patrik K E; Martin, Nicholas G; Mook-Kanamori, Dennis O; Nohr, Ellen A; Polasek, Ozren; Porteous, David; Price, Alkes L; Ridker, Paul M; Snieder, Harold; Spector, Tim D; Stöckl, Doris; Toniolo, Daniela; Ulivi, Sheila; Visser, Jenny A; Völzke, Henry; Wareham, Nicholas J; Wilson, James F; Spurdle, Amanda B; Thorsteindottir, Unnur; Pollard, Katherine S; Easton, Douglas F; Tung, Joyce Y; Chang-Claude, Jenny; Hinds, David; Murray, Anna; Murabito, Joanne M; Stefansson, Kari; Ong, Ken K; Perry, John R B

    2018-01-01

    The timing of puberty is a highly polygenic childhood trait that is epidemiologically associated with various adult diseases. Using 1000 Genomes Project–imputed genotype data in up to ~370,000 women, we identify 389 independent signals (P < 5 × 10−8) for age at menarche, a milestone in female pubertal development. In Icelandic data, these signals explain ~7.4% of the population variance in age at menarche, corresponding to ~25% of the estimated heritability. We implicate ~250 genes via coding variation or associated expression, demonstrating significant enrichment in neural tissues. Rare variants near the imprinted genes MKRN3 and DLK1 were identified, exhibiting large effects when paternally inherited. Mendelian randomization analyses suggest causal inverse associations, independent of body mass index (BMI), between puberty timing and risks for breast and endometrial cancers in women and prostate cancer in men. In aggregate, our findings highlight the complexity of the genetic regulation of puberty timing and support causal links with cancer susceptibility. PMID:28436984

  12. GeneCOST: a novel scoring-based prioritization framework for identifying disease causing genes.

    PubMed

    Ozer, Bugra; Sağıroğlu, Mahmut; Demirci, Hüseyin

    2015-11-15

    Due to the big data produced by next-generation sequencing studies, there is an evident need for methods to extract the valuable information gathered from these experiments. In this work, we propose GeneCOST, a novel scoring-based method to evaluate every gene for their disease association. Without any prior filtering and any prior knowledge, we assign a disease likelihood score to each gene in correspondence with their variations. Then, we rank all genes based on frequency, conservation, pedigree and detailed variation information to find out the causative reason of the disease state. We demonstrate the usage of GeneCOST with public and real life Mendelian disease cases including recessive, dominant, compound heterozygous and sporadic models. As a result, we were able to identify causative reason behind the disease state in top rankings of our list, proving that this novel prioritization framework provides a powerful environment for the analysis in genetic disease studies alternative to filtering-based approaches. GeneCOST software is freely available at www.igbam.bilgem.tubitak.gov.tr/en/softwares/genecost-en/index.html. buozer@gmail.com Supplementary data are available at Bioinformatics online. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  13. Kindness in the blood: A randomized controlled trial of the gene regulatory impact of prosocial behavior.

    PubMed

    Nelson-Coffey, S Katherine; Fritz, Megan M; Lyubomirsky, Sonja; Cole, Steve W

    2017-07-01

    Prosocial behavior is linked to longevity, but few studies have experimentally manipulated prosocial behavior to identify the causal mechanisms underlying this association. One possible mediating pathway involves changes in gene expression that may subsequently influence disease development or resistance. In the current study, we examined changes in a leukocyte gene expression profile known as the Conserved Transcriptional Response to Adversity (CTRA) in 159 adults who were randomly assigned for 4 weeks to engage in prosocial behavior directed towards specific others, prosocial behavior directed towards the world in general, self-focused kindness, or a neutral control task. Those randomized to prosocial behavior towards specific others demonstrated improvements (i.e., reductions) in leukocyte expression of CTRA indicator genes. No significant changes in CTRA gene expression were observed in the other 3 conditions. These findings suggest that prosocial behavior can causally impact leukocyte gene expression profiles in ways that might potentially help explain the previously observed health advantages associated with social ties. Copyright © 2017 Elsevier Ltd. All rights reserved.

  14. Use of RNA-seq to identify cardiac genes and gene pathways differentially expressed between dogs with and without dilated cardiomyopathy

    PubMed Central

    Friedenberg, Steven G.; Chdid, Lhoucine; Keene, Bruce; Sherry, Barbara; Motsinger-Reif, Alison; Meurs, Kathryn M.

    2017-01-01

    OBJECTIVE To identify cardiac tissue genes and gene pathways differentially expressed between dogs with and without dilated cardiomyopathy (DCM). ANIMALS 8 dogs with and 5 dogs without DCM. PROCEDURES Following euthanasia, samples of left ventricular myocardium were collected from each dog. Total RNA was extracted from tissue samples, and RNA sequencing was performed on each sample. Samples from dogs with and without DCM were grouped to identify genes that were differentially regulated between the 2 populations. Overrepresentation analysis was performed on upregulated and downregulated gene sets to identify altered molecular pathways in dogs with DCM. RESULTS Genes involved in cellular energy metabolism, especially metabolism of carbohydrates and fats, were significantly downregulated in dogs with DCM. Expression of cardiac structural proteins was also altered in affected dogs. CONCLUSIONS AND CLINICAL RELEVANCE Results suggested that RNA sequencing may provide important insights into the pathogenesis of DCM in dogs and highlight pathways that should be explored to identify causative mutations and develop novel therapeutic interventions. PMID:27347821

  15. Use of RNA-seq to identify cardiac genes and gene pathways differentially expressed between dogs with and without dilated cardiomyopathy.

    PubMed

    Friedenberg, Steven G; Chdid, Lhoucine; Keene, Bruce; Sherry, Barbara; Motsinger-Reif, Alison; Meurs, Kathryn M

    2016-07-01

    OBJECTIVE To identify cardiac tissue genes and gene pathways differentially expressed between dogs with and without dilated cardiomyopathy (DCM). ANIMALS 8 dogs with and 5 dogs without DCM. PROCEDURES Following euthanasia, samples of left ventricular myocardium were collected from each dog. Total RNA was extracted from tissue samples, and RNA sequencing was performed on each sample. Samples from dogs with and without DCM were grouped to identify genes that were differentially regulated between the 2 populations. Overrepresentation analysis was performed on upregulated and downregulated gene sets to identify altered molecular pathways in dogs with DCM. RESULTS Genes involved in cellular energy metabolism, especially metabolism of carbohydrates and fats, were significantly downregulated in dogs with DCM. Expression of cardiac structural proteins was also altered in affected dogs. CONCLUSIONS AND CLINICAL RELEVANCE Results suggested that RNA sequencing may provide important insights into the pathogenesis of DCM in dogs and highlight pathways that should be explored to identify causative mutations and develop novel therapeutic interventions.

  16. Whole-Genome Sequencing of Sordaria macrospora Mutants Identifies Developmental Genes.

    PubMed

    Nowrousian, Minou; Teichert, Ines; Masloff, Sandra; Kück, Ulrich

    2012-02-01

    The study of mutants to elucidate gene functions has a long and successful history; however, to discover causative mutations in mutants that were generated by random mutagenesis often takes years of laboratory work and requires previously generated genetic and/or physical markers, or resources like DNA libraries for complementation. Here, we present an alternative method to identify defective genes in developmental mutants of the filamentous fungus Sordaria macrospora through Illumina/Solexa whole-genome sequencing. We sequenced pooled DNA from progeny of crosses of three mutants and the wild type and were able to pinpoint the causative mutations in the mutant strains through bioinformatics analysis. One mutant is a spore color mutant, and the mutated gene encodes a melanin biosynthesis enzyme. The causative mutation is a G to A change in the first base of an intron, leading to a splice defect. The second mutant carries an allelic mutation in the pro41 gene encoding a protein essential for sexual development. In the mutant, we detected a complex pattern of deletion/rearrangements at the pro41 locus. In the third mutant, a point mutation in the stop codon of a transcription factor-encoding gene leads to the production of immature fruiting bodies. For all mutants, transformation with a wild type-copy of the affected gene restored the wild-type phenotype. Our data demonstrate that whole-genome sequencing of mutant strains is a rapid method to identify developmental genes in an organism that can be genetically crossed and where a reference genome sequence is available, even without prior mapping information.

  17. Whole-Genome Sequencing of Sordaria macrospora Mutants Identifies Developmental Genes

    PubMed Central

    Nowrousian, Minou; Teichert, Ines; Masloff, Sandra; Kück, Ulrich

    2012-01-01

    The study of mutants to elucidate gene functions has a long and successful history; however, to discover causative mutations in mutants that were generated by random mutagenesis often takes years of laboratory work and requires previously generated genetic and/or physical markers, or resources like DNA libraries for complementation. Here, we present an alternative method to identify defective genes in developmental mutants of the filamentous fungus Sordaria macrospora through Illumina/Solexa whole-genome sequencing. We sequenced pooled DNA from progeny of crosses of three mutants and the wild type and were able to pinpoint the causative mutations in the mutant strains through bioinformatics analysis. One mutant is a spore color mutant, and the mutated gene encodes a melanin biosynthesis enzyme. The causative mutation is a G to A change in the first base of an intron, leading to a splice defect. The second mutant carries an allelic mutation in the pro41 gene encoding a protein essential for sexual development. In the mutant, we detected a complex pattern of deletion/rearrangements at the pro41 locus. In the third mutant, a point mutation in the stop codon of a transcription factor-encoding gene leads to the production of immature fruiting bodies. For all mutants, transformation with a wild type-copy of the affected gene restored the wild-type phenotype. Our data demonstrate that whole-genome sequencing of mutant strains is a rapid method to identify developmental genes in an organism that can be genetically crossed and where a reference genome sequence is available, even without prior mapping information. PMID:22384404

  18. A general method for identifying major hybrid male sterility genes in Drosophila.

    PubMed

    Zeng, L W; Singh, R S

    1995-10-01

    The genes responsible for hybrid male sterility in species crosses are usually identified by introgressing chromosome segments, monitored by visible markers, between closely related species by continuous backcrosses. This commonly used method, however, suffers from two problems. First, it relies on the availability of markers to monitor the introgressed regions and so the portion of the genome examined is limited to the marked regions. Secondly, the introgressed regions are usually large and it is impossible to tell if the effects of the introgressed regions are the result of single (or few) major genes or many minor genes (polygenes). Here we introduce a simple and general method for identifying putative major hybrid male sterility genes which is free of these problems. In this method, the actual hybrid male sterility genes (rather than markers), or tightly linked gene complexes with large effects, are selectively introgressed from one species into the background of another species by repeated backcrosses. This is performed by selectively backcrossing heterozygous (for hybrid male sterility gene or genes) females producing fertile and sterile sons in roughly equal proportions to males of either parental species. As no marker gene is required for this procedure, this method can be used with any species pairs that produce unisexual sterility. With the application of this method, a small X chromosome region of Drosophila mauritiana which produces complete hybrid male sterility (aspermic testes) in the background of D. simulans was identified. Recombination analysis reveals that this region contains a second major hybrid male sterility gene linked to the forked locus located at either 62.7 +/- 0.66 map units or at the centromere region of the X chromosome of D. mauritiana.

  19. Epidermal growth factor gene is a newly identified candidate gene for gout.

    PubMed

    Han, Lin; Cao, Chunwei; Jia, Zhaotong; Liu, Shiguo; Liu, Zhen; Xin, Ruosai; Wang, Can; Li, Xinde; Ren, Wei; Wang, Xuefeng; Li, Changgui

    2016-08-10

    Chromosome 4q25 has been identified as a genomic region associated with gout. However, the associations of gout with the genes in this region have not yet been confirmed. Here, we performed two-stage analysis to determine whether variations in candidate genes in the 4q25 region are associated with gout in a male Chinese Han population. We first evaluated 96 tag single nucleotide polymorphisms (SNPs) in eight inflammatory/immune pathway- or glucose/lipid metabolism-related genes in the 4q25 region in 480 male gout patients and 480 controls. The SNP rs12504538, located in the elongation of very-long-chain-fatty-acid-like family member 6 gene (Elovl6), was found to be associated with gout susceptibility (Padjusted = 0.00595). In the second stage of analysis, we performed fine mapping analysis of 93 tag SNPs in Elovl6 and in the epidermal growth factor gene (EGF) and its flanking regions in 1017 male patients gout and 1897 healthy male controls. We observed a significant association between the T allele of EGF rs2298999 and gout (odds ratio = 0.77, 95% confidence interval = 0.67-0.88, Padjusted = 6.42 × 10(-3)). These results provide the first evidence for an association between the EGF rs2298999 C/T polymorphism and gout. Our findings should be validated in additional populations.

  20. Causal mechanisms and balancing selection inferred from genetic associations with polycystic ovary syndrome

    PubMed Central

    Day, Felix R.; Hinds, David A.; Tung, Joyce Y.; Stolk, Lisette; Styrkarsdottir, Unnur; Saxena, Richa; Bjonnes, Andrew; Broer, Linda; Dunger, David B.; Halldorsson, Bjarni V.; Lawlor, Debbie A.; Laval, Guillaume; Mathieson, Iain; McCardle, Wendy L.; Louwers, Yvonne; Meun, Cindy; Ring, Susan; Scott, Robert A.; Sulem, Patrick; Uitterlinden, André G.; Wareham, Nicholas J.; Thorsteinsdottir, Unnur; Welt, Corrine; Stefansson, Kari; Laven, Joop S. E.; Ong, Ken K.; Perry, John R. B.

    2015-01-01

    Polycystic ovary syndrome (PCOS) is the most common reproductive disorder in women, yet there is little consensus regarding its aetiology. Here we perform a genome-wide association study of PCOS in up to 5,184 self-reported cases of White European ancestry and 82,759 controls, with follow-up in a further ∼2,000 clinically validated cases and ∼100,000 controls. We identify six signals for PCOS at genome-wide statistical significance (P<5 × 10−8), in/near genes ERBB4/HER4, YAP1, THADA, FSHB, RAD50 and KRR1. Variants in/near three of the four epidermal growth factor receptor genes (ERBB2/HER2, ERBB3/HER3 and ERBB4/HER4) are associated with PCOS at or near genome-wide significance. Mendelian randomization analyses indicate causal roles in PCOS aetiology for higher BMI (P=2.5 × 10−9), higher insulin resistance (P=6 × 10−4) and lower serum sex hormone binding globulin concentrations (P=5 × 10−4). Furthermore, genetic susceptibility to later menopause is associated with higher PCOS risk (P=1.6 × 10−8) and PCOS-susceptibility alleles are associated with higher serum anti-Müllerian hormone concentrations in girls (P=8.9 × 10−5). This large-scale study implicates an aetiological role of the epidermal growth factor receptors, infers causal mechanisms relevant to clinical management and prevention, and suggests balancing selection mechanisms involved in PCOS risk. PMID:26416764

  1. Whole exome sequencing identifies novel candidate genes that modify chronic obstructive pulmonary disease susceptibility.

    PubMed

    Bruse, Shannon; Moreau, Michael; Bromberg, Yana; Jang, Jun-Ho; Wang, Nan; Ha, Hongseok; Picchi, Maria; Lin, Yong; Langley, Raymond J; Qualls, Clifford; Klensney-Tait, Julia; Zabner, Joseph; Leng, Shuguang; Mao, Jenny; Belinsky, Steven A; Xing, Jinchuan; Nyunoya, Toru

    2016-01-07

    Chronic obstructive pulmonary disease (COPD) is characterized by an irreversible airflow limitation in response to inhalation of noxious stimuli, such as cigarette smoke. However, only 15-20 % smokers manifest COPD, suggesting a role for genetic predisposition. Although genome-wide association studies have identified common genetic variants that are associated with susceptibility to COPD, effect sizes of the identified variants are modest, as is the total heritability accounted for by these variants. In this study, an extreme phenotype exome sequencing study was combined with in vitro modeling to identify COPD candidate genes. We performed whole exome sequencing of 62 highly susceptible smokers and 30 exceptionally resistant smokers to identify rare variants that may contribute to disease risk or resistance to COPD. This was a cross-sectional case-control study without therapeutic intervention or longitudinal follow-up information. We identified candidate genes based on rare variant analyses and evaluated exonic variants to pinpoint individual genes whose function was computationally established to be significantly different between susceptible and resistant smokers. Top scoring candidate genes from these analyses were further filtered by requiring that each gene be expressed in human bronchial epithelial cells (HBECs). A total of 81 candidate genes were thus selected for in vitro functional testing in cigarette smoke extract (CSE)-exposed HBECs. Using small interfering RNA (siRNA)-mediated gene silencing experiments, we showed that silencing of several candidate genes augmented CSE-induced cytotoxicity in vitro. Our integrative analysis through both genetic and functional approaches identified two candidate genes (TACC2 and MYO1E) that augment cigarette smoke (CS)-induced cytotoxicity and, potentially, COPD susceptibility.

  2. Clustering approaches to identifying gene expression patterns from DNA microarray data.

    PubMed

    Do, Jin Hwan; Choi, Dong-Kug

    2008-04-30

    The analysis of microarray data is essential for large amounts of gene expression data. In this review we focus on clustering techniques. The biological rationale for this approach is the fact that many co-expressed genes are co-regulated, and identifying co-expressed genes could aid in functional annotation of novel genes, de novo identification of transcription factor binding sites and elucidation of complex biological pathways. Co-expressed genes are usually identified in microarray experiments by clustering techniques. There are many such methods, and the results obtained even for the same datasets may vary considerably depending on the algorithms and metrics for dissimilarity measures used, as well as on user-selectable parameters such as desired number of clusters and initial values. Therefore, biologists who want to interpret microarray data should be aware of the weakness and strengths of the clustering methods used. In this review, we survey the basic principles of clustering of DNA microarray data from crisp clustering algorithms such as hierarchical clustering, K-means and self-organizing maps, to complex clustering algorithms like fuzzy clustering.

  3. Genes Important for Schizosaccharomyces pombe Meiosis Identified Through a Functional Genomics Screen

    PubMed Central

    Blyth, Julie; Makrantoni, Vasso; Barton, Rachael E.; Spanos, Christos; Rappsilber, Juri; Marston, Adele L.

    2018-01-01

    Meiosis is a specialized cell division that generates gametes, such as eggs and sperm. Errors in meiosis result in miscarriages and are the leading cause of birth defects; however, the molecular origins of these defects remain unknown. Studies in model organisms are beginning to identify the genes and pathways important for meiosis, but the parts list is still poorly defined. Here we present a comprehensive catalog of genes important for meiosis in the fission yeast, Schizosaccharomyces pombe. Our genome-wide functional screen surveyed all nonessential genes for roles in chromosome segregation and spore formation. Novel genes important at distinct stages of the meiotic chromosome segregation and differentiation program were identified. Preliminary characterization implicated three of these genes in centrosome/spindle pole body, centromere, and cohesion function. Our findings represent a near-complete parts list of genes important for meiosis in fission yeast, providing a valuable resource to advance our molecular understanding of meiosis. PMID:29259000

  4. A Multi-Ethnic Meta-Analysis of Genome-Wide Association Studies in Over 100,000 Subjects Identifies 23 Fibrinogen-Associated Loci but no Strong Evidence of a Causal Association between Circulating Fibrinogen and Cardiovascular Disease

    PubMed Central

    Sabater-Lleal, Maria; Huang, Jie; Chasman, Daniel; Naitza, Silvia; Dehghan, Abbas; Johnson, Andrew D; Teumer, Alexander; Reiner, Alex P; Folkersen, Lasse; Basu, Saonli; Rudnicka, Alicja R; Trompet, Stella; Mälarstig, Anders; Baumert, Jens; Bis, Joshua C.; Guo, Xiuqing; Hottenga, Jouke J; Shin, So-Youn; Lopez, Lorna M; Lahti, Jari; Tanaka, Toshiko; Yanek, Lisa R; Oudot-Mellakh, Tiphaine; Wilson, James F; Navarro, Pau; Huffman, Jennifer E; Zemunik, Tatijana; Redline, Susan; Mehra, Reena; Pulanic, Drazen; Rudan, Igor; Wright, Alan F; Kolcic, Ivana; Polasek, Ozren; Wild, Sarah H; Campbell, Harry; Curb, J David; Wallace, Robert; Liu, Simin; Eaton, Charles B.; Becker, Diane M.; Becker, Lewis C.; Bandinelli, Stefania; Räikkönen, Katri; Widen, Elisabeth; Palotie, Aarno; Fornage, Myriam; Green, David; Gross, Myron; Davies, Gail; Harris, Sarah E; Liewald, David C; Starr, John M; Williams, Frances M.K.; Grant, P.J.; Spector, Timothy D.; Strawbridge, Rona J; Silveira, Angela; Sennblad, Bengt; Rivadeneira, Fernando; Uitterlinden, Andre G; Franco, Oscar H; Hofman, Albert; van Dongen, Jenny; Willemsen, G; Boomsma, Dorret I; Yao, Jie; Jenny, Nancy Swords; Haritunians, Talin; McKnight, Barbara; Lumley, Thomas; Taylor, Kent D; Rotter, Jerome I; Psaty, Bruce M; Peters, Annette; Gieger, Christian; Illig, Thomas; Grotevendt, Anne; Homuth, Georg; Völzke, Henry; Kocher, Thomas; Goel, Anuj; Franzosi, Maria Grazia; Seedorf, Udo; Clarke, Robert; Steri, Maristella; Tarasov, Kirill V; Sanna, Serena; Schlessinger, David; Stott, David J; Sattar, Naveed; Buckley, Brendan M; Rumley, Ann; Lowe, Gordon D; McArdle, Wendy L; Chen, Ming-Huei; Tofler, Geoffrey H; Song, Jaejoon; Boerwinkle, Eric; Folsom, Aaron R.; Rose, Lynda M.; Franco-Cereceda, Anders; Teichert, Martina; Ikram, M Arfan; Mosley, Thomas H; Bevan, Steve; Dichgans, Martin; Rothwell, Peter M.; Sudlow, Cathie L M; Hopewell, Jemma C.; Chambers, John C.; Saleheen, Danish; Kooner, Jaspal S.; Danesh, John; Nelson, Christopher P; Erdmann, Jeanette; Reilly, Muredach P.; Kathiresan, Sekar; Schunkert, Heribert; Morange, Pierre-Emmanuel; Ferrucci, Luigi; Eriksson, Johan G; Jacobs, David; Deary, Ian J; Soranzo, Nicole; Witteman, Jacqueline CM; de Geus, Eco JC; Tracy, Russell P.; Hayward, Caroline; Koenig, Wolfgang; Cucca, Francesco; Jukema, J Wouter; Eriksson, Per; Seshadri, Sudha; Markus, Hugh S.; Watkins, Hugh; Samani, Nilesh J; Wallaschofski, Henri; Smith, Nicholas L.; Tregouet, David; Ridker, Paul M.; Tang, Weihong; Strachan, David P.; Hamsten, Anders; O’Donnell, Christopher J.

    2013-01-01

    Background Estimates of the heritability of plasma fibrinogen concentration, an established predictor of cardiovascular disease (CVD), range from 34 to 50%. Genetic variants so far identified by genome-wide association (GWA) studies only explain a small proportion (< 2%) of its variation. Methods and Results We conducted a meta-analysis of 28 GWA studies, including more than 90,000 subjects of European ancestry, the first GWA meta-analysis of fibrinogen levels in 7 African Americans studies totaling 8,289 samples, and a GWA study in Hispanic-Americans totaling 1,366 samples. Evaluation for association of SNPs with clinical outcomes included a total of 40,695 cases and 85,582 controls for coronary artery disease (CAD), 4,752 cases and 24,030 controls for stroke, and 3,208 cases and 46,167 controls for venous thromboembolism (VTE). Overall, we identified 24 genome-wide significant (P<5×10−8) independent signals in 23 loci, including 15 novel associations, together accounting for 3.7% of plasma fibrinogen variation. Gene-set enrichment analysis highlighted key roles in fibrinogen regulation for the three structural fibrinogen genes and pathways related to inflammation, adipocytokines and thyrotrophin-releasing hormone signaling. Whereas lead SNPs in a few loci were significantly associated with CAD, the combined effect of all 24 fibrinogen-associated lead SNPs was not significant for CAD, stroke or VTE. Conclusion We identify 23 robustly associated fibrinogen loci, 15 of which are new. Clinical outcome analysis of these loci does not support a causal relationship between circulating levels of fibrinogen and CAD, stroke or VTE. PMID:23969696

  5. Microarray and differential display identify genes involved in jasmonate-dependent anther development.

    PubMed

    Mandaokar, Ajin; Kumar, V Dinesh; Amway, Matt; Browse, John

    2003-07-01

    Jasmonate (JA) is a signaling compound essential for anther development and pollen fertility in Arabidopsis. Mutations that block the pathway of JA synthesis result into male sterility. To understand the processes of anther and pollen maturation, we used microarray and differential display approaches to compare gene expression pattern in anthers of wild-type Arabidopsis and the male-sterile mutant, opr3. Microarray experiment revealed 25 genes that were up-regulated more than 1.8-fold in wild-type anthers as compared to mutant anthers. Experiments based on differential display identified 13 additional genes up-regulated in wild-type anthers compared to opr3 for a total of 38 differentially expressed genes. Searches of the Arabidopsis and non-redundant databases disclosed known or likely functions for 28 of the 38 genes identified, while 10 genes encode proteins of unknown function. Northern blot analysis of eight representative clones as probes confirmed low expression in opr3 anthers compared with wild-type anthers. JA responsiveness of these same genes was also investigated by northern blot analysis of anther RNA isolated from wild-type and opr3 plants, In these experiments, four genes were induced in opr3 anthers within 0.5-1 h of JA treatment while the remaining genes were up-regulated only 1-8 h after JA application. None of these genes was induced by JA in anthers of the coil mutant that is deficient in JA responsiveness. The four early-induced genes in opr3 encode lipoxygenase, a putative bHLH transcription factor, epithiospecifier protein and an unknown protein. We propose that these and other early components may be involved in JA signaling and in the initiation of developmental processes. The four late genes encode an extensin-like protein, a peptide transporter and two unknown proteins, which may represent components required later in anther and pollen maturation. Transcript profiling has provided a successful approach to identify genes involved in

  6. Reasoning about Causal Relationships: Inferences on Causal Networks

    PubMed Central

    Rottman, Benjamin Margolin; Hastie, Reid

    2013-01-01

    Over the last decade, a normative framework for making causal inferences, Bayesian Probabilistic Causal Networks, has come to dominate psychological studies of inference based on causal relationships. The following causal networks—[X→Y→Z, X←Y→Z, X→Y←Z]—supply answers for questions like, “Suppose both X and Y occur, what is the probability Z occurs?” or “Suppose you intervene and make Y occur, what is the probability Z occurs?” In this review, we provide a tutorial for how normatively to calculate these inferences. Then, we systematically detail the results of behavioral studies comparing human qualitative and quantitative judgments to the normative calculations for many network structures and for several types of inferences on those networks. Overall, when the normative calculations imply that an inference should increase, judgments usually go up; when calculations imply a decrease, judgments usually go down. However, two systematic deviations appear. First, people’s inferences violate the Markov assumption. For example, when inferring Z from the structure X→Y→Z, people think that X is relevant even when Y completely mediates the relationship between X and Z. Second, even when people’s inferences are directionally consistent with the normative calculations, they are often not as sensitive to the parameters and the structure of the network as they should be. We conclude with a discussion of productive directions for future research. PMID:23544658

  7. Lentiviral vector-based insertional mutagenesis identifies genes associated with liver cancer

    PubMed Central

    Ranzani, Marco; Cesana, Daniela; Bartholomae, Cynthia C.; Sanvito, Francesca; Pala, Mauro; Benedicenti, Fabrizio; Gallina, Pierangela; Sergi, Lucia Sergi; Merella, Stefania; Bulfone, Alessandro; Doglioni, Claudio; von Kalle, Christof; Kim, Yoon Jun; Schmidt, Manfred; Tonon, Giovanni; Naldini, Luigi; Montini, Eugenio

    2013-01-01

    Transposons and γ-retroviruses have been efficiently used as insertional mutagens in different tissues to identify molecular culprits of cancer. However, these systems are characterized by recurring integrations that accumulate in tumor cells, hampering the identification of early cancer-driving events amongst bystander and progression-related events. We developed an insertional mutagenesis platform based on lentiviral vectors (LVV) by which we could efficiently induce hepatocellular carcinoma (HCC) in 3 different mouse models. By virtue of LVV’s replication-deficient nature and broad genome-wide integration pattern, LVV-based insertional mutagenesis allowed identification of 4 new liver cancer genes from a limited number of integrations. We validated the oncogenic potential of all the identified genes in vivo, with different levels of penetrance. Our newly identified cancer genes are likely to play a role in human disease, since they are upregulated and/or amplified/deleted in human HCCs and can predict clinical outcome of patients. PMID:23314173

  8. Dynamics of Quantum Causal Structures

    NASA Astrophysics Data System (ADS)

    Castro-Ruiz, Esteban; Giacomini, Flaminia; Brukner, Časlav

    2018-01-01

    It was recently suggested that causal structures are both dynamical, because of general relativity, and indefinite, because of quantum theory. The process matrix formalism furnishes a framework for quantum mechanics on indefinite causal structures, where the order between operations of local laboratories is not definite (e.g., one cannot say whether operation in laboratory A occurs before or after operation in laboratory B ). Here, we develop a framework for "dynamics of causal structures," i.e., for transformations of process matrices into process matrices. We show that, under continuous and reversible transformations, the causal order between operations is always preserved. However, the causal order between a subset of operations can be changed under continuous yet nonreversible transformations. An explicit example is that of the quantum switch, where a party in the past affects the causal order of operations of future parties, leading to a transition from a channel from A to B , via superposition of causal orders, to a channel from B to A . We generalize our framework to construct a hierarchy of quantum maps based on transformations of process matrices and transformations thereof.

  9. Causality and causal inference in epidemiology: the need for a pluralistic approach

    PubMed Central

    Vandenbroucke, Jan P; Broadbent, Alex; Pearce, Neil

    2016-01-01

    Abstract Causal inference based on a restricted version of the potential outcomes approach reasoning is assuming an increasingly prominent place in the teaching and practice of epidemiology. The proposed concepts and methods are useful for particular problems, but it would be of concern if the theory and practice of the complete field of epidemiology were to become restricted to this single approach to causal inference. Our concerns are that this theory restricts the questions that epidemiologists may ask and the study designs that they may consider. It also restricts the evidence that may be considered acceptable to assess causality, and thereby the evidence that may be considered acceptable for scientific and public health decision making. These restrictions are based on a particular conceptual framework for thinking about causality. In Section 1, we describe the characteristics of the restricted potential outcomes approach (RPOA) and show that there is a methodological movement which advocates these principles, not just for solving particular problems, but as ideals for which epidemiology as a whole should strive. In Section 2, we seek to show that the limitation of epidemiology to one particular view of the nature of causality is problematic. In Section 3, we argue that the RPOA is also problematic with regard to the assessment of causality. We argue that it threatens to restrict study design choice, to wrongly discredit the results of types of observational studies that have been very useful in the past and to damage the teaching of epidemiological reasoning. Finally, in Section 4 we set out what we regard as a more reasonable ‘working hypothesis’ as to the nature of causality and its assessment: pragmatic pluralism. PMID:26800751

  10. Redundant variables and Granger causality

    NASA Astrophysics Data System (ADS)

    Angelini, L.; de Tommaso, M.; Marinazzo, D.; Nitti, L.; Pellicoro, M.; Stramaglia, S.

    2010-03-01

    We discuss the use of multivariate Granger causality in presence of redundant variables: the application of the standard analysis, in this case, leads to under estimation of causalities. Using the un-normalized version of the causality index, we quantitatively develop the notions of redundancy and synergy in the frame of causality and propose two approaches to group redundant variables: (i) for a given target, the remaining variables are grouped so as to maximize the total causality and (ii) the whole set of variables is partitioned to maximize the sum of the causalities between subsets. We show the application to a real neurological experiment, aiming to a deeper understanding of the physiological basis of abnormal neuronal oscillations in the migraine brain. The outcome by our approach reveals the change in the informational pattern due to repetitive transcranial magnetic stimulations.

  11. Identifying HIV associated neurocognitive disorder using large-scale Granger causality analysis on resting-state functional MRI

    NASA Astrophysics Data System (ADS)

    DSouza, Adora M.; Abidin, Anas Z.; Leistritz, Lutz; Wismüller, Axel

    2017-02-01

    We investigate the applicability of large-scale Granger Causality (lsGC) for extracting a measure of multivariate information flow between pairs of regional brain activities from resting-state functional MRI (fMRI) and test the effectiveness of these measures for predicting a disease state. Such pairwise multivariate measures of interaction provide high-dimensional representations of connectivity profiles for each subject and are used in a machine learning task to distinguish between healthy controls and individuals presenting with symptoms of HIV Associated Neurocognitive Disorder (HAND). Cognitive impairment in several domains can occur as a result of HIV infection of the central nervous system. The current paradigm for assessing such impairment is through neuropsychological testing. With fMRI data analysis, we aim at non-invasively capturing differences in brain connectivity patterns between healthy subjects and subjects presenting with symptoms of HAND. To classify the extracted interaction patterns among brain regions, we use a prototype-based learning algorithm called Generalized Matrix Learning Vector Quantization (GMLVQ). Our approach to characterize connectivity using lsGC followed by GMLVQ for subsequent classification yields good prediction results with an accuracy of 87% and an area under the ROC curve (AUC) of up to 0.90. We obtain a statistically significant improvement (p<0.01) over a conventional Granger causality approach (accuracy = 0.76, AUC = 0.74). High accuracy and AUC values using our multivariate method to connectivity analysis suggests that our approach is able to better capture changes in interaction patterns between different brain regions when compared to conventional Granger causality analysis known from the literature.

  12. GESearch: An Interactive GUI Tool for Identifying Gene Expression Signature.

    PubMed

    Ye, Ning; Yin, Hengfu; Liu, Jingjing; Dai, Xiaogang; Yin, Tongming

    2015-01-01

    The huge amount of gene expression data generated by microarray and next-generation sequencing technologies present challenges to exploit their biological meanings. When searching for the coexpression genes, the data mining process is largely affected by selection of algorithms. Thus, it is highly desirable to provide multiple options of algorithms in the user-friendly analytical toolkit to explore the gene expression signatures. For this purpose, we developed GESearch, an interactive graphical user interface (GUI) toolkit, which is written in MATLAB and supports a variety of gene expression data files. This analytical toolkit provides four models, including the mean, the regression, the delegate, and the ensemble models, to identify the coexpression genes, and enables the users to filter data and to select gene expression patterns by browsing the display window or by importing knowledge-based genes. Subsequently, the utility of this analytical toolkit is demonstrated by analyzing two sets of real-life microarray datasets from cell-cycle experiments. Overall, we have developed an interactive GUI toolkit that allows for choosing multiple algorithms for analyzing the gene expression signatures.

  13. Applying Multivariate Adaptive Splines to Identify Genes With Expressions Varying After Diagnosis in Microarray Experiments.

    PubMed

    Duan, Fenghai; Xu, Ye

    2017-01-01

    To analyze a microarray experiment to identify the genes with expressions varying after the diagnosis of breast cancer. A total of 44 928 probe sets in an Affymetrix microarray data publicly available on Gene Expression Omnibus from 249 patients with breast cancer were analyzed by the nonparametric multivariate adaptive splines. Then, the identified genes with turning points were grouped by K-means clustering, and their network relationship was subsequently analyzed by the Ingenuity Pathway Analysis. In total, 1640 probe sets (genes) were reliably identified to have turning points along with the age at diagnosis in their expression profiling, of which 927 expressed lower after turning points and 713 expressed higher after the turning points. K-means clustered them into 3 groups with turning points centering at 54, 62.5, and 72, respectively. The pathway analysis showed that the identified genes were actively involved in various cancer-related functions or networks. In this article, we applied the nonparametric multivariate adaptive splines method to a publicly available gene expression data and successfully identified genes with expressions varying before and after breast cancer diagnosis.

  14. Occupational safety management: the role of causal attribution.

    PubMed

    Gyekye, Seth Ayim

    2010-12-01

    The paper addresses the causal attribution theory, an old and well-established theme in social psychology which denotes the everyday, commonsense explanations that people use to explain events and the world around them. The attribution paradigm is considered one of the most appropriate analytical tools for exploratory and descriptive studies in social psychology and organizational literature. It affords the possibility of describing accident processes as objectively as possible and with as much detail as possible. Causal explanations are vital to the formal analysis of workplace hazards and accidents, as they determine how organizations act to prevent accident recurrence. Accordingly, they are regarded as fundamental and prerequisite elements for safety management policies. The paper focuses primarily on the role of causal attributions in occupational and industrial accident analyses and implementation of safety interventions. It thus serves as a review of the contribution of attribution theory to occupational and industrial accidents. It comprises six sections. The first section presents an introduction to the classic attribution theories, and the second an account of the various ways in which the attribution paradigm has been applied in organizational settings. The third and fourth sections review the literature on causal attributions and demographic and organizational variables respectively. The sources of attributional biases in social psychology and how they manifest and are identified in the causal explanations for industrial and occupational accidents are treated in the fifth section. Finally, conclusion and recommendations are presented. The recommendations are particularly important for the reduction of workplace accidents and associated costs. The paper touches on the need for unbiased causal analyses, belief in the preventability of accidents, and the imperative role of management in occupational safety management.

  15. TGMI: an efficient algorithm for identifying pathway regulators through evaluation of triple-gene mutual interaction

    PubMed Central

    Gunasekara, Chathura; Zhang, Kui; Deng, Wenping; Brown, Laura

    2018-01-01

    Abstract Despite their important roles, the regulators for most metabolic pathways and biological processes remain elusive. Presently, the methods for identifying metabolic pathway and biological process regulators are intensively sought after. We developed a novel algorithm called triple-gene mutual interaction (TGMI) for identifying these regulators using high-throughput gene expression data. It first calculated the regulatory interactions among triple gene blocks (two pathway genes and one transcription factor (TF)), using conditional mutual information, and then identifies significantly interacted triple genes using a newly identified novel mutual interaction measure (MIM), which was substantiated to reflect strengths of regulatory interactions within each triple gene block. The TGMI calculated the MIM for each triple gene block and then examined its statistical significance using bootstrap. Finally, the frequencies of all TFs present in all significantly interacted triple gene blocks were calculated and ranked. We showed that the TFs with higher frequencies were usually genuine pathway regulators upon evaluating multiple pathways in plants, animals and yeast. Comparison of TGMI with several other algorithms demonstrated its higher accuracy. Therefore, TGMI will be a valuable tool that can help biologists to identify regulators of metabolic pathways and biological processes from the exploded high-throughput gene expression data in public repositories. PMID:29579312

  16. Causality analysis in business performance measurement system using system dynamics methodology

    NASA Astrophysics Data System (ADS)

    Yusof, Zainuridah; Yusoff, Wan Fadzilah Wan; Maarof, Faridah

    2014-07-01

    One of the main components of the Balanced Scorecard (BSC) that differentiates it from any other performance measurement system (PMS) is the Strategy Map with its unidirectional causality feature. Despite its apparent popularity, criticisms on the causality have been rigorously discussed by earlier researchers. In seeking empirical evidence of causality, propositions based on the service profit chain theory were developed and tested using the econometrics analysis, Granger causality test on the 45 data points. However, the insufficiency of well-established causality models was found as only 40% of the causal linkages were supported by the data. Expert knowledge was suggested to be used in the situations of insufficiency of historical data. The Delphi method was selected and conducted in obtaining the consensus of the causality existence among the 15 selected expert persons by utilizing 3 rounds of questionnaires. Study revealed that only 20% of the propositions were not supported. The existences of bidirectional causality which demonstrate significant dynamic environmental complexity through interaction among measures were obtained from both methods. With that, a computer modeling and simulation using System Dynamics (SD) methodology was develop as an experimental platform to identify how policies impacting the business performance in such environments. The reproduction, sensitivity and extreme condition tests were conducted onto developed SD model to ensure their capability in mimic the reality, robustness and validity for causality analysis platform. This study applied a theoretical service management model within the BSC domain to a practical situation using SD methodology where very limited work has been done.

  17. Examination of tetrahydrobiopterin pathway genes in autism.

    PubMed

    Schnetz-Boutaud, N C; Anderson, B M; Brown, K D; Wright, H H; Abramson, R K; Cuccaro, M L; Gilbert, J R; Pericak-Vance, M A; Haines, J L

    2009-11-01

    Autism is a complex disorder with a high degree of heritability and significant phenotypic and genotypic heterogeneity. Although candidate gene studies and genome-wide screens have failed to identify major causal loci associated with autism, numerous studies have proposed association with several variations in genes in the dopaminergic and serotonergic pathways. Because tetrahydrobiopterin (BH4) is the essential cofactor in the synthesis of these two neurotransmitters, we genotyped 25 SNPs in nine genes of the BH4 pathway in a total of 403 families. Significant nominal association was detected in the gene for 6-pyruvoyl-tetrahydropterin synthase, PTS (chromosome 11), with P = 0.009; this result was not restricted to an affected male-only subset. Multilocus interaction was detected in the BH4 pathway alone, but not across the serotonin, dopamine and BH4 pathways.

  18. Epidermal growth factor gene is a newly identified candidate gene for gout

    PubMed Central

    Han, Lin; Cao, Chunwei; Jia, Zhaotong; Liu, Shiguo; Liu, Zhen; Xin, Ruosai; Wang, Can; Li, Xinde; Ren, Wei; Wang, Xuefeng; Li, Changgui

    2016-01-01

    Chromosome 4q25 has been identified as a genomic region associated with gout. However, the associations of gout with the genes in this region have not yet been confirmed. Here, we performed two-stage analysis to determine whether variations in candidate genes in the 4q25 region are associated with gout in a male Chinese Han population. We first evaluated 96 tag single nucleotide polymorphisms (SNPs) in eight inflammatory/immune pathway- or glucose/lipid metabolism-related genes in the 4q25 region in 480 male gout patients and 480 controls. The SNP rs12504538, located in the elongation of very-long-chain-fatty-acid-like family member 6 gene (Elovl6), was found to be associated with gout susceptibility (Padjusted = 0.00595). In the second stage of analysis, we performed fine mapping analysis of 93 tag SNPs in Elovl6 and in the epidermal growth factor gene (EGF) and its flanking regions in 1017 male patients gout and 1897 healthy male controls. We observed a significant association between the T allele of EGF rs2298999 and gout (odds ratio = 0.77, 95% confidence interval = 0.67–0.88, Padjusted = 6.42 × 10−3). These results provide the first evidence for an association between the EGF rs2298999 C/T polymorphism and gout. Our findings should be validated in additional populations. PMID:27506295

  19. Experimental test of nonlocal causality.

    PubMed

    Ringbauer, Martin; Giarmatzi, Christina; Chaves, Rafael; Costa, Fabio; White, Andrew G; Fedrizzi, Alessandro

    2016-08-01

    Explaining observations in terms of causes and effects is central to empirical science. However, correlations between entangled quantum particles seem to defy such an explanation. This implies that some of the fundamental assumptions of causal explanations have to give way. We consider a relaxation of one of these assumptions, Bell's local causality, by allowing outcome dependence: a direct causal influence between the outcomes of measurements of remote parties. We use interventional data from a photonic experiment to bound the strength of this causal influence in a two-party Bell scenario, and observational data from a Bell-type inequality test for the considered models. Our results demonstrate the incompatibility of quantum mechanics with a broad class of nonlocal causal models, which includes Bell-local models as a special case. Recovering a classical causal picture of quantum correlations thus requires an even more radical modification of our classical notion of cause and effect.

  20. Identifying the genes of unconventional high temperature superconductors.

    PubMed

    Hu, Jiangping

    We elucidate a recently emergent framework in unifying the two families of high temperature (high [Formula: see text]) superconductors, cuprates and iron-based superconductors. The unification suggests that the latter is simply the counterpart of the former to realize robust extended s-wave pairing symmetries in a square lattice. The unification identifies that the key ingredients (gene) of high [Formula: see text] superconductors is a quasi two dimensional electronic environment in which the d -orbitals of cations that participate in strong in-plane couplings to the p -orbitals of anions are isolated near Fermi energy. With this gene, the superexchange magnetic interactions mediated by anions could maximize their contributions to superconductivity. Creating the gene requires special arrangements between local electronic structures and crystal lattice structures. The speciality explains why high [Formula: see text] superconductors are so rare. An explicit prediction is made to realize high [Formula: see text] superconductivity in Co/Ni-based materials with a quasi two dimensional hexagonal lattice structure formed by trigonal bipyramidal complexes.

  1. Combining gene expression and genetic analyses to identify candidate genes involved in cold responses in pea.

    PubMed

    Legrand, Sylvain; Marque, Gilles; Blassiau, Christelle; Bluteau, Aurélie; Canoy, Anne-Sophie; Fontaine, Véronique; Jaminon, Odile; Bahrman, Nasser; Mautord, Julie; Morin, Julie; Petit, Aurélie; Baranger, Alain; Rivière, Nathalie; Wilmer, Jeroen; Delbreil, Bruno; Lejeune-Hénaut, Isabelle

    2013-09-01

    Cold stress affects plant growth and development. In order to better understand the responses to cold (chilling or freezing tolerance), we used two contrasted pea lines. Following a chilling period, the Champagne line becomes tolerant to frost whereas the Terese line remains sensitive. Four suppression subtractive hybridisation libraries were obtained using mRNAs isolated from pea genotypes Champagne and Terese. Using quantitative polymerase chain reaction (qPCR) performed on 159 genes, 43 and 54 genes were identified as differentially expressed at the initial time point and during the time course study, respectively. Molecular markers were developed from the differentially expressed genes and were genotyped on a population of 164 RILs derived from a cross between Champagne and Terese. We identified 5 candidate genes colocalizing with 3 different frost damage quantitative trait loci (QTL) intervals and a protein quantity locus (PQL) rich region previously reported. This investigation revealed the role of constitutive differences between both genotypes in the cold responses, in particular with genes related to glycine degradation pathway that could confer to Champagne a better frost tolerance. We showed that freezing tolerance involves a decrease of expression of genes related to photosynthesis and the expression of a gene involved in the production of cysteine and methionine that could act as cryoprotectant molecules. Although it remains to be confirmed, this study could also reveal the involvement of the jasmonate pathway in the cold responses, since we observed that two genes related to this pathway were mapped in a frost damage QTL interval and in a PQL rich region interval, respectively. Copyright © 2013 Elsevier GmbH. All rights reserved.

  2. Causality and causal inference in epidemiology: the need for a pluralistic approach.

    PubMed

    Vandenbroucke, Jan P; Broadbent, Alex; Pearce, Neil

    2016-12-01

    Causal inference based on a restricted version of the potential outcomes approach reasoning is assuming an increasingly prominent place in the teaching and practice of epidemiology. The proposed concepts and methods are useful for particular problems, but it would be of concern if the theory and practice of the complete field of epidemiology were to become restricted to this single approach to causal inference. Our concerns are that this theory restricts the questions that epidemiologists may ask and the study designs that they may consider. It also restricts the evidence that may be considered acceptable to assess causality, and thereby the evidence that may be considered acceptable for scientific and public health decision making. These restrictions are based on a particular conceptual framework for thinking about causality. In Section 1, we describe the characteristics of the restricted potential outcomes approach (RPOA) and show that there is a methodological movement which advocates these principles, not just for solving particular problems, but as ideals for which epidemiology as a whole should strive. In Section 2, we seek to show that the limitation of epidemiology to one particular view of the nature of causality is problematic. In Section 3, we argue that the RPOA is also problematic with regard to the assessment of causality. We argue that it threatens to restrict study design choice, to wrongly discredit the results of types of observational studies that have been very useful in the past and to damage the teaching of epidemiological reasoning. Finally, in Section 4 we set out what we regard as a more reasonable 'working hypothesis' as to the nature of causality and its assessment: pragmatic pluralism. © The Author 2016. Published by Oxford University Press on behalf of the International Epidemiological Association.

  3. The Causal Effects of Father Absence

    PubMed Central

    McLanahan, Sara; Tach, Laura; Schneider, Daniel

    2014-01-01

    The literature on father absence is frequently criticized for its use of cross-sectional data and methods that fail to take account of possible omitted variable bias and reverse causality. We review studies that have responded to this critique by employing a variety of innovative research designs to identify the causal effect of father absence, including studies using lagged dependent variable models, growth curve models, individual fixed effects models, sibling fixed effects models, natural experiments, and propensity score matching models. Our assessment is that studies using more rigorous designs continue to find negative effects of father absence on offspring well-being, although the magnitude of these effects is smaller than what is found using traditional cross-sectional designs. The evidence is strongest and most consistent for outcomes such as high school graduation, children’s social-emotional adjustment, and adult mental health. PMID:24489431

  4. Instrumental variable approaches to identifying the causal effect of educational attainment on dementia risk

    PubMed Central

    Nguyen, Thu T.; Tchetgen Tchetgen, Eric J.; Kawachi, Ichiro; Gilman, Stephen E.; Walter, Stefan; Liu, Sze Y.; Manly, Jennifer; Glymour, M. Maria

    2015-01-01

    Purpose Education is an established correlate of cognitive status in older adulthood, but whether expanding educational opportunities would improve cognitive functioning remains unclear given limitations of prior studies for causal inference. Therefore, we conducted instrumental variable (IV) analyses of the association between education and dementia risk, using for the first time in this area, genetic variants as instruments as well as state-level school policies. Methods IV analyses in the Health and Retirement Study cohort (1998–2010) used two sets of instruments: 1) a genetic risk score constructed from three single nucleotide polymorphisms (SNPs) (n=8,054); and 2) compulsory schooling laws (CSLs) and state school characteristics (term length, student teacher ratios, and expenditures) (n=13,167). Results Employing the genetic risk score as an IV, there was a 1.1% reduction in dementia risk per year of schooling (95% CI: −2.4, 0.02). Leveraging compulsory schooling laws and state school characteristics as IVs, there was a substantially larger protective effect (−9.5%; 95% CI: −14.8, −4.2). Analyses evaluating the plausibility of the IV assumptions indicated estimates derived from analyses relying on CSLs provide the best estimates of the causal effect of education. Conclusion IV analyses suggest education is protective against risk of dementia in older adulthood. PMID:26633592

  5. Causal pathways linking Farm to School to childhood obesity prevention.

    PubMed

    Joshi, Anupama; Ratcliffe, Michelle M

    2012-08-01

    Farm to School programs are rapidly gaining attention as a potential strategy for preventing childhood obesity; however, the causal linkages between Farm to School activities and health outcomes are not well documented. To capitalize on the increased interest in and momentum for Farm to School, researchers and practitioners need to move from developing and implementing evidence informed programs and policies to ones that are evidence-based. The purpose of this article is to outline a framework for facilitating an evidence base for Farm to School programs and policies through a systematic and coordinated approach. Employing the concepts of causal pathways, the authors introduce a proposed framework for organizing and systematically testing out multiple hypotheses (or potential causal links) for how, why, and under what conditions Farm to School Inputs and Activities may result in what Outputs, Effects, and Impacts. Using the causal pathways framework may help develop and test competing hypotheses, identify multicausality, strength, and interactions of causes, and discern the difference between catalysts and causes. In this article, we introduce causal pathways, present menus of potential independent and dependent variables from which to create and test causal pathways linking Farm to School interventions and their role in preventing childhood obesity, discuss their applicability to Farm to School research and practice, and outline proposed next steps for developing a coordinated research framework for Farm to School programs.

  6. Causal Reasoning on Biological Networks: Interpreting Transcriptional Changes

    NASA Astrophysics Data System (ADS)

    Chindelevitch, Leonid; Ziemek, Daniel; Enayetallah, Ahmed; Randhawa, Ranjit; Sidders, Ben; Brockel, Christoph; Huang, Enoch

    Over the past decade gene expression data sets have been generated at an increasing pace. In addition to ever increasing data generation, the biomedical literature is growing exponentially. The PubMed database (Sayers et al., 2010) comprises more than 20 million citations as of October 2010. The goal of our method is the prediction of putative upstream regulators of observed expression changes based on a set of over 400,000 causal relationships. The resulting putative regulators constitute directly testable hypotheses for follow-up.

  7. Experimental test of nonlocal causality

    PubMed Central

    Ringbauer, Martin; Giarmatzi, Christina; Chaves, Rafael; Costa, Fabio; White, Andrew G.; Fedrizzi, Alessandro

    2016-01-01

    Explaining observations in terms of causes and effects is central to empirical science. However, correlations between entangled quantum particles seem to defy such an explanation. This implies that some of the fundamental assumptions of causal explanations have to give way. We consider a relaxation of one of these assumptions, Bell’s local causality, by allowing outcome dependence: a direct causal influence between the outcomes of measurements of remote parties. We use interventional data from a photonic experiment to bound the strength of this causal influence in a two-party Bell scenario, and observational data from a Bell-type inequality test for the considered models. Our results demonstrate the incompatibility of quantum mechanics with a broad class of nonlocal causal models, which includes Bell-local models as a special case. Recovering a classical causal picture of quantum correlations thus requires an even more radical modification of our classical notion of cause and effect. PMID:27532045

  8. Causal inference in public health.

    PubMed

    Glass, Thomas A; Goodman, Steven N; Hernán, Miguel A; Samet, Jonathan M

    2013-01-01

    Causal inference has a central role in public health; the determination that an association is causal indicates the possibility for intervention. We review and comment on the long-used guidelines for interpreting evidence as supporting a causal association and contrast them with the potential outcomes framework that encourages thinking in terms of causes that are interventions. We argue that in public health this framework is more suitable, providing an estimate of an action's consequences rather than the less precise notion of a risk factor's causal effect. A variety of modern statistical methods adopt this approach. When an intervention cannot be specified, causal relations can still exist, but how to intervene to change the outcome will be unclear. In application, the often-complex structure of causal processes needs to be acknowledged and appropriate data collected to study them. These newer approaches need to be brought to bear on the increasingly complex public health challenges of our globalized world.

  9. Causal conditionals and counterfactuals

    PubMed Central

    Frosch, Caren A.; Byrne, Ruth M.J.

    2012-01-01

    Causal counterfactuals e.g., ‘if the ignition key had been turned then the car would have started’ and causal conditionals e.g., ‘if the ignition key was turned then the car started’ are understood by thinking about multiple possibilities of different sorts, as shown in six experiments using converging evidence from three different types of measures. Experiments 1a and 1b showed that conditionals that comprise enabling causes, e.g., ‘if the ignition key was turned then the car started’ primed people to read quickly conjunctions referring to the possibility of the enabler occurring without the outcome, e.g., ‘the ignition key was turned and the car did not start’. Experiments 2a and 2b showed that people paraphrased causal conditionals by using causal or temporal connectives (because, when), whereas they paraphrased causal counterfactuals by using subjunctive constructions (had…would have). Experiments 3a and 3b showed that people made different inferences from counterfactuals presented with enabling conditions compared to none. The implications of the results for alternative theories of conditionals are discussed. PMID:22858874

  10. Microarray expression profiling identifies genes with altered expression in HDL-deficient mice

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Callow, Matthew J.; Dudoit, Sandrine; Gong, Elaine L.

    2000-05-05

    Based on the assumption that severe alterations in the expression of genes known to be involved in HDL metabolism may affect the expression of other genes we screened an array of over 5000 mouse expressed sequence tags (ESTs) for altered gene expression in the livers of two lines of mice with dramatic decreases in HDL plasma concentrations. Labeled cDNA from livers of apolipoprotein AI (apo AI) knockout mice, Scavenger Receptor BI (SR-BI) transgenic mice and control mice were co-hybridized to microarrays. Two-sample t-statistics were used to identify genes with altered expression levels in the knockout or transgenic mice compared withmore » the control mice. In the SR-BI group we found 9 array elements representing at least 5 genes to be significantly altered on the basis of an adjusted p value of less than 0.05. In the apo AI knockout group 8 array elements representing 4 genes were altered compared with the control group (p < 0.05). Several of the genes identified in the SR-BI transgenic suggest altered sterol metabolism and oxidative processes. These studies illustrate the use of multiple-testing methods for the identification of genes with altered expression in replicated microarray experiments of apo AI knockout and SR-BI transgenic mice.« less

  11. Causal Discovery of Dynamic Systems

    ERIC Educational Resources Information Center

    Voortman, Mark

    2010-01-01

    Recently, several philosophical and computational approaches to causality have used an interventionist framework to clarify the concept of causality [Spirtes et al., 2000, Pearl, 2000, Woodward, 2005]. The characteristic feature of the interventionist approach is that causal models are potentially useful in predicting the effects of manipulations.…

  12. Identifying the rooted species tree from the distribution of unrooted gene trees under the coalescent.

    PubMed

    Allman, Elizabeth S; Degnan, James H; Rhodes, John A

    2011-06-01

    Gene trees are evolutionary trees representing the ancestry of genes sampled from multiple populations. Species trees represent populations of individuals-each with many genes-splitting into new populations or species. The coalescent process, which models ancestry of gene copies within populations, is often used to model the probability distribution of gene trees given a fixed species tree. This multispecies coalescent model provides a framework for phylogeneticists to infer species trees from gene trees using maximum likelihood or Bayesian approaches. Because the coalescent models a branching process over time, all trees are typically assumed to be rooted in this setting. Often, however, gene trees inferred by traditional phylogenetic methods are unrooted. We investigate probabilities of unrooted gene trees under the multispecies coalescent model. We show that when there are four species with one gene sampled per species, the distribution of unrooted gene tree topologies identifies the unrooted species tree topology and some, but not all, information in the species tree edges (branch lengths). The location of the root on the species tree is not identifiable in this situation. However, for 5 or more species with one gene sampled per species, we show that the distribution of unrooted gene tree topologies identifies the rooted species tree topology and all its internal branch lengths. The length of any pendant branch leading to a leaf of the species tree is also identifiable for any species from which more than one gene is sampled.

  13. Genome-wide methylation analysis identifies genes silenced in non-seminoma cell lines

    PubMed Central

    Noor, Dzul Azri Mohamed; Jeyapalan, Jennie N; Alhazmi, Safiah; Carr, Matthew; Squibb, Benjamin; Wallace, Claire; Tan, Christopher; Cusack, Martin; Hughes, Jaime; Reader, Tom; Shipley, Janet; Sheer, Denise; Scotting, Paul J

    2016-01-01

    Silencing of genes by DNA methylation is a common phenomenon in many types of cancer. However, the genome-wide effect of DNA methylation on gene expression has been analysed in relatively few cancers. Germ cell tumours (GCTs) are a complex group of malignancies. They are unique in developing from a pluripotent progenitor cell. Previous analyses have suggested that non-seminomas exhibit much higher levels of DNA methylation than seminomas. The genomic targets that are methylated, the extent to which this results in gene silencing and the identity of the silenced genes most likely to play a role in the tumours’ biology have not yet been established. In this study, genome-wide methylation and expression analysis of GCT cell lines was combined with gene expression data from primary tumours to address this question. Genome methylation was analysed using the Illumina infinium HumanMethylome450 bead chip system and gene expression was analysed using Affymetrix GeneChip Human Genome U133 Plus 2.0 arrays. Regulation by methylation was confirmed by demethylation using 5-aza-2-deoxycytidine and reverse transcription–quantitative PCR. Large differences in the level of methylation of the CpG islands of individual genes between tumour cell lines correlated well with differential gene expression. Treatment of non-seminoma cells with 5-aza-2-deoxycytidine verified that methylation of all genes tested played a role in their silencing in yolk sac tumour cells and many of these genes were also differentially expressed in primary tumours. Genes silenced by methylation in the various GCT cell lines were identified. Several pluripotency-associated genes were identified as a major functional group of silenced genes. PMID:29263807

  14. Genome-wide methylation analysis identifies genes silenced in non-seminoma cell lines.

    PubMed

    Noor, Dzul Azri Mohamed; Jeyapalan, Jennie N; Alhazmi, Safiah; Carr, Matthew; Squibb, Benjamin; Wallace, Claire; Tan, Christopher; Cusack, Martin; Hughes, Jaime; Reader, Tom; Shipley, Janet; Sheer, Denise; Scotting, Paul J

    2016-01-01

    Silencing of genes by DNA methylation is a common phenomenon in many types of cancer. However, the genome-wide effect of DNA methylation on gene expression has been analysed in relatively few cancers. Germ cell tumours (GCTs) are a complex group of malignancies. They are unique in developing from a pluripotent progenitor cell. Previous analyses have suggested that non-seminomas exhibit much higher levels of DNA methylation than seminomas. The genomic targets that are methylated, the extent to which this results in gene silencing and the identity of the silenced genes most likely to play a role in the tumours' biology have not yet been established. In this study, genome-wide methylation and expression analysis of GCT cell lines was combined with gene expression data from primary tumours to address this question. Genome methylation was analysed using the Illumina infinium HumanMethylome450 bead chip system and gene expression was analysed using Affymetrix GeneChip Human Genome U133 Plus 2.0 arrays. Regulation by methylation was confirmed by demethylation using 5-aza-2-deoxycytidine and reverse transcription-quantitative PCR. Large differences in the level of methylation of the CpG islands of individual genes between tumour cell lines correlated well with differential gene expression. Treatment of non-seminoma cells with 5-aza-2-deoxycytidine verified that methylation of all genes tested played a role in their silencing in yolk sac tumour cells and many of these genes were also differentially expressed in primary tumours. Genes silenced by methylation in the various GCT cell lines were identified. Several pluripotency-associated genes were identified as a major functional group of silenced genes.

  15. Haplotype Analysis in Multiple Crosses to Identify a QTL Gene

    PubMed Central

    Wang, Xiaosong; Korstanje, Ron; Higgins, David; Paigen, Beverly

    2004-01-01

    Identifying quantitative trait locus (QTL) genes is a challenging task. Herein, we report using a two-step process to identify Apoa2 as the gene underlying Hdlq5, a QTL for plasma high-density lipoprotein cholesterol (HDL) levels on mouse chromosome 1. First, we performed a sequence analysis of the Apoa2 coding region in 46 genetically diverse mouse strains and found five different APOA2 protein variants, which we named APOA2a to APOA2e. Second, we conducted a haplotype analysis of the strains in 21 crosses that have so far detected HDL QTLs; we found that Hdlq5 was detected only in the nine crosses where one parent had the APOA2b protein variant characterized by an Ala61-to-Val61 substitution. We then found that strains with the APOA2b variant had significantly higher (P ≤ 0.002) plasma HDL levels than those with either the APOA2a or the APOA2c variant. These findings support Apoa2 as the underlying Hdlq5 gene and suggest the Apoa2 polymorphisms responsible for the Hdlq5 phenotype. Therefore, haplotype analysis in multiple crosses can be used to support a candidate QTL gene. PMID:15310659

  16. Haplotype analysis in multiple crosses to identify a QTL gene.

    PubMed

    Wang, Xiaosong; Korstanje, Ron; Higgins, David; Paigen, Beverly

    2004-09-01

    Identifying quantitative trait locus (QTL) genes is a challenging task. Herein, we report using a two-step process to identify Apoa2 as the gene underlying Hdlq5, a QTL for plasma high-density lipoprotein cholesterol (HDL) levels on mouse chromosome 1. First, we performed a sequence analysis of the Apoa2 coding region in 46 genetically diverse mouse strains and found five different APOA2 protein variants, which we named APOA2a to APOA2e. Second, we conducted a haplotype analysis of the strains in 21 crosses that have so far detected HDL QTLs; we found that Hdlq5 was detected only in the nine crosses where one parent had the APOA2b protein variant characterized by an Ala61-to-Val61 substitution. We then found that strains with the APOA2b variant had significantly higher (P < or = 0.002) plasma HDL levels than those with either the APOA2a or the APOA2c variant. These findings support Apoa2 as the underlying Hdlq5 gene and suggest the Apoa2 polymorphisms responsible for the Hdlq5 phenotype. Therefore, haplotype analysis in multiple crosses can be used to support a candidate QTL gene.

  17. Formalizing Neurath's ship: Approximate algorithms for online causal learning.

    PubMed

    Bramley, Neil R; Dayan, Peter; Griffiths, Thomas L; Lagnado, David A

    2017-04-01

    Higher-level cognition depends on the ability to learn models of the world. We can characterize this at the computational level as a structure-learning problem with the goal of best identifying the prevailing causal relationships among a set of relata. However, the computational cost of performing exact Bayesian inference over causal models grows rapidly as the number of relata increases. This implies that the cognitive processes underlying causal learning must be substantially approximate. A powerful class of approximations that focuses on the sequential absorption of successive inputs is captured by the Neurath's ship metaphor in philosophy of science, where theory change is cast as a stochastic and gradual process shaped as much by people's limited willingness to abandon their current theory when considering alternatives as by the ground truth they hope to approach. Inspired by this metaphor and by algorithms for approximating Bayesian inference in machine learning, we propose an algorithmic-level model of causal structure learning under which learners represent only a single global hypothesis that they update locally as they gather evidence. We propose a related scheme for understanding how, under these limitations, learners choose informative interventions that manipulate the causal system to help elucidate its workings. We find support for our approach in the analysis of 3 experiments. (PsycINFO Database Record (c) 2017 APA, all rights reserved).

  18. Experimental verification of an indefinite causal order

    PubMed Central

    Rubino, Giulia; Rozema, Lee A.; Feix, Adrien; Araújo, Mateus; Zeuner, Jonas M.; Procopio, Lorenzo M.; Brukner, Časlav; Walther, Philip

    2017-01-01

    Investigating the role of causal order in quantum mechanics has recently revealed that the causal relations of events may not be a priori well defined in quantum theory. Although this has triggered a growing interest on the theoretical side, creating processes without a causal order is an experimental task. We report the first decisive demonstration of a process with an indefinite causal order. To do this, we quantify how incompatible our setup is with a definite causal order by measuring a “causal witness.” This mathematical object incorporates a series of measurements that are designed to yield a certain outcome only if the process under examination is not consistent with any well-defined causal order. In our experiment, we perform a measurement in a superposition of causal orders—without destroying the coherence—to acquire information both inside and outside of a “causally nonordered process.” Using this information, we experimentally determine a causal witness, demonstrating by almost 7 SDs that the experimentally implemented process does not have a definite causal order. PMID:28378018

  19. Comparison of gene expression in segregating families identifies genes and genomic regions involved in a novel adaptation, zinc hyperaccumulation.

    PubMed

    Filatov, Victor; Dowdle, John; Smirnoff, Nicholas; Ford-Lloyd, Brian; Newbury, H John; Macnair, Mark R

    2006-09-01

    One of the challenges of comparative genomics is to identify specific genetic changes associated with the evolution of a novel adaptation or trait. We need to be able to disassociate the genes involved with a particular character from all the other genetic changes that take place as lineages diverge. Here we show that by comparing the transcriptional profile of segregating families with that of parent species differing in a novel trait, it is possible to narrow down substantially the list of potential target genes. In addition, by assuming synteny with a related model organism for which the complete genome sequence is available, it is possible to use the cosegregation of markers differing in transcription level to identify regions of the genome which probably contain quantitative trait loci (QTLs) for the character. This novel combination of genomics and classical genetics provides a very powerful tool to identify candidate genes. We use this methodology to investigate zinc hyperaccumulation in Arabidopsis halleri, the sister species to the model plant, Arabidopsis thaliana. We compare the transcriptional profile of A. halleri with that of its sister nonaccumulator species, Arabidopsis petraea, and between accumulator and nonaccumulator F(3)s derived from the cross between the two species. We identify eight genes which consistently show greater expression in accumulator phenotypes in both roots and shoots, including two metal transporter genes (NRAMP3 and ZIP6), and cytoplasmic aconitase, a gene involved in iron homeostasis in mammals. We also show that there appear to be two QTLs for zinc accumulation, on chromosomes 3 and 7.

  20. Transitive closure of subsumption and causal relations in a large ontology of radiological diagnosis.

    PubMed

    Kahn, Charles E

    2016-06-01

    The Radiology Gamuts Ontology (RGO)-an ontology of diseases, interventions, and imaging findings-was developed to aid in decision support, education, and translational research in diagnostic radiology. The ontology defines a subsumption (is_a) relation between more general and more specific terms, and a causal relation (may_cause) to express the relationship between disorders and their possible imaging manifestations. RGO incorporated 19,745 terms with their synonyms and abbreviations, 1768 subsumption relations, and 55,558 causal relations. Transitive closure was computed iteratively; it yielded 2154 relations over subsumption and 1,594,896 relations over causality. Five causal cycles were discovered, all with path length of no more than 5. The graph-theoretic metrics of in-degree and out-degree were explored; the most useful metric to prioritize modification of the ontology was found to be the product of the in-degree of transitive closure over subsumption and the out-degree of transitive closure over causality. Two general types of error were identified: (1) causal assertions that used overly general terms because they implicitly assumed an organ-specific context and (2) subsumption relations where a site-specific disorder was asserted to be a subclass of the general disorder. Transitive closure helped identify incorrect assertions, prioritized and guided ontology revision, and aided resources that applied the ontology's knowledge. Copyright © 2016 Elsevier Inc. All rights reserved.

  1. Confounding factors in determining causal soil moisture-precipitation feedback

    NASA Astrophysics Data System (ADS)

    Tuttle, Samuel E.; Salvucci, Guido D.

    2017-07-01

    Identification of causal links in the land-atmosphere system is important for construction and testing of land surface and general circulation models. However, the land and atmosphere are highly coupled and linked by a vast number of complex, interdependent processes. Statistical methods, such as Granger causality, can help to identify feedbacks from observational data, independent of the different parameterizations of physical processes and spatiotemporal resolution effects that influence feedbacks in models. However, statistical causal identification methods can easily be misapplied, leading to erroneous conclusions about feedback strength and sign. Here, we discuss three factors that must be accounted for in determination of causal soil moisture-precipitation feedback in observations and model output: seasonal and interannual variability, precipitation persistence, and endogeneity. The effect of neglecting these factors is demonstrated in simulated and observational data. The results show that long-timescale variability and precipitation persistence can have a substantial effect on detected soil moisture-precipitation feedback strength, while endogeneity has a smaller effect that is often masked by measurement error and thus is more likely to be an issue when analyzing model data or highly accurate observational data.

  2. Common variants at the CHEK2 gene locus and risk of epithelial ovarian cancer

    PubMed Central

    Lawrenson, Kate; Iversen, Edwin S.; Tyrer, Jonathan; Weber, Rachel Palmieri; Concannon, Patrick; Hazelett, Dennis J.; Li, Qiyuan; Marks, Jeffrey R.; Berchuck, Andrew; Lee, Janet M.; Aben, Katja K.H.; Anton-Culver, Hoda; Antonenkova, Natalia; Bandera, Elisa V.; Bean, Yukie; Beckmann, Matthias W.; Bisogna, Maria; Bjorge, Line; Bogdanova, Natalia; Brinton, Louise A.; Brooks-Wilson, Angela; Bruinsma, Fiona; Butzow, Ralf; Campbell, Ian G.; Carty, Karen; Chang-Claude, Jenny; Chenevix-Trench, Georgia; Chen, Ann; Chen, Zhihua; Cook, Linda S.; Cramer, Daniel W.; Cunningham, Julie M.; Cybulski, Cezary; Plisiecka-Halasa, Joanna; Dennis, Joe; Dicks, Ed; Doherty, Jennifer A.; Dörk, Thilo; du Bois, Andreas; Eccles, Diana; Easton, Douglas T.; Edwards, Robert P.; Eilber, Ursula; Ekici, Arif B.; Fasching, Peter A.; Fridley, Brooke L.; Gao, Yu-Tang; Gentry-Maharaj, Aleksandra; Giles, Graham G.; Glasspool, Rosalind; Goode, Ellen L.; Goodman, Marc T.; Gronwald, Jacek; Harter, Philipp; Hasmad, Hanis Nazihah; Hein, Alexander; Heitz, Florian; Hildebrandt, Michelle A.T.; Hillemanns, Peter; Hogdall, Estrid; Hogdall, Claus; Hosono, Satoyo; Jakubowska, Anna; Paul, James; Jensen, Allan; Karlan, Beth Y.; Kjaer, Susanne Kruger; Kelemen, Linda E.; Kellar, Melissa; Kelley, Joseph L.; Kiemeney, Lambertus A.; Krakstad, Camilla; Lambrechts, Diether; Lambrechts, Sandrina; Le, Nhu D.; Lee, Alice W.; Cannioto, Rikki; Leminen, Arto; Lester, Jenny; Levine, Douglas A.; Liang, Dong; Lissowska, Jolanta; Lu, Karen; Lubinski, Jan; Lundvall, Lene; Massuger, Leon F.A.G.; Matsuo, Keitaro; McGuire, Valerie; McLaughlin, John R.; Nevanlinna, Heli; McNeish, Iain; Menon, Usha; Modugno, Francesmary; Moysich, Kirsten B.; Narod, Steven A.; Nedergaard, Lotte; Ness, Roberta B.; Noor Azmi, Mat Adenan; Odunsi, Kunle; Olson, Sara H.; Orlow, Irene; Orsulic, Sandra; Pearce, Celeste L.; Pejovic, Tanja; Pelttari, Liisa M.; Permuth-Wey, Jennifer; Phelan, Catherine M.; Pike, Malcolm C.; Poole, Elizabeth M.; Ramus, Susan J.; Risch, Harvey A.; Rosen, Barry; Rossing, Mary Anne; Rothstein, Joseph H.; Rudolph, Anja; Runnebaum, Ingo B.; Rzepecka, Iwona K.; Salvesen, Helga B.; Budzilowska, Agnieszka; Sellers, Thomas A.; Shu, Xiao-Ou; Shvetsov, Yurii B.; Siddiqui, Nadeem; Sieh, Weiva; Song, Honglin; Southey, Melissa C.; Sucheston, Lara; Tangen, Ingvild L.; Teo, Soo-Hwang; Terry, Kathryn L.; Thompson, Pamela J.; Timorek, Agnieszka; Tworoger, Shelley S.; Nieuwenhuysen, Els Van; Vergote, Ignace; Vierkant, Robert A.; Wang-Gohrke, Shan; Walsh, Christine; Wentzensen, Nicolas; Whittemore, Alice S.; Wicklund, Kristine G.; Wilkens, Lynne R.; Woo, Yin-Ling; Wu, Xifeng; Wu, Anna H.; Yang, Hannah; Zheng, Wei; Ziogas, Argyrios; Coetzee, Gerhard A.; Freedman, Matthew L.; Monteiro, Alvaro N.A.; Moes-Sosnowska, Joanna; Kupryjanczyk, Jolanta; Pharoah, Paul D.; Gayther, Simon A.; Schildkraut, Joellen M.

    2015-01-01

    Genome-wide association studies have identified 20 genomic regions associated with risk of epithelial ovarian cancer (EOC), but many additional risk variants may exist. Here, we evaluated associations between common genetic variants [single nucleotide polymorphisms (SNPs) and indels] in DNA repair genes and EOC risk. We genotyped 2896 common variants at 143 gene loci in DNA samples from 15 397 patients with invasive EOC and controls. We found evidence of associations with EOC risk for variants at FANCA, EXO1, E2F4, E2F2, CREB5 and CHEK2 genes (P ≤ 0.001). The strongest risk association was for CHEK2 SNP rs17507066 with serous EOC (P = 4.74 x 10–7). Additional genotyping and imputation of genotypes from the 1000 genomes project identified a slightly more significant association for CHEK2 SNP rs6005807 (r 2 with rs17507066 = 0.84, odds ratio (OR) 1.17, 95% CI 1.11–1.24, P = 1.1×10−7). We identified 293 variants in the region with likelihood ratios of less than 1:100 for representing the causal variant. Functional annotation identified 25 candidate SNPs that alter transcription factor binding sites within regulatory elements active in EOC precursor tissues. In The Cancer Genome Atlas dataset, CHEK2 gene expression was significantly higher in primary EOCs compared to normal fallopian tube tissues (P = 3.72×10−8). We also identified an association between genotypes of the candidate causal SNP rs12166475 (r 2 = 0.99 with rs6005807) and CHEK2 expression (P = 2.70×10-8). These data suggest that common variants at 22q12.1 are associated with risk of serous EOC and CHEK2 as a plausible target susceptibility gene. PMID:26424751

  3. Common variants at the CHEK2 gene locus and risk of epithelial ovarian cancer.

    PubMed

    Lawrenson, Kate; Iversen, Edwin S; Tyrer, Jonathan; Weber, Rachel Palmieri; Concannon, Patrick; Hazelett, Dennis J; Li, Qiyuan; Marks, Jeffrey R; Berchuck, Andrew; Lee, Janet M; Aben, Katja K H; Anton-Culver, Hoda; Antonenkova, Natalia; Bandera, Elisa V; Bean, Yukie; Beckmann, Matthias W; Bisogna, Maria; Bjorge, Line; Bogdanova, Natalia; Brinton, Louise A; Brooks-Wilson, Angela; Bruinsma, Fiona; Butzow, Ralf; Campbell, Ian G; Carty, Karen; Chang-Claude, Jenny; Chenevix-Trench, Georgia; Chen, Ann; Chen, Zhihua; Cook, Linda S; Cramer, Daniel W; Cunningham, Julie M; Cybulski, Cezary; Plisiecka-Halasa, Joanna; Dennis, Joe; Dicks, Ed; Doherty, Jennifer A; Dörk, Thilo; du Bois, Andreas; Eccles, Diana; Easton, Douglas T; Edwards, Robert P; Eilber, Ursula; Ekici, Arif B; Fasching, Peter A; Fridley, Brooke L; Gao, Yu-Tang; Gentry-Maharaj, Aleksandra; Giles, Graham G; Glasspool, Rosalind; Goode, Ellen L; Goodman, Marc T; Gronwald, Jacek; Harter, Philipp; Hasmad, Hanis Nazihah; Hein, Alexander; Heitz, Florian; Hildebrandt, Michelle A T; Hillemanns, Peter; Hogdall, Estrid; Hogdall, Claus; Hosono, Satoyo; Jakubowska, Anna; Paul, James; Jensen, Allan; Karlan, Beth Y; Kjaer, Susanne Kruger; Kelemen, Linda E; Kellar, Melissa; Kelley, Joseph L; Kiemeney, Lambertus A; Krakstad, Camilla; Lambrechts, Diether; Lambrechts, Sandrina; Le, Nhu D; Lee, Alice W; Cannioto, Rikki; Leminen, Arto; Lester, Jenny; Levine, Douglas A; Liang, Dong; Lissowska, Jolanta; Lu, Karen; Lubinski, Jan; Lundvall, Lene; Massuger, Leon F A G; Matsuo, Keitaro; McGuire, Valerie; McLaughlin, John R; Nevanlinna, Heli; McNeish, Iain; Menon, Usha; Modugno, Francesmary; Moysich, Kirsten B; Narod, Steven A; Nedergaard, Lotte; Ness, Roberta B; Noor Azmi, Mat Adenan; Odunsi, Kunle; Olson, Sara H; Orlow, Irene; Orsulic, Sandra; Pearce, Celeste L; Pejovic, Tanja; Pelttari, Liisa M; Permuth-Wey, Jennifer; Phelan, Catherine M; Pike, Malcolm C; Poole, Elizabeth M; Ramus, Susan J; Risch, Harvey A; Rosen, Barry; Rossing, Mary Anne; Rothstein, Joseph H; Rudolph, Anja; Runnebaum, Ingo B; Rzepecka, Iwona K; Salvesen, Helga B; Budzilowska, Agnieszka; Sellers, Thomas A; Shu, Xiao-Ou; Shvetsov, Yurii B; Siddiqui, Nadeem; Sieh, Weiva; Song, Honglin; Southey, Melissa C; Sucheston, Lara; Tangen, Ingvild L; Teo, Soo-Hwang; Terry, Kathryn L; Thompson, Pamela J; Timorek, Agnieszka; Tworoger, Shelley S; Van Nieuwenhuysen, Els; Vergote, Ignace; Vierkant, Robert A; Wang-Gohrke, Shan; Walsh, Christine; Wentzensen, Nicolas; Whittemore, Alice S; Wicklund, Kristine G; Wilkens, Lynne R; Woo, Yin-Ling; Wu, Xifeng; Wu, Anna H; Yang, Hannah; Zheng, Wei; Ziogas, Argyrios; Coetzee, Gerhard A; Freedman, Matthew L; Monteiro, Alvaro N A; Moes-Sosnowska, Joanna; Kupryjanczyk, Jolanta; Pharoah, Paul D; Gayther, Simon A; Schildkraut, Joellen M

    2015-11-01

    Genome-wide association studies have identified 20 genomic regions associated with risk of epithelial ovarian cancer (EOC), but many additional risk variants may exist. Here, we evaluated associations between common genetic variants [single nucleotide polymorphisms (SNPs) and indels] in DNA repair genes and EOC risk. We genotyped 2896 common variants at 143 gene loci in DNA samples from 15 397 patients with invasive EOC and controls. We found evidence of associations with EOC risk for variants at FANCA, EXO1, E2F4, E2F2, CREB5 and CHEK2 genes (P ≤ 0.001). The strongest risk association was for CHEK2 SNP rs17507066 with serous EOC (P = 4.74 x 10(-7)). Additional genotyping and imputation of genotypes from the 1000 genomes project identified a slightly more significant association for CHEK2 SNP rs6005807 (r (2) with rs17507066 = 0.84, odds ratio (OR) 1.17, 95% CI 1.11-1.24, P = 1.1×10(-7)). We identified 293 variants in the region with likelihood ratios of less than 1:100 for representing the causal variant. Functional annotation identified 25 candidate SNPs that alter transcription factor binding sites within regulatory elements active in EOC precursor tissues. In The Cancer Genome Atlas dataset, CHEK2 gene expression was significantly higher in primary EOCs compared to normal fallopian tube tissues (P = 3.72×10(-8)). We also identified an association between genotypes of the candidate causal SNP rs12166475 (r (2) = 0.99 with rs6005807) and CHEK2 expression (P = 2.70×10(-8)). These data suggest that common variants at 22q12.1 are associated with risk of serous EOC and CHEK2 as a plausible target susceptibility gene. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

  4. Is There a Causal Effect of High School Math on Labor Market Outcomes?

    ERIC Educational Resources Information Center

    Joensen, Juanna Schroter; Nielsen, Helena Skyt

    2009-01-01

    In this paper, we exploit a high school pilot scheme to identify the causal effect of advanced high school math on labor market outcomes. The pilot scheme reduced the costs of choosing advanced math because it allowed for a more flexible combination of math with other courses. We find clear evidence of a causal relationship between math and…

  5. Integrating genome-wide association study and expression quantitative trait loci data identifies multiple genes and gene set associated with neuroticism.

    PubMed

    Fan, Qianrui; Wang, Wenyu; Hao, Jingcan; He, Awen; Wen, Yan; Guo, Xiong; Wu, Cuiyan; Ning, Yujie; Wang, Xi; Wang, Sen; Zhang, Feng

    2017-08-01

    Neuroticism is a fundamental personality trait with significant genetic determinant. To identify novel susceptibility genes for neuroticism, we conducted an integrative analysis of genomic and transcriptomic data of genome wide association study (GWAS) and expression quantitative trait locus (eQTL) study. GWAS summary data was driven from published studies of neuroticism, totally involving 170,906 subjects. eQTL dataset containing 927,753 eQTLs were obtained from an eQTL meta-analysis of 5311 samples. Integrative analysis of GWAS and eQTL data was conducted by summary data-based Mendelian randomization (SMR) analysis software. To identify neuroticism associated gene sets, the SMR analysis results were further subjected to gene set enrichment analysis (GSEA). The gene set annotation dataset (containing 13,311 annotated gene sets) of GSEA Molecular Signatures Database was used. SMR single gene analysis identified 6 significant genes for neuroticism, including MSRA (p value=2.27×10 -10 ), MGC57346 (p value=6.92×10 -7 ), BLK (p value=1.01×10 -6 ), XKR6 (p value=1.11×10 -6 ), C17ORF69 (p value=1.12×10 -6 ) and KIAA1267 (p value=4.00×10 -6 ). Gene set enrichment analysis observed significant association for Chr8p23 gene set (false discovery rate=0.033). Our results provide novel clues for the genetic mechanism studies of neuroticism. Copyright © 2017. Published by Elsevier Inc.

  6. A Multiomics Approach to Identify Genes Associated with Childhood Asthma Risk and Morbidity.

    PubMed

    Forno, Erick; Wang, Ting; Yan, Qi; Brehm, John; Acosta-Perez, Edna; Colon-Semidey, Angel; Alvarez, Maria; Boutaoui, Nadia; Cloutier, Michelle M; Alcorn, John F; Canino, Glorisa; Chen, Wei; Celedón, Juan C

    2017-10-01

    Childhood asthma is a complex disease. In this study, we aim to identify genes associated with childhood asthma through a multiomics "vertical" approach that integrates multiple analytical steps using linear and logistic regression models. In a case-control study of childhood asthma in Puerto Ricans (n = 1,127), we used adjusted linear or logistic regression models to evaluate associations between several analytical steps of omics data, including genome-wide (GW) genotype data, GW methylation, GW expression profiling, cytokine levels, asthma-intermediate phenotypes, and asthma status. At each point, only the top genes/single-nucleotide polymorphisms/probes/cytokines were carried forward for subsequent analysis. In step 1, asthma modified the gene expression-protein level association for 1,645 genes; pathway analysis showed an enrichment of these genes in the cytokine signaling system (n = 269 genes). In steps 2-3, expression levels of 40 genes were associated with intermediate phenotypes (asthma onset age, forced expiratory volume in 1 second, exacerbations, eosinophil counts, and skin test reactivity); of those, methylation of seven genes was also associated with asthma. Of these seven candidate genes, IL5RA was also significant in analytical steps 4-8. We then measured plasma IL-5 receptor α levels, which were associated with asthma age of onset and moderate-severe exacerbations. In addition, in silico database analysis showed that several of our identified IL5RA single-nucleotide polymorphisms are associated with transcription factors related to asthma and atopy. This approach integrates several analytical steps and is able to identify biologically relevant asthma-related genes, such as IL5RA. It differs from other methods that rely on complex statistical models with various assumptions.

  7. Misregulation of Gene Expression and Sterility in Interspecies Hybrids: Causal Links and Alternative Hypotheses.

    PubMed

    Civetta, Alberto

    2016-05-01

    Understanding the origin of species is of interest to biologist in general and evolutionary biologist in particular. Hybrid male sterility (HMS) has been a focus in studies of speciation because sterility imposes a barrier to free gene flow between organisms, thus effectively isolating them as distinct species. In this review, I focus on the role of differential gene expression in HMS and speciation. Microarray and qPCR assays have established associations between misregulation of gene expression and sterility in hybrids between closely related species. These studies originally proposed disrupted expression of spermatogenesis genes as a causative of sterility. Alternatively, rapid genetic divergence of regulatory elements, particularly as they relate to the male sex (fast-male evolution), can drive the misregulation of sperm developmental genes in the absence of sterility. The use of fertile hybrids (both backcross and F1 progeny) as controls has lent support to this alternative explanation. Differences in gene expression between fertile and sterile hybrids can also be influenced by a pattern of faster evolution of the sex chromosome (fast-X evolution) than autosomes. In particular, it would be desirable to establish whether known X-chromosome sterility factors can act as trans-regulatory drivers of genome-wide patterns of misregulation. Genome-wide expression studies coupled with assays of proxies of sterility in F1 and BC progeny have identified candidate HMS genes but functional assays, and a better phenotypic characterization of sterility phenotypes, are needed to rigorously test how these genes might contribute to HMS.

  8. A large-scale RNA interference screen identifies genes that regulate autophagy at different stages.

    PubMed

    Guo, Sujuan; Pridham, Kevin J; Virbasius, Ching-Man; He, Bin; Zhang, Liqing; Varmark, Hanne; Green, Michael R; Sheng, Zhi

    2018-02-12

    Dysregulated autophagy is central to the pathogenesis and therapeutic development of cancer. However, how autophagy is regulated in cancer is not well understood and genes that modulate cancer autophagy are not fully defined. To gain more insights into autophagy regulation in cancer, we performed a large-scale RNA interference screen in K562 human chronic myeloid leukemia cells using monodansylcadaverine staining, an autophagy-detecting approach equivalent to immunoblotting of the autophagy marker LC3B or fluorescence microscopy of GFP-LC3B. By coupling monodansylcadaverine staining with fluorescence-activated cell sorting, we successfully isolated autophagic K562 cells where we identified 336 short hairpin RNAs. After candidate validation using Cyto-ID fluorescence spectrophotometry, LC3B immunoblotting, and quantitative RT-PCR, 82 genes were identified as autophagy-regulating genes. 20 genes have been reported previously and the remaining 62 candidates are novel autophagy mediators. Bioinformatic analyses revealed that most candidate genes were involved in molecular pathways regulating autophagy, rather than directly participating in the autophagy process. Further autophagy flux assays revealed that 57 autophagy-regulating genes suppressed autophagy initiation, whereas 21 candidates promoted autophagy maturation. Our RNA interference screen identifies identified genes that regulate autophagy at different stages, which helps decode autophagy regulation in cancer and offers novel avenues to develop autophagy-related therapies for cancer.

  9. Overexpression screens identify conserved dosage chromosome instability genes in yeast and human cancer

    PubMed Central

    Duffy, Supipi; Fam, Hok Khim; Wang, Yi Kan; Styles, Erin B.; Kim, Jung-Hyun; Ang, J. Sidney; Singh, Tejomayee; Larionov, Vladimir; Shah, Sohrab P.; Andrews, Brenda; Boerkoel, Cornelius F.; Hieter, Philip

    2016-01-01

    Somatic copy number amplification and gene overexpression are common features of many cancers. To determine the role of gene overexpression on chromosome instability (CIN), we performed genome-wide screens in the budding yeast for yeast genes that cause CIN when overexpressed, a phenotype we refer to as dosage CIN (dCIN), and identified 245 dCIN genes. This catalog of genes reveals human orthologs known to be recurrently overexpressed and/or amplified in tumors. We show that two genes, TDP1, a tyrosyl-DNA-phosphdiesterase, and TAF12, an RNA polymerase II TATA-box binding factor, cause CIN when overexpressed in human cells. Rhabdomyosarcoma lines with elevated human Tdp1 levels also exhibit CIN that can be partially rescued by siRNA-mediated knockdown of TDP1. Overexpression of dCIN genes represents a genetic vulnerability that could be leveraged for selective killing of cancer cells through targeting of an unlinked synthetic dosage lethal (SDL) partner. Using SDL screens in yeast, we identified a set of genes that when deleted specifically kill cells with high levels of Tdp1. One gene was the histone deacetylase RPD3, for which there are known inhibitors. Both HT1080 cells overexpressing hTDP1 and rhabdomyosarcoma cells with elevated levels of hTdp1 were more sensitive to histone deacetylase inhibitors valproic acid (VPA) and trichostatin A (TSA), recapitulating the SDL interaction in human cells and suggesting VPA and TSA as potential therapeutic agents for tumors with elevated levels of hTdp1. The catalog of dCIN genes presented here provides a candidate list to identify genes that cause CIN when overexpressed in cancer, which can then be leveraged through SDL to selectively target tumors. PMID:27551064

  10. Quantum correlations with no causal order

    PubMed Central

    Oreshkov, Ognyan; Costa, Fabio; Brukner, Časlav

    2012-01-01

    The idea that events obey a definite causal order is deeply rooted in our understanding of the world and at the basis of the very notion of time. But where does causal order come from, and is it a necessary property of nature? Here, we address these questions from the standpoint of quantum mechanics in a new framework for multipartite correlations that does not assume a pre-defined global causal structure but only the validity of quantum mechanics locally. All known situations that respect causal order, including space-like and time-like separated experiments, are captured by this framework in a unified way. Surprisingly, we find correlations that cannot be understood in terms of definite causal order. These correlations violate a 'causal inequality' that is satisfied by all space-like and time-like correlations. We further show that in a classical limit causal order always arises, which suggests that space-time may emerge from a more fundamental structure in a quantum-to-classical transition. PMID:23033068

  11. Paradoxical Behavior of Granger Causality

    NASA Astrophysics Data System (ADS)

    Witt, Annette; Battaglia, Demian; Gail, Alexander

    2013-03-01

    Granger causality is a standard tool for the description of directed interaction of network components and is popular in many scientific fields including econometrics, neuroscience and climate science. For time series that can be modeled as bivariate auto-regressive processes we analytically derive an expression for spectrally decomposed Granger Causality (SDGC) and show that this quantity depends only on two out of four groups of model parameters. Then we present examples of such processes whose SDGC expose paradoxical behavior in the sense that causality is high for frequency ranges with low spectral power. For avoiding misinterpretations of Granger causality analysis we propose to complement it by partial spectral analysis. Our findings are illustrated by an example from brain electrophysiology. Finally, we draw implications for the conventional definition of Granger causality. Bernstein Center for Computational Neuroscience Goettingen

  12. Integrating mean and variance heterogeneities to identify differentially expressed genes.

    PubMed

    Ouyang, Weiwei; An, Qiang; Zhao, Jinying; Qin, Huaizhen

    2016-12-06

    In functional genomics studies, tests on mean heterogeneity have been widely employed to identify differentially expressed genes with distinct mean expression levels under different experimental conditions. Variance heterogeneity (aka, the difference between condition-specific variances) of gene expression levels is simply neglected or calibrated for as an impediment. The mean heterogeneity in the expression level of a gene reflects one aspect of its distribution alteration; and variance heterogeneity induced by condition change may reflect another aspect. Change in condition may alter both mean and some higher-order characteristics of the distributions of expression levels of susceptible genes. In this report, we put forth a conception of mean-variance differentially expressed (MVDE) genes, whose expression means and variances are sensitive to the change in experimental condition. We mathematically proved the null independence of existent mean heterogeneity tests and variance heterogeneity tests. Based on the independence, we proposed an integrative mean-variance test (IMVT) to combine gene-wise mean heterogeneity and variance heterogeneity induced by condition change. The IMVT outperformed its competitors under comprehensive simulations of normality and Laplace settings. For moderate samples, the IMVT well controlled type I error rates, and so did existent mean heterogeneity test (i.e., the Welch t test (WT), the moderated Welch t test (MWT)) and the procedure of separate tests on mean and variance heterogeneities (SMVT), but the likelihood ratio test (LRT) severely inflated type I error rates. In presence of variance heterogeneity, the IMVT appeared noticeably more powerful than all the valid mean heterogeneity tests. Application to the gene profiles of peripheral circulating B raised solid evidence of informative variance heterogeneity. After adjusting for background data structure, the IMVT replicated previous discoveries and identified novel experiment

  13. G20210A prothrombin gene mutation identified in patients with venous leg ulcers.

    PubMed

    Jebeleanu, G; Procopciuc, L

    2001-01-01

    The G20210A mutation variant of prothrombin gene is the second most frequent mutation identified in patients with deep venous thrombosis, after factor V Leiden. The risk for developing deep venous thrombosis is high in patients identified as heterozygous for G20210A mutation. In order to identify this polymorphism in the gene coding prothrombin, the 345bp fragment in the 3'- untranslated region of the prothrombin gene was amplified using amplification by polymerase chain reaction and enzymatic digestion by HindIII (restriction endonuclease enzyme). The products of amplification and enzymatic's digestion were analized using agarose gel electrophoresis. We investigated 20 patients with venous leg ulcers and we found 2 heterozygous (10%) for G20210A mutation. None of the patients in the control group had G20210A mutation. Our study confirms the presence of G20210A mutation in the Romanian population. Our study also shows the link between venous leg ulcers and this polymorphism in the prothrombin gene.

  14. Fostering Deeper Critical Inquiry with Causal Layered Analysis

    ERIC Educational Resources Information Center

    Haigh, Martin

    2016-01-01

    Causal layered analysis (CLA) is a technique that enables deeper critical inquiry through a structured exploration of four layers of causation. CLA's layers reach down from the surface litany of media understanding, through the layer of systemic causes identified by conventional research, to underpinning worldviews, ideologies and philosophies,…

  15. A Novel Yeast Genomics Method for Identifying New Breast Cancer Susceptibility Genes

    DTIC Science & Technology

    2007-05-01

    find new candidate genes for breast cancer susceptibility in women and identifying these human genes can further improve monitoring and treatment...breast cancer susceptibility genes in humans that are currently unknown and not deducible from current methodologies. It is a fundamental...template to faithfully repair the broken strand. In human cancer it is loss of HR, rather than NHEJ, that is more important in increasing cancer

  16. On the Inference of Functional Circadian Networks Using Granger Causality

    PubMed Central

    Pourzanjani, Arya; Herzog, Erik D.; Petzold, Linda R.

    2015-01-01

    Being able to infer one way direct connections in an oscillatory network such as the suprachiastmatic nucleus (SCN) of the mammalian brain using time series data is difficult but crucial to understanding network dynamics. Although techniques have been developed for inferring networks from time series data, there have been no attempts to adapt these techniques to infer directional connections in oscillatory time series, while accurately distinguishing between direct and indirect connections. In this paper an adaptation of Granger Causality is proposed that allows for inference of circadian networks and oscillatory networks in general called Adaptive Frequency Granger Causality (AFGC). Additionally, an extension of this method is proposed to infer networks with large numbers of cells called LASSO AFGC. The method was validated using simulated data from several different networks. For the smaller networks the method was able to identify all one way direct connections without identifying connections that were not present. For larger networks of up to twenty cells the method shows excellent performance in identifying true and false connections; this is quantified by an area-under-the-curve (AUC) 96.88%. We note that this method like other Granger Causality-based methods, is based on the detection of high frequency signals propagating between cell traces. Thus it requires a relatively high sampling rate and a network that can propagate high frequency signals. PMID:26413748

  17. Association between expression of random gene sets and survival is evident in multiple cancer types and may be explained by sub-classification.

    PubMed

    Shimoni, Yishai

    2018-02-01

    One of the goals of cancer research is to identify a set of genes that cause or control disease progression. However, although multiple such gene sets were published, these are usually in very poor agreement with each other, and very few of the genes proved to be functional therapeutic targets. Furthermore, recent findings from a breast cancer gene-expression cohort showed that sets of genes selected randomly can be used to predict survival with a much higher probability than expected. These results imply that many of the genes identified in breast cancer gene expression analysis may not be causal of cancer progression, even though they can still be highly predictive of prognosis. We performed a similar analysis on all the cancer types available in the cancer genome atlas (TCGA), namely, estimating the predictive power of random gene sets for survival. Our work shows that most cancer types exhibit the property that random selections of genes are more predictive of survival than expected. In contrast to previous work, this property is not removed by using a proliferation signature, which implies that proliferation may not always be the confounder that drives this property. We suggest one possible solution in the form of data-driven sub-classification to reduce this property significantly. Our results suggest that the predictive power of random gene sets may be used to identify the existence of sub-classes in the data, and thus may allow better understanding of patient stratification. Furthermore, by reducing the observed bias this may allow more direct identification of biologically relevant, and potentially causal, genes.

  18. Association between expression of random gene sets and survival is evident in multiple cancer types and may be explained by sub-classification

    PubMed Central

    2018-01-01

    One of the goals of cancer research is to identify a set of genes that cause or control disease progression. However, although multiple such gene sets were published, these are usually in very poor agreement with each other, and very few of the genes proved to be functional therapeutic targets. Furthermore, recent findings from a breast cancer gene-expression cohort showed that sets of genes selected randomly can be used to predict survival with a much higher probability than expected. These results imply that many of the genes identified in breast cancer gene expression analysis may not be causal of cancer progression, even though they can still be highly predictive of prognosis. We performed a similar analysis on all the cancer types available in the cancer genome atlas (TCGA), namely, estimating the predictive power of random gene sets for survival. Our work shows that most cancer types exhibit the property that random selections of genes are more predictive of survival than expected. In contrast to previous work, this property is not removed by using a proliferation signature, which implies that proliferation may not always be the confounder that drives this property. We suggest one possible solution in the form of data-driven sub-classification to reduce this property significantly. Our results suggest that the predictive power of random gene sets may be used to identify the existence of sub-classes in the data, and thus may allow better understanding of patient stratification. Furthermore, by reducing the observed bias this may allow more direct identification of biologically relevant, and potentially causal, genes. PMID:29470520

  19. GWAS of clinically defined gout and subtypes identifies multiple susceptibility loci that include urate transporter genes

    PubMed Central

    Nakayama, Akiyoshi; Nakaoka, Hirofumi; Yamamoto, Ken; Sakiyama, Masayuki; Shaukat, Amara; Toyoda, Yu; Okada, Yukinori; Kamatani, Yoichiro; Nakamura, Takahiro; Takada, Tappei; Inoue, Katsuhisa; Yasujima, Tomoya; Yuasa, Hiroaki; Shirahama, Yuko; Nakashima, Hiroshi; Shimizu, Seiko; Higashino, Toshihide; Kawamura, Yusuke; Ogata, Hiraku; Kawaguchi, Makoto; Ohkawa, Yasuyuki; Danjoh, Inaho; Tokumasu, Atsumi; Ooyama, Keiko; Ito, Toshimitsu; Kondo, Takaaki; Wakai, Kenji; Stiburkova, Blanka; Pavelka, Karel; Stamp, Lisa K; Dalbeth, Nicola; Sakurai, Yutaka; Suzuki, Hiroshi; Hosoyamada, Makoto; Fujimori, Shin; Yokoo, Takashi; Hosoya, Tatsuo; Inoue, Ituro; Takahashi, Atsushi; Kubo, Michiaki; Ooyama, Hiroshi; Shimizu, Toru; Ichida, Kimiyoshi; Shinomiya, Nariyoshi; Merriman, Tony R; Matsuo, Hirotaka

    2017-01-01

    Objective A genome-wide association study (GWAS) of gout and its subtypes was performed to identify novel gout loci, including those that are subtype-specific. Methods Putative causal association signals from a GWAS of 945 clinically defined gout cases and 1213 controls from Japanese males were replicated with 1396 cases and 1268 controls using a custom chip of 1961 single nucleotide polymorphisms (SNPs). We also first conducted GWASs of gout subtypes. Replication with Caucasian and New Zealand Polynesian samples was done to further validate the loci identified in this study. Results In addition to the five loci we reported previously, further susceptibility loci were identified at a genome-wide significance level (p<5.0×10−8): urate transporter genes (SLC22A12 and SLC17A1) and HIST1H2BF-HIST1H4E for all gout cases, and NIPAL1 and FAM35A for the renal underexcretion gout subtype. While NIPAL1 encodes a magnesium transporter, functional analysis did not detect urate transport via NIPAL1, suggesting an indirect association with urate handling. Localisation analysis in the human kidney revealed expression of NIPAL1 and FAM35A mainly in the distal tubules, which suggests the involvement of the distal nephron in urate handling in humans. Clinically ascertained male patients with gout and controls of Caucasian and Polynesian ancestries were also genotyped, and FAM35A was associated with gout in all cases. A meta-analysis of the three populations revealed FAM35A to be associated with gout at a genome-wide level of significance (pmeta=3.58×10−8). Conclusions Our findings including novel gout risk loci provide further understanding of the molecular pathogenesis of gout and lead to a novel concept for the therapeutic target of gout/hyperuricaemia. PMID:27899376

  20. Causal capture effects in chimpanzees (Pan troglodytes).

    PubMed

    Matsuno, Toyomi; Tomonaga, Masaki

    2017-01-01

    Extracting a cause-and-effect structure from the physical world is an important demand for animals living in dynamically changing environments. Human perceptual and cognitive mechanisms are known to be sensitive and tuned to detect and interpret such causal structures. In contrast to rigorous investigations of human causal perception, the phylogenetic roots of this perception are not well understood. In the present study, we aimed to investigate the susceptibility of nonhuman animals to mechanical causality by testing whether chimpanzees perceived an illusion called causal capture (Scholl & Nakayama, 2002). Causal capture is a phenomenon in which a type of bistable visual motion of objects is perceived as causal collision due to a bias from a co-occurring causal event. In our experiments, we assessed the susceptibility of perception of a bistable stream/bounce motion event to a co-occurring causal event in chimpanzees. The results show that, similar to in humans, causal "bounce" percepts were significantly increased in chimpanzees with the addition of a task-irrelevant causal bounce event that was synchronously presented. These outcomes suggest that the perceptual mechanisms behind the visual interpretation of causal structures in the environment are evolutionarily shared between human and nonhuman animals. Copyright © 2016 Elsevier B.V. All rights reserved.

  1. You've gotta be lucky: Coverage and the elusive gene-gene interaction.

    PubMed

    Reimherr, Matthew; Nicolae, Dan L

    2011-01-01

    Genome-wide association studies (GWAS) have led to a large number of single-SNP association findings, but there has been, so far, no investigation resulting in the discovery of a replicable gene-gene interaction. In this paper, we examine some of the possible explanations for the lack of findings, and argue that coverage of causal variation not only has a large effect on the loss in power, but that the effect is larger than in the single-SNP analyses. We show that the product of linkage disequilibrium measures, r², between causal and tested SNPs offers a good approximation to the loss in efficiency as defined by the ratio of sample sizes that lead to similar power. We also demonstrate that, in addition to the huge search space, the loss in power due to coverage when using commercially available platforms makes the search for gene-gene interactions daunting. © 2010 The Authors Annals of Human Genetics © 2010 Blackwell Publishing Ltd/University College London.

  2. A Genome-Wide mQTL Analysis in Human Adipose Tissue Identifies Genetic Variants Associated with DNA Methylation, Gene Expression and Metabolic Traits

    PubMed Central

    Volkov, Petr; Olsson, Anders H.; Gillberg, Linn; Jørgensen, Sine W.; Brøns, Charlotte; Eriksson, Karl-Fredrik; Groop, Leif; Jansson, Per-Anders; Nilsson, Emma; Rönn, Tina; Vaag, Allan; Ling, Charlotte

    2016-01-01

    Little is known about the extent to which interactions between genetics and epigenetics may affect the risk of complex metabolic diseases and/or their intermediary phenotypes. We performed a genome-wide DNA methylation quantitative trait locus (mQTL) analysis in human adipose tissue of 119 men, where 592,794 single nucleotide polymorphisms (SNPs) were related to DNA methylation of 477,891 CpG sites, covering 99% of RefSeq genes. SNPs in significant mQTLs were further related to gene expression in adipose tissue and obesity related traits. We found 101,911 SNP-CpG pairs (mQTLs) in cis and 5,342 SNP-CpG pairs in trans showing significant associations between genotype and DNA methylation in adipose tissue after correction for multiple testing, where cis is defined as distance less than 500 kb between a SNP and CpG site. These mQTLs include reported obesity, lipid and type 2 diabetes loci, e.g. ADCY3/POMC, APOA5, CETP, FADS2, GCKR, SORT1 and LEPR. Significant mQTLs were overrepresented in intergenic regions meanwhile underrepresented in promoter regions and CpG islands. We further identified 635 SNPs in significant cis-mQTLs associated with expression of 86 genes in adipose tissue including CHRNA5, G6PC2, GPX7, RPL27A, THNSL2 and ZFP57. SNPs in significant mQTLs were also associated with body mass index (BMI), lipid traits and glucose and insulin levels in our study cohort and public available consortia data. Importantly, the Causal Inference Test (CIT) demonstrates how genetic variants mediate their effects on metabolic traits (e.g. BMI, cholesterol, high-density lipoprotein (HDL), hemoglobin A1c (HbA1c) and homeostatic model assessment of insulin resistance (HOMA-IR)) via altered DNA methylation in human adipose tissue. This study identifies genome-wide interactions between genetic and epigenetic variation in both cis and trans positions influencing gene expression in adipose tissue and in vivo (dys)metabolic traits associated with the development of obesity and

  3. Transcriptome analysis of an apple (Malus × domestica) yellow fruit somatic mutation identifies a gene network module highly associated with anthocyanin and epigenetic regulation

    PubMed Central

    El-Sharkawy, Islam; Liang, Dong; Xu, Kenong

    2015-01-01

    Using RNA-seq, this study analysed an apple (Malus×domestica) anthocyanin-deficient yellow-skin somatic mutant ‘Blondee’ (BLO) and its red-skin parent ‘Kidd’s D-8’ (KID), the original name of ‘Gala’, to understand the molecular mechanisms underlying the mutation. A total of 3299 differentially expressed genes (DEGs) were identified between BLO and KID at four developmental stages and/or between two adjacent stages within BLO and/or KID. A weighted gene co-expression network analysis (WGCNA) of the DEGs uncovered a network module of 34 genes highly correlated (r=0.95, P=9.0×10–13) with anthocyanin contents. Although 12 of the 34 genes in the WGCNA module were characterized and known of roles in anthocyanin, the remainder 22 appear to be novel. Examining the expression of ten representative genes in the module in 14 diverse apples revealed that at least eight were significantly correlated with anthocyanin variation. MdMYB10 (MDP0000259614) and MdGST (MDP0000252292) were among the most suppressed module member genes in BLO despite being undistinguishable in their corresponding sequences between BLO and KID. Methylation assay of MdMYB10 and MdGST in fruit skin revealed that two regions (MR3 and MR7) in the MdMYB10 promoter exhibited remarkable differences between BLO and KID. In particular, methylation was high and progressively increased alongside fruit development in BLO while was correspondingly low and constant in KID. The methylation levels in both MR3 and MR7 were negatively correlated with anthocyanin content as well as the expression of MdMYB10 and MdGST. Clearly, the collective repression of the 34 genes explains the loss-of-colour in BLO while the methylation in MdMYB10 promoter is likely causal for the mutation. PMID:26417021

  4. Next-generation sequencing to identify candidate genes and develop diagnostic markers for a novel Phytophthora resistance gene, RpsHC18, in soybean.

    PubMed

    Zhong, Chao; Sun, Suli; Li, Yinping; Duan, Canxing; Zhu, Zhendong

    2018-03-01

    A novel Phytophthora sojae resistance gene RpsHC18 was identified and finely mapped on soybean chromosome 3. Two NBS-LRR candidate genes were identified and two diagnostic markers of RpsHC18 were developed. Phytophthora root rot caused by Phytophthora sojae is a destructive disease of soybean. The most effective disease-control strategy is to deploy resistant cultivars carrying Phytophthora-resistant Rps genes. The soybean cultivar Huachun 18 has a broad and distinct resistance spectrum to 12 P. sojae isolates. Quantitative trait loci sequencing (QTL-seq), based on the whole-genome resequencing (WGRS) of two extreme resistant and susceptible phenotype bulks from an F 2:3 population, was performed, and one 767-kb genomic region with ΔSNP-index ≥ 0.9 on chromosome 3 was identified as the RpsHC18 candidate region in Huachun 18. The candidate region was reduced to a 146-kb region by fine mapping. Nonsynonymous SNP and haplotype analyses were carried out in the 146-kb region among ten soybean genotypes using WGRS. Four specific nonsynonymous SNPs were identified in two nucleotide-binding sites-leucine-rich repeat (NBS-LRR) genes, RpsHC18-NBL1 and RpsHC18-NBL2, which were considered to be the candidate genes. Finally, one specific SNP marker in each candidate gene was successfully developed using a tetra-primer ARMS-PCR assay, and the two markers were verified to be specific for RpsHC18 and to effectively distinguish other known Rps genes. In this study, we applied an integrated genomic-based strategy combining WGRS with traditional genetic mapping to identify RpsHC18 candidate genes and develop diagnostic markers. These results suggest that next-generation sequencing is a precise, rapid and cost-effective way to identify candidate genes and develop diagnostic markers, and it can accelerate Rps gene cloning and marker-assisted selection for breeding of P. sojae-resistant soybean cultivars.

  5. A 6-gene signature identifies four molecular subgroups of neuroblastoma

    PubMed Central

    2011-01-01

    Background There are currently three postulated genomic subtypes of the childhood tumour neuroblastoma (NB); Type 1, Type 2A, and Type 2B. The most aggressive forms of NB are characterized by amplification of the oncogene MYCN (MNA) and low expression of the favourable marker NTRK1. Recently, mutations or high expression of the familial predisposition gene Anaplastic Lymphoma Kinase (ALK) was associated to unfavourable biology of sporadic NB. Also, various other genes have been linked to NB pathogenesis. Results The present study explores subgroup discrimination by gene expression profiling using three published microarray studies on NB (47 samples). Four distinct clusters were identified by Principal Components Analysis (PCA) in two separate data sets, which could be verified by an unsupervised hierarchical clustering in a third independent data set (101 NB samples) using a set of 74 discriminative genes. The expression signature of six NB-associated genes ALK, BIRC5, CCND1, MYCN, NTRK1, and PHOX2B, significantly discriminated the four clusters (p < 0.05, one-way ANOVA test). PCA clusters p1, p2, and p3 were found to correspond well to the postulated subtypes 1, 2A, and 2B, respectively. Remarkably, a fourth novel cluster was detected in all three independent data sets. This cluster comprised mainly 11q-deleted MNA-negative tumours with low expression of ALK, BIRC5, and PHOX2B, and was significantly associated with higher tumour stage, poor outcome and poor survival compared to the Type 1-corresponding favourable group (INSS stage 4 and/or dead of disease, p < 0.05, Fisher's exact test). Conclusions Based on expression profiling we have identified four molecular subgroups of neuroblastoma, which can be distinguished by a 6-gene signature. The fourth subgroup has not been described elsewhere, and efforts are currently made to further investigate this group's specific characteristics. PMID:21492432

  6. Exome sequencing of a large family identifies potential candidate genes contributing risk to bipolar disorder.

    PubMed

    Zhang, Tianxiao; Hou, Liping; Chen, David T; McMahon, Francis J; Wang, Jen-Chyong; Rice, John P

    2018-03-01

    Bipolar disorder is a mental illness with lifetime prevalence of about 1%. Previous genetic studies have identified multiple chromosomal linkage regions and candidate genes that might be associated with bipolar disorder. The present study aimed to identify potential susceptibility variants for bipolar disorder using 6 related case samples from a four-generation family. A combination of exome sequencing and linkage analysis was performed to identify potential susceptibility variants for bipolar disorder. Our study identified a list of five potential candidate genes for bipolar disorder. Among these five genes, GRID1(Glutamate Receptor Delta-1 Subunit), which was previously reported to be associated with several psychiatric disorders and brain related traits, is particularly interesting. Variants with functional significance in this gene were identified from two cousins in our bipolar disorder pedigree. Our findings suggest a potential role for these genes and the related rare variants in the onset and development of bipolar disorder in this one family. Additional research is needed to replicate these findings and evaluate their patho-biological significance. Copyright © 2017 Elsevier B.V. All rights reserved.

  7. Systems approach identifies an organic nitrogen-responsive gene network that is regulated by the master clock control gene CCA1.

    PubMed

    Gutiérrez, Rodrigo A; Stokes, Trevor L; Thum, Karen; Xu, Xiaodong; Obertello, Mariana; Katari, Manpreet S; Tanurdzic, Milos; Dean, Alexis; Nero, Damion C; McClung, C Robertson; Coruzzi, Gloria M

    2008-03-25

    Understanding how nutrients affect gene expression will help us to understand the mechanisms controlling plant growth and development as a function of nutrient availability. Nitrate has been shown to serve as a signal for the control of gene expression in Arabidopsis. There is also evidence, on a gene-by-gene basis, that downstream products of nitrogen (N) assimilation such as glutamate (Glu) or glutamine (Gln) might serve as signals of organic N status that in turn regulate gene expression. To identify genome-wide responses to such organic N signals, Arabidopsis seedlings were transiently treated with ammonium nitrate in the presence or absence of MSX, an inhibitor of glutamine synthetase, resulting in a block of Glu/Gln synthesis. Genes that responded to organic N were identified as those whose response to ammonium nitrate treatment was blocked in the presence of MSX. We showed that some genes previously identified to be regulated by nitrate are under the control of an organic N-metabolite. Using an integrated network model of molecular interactions, we uncovered a subnetwork regulated by organic N that included CCA1 and target genes involved in N-assimilation. We validated some of the predicted interactions and showed that regulation of the master clock control gene CCA1 by Glu or a Glu-derived metabolite in turn regulates the expression of key N-assimilatory genes. Phase response curve analysis shows that distinct N-metabolites can advance or delay the CCA1 phase. Regulation of CCA1 by organic N signals may represent a novel input mechanism for N-nutrients to affect plant circadian clock function.

  8. Measuring causal perception: connections to representational momentum?

    PubMed

    Choi, Hoon; Scholl, Brian J

    2006-01-01

    In a collision between two objects, we can perceive not only low-level properties, such as color and motion, but also the seemingly high-level property of causality. It has proven difficult, however, to measure causal perception in a quantitatively rigorous way which goes beyond perceptual reports. Here we focus on the possibility of measuring perceived causality using the phenomenon of representational momentum (RM). Recent studies suggest a relationship between causal perception and RM, based on the fact that RM appears to be attenuated for causally 'launched' objects. This is explained by appeal to the visual expectation that a 'launched' object is inert and thus should eventually cease its movement after a collision, without a source of self-propulsion. We first replicated these demonstrations, and then evaluated this alleged connection by exploring RM for different types of displays, including the contrast between causal launching and non-causal 'passing'. These experiments suggest that the RM-attenuation effect is not a pure measure of causal perception, but rather may reflect lower-level spatiotemporal correlates of only some causal displays. We conclude by discussing the strengths and pitfalls of various methods of measuring causal perception.

  9. Investigating the multi-causal and complex nature of the accident causal influence of construction project features.

    PubMed

    Manu, Patrick A; Ankrah, Nii A; Proverbs, David G; Suresh, Subashini

    2012-09-01

    Construction project features (CPFs) are organisational, physical and operational attributes that characterise construction projects. Although previous studies have examined the accident causal influence of CPFs, the multi-causal attribute of this causal phenomenon still remain elusive and thus requires further investigation. Aiming to shed light on this facet of the accident causal phenomenon of CPFs, this study examines relevant literature and crystallises the attained insight of the multi-causal attribute by a graphical model which is subsequently operationalised by a derived mathematical risk expression that offers a systematic approach for evaluating the potential of CPFs to cause harm and consequently their health and safety (H&S) risk implications. The graphical model and the risk expression put forth by the study thus advance current understanding of the accident causal phenomenon of CPFs and they present an opportunity for project participants to manage the H&S risk associated with CPFs from the early stages of project procurement. Copyright © 2011 Elsevier Ltd. All rights reserved.

  10. The major origin of seedless grapes is associated with a missense mutation in the MADS-box gene VviAGL11.

    PubMed

    Royo, Carolina; Torres-Pérez, Rafael; Mauri, Nuria; Diestro, Nieves; Cabezas, José Antonio; Marchal, Cécile; Lacombe, Thierry; Ibáñez, Javier; Tornel, Manuel; Carreño, Juan; Martínez-Zapater, José M; Carbonell-Bejerano, Pablo

    2018-05-31

    Seedlessness is greatly prized by consumers of fresh grapes. While stenospermocarpic seed abortion determined by the SEED DEVELOPMENT INHIBITOR (SDI) locus is the usual source of seedlessness in commercial grapevine (Vitis vinifera) cultivars, the underlying sdi mutation remains unknown. Here, we undertook an integrative approach to identify the causal mutation. Quantitative genetics and fine mapping in two 'Crimson Seedless' (CS)-derived F1 mapping populations confirmed the major effect of the SDI locus and delimited the sdi mutation to a 323-kb region on chromosome 18. RNA-seq comparing seed traces of seedless and seeds of seeded F1 individuals identified processes triggered during sdi-determined seed abortion, including activation of salicylic acid-dependent defenses. The RNA-seq dataset was investigated for candidate genes and, while no evidence for causal cis-acting regulatory mutations was detected, deleterious nucleotide changes in coding sequences of the seedless haplotype were predicted in two genes within the sdi fine mapping interval. Targeted re-sequencing of the two genes in a collection of 124 grapevine cultivars showed that only the point variation causing the Arg197Leu substitution in the seed morphogenesis regulator gene AGAMOUS-LIKE 11 (VviAGL11) was fully linked with stenospermocarpy. The concurrent post-zygotic variation identified for this missense polymorphism and seedlessness phenotype in seeded somatic variants of the original stenospermocarpic cultivar supports a causal effect. We postulate that seed abortion caused by this amino acid substitution in VviAGL11 is the major cause of seedlessness in cultivated grapevine. This information can be exploited to boost seedless grape breeding. {copyright, serif} 2018 American Society of Plant Biologists. All rights reserved.

  11. Weighted gene co-expression network analysis of expression data of monozygotic twins identifies specific modules and hub genes related to BMI.

    PubMed

    Wang, Weijing; Jiang, Wenjie; Hou, Lin; Duan, Haiping; Wu, Yili; Xu, Chunsheng; Tan, Qihua; Li, Shuxia; Zhang, Dongfeng

    2017-11-13

    The therapeutic management of obesity is challenging, hence further elucidating the underlying mechanisms of obesity development and identifying new diagnostic biomarkers and therapeutic targets are urgent and necessary. Here, we performed differential gene expression analysis and weighted gene co-expression network analysis (WGCNA) to identify significant genes and specific modules related to BMI based on gene expression profile data of 7 discordant monozygotic twins. In the differential gene expression analysis, it appeared that 32 differentially expressed genes (DEGs) were with a trend of up-regulation in twins with higher BMI when compared to their siblings. Categories of positive regulation of nitric-oxide synthase biosynthetic process, positive regulation of NF-kappa B import into nucleus, and peroxidase activity were significantly enriched within GO database and NF-kappa B signaling pathway within KEGG database. DEGs of NAMPT, TLR9, PTGS2, HBD, and PCSK1N might be associated with obesity. In the WGCNA, among the total 20 distinct co-expression modules identified, coral1 module (68 genes) had the strongest positive correlation with BMI (r = 0.56, P = 0.04) and disease status (r = 0.56, P = 0.04). Categories of positive regulation of phospholipase activity, high-density lipoprotein particle clearance, chylomicron remnant clearance, reverse cholesterol transport, intermediate-density lipoprotein particle, chylomicron, low-density lipoprotein particle, very-low-density lipoprotein particle, voltage-gated potassium channel complex, cholesterol transporter activity, and neuropeptide hormone activity were significantly enriched within GO database for this module. And alcoholism and cell adhesion molecules pathways were significantly enriched within KEGG database. Several hub genes, such as GAL, ASB9, NPPB, TBX2, IL17C, APOE, ABCG4, and APOC2 were also identified. The module eigengene of saddlebrown module (212 genes) was also significantly

  12. Analysis of global gene expression profiles to identify differentially expressed genes critical for embryo development in Brassica rapa.

    PubMed

    Zhang, Yu; Peng, Lifang; Wu, Ya; Shen, Yanyue; Wu, Xiaoming; Wang, Jianbo

    2014-11-01

    Embryo development represents a crucial developmental period in the life cycle of flowering plants. To gain insights into the genetic programs that control embryo development in Brassica rapa L., RNA sequencing technology was used to perform transcriptome profiling analysis of B. rapa developing embryos. The results generated 42,906,229 sequence reads aligned with 32,941 genes. In total, 27,760, 28,871, 28,384, and 25,653 genes were identified from embryos at globular, heart, early cotyledon, and mature developmental stages, respectively, and analysis between stages revealed a subset of stage-specific genes. We next investigated 9,884 differentially expressed genes with more than fivefold changes in expression and false discovery rate ≤ 0.001 from three adjacent-stage comparisons; 1,514, 3,831, and 6,633 genes were detected between globular and heart stage embryo libraries, heart stage and early cotyledon stage, and early cotyledon and mature stage, respectively. Large numbers of genes related to cellular process, metabolism process, response to stimulus, and biological process were expressed during the early and middle stages of embryo development. Fatty acid biosynthesis, biosynthesis of secondary metabolites, and photosynthesis-related genes were expressed predominantly in embryos at the middle stage. Genes for lipid metabolism and storage proteins were highly expressed in the middle and late stages of embryo development. We also identified 911 transcription factor genes that show differential expression across embryo developmental stages. These results increase our understanding of the complex molecular and cellular events during embryo development in B. rapa and provide a foundation for future studies on other oilseed crops.

  13. Towards graphical causal structures

    NASA Astrophysics Data System (ADS)

    Paulsson, K. Johan

    2012-12-01

    Folowing recent work by R. Spekkens, M. Leifer and B. Coecke we investigate causal settings in a limited categorical version of the conditional density operator formalism. We particularly show how the compact structure for causal and acausal settings apply on the measurements of stabiliser theory.

  14. HaploReg v4: systematic mining of putative causal variants, cell types, regulators and target genes for human complex traits and disease.

    PubMed

    Ward, Lucas D; Kellis, Manolis

    2016-01-04

    More than 90% of common variants associated with complex traits do not affect proteins directly, but instead the circuits that control gene expression. This has increased the urgency of understanding the regulatory genome as a key component for translating genetic results into mechanistic insights and ultimately therapeutics. To address this challenge, we developed HaploReg (http://compbio.mit.edu/HaploReg) to aid the functional dissection of genome-wide association study (GWAS) results, the prediction of putative causal variants in haplotype blocks, the prediction of likely cell types of action, and the prediction of candidate target genes by systematic mining of comparative, epigenomic and regulatory annotations. Since first launching the website in 2011, we have greatly expanded HaploReg, increasing the number of chromatin state maps to 127 reference epigenomes from ENCODE 2012 and Roadmap Epigenomics, incorporating regulator binding data, expanding regulatory motif disruption annotations, and integrating expression quantitative trait locus (eQTL) variants and their tissue-specific target genes from GTEx, Geuvadis, and other recent studies. We present these updates as HaploReg v4, and illustrate a use case of HaploReg for attention deficit hyperactivity disorder (ADHD)-associated SNPs with putative brain regulatory mechanisms. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

  15. Formalizing the Role of Agent-Based Modeling in Causal Inference and Epidemiology

    PubMed Central

    Marshall, Brandon D. L.; Galea, Sandro

    2015-01-01

    Calls for the adoption of complex systems approaches, including agent-based modeling, in the field of epidemiology have largely centered on the potential for such methods to examine complex disease etiologies, which are characterized by feedback behavior, interference, threshold dynamics, and multiple interacting causal effects. However, considerable theoretical and practical issues impede the capacity of agent-based methods to examine and evaluate causal effects and thus illuminate new areas for intervention. We build on this work by describing how agent-based models can be used to simulate counterfactual outcomes in the presence of complexity. We show that these models are of particular utility when the hypothesized causal mechanisms exhibit a high degree of interdependence between multiple causal effects and when interference (i.e., one person's exposure affects the outcome of others) is present and of intrinsic scientific interest. Although not without challenges, agent-based modeling (and complex systems methods broadly) represent a promising novel approach to identify and evaluate complex causal effects, and they are thus well suited to complement other modern epidemiologic methods of etiologic inquiry. PMID:25480821

  16. Principal stratification in causal inference.

    PubMed

    Frangakis, Constantine E; Rubin, Donald B

    2002-03-01

    Many scientific problems require that treatment comparisons be adjusted for posttreatment variables, but the estimands underlying standard methods are not causal effects. To address this deficiency, we propose a general framework for comparing treatments adjusting for posttreatment variables that yields principal effects based on principal stratification. Principal stratification with respect to a posttreatment variable is a cross-classification of subjects defined by the joint potential values of that posttreatment variable tinder each of the treatments being compared. Principal effects are causal effects within a principal stratum. The key property of principal strata is that they are not affected by treatment assignment and therefore can be used just as any pretreatment covariate. such as age category. As a result, the central property of our principal effects is that they are always causal effects and do not suffer from the complications of standard posttreatment-adjusted estimands. We discuss briefly that such principal causal effects are the link between three recent applications with adjustment for posttreatment variables: (i) treatment noncompliance, (ii) missing outcomes (dropout) following treatment noncompliance. and (iii) censoring by death. We then attack the problem of surrogate or biomarker endpoints, where we show, using principal causal effects, that all current definitions of surrogacy, even when perfectly true, do not generally have the desired interpretation as causal effects of treatment on outcome. We go on to forrmulate estimands based on principal stratification and principal causal effects and show their superiority.

  17. Evolutionary Inference across Eukaryotes Identifies Specific Pressures Favoring Mitochondrial Gene Retention.

    PubMed

    Johnston, Iain G; Williams, Ben P

    2016-02-24

    Since their endosymbiotic origin, mitochondria have lost most of their genes. Although many selective mechanisms underlying the evolution of mitochondrial genomes have been proposed, a data-driven exploration of these hypotheses is lacking, and a quantitatively supported consensus remains absent. We developed HyperTraPS, a methodology coupling stochastic modeling with Bayesian inference, to identify the ordering of evolutionary events and suggest their causes. Using 2015 complete mitochondrial genomes, we inferred evolutionary trajectories of mtDNA gene loss across the eukaryotic tree of life. We find that proteins comprising the structural cores of the electron transport chain are preferentially encoded within mitochondrial genomes across eukaryotes. A combination of high GC content and high protein hydrophobicity is required to explain patterns of mtDNA gene retention; a model that accounts for these selective pressures can also predict the success of artificial gene transfer experiments in vivo. This work provides a general method for data-driven inference of the ordering of evolutionary and progressive events, here identifying the distinct features shaping mitochondrial genomes of present-day species. Copyright © 2016 Elsevier Inc. All rights reserved.

  18. Children's Counterfactual Reasoning About Causally Overdetermined Events.

    PubMed

    Nyhout, Angela; Henke, Lena; Ganea, Patricia A

    2017-08-07

    In two experiments, one hundred and sixty-two 6- to 8-year-olds were asked to reason counterfactually about events with different causal structures. All events involved overdetermined outcomes in which two different causal events led to the same outcome. In Experiment 1, children heard stories with either an ambiguous causal relation between events or causally unrelated events. Children in the causally unrelated version performed better than chance and better than those in the ambiguous condition. In Experiment 2, children heard stories in which antecedent events were causally connected or causally disconnected. Eight-year-olds performed above chance in both conditions, whereas 6-year-olds performed above chance only in the connected condition. This work provides the first evidence that children can reason counterfactually in causally overdetermined contexts by age 8. © 2017 The Authors. Child Development © 2017 Society for Research in Child Development, Inc.

  19. Coalitional game theory as a promising approach to identify candidate autism genes.

    PubMed

    Gupta, Anika; Sun, Min Woo; Paskov, Kelley Marie; Stockham, Nate Tyler; Jung, Jae-Yoon; Wall, Dennis Paul

    2018-01-01

    Despite mounting evidence for the strong role of genetics in the phenotypic manifestation of Autism Spectrum Disorder (ASD), the specific genes responsible for the variable forms of ASD remain undefined. ASD may be best explained by a combinatorial genetic model with varying epistatic interactions across many small effect mutations. Coalitional or cooperative game theory is a technique that studies the combined effects of groups of players, known as coalitions, seeking to identify players who tend to improve the performance--the relationship to a specific disease phenotype--of any coalition they join. This method has been previously shown to boost biologically informative signal in gene expression data but to-date has not been applied to the search for cooperative mutations among putative ASD genes. We describe our approach to highlight genes relevant to ASD using coalitional game theory on alteration data of 1,965 fully sequenced genomes from 756 multiplex families. Alterations were encoded into binary matrices for ASD (case) and unaffected (control) samples, indicating likely gene-disrupting, inherited mutations in altered genes. To determine individual gene contributions given an ASD phenotype, a "player" metric, referred to as the Shapley value, was calculated for each gene in the case and control cohorts. Sixty seven genes were found to have significantly elevated player scores and likely represent significant contributors to the genetic coordination underlying ASD. Using network and cross-study analysis, we found that these genes are involved in biological pathways known to be affected in the autism cases and that a subset directly interact with several genes known to have strong associations to autism. These findings suggest that coalitional game theory can be applied to large-scale genomic data to identify hidden yet influential players in complex polygenic disorders such as autism.

  20. The FUN of identifying gene function in bacterial pathogens; insights from Salmonella functional genomics.

    PubMed

    Hammarlöf, Disa L; Canals, Rocío; Hinton, Jay C D

    2013-10-01

    The availability of thousands of genome sequences of bacterial pathogens poses a particular challenge because each genome contains hundreds of genes of unknown function (FUN). How can we easily discover which FUN genes encode important virulence factors? One solution is to combine two different functional genomic approaches. First, transcriptomics identifies bacterial FUN genes that show differential expression during the process of mammalian infection. Second, global mutagenesis identifies individual FUN genes that the pathogen requires to cause disease. The intersection of these datasets can reveal a small set of candidate genes most likely to encode novel virulence attributes. We demonstrate this approach with the Salmonella infection model, and propose that a similar strategy could be used for other bacterial pathogens. Copyright © 2013 Elsevier Ltd. All rights reserved.

  1. A vector space model approach to identify genetically related diseases.

    PubMed

    Sarkar, Indra Neil

    2012-01-01

    The relationship between diseases and their causative genes can be complex, especially in the case of polygenic diseases. Further exacerbating the challenges in their study is that many genes may be causally related to multiple diseases. This study explored the relationship between diseases through the adaptation of an approach pioneered in the context of information retrieval: vector space models. A vector space model approach was developed that bridges gene disease knowledge inferred across three knowledge bases: Online Mendelian Inheritance in Man, GenBank, and Medline. The approach was then used to identify potentially related diseases for two target diseases: Alzheimer disease and Prader-Willi Syndrome. In the case of both Alzheimer Disease and Prader-Willi Syndrome, a set of plausible diseases were identified that may warrant further exploration. This study furthers seminal work by Swanson, et al. that demonstrated the potential for mining literature for putative correlations. Using a vector space modeling approach, information from both biomedical literature and genomic resources (like GenBank) can be combined towards identification of putative correlations of interest. To this end, the relevance of the predicted diseases of interest in this study using the vector space modeling approach were validated based on supporting literature. The results of this study suggest that a vector space model approach may be a useful means to identify potential relationships between complex diseases, and thereby enable the coordination of gene-based findings across multiple complex diseases.

  2. A powerful score-based test statistic for detecting gene-gene co-association.

    PubMed

    Xu, Jing; Yuan, Zhongshang; Ji, Jiadong; Zhang, Xiaoshuai; Li, Hongkai; Wu, Xuesen; Xue, Fuzhong; Liu, Yanxun

    2016-01-29

    The genetic variants identified by Genome-wide association study (GWAS) can only account for a small proportion of the total heritability for complex disease. The existence of gene-gene joint effects which contains the main effects and their co-association is one of the possible explanations for the "missing heritability" problems. Gene-gene co-association refers to the extent to which the joint effects of two genes differ from the main effects, not only due to the traditional interaction under nearly independent condition but the correlation between genes. Generally, genes tend to work collaboratively within specific pathway or network contributing to the disease and the specific disease-associated locus will often be highly correlated (e.g. single nucleotide polymorphisms (SNPs) in linkage disequilibrium). Therefore, we proposed a novel score-based statistic (SBS) as a gene-based method for detecting gene-gene co-association. Various simulations illustrate that, under different sample sizes, marginal effects of causal SNPs and co-association levels, the proposed SBS has the better performance than other existed methods including single SNP-based and principle component analysis (PCA)-based logistic regression model, the statistics based on canonical correlations (CCU), kernel canonical correlation analysis (KCCU), partial least squares path modeling (PLSPM) and delta-square (δ (2)) statistic. The real data analysis of rheumatoid arthritis (RA) further confirmed its advantages in practice. SBS is a powerful and efficient gene-based method for detecting gene-gene co-association.

  3. Transposon mutagenesis identifies genes that cooperate with mutant Pten in breast cancer progression

    PubMed Central

    Rangel, Roberto; Lee, Song-Choon; Hon-Kim Ban, Kenneth; Guzman-Rojas, Liliana; Mann, Michael B.; Newberg, Justin Y.; McNoe, Leslie A.; Selvanesan, Luxmanan; Ward, Jerrold M.; Rust, Alistair G.; Chin, Kuan-Yew; Black, Michael A.; Jenkins, Nancy A.; Copeland, Neal G.

    2016-01-01

    Triple-negative breast cancer (TNBC) has the worst prognosis of any breast cancer subtype. To better understand the genetic forces driving TNBC, we performed a transposon mutagenesis screen in a phosphatase and tensin homolog (Pten) mutant mice and identified 12 candidate trunk drivers and a much larger number of progression genes. Validation studies identified eight TNBC tumor suppressor genes, including the GATA-like transcriptional repressor TRPS1. Down-regulation of TRPS1 in TNBC cells promoted epithelial-to-mesenchymal transition (EMT) by deregulating multiple EMT pathway genes, in addition to increasing the expression of SERPINE1 and SERPINB2 and the subsequent migration, invasion, and metastasis of tumor cells. Transposon mutagenesis has thus provided a better understanding of the genetic forces driving TNBC and discovered genes with potential clinical importance in TNBC. PMID:27849608

  4. A Hierarchical Causal Taxonomy of Psychopathology across the Life Span

    PubMed Central

    Lahey, Benjamin B.; Krueger, Robert F.; Rathouz, Paul J.; Waldman, Irwin D.; Zald, David H.

    2016-01-01

    We propose a taxonomy of psychopathology based on patterns of shared causal influences identified in a review of multivariate behavior genetic studies that distinguish genetic and environmental influences that are either common to multiple dimensions of psychopathology or unique to each dimension. At the phenotypic level, first-order dimensions are defined by correlations among symptoms; correlations among first-order dimensions similarly define higher-order domains (e.g., internalizing or externalizing psychopathology). We hypothesize that the robust phenotypic correlations among first-order dimensions reflect a hierarchy of increasingly specific etiologic influences. Some nonspecific etiologic factors increase risk for all first-order dimensions of psychopathology to varying degrees through a general factor of psychopathology. Other nonspecific etiologic factors increase risk only for all first-order dimensions within a more specific higher-order domain. Furthermore, each first-order dimension has its own unique causal influences. Genetic and environmental influences common to family members tend to be nonspecific, whereas environmental influences unique to each individual are more dimension-specific. We posit that these causal influences on psychopathology are moderated by sex and developmental processes. This causal taxonomy also provides a novel framework for understanding the heterogeneity of each first-order dimension: Different persons exhibiting similar symptoms may be influenced by different combinations of etiologic influences from each of the three levels of the etiologic hierarchy. Furthermore, we relate the proposed causal taxonomy to transdimensional psychobiological processes, which also impact the heterogeneity of each psychopathology dimension. This causal taxonomy implies the need for changes in strategies for studying the etiology, psychobiology, prevention, and treatment of psychopathology. PMID:28004947

  5. Exploring individual differences in preschoolers' causal stance.

    PubMed

    Alvarez, Aubry; Booth, Amy E

    2016-03-01

    Preschoolers, as a group, are highly attuned to causality, and this attunement is known to facilitate memory, learning, and problem solving. However, recent work reveals substantial individual variability in the strength of children's "causal stance," as demonstrated by their curiosity about and preference for new causal information. In this study, we explored the coherence and short-term stability of individual differences in children's causal stance. We also began to investigate the origins of this variability, focusing particularly on the potential role of mothers' explanatory talk in shaping the causal stance of their children. Two measures of causal stance correlated with each other, as well as themselves across time. Both also revealed internal consistency of response. The strength of children's causal stance also correlated with mother's responses on the same tasks and the frequency with which mothers emphasized causality during naturalistic joint activities with their children. Implications for theory and practice are discussed. (c) 2016 APA, all rights reserved).

  6. Granger causality for state-space models

    NASA Astrophysics Data System (ADS)

    Barnett, Lionel; Seth, Anil K.

    2015-04-01

    Granger causality has long been a prominent method for inferring causal interactions between stochastic variables for a broad range of complex physical systems. However, it has been recognized that a moving average (MA) component in the data presents a serious confound to Granger causal analysis, as routinely performed via autoregressive (AR) modeling. We solve this problem by demonstrating that Granger causality may be calculated simply and efficiently from the parameters of a state-space (SS) model. Since SS models are equivalent to autoregressive moving average models, Granger causality estimated in this fashion is not degraded by the presence of a MA component. This is of particular significance when the data has been filtered, downsampled, observed with noise, or is a subprocess of a higher dimensional process, since all of these operations—commonplace in application domains as diverse as climate science, econometrics, and the neurosciences—induce a MA component. We show how Granger causality, conditional and unconditional, in both time and frequency domains, may be calculated directly from SS model parameters via solution of a discrete algebraic Riccati equation. Numerical simulations demonstrate that Granger causality estimators thus derived have greater statistical power and smaller bias than AR estimators. We also discuss how the SS approach facilitates relaxation of the assumptions of linearity, stationarity, and homoscedasticity underlying current AR methods, thus opening up potentially significant new areas of research in Granger causal analysis.

  7. Identifying Novel Transcriptional and Epigenetic Features of Nuclear Lamina-associated Genes.

    PubMed

    Wu, Feinan; Yao, Jie

    2017-03-07

    Because a large portion of the mammalian genome is associated with the nuclear lamina (NL), it is interesting to study how native genes resided there are transcribed and regulated. In this study, we report unique transcriptional and epigenetic features of nearly 3,500 NL-associated genes (NL genes). Promoter regions of active NL genes are often excluded from NL-association, suggesting that NL-promoter interactions may repress transcription. Active NL genes with higher RNA polymerase II (Pol II) recruitment levels tend to display Pol II promoter-proximal pausing, while Pol II recruitment and Pol II pausing are not correlated among non-NL genes. At the genome-wide scale, NL-association and H3K27me3 distinguishes two large gene classes with low transcriptional activities. Notably, NL-association is anti-correlated with both transcription and active histone mark levels among genes not significantly enriched with H3K9me3 or H3K27me3, suggesting that NL-association may represent a novel gene repression pathway. Interestingly, an NL gene subgroup is not significantly enriched with H3K9me3 or H3K27me3 and is transcribed at higher levels than the rest of NL genes. Furthermore, we identified distal enhancers associated with active NL genes and reported their epigenetic features.

  8. Using reporter gene assays to identify cis regulatory differences between humans and chimpanzees.

    PubMed

    Chabot, Adrien; Shrit, Ralla A; Blekhman, Ran; Gilad, Yoav

    2007-08-01

    Most phenotypic differences between human and chimpanzee are likely to result from differences in gene regulation, rather than changes to protein-coding regions. To date, however, only a handful of human-chimpanzee nucleotide differences leading to changes in gene regulation have been identified. To hone in on differences in regulatory elements between human and chimpanzee, we focused on 10 genes that were previously found to be differentially expressed between the two species. We then designed reporter gene assays for the putative human and chimpanzee promoters of the 10 genes. Of seven promoters that we found to be active in human liver cell lines, human and chimpanzee promoters had significantly different activity in four cases, three of which recapitulated the gene expression difference seen in the microarray experiment. For these three genes, we were therefore able to demonstrate that a change in cis influences expression differences between humans and chimpanzees. Moreover, using site-directed mutagenesis on one construct, the promoter for the DDA3 gene, we were able to identify three nucleotides that together lead to a cis regulatory difference between the species. High-throughput application of this approach can provide a map of regulatory element differences between humans and our close evolutionary relatives.

  9. Genome-Wide Association Study Identifies Candidate Genes for Starch Content Regulation in Maize Kernels

    PubMed Central

    Liu, Na; Xue, Yadong; Guo, Zhanyong; Li, Weihua; Tang, Jihua

    2016-01-01

    Kernel starch content is an important trait in maize (Zea mays L.) as it accounts for 65–75% of the dry kernel weight and positively correlates with seed yield. A number of starch synthesis-related genes have been identified in maize in recent years. However, many loci underlying variation in starch content among maize inbred lines still remain to be identified. The current study is a genome-wide association study that used a set of 263 maize inbred lines. In this panel, the average kernel starch content was 66.99%, ranging from 60.60 to 71.58% over the three study years. These inbred lines were genotyped with the SNP50 BeadChip maize array, which is comprised of 56,110 evenly spaced, random SNPs. Population structure was controlled by a mixed linear model (MLM) as implemented in the software package TASSEL. After the statistical analyses, four SNPs were identified as significantly associated with starch content (P ≤ 0.0001), among which one each are located on chromosomes 1 and 5 and two are on chromosome 2. Furthermore, 77 candidate genes associated with starch synthesis were found within the 100-kb intervals containing these four QTLs, and four highly associated genes were within 20-kb intervals of the associated SNPs. Among the four genes, Glucose-1-phosphate adenylyltransferase (APS1; Gene ID GRMZM2G163437) is known as an important regulator of kernel starch content. The identified SNPs, QTLs, and candidate genes may not only be readily used for germplasm improvement by marker-assisted selection in breeding, but can also elucidate the genetic basis of starch content. Further studies on these identified candidate genes may help determine the molecular mechanisms regulating kernel starch content in maize and other important cereal crops. PMID:27512395

  10. Causal knowledge and the development of inductive reasoning.

    PubMed

    Bright, Aimée K; Feeney, Aidan

    2014-06-01

    We explored the development of sensitivity to causal relations in children's inductive reasoning. Children (5-, 8-, and 12-year-olds) and adults were given trials in which they decided whether a property known to be possessed by members of one category was also possessed by members of (a) a taxonomically related category or (b) a causally related category. The direction of the causal link was either predictive (prey→predator) or diagnostic (predator→prey), and the property that participants reasoned about established either a taxonomic or causal context. There was a causal asymmetry effect across all age groups, with more causal choices when the causal link was predictive than when it was diagnostic. Furthermore, context-sensitive causal reasoning showed a curvilinear development, with causal choices being most frequent for 8-year-olds regardless of context. Causal inductions decreased thereafter because 12-year-olds and adults made more taxonomic choices when reasoning in the taxonomic context. These findings suggest that simple causal relations may often be the default knowledge structure in young children's inductive reasoning, that sensitivity to causal direction is present early on, and that children over-generalize their causal knowledge when reasoning. Copyright © 2013 Elsevier Inc. All rights reserved.

  11. Understanding environmental contributions to autism: Causal concepts and the state of science.

    PubMed

    Hertz-Picciotto, Irva; Schmidt, Rebecca J; Krakowiak, Paula

    2018-04-01

    The complexity of neurodevelopment, the rapidity of early neurogenesis, and over 100 years of research identifying environmental influences on neurodevelopment serve as backdrop to understanding factors that influence risk and severity of autism spectrum disorder (ASD). This Keynote Lecture, delivered at the May 2016 annual meeting of the International Society for Autism Research, describes concepts of causation, outlines the trajectory of research on nongenetic factors beginning in the 1960s, and briefly reviews the current state of this science. Causal concepts are introduced, including root causes; pitfalls in interpreting time trends as clues to etiologic factors; susceptible time windows for exposure; and implications of a multi-factorial model of ASD. An historical background presents early research into the origins of ASD. The epidemiologic literature from the last fifteen years is briefly but critically reviewed for potential roles of, for example, air pollution, pesticides, plastics, prenatal vitamins, lifestyle and family factors, and maternal obstetric and metabolic conditions during her pregnancy. Three examples from the case-control CHildhood Autism Risks from Genes and the Environment Study are probed to illustrate methodological approaches to central challenges in observational studies: capturing environmental exposure; causal inference when a randomized controlled clinical trial is either unethical or infeasible; and the integration of genetic, epigenetic, and environmental influences on development. We conclude with reflections on future directions, including exposomics, new technologies, the microbiome, gene-by-environment interaction in the era of -omics, and epigenetics as the interface of those two. As the environment is malleable, this research advances the goal of a productive and fulfilling life for all children, teen-agers and adults. Autism Res 2018, 11: 554-586. © 2018 International Society for Autism Research, Wiley Periodicals, Inc

  12. Theory-Based Causal Induction

    ERIC Educational Resources Information Center

    Griffiths, Thomas L.; Tenenbaum, Joshua B.

    2009-01-01

    Inducing causal relationships from observations is a classic problem in scientific inference, statistics, and machine learning. It is also a central part of human learning, and a task that people perform remarkably well given its notorious difficulties. People can learn causal structure in various settings, from diverse forms of data: observations…

  13. GENE EXPRESSION PROFILING TO IDENTIFY MECHANISMS OF MALE REPRODUCTIVE TOXICITY

    EPA Science Inventory

    Gene Expression Profiling to Identify Mechanisms of Male Reproductive Toxicity
    David J. Dix
    National Health and Environmental Effects Research Laboratory, Office of Research and Development, U.S. Environmental Protection Agency, Research Triangle Park, NC, 27711, USA.
    Ab...

  14. Designing Effective Supports for Causal Reasoning

    ERIC Educational Resources Information Center

    Jonassen, David H.; Ionas, Ioan Gelu

    2008-01-01

    Causal reasoning represents one of the most basic and important cognitive processes that underpin all higher-order activities, such as conceptual understanding and problem solving. Hume called causality the "cement of the universe" [Hume (1739/2000). Causal reasoning is required for making predictions, drawing implications and…

  15. The selective power of causality on memory errors.

    PubMed

    Marsh, Jessecae K; Kulkofsky, Sarah

    2015-01-01

    We tested the influence of causal links on the production of memory errors in a misinformation paradigm. Participants studied a set of statements about a person, which were presented as either individual statements or pairs of causally linked statements. Participants were then provided with causally plausible and causally implausible misinformation. We hypothesised that studying information connected with causal links would promote representing information in a more abstract manner. As such, we predicted that causal information would not provide an overall protection against memory errors, but rather would preferentially help in the rejection of misinformation that was causally implausible, given the learned causal links. In two experiments, we measured whether the causal linkage of information would be generally protective against all memory errors or only selectively protective against certain types of memory errors. Causal links helped participants reject implausible memory lures, but did not protect against plausible lures. Our results suggest that causal information may promote an abstract storage of information that helps prevent only specific types of memory errors.

  16. Use of Genome Sequence Information for Meat Quality Trait QTL Mining for Causal Genes and Mutations on Pig Chromosome 17

    PubMed Central

    Hu, Zhi-Liang; Ramos, Antonio M.; Humphray, Sean J.; Rogers, Jane; Reecy, James M.; Rothschild, Max F.

    2011-01-01

    The newly available pig genome sequence has provided new information to fine map quantitative trait loci (QTL) in order to eventually identify causal variants. With targeted genomic sequencing efforts, we were able to obtain high quality BAC sequences that cover a region on pig chromosome 17 where a number of meat quality QTL have been previously discovered. Sequences from 70 BAC clones were assembled to form an 8-Mbp contig. Subsequently, we successfully mapped five previously identified QTL, three for meat color and two for lactate related traits, to the contig. With an additional 25 genetic markers that were identified by sequence comparison, we were able to carry out further linkage disequilibrium analysis to narrow down the genomic locations of these QTL, which allowed identification of the chromosomal regions that likely contain the causative variants. This research has provided one practical approach to combine genetic and molecular information for QTL mining. PMID:22303339

  17. Different Kinds of Causality in Event Cognition

    ERIC Educational Resources Information Center

    Radvansky, Gabriel A.; Tamplin, Andrea K.; Armendarez, Joseph; Thompson, Alexis N.

    2014-01-01

    Narrative memory is better for information that is more causally connected and occurs at event boundaries, such as a causal break. However, it is unclear whether there are common or distinct influences of causality. For the event boundaries that arise as a result of causal breaks, the events that follow may subsequently become more causally…

  18. Arsenic metabolism efficiency has a causal role in arsenic toxicity: Mendelian randomization and gene-environment interaction.

    PubMed

    Pierce, Brandon L; Tong, Lin; Argos, Maria; Gao, Jianjun; Farzana, Jasmine; Roy, Shantanu; Paul-Brutus, Rachelle; Rahaman, Ronald; Rakibuz-Zaman, Muhammad; Parvez, Faruque; Ahmed, Alauddin; Quasem, Iftekhar; Hore, Samar K; Alam, Shafiul; Islam, Tariqul; Harjes, Judith; Sarwar, Golam; Slavkovich, Vesna; Gamble, Mary V; Chen, Yu; Yunus, Mohammad; Rahman, Mahfuzar; Baron, John A; Graziano, Joseph H; Ahsan, Habibul

    2013-12-01

    Arsenic exposure through drinking water is a serious global health issue. Observational studies suggest that individuals who metabolize arsenic efficiently are at lower risk for toxicities such as arsenical skin lesions. Using two single nucleotide polymorphisms(SNPs) in the 10q24.32 region (near AS3MT) that show independent associations with metabolism efficiency, Mendelian randomization can be used to assess whether the association between metabolism efficiency and skin lesions is likely to be causal. Using data on 2060 arsenic-exposed Bangladeshi individuals, we estimated associations for two 10q24.32 SNPs with relative concentrations of three urinary arsenic species (representing metabolism efficiency): inorganic arsenic (iAs), monomethylarsonic acid(MMA) and dimethylarsinic acid (DMA). SNP-based predictions of iAs%, MMA% and DMA% were tested for association with skin lesion status among 2483 cases and 2857 controls. Causal odds ratios for skin lesions were 0.90 (95% confidence interval[CI]: 0.87, 0.95), 1.19 (CI: 1.10, 1.28) and 1.23 (CI: 1.12, 1.36)for a one standard deviation increase in DMA%, MMA% and iAs%,respectively. We demonstrated genotype-arsenic interaction, with metabolism-related variants showing stronger associations with skin lesion risk among individuals with high arsenic exposure (synergy index: 1.37; CI: 1.11, 1.62). We provide strong evidence for a causal relationship between arsenic metabolism efficiency and skin lesion risk. Mendelian randomization can be used to assess the causal role of arsenic exposure and metabolism in a wide array of health conditions.exposure and metabolism in a wide array of health conditions.Developing interventions that increase arsenic metabolism efficiency are likely to reduce the impact of arsenic exposure on health.

  19. Neural theory for the perception of causal actions.

    PubMed

    Fleischer, Falk; Christensen, Andrea; Caggiano, Vittorio; Thier, Peter; Giese, Martin A

    2012-07-01

    The efficient prediction of the behavior of others requires the recognition of their actions and an understanding of their action goals. In humans, this process is fast and extremely robust, as demonstrated by classical experiments showing that human observers reliably judge causal relationships and attribute interactive social behavior to strongly simplified stimuli consisting of simple moving geometrical shapes. While psychophysical experiments have identified critical visual features that determine the perception of causality and agency from such stimuli, the underlying detailed neural mechanisms remain largely unclear, and it is an open question why humans developed this advanced visual capability at all. We created pairs of naturalistic and abstract stimuli of hand actions that were exactly matched in terms of their motion parameters. We show that varying critical stimulus parameters for both stimulus types leads to very similar modulations of the perception of causality. However, the additional form information about the hand shape and its relationship with the object supports more fine-grained distinctions for the naturalistic stimuli. Moreover, we show that a physiologically plausible model for the recognition of goal-directed hand actions reproduces the observed dependencies of causality perception on critical stimulus parameters. These results support the hypothesis that selectivity for abstract action stimuli might emerge from the same neural mechanisms that underlie the visual processing of natural goal-directed action stimuli. Furthermore, the model proposes specific detailed neural circuits underlying this visual function, which can be evaluated in future experiments.

  20. A data mining paradigm for identifying key factors in biological processes using gene expression data.

    PubMed

    Li, Jin; Zheng, Le; Uchiyama, Akihiko; Bin, Lianghua; Mauro, Theodora M; Elias, Peter M; Pawelczyk, Tadeusz; Sakowicz-Burkiewicz, Monika; Trzeciak, Magdalena; Leung, Donald Y M; Morasso, Maria I; Yu, Peng

    2018-06-13

    A large volume of biological data is being generated for studying mechanisms of various biological processes. These precious data enable large-scale computational analyses to gain biological insights. However, it remains a challenge to mine the data efficiently for knowledge discovery. The heterogeneity of these data makes it difficult to consistently integrate them, slowing down the process of biological discovery. We introduce a data processing paradigm to identify key factors in biological processes via systematic collection of gene expression datasets, primary analysis of data, and evaluation of consistent signals. To demonstrate its effectiveness, our paradigm was applied to epidermal development and identified many genes that play a potential role in this process. Besides the known epidermal development genes, a substantial proportion of the identified genes are still not supported by gain- or loss-of-function studies, yielding many novel genes for future studies. Among them, we selected a top gene for loss-of-function experimental validation and confirmed its function in epidermal differentiation, proving the ability of this paradigm to identify new factors in biological processes. In addition, this paradigm revealed many key genes in cold-induced thermogenesis using data from cold-challenged tissues, demonstrating its generalizability. This paradigm can lead to fruitful results for studying molecular mechanisms in an era of explosive accumulation of publicly available biological data.

  1. How causal analysis can reveal autonomy in models of biological systems

    NASA Astrophysics Data System (ADS)

    Marshall, William; Kim, Hyunju; Walker, Sara I.; Tononi, Giulio; Albantakis, Larissa

    2017-11-01

    Standard techniques for studying biological systems largely focus on their dynamical or, more recently, their informational properties, usually taking either a reductionist or holistic perspective. Yet, studying only individual system elements or the dynamics of the system as a whole disregards the organizational structure of the system-whether there are subsets of elements with joint causes or effects, and whether the system is strongly integrated or composed of several loosely interacting components. Integrated information theory offers a theoretical framework to (1) investigate the compositional cause-effect structure of a system and to (2) identify causal borders of highly integrated elements comprising local maxima of intrinsic cause-effect power. Here we apply this comprehensive causal analysis to a Boolean network model of the fission yeast (Schizosaccharomyces pombe) cell cycle. We demonstrate that this biological model features a non-trivial causal architecture, whose discovery may provide insights about the real cell cycle that could not be gained from holistic or reductionist approaches. We also show how some specific properties of this underlying causal architecture relate to the biological notion of autonomy. Ultimately, we suggest that analysing the causal organization of a system, including key features like intrinsic control and stable causal borders, should prove relevant for distinguishing life from non-life, and thus could also illuminate the origin of life problem. This article is part of the themed issue 'Reconceptualizing the origins of life'.

  2. Causal essentialism in kinds.

    PubMed

    Ahn, Woo-kyoung; Taylor, Eric G; Kato, Daniel; Marsh, Jessecae K; Bloom, Paul

    2013-06-01

    The current study examines causal essentialism, derived from psychological essentialism of concepts. We examine whether people believe that members of a category share some underlying essence that is both necessary and sufficient for category membership and that also causes surface features. The main claim is that causal essentialism is restricted to categories that correspond to our intuitive notions of existing kinds and hence is more attenuated for categories that are based on arbitrary criteria. Experiments 1 and 3 found that people overtly endorse causal essences in nonarbitrary kinds but are less likely to do so for arbitrary categories. Experiments 2 and 4 found that people were more willing to generalize a member's known causal relations (or lack thereof) when dealing with a kind than when dealing with an arbitrary category. These differences between kinds and arbitrary categories were found across various domains-not only for categories of living things, but also for artefacts. These findings have certain real-world implications, including how people make sense of mental disorders that are treated as real kinds.

  3. X-exome sequencing of 405 unresolved families identifies seven novel intellectual disability genes.

    PubMed

    Hu, H; Haas, S A; Chelly, J; Van Esch, H; Raynaud, M; de Brouwer, A P M; Weinert, S; Froyen, G; Frints, S G M; Laumonnier, F; Zemojtel, T; Love, M I; Richard, H; Emde, A-K; Bienek, M; Jensen, C; Hambrock, M; Fischer, U; Langnick, C; Feldkamp, M; Wissink-Lindhout, W; Lebrun, N; Castelnau, L; Rucci, J; Montjean, R; Dorseuil, O; Billuart, P; Stuhlmann, T; Shaw, M; Corbett, M A; Gardner, A; Willis-Owen, S; Tan, C; Friend, K L; Belet, S; van Roozendaal, K E P; Jimenez-Pocquet, M; Moizard, M-P; Ronce, N; Sun, R; O'Keeffe, S; Chenna, R; van Bömmel, A; Göke, J; Hackett, A; Field, M; Christie, L; Boyle, J; Haan, E; Nelson, J; Turner, G; Baynam, G; Gillessen-Kaesbach, G; Müller, U; Steinberger, D; Budny, B; Badura-Stronka, M; Latos-Bieleńska, A; Ousager, L B; Wieacker, P; Rodríguez Criado, G; Bondeson, M-L; Annerén, G; Dufke, A; Cohen, M; Van Maldergem, L; Vincent-Delorme, C; Echenne, B; Simon-Bouy, B; Kleefstra, T; Willemsen, M; Fryns, J-P; Devriendt, K; Ullmann, R; Vingron, M; Wrogemann, K; Wienker, T F; Tzschach, A; van Bokhoven, H; Gecz, J; Jentsch, T J; Chen, W; Ropers, H-H; Kalscheuer, V M

    2016-01-01

    X-linked intellectual disability (XLID) is a clinically and genetically heterogeneous disorder. During the past two decades in excess of 100 X-chromosome ID genes have been identified. Yet, a large number of families mapping to the X-chromosome remained unresolved suggesting that more XLID genes or loci are yet to be identified. Here, we have investigated 405 unresolved families with XLID. We employed massively parallel sequencing of all X-chromosome exons in the index males. The majority of these males were previously tested negative for copy number variations and for mutations in a subset of known XLID genes by Sanger sequencing. In total, 745 X-chromosomal genes were screened. After stringent filtering, a total of 1297 non-recurrent exonic variants remained for prioritization. Co-segregation analysis of potential clinically relevant changes revealed that 80 families (20%) carried pathogenic variants in established XLID genes. In 19 families, we detected likely causative protein truncating and missense variants in 7 novel and validated XLID genes (CLCN4, CNKSR2, FRMPD4, KLHL15, LAS1L, RLIM and USP27X) and potentially deleterious variants in 2 novel candidate XLID genes (CDK16 and TAF1). We show that the CLCN4 and CNKSR2 variants impair protein functions as indicated by electrophysiological studies and altered differentiation of cultured primary neurons from Clcn4(-/-) mice or after mRNA knock-down. The newly identified and candidate XLID proteins belong to pathways and networks with established roles in cognitive function and intellectual disability in particular. We suggest that systematic sequencing of all X-chromosomal genes in a cohort of patients with genetic evidence for X-chromosome locus involvement may resolve up to 58% of Fragile X-negative cases.

  4. X-exome sequencing of 405 unresolved families identifies seven novel intellectual disability genes

    PubMed Central

    Hu, H; Haas, S A; Chelly, J; Van Esch, H; Raynaud, M; de Brouwer, A P M; Weinert, S; Froyen, G; Frints, S G M; Laumonnier, F; Zemojtel, T; Love, M I; Richard, H; Emde, A-K; Bienek, M; Jensen, C; Hambrock, M; Fischer, U; Langnick, C; Feldkamp, M; Wissink-Lindhout, W; Lebrun, N; Castelnau, L; Rucci, J; Montjean, R; Dorseuil, O; Billuart, P; Stuhlmann, T; Shaw, M; Corbett, M A; Gardner, A; Willis-Owen, S; Tan, C; Friend, K L; Belet, S; van Roozendaal, K E P; Jimenez-Pocquet, M; Moizard, M-P; Ronce, N; Sun, R; O'Keeffe, S; Chenna, R; van Bömmel, A; Göke, J; Hackett, A; Field, M; Christie, L; Boyle, J; Haan, E; Nelson, J; Turner, G; Baynam, G; Gillessen-Kaesbach, G; Müller, U; Steinberger, D; Budny, B; Badura-Stronka, M; Latos-Bieleńska, A; Ousager, L B; Wieacker, P; Rodríguez Criado, G; Bondeson, M-L; Annerén, G; Dufke, A; Cohen, M; Van Maldergem, L; Vincent-Delorme, C; Echenne, B; Simon-Bouy, B; Kleefstra, T; Willemsen, M; Fryns, J-P; Devriendt, K; Ullmann, R; Vingron, M; Wrogemann, K; Wienker, T F; Tzschach, A; van Bokhoven, H; Gecz, J; Jentsch, T J; Chen, W; Ropers, H-H; Kalscheuer, V M

    2016-01-01

    X-linked intellectual disability (XLID) is a clinically and genetically heterogeneous disorder. During the past two decades in excess of 100 X-chromosome ID genes have been identified. Yet, a large number of families mapping to the X-chromosome remained unresolved suggesting that more XLID genes or loci are yet to be identified. Here, we have investigated 405 unresolved families with XLID. We employed massively parallel sequencing of all X-chromosome exons in the index males. The majority of these males were previously tested negative for copy number variations and for mutations in a subset of known XLID genes by Sanger sequencing. In total, 745 X-chromosomal genes were screened. After stringent filtering, a total of 1297 non-recurrent exonic variants remained for prioritization. Co-segregation analysis of potential clinically relevant changes revealed that 80 families (20%) carried pathogenic variants in established XLID genes. In 19 families, we detected likely causative protein truncating and missense variants in 7 novel and validated XLID genes (CLCN4, CNKSR2, FRMPD4, KLHL15, LAS1L, RLIM and USP27X) and potentially deleterious variants in 2 novel candidate XLID genes (CDK16 and TAF1). We show that the CLCN4 and CNKSR2 variants impair protein functions as indicated by electrophysiological studies and altered differentiation of cultured primary neurons from Clcn4−/− mice or after mRNA knock-down. The newly identified and candidate XLID proteins belong to pathways and networks with established roles in cognitive function and intellectual disability in particular. We suggest that systematic sequencing of all X-chromosomal genes in a cohort of patients with genetic evidence for X-chromosome locus involvement may resolve up to 58% of Fragile X-negative cases. PMID:25644381

  5. Using genetics to test the causal relationship of total adiposity and periodontitis: Mendelian randomization analyses in the Gene-Lifestyle Interactions and Dental Endpoints (GLIDE) Consortium

    PubMed Central

    Shungin, Dmitry; Cornelis, Marilyn C; Divaris, Kimon; Holtfreter, Birte; Shaffer, John R; Yu, Yau-Hua; Barros, Silvana P; Beck, James D; Biffar, Reiner; Boerwinkle, Eric A; Crout, Richard J.; Ganna, Andrea; Hallmans, Goran; Hindy, George; Hu, Frank B; Kraft, Peter; McNeil, Daniel W; Melander, Olle; Moss, Kevin L; North, Kari E; Orho-Melander, Marju; Pedersen, Nancy L; Ridker, Paul M; Rimm, Eric B; Rose, Lynda M; Rukh, Gull; Teumer, Alexander; Weyant, Robert J; Chasman, Daniel I; Joshipura, Kaumudi; Kocher, Thomas; Magnusson, Patrik KE; Marazita, Mary L; Nilsson, Peter; Offenbacher, Steve; Davey Smith, George; Lundberg, Pernilla; Palmer, Tom M; Timpson, Nicholas J; Johansson, Ingegerd; Franks, Paul W

    2015-01-01

    Background: The observational relationship between obesity and periodontitis is widely known, yet causal evidence is lacking. Our objective was to investigate causal associations between periodontitis and body mass index (BMI). Methods: We performed Mendelian randomization analyses with BMI-associated loci combined in a genetic risk score (GRS) as the instrument for BMI. All analyses were conducted within the Gene-Lifestyle Interactions and Dental Endpoints (GLIDE) Consortium in 13 studies from Europe and the USA, including 49 066 participants with clinically assessed (seven studies, 42.1% of participants) and self-reported (six studies, 57.9% of participants) periodontitis and genotype data (17 672/31 394 with/without periodontitis); 68 761 participants with BMI and genotype data; and 57 871 participants (18 881/38 990 with/without periodontitis) with data on BMI and periodontitis. Results: In the observational meta-analysis of all participants, the pooled crude observational odds ratio (OR) for periodontitis was 1.13 [95% confidence interval (CI): 1.03, 1.24] per standard deviation increase of BMI. Controlling for potential confounders attenuated this estimate (OR = 1.08; 95% CI:1.03, 1.12). For clinically assessed periodontitis, corresponding ORs were 1.25 (95% CI: 1.10, 1.42) and 1.13 (95% CI: 1.10, 1.17), respectively. In the genetic association meta-analysis, the OR for periodontitis was 1.01 (95% CI: 0.99, 1.03) per GRS unit (per one effect allele) in all participants and 1.00 (95% CI: 0.97, 1.03) in participants with clinically assessed periodontitis. The instrumental variable meta-analysis of all participants yielded an OR of 1.05 (95% CI: 0.80, 1.38) per BMI standard deviation, and 0.90 (95% CI: 0.56, 1.46) in participants with clinical data. Conclusions: Our study does not support total adiposity as a causal risk factor for periodontitis, as the point estimate is very close to the null in the causal inference analysis, with wide

  6. A Genome-Wide Knockout Screen to Identify Genes Involved in Acquired Carboplatin Resistance

    DTIC Science & Technology

    2016-07-01

    library screen to identify genes that when knocked out render human ovarian cells > 2.5-fold resistant to CBDCA; 2) Validate the ability of...a GeCKOv2 library screen to identify genes that when knocked out render human ovarian cells > 2.5-fold resistant to CBDCA; 2) validate the ability of...resistance in either cell lines or clinical samples. The CRIPSR-cas9 technology now provides us with a major new tool to introduce knock out mutations

  7. A computational approach to identify cellular heterogeneity and tissue-specific gene regulatory networks.

    PubMed

    Jambusaria, Ankit; Klomp, Jeff; Hong, Zhigang; Rafii, Shahin; Dai, Yang; Malik, Asrar B; Rehman, Jalees

    2018-06-07

    The heterogeneity of cells across tissue types represents a major challenge for studying biological mechanisms as well as for therapeutic targeting of distinct tissues. Computational prediction of tissue-specific gene regulatory networks may provide important insights into the mechanisms underlying the cellular heterogeneity of cells in distinct organs and tissues. Using three pathway analysis techniques, gene set enrichment analysis (GSEA), parametric analysis of gene set enrichment (PGSEA), alongside our novel model (HeteroPath), which assesses heterogeneously upregulated and downregulated genes within the context of pathways, we generated distinct tissue-specific gene regulatory networks. We analyzed gene expression data derived from freshly isolated heart, brain, and lung endothelial cells and populations of neurons in the hippocampus, cingulate cortex, and amygdala. In both datasets, we found that HeteroPath segregated the distinct cellular populations by identifying regulatory pathways that were not identified by GSEA or PGSEA. Using simulated datasets, HeteroPath demonstrated robustness that was comparable to what was seen using existing gene set enrichment methods. Furthermore, we generated tissue-specific gene regulatory networks involved in vascular heterogeneity and neuronal heterogeneity by performing motif enrichment of the heterogeneous genes identified by HeteroPath and linking the enriched motifs to regulatory transcription factors in the ENCODE database. HeteroPath assesses contextual bidirectional gene expression within pathways and thus allows for transcriptomic assessment of cellular heterogeneity. Unraveling tissue-specific heterogeneity of gene expression can lead to a better understanding of the molecular underpinnings of tissue-specific phenotypes.

  8. Constitutional epimutation as a mechanism for cancer causality and heritability?

    PubMed

    Hitchins, Megan P

    2015-10-01

    Constitutional epimutation, which is an aberration in gene expression due to an altered epigenotype that is widely distributed in normal tissues (albeit frequently mosaic), provides an alternative mechanism to genetic mutation for cancer predisposition. Observational studies in cancer-affected families have revealed intergenerational inheritance of constitutional epimutation, providing unique insights into the heritability of epigenetic traits in humans. In this Opinion article, the potential contribution of constitutional epimutation to the 'missing' causality and heritability of cancer is explored.

  9. Expectations and Interpretations during Causal Learning

    ERIC Educational Resources Information Center

    Luhmann, Christian C.; Ahn, Woo-kyoung

    2011-01-01

    In existing models of causal induction, 4 types of covariation information (i.e., presence/absence of an event followed by presence/absence of another event) always exert identical influences on causal strength judgments (e.g., joint presence of events always suggests a generative causal relationship). In contrast, we suggest that, due to…

  10. Representing Personal Determinants in Causal Structures.

    ERIC Educational Resources Information Center

    Bandura, Albert

    1984-01-01

    Responds to Staddon's critique of the author's earlier article and addresses issues raised by Staddon's (1984) alternative models of causality. The author argues that it is not the formalizability of causal processes that is the issue but whether cognitive determinants of behavior are reducible to past stimulus inputs in causal structures.…

  11. Identifying candidate driver genes by integrative ovarian cancer genomics data

    NASA Astrophysics Data System (ADS)

    Lu, Xinguo; Lu, Jibo

    2017-08-01

    Integrative analysis of molecular mechanics underlying cancer can distinguish interactions that cannot be revealed based on one kind of data for the appropriate diagnosis and treatment of cancer patients. Tumor samples exhibit heterogeneity in omics data, such as somatic mutations, Copy Number Variations CNVs), gene expression profiles and so on. In this paper we combined gene co-expression modules and mutation modulators separately in tumor patients to obtain the candidate driver genes for resistant and sensitive tumor from the heterogeneous data. The final list of modulators identified are well known in biological processes associated with ovarian cancer, such as CCL17, CACTIN, CCL16, CCL22, APOB, KDF1, CCL11, HNF1B, LRG1, MED1 and so on, which can help to facilitate the discovery of biomarkers, molecular diagnostics, and drug discovery.

  12. Identifying novel genes and chemicals related to nasopharyngeal cancer in a heterogeneous network.

    PubMed

    Li, Zhandong; An, Lifeng; Li, Hao; Wang, ShaoPeng; Zhou, You; Yuan, Fei; Li, Lin

    2016-05-05

    Nasopharyngeal cancer or nasopharyngeal carcinoma (NPC) is the most common cancer originating in the nasopharynx. The factors that induce nasopharyngeal cancer are still not clear. Additional information about the chemicals or genes related to nasopharyngeal cancer will promote a better understanding of the pathogenesis of this cancer and the factors that induce it. Thus, a computational method NPC-RGCP was proposed in this study to identify the possible relevant chemicals and genes based on the presently known chemicals and genes related to nasopharyngeal cancer. To extensively utilize the functional associations between proteins and chemicals, a heterogeneous network was constructed based on interactions of proteins and chemicals. The NPC-RGCP included two stages: the searching stage and the screening stage. The former stage is for finding new possible genes and chemicals in the heterogeneous network, while the latter stage is for screening and removing false discoveries and selecting the core genes and chemicals. As a result, five putative genes, CXCR3, IRF1, CDK1, GSTP1, and CDH2, and seven putative chemicals, iron, propionic acid, dimethyl sulfoxide, isopropanol, erythrose 4-phosphate, β-D-Fructose 6-phosphate, and flavin adenine dinucleotide, were identified by NPC-RGCP. Extensive analyses provided confirmation that the putative genes and chemicals have significant associations with nasopharyngeal cancer.

  13. Identifying novel genes and chemicals related to nasopharyngeal cancer in a heterogeneous network

    PubMed Central

    Li, Zhandong; An, Lifeng; Li, Hao; Wang, ShaoPeng; Zhou, You; Yuan, Fei; Li, Lin

    2016-01-01

    Nasopharyngeal cancer or nasopharyngeal carcinoma (NPC) is the most common cancer originating in the nasopharynx. The factors that induce nasopharyngeal cancer are still not clear. Additional information about the chemicals or genes related to nasopharyngeal cancer will promote a better understanding of the pathogenesis of this cancer and the factors that induce it. Thus, a computational method NPC-RGCP was proposed in this study to identify the possible relevant chemicals and genes based on the presently known chemicals and genes related to nasopharyngeal cancer. To extensively utilize the functional associations between proteins and chemicals, a heterogeneous network was constructed based on interactions of proteins and chemicals. The NPC-RGCP included two stages: the searching stage and the screening stage. The former stage is for finding new possible genes and chemicals in the heterogeneous network, while the latter stage is for screening and removing false discoveries and selecting the core genes and chemicals. As a result, five putative genes, CXCR3, IRF1, CDK1, GSTP1, and CDH2, and seven putative chemicals, iron, propionic acid, dimethyl sulfoxide, isopropanol, erythrose 4-phosphate, β-D-Fructose 6-phosphate, and flavin adenine dinucleotide, were identified by NPC-RGCP. Extensive analyses provided confirmation that the putative genes and chemicals have significant associations with nasopharyngeal cancer. PMID:27149165

  14. A whole-blood transcriptome meta-analysis identifies gene expression signatures of cigarette smoking

    PubMed Central

    Huan, Tianxiao; Joehanes, Roby; Schurmann, Claudia; Schramm, Katharina; Pilling, Luke C.; Peters, Marjolein J.; Mägi, Reedik; DeMeo, Dawn; O'Connor, George T.; Ferrucci, Luigi; Teumer, Alexander; Homuth, Georg; Biffar, Reiner; Völker, Uwe; Herder, Christian; Waldenberger, Melanie; Peters, Annette; Zeilinger, Sonja; Metspalu, Andres; Hofman, Albert; Uitterlinden, André G.; Hernandez, Dena G.; Singleton, Andrew B.; Bandinelli, Stefania; Munson, Peter J.; Lin, Honghuang; Benjamin, Emelia J.; Esko, Tõnu; Grabe, Hans J.; Prokisch, Holger; van Meurs, Joyce B.J.; Melzer, David; Levy, Daniel

    2016-01-01

    Abstract Cigarette smoking is a leading modifiable cause of death worldwide. We hypothesized that cigarette smoking induces extensive transcriptomic changes that lead to target-organ damage and smoking-related diseases. We performed a meta-analysis of transcriptome-wide gene expression using whole blood-derived RNA from 10,233 participants of European ancestry in six cohorts (including 1421 current and 3955 former smokers) to identify associations between smoking and altered gene expression levels. At a false discovery rate (FDR) <0.1, we identified 1270 differentially expressed genes in current vs. never smokers, and 39 genes in former vs. never smokers. Expression levels of 12 genes remained elevated up to 30 years after smoking cessation, suggesting that the molecular consequence of smoking may persist for decades. Gene ontology analysis revealed enrichment of smoking-related genes for activation of platelets and lymphocytes, immune response, and apoptosis. Many of the top smoking-related differentially expressed genes, including LRRN3 and GPR15, have DNA methylation loci in promoter regions that were recently reported to be hypomethylated among smokers. By linking differential gene expression with smoking-related disease phenotypes, we demonstrated that stroke and pulmonary function show enrichment for smoking-related gene expression signatures. Mediation analysis revealed the expression of several genes (e.g. ALAS2) to be putative mediators of the associations between smoking and inflammatory biomarkers (IL6 and C-reactive protein levels). Our transcriptomic study provides potential insights into the effects of cigarette smoking on gene expression in whole blood and their relations to smoking-related diseases. The results of such analyses may highlight attractive targets for treating or preventing smoking-related health effects. PMID:28158590

  15. Repeated Causal Decision Making

    ERIC Educational Resources Information Center

    Hagmayer, York; Meder, Bjorn

    2013-01-01

    Many of our decisions refer to actions that have a causal impact on the external environment. Such actions may not only allow for the mere learning of expected values or utilities but also for acquiring knowledge about the causal structure of our world. We used a repeated decision-making paradigm to examine what kind of knowledge people acquire in…

  16. Causality in Classical Electrodynamics

    ERIC Educational Resources Information Center

    Savage, Craig

    2012-01-01

    Causality in electrodynamics is a subject of some confusion, especially regarding the application of Faraday's law and the Ampere-Maxwell law. This has led to the suggestion that we should not teach students that electric and magnetic fields can cause each other, but rather focus on charges and currents as the causal agents. In this paper I argue…

  17. Resistance gene candidates identified by PCR with degenerate oligonucleotide primers map to clusters of resistance genes in lettuce.

    PubMed

    Shen, K A; Meyers, B C; Islam-Faridi, M N; Chin, D B; Stelly, D M; Michelmore, R W

    1998-08-01

    The recent cloning of genes for resistance against diverse pathogens from a variety of plants has revealed that many share conserved sequence motifs. This provides the possibility of isolating numerous additional resistance genes by polymerase chain reaction (PCR) with degenerate oligonucleotide primers. We amplified resistance gene candidates (RGCs) from lettuce with multiple combinations of primers with low degeneracy designed from motifs in the nucleotide binding sites (NBSs) of RPS2 of Arabidopsis thaliana and N of tobacco. Genomic DNA, cDNA, and bacterial artificial chromosome (BAC) clones were successfully used as templates. Four families of sequences were identified that had the same similarity to each other as to resistance genes from other species. The relationship of the amplified products to resistance genes was evaluated by several sequence and genetic criteria. The amplified products contained open reading frames with additional sequences characteristic of NBSs. Hybridization of RGCs to genomic DNA and to BAC clones revealed large numbers of related sequences. Genetic analysis demonstrated the existence of clustered multigene families for each of the four RGC sequences. This parallels classical genetic data on clustering of disease resistance genes. Two of the four families mapped to known clusters of resistance genes; these two families were therefore studied in greater detail. Additional evidence that these RGCs could be resistance genes was gained by the identification of leucine-rich repeat (LRR) regions in sequences adjoining the NBS similar to those in RPM1 and RPS2 of A. thaliana. Fluorescent in situ hybridization confirmed the clustered genomic distribution of these sequences. The use of PCR with degenerate oligonucleotide primers is therefore an efficient method to identify numerous RGCs in plants.

  18. Gene-environment interaction involving recently identified colorectal cancer susceptibility loci

    PubMed Central

    Kantor, Elizabeth D.; Hutter, Carolyn M.; Minnier, Jessica; Berndt, Sonja I.; Brenner, Hermann; Caan, Bette J.; Campbell, Peter T.; Carlson, Christopher S.; Casey, Graham; Chan, Andrew T.; Chang-Claude, Jenny; Chanock, Stephen J.; Cotterchio, Michelle; Du, Mengmeng; Duggan, David; Fuchs, Charles S.; Giovannucci, Edward L.; Gong, Jian; Harrison, Tabitha A.; Hayes, Richard B.; Henderson, Brian E.; Hoffmeister, Michael; Hopper, John L.; Jenkins, Mark A.; Jiao, Shuo; Kolonel, Laurence N.; Le Marchand, Loic; Lemire, Mathieu; Ma, Jing; Newcomb, Polly A.; Ochs-Balcom, Heather M.; Pflugeisen, Bethann M.; Potter, John D.; Rudolph, Anja; Schoen, Robert E.; Seminara, Daniela; Slattery, Martha L.; Stelling, Deanna L.; Thomas, Fridtjof; Thornquist, Mark; Ulrich, Cornelia M.; Warnick, Greg S.; Zanke, Brent W.; Peters, Ulrike; Hsu, Li; White, Emily

    2014-01-01

    BACKGROUND Genome-wide association studies have identified several single nucleotide polymorphisms (SNPs) that are associated with risk of colorectal cancer (CRC). Prior research has evaluated the presence of gene-environment interaction involving the first 10 identified susceptibility loci, but little work has been conducted on interaction involving SNPs at recently identified susceptibility loci, including: rs10911251, rs6691170, rs6687758, rs11903757, rs10936599, rs647161, rs1321311, rs719725, rs1665650, rs3824999, rs7136702, rs11169552, rs59336, rs3217810, rs4925386, and rs2423279. METHODS Data on 9160 cases and 9280 controls from the Genetics and Epidemiology of Colorectal Cancer Consortium (GECCO) and Colon Cancer Family Registry (CCFR) were used to evaluate the presence of interaction involving the above-listed SNPs and sex, body mass index (BMI), alcohol consumption, smoking, aspirin use, post-menopausal hormone (PMH) use, as well as intake of dietary calcium, dietary fiber, dietary folate, red meat, processed meat, fruit, and vegetables. Interaction was evaluated using a fixed-effects meta-analysis of an efficient Empirical Bayes estimator, and permutation was used to account for multiple comparisons. RESULTS None of the permutation-adjusted p-values reached statistical significance. CONCLUSIONS The associations between recently identified genetic susceptibility loci and CRC are not strongly modified by sex, BMI, alcohol, smoking, aspirin, PMH use, and various dietary factors. IMPACT Results suggest no evidence of strong gene-environment interactions involving the recently identified 16 susceptibility loci for CRC taken one at a time. PMID:24994789

  19. LGscore: A method to identify disease-related genes using biological literature and Google data.

    PubMed

    Kim, Jeongwoo; Kim, Hyunjin; Yoon, Youngmi; Park, Sanghyun

    2015-04-01

    Since the genome project in 1990s, a number of studies associated with genes have been conducted and researchers have confirmed that genes are involved in disease. For this reason, the identification of the relationships between diseases and genes is important in biology. We propose a method called LGscore, which identifies disease-related genes using Google data and literature data. To implement this method, first, we construct a disease-related gene network using text-mining results. We then extract gene-gene interactions based on co-occurrences in abstract data obtained from PubMed, and calculate the weights of edges in the gene network by means of Z-scoring. The weights contain two values: the frequency and the Google search results. The frequency value is extracted from literature data, and the Google search result is obtained using Google. We assign a score to each gene through a network analysis. We assume that genes with a large number of links and numerous Google search results and frequency values are more likely to be involved in disease. For validation, we investigated the top 20 inferred genes for five different diseases using answer sets. The answer sets comprised six databases that contain information on disease-gene relationships. We identified a significant number of disease-related genes as well as candidate genes for Alzheimer's disease, diabetes, colon cancer, lung cancer, and prostate cancer. Our method was up to 40% more accurate than existing methods. Copyright © 2015 Elsevier Inc. All rights reserved.

  20. Genetically Determined Susceptibility to Tuberculosis in Mice Causally Involves Accelerated and Enhanced Recruitment of Granulocytes

    PubMed Central

    Keller, Christine; Hoffmann, Reinhard; Lang, Roland; Brandau, Sven; Hermann, Corinna; Ehlers, Stefan

    2006-01-01

    Classical twin studies and recent linkage analyses of African populations have revealed a potential involvement of host genetic factors in susceptibility or resistance to Mycobacterium tuberculosis infection. In order to identify the candidate genes involved and test their causal implication, we capitalized on the mouse model of tuberculosis, since inbred mouse strains also differ substantially in their susceptibility to infection. Two susceptible and two resistant mouse strains were aerogenically infected with 1,000 CFU of M. tuberculosis, and the regulation of gene expression was examined by Affymetrix GeneChip U74A array with total lung RNA 2 and 4 weeks postinfection. Four weeks after infection, 96 genes, many of which are involved in inflammatory cell recruitment and activation, were regulated in common. One hundred seven genes were differentially regulated in susceptible mouse strains, whereas 43 genes were differentially expressed only in resistant mice. Data mining revealed a bias towards the expression of genes involved in granulocyte pathophysiology in susceptible mice, such as an upregulation of those for the neutrophil chemoattractant LIX (CXCL5), interleukin 17 receptor, phosphoinositide kinase 3 delta, or gamma interferon-inducible protein 10. Following M. tuberculosis challenge in both airways or peritoneum, granulocytes were recruited significantly faster and at higher numbers in susceptible than in resistant mice. When granulocytes were efficiently depleted by either of two regimens at the onset of infection, only susceptible mice survived aerosol challenge with M. tuberculosis significantly longer than control mice. We conclude that initially enhanced recruitment of granulocytes contributes to susceptibility to tuberculosis. PMID:16790804

  1. Identifying candidate genes affecting developmental time in Drosophila melanogaster: pervasive pleiotropy and gene-by-environment interaction

    PubMed Central

    Mensch, Julián; Lavagnino, Nicolás; Carreira, Valeria Paula; Massaldi, Ana; Hasson, Esteban; Fanara, Juan José

    2008-01-01

    Background Understanding the genetic architecture of ecologically relevant adaptive traits requires the contribution of developmental and evolutionary biology. The time to reach the age of reproduction is a complex life history trait commonly known as developmental time. In particular, in holometabolous insects that occupy ephemeral habitats, like fruit flies, the impact of developmental time on fitness is further exaggerated. The present work is one of the first systematic studies of the genetic basis of developmental time, in which we also evaluate the impact of environmental variation on the expression of the trait. Results We analyzed 179 co-isogenic single P[GT1]-element insertion lines of Drosophila melanogaster to identify novel genes affecting developmental time in flies reared at 25°C. Sixty percent of the lines showed a heterochronic phenotype, suggesting that a large number of genes affect this trait. Mutant lines for the genes Merlin and Karl showed the most extreme phenotypes exhibiting a developmental time reduction and increase, respectively, of over 2 days and 4 days relative to the control (a co-isogenic P-element insertion free line). In addition, a subset of 42 lines selected at random from the initial set of 179 lines was screened at 17°C. Interestingly, the gene-by-environment interaction accounted for 52% of total phenotypic variance. Plastic reaction norms were found for a large number of developmental time candidate genes. Conclusion We identified components of several integrated time-dependent pathways affecting egg-to-adult developmental time in Drosophila. At the same time, we also show that many heterochronic phenotypes may arise from changes in genes involved in several developmental mechanisms that do not explicitly control the timing of specific events. We also demonstrate that many developmental time genes have pleiotropic effects on several adult traits and that the action of most of them is sensitive to temperature during

  2. The application of artificial intelligence to microarray data: identification of a novel gene signature to identify bladder cancer progression.

    PubMed

    Catto, James W F; Abbod, Maysam F; Wild, Peter J; Linkens, Derek A; Pilarsky, Christian; Rehman, Ishtiaq; Rosario, Derek J; Denzinger, Stefan; Burger, Maximilian; Stoehr, Robert; Knuechel, Ruth; Hartmann, Arndt; Hamdy, Freddie C

    2010-03-01

    New methods for identifying bladder cancer (BCa) progression are required. Gene expression microarrays can reveal insights into disease biology and identify novel biomarkers. However, these experiments produce large datasets that are difficult to interpret. To develop a novel method of microarray analysis combining two forms of artificial intelligence (AI): neurofuzzy modelling (NFM) and artificial neural networks (ANN) and validate it in a BCa cohort. We used AI and statistical analyses to identify progression-related genes in a microarray dataset (n=66 tumours, n=2800 genes). The AI-selected genes were then investigated in a second cohort (n=262 tumours) using immunohistochemistry. We compared the accuracy of AI and statistical approaches to identify tumour progression. AI identified 11 progression-associated genes (odds ratio [OR]: 0.70; 95% confidence interval [CI], 0.56-0.87; p=0.0004), and these were more discriminate than genes chosen using statistical analyses (OR: 1.24; 95% CI, 0.96-1.60; p=0.09). The expression of six AI-selected genes (LIG3, FAS, KRT18, ICAM1, DSG2, and BRCA2) was determined using commercial antibodies and successfully identified tumour progression (concordance index: 0.66; log-rank test: p=0.01). AI-selected genes were more discriminate than pathologic criteria at determining progression (Cox multivariate analysis: p=0.01). Limitations include the use of statistical correlation to identify 200 genes for AI analysis and that we did not compare regression identified genes with immunohistochemistry. AI and statistical analyses use different techniques of inference to determine gene-phenotype associations and identify distinct prognostic gene signatures that are equally valid. We have identified a prognostic gene signature whose members reflect a variety of carcinogenic pathways that could identify progression in non-muscle-invasive BCa. 2009 European Association of Urology. Published by Elsevier B.V. All rights reserved.

  3. Kant on causal laws and powers.

    PubMed

    Henschen, Tobias

    2014-12-01

    The aim of the paper is threefold. Its first aim is to defend Eric Watkins's claim that for Kant, a cause is not an event but a causal power: a power that is borne by a substance, and that, when active, brings about its effect, i.e. a change of the states of another substance, by generating a continuous flow of intermediate states of that substance. The second aim of the paper is to argue against Watkins that the Kantian concept of causal power is not the pre-critical concept of real ground but the category of causality, and that Kant holds with Hume that causal laws cannot be inferred non-inductively (that he accordingly has no intention to show in the Second analogy or elsewhere that events fall under causal laws). The third aim of the paper is to compare the Kantian position on causality with central tenets of contemporary powers ontology: it argues that unlike the variants endorsed by contemporary powers theorists, the Kantian variants of these tenets are resistant to objections that neo-Humeans raise to these tenets.

  4. A hierarchical causal taxonomy of psychopathology across the life span.

    PubMed

    Lahey, Benjamin B; Krueger, Robert F; Rathouz, Paul J; Waldman, Irwin D; Zald, David H

    2017-02-01

    We propose a taxonomy of psychopathology based on patterns of shared causal influences identified in a review of multivariate behavior genetic studies that distinguish genetic and environmental influences that are either common to multiple dimensions of psychopathology or unique to each dimension. At the phenotypic level, first-order dimensions are defined by correlations among symptoms; correlations among first-order dimensions similarly define higher-order domains (e.g., internalizing or externalizing psychopathology). We hypothesize that the robust phenotypic correlations among first-order dimensions reflect a hierarchy of increasingly specific etiologic influences . Some nonspecific etiologic factors increase risk for all first-order dimensions of psychopathology to varying degrees through a general factor of psychopathology. Other nonspecific etiologic factors increase risk only for all first-order dimensions within a more specific higher-order domain. Furthermore, each first-order dimension has its own unique causal influences. Genetic and environmental influences common to family members tend to be nonspecific, whereas environmental influences unique to each individual are more dimension-specific. We posit that these causal influences on psychopathology are moderated by sex and developmental processes. This causal taxonomy also provides a novel framework for understanding the heterogeneity of each first-order dimension: Different persons exhibiting similar symptoms may be influenced by different combinations of etiologic influences from each of the 3 levels of the etiologic hierarchy. Furthermore, we relate the proposed causal taxonomy to transdimensional psychobiological processes, which also impact the heterogeneity of each psychopathology dimension. This causal taxonomy implies the need for changes in strategies for studying the etiology, psychobiology, prevention, and treatment of psychopathology. (PsycINFO Database Record (c) 2017 APA, all rights

  5. An assessment of predominant causal factors of pilot deviations that contribute to runway incursions

    NASA Astrophysics Data System (ADS)

    Campbell, Denado M.

    The aim of this study was to identify predominant causal factors of pilot deviations in runway incursions over a two-year period. Runway incursion reports were obtained from NASA's Aviation Safety Reporting System (ASRS), and a qualitative method was used by classifying and coding each report to a specific causal factor(s). The causal factors that were used were substantiated by research from the Aircraft Owner's and Pilot's Association that found that these causal factors were the most common in runway incursion incidents and accidents. An additional causal factor was also utilized to determine the significance of pilot training in relation to runway incursions. From the reports examined, it was found that miscommunication and situational awareness have the greatest impact on pilots and are most often the major causes of runway incursions. This data can be used to assist airports, airlines, and the FAA to understand trends in pilot deviations, and to find solutions for specific problem areas in runway incursion incidents.

  6. Comprehensive Ex Vivo Transposon Mutagenesis Identifies Genes That Promote Growth Factor Independence and Leukemogenesis.

    PubMed

    Guo, Yabin; Updegraff, Barrett L; Park, Sunho; Durakoglugil, Deniz; Cruz, Victoria H; Maddux, Sarah; Hwang, Tae Hyun; O'Donnell, Kathryn A

    2016-02-15

    Aberrant signaling through cytokine receptors and their downstream signaling pathways is a major oncogenic mechanism underlying hematopoietic malignancies. To better understand how these pathways become pathologically activated and to potentially identify new drivers of hematopoietic cancers, we developed a high-throughput functional screening approach using ex vivo mutagenesis with the Sleeping Beauty transposon. We analyzed over 1,100 transposon-mutagenized pools of Ba/F3 cells, an IL3-dependent pro-B-cell line, which acquired cytokine independence and tumor-forming ability. Recurrent transposon insertions could be mapped to genes in the JAK/STAT and MAPK pathways, confirming the ability of this strategy to identify known oncogenic components of cytokine signaling pathways. In addition, recurrent insertions were identified in a large set of genes that have been found to be mutated in leukemia or associated with survival, but were not previously linked to the JAK/STAT or MAPK pathways nor shown to functionally contribute to leukemogenesis. Forced expression of these novel genes resulted in IL3-independent growth in vitro and tumorigenesis in vivo, validating this mutagenesis-based approach for identifying new genes that promote cytokine signaling and leukemogenesis. Therefore, our findings provide a broadly applicable approach for classifying functionally relevant genes in diverse malignancies and offer new insights into the impact of cytokine signaling on leukemia development. ©2015 American Association for Cancer Research.

  7. Detecting dynamic causal inference in nonlinear two-phase fracture flow

    NASA Astrophysics Data System (ADS)

    Faybishenko, Boris

    2017-08-01

    Identifying dynamic causal inference involved in flow and transport processes in complex fractured-porous media is generally a challenging task, because nonlinear and chaotic variables may be positively coupled or correlated for some periods of time, but can then become spontaneously decoupled or non-correlated. In his 2002 paper (Faybishenko, 2002), the author performed a nonlinear dynamical and chaotic analysis of time-series data obtained from the fracture flow experiment conducted by Persoff and Pruess (1995), and, based on the visual examination of time series data, hypothesized that the observed pressure oscillations at both inlet and outlet edges of the fracture result from a superposition of both forward and return waves of pressure propagation through the fracture. In the current paper, the author explores an application of a combination of methods for detecting nonlinear chaotic dynamics behavior along with the multivariate Granger Causality (G-causality) time series test. Based on the G-causality test, the author infers that his hypothesis is correct, and presents a causation loop diagram of the spatial-temporal distribution of gas, liquid, and capillary pressures measured at the inlet and outlet of the fracture. The causal modeling approach can be used for the analysis of other hydrological processes, for example, infiltration and pumping tests in heterogeneous subsurface media, and climatic processes, for example, to find correlations between various meteorological parameters, such as temperature, solar radiation, barometric pressure, etc.

  8. CADDIS Volume 1. Stressor Identification: About Causal Assessment

    EPA Pesticide Factsheets

    An introduction to the history of our approach to causal assessment, A chronology of causal history and philosophy, An introduction to causal history and philosophy, References for the Causal Assessment Background section of Stressor Identification

  9. Selection on plant male function genes identifies candidates for reproductive isolation of yellow monkeyflowers.

    PubMed

    Aagaard, Jan E; George, Renee D; Fishman, Lila; Maccoss, Michael J; Swanson, Willie J

    2013-01-01

    Understanding the genetic basis of reproductive isolation promises insight into speciation and the origins of biological diversity. While progress has been made in identifying genes underlying barriers to reproduction that function after fertilization (post-zygotic isolation), we know much less about earlier acting pre-zygotic barriers. Of particular interest are barriers involved in mating and fertilization that can evolve extremely rapidly under sexual selection, suggesting they may play a prominent role in the initial stages of reproductive isolation. A significant challenge to the field of speciation genetics is developing new approaches for identification of candidate genes underlying these barriers, particularly among non-traditional model systems. We employ powerful proteomic and genomic strategies to study the genetic basis of conspecific pollen precedence, an important component of pre-zygotic reproductive isolation among yellow monkeyflowers (Mimulus spp.) resulting from male pollen competition. We use isotopic labeling in combination with shotgun proteomics to identify more than 2,000 male function (pollen tube) proteins within maternal reproductive structures (styles) of M. guttatus flowers where pollen competition occurs. We then sequence array-captured pollen tube exomes from a large outcrossing population of M. guttatus, and identify those genes with evidence of selective sweeps or balancing selection consistent with their role in pollen competition. We also test for evidence of positive selection on these genes more broadly across yellow monkeyflowers, because a signal of adaptive divergence is a common feature of genes causing reproductive isolation. Together the molecular evolution studies identify 159 pollen tube proteins that are candidate genes for conspecific pollen precedence. Our work demonstrates how powerful proteomic and genomic tools can be readily adapted to non-traditional model systems, allowing for genome-wide screens towards the

  10. Comparative Transcriptional Profiling of the Axolotl Limb Identifies a Tripartite Regeneration-Specific Gene Program

    PubMed Central

    Knapp, Dunja; Schulz, Herbert; Rascon, Cynthia Alexander; Volkmer, Michael; Scholz, Juliane; Nacu, Eugen; Le, Mu; Novozhilov, Sergey; Tazaki, Akira; Protze, Stephanie; Jacob, Tina; Hubner, Norbert; Habermann, Bianca; Tanaka, Elly M.

    2013-01-01

    Understanding how the limb blastema is established after the initial wound healing response is an important aspect of regeneration research. Here we performed parallel expression profile time courses of healing lateral wounds versus amputated limbs in axolotl. This comparison between wound healing and regeneration allowed us to identify amputation-specific genes. By clustering the expression profiles of these samples, we could detect three distinguishable phases of gene expression – early wound healing followed by a transition-phase leading to establishment of the limb development program, which correspond to the three phases of limb regeneration that had been defined by morphological criteria. By focusing on the transition-phase, we identified 93 strictly amputation-associated genes many of which are implicated in oxidative-stress response, chromatin modification, epithelial development or limb development. We further classified the genes based on whether they were or were not significantly expressed in the developing limb bud. The specific localization of 53 selected candidates within the blastema was investigated by in situ hybridization. In summary, we identified a set of genes that are expressed specifically during regeneration and are therefore, likely candidates for the regulation of blastema formation. PMID:23658691

  11. Genome-wide gene by lead exposure interaction analysis identifies UNC5D as a candidate gene for neurodevelopment.

    PubMed

    Wang, Zhaoxi; Claus Henn, Birgit; Wang, Chaolong; Wei, Yongyue; Su, Li; Sun, Ryan; Chen, Han; Wagner, Peter J; Lu, Quan; Lin, Xihong; Wright, Robert; Bellinger, David; Kile, Molly; Mazumdar, Maitreyi; Tellez-Rojo, Martha Maria; Schnaas, Lourdes; Christiani, David C

    2017-07-28

    Neurodevelopment is a complex process involving both genetic and environmental factors. Prenatal exposure to lead (Pb) has been associated with lower performance on neurodevelopmental tests. Adverse neurodevelopmental outcomes are more frequent and/or more severe when toxic exposures interact with genetic susceptibility. To explore possible loci associated with increased susceptibility to prenatal Pb exposure, we performed a genome-wide gene-environment interaction study (GWIS) in young children from Mexico (n = 390) and Bangladesh (n = 497). Prenatal Pb exposure was estimated by cord blood Pb concentration. Neurodevelopment was assessed using the Bayley Scales of Infant Development. We identified a locus on chromosome 8, containing UNC5D, and demonstrated evidence of its genome-wide significance with mental composite scores (rs9642758, p meta  = 4.35 × 10 -6 ). Within this locus, the joint effects of two independent single nucleotide polymorphisms (SNPs, rs9642758 and rs10503970) had a p-value of 4.38 × 10 -9 for mental composite scores. Correlating GWIS results with in vitro transcriptomic profiles identified one common gene, SLC1A5, which is involved in synaptic function, neuronal development, and excitotoxicity. Further analysis revealed interconnected interactions that formed a large network of 52 genes enriched with oxidative stress genes and neurodevelopmental genes. Our findings suggest that certain genetic polymorphisms within/near genes relevant to neurodevelopment might modify the toxic effects of Pb exposure via oxidative stress.

  12. Preferential Allele Expression Analysis Identifies Shared Germline and Somatic Driver Genes in Advanced Ovarian Cancer

    PubMed Central

    Halabi, Najeeb M.; Martinez, Alejandra; Al-Farsi, Halema; Mery, Eliane; Puydenus, Laurence; Pujol, Pascal; Khalak, Hanif G.; McLurcan, Cameron; Ferron, Gwenael; Querleu, Denis; Al-Azwani, Iman; Al-Dous, Eman; Mohamoud, Yasmin A.; Malek, Joel A.; Rafii, Arash

    2016-01-01

    Identifying genes where a variant allele is preferentially expressed in tumors could lead to a better understanding of cancer biology and optimization of targeted therapy. However, tumor sample heterogeneity complicates standard approaches for detecting preferential allele expression. We therefore developed a novel approach combining genome and transcriptome sequencing data from the same sample that corrects for sample heterogeneity and identifies significant preferentially expressed alleles. We applied this analysis to epithelial ovarian cancer samples consisting of matched primary ovary and peritoneum and lymph node metastasis. We find that preferentially expressed variant alleles include germline and somatic variants, are shared at a relatively high frequency between patients, and are in gene networks known to be involved in cancer processes. Analysis at a patient level identifies patient-specific preferentially expressed alleles in genes that are targets for known drugs. Analysis at a site level identifies patterns of site specific preferential allele expression with similar pathways being impacted in the primary and metastasis sites. We conclude that genes with preferentially expressed variant alleles can act as cancer drivers and that targeting those genes could lead to new therapeutic strategies. PMID:26735499

  13. An Optimal Mean Based Block Robust Feature Extraction Method to Identify Colorectal Cancer Genes with Integrated Data.

    PubMed

    Liu, Jian; Cheng, Yuhu; Wang, Xuesong; Zhang, Lin; Liu, Hui

    2017-08-17

    It is urgent to diagnose colorectal cancer in the early stage. Some feature genes which are important to colorectal cancer development have been identified. However, for the early stage of colorectal cancer, less is known about the identity of specific cancer genes that are associated with advanced clinical stage. In this paper, we conducted a feature extraction method named Optimal Mean based Block Robust Feature Extraction method (OMBRFE) to identify feature genes associated with advanced colorectal cancer in clinical stage by using the integrated colorectal cancer data. Firstly, based on the optimal mean and L 2,1 -norm, a novel feature extraction method called Optimal Mean based Robust Feature Extraction method (OMRFE) is proposed to identify feature genes. Then the OMBRFE method which introduces the block ideology into OMRFE method is put forward to process the colorectal cancer integrated data which includes multiple genomic data: copy number alterations, somatic mutations, methylation expression alteration, as well as gene expression changes. Experimental results demonstrate that the OMBRFE is more effective than previous methods in identifying the feature genes. Moreover, genes identified by OMBRFE are verified to be closely associated with advanced colorectal cancer in clinical stage.

  14. Illustrating, Quantifying, and Correcting for Bias in Post-hoc Analysis of Gene-Based Rare Variant Tests of Association

    PubMed Central

    Grinde, Kelsey E.; Arbet, Jaron; Green, Alden; O'Connell, Michael; Valcarcel, Alessandra; Westra, Jason; Tintle, Nathan

    2017-01-01

    To date, gene-based rare variant testing approaches have focused on aggregating information across sets of variants to maximize statistical power in identifying genes showing significant association with diseases. Beyond identifying genes that are associated with diseases, the identification of causal variant(s) in those genes and estimation of their effect is crucial for planning replication studies and characterizing the genetic architecture of the locus. However, we illustrate that straightforward single-marker association statistics can suffer from substantial bias introduced by conditioning on gene-based test significance, due to the phenomenon often referred to as “winner's curse.” We illustrate the ramifications of this bias on variant effect size estimation and variant prioritization/ranking approaches, outline parameters of genetic architecture that affect this bias, and propose a bootstrap resampling method to correct for this bias. We find that our correction method significantly reduces the bias due to winner's curse (average two-fold decrease in bias, p < 2.2 × 10−6) and, consequently, substantially improves mean squared error and variant prioritization/ranking. The method is particularly helpful in adjustment for winner's curse effects when the initial gene-based test has low power and for relatively more common, non-causal variants. Adjustment for winner's curse is recommended for all post-hoc estimation and ranking of variants after a gene-based test. Further work is necessary to continue seeking ways to reduce bias and improve inference in post-hoc analysis of gene-based tests under a wide variety of genetic architectures. PMID:28959274

  15. Causal inference in economics and marketing.

    PubMed

    Varian, Hal R

    2016-07-05

    This is an elementary introduction to causal inference in economics written for readers familiar with machine learning methods. The critical step in any causal analysis is estimating the counterfactual-a prediction of what would have happened in the absence of the treatment. The powerful techniques used in machine learning may be useful for developing better estimates of the counterfactual, potentially improving causal inference.

  16. The Causality of Evolution on Different Fitness Landscapes

    NASA Astrophysics Data System (ADS)

    Vyawahare, Saurabh; Austin, Robert; Zhang, Qiucen; Kim, Hyunsung; Bestoso, John

    2013-03-01

    Evolution of antibiotic resistance is a growing problem. One major reason why most antibiotics fail is because of mutations on drug targets (e.g. essential enzymes). Sequencing of clinically resistant isolates have shown that multiple mutational-hotspots exist in coding regions, which could potentially prohibit the binding of drugs. However, it is not clear whether the appearance of each mutation is random or influenced by other factors. In this paper, we compare evolution of resistance to ciprofloxacin from two distinct but well characterized genetic backgrounds. By combining our recently developed evolution reactor and deep whole-genome sequencing, we show different alleles of σs factor lead to fixation of different mutations in gyrA gene that confer ciprofloxacin resistance to bacteria Escherichia coli. Such causality of evolution in different genes provides an opportunity to control the evolution of antibiotic resistance. Sponsored by the NCI/NIH Physical Sciences Oncology Centers

  17. Exome sequencing in amyotrophic lateral sclerosis identifies risk genes and pathways.

    PubMed

    Cirulli, Elizabeth T; Lasseigne, Brittany N; Petrovski, Slavé; Sapp, Peter C; Dion, Patrick A; Leblond, Claire S; Couthouis, Julien; Lu, Yi-Fan; Wang, Quanli; Krueger, Brian J; Ren, Zhong; Keebler, Jonathan; Han, Yujun; Levy, Shawn E; Boone, Braden E; Wimbish, Jack R; Waite, Lindsay L; Jones, Angela L; Carulli, John P; Day-Williams, Aaron G; Staropoli, John F; Xin, Winnie W; Chesi, Alessandra; Raphael, Alya R; McKenna-Yasek, Diane; Cady, Janet; Vianney de Jong, J M B; Kenna, Kevin P; Smith, Bradley N; Topp, Simon; Miller, Jack; Gkazi, Athina; Al-Chalabi, Ammar; van den Berg, Leonard H; Veldink, Jan; Silani, Vincenzo; Ticozzi, Nicola; Shaw, Christopher E; Baloh, Robert H; Appel, Stanley; Simpson, Ericka; Lagier-Tourenne, Clotilde; Pulst, Stefan M; Gibson, Summer; Trojanowski, John Q; Elman, Lauren; McCluskey, Leo; Grossman, Murray; Shneider, Neil A; Chung, Wendy K; Ravits, John M; Glass, Jonathan D; Sims, Katherine B; Van Deerlin, Vivianna M; Maniatis, Tom; Hayes, Sebastian D; Ordureau, Alban; Swarup, Sharan; Landers, John; Baas, Frank; Allen, Andrew S; Bedlack, Richard S; Harper, J Wade; Gitler, Aaron D; Rouleau, Guy A; Brown, Robert; Harms, Matthew B; Cooper, Gregory M; Harris, Tim; Myers, Richard M; Goldstein, David B

    2015-03-27

    Amyotrophic lateral sclerosis (ALS) is a devastating neurological disease with no effective treatment. We report the results of a moderate-scale sequencing study aimed at increasing the number of genes known to contribute to predisposition for ALS. We performed whole-exome sequencing of 2869 ALS patients and 6405 controls. Several known ALS genes were found to be associated, and TBK1 (the gene encoding TANK-binding kinase 1) was identified as an ALS gene. TBK1 is known to bind to and phosphorylate a number of proteins involved in innate immunity and autophagy, including optineurin (OPTN) and p62 (SQSTM1/sequestosome), both of which have also been implicated in ALS. These observations reveal a key role of the autophagic pathway in ALS and suggest specific targets for therapeutic intervention. Copyright © 2015, American Association for the Advancement of Science.

  18. Whither Causal Models in the Neuroscience of ADHD?

    ERIC Educational Resources Information Center

    Coghill, Dave; Nigg, Joel; Rothenberger, Aribert; Sonuga-Barke, Edmund; Tannock, Rosemary

    2005-01-01

    In this paper we examine the current status of the science of ADHD from a theoretical point of view. While the field has reached the point at which a number of causal models have been proposed, it remains some distance away from demonstrating the viability of such models empirically. We identify a number of existing barriers and make proposals as…

  19. Interactions of information transfer along separable causal paths

    NASA Astrophysics Data System (ADS)

    Jiang, Peishi; Kumar, Praveen

    2018-04-01

    Complex systems arise as a result of interdependences between multiple variables, whose causal interactions can be visualized in a time-series graph. Transfer entropy and information partitioning approaches have been used to characterize such dependences. However, these approaches capture net information transfer occurring through a multitude of pathways involved in the interaction and as a result mask our ability to discern the causal interaction within a subgraph of interest through specific pathways. We build on recent developments of momentary information transfer along causal paths proposed by Runge [Phys. Rev. E 92, 062829 (2015), 10.1103/PhysRevE.92.062829] to develop a framework for quantifying information partitioning along separable causal paths. Momentary information transfer along causal paths captures the amount of information transfer between any two variables lagged at two specific points in time. Our approach expands this concept to characterize the causal interaction in terms of synergistic, unique, and redundant information transfer through separable causal paths. Through a graphical model, we analyze the impact of the separable and nonseparable causal paths and the causality structure embedded in the graph as well as the noise effect on information partitioning by using synthetic data generated from two coupled logistic equation models. Our approach can provide a valuable reference for an autonomous information partitioning along separable causal paths which form a causal subgraph influencing a target.

  20. MADGiC: a model-based approach for identifying driver genes in cancer

    PubMed Central

    Korthauer, Keegan D.; Kendziorski, Christina

    2015-01-01

    Motivation: Identifying and prioritizing somatic mutations is an important and challenging area of cancer research that can provide new insights into gene function as well as new targets for drug development. Most methods for prioritizing mutations rely primarily on frequency-based criteria, where a gene is identified as having a driver mutation if it is altered in significantly more samples than expected according to a background model. Although useful, frequency-based methods are limited in that all mutations are treated equally. It is well known, however, that some mutations have no functional consequence, while others may have a major deleterious impact. The spatial pattern of mutations within a gene provides further insight into their functional consequence. Properly accounting for these factors improves both the power and accuracy of inference. Also important is an accurate background model. Results: Here, we develop a Model-based Approach for identifying Driver Genes in Cancer (termed MADGiC) that incorporates both frequency and functional impact criteria and accommodates a number of factors to improve the background model. Simulation studies demonstrate advantages of the approach, including a substantial increase in power over competing methods. Further advantages are illustrated in an analysis of ovarian and lung cancer data from The Cancer Genome Atlas (TCGA) project. Availability and implementation: R code to implement this method is available at http://www.biostat.wisc.edu/ kendzior/MADGiC/. Contact: kendzior@biostat.wisc.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:25573922

  1. Causal biological network database: a comprehensive platform of causal biological network models focused on the pulmonary and vascular systems

    PubMed Central

    Boué, Stéphanie; Talikka, Marja; Westra, Jurjen Willem; Hayes, William; Di Fabio, Anselmo; Park, Jennifer; Schlage, Walter K.; Sewer, Alain; Fields, Brett; Ansari, Sam; Martin, Florian; Veljkovic, Emilija; Kenney, Renee; Peitsch, Manuel C.; Hoeng, Julia

    2015-01-01

    With the wealth of publications and data available, powerful and transparent computational approaches are required to represent measured data and scientific knowledge in a computable and searchable format. We developed a set of biological network models, scripted in the Biological Expression Language, that reflect causal signaling pathways across a wide range of biological processes, including cell fate, cell stress, cell proliferation, inflammation, tissue repair and angiogenesis in the pulmonary and cardiovascular context. This comprehensive collection of networks is now freely available to the scientific community in a centralized web-based repository, the Causal Biological Network database, which is composed of over 120 manually curated and well annotated biological network models and can be accessed at http://causalbionet.com. The website accesses a MongoDB, which stores all versions of the networks as JSON objects and allows users to search for genes, proteins, biological processes, small molecules and keywords in the network descriptions to retrieve biological networks of interest. The content of the networks can be visualized and browsed. Nodes and edges can be filtered and all supporting evidence for the edges can be browsed and is linked to the original articles in PubMed. Moreover, networks may be downloaded for further visualization and evaluation. Database URL: http://causalbionet.com PMID:25887162

  2. Does Causality Matter More Now? Increase in the Proportion of Causal Language in English Texts.

    PubMed

    Iliev, Rumen; Axelrod, Robert

    2016-05-01

    The vast majority of the work on culture and cognition has focused on cross-cultural comparisons, largely ignoring the dynamic aspects of culture. In this article, we provide a diachronic analysis of causal cognition over time. We hypothesized that the increased role of education, science, and technology in Western societies should be accompanied by greater attention to causal connections. To test this hypothesis, we compared word frequencies in English texts from different time periods and found an increase in the use of causal language of about 40% over the past two centuries. The observed increase was not attributable to general language effects or to changing semantics of causal words. We also found that there was a consistent difference between the 19th and the 20th centuries, and that the increase happened mainly in the 20th century. © The Author(s) 2016.

  3. Draft Genome Sequence of an Isolate of Colletotrichum fructicola, a Causal Agent of Mango Anthracnose.

    PubMed

    Li, Qili; Bu, Junyan; Yu, Zhihe; Tang, Lihua; Huang, Suiping; Guo, Tangxun; Mo, Jianyou; Hsiang, Tom

    2018-02-22

    Here, we present a draft genome sequence of isolate 15060 of Colletotrichum fructicola , a causal agent of mango anthracnose. The final assembly consists of 1,048 scaffolds totaling 56,493,063 bp (G+C content, 53.38%) and 15,180 predicted genes. Copyright © 2018 Li et al.

  4. Causal inference, probability theory, and graphical insights.

    PubMed

    Baker, Stuart G

    2013-11-10

    Causal inference from observational studies is a fundamental topic in biostatistics. The causal graph literature typically views probability theory as insufficient to express causal concepts in observational studies. In contrast, the view here is that probability theory is a desirable and sufficient basis for many topics in causal inference for the following two reasons. First, probability theory is generally more flexible than causal graphs: Besides explaining such causal graph topics as M-bias (adjusting for a collider) and bias amplification and attenuation (when adjusting for instrumental variable), probability theory is also the foundation of the paired availability design for historical controls, which does not fit into a causal graph framework. Second, probability theory is the basis for insightful graphical displays including the BK-Plot for understanding Simpson's paradox with a binary confounder, the BK2-Plot for understanding bias amplification and attenuation in the presence of an unobserved binary confounder, and the PAD-Plot for understanding the principal stratification component of the paired availability design. Published 2013. This article is a US Government work and is in the public domain in the USA.

  5. Generalized Causal Quantum Theories

    NASA Astrophysics Data System (ADS)

    Parmeggiani, Claudio

    2007-12-01

    We shall show that is always possible to construct causal Quantum Theories fully equivalent (as predictive tools) to acausal, standard Quantum Theory, relativistic or not relativistic; we re-obtain, as a particular case, the usual Quantum Bohmian Theory. Then we consider the measurement process, in causal theories, and we conclude that the state of affairs is not really improved, with respect to standard theories.

  6. A Sleeping Beauty forward genetic screen identifies new genes and pathways driving osteosarcoma development and metastasis

    PubMed Central

    Moriarity, Branden S; Otto, George M; Rahrmann, Eric P; Rathe, Susan K; Wolf, Natalie K; Weg, Madison T; Manlove, Luke A; LaRue, Rebecca S; Temiz, Nuri A; Molyneux, Sam D; Choi, Kwangmin; Holly, Kevin J; Sarver, Aaron L; Scott, Milcah C; Forster, Colleen L; Modiano, Jaime F; Khanna, Chand; Hewitt, Stephen M; Khokha, Rama; Yang, Yi; Gorlick, Richard; Dyer, Michael A; Largaespada, David A

    2016-01-01

    Osteosarcomas are sarcomas of the bone, derived from osteoblasts or their precursors, with a high propensity to metastasize. Osteosarcoma is associated with massive genomic instability, making it problematic to identify driver genes using human tumors or prototypical mouse models, many of which involve loss of Trp53 function. To identify the genes driving osteosarcoma development and metastasis, we performed a Sleeping Beauty (SB) transposon-based forward genetic screen in mice with and without somatic loss of Trp53. Common insertion site (CIS) analysis of 119 primary tumors and 134 metastatic nodules identified 232 sites associated with osteosarcoma development and 43 sites associated with metastasis, respectively. Analysis of CIS-associated genes identified numerous known and new osteosarcoma-associated genes enriched in the ErbB, PI3K-AKT-mTOR and MAPK signaling pathways. Lastly, we identified several oncogenes involved in axon guidance, including Sema4d and Sema6d, which we functionally validated as oncogenes in human osteosarcoma. PMID:25961939

  7. Male specific genes from dioecious white campion identified by fluorescent differential display.

    PubMed

    Scutt, Charles P; Jenkins, Tom; Furuya, Masaki; Gilmartin, Philip M

    2002-05-01

    Fluorescent differential display (FDD) has been used to screen for cDNAs that are differentially up-regulated in male flowers of the dioecious plant Silene latifolia in which an X/Y chromosome system of sex determination operates. To adapt FDD to the cloning of large numbers of differential cDNAs, a novel method of confirming the differential expression of these has been devised. FDD gels were Southern electro-blotted and probed with mixtures of individual cDNA clones derived from different FDD product ligation reactions. These Southern blots were then stripped and re-probed with further mixtures of individual cloned FDD products to identify the maximum number of recombinant clones carrying the true differential amplification products. Of 135 differential bands identified by FDD, 56 differential amplification products were confirmed; these represent 23 unique differentially expressed genes as determined by virtual Northern analysis and two genes expressed at or below the level of detection by virtual Northern analysis. These two low expressed genes show bands of hybridization on genomic Southern blots that are specific to male plants, indicating that they are derived from, or closely related to, Y chromosome genes.

  8. Gene-set analysis based on the pharmacological profiles of drugs to identify repurposing opportunities in schizophrenia.

    PubMed

    de Jong, Simone; Vidler, Lewis R; Mokrab, Younes; Collier, David A; Breen, Gerome

    2016-08-01

    Genome-wide association studies (GWAS) have identified thousands of novel genetic associations for complex genetic disorders, leading to the identification of potential pharmacological targets for novel drug development. In schizophrenia, 108 conservatively defined loci that meet genome-wide significance have been identified and hundreds of additional sub-threshold associations harbour information on the genetic aetiology of the disorder. In the present study, we used gene-set analysis based on the known binding targets of chemical compounds to identify the 'drug pathways' most strongly associated with schizophrenia-associated genes, with the aim of identifying potential drug repositioning opportunities and clues for novel treatment paradigms, especially in multi-target drug development. We compiled 9389 gene sets (2496 with unique gene content) and interrogated gene-based p-values from the PGC2-SCZ analysis. Although no single drug exceeded experiment wide significance (corrected p<0.05), highly ranked gene-sets reaching suggestive significance including the dopamine receptor antagonists metoclopramide and trifluoperazine and the tyrosine kinase inhibitor neratinib. This is a proof of principle analysis showing the potential utility of GWAS data of schizophrenia for the direct identification of candidate drugs and molecules that show polypharmacy. © The Author(s) 2016.

  9. Causal inference in economics and marketing

    PubMed Central

    Varian, Hal R.

    2016-01-01

    This is an elementary introduction to causal inference in economics written for readers familiar with machine learning methods. The critical step in any causal analysis is estimating the counterfactual—a prediction of what would have happened in the absence of the treatment. The powerful techniques used in machine learning may be useful for developing better estimates of the counterfactual, potentially improving causal inference. PMID:27382144

  10. A Bayesian Nonparametric Causal Model for Regression Discontinuity Designs

    ERIC Educational Resources Information Center

    Karabatsos, George; Walker, Stephen G.

    2013-01-01

    The regression discontinuity (RD) design (Thistlewaite & Campbell, 1960; Cook, 2008) provides a framework to identify and estimate causal effects from a non-randomized design. Each subject of a RD design is assigned to the treatment (versus assignment to a non-treatment) whenever her/his observed value of the assignment variable equals or…

  11. Causal learning with local computations.

    PubMed

    Fernbach, Philip M; Sloman, Steven A

    2009-05-01

    The authors proposed and tested a psychological theory of causal structure learning based on local computations. Local computations simplify complex learning problems via cues available on individual trials to update a single causal structure hypothesis. Structural inferences from local computations make minimal demands on memory, require relatively small amounts of data, and need not respect normative prescriptions as inferences that are principled locally may violate those principles when combined. Over a series of 3 experiments, the authors found (a) systematic inferences from small amounts of data; (b) systematic inference of extraneous causal links; (c) influence of data presentation order on inferences; and (d) error reduction through pretraining. Without pretraining, a model based on local computations fitted data better than a Bayesian structural inference model. The data suggest that local computations serve as a heuristic for learning causal structure. Copyright 2009 APA, all rights reserved.

  12. Exome Sequencing and Linkage Analysis Identified Novel Candidate Genes in Recessive Intellectual Disability Associated with Ataxia.

    PubMed

    Jazayeri, Roshanak; Hu, Hao; Fattahi, Zohreh; Musante, Luciana; Abedini, Seyedeh Sedigheh; Hosseini, Masoumeh; Wienker, Thomas F; Ropers, Hans Hilger; Najmabadi, Hossein; Kahrizi, Kimia

    2015-10-01

    Intellectual disability (ID) is a neuro-developmental disorder which causes considerable socio-economic problems. Some ID individuals are also affected by ataxia, and the condition includes different mutations affecting several genes. We used whole exome sequencing (WES) in combination with homozygosity mapping (HM) to identify the genetic defects in five consanguineous families among our cohort study, with two affected children with ID and ataxia as major clinical symptoms. We identified three novel candidate genes, RIPPLY1, MRPL10, SNX14, and a new mutation in known gene SURF1. All are autosomal genes, except RIPPLY1, which is located on the X chromosome. Two are housekeeping genes, implicated in transcription and translation regulation and intracellular trafficking, and two encode mitochondrial proteins. The pathogenesis of these variants was evaluated by mutation classification, bioinformatic methods, review of medical and biological relevance, co-segregation studies in the particular family, and a normal population study. Linkage analysis and exome sequencing of a small number of affected family members is a powerful new technique which can be used to decrease the number of candidate genes in heterogenic disorders such as ID, and may even identify the responsible gene(s).

  13. Integrated sequence analysis pipeline provides one-stop solution for identifying disease-causing mutations.

    PubMed

    Hu, Hao; Wienker, Thomas F; Musante, Luciana; Kalscheuer, Vera M; Kahrizi, Kimia; Najmabadi, Hossein; Ropers, H Hilger

    2014-12-01

    Next-generation sequencing has greatly accelerated the search for disease-causing defects, but even for experts the data analysis can be a major challenge. To facilitate the data processing in a clinical setting, we have developed a novel medical resequencing analysis pipeline (MERAP). MERAP assesses the quality of sequencing, and has optimized capacity for calling variants, including single-nucleotide variants, insertions and deletions, copy-number variation, and other structural variants. MERAP identifies polymorphic and known causal variants by filtering against public domain databases, and flags nonsynonymous and splice-site changes. MERAP uses a logistic model to estimate the causal likelihood of a given missense variant. MERAP considers the relevant information such as phenotype and interaction with known disease-causing genes. MERAP compares favorably with GATK, one of the widely used tools, because of its higher sensitivity for detecting indels, its easy installation, and its economical use of computational resources. Upon testing more than 1,200 individuals with mutations in known and novel disease genes, MERAP proved highly reliable, as illustrated here for five families with disease-causing variants. We believe that the clinical implementation of MERAP will expedite the diagnostic process of many disease-causing defects. © 2014 WILEY PERIODICALS, INC.

  14. Localizing epileptic seizure onsets with Granger causality

    NASA Astrophysics Data System (ADS)

    Adhikari, Bhim M.; Epstein, Charles M.; Dhamala, Mukesh

    2013-09-01

    Accurate localization of the epileptic seizure onset zones (SOZs) is crucial for successful surgery, which usually depends on the information obtained from intracranial electroencephalography (IEEG) recordings. The visual criteria and univariate methods of analyzing IEEG recordings have not always produced clarity on the SOZs for resection and ultimate seizure freedom for patients. Here, to contribute to improving the localization of the SOZs and to understanding the mechanism of seizure propagation over the brain, we applied spectral interdependency methods to IEEG time series recorded from patients during seizures. We found that the high-frequency (>80 Hz) Granger causality (GC) occurs before the onset of any visible ictal activity and causal relationships involve the recording electrodes where clinically identifiable seizures later develop. These results suggest that high-frequency oscillatory network activities precede and underlie epileptic seizures, and that GC spectral measures derived from IEEG can assist in precise delineation of seizure onset times and SOZs.

  15. High-Throughput Screening to Identify Regulators of Meiosis-Specific Gene Expression in Saccharomyces cerevisiae.

    PubMed

    Kassir, Yona

    2017-01-01

    Meiosis and gamete formation are processes that are essential for sexual reproduction in all eukaryotic organisms. Multiple intracellular and extracellular signals feed into pathways that converge on transcription factors that induce the expression of meiosis-specific genes. Once triggered the meiosis-specific gene expression program proceeds in a cascade that drives progress through the events of meiosis and gamete formation. Meiosis-specific gene expression is tightly controlled by a balance of positive and negative regulatory factors that respond to a plethora of signaling pathways. The budding yeast Saccharomyces cerevisiae has proven to be an outstanding model for the dissection of gametogenesis owing to the sophisticated genetic manipulations that can be performed with the cells. It is possible to use a variety selection and screening methods to identify genes and their functions. High-throughput screening technology has been developed to allow an array of all viable yeast gene deletion mutants to be screened for phenotypes and for regulators of gene expression. This chapter describes a protocol that has been used to screen a library of homozygous diploid yeast deletion strains to identify regulators of the meiosis-specific IME1 gene.

  16. Transposon mutagenesis identifies genes and cellular processes driving epithelial-mesenchymal transition in hepatocellular carcinoma

    PubMed Central

    Kodama, Takahiro; Newberg, Justin Y.; Kodama, Michiko; Rangel, Roberto; Yoshihara, Kosuke; Tien, Jean C.; Parsons, Pamela H.; Wu, Hao; Finegold, Milton J.; Copeland, Neal G.; Jenkins, Nancy A.

    2016-01-01

    Epithelial-mesenchymal transition (EMT) is thought to contribute to metastasis and chemoresistance in patients with hepatocellular carcinoma (HCC), leading to their poor prognosis. The genes driving EMT in HCC are not yet fully understood, however. Here, we show that mobilization of Sleeping Beauty (SB) transposons in immortalized mouse hepatoblasts induces mesenchymal liver tumors on transplantation to nude mice. These tumors show significant down-regulation of epithelial markers, along with up-regulation of mesenchymal markers and EMT-related transcription factors (EMT-TFs). Sequencing of transposon insertion sites from tumors identified 233 candidate cancer genes (CCGs) that were enriched for genes and cellular processes driving EMT. Subsequent trunk driver analysis identified 23 CCGs that are predicted to function early in tumorigenesis and whose mutation or alteration in patients with HCC is correlated with poor patient survival. Validation of the top trunk drivers identified in the screen, including MET (MET proto-oncogene, receptor tyrosine kinase), GRB2-associated binding protein 1 (GAB1), HECT, UBA, and WWE domain containing 1 (HUWE1), lysine-specific demethylase 6A (KDM6A), and protein-tyrosine phosphatase, nonreceptor-type 12 (PTPN12), showed that deregulation of these genes activates an EMT program in human HCC cells that enhances tumor cell migration. Finally, deregulation of these genes in human HCC was found to confer sorafenib resistance through apoptotic tolerance and reduced proliferation, consistent with recent studies showing that EMT contributes to the chemoresistance of tumor cells. Our unique cell-based transposon mutagenesis screen appears to be an excellent resource for discovering genes involved in EMT in human HCC and potentially for identifying new drug targets. PMID:27247392

  17. Causal localizations in relativistic quantum mechanics

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Castrigiano, Domenico P. L., E-mail: castrig@ma.tum.de; Leiseifer, Andreas D., E-mail: andreas.leiseifer@tum.de

    2015-07-15

    Causal localizations describe the position of quantum systems moving not faster than light. They are constructed for the systems with finite spinor dimension. At the center of interest are the massive relativistic systems. For every positive mass, there is the sequence of Dirac tensor-localizations, which provides a complete set of inequivalent irreducible causal localizations. They obey the principle of special relativity and are fully Poincaré covariant. The boosters are determined by the causal position operator and the other Poincaré generators. The localization with minimal spinor dimension is the Dirac localization. Thus, the Dirac equation is derived here as a meremore » consequence of the principle of causality. Moreover, the higher tensor-localizations, not known so far, follow from Dirac’s localization by a simple construction. The probability of localization for positive energy states results to be described by causal positive operator valued (PO-) localizations, which are the traces of the causal localizations on the subspaces of positive energy. These causal Poincaré covariant PO-localizations for every irreducible massive relativistic system were, all the more, not known before. They are shown to be separated. Hence, the positive energy systems can be localized within every open region by a suitable preparation as accurately as desired. Finally, the attempt is made to provide an interpretation of the PO-localization operators within the frame of conventional quantum mechanics attributing an important role to the negative energy states.« less

  18. Causal localizations in relativistic quantum mechanics

    NASA Astrophysics Data System (ADS)

    Castrigiano, Domenico P. L.; Leiseifer, Andreas D.

    2015-07-01

    Causal localizations describe the position of quantum systems moving not faster than light. They are constructed for the systems with finite spinor dimension. At the center of interest are the massive relativistic systems. For every positive mass, there is the sequence of Dirac tensor-localizations, which provides a complete set of inequivalent irreducible causal localizations. They obey the principle of special relativity and are fully Poincaré covariant. The boosters are determined by the causal position operator and the other Poincaré generators. The localization with minimal spinor dimension is the Dirac localization. Thus, the Dirac equation is derived here as a mere consequence of the principle of causality. Moreover, the higher tensor-localizations, not known so far, follow from Dirac's localization by a simple construction. The probability of localization for positive energy states results to be described by causal positive operator valued (PO-) localizations, which are the traces of the causal localizations on the subspaces of positive energy. These causal Poincaré covariant PO-localizations for every irreducible massive relativistic system were, all the more, not known before. They are shown to be separated. Hence, the positive energy systems can be localized within every open region by a suitable preparation as accurately as desired. Finally, the attempt is made to provide an interpretation of the PO-localization operators within the frame of conventional quantum mechanics attributing an important role to the negative energy states.

  19. Causal structures in Gauss-Bonnet gravity

    NASA Astrophysics Data System (ADS)

    Izumi, Keisuke

    2014-08-01

    We analyze causal structures in Gauss-Bonnet gravity. It is known that Gauss-Bonnet gravity potentially has superluminal propagation of gravitons due to its noncanonical kinetic terms. In a theory with superluminal modes, an analysis of causality based on null curves makes no sense, and thus, we need to analyze them in a different way. In this paper, using the method of the characteristics, we analyze the causal structure in Gauss-Bonnet gravity. We have the result that, on a Killing horizon, gravitons can propagate in the null direction tangent to the Killing horizon. Therefore, a Killing horizon can be a causal edge as in the case of general relativity; i.e. a Killing horizon is the "event horizon" in the sense of causality. We also analyze causal structures on nonstationary solutions with (D-2)-dimensional maximal symmetry, including spherically symmetric and flat spaces. If the geometrical null energy condition, RABNANB≥0 for any null vector NA, is satisfied, the radial velocity of gravitons must be less than or equal to that of light. However, if the geometrical null energy condition is violated, gravitons can propagate faster than light. Hence, on an evaporating black hole where the geometrical null energy condition is expected not to hold, classical gravitons can escape from the "black hole" defined with null curves. That is, the causal structures become nontrivial. It may be one of the possible solutions for the information loss paradox of evaporating black holes.

  20. Resequencing a core collection of upland cotton identifies genomic variation and loci influencing fiber quality and yield.

    PubMed

    Ma, Zhiying; He, Shoupu; Wang, Xingfen; Sun, Junling; Zhang, Yan; Zhang, Guiyin; Wu, Liqiang; Li, Zhikun; Liu, Zhihao; Sun, Gaofei; Yan, Yuanyuan; Jia, Yinhua; Yang, Jun; Pan, Zhaoe; Gu, Qishen; Li, Xueyuan; Sun, Zhengwen; Dai, Panhong; Liu, Zhengwen; Gong, Wenfang; Wu, Jinhua; Wang, Mi; Liu, Hengwei; Feng, Keyun; Ke, Huifeng; Wang, Junduo; Lan, Hongyu; Wang, Guoning; Peng, Jun; Wang, Nan; Wang, Liru; Pang, Baoyin; Peng, Zhen; Li, Ruiqiang; Tian, Shilin; Du, Xiongming

    2018-05-07

    Upland cotton is the most important natural-fiber crop. The genomic variation of diverse germplasms and alleles underpinning fiber quality and yield should be extensively explored. Here, we resequenced a core collection comprising 419 accessions with 6.55-fold coverage depth and identified approximately 3.66 million SNPs for evaluating the genomic variation. We performed phenotyping across 12 environments and conducted genome-wide association study of 13 fiber-related traits. 7,383 unique SNPs were significantly associated with these traits and were located within or near 4,820 genes; more associated loci were detected for fiber quality than fiber yield, and more fiber genes were detected in the D than the A subgenome. Several previously undescribed causal genes for days to flowering, fiber length, and fiber strength were identified. Phenotypic selection for these traits increased the frequency of elite alleles during domestication and breeding. These results provide targets for molecular selection and genetic manipulation in cotton improvement.

  1. Causal beliefs about depression in different cultural groups—what do cognitive psychological theories of causal learning and reasoning predict?

    PubMed Central

    Hagmayer, York; Engelmann, Neele

    2014-01-01

    Cognitive psychological research focuses on causal learning and reasoning while cognitive anthropological and social science research tend to focus on systems of beliefs. Our aim was to explore how these two types of research can inform each other. Cognitive psychological theories (causal model theory and causal Bayes nets) were used to derive predictions for systems of causal beliefs. These predictions were then applied to lay theories of depression as a specific test case. A systematic literature review on causal beliefs about depression was conducted, including original, quantitative research. Thirty-six studies investigating 13 non-Western and 32 Western cultural groups were analyzed by classifying assumed causes and preferred forms of treatment into common categories. Relations between beliefs and treatment preferences were assessed. Substantial agreement between cultural groups was found with respect to the impact of observable causes. Stress was generally rated as most important. Less agreement resulted for hidden, especially supernatural causes. Causal beliefs were clearly related to treatment preferences in Western groups, while evidence was mostly lacking for non-Western groups. Overall predictions were supported, but there were considerable methodological limitations. Pointers to future research, which may combine studies on causal beliefs with experimental paradigms on causal reasoning, are given. PMID:25505432

  2. Transcriptomic Analysis Using Olive Varieties and Breeding Progenies Identifies Candidate Genes Involved in Plant Architecture.

    PubMed

    González-Plaza, Juan J; Ortiz-Martín, Inmaculada; Muñoz-Mérida, Antonio; García-López, Carmen; Sánchez-Sevilla, José F; Luque, Francisco; Trelles, Oswaldo; Bejarano, Eduardo R; De La Rosa, Raúl; Valpuesta, Victoriano; Beuzón, Carmen R

    2016-01-01

    Plant architecture is a critical trait in fruit crops that can significantly influence yield, pruning, planting density and harvesting. Little is known about how plant architecture is genetically determined in olive, were most of the existing varieties are traditional with an architecture poorly suited for modern growing and harvesting systems. In the present study, we have carried out microarray analysis of meristematic tissue to compare expression profiles of olive varieties displaying differences in architecture, as well as seedlings from their cross pooled on the basis of their sharing architecture-related phenotypes. The microarray used, previously developed by our group has already been applied to identify candidates genes involved in regulating juvenile to adult transition in the shoot apex of seedlings. Varieties with distinct architecture phenotypes and individuals from segregating progenies displaying opposite architecture features were used to link phenotype to expression. Here, we identify 2252 differentially expressed genes (DEGs) associated to differences in plant architecture. Microarray results were validated by quantitative RT-PCR carried out on genes with functional annotation likely related to plant architecture. Twelve of these genes were further analyzed in individual seedlings of the corresponding pool. We also examined Arabidopsis mutants in putative orthologs of these targeted candidate genes, finding altered architecture for most of them. This supports a functional conservation between species and potential biological relevance of the candidate genes identified. This study is the first to identify genes associated to plant architecture in olive, and the results obtained could be of great help in future programs aimed at selecting phenotypes adapted to modern cultivation practices in this species.

  3. Systematic analysis of mutation distribution in three dimensional protein structures identifies cancer driver genes.

    PubMed

    Fujimoto, Akihiro; Okada, Yukinori; Boroevich, Keith A; Tsunoda, Tatsuhiko; Taniguchi, Hiroaki; Nakagawa, Hidewaki

    2016-05-26

    Protein tertiary structure determines molecular function, interaction, and stability of the protein, therefore distribution of mutation in the tertiary structure can facilitate the identification of new driver genes in cancer. To analyze mutation distribution in protein tertiary structures, we applied a novel three dimensional permutation test to the mutation positions. We analyzed somatic mutation datasets of 21 types of cancers obtained from exome sequencing conducted by the TCGA project. Of the 3,622 genes that had ≥3 mutations in the regions with tertiary structure data, 106 genes showed significant skew in mutation distribution. Known tumor suppressors and oncogenes were significantly enriched in these identified cancer gene sets. Physical distances between mutations in known oncogenes were significantly smaller than those of tumor suppressors. Twenty-three genes were detected in multiple cancers. Candidate genes with significant skew of the 3D mutation distribution included kinases (MAPK1, EPHA5, ERBB3, and ERBB4), an apoptosis related gene (APP), an RNA splicing factor (SF1), a miRNA processing factor (DICER1), an E3 ubiquitin ligase (CUL1) and transcription factors (KLF5 and EEF1B2). Our study suggests that systematic analysis of mutation distribution in the tertiary protein structure can help identify cancer driver genes.

  4. Systematic analysis of mutation distribution in three dimensional protein structures identifies cancer driver genes

    PubMed Central

    Fujimoto, Akihiro; Okada, Yukinori; Boroevich, Keith A.; Tsunoda, Tatsuhiko; Taniguchi, Hiroaki; Nakagawa, Hidewaki

    2016-01-01

    Protein tertiary structure determines molecular function, interaction, and stability of the protein, therefore distribution of mutation in the tertiary structure can facilitate the identification of new driver genes in cancer. To analyze mutation distribution in protein tertiary structures, we applied a novel three dimensional permutation test to the mutation positions. We analyzed somatic mutation datasets of 21 types of cancers obtained from exome sequencing conducted by the TCGA project. Of the 3,622 genes that had ≥3 mutations in the regions with tertiary structure data, 106 genes showed significant skew in mutation distribution. Known tumor suppressors and oncogenes were significantly enriched in these identified cancer gene sets. Physical distances between mutations in known oncogenes were significantly smaller than those of tumor suppressors. Twenty-three genes were detected in multiple cancers. Candidate genes with significant skew of the 3D mutation distribution included kinases (MAPK1, EPHA5, ERBB3, and ERBB4), an apoptosis related gene (APP), an RNA splicing factor (SF1), a miRNA processing factor (DICER1), an E3 ubiquitin ligase (CUL1) and transcription factors (KLF5 and EEF1B2). Our study suggests that systematic analysis of mutation distribution in the tertiary protein structure can help identify cancer driver genes. PMID:27225414

  5. Transcriptome analysis of an apple (Malus × domestica) yellow fruit somatic mutation identifies a gene network module highly associated with anthocyanin and epigenetic regulation.

    PubMed

    El-Sharkawy, Islam; Liang, Dong; Xu, Kenong

    2015-12-01

    Using RNA-seq, this study analysed an apple (Malus×domestica) anthocyanin-deficient yellow-skin somatic mutant 'Blondee' (BLO) and its red-skin parent 'Kidd's D-8' (KID), the original name of 'Gala', to understand the molecular mechanisms underlying the mutation. A total of 3299 differentially expressed genes (DEGs) were identified between BLO and KID at four developmental stages and/or between two adjacent stages within BLO and/or KID. A weighted gene co-expression network analysis (WGCNA) of the DEGs uncovered a network module of 34 genes highly correlated (r=0.95, P=9.0×10(-13)) with anthocyanin contents. Although 12 of the 34 genes in the WGCNA module were characterized and known of roles in anthocyanin, the remainder 22 appear to be novel. Examining the expression of ten representative genes in the module in 14 diverse apples revealed that at least eight were significantly correlated with anthocyanin variation. MdMYB10 (MDP0000259614) and MdGST (MDP0000252292) were among the most suppressed module member genes in BLO despite being undistinguishable in their corresponding sequences between BLO and KID. Methylation assay of MdMYB10 and MdGST in fruit skin revealed that two regions (MR3 and MR7) in the MdMYB10 promoter exhibited remarkable differences between BLO and KID. In particular, methylation was high and progressively increased alongside fruit development in BLO while was correspondingly low and constant in KID. The methylation levels in both MR3 and MR7 were negatively correlated with anthocyanin content as well as the expression of MdMYB10 and MdGST. Clearly, the collective repression of the 34 genes explains the loss-of-colour in BLO while the methylation in MdMYB10 promoter is likely causal for the mutation. © The Author 2015. Published by Oxford University Press on behalf of the Society for Experimental Biology.

  6. Specific PCR primers directed to identify cryI and cryIII genes within a Bacillus thuringiensis strain collection.

    PubMed Central

    Cerón, J; Ortíz, A; Quintero, R; Güereca, L; Bravo, A

    1995-01-01

    In this paper we describe a PCR strategy that can be used to rapidly identify Bacillus thuringiensis strains that harbor any of the known cryI or cryIII genes. Four general PCR primers which amplify DNA fragments from the known cryI or cryIII genes were selected from conserved regions. Once a strain was identified as an organism that contains a particular type of cry gene, it could be easily characterized by performing additional PCR with specific cryI and cryIII primers selected from variable regions. The method described in this paper can be used to identify the 10 different cryI genes and the five different cryIII genes. One feature of this screening method is that each cry gene is expected to produce a PCR product having a precise molecular weight. The genes which produce PCR products having different sizes probably represent strains that harbor a potentially novel cry gene. Finally, we present evidence that novel crystal genes can be identified by the method described in this paper. PMID:8526493

  7. Bayesian networks improve causal environmental ...

    EPA Pesticide Factsheets

    Rule-based weight of evidence approaches to ecological risk assessment may not account for uncertainties and generally lack probabilistic integration of lines of evidence. Bayesian networks allow causal inferences to be made from evidence by including causal knowledge about the problem, using this knowledge with probabilistic calculus to combine multiple lines of evidence, and minimizing biases in predicting or diagnosing causal relationships. Too often, sources of uncertainty in conventional weight of evidence approaches are ignored that can be accounted for with Bayesian networks. Specifying and propagating uncertainties improve the ability of models to incorporate strength of the evidence in the risk management phase of an assessment. Probabilistic inference from a Bayesian network allows evaluation of changes in uncertainty for variables from the evidence. The network structure and probabilistic framework of a Bayesian approach provide advantages over qualitative approaches in weight of evidence for capturing the impacts of multiple sources of quantifiable uncertainty on predictions of ecological risk. Bayesian networks can facilitate the development of evidence-based policy under conditions of uncertainty by incorporating analytical inaccuracies or the implications of imperfect information, structuring and communicating causal issues through qualitative directed graph formulations, and quantitatively comparing the causal power of multiple stressors on value

  8. Meta-Analysis of Placental Transcriptome Data Identifies a Novel Molecular Pathway Related to Preeclampsia.

    PubMed

    van Uitert, Miranda; Moerland, Perry D; Enquobahrie, Daniel A; Laivuori, Hannele; van der Post, Joris A M; Ris-Stalpers, Carrie; Afink, Gijs B

    2015-01-01

    Studies using the placental transcriptome to identify key molecules relevant for preeclampsia are hampered by a relatively small sample size. In addition, they use a variety of bioinformatics and statistical methods, making comparison of findings challenging. To generate a more robust preeclampsia gene expression signature, we performed a meta-analysis on the original data of 11 placenta RNA microarray experiments, representing 139 normotensive and 116 preeclamptic pregnancies. Microarray data were pre-processed and analyzed using standardized bioinformatics and statistical procedures and the effect sizes were combined using an inverse-variance random-effects model. Interactions between genes in the resulting gene expression signature were identified by pathway analysis (Ingenuity Pathway Analysis, Gene Set Enrichment Analysis, Graphite) and protein-protein associations (STRING). This approach has resulted in a comprehensive list of differentially expressed genes that led to a 388-gene meta-signature of preeclamptic placenta. Pathway analysis highlights the involvement of the previously identified hypoxia/HIF1A pathway in the establishment of the preeclamptic gene expression profile, while analysis of protein interaction networks indicates CREBBP/EP300 as a novel element central to the preeclamptic placental transcriptome. In addition, there is an apparent high incidence of preeclampsia in women carrying a child with a mutation in CREBBP/EP300 (Rubinstein-Taybi Syndrome). The 388-gene preeclampsia meta-signature offers a vital starting point for further studies into the relevance of these genes (in particular CREBBP/EP300) and their concomitant pathways as biomarkers or functional molecules in preeclampsia. This will result in a better understanding of the molecular basis of this disease and opens up the opportunity to develop rational therapies targeting the placental dysfunction causal to preeclampsia.

  9. Formalizing the role of agent-based modeling in causal inference and epidemiology.

    PubMed

    Marshall, Brandon D L; Galea, Sandro

    2015-01-15

    Calls for the adoption of complex systems approaches, including agent-based modeling, in the field of epidemiology have largely centered on the potential for such methods to examine complex disease etiologies, which are characterized by feedback behavior, interference, threshold dynamics, and multiple interacting causal effects. However, considerable theoretical and practical issues impede the capacity of agent-based methods to examine and evaluate causal effects and thus illuminate new areas for intervention. We build on this work by describing how agent-based models can be used to simulate counterfactual outcomes in the presence of complexity. We show that these models are of particular utility when the hypothesized causal mechanisms exhibit a high degree of interdependence between multiple causal effects and when interference (i.e., one person's exposure affects the outcome of others) is present and of intrinsic scientific interest. Although not without challenges, agent-based modeling (and complex systems methods broadly) represent a promising novel approach to identify and evaluate complex causal effects, and they are thus well suited to complement other modern epidemiologic methods of etiologic inquiry. © The Author 2014. Published by Oxford University Press on behalf of the Johns Hopkins Bloomberg School of Public Health. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  10. Quantum-coherent mixtures of causal relations

    NASA Astrophysics Data System (ADS)

    Maclean, Jean-Philippe W.; Ried, Katja; Spekkens, Robert W.; Resch, Kevin J.

    2017-05-01

    Understanding the causal influences that hold among parts of a system is critical both to explaining that system's natural behaviour and to controlling it through targeted interventions. In a quantum world, understanding causal relations is equally important, but the set of possibilities is far richer. The two basic ways in which a pair of time-ordered quantum systems may be causally related are by a cause-effect mechanism or by a common-cause acting on both. Here we show a coherent mixture of these two possibilities. We realize this nonclassical causal relation in a quantum optics experiment and derive a set of criteria for witnessing the coherence based on a quantum version of Berkson's effect, whereby two independent causes can become correlated on observation of their common effect. The interplay of causality and quantum theory lies at the heart of challenging foundational puzzles, including Bell's theorem and the search for quantum gravity.

  11. Quantum-coherent mixtures of causal relations

    PubMed Central

    MacLean, Jean-Philippe W.; Ried, Katja; Spekkens, Robert W.; Resch, Kevin J.

    2017-01-01

    Understanding the causal influences that hold among parts of a system is critical both to explaining that system's natural behaviour and to controlling it through targeted interventions. In a quantum world, understanding causal relations is equally important, but the set of possibilities is far richer. The two basic ways in which a pair of time-ordered quantum systems may be causally related are by a cause-effect mechanism or by a common-cause acting on both. Here we show a coherent mixture of these two possibilities. We realize this nonclassical causal relation in a quantum optics experiment and derive a set of criteria for witnessing the coherence based on a quantum version of Berkson's effect, whereby two independent causes can become correlated on observation of their common effect. The interplay of causality and quantum theory lies at the heart of challenging foundational puzzles, including Bell's theorem and the search for quantum gravity. PMID:28485394

  12. Quantum-coherent mixtures of causal relations.

    PubMed

    MacLean, Jean-Philippe W; Ried, Katja; Spekkens, Robert W; Resch, Kevin J

    2017-05-09

    Understanding the causal influences that hold among parts of a system is critical both to explaining that system's natural behaviour and to controlling it through targeted interventions. In a quantum world, understanding causal relations is equally important, but the set of possibilities is far richer. The two basic ways in which a pair of time-ordered quantum systems may be causally related are by a cause-effect mechanism or by a common-cause acting on both. Here we show a coherent mixture of these two possibilities. We realize this nonclassical causal relation in a quantum optics experiment and derive a set of criteria for witnessing the coherence based on a quantum version of Berkson's effect, whereby two independent causes can become correlated on observation of their common effect. The interplay of causality and quantum theory lies at the heart of challenging foundational puzzles, including Bell's theorem and the search for quantum gravity.

  13. A method to identify differential expression profiles of time-course gene data with Fourier transformation.

    PubMed

    Kim, Jaehee; Ogden, Robert Todd; Kim, Haseong

    2013-10-18

    Time course gene expression experiments are an increasingly popular method for exploring biological processes. Temporal gene expression profiles provide an important characterization of gene function, as biological systems are both developmental and dynamic. With such data it is possible to study gene expression changes over time and thereby to detect differential genes. Much of the early work on analyzing time series expression data relied on methods developed originally for static data and thus there is a need for improved methodology. Since time series expression is a temporal process, its unique features such as autocorrelation between successive points should be incorporated into the analysis. This work aims to identify genes that show different gene expression profiles across time. We propose a statistical procedure to discover gene groups with similar profiles using a nonparametric representation that accounts for the autocorrelation in the data. In particular, we first represent each profile in terms of a Fourier basis, and then we screen out genes that are not differentially expressed based on the Fourier coefficients. Finally, we cluster the remaining gene profiles using a model-based approach in the Fourier domain. We evaluate the screening results in terms of sensitivity, specificity, FDR and FNR, compare with the Gaussian process regression screening in a simulation study and illustrate the results by application to yeast cell-cycle microarray expression data with alpha-factor synchronization.The key elements of the proposed methodology: (i) representation of gene profiles in the Fourier domain; (ii) automatic screening of genes based on the Fourier coefficients and taking into account autocorrelation in the data, while controlling the false discovery rate (FDR); (iii) model-based clustering of the remaining gene profiles. Using this method, we identified a set of cell-cycle-regulated time-course yeast genes. The proposed method is general and can be

  14. A method to identify differential expression profiles of time-course gene data with Fourier transformation

    PubMed Central

    2013-01-01

    Background Time course gene expression experiments are an increasingly popular method for exploring biological processes. Temporal gene expression profiles provide an important characterization of gene function, as biological systems are both developmental and dynamic. With such data it is possible to study gene expression changes over time and thereby to detect differential genes. Much of the early work on analyzing time series expression data relied on methods developed originally for static data and thus there is a need for improved methodology. Since time series expression is a temporal process, its unique features such as autocorrelation between successive points should be incorporated into the analysis. Results This work aims to identify genes that show different gene expression profiles across time. We propose a statistical procedure to discover gene groups with similar profiles using a nonparametric representation that accounts for the autocorrelation in the data. In particular, we first represent each profile in terms of a Fourier basis, and then we screen out genes that are not differentially expressed based on the Fourier coefficients. Finally, we cluster the remaining gene profiles using a model-based approach in the Fourier domain. We evaluate the screening results in terms of sensitivity, specificity, FDR and FNR, compare with the Gaussian process regression screening in a simulation study and illustrate the results by application to yeast cell-cycle microarray expression data with alpha-factor synchronization. The key elements of the proposed methodology: (i) representation of gene profiles in the Fourier domain; (ii) automatic screening of genes based on the Fourier coefficients and taking into account autocorrelation in the data, while controlling the false discovery rate (FDR); (iii) model-based clustering of the remaining gene profiles. Conclusions Using this method, we identified a set of cell-cycle-regulated time-course yeast genes. The

  15. Causal uncertainty, claimed and behavioural self-handicapping.

    PubMed

    Thompson, Ted; Hepburn, Jonathan

    2003-06-01

    Causal uncertainty beliefs involve doubts about the causes of events, and arise as a consequence of non-contingent evaluative feedback: feedback that leaves the individual uncertain about the causes of his or her achievement outcomes. Individuals high in causal uncertainty are frequently unable to confidently attribute their achievement outcomes, experience anxiety in achievement situations and as a consequence are likely to engage in self-handicapping behaviour. Accordingly, we sought to establish links between trait causal uncertainty, claimed and behavioural self-handicapping. Participants were N=72 undergraduate students divided equally between high and low causally uncertain groups. We used a 2 (causal uncertainty status: high, low) x 3 (performance feedback condition: success, non-contingent success, non-contingent failure) between-subjects factorial design to examine the effects of causal uncertainty on achievement behaviour. Following performance feedback, participants completed 20 single-solution anagrams and 12 remote associate tasks serving as performance measures, and 16 unicursal tasks to assess practice effort. Participants also completed measures of claimed handicaps, state anxiety and attributions. Relative to low causally uncertain participants, high causally uncertain participants claimed more handicaps prior to performance on the anagrams and remote associates, reported higher anxiety, attributed their failure to internal, stable factors, and reduced practice effort on the unicursal tasks, evident in fewer unicursal tasks solved. These findings confirm links between trait causal uncertainty and claimed and behavioural self-handicapping, highlighting the need for educators to facilitate means by which students can achieve surety in the manner in which they attribute the causes of their achievement outcomes.

  16. Transcriptome Sequencing Identified Genes and Gene Ontologies Associated with Early Freezing Tolerance in Maize

    PubMed Central

    Li, Zhao; Hu, Guanghui; Liu, Xiangfeng; Zhou, Yao; Li, Yu; Zhang, Xu; Yuan, Xiaohui; Zhang, Qian; Yang, Deguang; Wang, Tianyu; Zhang, Zhiwu

    2016-01-01

    Originating in a tropical climate, maize has faced great challenges as cultivation has expanded to the majority of the world's temperate zones. In these zones, frost and cold temperatures are major factors that prevent maize from reaching its full yield potential. Among 30 elite maize inbred lines adapted to northern China, we identified two lines of extreme, but opposite, freezing tolerance levels—highly tolerant and highly sensitive. During the seedling stage of these two lines, we used RNA-seq to measure changes in maize whole genome transcriptome before and after freezing treatment. In total, 19,794 genes were expressed, of which 4550 exhibited differential expression due to either treatment (before or after freezing) or line type (tolerant or sensitive). Of the 4550 differently expressed genes, 948 exhibited differential expression due to treatment within line or lines under freezing condition. Analysis of gene ontology found that these 948 genes were significantly enriched for binding functions (DNA binding, ATP binding, and metal ion binding), protein kinase activity, and peptidase activity. Based on their enrichment, literature support, and significant levels of differential expression, 30 of these 948 genes were selected for quantitative real-time PCR (qRT-PCR) validation. The validation confirmed our RNA-Seq-based findings, with squared correlation coefficients of 80% and 50% in the tolerance and sensitive lines, respectively. This study provided valuable resources for further studies to enhance understanding of the molecular mechanisms underlying maize early freezing response and enable targeted breeding strategies for developing varieties with superior frost resistance to achieve yield potential. PMID:27774095

  17. Causality or Relatedness Assessment in Adverse Drug Reaction and Its Relevance in Dermatology.

    PubMed

    Pande, Sushil

    2018-01-01

    Causality assessment essentially means finding a causal association or relationship between a drug and drug reaction. Identifying the culprit drug or drugs can be lifesaving or helpful in preventing the further damage caused by the drug to our body systems. In dermatology practice, when it comes to cutaneous adverse drug reaction, this is much more important and relevant because many aetiologies can produce a similar cutaneous manifestation. There are multiple criteria or algorithms available as of now for establishing a causal relationship in cases of adverse drug reaction (ADR), indicating that none of them is specific or complete. Most of these causality assessment tools (CATs) use four cardinal principles of diagnosis of ADR such as temporal relationship of drug with the drug reaction, biological plausibility of the drug causing a reaction, dechallenge, and rechallenge. The present study reviews some of the established or commonly used CATs and its implications or relevance to dermatology in clinical practice.

  18. Unveiling causal activity of complex networks

    NASA Astrophysics Data System (ADS)

    Williams-García, Rashid V.; Beggs, John M.; Ortiz, Gerardo

    2017-07-01

    We introduce a novel tool for analyzing complex network dynamics, allowing for cascades of causally-related events, which we call causal webs (c-webs), to be separated from other non-causally-related events. This tool shows that traditionally-conceived avalanches may contain mixtures of spatially-distinct but temporally-overlapping cascades of events, and dynamical disorder or noise. In contrast, c-webs separate these components, unveiling previously hidden features of the network and dynamics. We apply our method to mouse cortical data with resulting statistics which demonstrate for the first time that neuronal avalanches are not merely composed of causally-related events. The original version of this article was uploaded to the arXiv on March 17th, 2016 [1].

  19. GWAS of clinically defined gout and subtypes identifies multiple susceptibility loci that include urate transporter genes.

    PubMed

    Nakayama, Akiyoshi; Nakaoka, Hirofumi; Yamamoto, Ken; Sakiyama, Masayuki; Shaukat, Amara; Toyoda, Yu; Okada, Yukinori; Kamatani, Yoichiro; Nakamura, Takahiro; Takada, Tappei; Inoue, Katsuhisa; Yasujima, Tomoya; Yuasa, Hiroaki; Shirahama, Yuko; Nakashima, Hiroshi; Shimizu, Seiko; Higashino, Toshihide; Kawamura, Yusuke; Ogata, Hiraku; Kawaguchi, Makoto; Ohkawa, Yasuyuki; Danjoh, Inaho; Tokumasu, Atsumi; Ooyama, Keiko; Ito, Toshimitsu; Kondo, Takaaki; Wakai, Kenji; Stiburkova, Blanka; Pavelka, Karel; Stamp, Lisa K; Dalbeth, Nicola; Sakurai, Yutaka; Suzuki, Hiroshi; Hosoyamada, Makoto; Fujimori, Shin; Yokoo, Takashi; Hosoya, Tatsuo; Inoue, Ituro; Takahashi, Atsushi; Kubo, Michiaki; Ooyama, Hiroshi; Shimizu, Toru; Ichida, Kimiyoshi; Shinomiya, Nariyoshi; Merriman, Tony R; Matsuo, Hirotaka

    2017-05-01

    A genome-wide association study (GWAS) of gout and its subtypes was performed to identify novel gout loci, including those that are subtype-specific. Putative causal association signals from a GWAS of 945 clinically defined gout cases and 1213 controls from Japanese males were replicated with 1396 cases and 1268 controls using a custom chip of 1961 single nucleotide polymorphisms (SNPs). We also first conducted GWASs of gout subtypes. Replication with Caucasian and New Zealand Polynesian samples was done to further validate the loci identified in this study. In addition to the five loci we reported previously, further susceptibility loci were identified at a genome-wide significance level (p<5.0×10 -8 ): urate transporter genes ( SLC22A12 and SLC17A1 ) and HIST1H2BF-HIST1H4E for all gout cases, and NIPAL1 and FAM35A for the renal underexcretion gout subtype. While NIPAL1 encodes a magnesium transporter, functional analysis did not detect urate transport via NIPAL1, suggesting an indirect association with urate handling. Localisation analysis in the human kidney revealed expression of NIPAL1 and FAM35A mainly in the distal tubules, which suggests the involvement of the distal nephron in urate handling in humans. Clinically ascertained male patients with gout and controls of Caucasian and Polynesian ancestries were also genotyped, and FAM35A was associated with gout in all cases. A meta-analysis of the three populations revealed FAM35A to be associated with gout at a genome-wide level of significance (p meta =3.58×10 -8 ). Our findings including novel gout risk loci provide further understanding of the molecular pathogenesis of gout and lead to a novel concept for the therapeutic target of gout/hyperuricaemia. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://www.bmj.com/company/products-services/rights-and-licensing/.

  20. Expert Causal Reasoning and Explanation.

    ERIC Educational Resources Information Center

    Kuipers, Benjamin

    The relationship between cognitive psychologists and researchers in artificial intelligence carries substantial benefits for both. An ongoing investigation in causal reasoning in medical problem solving systems illustrates this interaction. This paper traces a dialectic of sorts in which three different types of causal resaoning for medical…

  1. Causal Inference in Retrospective Studies.

    ERIC Educational Resources Information Center

    Holland, Paul W.; Rubin, Donald B.

    1988-01-01

    The problem of drawing causal inferences from retrospective case-controlled studies is considered. A model for causal inference in prospective studies is applied to retrospective studies. Limitations of case-controlled studies are formulated concerning relevant parameters that can be estimated in such studies. A coffee-drinking/myocardial…

  2. Causal Learning with Local Computations

    ERIC Educational Resources Information Center

    Fernbach, Philip M.; Sloman, Steven A.

    2009-01-01

    The authors proposed and tested a psychological theory of causal structure learning based on local computations. Local computations simplify complex learning problems via cues available on individual trials to update a single causal structure hypothesis. Structural inferences from local computations make minimal demands on memory, require…

  3. Utility and Limitations of Using Gene Expression Data to Identify Functional Associations

    PubMed Central

    Peng, Cheng; Shiu, Shin-Han

    2016-01-01

    Gene co-expression has been widely used to hypothesize gene function through guilt-by association. However, it is not clear to what degree co-expression is informative, whether it can be applied to genes involved in different biological processes, and how the type of dataset impacts inferences about gene functions. Here our goal is to assess the utility and limitations of using co-expression as a criterion to recover functional associations between genes. By determining the percentage of gene pairs in a metabolic pathway with significant expression correlation, we found that many genes in the same pathway do not have similar transcript profiles and the choice of dataset, annotation quality, gene function, expression similarity measure, and clustering approach significantly impacts the ability to recover functional associations between genes using Arabidopsis thaliana as an example. Some datasets are more informative in capturing coordinated expression profiles and larger data sets are not always better. In addition, to recover the maximum number of known pathways and identify candidate genes with similar functions, it is important to explore rather exhaustively multiple dataset combinations, similarity measures, clustering algorithms and parameters. Finally, we validated the biological relevance of co-expression cluster memberships with an independent phenomics dataset and found that genes that consistently cluster with leucine degradation genes tend to have similar leucine levels in mutants. This study provides a framework for obtaining gene functional associations by maximizing the information that can be obtained from gene expression datasets. PMID:27935950

  4. A Gene Co-Expression Network in Whole Blood of Schizophrenia Patients Is Independent of Antipsychotic-Use and Enriched for Brain-Expressed Genes

    PubMed Central

    de Jong, Simone; Boks, Marco P. M.; Fuller, Tova F.; Strengman, Eric; Janson, Esther; de Kovel, Carolien G. F.; Ori, Anil P. S.; Vi, Nancy; Mulder, Flip; Blom, Jan Dirk; Glenthøj, Birte; Schubart, Chris D.; Cahn, Wiepke; Kahn, René S.; Horvath, Steve; Ophoff, Roel A.

    2012-01-01

    Despite large-scale genome-wide association studies (GWAS), the underlying genes for schizophrenia are largely unknown. Additional approaches are therefore required to identify the genetic background of this disorder. Here we report findings from a large gene expression study in peripheral blood of schizophrenia patients and controls. We applied a systems biology approach to genome-wide expression data from whole blood of 92 medicated and 29 antipsychotic-free schizophrenia patients and 118 healthy controls. We show that gene expression profiling in whole blood can identify twelve large gene co-expression modules associated with schizophrenia. Several of these disease related modules are likely to reflect expression changes due to antipsychotic medication. However, two of the disease modules could be replicated in an independent second data set involving antipsychotic-free patients and controls. One of these robustly defined disease modules is significantly enriched with brain-expressed genes and with genetic variants that were implicated in a GWAS study, which could imply a causal role in schizophrenia etiology. The most highly connected intramodular hub gene in this module (ABCF1), is located in, and regulated by the major histocompatibility (MHC) complex, which is intriguing in light of the fact that common allelic variants from the MHC region have been implicated in schizophrenia. This suggests that the MHC increases schizophrenia susceptibility via altered gene expression of regulatory genes in this network. PMID:22761806

  5. The Role of a Novel TRMT1 Gene Mutation and Rare GRM1 Gene Defect in Intellectual Disability in Two Azeri Families.

    PubMed

    Davarniya, Behzad; Hu, Hao; Kahrizi, Kimia; Musante, Luciana; Fattahi, Zohreh; Hosseini, Masoumeh; Maqsoud, Fariba; Farajollahi, Reza; Wienker, Thomas F; Ropers, H Hilger; Najmabadi, Hossein

    2015-01-01

    Cognitive impairment or intellectual disability (ID) is a widespread neurodevelopmental disorder characterized by low IQ (below 70). ID is genetically heterogeneous and is estimated to affect 1-3% of the world's population. In affected children from consanguineous families, autosomal recessive inheritance is common, and identifying the underlying genetic cause is an important issue in clinical genetics. In the framework of a larger project, aimed at identifying candidate genes for autosomal recessive intellectual disorder (ARID), we recently carried out single nucleotide polymorphism-based genome-wide linkage analysis in several families from Ardabil province in Iran. The identification of homozygosity-by-descent loci in these families, in combination with whole exome sequencing, led us to identify possible causative homozygous changes in two families. In the first family, a missense variant was found in GRM1 gene, while in the second family, a frameshift alteration was identified in TRMT1, both of which were found to co-segregate with the disease. GRM1, a known causal gene for autosomal recessive spinocerebellar ataxia (SCAR13, MIM#614831), encodes the metabotropic glutamate receptor1 (mGluR1). This gene plays an important role in synaptic plasticity and cerebellar development. Conversely, the TRMT1 gene encodes a tRNA methyltransferase that dimethylates a single guanine residue at position 26 of most tRNAs using S-adenosyl methionine as the methyl group donor. We recently presented TRMT1 as a candidate gene for ARID in a consanguineous Iranian family (Najmabadi et al., 2011). We believe that this second Iranian family with a biallelic loss-of-function mutation in TRMT1 gene supports the idea that this gene likely has function in development of the disorder.

  6. Gene Network for Identifying the Entropy Changes of Different Modules in Pediatric Sepsis.

    PubMed

    Yang, Jing; Zhang, Pingli; Wang, Lumin

    2016-01-01

    Pediatric sepsis is a disease that threatens life of children. The incidence of pediatric sepsis is higher in developing countries due to various reasons, such as insufficient immunization and nutrition, water and air pollution, etc. Exploring the potential genes via different methods is of significance for the prevention and treatment of pediatric sepsis. This study aimed to identify potential genes associated with pediatric sepsis utilizing analysis of gene network and entropy. The mRNA expression in the blood samples collected from 20 septic children and 30 healthy controls was quantified by using Affymetrix HG-U133A microarray. Two condition-specific protein-protein interaction networks (PINs), one for the healthy control and the other one for the children with sepsis, were deduced by combining the fundamental human PINs with gene expression profiles in the two phenotypes. Subsequently, distinct modules from the two conditional networks were extracted by adopting a maximal clique-merging approach. Delta entropy (ΔS) was calculated between sepsis and control modules. Then, key genes displaying changes in gene composition were identified by matching the control and sepsis modules. Two objective modules were obtained, in which ribosomal protein RPL4 and RPL9 as well as TOP2A were probably considered as the key genes differentiating sepsis from healthy controls. According to previous reports and this work, TOP2A is the potential gene therapy target for pediatric sepsis. The relationship between pediatric sepsis and RPL4 and RPL9 needs further investigation. © 2016 The Author(s) Published by S. Karger AG, Basel.

  7. Non-Gaussian Methods for Causal Structure Learning.

    PubMed

    Shimizu, Shohei

    2018-05-22

    Causal structure learning is one of the most exciting new topics in the fields of machine learning and statistics. In many empirical sciences including prevention science, the causal mechanisms underlying various phenomena need to be studied. Nevertheless, in many cases, classical methods for causal structure learning are not capable of estimating the causal structure of variables. This is because it explicitly or implicitly assumes Gaussianity of data and typically utilizes only the covariance structure. In many applications, however, non-Gaussian data are often obtained, which means that more information may be contained in the data distribution than the covariance matrix is capable of containing. Thus, many new methods have recently been proposed for using the non-Gaussian structure of data and inferring the causal structure of variables. This paper introduces prevention scientists to such causal structure learning methods, particularly those based on the linear, non-Gaussian, acyclic model known as LiNGAM. These non-Gaussian data analysis tools can fully estimate the underlying causal structures of variables under assumptions even in the presence of unobserved common causes. This feature is in contrast to other approaches. A simulated example is also provided.

  8. Wheat differential gene expression induced by different races of Puccinia triticina.

    PubMed

    Neugebauer, Kerri A; Bruce, Myron; Todd, Tim; Trick, Harold N; Fellers, John P

    2018-01-01

    Puccinia triticina, the causal agent of wheat leaf rust, causes significant losses in wheat yield and quality each year worldwide. During leaf rust infection, the host plant recognizes numerous molecules, some of which trigger host defenses. Although P. triticina reproduces clonally, there is still variation within the population due to a high mutation frequency, host specificity, and environmental adaptation. This study explores how wheat responds on a gene expression level to different P. triticina races. Six P. triticina races were inoculated onto a susceptible wheat variety and samples were taken at six days post inoculation, just prior to pustule eruption. RNA sequence data identified 63 wheat genes differentially expressed between the six races. A time course, conducted over the first seven days post inoculation, was used to examine the expression pattern of 63 genes during infection. Forty-seven wheat genes were verified to have differential expression. Three common expression patterns were identified. In addition, two genes were associated with race specific gene expression. Differential expression of an ER molecular chaperone gene was associated with races from two different P. triticina lineages. Also, differential expression in an alanine glyoxylate aminotransferase gene was associated with races with virulence shifts for leaf rust resistance genes.

  9. Inferring hidden causal relations between pathway members using reduced Google matrix of directed biological networks

    PubMed Central

    2018-01-01

    Signaling pathways represent parts of the global biological molecular network which connects them into a seamless whole through complex direct and indirect (hidden) crosstalk whose structure can change during development or in pathological conditions. We suggest a novel methodology, called Googlomics, for the structural analysis of directed biological networks using spectral analysis of their Google matrices, using parallels with quantum scattering theory, developed for nuclear and mesoscopic physics and quantum chaos. We introduce analytical “reduced Google matrix” method for the analysis of biological network structure. The method allows inferring hidden causal relations between the members of a signaling pathway or a functionally related group of genes. We investigate how the structure of hidden causal relations can be reprogrammed as a result of changes in the transcriptional network layer during cancerogenesis. The suggested Googlomics approach rigorously characterizes complex systemic changes in the wiring of large causal biological networks in a computationally efficient way. PMID:29370181

  10. Illness causal beliefs in Turkish immigrants

    PubMed Central

    Minas, Harry; Klimidis, Steven; Tuncer, Can

    2007-01-01

    Background People hold a wide variety of beliefs concerning the causes of illness. Such beliefs vary across cultures and, among immigrants, may be influenced by many factors, including level of acculturation, gender, level of education, and experience of illness and treatment. This study examines illness causal beliefs in Turkish-immigrants in Australia. Methods Causal beliefs about somatic and mental illness were examined in a sample of 444 members of the Turkish population of Melbourne. The socio-demographic characteristics of the sample were broadly similar to those of the Melbourne Turkish community. Five issues were examined: the structure of causal beliefs; the relative frequency of natural, supernatural and metaphysical beliefs; ascription of somatic, mental, or both somatic and mental conditions to the various causes; the correlations of belief types with socio-demographic, modernizing and acculturation variables; and the relationship between causal beliefs and current illness. Results Principal components analysis revealed two broad factors, accounting for 58 percent of the variation in scores on illness belief scales, distinctly interpretable as natural and supernatural beliefs. Second, beliefs in natural causes were more frequent than beliefs in supernatural causes. Third, some causal beliefs were commonly linked to both somatic and mental conditions while others were regarded as more specific to either somatic or mental disorders. Last, there was a range of correlations between endorsement of belief types and factors defining heterogeneity within the community, including with demographic factors, indicators of modernizing and acculturative processes, and the current presence of illness. Conclusion Results supported the classification of causal beliefs proposed by Murdock, Wilson & Frederick, with a division into natural and supernatural causes. While belief in natural causes is more common, belief in supernatural causes persists despite modernizing and

  11. Illness causal beliefs in Turkish immigrants.

    PubMed

    Minas, Harry; Klimidis, Steven; Tuncer, Can

    2007-07-24

    People hold a wide variety of beliefs concerning the causes of illness. Such beliefs vary across cultures and, among immigrants, may be influenced by many factors, including level of acculturation, gender, level of education, and experience of illness and treatment. This study examines illness causal beliefs in Turkish-immigrants in Australia. Causal beliefs about somatic and mental illness were examined in a sample of 444 members of the Turkish population of Melbourne. The socio-demographic characteristics of the sample were broadly similar to those of the Melbourne Turkish community. Five issues were examined: the structure of causal beliefs; the relative frequency of natural, supernatural and metaphysical beliefs; ascription of somatic, mental, or both somatic and mental conditions to the various causes; the correlations of belief types with socio-demographic, modernizing and acculturation variables; and the relationship between causal beliefs and current illness. Principal components analysis revealed two broad factors, accounting for 58 percent of the variation in scores on illness belief scales, distinctly interpretable as natural and supernatural beliefs. Second, beliefs in natural causes were more frequent than beliefs in supernatural causes. Third, some causal beliefs were commonly linked to both somatic and mental conditions while others were regarded as more specific to either somatic or mental disorders. Last, there was a range of correlations between endorsement of belief types and factors defining heterogeneity within the community, including with demographic factors, indicators of modernizing and acculturative processes, and the current presence of illness. Results supported the classification of causal beliefs proposed by Murdock, Wilson & Frederick, with a division into natural and supernatural causes. While belief in natural causes is more common, belief in supernatural causes persists despite modernizing and acculturative influences. Different

  12. Evolutionary analysis of vision genes identifies potential drivers of visual differences between giraffe and okapi

    PubMed Central

    Agaba, Morris; Cavener, Douglas R.

    2017-01-01

    Background The capacity of visually oriented species to perceive and respond to visual signal is integral to their evolutionary success. Giraffes are closely related to okapi, but the two species have broad range of phenotypic differences including their visual capacities. Vision studies rank giraffe’s visual acuity higher than all other artiodactyls despite sharing similar vision ecological determinants with many of them. The extent to which the giraffe’s unique visual capacity and its difference with okapi is reflected by changes in their vision genes is not understood. Methods The recent availability of giraffe and okapi genomes provided opportunity to identify giraffe and okapi vision genes. Multiple strategies were employed to identify thirty-six candidate mammalian vision genes in giraffe and okapi genomes. Quantification of selection pressure was performed by a combination of branch-site tests of positive selection and clade models of selection divergence through comparing giraffe and okapi vision genes and orthologous sequences from other mammals. Results Signatures of selection were identified in key genes that could potentially underlie giraffe and okapi visual adaptations. Importantly, some genes that contribute to optical transparency of the eye and those that are critical in light signaling pathway were found to show signatures of adaptive evolution or selection divergence. Comparison between giraffe and other ruminants identifies significant selection divergence in CRYAA and OPN1LW. Significant selection divergence was identified in SAG while positive selection was detected in LUM when okapi is compared with ruminants and other mammals. Sequence analysis of OPN1LW showed that at least one of the sites known to affect spectral sensitivity of the red pigment is uniquely divergent between giraffe and other ruminants. Discussion By taking a systemic approach to gene function in vision, the results provide the first molecular clues associated with

  13. Evolutionary analysis of vision genes identifies potential drivers of visual differences between giraffe and okapi.

    PubMed

    Ishengoma, Edson; Agaba, Morris; Cavener, Douglas R

    2017-01-01

    The capacity of visually oriented species to perceive and respond to visual signal is integral to their evolutionary success. Giraffes are closely related to okapi, but the two species have broad range of phenotypic differences including their visual capacities. Vision studies rank giraffe's visual acuity higher than all other artiodactyls despite sharing similar vision ecological determinants with many of them. The extent to which the giraffe's unique visual capacity and its difference with okapi is reflected by changes in their vision genes is not understood. The recent availability of giraffe and okapi genomes provided opportunity to identify giraffe and okapi vision genes. Multiple strategies were employed to identify thirty-six candidate mammalian vision genes in giraffe and okapi genomes. Quantification of selection pressure was performed by a combination of branch-site tests of positive selection and clade models of selection divergence through comparing giraffe and okapi vision genes and orthologous sequences from other mammals. Signatures of selection were identified in key genes that could potentially underlie giraffe and okapi visual adaptations. Importantly, some genes that contribute to optical transparency of the eye and those that are critical in light signaling pathway were found to show signatures of adaptive evolution or selection divergence. Comparison between giraffe and other ruminants identifies significant selection divergence in CRYAA and OPN1LW . Significant selection divergence was identified in SAG while positive selection was detected in LUM when okapi is compared with ruminants and other mammals. Sequence analysis of OPN1LW showed that at least one of the sites known to affect spectral sensitivity of the red pigment is uniquely divergent between giraffe and other ruminants. By taking a systemic approach to gene function in vision, the results provide the first molecular clues associated with giraffe and okapi vision adaptations. At

  14. Combining GWAS and RNA-Seq Approaches for Detection of the Causal Mutation for Hereditary Junctional Epidermolysis Bullosa in Sheep

    PubMed Central

    Suárez-Vega, Aroa; Gutiérrez-Gil, Beatriz; Benavides, Julio; Perez, Valentín; Tosser-Klopp, Gwenola; Klopp, Christophe; Keennel, Stephen J.; Arranz, Juan José

    2015-01-01

    In this study, we demonstrate the use of a genome-wide association mapping together with RNA-seq in a reduced number of samples, as an efficient approach to detect the causal mutation for a Mendelian disease. Junctional epidermolysis bullosa is a recessive genodermatosis that manifests with neonatal mechanical fragility of the skin, blistering confined to the lamina lucida of the basement membrane and severe alteration of the hemidesmosomal junctions. In Spanish Churra sheep, junctional epidermolysis bullosa (JEB) has been detected in two commercial flocks. The JEB locus was mapped to Ovis aries chromosome 11 by GWAS and subsequently fine-mapped to an 868-kb homozygous segment using the identical-by-descent method. The ITGB4, which is located within this region, was identified as the best positional and functional candidate gene. The RNA-seq variant analysis enabled us to discover a 4-bp deletion within exon 33 of the ITGB4 gene (c.4412_4415del). The c.4412_4415del mutation causes a frameshift resulting in a premature stop codon at position 1472 of the integrin β4 protein. A functional analysis of this deletion revealed decreased levels of mRNA in JEB skin samples and the absence of integrin β4 labeling in immunohistochemical assays. Genotyping of c.4412_4415del showed perfect concordance with the recessive mode of the disease phenotype. Selection against this causal mutation will now be used to solve the problem of JEB in flocks of Churra sheep. Furthermore, the identification of the ITGB4 mutation means that affected sheep can be used as a large mammal animal model for the human form of epidermolysis bullosa with aplasia cutis. Our approach evidences that RNA-seq offers cost-effective alternative to identify variants in the species in which high resolution exome-sequencing is not straightforward. PMID:25955497

  15. Combining GWAS and RNA-Seq Approaches for Detection of the Causal Mutation for Hereditary Junctional Epidermolysis Bullosa in Sheep.

    PubMed

    Suárez-Vega, Aroa; Gutiérrez-Gil, Beatriz; Benavides, Julio; Perez, Valentín; Tosser-Klopp, Gwenola; Klopp, Christophe; Keennel, Stephen J; Arranz, Juan José

    2015-01-01

    In this study, we demonstrate the use of a genome-wide association mapping together with RNA-seq in a reduced number of samples, as an efficient approach to detect the causal mutation for a Mendelian disease. Junctional epidermolysis bullosa is a recessive genodermatosis that manifests with neonatal mechanical fragility of the skin, blistering confined to the lamina lucida of the basement membrane and severe alteration of the hemidesmosomal junctions. In Spanish Churra sheep, junctional epidermolysis bullosa (JEB) has been detected in two commercial flocks. The JEB locus was mapped to Ovis aries chromosome 11 by GWAS and subsequently fine-mapped to an 868-kb homozygous segment using the identical-by-descent method. The ITGB4, which is located within this region, was identified as the best positional and functional candidate gene. The RNA-seq variant analysis enabled us to discover a 4-bp deletion within exon 33 of the ITGB4 gene (c.4412_4415del). The c.4412_4415del mutation causes a frameshift resulting in a premature stop codon at position 1472 of the integrin β4 protein. A functional analysis of this deletion revealed decreased levels of mRNA in JEB skin samples and the absence of integrin β4 labeling in immunohistochemical assays. Genotyping of c.4412_4415del showed perfect concordance with the recessive mode of the disease phenotype. Selection against this causal mutation will now be used to solve the problem of JEB in flocks of Churra sheep. Furthermore, the identification of the ITGB4 mutation means that affected sheep can be used as a large mammal animal model for the human form of epidermolysis bullosa with aplasia cutis. Our approach evidences that RNA-seq offers cost-effective alternative to identify variants in the species in which high resolution exome-sequencing is not straightforward.

  16. Genome-Wide and Gene-Based Meta-Analyses Identify Novel Loci Influencing Blood Pressure Response to Hydrochlorothiazide.

    PubMed

    Salvi, Erika; Wang, Zhiying; Rizzi, Federica; Gong, Yan; McDonough, Caitrin W; Padmanabhan, Sandosh; Hiltunen, Timo P; Lanzani, Chiara; Zaninello, Roberta; Chittani, Martina; Bailey, Kent R; Sarin, Antti-Pekka; Barcella, Matteo; Melander, Olle; Chapman, Arlene B; Manunta, Paolo; Kontula, Kimmo K; Glorioso, Nicola; Cusi, Daniele; Dominiczak, Anna F; Johnson, Julie A; Barlassina, Cristina; Boerwinkle, Eric; Cooper-DeHoff, Rhonda M; Turner, Stephen T

    2017-01-01

    This study aimed to identify novel loci influencing the antihypertensive response to hydrochlorothiazide monotherapy. A genome-wide meta-analysis of blood pressure (BP) response to hydrochlorothiazide was performed in 1739 white hypertensives from 6 clinical trials within the International Consortium for Antihypertensive Pharmacogenomics Studies, making it the largest study to date of its kind. No signals reached genome-wide significance (P<5×10 - 8 ), and the suggestive regions (P<10 -5 ) were cross-validated in 2 black cohorts treated with hydrochlorothiazide. In addition, a gene-based analysis was performed on candidate genes with previous evidence of involvement in diuretic response, in BP regulation, or in hypertension susceptibility. Using the genome-wide meta-analysis approach, with validation in blacks, we identified 2 suggestive regulatory regions linked to gap junction protein α1 gene (GJA1) and forkhead box A1 gene (FOXA1), relevant for cardiovascular and kidney function. With the gene-based approach, we identified hydroxy-delta-5-steroid dehydrogenase, 3 β- and steroid δ-isomerase 1 gene (HSD3B1) as significantly associated with BP response (P<2.28×10 - 4 ). HSD3B1 encodes the 3β-hydroxysteroid dehydrogenase enzyme and plays a crucial role in the biosynthesis of aldosterone and endogenous ouabain. By amassing all of the available pharmacogenomic studies of BP response to hydrochlorothiazide, and using 2 different analytic approaches, we identified 3 novel loci influencing BP response to hydrochlorothiazide. The gene-based analysis, never before applied to pharmacogenomics of antihypertensive drugs to our knowledge, provided a powerful strategy to identify a locus of interest, which was not identified in the genome-wide meta-analysis because of high allelic heterogeneity. These data pave the way for future investigations on new pathways and drug targets to enhance the current understanding of personalized antihypertensive treatment. © 2016

  17. Sufficiency and Necessity Assumptions in Causal Structure Induction

    ERIC Educational Resources Information Center

    Mayrhofer, Ralf; Waldmann, Michael R.

    2016-01-01

    Research on human causal induction has shown that people have general prior assumptions about causal strength and about how causes interact with the background. We propose that these prior assumptions about the parameters of causal systems do not only manifest themselves in estimations of causal strength or the selection of causes but also when…

  18. Exploring Individual Differences in Preschoolers' Causal Stance

    ERIC Educational Resources Information Center

    Alvarez, Aubry; Booth, Amy E.

    2016-01-01

    Preschoolers, as a group, are highly attuned to causality, and this attunement is known to facilitate memory, learning, and problem solving. However, recent work reveals substantial individual variability in the strength of children's "causal stance," as demonstrated by their curiosity about and preference for new causal information. In…

  19. Constraints on Children's Judgments of Magical Causality

    ERIC Educational Resources Information Center

    Woolley, Jacqueline D.; Browne, Cheryl A.; Boerger, Elizabeth A.

    2006-01-01

    In 3 studies we addressed the operation of constraints on children's causal judgments. Our primary focus was whether children's beliefs about magical causality, specifically wishing, are constrained by features that govern the attribution of ordinary causality. In Experiment 1, children witnessed situations in which a confederate's wish appeared…

  20. Identifying signatures of positive selection in pigmentation genes in two South Asian populations.

    PubMed

    Jonnalagadda, Manjari; Bharti, Neeraj; Patil, Yatish; Ozarkar, Shantanu; K, Sunitha Manjari; Joshi, Rajendra; Norton, Heather

    2017-09-10

    Skin pigmentation is a polygenic trait showing wide phenotypic variations among global populations. While numerous pigmentation genes have been identified to be under positive selection among European and East populations, genes contributing to phenotypic variation in skin pigmentation within and among South Asian populations are still poorly understood. The present study uses data from the Phase 3 of the 1000 genomes project focusing on two South Asian populations-GIH (Gujarati Indian from Houston, Texas) and ITU (Indian Telugu from UK), so as to decode the genetic architecture involved in adaptation to ultraviolet radiation in South Asian populations. Statistical tests included were (1) tests to identify deviations of the Site Frequency Spectrum (SFS) from neutral expectations (Tajima's D, Fay and Wu's H and Fu and Li's D* and F*), (2) tests focused on the identification of high-frequency haplotypes with extended linkage disequilibrium (iHS and Rsb), and (3) tests based on genetic differentiation between populations (LSBL). Twenty-two pigmentation genes fall in the top 1% for at least one statistic in the GIH population, 5 of which (LYST, OCA2, SLC24A5, SLC45A2, and TYR) have been previously associated with normal variation in skin, hair, or eye color. In comparison, 17 genes fall in the top 1% for at least one statistic in the ITU population. Twelve loci which are identified as outliers in the ITU scan were also identified in the GIH population. These results suggest that selection may have affected these loci broadly across the region. © 2017 Wiley Periodicals, Inc.

  1. Refined mapping of autoimmune disease associated genetic variants with gene expression suggests an important role for non-coding RNAs.

    PubMed

    Ricaño-Ponce, Isis; Zhernakova, Daria V; Deelen, Patrick; Luo, Oscar; Li, Xingwang; Isaacs, Aaron; Karjalainen, Juha; Di Tommaso, Jennifer; Borek, Zuzanna Agnieszka; Zorro, Maria M; Gutierrez-Achury, Javier; Uitterlinden, Andre G; Hofman, Albert; van Meurs, Joyce; Netea, Mihai G; Jonkers, Iris H; Withoff, Sebo; van Duijn, Cornelia M; Li, Yang; Ruan, Yijun; Franke, Lude; Wijmenga, Cisca; Kumar, Vinod

    2016-04-01

    Genome-wide association and fine-mapping studies in 14 autoimmune diseases (AID) have implicated more than 250 loci in one or more of these diseases. As more than 90% of AID-associated SNPs are intergenic or intronic, pinpointing the causal genes is challenging. We performed a systematic analysis to link 460 SNPs that are associated with 14 AID to causal genes using transcriptomic data from 629 blood samples. We were able to link 71 (39%) of the AID-SNPs to two or more nearby genes, providing evidence that for part of the AID loci multiple causal genes exist. While 54 of the AID loci are shared by one or more AID, 17% of them do not share candidate causal genes. In addition to finding novel genes such as ULK3, we also implicate novel disease mechanisms and pathways like autophagy in celiac disease pathogenesis. Furthermore, 42 of the AID SNPs specifically affected the expression of 53 non-coding RNA genes. To further understand how the non-coding genome contributes to AID, the SNPs were linked to functional regulatory elements, which suggest a model where AID genes are regulated by network of chromatin looping/non-coding RNAs interactions. The looping model also explains how a causal candidate gene is not necessarily the gene closest to the AID SNP, which was the case in nearly 50% of cases. Copyright © 2016 The Authors. Published by Elsevier Ltd.. All rights reserved.

  2. Transcriptomic Analysis Using Olive Varieties and Breeding Progenies Identifies Candidate Genes Involved in Plant Architecture

    PubMed Central

    González-Plaza, Juan J.; Ortiz-Martín, Inmaculada; Muñoz-Mérida, Antonio; García-López, Carmen; Sánchez-Sevilla, José F.; Luque, Francisco; Trelles, Oswaldo; Bejarano, Eduardo R.; De La Rosa, Raúl; Valpuesta, Victoriano; Beuzón, Carmen R.

    2016-01-01

    Plant architecture is a critical trait in fruit crops that can significantly influence yield, pruning, planting density and harvesting. Little is known about how plant architecture is genetically determined in olive, were most of the existing varieties are traditional with an architecture poorly suited for modern growing and harvesting systems. In the present study, we have carried out microarray analysis of meristematic tissue to compare expression profiles of olive varieties displaying differences in architecture, as well as seedlings from their cross pooled on the basis of their sharing architecture-related phenotypes. The microarray used, previously developed by our group has already been applied to identify candidates genes involved in regulating juvenile to adult transition in the shoot apex of seedlings. Varieties with distinct architecture phenotypes and individuals from segregating progenies displaying opposite architecture features were used to link phenotype to expression. Here, we identify 2252 differentially expressed genes (DEGs) associated to differences in plant architecture. Microarray results were validated by quantitative RT-PCR carried out on genes with functional annotation likely related to plant architecture. Twelve of these genes were further analyzed in individual seedlings of the corresponding pool. We also examined Arabidopsis mutants in putative orthologs of these targeted candidate genes, finding altered architecture for most of them. This supports a functional conservation between species and potential biological relevance of the candidate genes identified. This study is the first to identify genes associated to plant architecture in olive, and the results obtained could be of great help in future programs aimed at selecting phenotypes adapted to modern cultivation practices in this species. PMID:26973682

  3. The Development of Causal Categorization

    ERIC Educational Resources Information Center

    Hayes, Brett K.; Rehder, Bob

    2012-01-01

    Two experiments examined the impact of causal relations between features on categorization in 5- to 6-year-old children and adults. Participants learned artificial categories containing instances with causally related features and noncausal features. They then selected the most likely category member from a series of novel test pairs.…

  4. Transcriptomic analysis of genetically defined autism candidate genes reveals common mechanisms of action

    PubMed Central

    2013-01-01

    Background Austism spectrum disorder (ASD) is a heterogeneous behavioral disorder or condition characterized by severe impairment of social engagement and the presence of repetitive activities. The molecular etiology of ASD is still largely unknown despite a strong genetic component. Part of the difficulty in turning genetics into disease mechanisms and potentially new therapeutics is the sheer number and diversity of the genes that have been associated with ASD and ASD symptoms. The goal of this work is to use shRNA-generated models of genetic defects proposed as causative for ASD to identify the common pathways that might explain how they produce a core clinical disability. Methods Transcript levels of Mecp2, Mef2a, Mef2d, Fmr1, Nlgn1, Nlgn3, Pten, and Shank3 were knocked-down in mouse primary neuron cultures using shRNA constructs. Whole genome expression analysis was conducted for each of the knockdown cultures as well as a mock-transduced culture and a culture exposed to a lentivirus expressing an anti-luciferase shRNA. Gene set enrichment and a causal reasoning engine was employed to identify pathway level perturbations generated by the transcript knockdown. Results Quantification of the shRNA targets confirmed the successful knockdown at the transcript and protein levels of at least 75% for each of the genes. After subtracting out potential artifacts caused by viral infection, gene set enrichment and causal reasoning engine analysis showed that a significant number of gene expression changes mapped to pathways associated with neurogenesis, long-term potentiation, and synaptic activity. Conclusions This work demonstrates that despite the complex genetic nature of ASD, there are common molecular mechanisms that connect many of the best established autism candidate genes. By identifying the key regulatory checkpoints in the interlinking transcriptional networks underlying autism, we are better able to discover the ideal points of intervention that provide the

  5. Transcriptomic analysis of genetically defined autism candidate genes reveals common mechanisms of action.

    PubMed

    Lanz, Thomas A; Guilmette, Edward; Gosink, Mark M; Fischer, James E; Fitzgerald, Lawrence W; Stephenson, Diane T; Pletcher, Mathew T

    2013-11-15

    Austism spectrum disorder (ASD) is a heterogeneous behavioral disorder or condition characterized by severe impairment of social engagement and the presence of repetitive activities. The molecular etiology of ASD is still largely unknown despite a strong genetic component. Part of the difficulty in turning genetics into disease mechanisms and potentially new therapeutics is the sheer number and diversity of the genes that have been associated with ASD and ASD symptoms. The goal of this work is to use shRNA-generated models of genetic defects proposed as causative for ASD to identify the common pathways that might explain how they produce a core clinical disability. Transcript levels of Mecp2, Mef2a, Mef2d, Fmr1, Nlgn1, Nlgn3, Pten, and Shank3 were knocked-down in mouse primary neuron cultures using shRNA constructs. Whole genome expression analysis was conducted for each of the knockdown cultures as well as a mock-transduced culture and a culture exposed to a lentivirus expressing an anti-luciferase shRNA. Gene set enrichment and a causal reasoning engine was employed to identify pathway level perturbations generated by the transcript knockdown. Quantification of the shRNA targets confirmed the successful knockdown at the transcript and protein levels of at least 75% for each of the genes. After subtracting out potential artifacts caused by viral infection, gene set enrichment and causal reasoning engine analysis showed that a significant number of gene expression changes mapped to pathways associated with neurogenesis, long-term potentiation, and synaptic activity. This work demonstrates that despite the complex genetic nature of ASD, there are common molecular mechanisms that connect many of the best established autism candidate genes. By identifying the key regulatory checkpoints in the interlinking transcriptional networks underlying autism, we are better able to discover the ideal points of intervention that provide the broadest efficacy across the diverse

  6. Establishing causal coherence across sentences: an ERP study

    PubMed Central

    Kuperberg, Gina R.; Paczynski, Martin; Ditman, Tali

    2011-01-01

    This study examined neural activity associated with establishing causal relationships across sentences during online comprehension. ERPs were measured while participants read and judged the relatedness of three-sentence scenarios in which the final sentence was highly causally related, intermediately related and causally unrelated to its context. Lexico-semantic co-occurrence was matched across the three conditions using a Latent Semantic Analysis. Critical words in causally unrelated scenarios evoked a larger N400 than words in both highly causally related and intermediately related scenarios, regardless of whether they appeared before or at the sentence-final position. At midline sites, the N400 to intermediately related sentence-final words was attenuated to the same degree as to highly causally related words, but otherwise the N400 to intermediately related words fell in between that evoked by highly causally related and intermediately related words. No modulation of the Late Positivity/P600 component was observed across conditions. These results indicate that both simple and complex causal inferences can influence the earliest stages of semantically processing an incoming word. Further, they suggest that causal coherence, at the situation level, can influence incremental word-by-word discourse comprehension, even when semantic relationships between individual words are matched. PMID:20175676

  7. Granger-causality maps of diffusion processes.

    PubMed

    Wahl, Benjamin; Feudel, Ulrike; Hlinka, Jaroslav; Wächter, Matthias; Peinke, Joachim; Freund, Jan A

    2016-02-01

    Granger causality is a statistical concept devised to reconstruct and quantify predictive information flow between stochastic processes. Although the general concept can be formulated model-free it is often considered in the framework of linear stochastic processes. Here we show how local linear model descriptions can be employed to extend Granger causality into the realm of nonlinear systems. This novel treatment results in maps that resolve Granger causality in regions of state space. Through examples we provide a proof of concept and illustrate the utility of these maps. Moreover, by integration we convert the local Granger causality into a global measure that yields a consistent picture for a global Ornstein-Uhlenbeck process. Finally, we recover invariance transformations known from the theory of autoregressive processes.

  8. Robust Principal Component Analysis Regularized by Truncated Nuclear Norm for Identifying Differentially Expressed Genes.

    PubMed

    Wang, Ya-Xuan; Gao, Ying-Lian; Liu, Jin-Xing; Kong, Xiang-Zhen; Li, Hai-Jun

    2017-09-01

    Identifying differentially expressed genes from the thousands of genes is a challenging task. Robust principal component analysis (RPCA) is an efficient method in the identification of differentially expressed genes. RPCA method uses nuclear norm to approximate the rank function. However, theoretical studies showed that the nuclear norm minimizes all singular values, so it may not be the best solution to approximate the rank function. The truncated nuclear norm is defined as the sum of some smaller singular values, which may achieve a better approximation of the rank function than nuclear norm. In this paper, a novel method is proposed by replacing nuclear norm of RPCA with the truncated nuclear norm, which is named robust principal component analysis regularized by truncated nuclear norm (TRPCA). The method decomposes the observation matrix of genomic data into a low-rank matrix and a sparse matrix. Because the significant genes can be considered as sparse signals, the differentially expressed genes are viewed as the sparse perturbation signals. Thus, the differentially expressed genes can be identified according to the sparse matrix. The experimental results on The Cancer Genome Atlas data illustrate that the TRPCA method outperforms other state-of-the-art methods in the identification of differentially expressed genes.

  9. Genome-Wide Temporal Expression Profiling in Caenorhabditis elegans Identifies a Core Gene Set Related to Long-Term Memory.

    PubMed

    Freytag, Virginie; Probst, Sabine; Hadziselimovic, Nils; Boglari, Csaba; Hauser, Yannick; Peter, Fabian; Gabor Fenyves, Bank; Milnik, Annette; Demougin, Philippe; Vukojevic, Vanja; de Quervain, Dominique J-F; Papassotiropoulos, Andreas; Stetak, Attila

    2017-07-12

    The identification of genes related to encoding, storage, and retrieval of memories is a major interest in neuroscience. In the current study, we analyzed the temporal gene expression changes in a neuronal mRNA pool during an olfactory long-term associative memory (LTAM) in Caenorhabditis elegans hermaphrodites. Here, we identified a core set of 712 (538 upregulated and 174 downregulated) genes that follows three distinct temporal peaks demonstrating multiple gene regulation waves in LTAM. Compared with the previously published positive LTAM gene set (Lakhina et al., 2015), 50% of the identified upregulated genes here overlap with the previous dataset, possibly representing stimulus-independent memory-related genes. On the other hand, the remaining genes were not previously identified in positive associative memory and may specifically regulate aversive LTAM. Our results suggest a multistep gene activation process during the formation and retrieval of long-term memory and define general memory-implicated genes as well as conditioning-type-dependent gene sets. SIGNIFICANCE STATEMENT The identification of genes regulating different steps of memory is of major interest in neuroscience. Identification of common memory genes across different learning paradigms and the temporal activation of the genes are poorly studied. Here, we investigated the temporal aspects of Caenorhabditis elegans gene expression changes using aversive olfactory associative long-term memory (LTAM) and identified three major gene activation waves. Like in previous studies, aversive LTAM is also CREB dependent, and CREB activity is necessary immediately after training. Finally, we define a list of memory paradigm-independent core gene sets as well as conditioning-dependent genes. Copyright © 2017 the authors 0270-6474/17/376661-12$15.00/0.

  10. Temporal Information of Directed Causal Connectivity in Multi-Trial ERP Data using Partial Granger Causality.

    PubMed

    Youssofzadeh, Vahab; Prasad, Girijesh; Naeem, Muhammad; Wong-Lin, KongFatt

    2016-01-01

    Partial Granger causality (PGC) has been applied to analyse causal functional neural connectivity after effectively mitigating confounding influences caused by endogenous latent variables and exogenous environmental inputs. However, it is not known how this connectivity obtained from PGC evolves over time. Furthermore, PGC has yet to be tested on realistic nonlinear neural circuit models and multi-trial event-related potentials (ERPs) data. In this work, we first applied a time-domain PGC technique to evaluate simulated neural circuit models, and demonstrated that the PGC measure is more accurate and robust in detecting connectivity patterns as compared to conditional Granger causality and partial directed coherence, especially when the circuit is intrinsically nonlinear. Moreover, the connectivity in PGC settles faster into a stable and correct configuration over time. After method verification, we applied PGC to reveal the causal connections of ERP trials of a mismatch negativity auditory oddball paradigm. The PGC analysis revealed a significant bilateral but asymmetrical localised activity in the temporal lobe close to the auditory cortex, and causal influences in the frontal, parietal and cingulate cortical areas, consistent with previous studies. Interestingly, the time to reach a stable connectivity configuration (~250–300 ms) coincides with the deviation of ensemble ERPs of oddball from standard tones. Finally, using a sliding time window, we showed higher resolution dynamics of causal connectivity within an ERP trial. In summary, time-domain PGC is promising in deciphering directed functional connectivity in nonlinear and ERP trials accurately, and at a sufficiently early stage. This data-driven approach can reduce computational time, and determine the key architecture for neural circuit modeling.

  11. Drug Induced Liver Injury: Can Biomarkers Assist RUCAM in Causality Assessment?

    PubMed Central

    Teschke, Rolf; Schulze, Johannes; Eickhoff, Axel; Danan, Gaby

    2017-01-01

    Drug induced liver injury (DILI) is a potentially serious adverse reaction in a few susceptible individuals under therapy by various drugs. Health care professionals facing DILI are confronted with a wealth of drug-unrelated liver diseases with high incidence and prevalence rates, which can confound the DILI diagnosis. Searching for alternative causes is a key element of RUCAM (Roussel Uclaf Causality Assessment Method) to assess rigorously causality in suspected DILI cases. Diagnostic biomarkers as blood tests would be a great help to clinicians, regulators, and pharmaceutical industry would be more comfortable if, in addition to RUCAM, causality of DILI can be confirmed. High specificity and sensitivity are required for any diagnostic biomarker. Although some risk factors are available to evaluate liver safety of drugs in patients, no valid diagnostic or prognostic biomarker exists currently for idiosyncratic DILI when a liver injury occurred. Identifying a biomarker in idiosyncratic DILI requires detailed knowledge of cellular and biochemical disturbances leading to apoptosis or cell necrosis and causing leakage of specific products in blood. As idiosyncratic DILI is typically a human disease and hardly reproducible in animals, pathogenetic events and resulting possible biomarkers remain largely undisclosed. Potential new diagnostic biomarkers should be evaluated in patients with DILI and RUCAM-based established causality. In conclusion, causality assessment in cases of suspected idiosyncratic DILI is still best achieved using RUCAM since specific biomarkers as diagnostic blood tests that could enhance RUCAM results are not yet available. PMID:28398242

  12. DNA methylome profiling identifies novel methylated genes in African American patients with colorectal neoplasia.

    PubMed

    Ashktorab, Hassan; Daremipouran, M; Goel, Ajay; Varma, Sudhir; Leavitt, R; Sun, Xueguang; Brim, Hassan

    2014-04-01

    The identification of genes that are differentially methylated in colorectal cancer (CRC) has potential value for both diagnostic and therapeutic interventions specifically in high-risk populations such as African Americans (AAs). However, DNA methylation patterns in CRC, especially in AAs, have not been systematically explored and remain poorly understood. Here, we performed DNA methylome profiling to identify the methylation status of CpG islands within candidate genes involved in critical pathways important in the initiation and development of CRC. We used reduced representation bisulfite sequencing (RRBS) in colorectal cancer and adenoma tissues that were compared with DNA methylome from a healthy AA subject's colon tissue and peripheral blood DNA. The identified methylation markers were validated in fresh frozen CRC tissues and corresponding normal tissues from AA patients diagnosed with CRC at Howard University Hospital. We identified and validated the methylation status of 355 CpG sites located within 16 gene promoter regions associated with CpG islands. Fifty CpG sites located within CpG islands-in genes ATXN7L1 (2), BMP3 (7), EID3 (15), GAS7 (1), GPR75 (24), and TNFAIP2 (1)-were significantly hypermethylated in tumor vs. normal tissues (P<0.05). The methylation status of BMP3, EID3, GAS7, and GPR75 was confirmed in an independent, validation cohort. Ingenuity pathway analysis mapped three of these markers (GAS7, BMP3 and GPR) in the insulin and TGF-β1 network-the two key pathways in CRC. In addition to hypermethylated genes, our analysis also revealed that LINE-1 repeat elements were progressively hypomethylated in the normal-adenoma-cancer sequence. We conclude that DNA methylome profiling based on RRBS is an effective method for screening aberrantly methylated genes in CRC. While previous studies focused on the limited identification of hypermethylated genes, ours is the first study to systematically and comprehensively identify novel hypermethylated

  13. Cross-lagged relations between mentoring received from supervisors and employee OCBs: Disentangling causal direction and identifying boundary conditions.

    PubMed

    Eby, Lillian T; Butts, Marcus M; Hoffman, Brian J; Sauer, Julia B

    2015-07-01

    Although mentoring has documented relationships with employee attitudes and outcomes of interest to organizations, neither the causal direction nor boundary conditions of the relationship between mentoring and organizational citizenship behaviors (OCBs) has been fully explored. On the basis of Social Learning Theory (SLT; Bandura, 1977, 1986), we predicted that mentoring received by supervisors would causally precede OCBs, rather than employee OCBs resulting in the receipt of more mentoring from supervisors. Results from cross-lagged data collected at 2 points in time from 190 intact supervisor-employee dyads supported our predictions; however, only for OCBs directed at individuals (OCB-Is) and not for OCBs directed at the organization (OCB-Os). Further supporting our theoretical rationale for expecting mentoring to precede OCBs, we found that coworker support operates as a substitute for mentoring in predicting OCB-Is. By contrast, no moderating effects were found for perceived organizational support. The results are discussed in terms of theoretical implications for mentoring and OCB research, as well as practical suggestions for enhancing employee citizenship behaviors. (c) 2015 APA, all rights reserved).

  14. Dissecting the Gene Network of Dietary Restriction to Identify Evolutionarily Conserved Pathways and New Functional Genes

    PubMed Central

    Wuttke, Daniel; Connor, Richard; Vora, Chintan; Craig, Thomas; Li, Yang; Wood, Shona; Vasieva, Olga; Shmookler Reis, Robert; Tang, Fusheng; de Magalhães, João Pedro

    2012-01-01

    Dietary restriction (DR), limiting nutrient intake from diet without causing malnutrition, delays the aging process and extends lifespan in multiple organisms. The conserved life-extending effect of DR suggests the involvement of fundamental mechanisms, although these remain a subject of debate. To help decipher the life-extending mechanisms of DR, we first compiled a list of genes that if genetically altered disrupt or prevent the life-extending effects of DR. We called these DR–essential genes and identified more than 100 in model organisms such as yeast, worms, flies, and mice. In order for other researchers to benefit from this first curated list of genes essential for DR, we established an online database called GenDR (http://genomics.senescence.info/diet/). To dissect the interactions of DR–essential genes and discover the underlying lifespan-extending mechanisms, we then used a variety of network and systems biology approaches to analyze the gene network of DR. We show that DR–essential genes are more conserved at the molecular level and have more molecular interactions than expected by chance. Furthermore, we employed a guilt-by-association method to predict novel DR–essential genes. In budding yeast, we predicted nine genes related to vacuolar functions; we show experimentally that mutations deleting eight of those genes prevent the life-extending effects of DR. Three of these mutants (OPT2, FRE6, and RCR2) had extended lifespan under ad libitum, indicating that the lack of further longevity under DR is not caused by a general compromise of fitness. These results demonstrate how network analyses of DR using GenDR can be used to make phenotypically relevant predictions. Moreover, gene-regulatory circuits reveal that the DR–induced transcriptional signature in yeast involves nutrient-sensing, stress responses and meiotic transcription factors. Finally, comparing the influence of gene expression changes during DR on the interactomes of multiple

  15. Identifying Stable Reference Genes for qRT-PCR Normalisation in Gene Expression Studies of Narrow-Leafed Lupin (Lupinus angustifolius L.).

    PubMed

    Taylor, Candy M; Jost, Ricarda; Erskine, William; Nelson, Matthew N

    2016-01-01

    Quantitative Reverse Transcription PCR (qRT-PCR) is currently one of the most popular, high-throughput and sensitive technologies available for quantifying gene expression. Its accurate application depends heavily upon normalisation of gene-of-interest data with reference genes that are uniformly expressed under experimental conditions. The aim of this study was to provide the first validation of reference genes for Lupinus angustifolius (narrow-leafed lupin, a significant grain legume crop) using a selection of seven genes previously trialed as reference genes for the model legume, Medicago truncatula. In a preliminary evaluation, the seven candidate reference genes were assessed on the basis of primer specificity for their respective targeted region, PCR amplification efficiency, and ability to discriminate between cDNA and gDNA. Following this assessment, expression of the three most promising candidates [Ubiquitin C (UBC), Helicase (HEL), and Polypyrimidine tract-binding protein (PTB)] was evaluated using the NormFinder and RefFinder statistical algorithms in two narrow-leafed lupin lines, both with and without vernalisation treatment, and across seven organ types (cotyledons, stem, leaves, shoot apical meristem, flowers, pods and roots) encompassing three developmental stages. UBC was consistently identified as the most stable candidate and has sufficiently uniform expression that it may be used as a sole reference gene under the experimental conditions tested here. However, as organ type and developmental stage were associated with greater variability in relative expression, it is recommended using UBC and HEL as a pair to achieve optimal normalisation. These results highlight the importance of rigorously assessing candidate reference genes for each species across a diverse range of organs and developmental stages. With emerging technologies, such as RNAseq, and the completion of valuable transcriptome data sets, it is possible that other potentially more

  16. Identifying Stable Reference Genes for qRT-PCR Normalisation in Gene Expression Studies of Narrow-Leafed Lupin (Lupinus angustifolius L.)

    PubMed Central

    Erskine, William; Nelson, Matthew N.

    2016-01-01

    Quantitative Reverse Transcription PCR (qRT-PCR) is currently one of the most popular, high-throughput and sensitive technologies available for quantifying gene expression. Its accurate application depends heavily upon normalisation of gene-of-interest data with reference genes that are uniformly expressed under experimental conditions. The aim of this study was to provide the first validation of reference genes for Lupinus angustifolius (narrow-leafed lupin, a significant grain legume crop) using a selection of seven genes previously trialed as reference genes for the model legume, Medicago truncatula. In a preliminary evaluation, the seven candidate reference genes were assessed on the basis of primer specificity for their respective targeted region, PCR amplification efficiency, and ability to discriminate between cDNA and gDNA. Following this assessment, expression of the three most promising candidates [Ubiquitin C (UBC), Helicase (HEL), and Polypyrimidine tract-binding protein (PTB)] was evaluated using the NormFinder and RefFinder statistical algorithms in two narrow-leafed lupin lines, both with and without vernalisation treatment, and across seven organ types (cotyledons, stem, leaves, shoot apical meristem, flowers, pods and roots) encompassing three developmental stages. UBC was consistently identified as the most stable candidate and has sufficiently uniform expression that it may be used as a sole reference gene under the experimental conditions tested here. However, as organ type and developmental stage were associated with greater variability in relative expression, it is recommended using UBC and HEL as a pair to achieve optimal normalisation. These results highlight the importance of rigorously assessing candidate reference genes for each species across a diverse range of organs and developmental stages. With emerging technologies, such as RNAseq, and the completion of valuable transcriptome data sets, it is possible that other potentially more

  17. A screen to identify Drosophila genes required for integrin-mediated adhesion.

    PubMed Central

    Walsh, E P; Brown, N H

    1998-01-01

    Drosophila integrins have essential adhesive roles during development, including adhesion between the two wing surfaces. Most position-specific integrin mutations cause lethality, and clones of homozygous mutant cells in the wing do not adhere to the apposing surface, causing blisters. We have used FLP-FRT induced mitotic recombination to generate clones of randomly induced mutations in the F1 generation and screened for mutations that cause wing blisters. This phenotype is highly selective, since only 14 lethal complementation groups were identified in screens of the five major chromosome arms. Of the loci identified, 3 are PS integrin genes, 2 are blistered and bloated, and the remaining 9 appear to be newly characterized loci. All 11 nonintegrin loci are required on both sides of the wing, in contrast to integrin alpha subunit genes. Mutations in 8 loci only disrupt adhesion in the wing, similar to integrin mutations, while mutations in the 3 other loci cause additional wing defects. Mutations in 4 loci, like the strongest integrin mutations, cause a "tail-up" embryonic lethal phenotype, and mutant alleles of 1 of these loci strongly enhance an integrin mutation. Thus several of these loci are good candidates for genes encoding cytoplasmic proteins required for integrin function. PMID:9755209

  18. Multiple Causality: Consequences for Medical Practice

    PubMed Central

    Nydegger, Corinne N.

    1983-01-01

    When a scientifically trained health professional is called upon to deal with patients holding differing causal views of illness, the resulting lack of communication is frustrating to both. This discussion traces some implications for medical practice of significant cultural differences in two aspects of causal paradigms of illness: (1) terms accepted and (2) dimension or level of causality typically sought. The second is the more pervasive and intractable problem, having distinctive consequences for the role of curer, symptomatology, diagnosis and treatment. PMID:6858133

  19. Genomic Analyses Yield Markers for Identifying Agronomically Important Genes in Potato

    USDA-ARS?s Scientific Manuscript database

    This study explores the genetic architecture underling the potato evolution through a comprehensive assessment of wild and cultivated potato species based on the re-sequencing of 201 accessions of Solanum section Petota with >12 × genome coverage. We identified 450 domesticated genes, which showed e...

  20. [Antibibiotic resistance by nosocomial infections' causal agents].

    PubMed

    Salazar-Holguín, Héctor Daniel; Cisneros-Robledo, María Elena

    2016-01-01

    The antibibiotic resistance by nosocomial infections (NI) causal agents constitutes a seriously global problematic that involves the Mexican Institute of Social Security's Regional General Hospital 1 in Chihuahua, Mexico; although with special features that required to be specified and evaluated, in order to concrete an effective therapy. Observational, descriptive and prospective study; by means of active vigilance all along 2014 in order to detect the nosocomial infections, for epidemiologic study, culture and antibiogram to identify its causal agents and antibiotics resistance and sensitivity. Among 13527 hospital discharges, 1079 displayed NI (8 %), standed out: the related on vascular lines, of surgical site, pneumonia and urinal track; they added up two thirds of the total. We carried out culture and antibiogram about 300 of them (27.8 %); identifying 31 bacterian species, mainly seven of those (77.9 %): Escherichia coli, Staphylococcus aureus and epidermidis, Pseudomonas aeruginosa, Acinetobacter baumannii, Klebsiella pneumoniae and Enterobacter cloacae; showing multiresistance to 34 tested antibiotics, except in seven with low or without resistance at all: vancomycin, teicoplanin, linezolid, quinupristin-dalfopristin, piperacilin-tazobactam, amikacin and carbapenems. When we contrasted those results with the recommendations in the clinical practice guides, it aroused several contradictions; so they must be taken with reserves and has to be tested in each hospital, by means of cultures and antibiograms in practically every case of nosocomial infection.