Microbial genome-wide association studies: lessons from human GWAS.
Power, Robert A; Parkhill, Julian; de Oliveira, Tulio
2017-01-01
The reduced costs of sequencing have led to whole-genome sequences for a large number of microorganisms, enabling the application of microbial genome-wide association studies (GWAS). Given the successes of human GWAS in understanding disease aetiology and identifying potential drug targets, microbial GWAS are likely to further advance our understanding of infectious diseases. These advances include insights into pressing global health problems, such as antibiotic resistance and disease transmission. In this Review, we outline the methodologies of GWAS, the current state of the field of microbial GWAS, and how lessons from human GWAS can direct the future of the field.
Implications of genome-wide association studies in cancer therapeutics.
Patel, Jai N; McLeod, Howard L; Innocenti, Federico
2013-09-01
Genome wide association studies (GWAS) provide an agnostic approach to identifying potential genetic variants associated with disease susceptibility, prognosis of survival and/or predictive of drug response. Although these techniques are costly and interpretation of study results is challenging, they do allow for a more unbiased interrogation of the entire genome, resulting in the discovery of novel genes and understanding of novel biological associations. This review will focus on the implications of GWAS in cancer therapy, in particular germ-line mutations, including findings from major GWAS which have identified predictive genetic loci for clinical outcome and/or toxicity. Lessons and challenges in cancer GWAS are also discussed, including the need for functional analysis and replication, as well as future perspectives for biological and clinical utility. Given the large heterogeneity in response to cancer therapeutics, novel methods of identifying mechanisms and biology of variable drug response and ultimately treatment individualization will be indispensable. © 2013 The British Pharmacological Society.
The new NHGRI-EBI Catalog of published genome-wide association studies (GWAS Catalog)
MacArthur, Jacqueline; Bowler, Emily; Cerezo, Maria; Gil, Laurent; Hall, Peggy; Hastings, Emma; Junkins, Heather; McMahon, Aoife; Milano, Annalisa; Morales, Joannella; Pendlington, Zoe May; Welter, Danielle; Burdett, Tony; Hindorff, Lucia; Flicek, Paul; Cunningham, Fiona; Parkinson, Helen
2017-01-01
The NHGRI-EBI GWAS Catalog has provided data from published genome-wide association studies since 2008. In 2015, the database was redesigned and relocated to EMBL-EBI. The new infrastructure includes a new graphical user interface (www.ebi.ac.uk/gwas/), ontology supported search functionality and an improved curation interface. These developments have improved the data release frequency by increasing automation of curation and providing scaling improvements. The range of available Catalog data has also been extended with structured ancestry and recruitment information added for all studies. The infrastructure improvements also support scaling for larger arrays, exome and sequencing studies, allowing the Catalog to adapt to the needs of evolving study design, genotyping technologies and user needs in the future. PMID:27899670
The new NHGRI-EBI Catalog of published genome-wide association studies (GWAS Catalog).
MacArthur, Jacqueline; Bowler, Emily; Cerezo, Maria; Gil, Laurent; Hall, Peggy; Hastings, Emma; Junkins, Heather; McMahon, Aoife; Milano, Annalisa; Morales, Joannella; Pendlington, Zoe May; Welter, Danielle; Burdett, Tony; Hindorff, Lucia; Flicek, Paul; Cunningham, Fiona; Parkinson, Helen
2017-01-04
The NHGRI-EBI GWAS Catalog has provided data from published genome-wide association studies since 2008. In 2015, the database was redesigned and relocated to EMBL-EBI. The new infrastructure includes a new graphical user interface (www.ebi.ac.uk/gwas/), ontology supported search functionality and an improved curation interface. These developments have improved the data release frequency by increasing automation of curation and providing scaling improvements. The range of available Catalog data has also been extended with structured ancestry and recruitment information added for all studies. The infrastructure improvements also support scaling for larger arrays, exome and sequencing studies, allowing the Catalog to adapt to the needs of evolving study design, genotyping technologies and user needs in the future. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Hsu, Yi-Hsiang; Kiel, Douglas P
2012-10-01
The primary goals of genome-wide association studies (GWAS) are to discover new molecular and biological pathways involved in the regulation of bone metabolism that can be leveraged for drug development. In addition, the identified genetic determinants may be used to enhance current risk factor profiles. There have been more than 40 published GWAS on skeletal phenotypes, predominantly focused on dual-energy x-ray absorptiometry-derived bone mineral density (BMD) of the hip and spine. Sixty-six BMD loci have been replicated across all the published GWAS, confirming the highly polygenic nature of BMD variation. Only seven of the 66 previously reported genes (LRP5, SOST, ESR1, TNFRSF11B, TNFRSF11A, TNFSF11, PTH) from candidate gene association studies have been confirmed by GWAS. Among 59 novel BMD GWAS loci that have not been reported by previous candidate gene association studies, some have been shown to be involved in key biological pathways involving the skeleton, particularly Wnt signaling (AXIN1, LRP5, CTNNB1, DKK1, FOXC2, HOXC6, LRP4, MEF2C, PTHLH, RSPO3, SFRP4, TGFBR3, WLS, WNT3, WNT4, WNT5B, WNT16), bone development: ossification (CLCN7, CSF1, MEF2C, MEPE, PKDCC, PTHLH, RUNX2, SOX6, SOX9, SPP1, SP7), mesenchymal-stem-cell differentiation (FAM3C, MEF2C, RUNX2, SOX4, SOX9, SP7), osteoclast differentiation (JAG1, RUNX2), and TGF-signaling (FOXL1, SPTBN1, TGFBR3). There are still 30 BMD GWAS loci without prior molecular or biological evidence of their involvement in skeletal phenotypes. Other skeletal phenotypes that either have been or are being studied include hip geometry, bone ultrasound, quantitative computed tomography, high-resolution peripheral quantitative computed tomography, biochemical markers, and fractures such as vertebral, nonvertebral, hip, and forearm. Although several challenges lie ahead as GWAS moves into the next generation, there are prospects of new discoveries in skeletal biology. This review integrates findings from previous GWAS and provides a roadmap for future directions building on current GWAS successes.
Zhu, Qianqian; Shepherd, Lori; Lunetta, Kathryn L.; Yao, Song; Liu, Qian; Hu, Qiang; Haddad, Stephen A.; Sucheston-Campbell, Lara; Bensen, Jeannette T.; Bandera, Elisa V.; Rosenberg, Lynn; Liu, Song; Haiman, Christopher A.; Olshan, Andrew F.; Palmer, Julie R.; Ambrosone, Christine B.
2016-01-01
Leveraging population-distinct linkage equilibrium (LD) patterns, trans-ethnic follow-up of variants discovered from genome-wide association studies (GWAS) has proved to be useful in facilitating the identification of bona fide causal variants. We previously developed the preferential LD approach, a novel method that successfully identified causal variants driving the GWAS signals within European-descent populations even when the causal variants were only weakly linked with the GWAS-discovered variants. To evaluate the performance of our approach in a trans-ethnic setting, we applied it to follow up breast cancer GWAS hits identified mostly from populations of European ancestry in African Americans (AA). We evaluated 74 breast cancer GWAS variants in 8,315 AA women from the African American Breast Cancer Epidemiology and Risk (AMBER) consortium. Only 27% of them were associated with breast cancer risk at significance level α=0.05, suggesting race-specificity of the identified breast cancer risk loci. We followed up on those replicated GWAS hits in the AMBER consortium utilizing the preferential LD approach, to search for causal variants or better breast cancer markers from the 1000 Genomes variant catalog. Our approach identified stronger breast cancer markers for 80% of the GWAS hits with at least nominal breast cancer association, and in 81% of these cases, the marker identified was among the top 10 of all 1000 Genomes variants in the corresponding locus. The results support trans-ethnic application of the preferential LD approach in search for candidate causal variants, and may have implications for future genetic research of breast cancer in AA women. PMID:27825120
Power considerations for λ inflation factor in meta-analyses of genome-wide association studies.
Georgiopoulos, Georgios; Evangelou, Evangelos
2016-05-19
The genomic control (GC) approach is extensively used to effectively control false positive signals due to population stratification in genome-wide association studies (GWAS). However, GC affects the statistical power of GWAS. The loss of power depends on the magnitude of the inflation factor (λ) that is used for GC. We simulated meta-analyses of different GWAS. Minor allele frequency (MAF) ranged from 0·001 to 0·5 and λ was sampled from two scenarios: (i) random scenario (empirically-derived distribution of real λ values) and (ii) selected scenario from simulation parameter modification. Adjustment for λ was considered under single correction (within study corrected standard errors) and double correction (additional λ corrected summary estimate). MAF was a pivotal determinant of observed power. In random λ scenario, double correction induced a symmetric power reduction in comparison to single correction. For MAF 1·2 and MAF >5%. Our results provide a quick but detailed index for power considerations of future meta-analyses of GWAS that enables a more flexible design from early steps based on the number of studies accumulated in different groups and the λ values observed in the single studies.
Genome-wide association studies for the identification of biomarkers in metabolic diseases.
Pattin, Kristine A; Moore, Jason H
2010-01-01
The field of genetics as it relates to metabolic disorders such as obesity and type II diabetes is complicated, and along with the medical research community, great strides are being taken to begin to understand the biological and genetic underpinnings of these diseases, with the hope of improving therapeutic, diagnostic and preventive strategies. Although research on metabolic disorders has been continuing for decades, the completion of the Human Genome Project in 2003 and the International HapMap Project in 2005 gave rise to an abundance of research tools, such as genome-wide genotyping, which allow researchers to conduct genome-wide association studies (GWAS) for detecting genetic variants that confer increased or decreased susceptibility to such complex diseases. In this review, the complex nature of metabolic disorders is discussed, specifically obesity and type II diabetes, as well as the limitations of the GWAS as applied to these disorders. While acknowledging limitations of GWAS, it is hoped to provide an insight about how GWAS can be adapted and advantageous in the clinical setting, enhancing prevention, diagnosis and treatment of these diseases. To be able to use the GWAS in a clinical setting is a complex challenge, yet it is hoped that in the future this tool will ultimately allow the development of pharmaceutical options that are capable of targeting the cause of metabolic disorders, not just the symptoms themselves.
Iwata, Hiroyoshi; Hayashi, Takeshi; Terakami, Shingo; Takada, Norio; Sawamura, Yutaka; Yamamoto, Toshiya
2013-01-01
Although the potential of marker-assisted selection (MAS) in fruit tree breeding has been reported, bi-parental QTL mapping before MAS has hindered the introduction of MAS to fruit tree breeding programs. Genome-wide association studies (GWAS) are an alternative to bi-parental QTL mapping in long-lived perennials. Selection based on genomic predictions of breeding values (genomic selection: GS) is another alternative for MAS. This study examined the potential of GWAS and GS in pear breeding with 76 Japanese pear cultivars to detect significant associations of 162 markers with nine agronomic traits. We applied multilocus Bayesian models accounting for ordinal categorical phenotypes for GWAS and GS model training. Significant associations were detected at harvest time, black spot resistance and the number of spurs and two of the associations were closely linked to known loci. Genome-wide predictions for GS were accurate at the highest level (0.75) in harvest time, at medium levels (0.38–0.61) in resistance to black spot, firmness of flesh, fruit shape in longitudinal section, fruit size, acid content and number of spurs and at low levels (<0.2) in all soluble solid content and vigor of tree. Results suggest the potential of GWAS and GS for use in future breeding programs in Japanese pear. PMID:23641189
easyGWAS: A Cloud-Based Platform for Comparing the Results of Genome-Wide Association Studies.
Grimm, Dominik G; Roqueiro, Damian; Salomé, Patrice A; Kleeberger, Stefan; Greshake, Bastian; Zhu, Wangsheng; Liu, Chang; Lippert, Christoph; Stegle, Oliver; Schölkopf, Bernhard; Weigel, Detlef; Borgwardt, Karsten M
2017-01-01
The ever-growing availability of high-quality genotypes for a multitude of species has enabled researchers to explore the underlying genetic architecture of complex phenotypes at an unprecedented level of detail using genome-wide association studies (GWAS). The systematic comparison of results obtained from GWAS of different traits opens up new possibilities, including the analysis of pleiotropic effects. Other advantages that result from the integration of multiple GWAS are the ability to replicate GWAS signals and to increase statistical power to detect such signals through meta-analyses. In order to facilitate the simple comparison of GWAS results, we present easyGWAS, a powerful, species-independent online resource for computing, storing, sharing, annotating, and comparing GWAS. The easyGWAS tool supports multiple species, the uploading of private genotype data and summary statistics of existing GWAS, as well as advanced methods for comparing GWAS results across different experiments and data sets in an interactive and user-friendly interface. easyGWAS is also a public data repository for GWAS data and summary statistics and already includes published data and results from several major GWAS. We demonstrate the potential of easyGWAS with a case study of the model organism Arabidopsis thaliana , using flowering and growth-related traits. © 2016 American Society of Plant Biologists. All rights reserved.
Smith, Andrew J P; Deloukas, Panos; Munroe, Patricia B
2018-04-13
Over the last decade, genome-wide association studies (GWAS) have propelled the discovery of thousands of loci associated with complex diseases. The focus is now turning towards the function of these association signals, determining the causal variant(s) amongst those in strong linkage disequilibrium, and identifying their underlying mechanisms, such as long-range gene regulation. Genome-editing techniques utilising zinc-finger nucleases (ZFN), transcription activator-like effector nucleases (TALENs) and clustered regularly-interspaced short palindromic repeats with Cas9 nuclease (CRISPR-Cas9), are becoming the tools of choice to establish functionality for these variants, due to the ability to assess effects of single variants in vivo. This review will discuss examples of how these technologies have begun to aid functional analysis of GWAS loci for complex traits such as cardiovascular disease, type 2 diabetes, cancer, obesity and autoimmune disease. We focus on analysis of variants occurring within non-coding genomic regions, as these comprise the majority of GWAS variants, providing the greatest challenges to determining functionality, and compare editing strategies that provide different levels of evidence for variant functionality. The review describes molecular insights into some of these potentially causal variants, and how these may relate to the pathology of the trait, and look towards future directions for these technologies in post-GWAS analysis, such as base-editing.
Cabrera, Claudia P; Ng, Fu Liang; Warren, Helen R; Barnes, Michael R; Munroe, Patricia B; Caulfield, Mark J
2015-01-01
Hypertension is a major risk factor for global mortality. Recent genome-wide association studies (GWAS) have led to successful identification of many genetic loci influencing blood pressure, although these studies account for less than 5% of heritability. While genetic discovery efforts continue, it is timely to pause and reflect on what information has been gained to date from reported loci. Knowledge from GWAS findings inform our understanding of the pathways and pleiotropy underpinning hypertension and aid in the identification of potential druggable targets. By reviewing blood pressure loci we aim to determine how much potential the current observations have for future clinical utility. The authors have declared no conflicts of interest for this article. © 2015 Wiley Periodicals, Inc.
Insights into the genetics of gastroesophageal reflux disease (GERD) and GERD-related disorders.
Böhmer, A C; Schumacher, J
2017-02-01
Gastroesophageal reflux disease (GERD) is associated with obesity and hiatal hernia, and often precedes the development of Barrett's esophagus (BE) and esophageal adenocarcinoma (EA). Epidemiological studies show that the global prevalence of GERD is increasing. GERD is a multifactorial disease with a complex genetic architecture. Genome-wide association studies (GWAS) have provided initial insights into the genetic background of GERD. The present review summarizes current knowledge of the genetics of GERD and a possible genetic overlap between GERD and BE and EA. The review discusses genes and cellular pathways that have been implicated through GWAS, and provides an outlook on how future molecular research will enhance understanding of GERD pathophysiology. © 2017 John Wiley & Sons Ltd.
The AraGWAS Catalog: a curated and standardized Arabidopsis thaliana GWAS catalog
Togninalli, Matteo; Seren, Ümit; Meng, Dazhe; Fitz, Joffrey; Nordborg, Magnus; Weigel, Detlef
2018-01-01
Abstract The abundance of high-quality genotype and phenotype data for the model organism Arabidopsis thaliana enables scientists to study the genetic architecture of many complex traits at an unprecedented level of detail using genome-wide association studies (GWAS). GWAS have been a great success in A. thaliana and many SNP-trait associations have been published. With the AraGWAS Catalog (https://aragwas.1001genomes.org) we provide a publicly available, manually curated and standardized GWAS catalog for all publicly available phenotypes from the central A. thaliana phenotype repository, AraPheno. All GWAS have been recomputed on the latest imputed genotype release of the 1001 Genomes Consortium using a standardized GWAS pipeline to ensure comparability between results. The catalog includes currently 167 phenotypes and more than 222 000 SNP-trait associations with P < 10−4, of which 3887 are significantly associated using permutation-based thresholds. The AraGWAS Catalog can be accessed via a modern web-interface and provides various features to easily access, download and visualize the results and summary statistics across GWAS. PMID:29059333
Genomic regions associated with bovine milk fatty acids in both summer and winter milk samples
2012-01-01
Background In this study we perform a genome-wide association study (GWAS) for bovine milk fatty acids from summer milk samples. This study replicates a previous study where we performed a GWAS for bovine milk fatty acids based on winter milk samples from the same population. Fatty acids from summer and winter milk are genetically similar traits and we therefore compare the regions detected in summer milk to the regions previously detected in winter milk GWAS to discover regions that explain genetic variation in both summer and winter milk. Results The GWAS of summer milk samples resulted in 51 regions associated with one or more milk fatty acids. Results are in agreement with most associations that were previously detected in a GWAS of fatty acids from winter milk samples, including eight ‘new’ regions that were not considered in the individual studies. The high correlation between the –log10(P-values) and effects of SNPs that were found significant in both GWAS imply that the effects of the SNPs were similar on winter and summer milk fatty acids. Conclusions The GWAS of fatty acids based on summer milk samples was in agreement with most of the associations detected in the GWAS of fatty acids based on winter milk samples. Associations that were in agreement between both GWAS are more likely to be involved in fatty acid synthesis compared to regions detected in only one GWAS and are therefore worthwhile to pursue in fine-mapping studies. PMID:23107417
2014-01-01
Background Genome-wide association studies (GWAS) have identified several loci associated with schizophrenia and/or bipolar disorder. We performed a GWAS of psychosis as a broad syndrome rather than within specific diagnostic categories. Methods 1239 cases with schizophrenia, schizoaffective disorder, or psychotic bipolar disorder; 857 of their unaffected relatives, and 2739 healthy controls were genotyped with the Affymetrix 6.0 single nucleotide polymorphism (SNP) array. Analyses of 695,193 SNPs were conducted using UNPHASED, which combines information across families and unrelated individuals. We attempted to replicate signals found in 23 genomic regions using existing data on nonoverlapping samples from the Psychiatric GWAS Consortium and Schizophrenia-GENE-plus cohorts (10,352 schizophrenia patients and 24,474 controls). Results No individual SNP showed compelling evidence for association with psychosis in our data. However, we observed a trend for association with same risk alleles at loci previously associated with schizophrenia (one-sided p = .003). A polygenic score analysis found that the Psychiatric GWAS Consortium’s panel of SNPs associated with schizophrenia significantly predicted disease status in our sample (p = 5 × 10–14) and explained approximately 2% of the phenotypic variance. Conclusions Although narrowly defined phenotypes have their advantages, we believe new loci may also be discovered through meta-analysis across broad phenotypes. The novel statistical methodology we introduced to model effect size heterogeneity between studies should help future GWAS that combine association evidence from related phenotypes. Applying these approaches, we highlight three loci that warrant further investigation. We found that SNPs conveying risk for schizophrenia are also predictive of disease status in our data. PMID:23871474
Bramon, Elvira; Pirinen, Matti; Strange, Amy; Lin, Kuang; Freeman, Colin; Bellenguez, Céline; Su, Zhan; Band, Gavin; Pearson, Richard; Vukcevic, Damjan; Langford, Cordelia; Deloukas, Panos; Hunt, Sarah; Gray, Emma; Dronov, Serge; Potter, Simon C; Tashakkori-Ghanbaria, Avazeh; Edkins, Sarah; Bumpstead, Suzannah J; Arranz, Maria J; Bakker, Steven; Bender, Stephan; Bruggeman, Richard; Cahn, Wiepke; Chandler, David; Collier, David A; Crespo-Facorro, Benedicto; Dazzan, Paola; de Haan, Lieuwe; Di Forti, Marta; Dragović, Milan; Giegling, Ina; Hall, Jeremy; Iyegbe, Conrad; Jablensky, Assen; Kahn, René S; Kalaydjieva, Luba; Kravariti, Eugenia; Lawrie, Stephen; Linszen, Don H; Mata, Ignacio; McDonald, Colm; McIntosh, Andrew; Myin-Germeys, Inez; Ophoff, Roel A; Pariante, Carmine M; Paunio, Tiina; Picchioni, Marco; Ripke, Stephan; Rujescu, Dan; Sauer, Heinrich; Shaikh, Madiha; Sussmann, Jessika; Suvisaari, Jaana; Tosato, Sarah; Toulopoulou, Timothea; Van Os, Jim; Walshe, Muriel; Weisbrod, Matthias; Whalley, Heather; Wiersma, Durk; Blackwell, Jenefer M; Brown, Matthew A; Casas, Juan P; Corvin, Aiden; Duncanson, Audrey; Jankowski, Janusz A Z; Markus, Hugh S; Mathew, Christopher G; Palmer, Colin N A; Plomin, Robert; Rautanen, Anna; Sawcer, Stephen J; Trembath, Richard C; Wood, Nicholas W; Barroso, Ines; Peltonen, Leena; Lewis, Cathryn M; Murray, Robin M; Donnelly, Peter; Powell, John; Spencer, Chris C A
2014-03-01
Genome-wide association studies (GWAS) have identified several loci associated with schizophrenia and/or bipolar disorder. We performed a GWAS of psychosis as a broad syndrome rather than within specific diagnostic categories. 1239 cases with schizophrenia, schizoaffective disorder, or psychotic bipolar disorder; 857 of their unaffected relatives, and 2739 healthy controls were genotyped with the Affymetrix 6.0 single nucleotide polymorphism (SNP) array. Analyses of 695,193 SNPs were conducted using UNPHASED, which combines information across families and unrelated individuals. We attempted to replicate signals found in 23 genomic regions using existing data on nonoverlapping samples from the Psychiatric GWAS Consortium and Schizophrenia-GENE-plus cohorts (10,352 schizophrenia patients and 24,474 controls). No individual SNP showed compelling evidence for association with psychosis in our data. However, we observed a trend for association with same risk alleles at loci previously associated with schizophrenia (one-sided p = .003). A polygenic score analysis found that the Psychiatric GWAS Consortium's panel of SNPs associated with schizophrenia significantly predicted disease status in our sample (p = 5 × 10(-14)) and explained approximately 2% of the phenotypic variance. Although narrowly defined phenotypes have their advantages, we believe new loci may also be discovered through meta-analysis across broad phenotypes. The novel statistical methodology we introduced to model effect size heterogeneity between studies should help future GWAS that combine association evidence from related phenotypes. Applying these approaches, we highlight three loci that warrant further investigation. We found that SNPs conveying risk for schizophrenia are also predictive of disease status in our data. Copyright © 2014 Society of Biological Psychiatry. Published by Elsevier Inc. All rights reserved.
Zhu, Zhaozhong; Anttila, Verneri; Smoller, Jordan W; Lee, Phil H
2018-01-01
Advances in recent genome wide association studies (GWAS) suggest that pleiotropic effects on human complex traits are widespread. A number of classic and recent meta-analysis methods have been used to identify genetic loci with pleiotropic effects, but the overall performance of these methods is not well understood. In this work, we use extensive simulations and case studies of GWAS datasets to investigate the power and type-I error rates of ten meta-analysis methods. We specifically focus on three conditions commonly encountered in the studies of multiple traits: (1) extensive heterogeneity of genetic effects; (2) characterization of trait-specific association; and (3) inflated correlation of GWAS due to overlapping samples. Although the statistical power is highly variable under distinct study conditions, we found the superior power of several methods under diverse heterogeneity. In particular, classic fixed-effects model showed surprisingly good performance when a variant is associated with more than a half of study traits. As the number of traits with null effects increases, ASSET performed the best along with competitive specificity and sensitivity. With opposite directional effects, CPASSOC featured the first-rate power. However, caution is advised when using CPASSOC for studying genetically correlated traits with overlapping samples. We conclude with a discussion of unresolved issues and directions for future research.
Huo, Dezheng
2013-01-01
Numerous single nucleotide polymorphisms (SNPs) associated with breast cancer susceptibility have been identified by genome-wide association studies (GWAS). However, these SNPs were primarily discovered and validated in women of European and Asian ancestry. Because linkage disequilibrium is ancestry-dependent and heterogeneous among racial/ethnic populations, we evaluated common genetic variants at 22 GWAS-identified breast cancer susceptibility loci in a pooled sample of 1502 breast cancer cases and 1378 controls of African ancestry. None of the 22 GWAS index SNPs could be validated, challenging the direct generalizability of breast cancer risk variants identified in Caucasians or Asians to other populations. Novel breast cancer risk variants for women of African ancestry were identified in regions including 5p12 (odds ratio [OR] = 1.40, 95% confidence interval [CI] = 1.11–1.76; P = 0.004), 5q11.2 (OR = 1.22, 95% CI = 1.09–1.36; P = 0.00053) and 10p15.1 (OR = 1.22, 95% CI = 1.08–1.38; P = 0.0015). We also found positive association signals in three regions (6q25.1, 10q26.13 and 16q12.1–q12.2) previously confirmed by fine mapping in women of African ancestry. In addition, polygenic model indicated that eight best markers in this study, compared with 22 GWAS-identified SNPs, could better predict breast cancer risk in women of African ancestry (per-allele OR = 1.21, 95% CI = 1.16–1.27; P = 9.7 × 10–16). Our results demonstrate that fine mapping is a powerful approach to better characterize the breast cancer risk alleles in diverse populations. Future studies and new GWAS in women of African ancestry hold promise to discover additional variants for breast cancer susceptibility with clinical implications throughout the African diaspora. PMID:23475944
Shen, Changbing; Gao, Jing; Sheng, Yujun; Dou, Jinfa; Zhou, Fusheng; Zheng, Xiaodong; Ko, Randy; Tang, Xianfa; Zhu, Caihong; Yin, Xianyong; Sun, Liangdan; Cui, Yong; Zhang, Xuejun
2016-01-01
Vitiligo is an autoimmune disease with a strong genetic component, characterized by areas of depigmented skin resulting from loss of epidermal melanocytes. Genetic factors are known to play key roles in vitiligo through discoveries in association studies and family studies. Previously, vitiligo susceptibility genes were mainly revealed through linkage analysis and candidate gene studies. Recently, our understanding of the genetic basis of vitiligo has been rapidly advancing through genome-wide association study (GWAS). More than 40 robust susceptible loci have been identified and confirmed to be associated with vitiligo by using GWAS. Most of these associated genes participate in important pathways involved in the pathogenesis of vitiligo. Many susceptible loci with unknown functions in the pathogenesis of vitiligo have also been identified, indicating that additional molecular mechanisms may contribute to the risk of developing vitiligo. In this review, we summarize the key loci that are of genome-wide significance, which have been shown to influence vitiligo risk. These genetic loci may help build the foundation for genetic diagnosis and personalize treatment for patients with vitiligo in the future. However, substantial additional studies, including gene-targeted and functional studies, are required to confirm the causality of the genetic variants and their biological relevance in the development of vitiligo. PMID:26870082
Further support for association between GWAS variant for positive emotion and reward systems.
Lancaster, T M; Ihssen, N; Brindley, L M; Linden, D E J
2017-01-31
A recent genome-wide association study (GWAS) identified a significant single-nucleotide polymorphism (SNP) for trait-positive emotion at rs322931 on chromosome 1, which was also associated with brain activation in the reward system of healthy individuals when observing positive stimuli in a functional magnetic resonance imaging (fMRI) study. In the current study, we aimed to further validate the role of variation at rs322931 in reward processing. Using a similar fMRI approach, we use two paradigms that elicit a strong ventral striatum (VS) blood oxygen-level dependency (BOLD) response in a sample of young, healthy individuals (N=82). In the first study we use a similar picture-viewing task to the discovery sample (positive>neutral stimuli) to replicate an effect of the variant on emotion processing. In the second study we use a probabilistic reversal learning procedure to identify reward processing during decision-making under uncertainly (reward>punishment). In a region of interest (ROI) analysis of the bilateral VS, we show that the rs322931 genotype was associated with BOLD in the left VS during the positive>neutral contrast (P ROI-CORRECTED =0.045) and during the reward>punishment contrast (P ROI-CORRECTED =0.018), although the effect of passive picture viewing was in the opposite direction from that reported in the discovery sample. These findings suggest that the recently identified GWAS hit may influence positive emotion via individual differences in activity in the key hubs of the brain's reward system. Furthermore, these effects may not be limited to the passive viewing of positive emotional scenes, but may also be observed during dynamic decision-making. This study suggests that future studies of this GWAS locus may yield further insight into the biological mechanisms of psychopathologies characterised by deficits in reward processing and positive emotion.
Genomics-assisted breeding in fruit trees.
Iwata, Hiroyoshi; Minamikawa, Mai F; Kajiya-Kanegae, Hiromi; Ishimori, Motoyuki; Hayashi, Takeshi
2016-01-01
Recent advancements in genomic analysis technologies have opened up new avenues to promote the efficiency of plant breeding. Novel genomics-based approaches for plant breeding and genetics research, such as genome-wide association studies (GWAS) and genomic selection (GS), are useful, especially in fruit tree breeding. The breeding of fruit trees is hindered by their long generation time, large plant size, long juvenile phase, and the necessity to wait for the physiological maturity of the plant to assess the marketable product (fruit). In this article, we describe the potential of genomics-assisted breeding, which uses these novel genomics-based approaches, to break through these barriers in conventional fruit tree breeding. We first introduce the molecular marker systems and whole-genome sequence data that are available for fruit tree breeding. Next we introduce the statistical methods for biparental linkage and quantitative trait locus (QTL) mapping as well as GWAS and GS. We then review QTL mapping, GWAS, and GS studies conducted on fruit trees. We also review novel technologies for rapid generation advancement. Finally, we note the future prospects of genomics-assisted fruit tree breeding and problems that need to be overcome in the breeding.
Genomics-assisted breeding in fruit trees
Iwata, Hiroyoshi; Minamikawa, Mai F.; Kajiya-Kanegae, Hiromi; Ishimori, Motoyuki; Hayashi, Takeshi
2016-01-01
Recent advancements in genomic analysis technologies have opened up new avenues to promote the efficiency of plant breeding. Novel genomics-based approaches for plant breeding and genetics research, such as genome-wide association studies (GWAS) and genomic selection (GS), are useful, especially in fruit tree breeding. The breeding of fruit trees is hindered by their long generation time, large plant size, long juvenile phase, and the necessity to wait for the physiological maturity of the plant to assess the marketable product (fruit). In this article, we describe the potential of genomics-assisted breeding, which uses these novel genomics-based approaches, to break through these barriers in conventional fruit tree breeding. We first introduce the molecular marker systems and whole-genome sequence data that are available for fruit tree breeding. Next we introduce the statistical methods for biparental linkage and quantitative trait locus (QTL) mapping as well as GWAS and GS. We then review QTL mapping, GWAS, and GS studies conducted on fruit trees. We also review novel technologies for rapid generation advancement. Finally, we note the future prospects of genomics-assisted fruit tree breeding and problems that need to be overcome in the breeding. PMID:27069395
Genetic Structure of the Han Chinese Population Revealed by Genome-wide SNP Variation
Chen, Jieming; Zheng, Houfeng; Bei, Jin-Xin; Sun, Liangdan; Jia, Wei-hua; Li, Tao; Zhang, Furen; Seielstad, Mark; Zeng, Yi-Xin; Zhang, Xuejun; Liu, Jianjun
2009-01-01
Population stratification is a potential problem for genome-wide association studies (GWAS), confounding results and causing spurious associations. Hence, understanding how allele frequencies vary across geographic regions or among subpopulations is an important prelude to analyzing GWAS data. Using over 350,000 genome-wide autosomal SNPs in over 6000 Han Chinese samples from ten provinces of China, our study revealed a one-dimensional “north-south” population structure and a close correlation between geography and the genetic structure of the Han Chinese. The north-south population structure is consistent with the historical migration pattern of the Han Chinese population. Metropolitan cities in China were, however, more diffused “outliers,” probably because of the impact of modern migration of peoples. At a very local scale within the Guangdong province, we observed evidence of population structure among dialect groups, probably on account of endogamy within these dialects. Via simulation, we show that empirical levels of population structure observed across modern China can cause spurious associations in GWAS if not properly handled. In the Han Chinese, geographic matching is a good proxy for genetic matching, particularly in validation and candidate-gene studies in which population stratification cannot be directly accessed and accounted for because of the lack of genome-wide data, with the exception of the metropolitan cities, where geographical location is no longer a good indicator of ancestral origin. Our findings are important for designing GWAS in the Chinese population, an activity that is expected to intensify greatly in the near future. PMID:19944401
Genotype Imputation for Latinos Using the HapMap and 1000 Genomes Project Reference Panels.
Gao, Xiaoyi; Haritunians, Talin; Marjoram, Paul; McKean-Cowdin, Roberta; Torres, Mina; Taylor, Kent D; Rotter, Jerome I; Gauderman, William J; Varma, Rohit
2012-01-01
Genotype imputation is a vital tool in genome-wide association studies (GWAS) and meta-analyses of multiple GWAS results. Imputation enables researchers to increase genomic coverage and to pool data generated using different genotyping platforms. HapMap samples are often employed as the reference panel. More recently, the 1000 Genomes Project resource is becoming the primary source for reference panels. Multiple GWAS and meta-analyses are targeting Latinos, the most populous, and fastest growing minority group in the US. However, genotype imputation resources for Latinos are rather limited compared to individuals of European ancestry at present, largely because of the lack of good reference data. One choice of reference panel for Latinos is one derived from the population of Mexican individuals in Los Angeles contained in the HapMap Phase 3 project and the 1000 Genomes Project. However, a detailed evaluation of the quality of the imputed genotypes derived from the public reference panels has not yet been reported. Using simulation studies, the Illumina OmniExpress GWAS data from the Los Angles Latino Eye Study and the MACH software package, we evaluated the accuracy of genotype imputation in Latinos. Our results show that the 1000 Genomes Project AMR + CEU + YRI reference panel provides the highest imputation accuracy for Latinos, and that also including Asian samples in the panel can reduce imputation accuracy. We also provide the imputation accuracy for each autosomal chromosome using the 1000 Genomes Project panel for Latinos. Our results serve as a guide to future imputation based analysis in Latinos.
Prioritizing GWAS Results: A Review of Statistical Methods and Recommendations for Their Application
Cantor, Rita M.; Lange, Kenneth; Sinsheimer, Janet S.
2010-01-01
Genome-wide association studies (GWAS) have rapidly become a standard method for disease gene discovery. A substantial number of recent GWAS indicate that for most disorders, only a few common variants are implicated and the associated SNPs explain only a small fraction of the genetic risk. This review is written from the viewpoint that findings from the GWAS provide preliminary genetic information that is available for additional analysis by statistical procedures that accumulate evidence, and that these secondary analyses are very likely to provide valuable information that will help prioritize the strongest constellations of results. We review and discuss three analytic methods to combine preliminary GWAS statistics to identify genes, alleles, and pathways for deeper investigations. Meta-analysis seeks to pool information from multiple GWAS to increase the chances of finding true positives among the false positives and provides a way to combine associations across GWAS, even when the original data are unavailable. Testing for epistasis within a single GWAS study can identify the stronger results that are revealed when genes interact. Pathway analysis of GWAS results is used to prioritize genes and pathways within a biological context. Following a GWAS, association results can be assigned to pathways and tested in aggregate with computational tools and pathway databases. Reviews of published methods with recommendations for their application are provided within the framework for each approach. PMID:20074509
van den Berg, Irene; Boichard, Didier; Lund, Mogens Sandø
2016-11-01
The objective of this study was to compare mapping precision and power of within-breed and multibreed genome-wide association studies (GWAS) and to compare the results obtained by the multibreed GWAS with 3 meta-analysis methods. The multibreed GWAS was expected to improve mapping precision compared with a within-breed GWAS because linkage disequilibrium is conserved over shorter distances across breeds than within breeds. The multibreed GWAS was also expected to increase detection power for quantitative trait loci (QTL) segregating across breeds. GWAS were performed for production traits in dairy cattle, using imputed full genome sequences of 16,031 bulls, originating from 6 French and Danish dairy cattle populations. Our results show that a multibreed GWAS can be a valuable tool for the detection and fine mapping of quantitative trait loci. The number of QTL detected with the multibreed GWAS was larger than the number detected by the within-breed GWAS, indicating an increase in power, especially when the 2 Holstein populations were combined. The largest number of QTL was detected when all populations were combined. The analysis combining all breeds was, however, dominated by Holstein, and QTL segregating in other breeds but not in Holstein were sometimes overshadowed by larger QTL segregating in Holstein. Therefore, the GWAS combining all breeds except Holstein was useful to detect such peaks. Combining all breeds except Holstein resulted in smaller QTL intervals on average, but this outcome was not the case when the Holstein populations were included in the analysis. Although no decrease in the average QTL size was observed, mapping precision did improve for several QTL. Out of 3 different multibreed meta-analysis methods, the weighted z-scores model resulted in the most similar results to the full multibreed GWAS and can be useful as an alternative to a full multibreed GWAS. Differences between the multibreed GWAS and the meta-analyses were larger when different breeds were combined than when the 2 Holstein populations were combined. Copyright © 2016 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.
Effects of GWAS-Associated Genetic Variants on lncRNAs within IBD and T1D Candidate Loci
Brorsson, Caroline A.; Pociot, Flemming
2014-01-01
Long non-coding RNAs are a new class of non-coding RNAs that are at the crosshairs in many human diseases such as cancers, cardiovascular disorders, inflammatory and autoimmune disease like Inflammatory Bowel Disease (IBD) and Type 1 Diabetes (T1D). Nearly 90% of the phenotype-associated single-nucleotide polymorphisms (SNPs) identified by genome-wide association studies (GWAS) lie outside of the protein coding regions, and map to the non-coding intervals. However, the relationship between phenotype-associated loci and the non-coding regions including the long non-coding RNAs (lncRNAs) is poorly understood. Here, we systemically identified all annotated IBD and T1D loci-associated lncRNAs, and mapped nominally significant GWAS/ImmunoChip SNPs for IBD and T1D within these lncRNAs. Additionally, we identified tissue-specific cis-eQTLs, and strong linkage disequilibrium (LD) signals associated with these SNPs. We explored sequence and structure based attributes of these lncRNAs, and also predicted the structural effects of mapped SNPs within them. We also identified lncRNAs in IBD and T1D that are under recent positive selection. Our analysis identified putative lncRNA secondary structure-disruptive SNPs within and in close proximity (+/−5 kb flanking regions) of IBD and T1D loci-associated candidate genes, suggesting that these RNA conformation-altering polymorphisms might be associated with diseased-phenotype. Disruption of lncRNA secondary structure due to presence of GWAS SNPs provides valuable information that could be potentially useful for future structure-function studies on lncRNAs. PMID:25144376
Impact of exome sequencing in inflammatory bowel disease
Cardinale, Christopher J; Kelsen, Judith R; Baldassano, Robert N; Hakonarson, Hakon
2013-01-01
Approaches to understanding the genetic contribution to inflammatory bowel disease (IBD) have continuously evolved from family- and population-based epidemiology, to linkage analysis, and most recently, to genome-wide association studies (GWAS). The next stage in this evolution seems to be the sequencing of the exome, that is, the regions of the human genome which encode proteins. The GWAS approach has been very fruitful in identifying at least 163 loci as being associated with IBD, and now, exome sequencing promises to take our genetic understanding to the next level. In this review we will discuss the possible contributions that can be made by an exome sequencing approach both at the individual patient level to aid with disease diagnosis and future therapies, as well as in advancing knowledge of the pathogenesis of IBD. PMID:24187447
SNP association study in PMS2-associated Lynch syndrome.
Ten Broeke, Sanne W; Elsayed, Fadwa A; Pagan, Lisa; Olderode-Berends, Maran J W; Garcia, Encarna Gomez; Gille, Hans J P; van Hest, Liselot P; Letteboer, Tom G W; van der Kolk, Lizet E; Mensenkamp, Arjen R; van Os, Theo A; Spruijt, Liesbeth; Redeker, Bert J W; Suerink, Manon; Vos, Yvonne J; Wagner, Anja; Wijnen, Juul T; Steyerberg, E W; Tops, Carli M J; van Wezel, Tom; Nielsen, Maartje
2017-11-17
Lynch syndrome (LS) patients are at high risk of developing colorectal cancer (CRC). Phenotypic variability might in part be explained by common susceptibility loci identified in Genome Wide Association Studies (GWAS). Previous studies focused mostly on MLH1, MSH2 and MSH6 carriers, with conflicting results. We aimed to determine the role of GWAS SNPs in PMS2 mutation carriers. A cohort study was performed in 507 PMS2 carriers (124 CRC cases), genotyped for 24 GWAS SNPs, including SNPs at 11q23.1 and 8q23.3. Hazard ratios (HRs) were calculated using a weighted Cox regression analysis to correct for ascertainment bias. Discrimination was assessed with a concordance statistic in a bootstrap cross-validation procedure. Individual SNPs only had non-significant associations with CRC occurrence with HRs lower than 2, although male carriers of allele A at rs1321311 (6p21.31) may have increased risk of CRC (HR = 2.1, 95% CI 1.2-3.0). A polygenic risk score (PRS) based on 24 HRs had an HR of 2.6 (95% CI 1.5-4.6) for the highest compared to the lowest quartile, but had no discriminative ability (c statistic 0.52). Previously suggested SNPs do not modify CRC risk in PMS2 carriers. Future large studies are needed for improved risk stratification among Lynch syndrome patients.
Mirkovic, Bojan; Laurent, Claudine; Podlipski, Marc-Antoine; Frebourg, Thierry; Cohen, David; Gerardin, Priscille
2016-01-01
Suicidal behaviors (SBs), which range from suicidal ideation to suicide attempts and completed suicide, represent a fatal dimension of mental ill-health. The involvement of genetic risk factors in SB is supported by family, twin, and adoption studies. The aim of this paper is to review recent genetic association studies in SBs including (i) case–control studies, (ii) family-based association studies, and (iii) genome-wide association studies (GWAS). Various studies on genetic associations have tended to suggest that a number of genes [e.g., tryptophan hydroxylase, serotonin receptors and transporters, or brain-derived neurotrophic factors (BDNFs)] are linked to SBs, but these findings are not consistently supported by the results obtained. Although the candidate–gene approach is useful, it is hampered by the present state of knowledge concerning the pathophysiology of diseases. Interpretations of GWAS results are mostly hindered by a lack of annotation describing the functions of most variation throughout the genome. Association studies have addressed a wide range of single-nucleotide polymorphisms in numerous genes. We have included 104 such studies, of which 10 are family-based association studies and 11 are GWAS. Numerous meta-analyses of case–control studies have shown significant associations of SB with variants in the serotonin transporter gene (5-HTT or SLC6A4) and the tryptophan hydroxylase 1 gene (TPH1), but others report contradictory results. The gene encoding BDNF and its receptor (NTRK2) are also promising candidates. Only two of the GWAS showed any significant associations. Several pathways are mentioned in an attempt to understand the lack of reproducibility and the disappointing results. Consequently, we review and discuss here the following aspects: (i) sample characteristics and confounding factors; (ii) statistical limits; (iii) gene–gene interactions; (iv) gene, environment, and by time interactions; and (v) technological and theoretical limits. PMID:27721799
Adib-Samii, Poneh; Devan, William; Traylor, Matthew; Lanfranconi, Silvia; Zhang, Cathy R; Cloonan, Lisa; Falcone, Guido J; Radmanesh, Farid; Fitzpatrick, Kaitlin; Kanakis, Allison; Rothwell, Peter M; Sudlow, Cathie; Boncoraglio, Giorgio B; Meschia, James F; Levi, Chris; Dichgans, Martin; Bevan, Steve; Rosand, Jonathan; Rost, Natalia S; Markus, Hugh S
2015-02-01
Epidemiological studies suggest that white matter hyperintensities (WMH) are extremely heritable, but the underlying genetic variants are largely unknown. Pathophysiological heterogeneity is known to reduce the power of genome-wide association studies (GWAS). Hypertensive and nonhypertensive individuals with WMH might have different underlying pathologies. We used GWAS data to calculate the variance in WMH volume (WMHV) explained by common single nucleotide polymorphisms (SNPs) as a measure of heritability (SNP heritability [HSNP]) and tested the hypothesis that WMH heritability differs between hypertensive and nonhypertensive individuals. WMHV was measured on MRI in the stroke-free cerebral hemisphere of 2336 ischemic stroke cases with GWAS data. After adjustment for age and intracranial volume, we determined which cardiovascular risk factors were independent predictors of WMHV. Using the genome-wide complex trait analysis tool to estimate HSNP for WMHV overall and within subgroups stratified by risk factors found to be significant in multivariate analyses. A significant proportion of the variance of WMHV was attributable to common SNPs after adjustment for significant risk factors (HSNP=0.23; P=0.0026). HSNP estimates were higher among hypertensive individuals (HSNP=0.45; P=7.99×10(-5)); this increase was greater than expected by chance (P=0.012). In contrast, estimates were lower, and nonsignificant, in nonhypertensive individuals (HSNP=0.13; P=0.13). A quarter of variance is attributable to common SNPs, but this estimate was greater in hypertensive individuals. These findings suggest that the genetic architecture of WMH in ischemic stroke differs between hypertensives and nonhypertensives. Future WMHV GWAS studies may gain power by accounting for this interaction. © 2014 The Authors. Published on behalf of the American Heart Association, Inc., by Wolters Kluwer.
Yang, Jinliang; Jiang, Haiying; Yeh, Cheng-Ting; Yu, Jianming; Jeddeloh, Jeffrey A; Nettleton, Dan; Schnable, Patrick S
2015-11-01
Although approaches for performing genome-wide association studies (GWAS) are well developed, conventional GWAS requires high-density genotyping of large numbers of individuals from a diversity panel. Here we report a method for performing GWAS that does not require genotyping of large numbers of individuals. Instead XP-GWAS (extreme-phenotype GWAS) relies on genotyping pools of individuals from a diversity panel that have extreme phenotypes. This analysis measures allele frequencies in the extreme pools, enabling discovery of associations between genetic variants and traits of interest. This method was evaluated in maize (Zea mays) using the well-characterized kernel row number trait, which was selected to enable comparisons between the results of XP-GWAS and conventional GWAS. An exome-sequencing strategy was used to focus sequencing resources on genes and their flanking regions. A total of 0.94 million variants were identified and served as evaluation markers; comparisons among pools showed that 145 of these variants were statistically associated with the kernel row number phenotype. These trait-associated variants were significantly enriched in regions identified by conventional GWAS. XP-GWAS was able to resolve several linked QTL and detect trait-associated variants within a single gene under a QTL peak. XP-GWAS is expected to be particularly valuable for detecting genes or alleles responsible for quantitative variation in species for which extensive genotyping resources are not available, such as wild progenitors of crops, orphan crops, and other poorly characterized species such as those of ecological interest. © 2015 The Authors The Plant Journal published by Society for Experimental Biology and John Wiley & Sons Ltd.
Zhang, Kunlin; Chang, Suhua; Cui, Sijia; Guo, Liyuan; Zhang, Liuyan; Wang, Jing
2011-07-01
Genome-wide association study (GWAS) is widely utilized to identify genes involved in human complex disease or some other trait. One key challenge for GWAS data interpretation is to identify causal SNPs and provide profound evidence on how they affect the trait. Currently, researches are focusing on identification of candidate causal variants from the most significant SNPs of GWAS, while there is lack of support on biological mechanisms as represented by pathways. Although pathway-based analysis (PBA) has been designed to identify disease-related pathways by analyzing the full list of SNPs from GWAS, it does not emphasize on interpreting causal SNPs. To our knowledge, so far there is no web server available to solve the challenge for GWAS data interpretation within one analytical framework. ICSNPathway is developed to identify candidate causal SNPs and their corresponding candidate causal pathways from GWAS by integrating linkage disequilibrium (LD) analysis, functional SNP annotation and PBA. ICSNPathway provides a feasible solution to bridge the gap between GWAS and disease mechanism study by generating hypothesis of SNP → gene → pathway(s). The ICSNPathway server is freely available at http://icsnpathway.psych.ac.cn/.
Huang, Yen-Tsung; Liang, Liming; Moffatt, Miriam F; Cookson, William O C M; Lin, Xihong
2015-07-01
Genome-wide association studies (GWAS) have been a standard practice in identifying single nucleotide polymorphisms (SNPs) for disease susceptibility. We propose a new approach, termed integrative GWAS (iGWAS) that exploits the information of gene expressions to investigate the mechanisms of the association of SNPs with a disease phenotype, and to incorporate the family-based design for genetic association studies. Specifically, the relations among SNPs, gene expression, and disease are modeled within the mediation analysis framework, which allows us to disentangle the genetic effect on a disease phenotype into two parts: an effect mediated through a gene expression (mediation effect, ME) and an effect through other biological mechanisms or environment-mediated mechanisms (alternative effect, AE). We develop omnibus tests for the ME and AE that are robust to underlying true disease models. Numerical studies show that the iGWAS approach is able to facilitate discovering genetic association mechanisms, and outperforms the SNP-only method for testing genetic associations. We conduct a family-based iGWAS of childhood asthma that integrates genetic and genomic data. The iGWAS approach identifies six novel susceptibility genes (MANEA, MRPL53, LYCAT, ST8SIA4, NDFIP1, and PTCH1) using the omnibus test with false discovery rate less than 1%, whereas no gene using SNP-only analyses survives with the same cut-off. The iGWAS analyses further characterize that genetic effects of these genes are mostly mediated through their gene expressions. In summary, the iGWAS approach provides a new analytic framework to investigate the mechanism of genetic etiology, and identifies novel susceptibility genes of childhood asthma that were biologically meaningful. © 2015 WILEY PERIODICALS, INC.
Jin, Ying; Andersen, Genevieve; Yorgov, Daniel; Ferrara, Tracey M; Ben, Songtao; Brownson, Kelly M; Holland, Paulene J; Birlea, Stanca A; Siebert, Janet; Hartmann, Anke; Lienert, Anne; van Geel, Nanja; Lambert, Jo; Luiten, Rosalie M; Wolkerstorfer, Albert; Wietze van der Veen, J P; Bennett, Dorothy C; Taïeb, Alain; Ezzedine, Khaled; Kemp, E Helen; Gawkrodger, David J; Weetman, Anthony P; Kõks, Sulev; Prans, Ele; Kingo, Külli; Karelson, Maire; Wallace, Margaret R; McCormack, Wayne T; Overbeck, Andreas; Moretti, Silvia; Colucci, Roberta; Picardo, Mauro; Silverberg, Nanette B; Olsson, Mats; Valle, Yan; Korobko, Igor; Böhm, Markus; Lim, Henry W; Hamzavi, Iltefat; Zhou, Li; Mi, Qing-Sheng; Fain, Pamela R; Santorico, Stephanie A; Spritz, Richard A
2016-11-01
Vitiligo is an autoimmune disease in which depigmented skin results from the destruction of melanocytes, with epidemiological association with other autoimmune diseases. In previous linkage and genome-wide association studies (GWAS1 and GWAS2), we identified 27 vitiligo susceptibility loci in patients of European ancestry. We carried out a third GWAS (GWAS3) in European-ancestry subjects, with augmented GWAS1 and GWAS2 controls, genome-wide imputation, and meta-analysis of all three GWAS, followed by an independent replication. The combined analyses, with 4,680 cases and 39,586 controls, identified 23 new significantly associated loci and 7 suggestive loci. Most encode immune and apoptotic regulators, with some also associated with other autoimmune diseases, as well as several melanocyte regulators. Bioinformatic analyses indicate a predominance of causal regulatory variation, some of which corresponds to expression quantitative trait loci (eQTLs) at these loci. Together, the identified genes provide a framework for the genetic architecture and pathobiology of vitiligo, highlight relationships with other autoimmune diseases and melanoma, and offer potential targets for treatment.
Jin, Ying; Andersen, Genevieve; Yorgov, Daniel; Ferrara, Tracey M; Ben, Songtao; Brownson, Kelly M; Holland, Paulene J; Birlea, Stanca A; Siebert, Janet; Hartmann, Anke; Lienert, Anne; van Geel, Nanja; Lambert, Jo; Luiten, Rosalie M; Wolkerstorfer, Albert; van der Veen, JP Wietze; Bennett, Dorothy C; Taïeb, Alain; Ezzedine, Khaled; Kemp, E Helen; Gawkrodger, David J; Weetman, Anthony P; Kõks, Sulev; Prans, Ele; Kingo, Külli; Karelson, Maire; Wallace, Margaret R; McCormack, Wayne T; Overbeck, Andreas; Moretti, Silvia; Colucci, Roberta; Picardo, Mauro; Silverberg, Nanette B; Olsson, Mats; Valle, Yan; Korobko, Igor; Böhm, Markus; Lim, Henry W.; Hamzavi, Iltefat; Zhou, Li; Mi, Qing-Sheng; Fain, Pamela R.; Santorico, Stephanie A; Spritz, Richard A
2016-01-01
Vitiligo is an autoimmune disease in which depigmented skin results from destruction of melanocytes1, with epidemiologic association with other autoimmune diseases2. In previous linkage and genome-wide association studies (GWAS1, GWAS2), we identified 27 vitiligo susceptibility loci in patients of European (EUR) ancestry. We carried out a third GWAS (GWAS3) in EUR subjects, with augmented GWAS1 and GWAS2 controls, genome-wide imputation, and meta-analysis of all three GWAS, followed by an independent replication. The combined analyses, with 4,680 cases and 39,586 controls, identified 23 new loci and 7 suggestive loci, most encoding immune and apoptotic regulators, some also associated with other autoimmune diseases, as well as several melanocyte regulators. Bioinformatic analyses indicate a predominance of causal regulatory variation, some corresponding to eQTL at these loci. Together, the identified genes provide a framework for vitiligo genetic architecture and pathobiology, highlight relationships to other autoimmune diseases and melanoma, and offer potential targets for treatment. PMID:27723757
Privacy-preserving GWAS analysis on federated genomic datasets.
Constable, Scott D; Tang, Yuzhe; Wang, Shuang; Jiang, Xiaoqian; Chapin, Steve
2015-01-01
The biomedical community benefits from the increasing availability of genomic data to support meaningful scientific research, e.g., Genome-Wide Association Studies (GWAS). However, high quality GWAS usually requires a large amount of samples, which can grow beyond the capability of a single institution. Federated genomic data analysis holds the promise of enabling cross-institution collaboration for effective GWAS, but it raises concerns about patient privacy and medical information confidentiality (as data are being exchanged across institutional boundaries), which becomes an inhibiting factor for the practical use. We present a privacy-preserving GWAS framework on federated genomic datasets. Our method is to layer the GWAS computations on top of secure multi-party computation (MPC) systems. This approach allows two parties in a distributed system to mutually perform secure GWAS computations, but without exposing their private data outside. We demonstrate our technique by implementing a framework for minor allele frequency counting and χ2 statistics calculation, one of typical computations used in GWAS. For efficient prototyping, we use a state-of-the-art MPC framework, i.e., Portable Circuit Format (PCF) 1. Our experimental results show promise in realizing both efficient and secure cross-institution GWAS computations.
Dennis, Jessica; Medina-Rivera, Alejandra; Truong, Vinh; Antounians, Lina; Zwingerman, Nora; Carrasco, Giovana; Strug, Lisa; Wells, Phil; Trégouët, David-Alexandre; Morange, Pierre-Emmanuel; Wilson, Michael D; Gagnon, France
2017-07-01
Tissue factor pathway inhibitor (TFPI) regulates the formation of intravascular blood clots, which manifest clinically as ischemic heart disease, ischemic stroke, and venous thromboembolism (VTE). TFPI plasma levels are heritable, but the genetics underlying TFPI plasma level variability are poorly understood. Herein we report the first genome-wide association scan (GWAS) of TFPI plasma levels, conducted in 251 individuals from five extended French-Canadian Families ascertained on VTE. To improve discovery, we also applied a hypothesis-driven (HD) GWAS approach that prioritized single nucleotide polymorphisms (SNPs) in (1) hemostasis pathway genes, and (2) vascular endothelial cell (EC) regulatory regions, which are among the highest expressers of TFPI. Our GWAS identified 131 SNPs with suggestive evidence of association (P-value < 5 × 10 -8 ), but no SNPs reached the genome-wide threshold for statistical significance. Hemostasis pathway genes were not enriched for TFPI plasma level associated SNPs (global hypothesis test P-value = 0.147), but EC regulatory regions contained more TFPI plasma level associated SNPs than expected by chance (global hypothesis test P-value = 0.046). We therefore stratified our genome-wide SNPs, prioritizing those in EC regulatory regions via stratified false discovery rate (sFDR) control, and reranked the SNPs by q-value. The minimum q-value was 0.27, and the top-ranked SNPs did not show association evidence in the MARTHA replication sample of 1,033 unrelated VTE cases. Although this study did not result in new loci for TFPI, our work lays out a strategy to utilize epigenomic data in prioritization schemes for future GWAS studies. © 2017 WILEY PERIODICALS, INC.
Jia, Peilin; Wang, Lily; Fanous, Ayman H.; Pato, Carlos N.; Edwards, Todd L.; Zhao, Zhongming
2012-01-01
With the recent success of genome-wide association studies (GWAS), a wealth of association data has been accomplished for more than 200 complex diseases/traits, proposing a strong demand for data integration and interpretation. A combinatory analysis of multiple GWAS datasets, or an integrative analysis of GWAS data and other high-throughput data, has been particularly promising. In this study, we proposed an integrative analysis framework of multiple GWAS datasets by overlaying association signals onto the protein-protein interaction network, and demonstrated it using schizophrenia datasets. Building on a dense module search algorithm, we first searched for significantly enriched subnetworks for schizophrenia in each single GWAS dataset and then implemented a discovery-evaluation strategy to identify module genes with consistent association signals. We validated the module genes in an independent dataset, and also examined them through meta-analysis of the related SNPs using multiple GWAS datasets. As a result, we identified 205 module genes with a joint effect significantly associated with schizophrenia; these module genes included a number of well-studied candidate genes such as DISC1, GNA12, GNA13, GNAI1, GPR17, and GRIN2B. Further functional analysis suggested these genes are involved in neuronal related processes. Additionally, meta-analysis found that 18 SNPs in 9 module genes had P meta<1×10−4, including the gene HLA-DQA1 located in the MHC region on chromosome 6, which was reported in previous studies using the largest cohort of schizophrenia patients to date. These results demonstrated our bi-directional network-based strategy is efficient for identifying disease-associated genes with modest signals in GWAS datasets. This approach can be applied to any other complex diseases/traits where multiple GWAS datasets are available. PMID:22792057
GWASeq: targeted re-sequencing follow up to GWAS.
Salomon, Matthew P; Li, Wai Lok Sibon; Edlund, Christopher K; Morrison, John; Fortini, Barbara K; Win, Aung Ko; Conti, David V; Thomas, Duncan C; Duggan, David; Buchanan, Daniel D; Jenkins, Mark A; Hopper, John L; Gallinger, Steven; Le Marchand, Loïc; Newcomb, Polly A; Casey, Graham; Marjoram, Paul
2016-03-03
For the last decade the conceptual framework of the Genome-Wide Association Study (GWAS) has dominated the investigation of human disease and other complex traits. While GWAS have been successful in identifying a large number of variants associated with various phenotypes, the overall amount of heritability explained by these variants remains small. This raises the question of how best to follow up on a GWAS, localize causal variants accounting for GWAS hits, and as a consequence explain more of the so-called "missing" heritability. Advances in high throughput sequencing technologies now allow for the efficient and cost-effective collection of vast amounts of fine-scale genomic data to complement GWAS. We investigate these issues using a colon cancer dataset. After QC, our data consisted of 1993 cases, 899 controls. Using marginal tests of associations, we identify 10 variants distributed among six targeted regions that are significantly associated with colorectal cancer, with eight of the variants being novel to this study. Additionally, we perform so-called 'SNP-set' tests of association and identify two sets of variants that implicate both common and rare variants in the etiology of colorectal cancer. Here we present a large-scale targeted re-sequencing resource focusing on genomic regions implicated in colorectal cancer susceptibility previously identified in several GWAS, which aims to 1) provide fine-scale targeted sequencing data for fine-mapping and 2) provide data resources to address methodological questions regarding the design of sequencing-based follow-up studies to GWAS. Additionally, we show that this strategy successfully identifies novel variants associated with colorectal cancer susceptibility and can implicate both common and rare variants.
The case of GWAS of obesity: does body weight control play by the rules?
Müller, Manfred J; Geisler, Corinna; Blundell, John; Dulloo, Abdul; Schutz, Yves; Krawczak, Michael; Bosy-Westphal, Anja; Enderle, Janna; Heymsfield, Steven B
2018-05-24
As yet, genome-wide association studies (GWAS) have not added much to our understanding of the mechanisms of body weight control and of the etiology of obesity. This shortcoming is widely attributed to the complexity of the issues. The appeal of this explanation notwithstanding, we surmise that (i) an oversimplification of the phenotype (namely by the use of crude anthropometric traits) and (ii) a lack of sound concepts of body weight control and, thus, a lack of a clear research focus have impeded better insights most. The idea of searching for polygenetic mechanisms underlying common forms of obesity was born out of the impressive findings made for monogenetic forms of extreme obesity. In the case of common obesity, however, observational studies on normal weight and overweight subjects never provided any strong evidence for a tight internal control of body weight. In addition, empirical studies of weight changes in normal weight and overweight subjects revealed an intra-individual variance that was similar to inter-individual variance suggesting the absence of tight control of body weight. Not least, this lack of coerciveness is reflected by the present obesity epidemic. Finally, data on detailed body composition highlight that body weight is too heterogeneous a phenotype to be controlled as a single entity. In summary GWAS of obesity using crude anthropometric traits have likely been misled by popular heritability estimates that may have been inflated in the first place. To facilitate more robust and useful insights into the mechanisms of internal control of human body weight and, consequently, the genetic basis of obesity, we argue in favor of a broad discussion between scientists from the areas of integrative physiologic and of genomics. This discussion should aim at better conceived studies employing biologically more meaningful phenotypes based on in depth body composition analysis. To advance the scientific community-including the editors of our top journals-needs a re-launch of future GWAS of obesity.
Vaithilingam, R D; Safii, S H; Baharuddin, N A; Ng, C C; Cheong, S C; Bartold, P M; Schaefer, A S; Loos, B G
2014-12-01
Studies to elucidate the role of genetics as a risk factor for periodontal disease have gone through various phases. In the majority of cases, the initial 'hypothesis-dependent' candidate-gene polymorphism studies did not report valid genetic risk loci. Following a large-scale replication study, these initially positive results are believed to be caused by type 1 errors. However, susceptibility genes, such as CDKN2BAS (Cyclin Dependend KiNase 2B AntiSense RNA; alias ANRIL [ANtisense Rna In the Ink locus]), glycosyltransferase 6 domain containing 1 (GLT6D1) and cyclooxygenase 2 (COX2), have been reported as conclusive risk loci of periodontitis. The search for genetic risk factors accelerated with the advent of 'hypothesis-free' genome-wide association studies (GWAS). However, despite many different GWAS being performed for almost all human diseases, only three GWAS on periodontitis have been published - one reported genome-wide association of GLT6D1 with aggressive periodontitis (a severe phenotype of periodontitis), whereas the remaining two, which were performed on patients with chronic periodontitis, were not able to find significant associations. This review discusses the problems faced and the lessons learned from the search for genetic risk variants of periodontitis. Current and future strategies for identifying genetic variance in periodontitis, and the importance of planning a well-designed genetic study with large and sufficiently powered case-control samples of severe phenotypes, are also discussed. © 2014 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Khramtsova, Ekaterina A; Stranger, Barbara E
2017-02-01
Over the last decade, genome-wide association studies (GWAS) have generated vast amounts of analysis results, requiring development of novel tools for data visualization. Quantile–quantile (QQ) plots and Manhattan plots are classical tools which have been utilized to visually summarize GWAS results and identify genetic variants significantly associated with traits of interest. However, static visualizations are limiting in the information that can be shown. Here, we present Assocplots, a Python package for viewing and exploring GWAS results not only using classic static Manhattan and QQ plots, but also through a dynamic extension which allows to interactively visualize the relationships between GWAS results from multiple cohorts or studies. The Assocplots package is open source and distributed under the MIT license via GitHub (https://github.com/khramts/assocplots) along with examples, documentation and installation instructions. ekhramts@medicine.bsd.uchicago.edu or bstranger@medicine.bsd.uchicago.edu
Genetic Predictors for Cardiovascular Disease in Hispanics
Qi, Lu; Campos, Hannia
2012-01-01
A less favorable cardiovascular risk factor profile, but paradoxically lower cardiovascular morbidity and mortality have been observed in Hispanics, a pattern often referred to as the Hispanic Paradox. It was proposed the specific genetic susceptibility of this admixed population and gene-environment interactions may partly explain the paradox. The past few years have seen great advances in discovering genetic risk factors using genome-wide association studies (GWAS) for cardiovascular disease especially in Caucasians. However, there is no GWAS of cardiovascular disease that have been reported in Hispanics. In the Costa Rican Heart Study we reported both the consistency and disparity of genetic effects on risk of coronary heart disease (CHD) between Hispanics and other ethnic groups. We demonstrated the improvement in the identified genetic markers on discrimination of CHD in Hispanics was modest. Future genetic research in Hispanics would consider the diversities in genetic structure, lifestyle and socioeconomics among various sub-populations, and comprehensively evaluate potential gene-environment interactions in relation to cardiovascular risk. PMID:22498015
Metabolomic and Genome-wide Association Studies Reveal Potential Endogenous Biomarkers for OATP1B1.
Yee, S W; Giacomini, M M; Hsueh, C-H; Weitz, D; Liang, X; Goswami, S; Kinchen, J M; Coelho, A; Zur, A A; Mertsch, K; Brian, W; Kroetz, D L; Giacomini, K M
2016-11-01
Transporter-mediated drug-drug interactions (DDIs) are a major cause of drug toxicities. Using published genome-wide association studies (GWAS) of the human metabolome, we identified 20 metabolites associated with genetic variants in organic anion transporter, OATP1B1 (P < 5 × 10 -8 ). Of these, 12 metabolites were significantly higher in plasma samples from volunteers dosed with the OATP1B1 inhibitor, cyclosporine (CSA) vs. placebo (q-value < 0.2). Conjugated bile acids and fatty acid dicarboxylates were among the metabolites discovered using both GWAS and CSA administration. In vitro studies confirmed tetradecanedioate (TDA) and hexadecanedioate (HDA) were novel substrates of OATP1B1 as well as OAT1 and OAT3. This study highlights the use of multiple datasets for the discovery of endogenous metabolites that represent potential in vivo biomarkers for transporter-mediated DDIs. Future studies are needed to determine whether these metabolites can serve as qualified biomarkers for organic anion transporters. Quantitative relationships between metabolite levels and modulation of transporters should be established. © 2016 American Society for Clinical Pharmacology and Therapeutics.
Kwon, Ji-Sun; Kim, Jihye; Nam, Dougu; Kim, Sangsoo
2012-06-01
Gene set analysis (GSA) is useful in interpreting a genome-wide association study (GWAS) result in terms of biological mechanism. We compared the performance of two different GSA implementations that accept GWAS p-values of single nucleotide polymorphisms (SNPs) or gene-by-gene summaries thereof, GSA-SNP and i-GSEA4GWAS, under the same settings of inputs and parameters. GSA runs were made with two sets of p-values from a Korean type 2 diabetes mellitus GWAS study: 259,188 and 1,152,947 SNPs of the original and imputed genotype datasets, respectively. When Gene Ontology terms were used as gene sets, i-GSEA4GWAS produced 283 and 1,070 hits for the unimputed and imputed datasets, respectively. On the other hand, GSA-SNP reported 94 and 38 hits, respectively, for both datasets. Similar, but to a lesser degree, trends were observed with Kyoto Encyclopedia of Genes and Genomes (KEGG) gene sets as well. The huge number of hits by i-GSEA4GWAS for the imputed dataset was probably an artifact due to the scaling step in the algorithm. The decrease in hits by GSA-SNP for the imputed dataset may be due to the fact that it relies on Z-statistics, which is sensitive to variations in the background level of associations. Judicious evaluation of the GSA outcomes, perhaps based on multiple programs, is recommended.
Statistical methods to detect novel genetic variants using publicly available GWAS summary data.
Guo, Bin; Wu, Baolin
2018-03-01
We propose statistical methods to detect novel genetic variants using only genome-wide association studies (GWAS) summary data without access to raw genotype and phenotype data. With more and more summary data being posted for public access in the post GWAS era, the proposed methods are practically very useful to identify additional interesting genetic variants and shed lights on the underlying disease mechanism. We illustrate the utility of our proposed methods with application to GWAS meta-analysis results of fasting glucose from the international MAGIC consortium. We found several novel genome-wide significant loci that are worth further study. Copyright © 2018 Elsevier Ltd. All rights reserved.
Qin, Pengfei; Li, Zhiqiang; Jin, Wenfei; Lu, Dongsheng; Lou, Haiyi; Shen, Jiawei; Jin, Li; Shi, Yongyong; Xu, Shuhua
2014-02-01
Population stratification acts as a confounding factor in genetic association studies and may lead to false-positive or false-negative results. Previous studies have analyzed the genetic substructures in Han Chinese population, the largest ethnic group in the world comprising ∼20% of the global human population. In this study, we examined 5540 Han Chinese individuals with about 1 million single-nucleotide polymorphisms (SNPs) and screened a panel of ancestry informative markers (AIMs) to facilitate the discerning and controlling of population structure in future association studies on Han Chinese. Based on genome-wide data, we first confirmed our previous observation of the north-south differentiation in Han Chinese population. Second, we developed a panel of 150 validated SNP AIMs to determine the northern or southern origin of each Han Chinese individual. We further evaluated the performance of our AIMs panel in association studies in simulation analysis. Our results showed that this AIMs panel had sufficient power to discern and control population stratification in Han Chinese, which could significantly reduce false-positive rates in both genome-wide association studies (GWAS) and candidate gene association studies (CGAS). We suggest this AIMs panel be genotyped and used to control and correct population stratification in the study design or data analysis of future association studies, especially in CGAS which is the most popular approach to validate previous reports on genetic associations of diseases in post-GWAS era.
Wei, Wen-Hua; Massey, Jonathan; Worthington, Jane; Barton, Anne; Warren, Richard B
2018-03-01
Genome-wide association studies (GWASs) have identified a number of loci for psoriasis but largely ignored non-additive effects. We report a genotypic variability-based GWAS (vGWAS) that can prioritize non-additive loci without requiring prior knowledge of interaction types or interacting factors in two steps, using a mixed model to partition dichotomous phenotypes into an additive component and non-additive environmental residuals on the liability scale and then the Levene's (Brown-Forsythe) test to assess equality of the residual variances across genotype groups genome widely. The vGWAS identified two genome-wide significant (P < 5.0e-08) non-additive loci HLA-C and IL12B that were also genome-wide significant in an accompanying GWAS in the discovery cohort. Both loci were statistically replicated in vGWAS of an independent cohort with a small sample size. HLA-C and IL12B were reported in moderate gene-gene and/or gene-environment interactions in several occasions. We found a moderate interaction with age-of-onset of psoriasis, which was replicated indirectly. The vGWAS also revealed five suggestive loci (P < 6.76e-05) including FUT2 that was associated with psoriasis with environmental aspects triggered by virus infection and/or metabolic factors. Replication and functional investigation are needed to validate the suggestive vGWAS loci.
Xu, Zheng; Zhang, Guosheng; Duan, Qing; Chai, Shengjie; Zhang, Baqun; Wu, Cong; Jin, Fulai; Yue, Feng; Li, Yun; Hu, Ming
2016-03-11
Genome-wide association studies (GWAS) have identified thousands of genetic variants associated with complex traits and diseases. However, most of them are located in the non-protein coding regions, and therefore it is challenging to hypothesize the functions of these non-coding GWAS variants. Recent large efforts such as the ENCODE and Roadmap Epigenomics projects have predicted a large number of regulatory elements. However, the target genes of these regulatory elements remain largely unknown. Chromatin conformation capture based technologies such as Hi-C can directly measure the chromatin interactions and have generated an increasingly comprehensive catalog of the interactome between the distal regulatory elements and their potential target genes. Leveraging such information revealed by Hi-C holds the promise of elucidating the functions of genetic variants in human diseases. In this work, we present HiView, the first integrative genome browser to leverage Hi-C results for the interpretation of GWAS variants. HiView is able to display Hi-C data and statistical evidence for chromatin interactions in genomic regions surrounding any given GWAS variant, enabling straightforward visualization and interpretation. We believe that as the first GWAS variants-centered Hi-C genome browser, HiView is a useful tool guiding post-GWAS functional genomics studies. HiView is freely accessible at: http://www.unc.edu/~yunmli/HiView .
An, Ping; Miljkovic, Iva; Thyagarajan, Bharat; Kraja, Aldi T; Daw, E Warwick; Pankow, James S; Selvin, Elizabeth; Kao, W H Linda; Maruthur, Nisa M; Nalls, Micahel A; Liu, Yongmei; Harris, Tamara B; Lee, Joseph H; Borecki, Ingrid B; Christensen, Kaare; Eckfeldt, John H; Mayeux, Richard; Perls, Thomas T; Newman, Anne B; Province, Michael A
2014-04-01
Glycated hemoglobin (HbA1c) is a stable index of chronic glycemic status and hyperglycemia associated with progressive development of insulin resistance and frank diabetes. It is also associated with premature aging and increased mortality. To uncover novel loci for HbA1c that are associated with healthy aging, we conducted a genome-wide association study (GWAS) using non-diabetic participants in the Long Life Family Study (LLFS), a study with familial clustering of exceptional longevity in the US and Denmark. A total of 4088 non-diabetic subjects from the LLFS were used for GWAS discoveries, and a total of 8231 non-diabetic subjects from the Atherosclerosis Risk in Communities Study (ARIC, in the MAGIC Consortium) and the Health, Aging, and Body Composition Study (HABC) were used for GWAS replications. HbA1c was adjusted for age, sex, centers, 20 principal components, without and with BMI. A linear mixed effects model was used for association testing. Two known loci at GCK rs730497 (or rs2908282) and HK1 rs17476364 were confirmed (p<5e-8). Of 25 suggestive (5e-8
Carlson, Christopher S; Matise, Tara C; North, Kari E; Haiman, Christopher A; Fesinmeyer, Megan D; Buyske, Steven; Schumacher, Fredrick R; Peters, Ulrike; Franceschini, Nora; Ritchie, Marylyn D; Duggan, David J; Spencer, Kylee L; Dumitrescu, Logan; Eaton, Charles B; Thomas, Fridtjof; Young, Alicia; Carty, Cara; Heiss, Gerardo; Le Marchand, Loic; Crawford, Dana C; Hindorff, Lucia A; Kooperberg, Charles L
2013-09-01
The vast majority of genome-wide association study (GWAS) findings reported to date are from populations with European Ancestry (EA), and it is not yet clear how broadly the genetic associations described will generalize to populations of diverse ancestry. The Population Architecture Using Genomics and Epidemiology (PAGE) study is a consortium of multi-ancestry, population-based studies formed with the objective of refining our understanding of the genetic architecture of common traits emerging from GWAS. In the present analysis of five common diseases and traits, including body mass index, type 2 diabetes, and lipid levels, we compare direction and magnitude of effects for GWAS-identified variants in multiple non-EA populations against EA findings. We demonstrate that, in all populations analyzed, a significant majority of GWAS-identified variants have allelic associations in the same direction as in EA, with none showing a statistically significant effect in the opposite direction, after adjustment for multiple testing. However, 25% of tagSNPs identified in EA GWAS have significantly different effect sizes in at least one non-EA population, and these differential effects were most frequent in African Americans where all differential effects were diluted toward the null. We demonstrate that differential LD between tagSNPs and functional variants within populations contributes significantly to dilute effect sizes in this population. Although most variants identified from GWAS in EA populations generalize to all non-EA populations assessed, genetic models derived from GWAS findings in EA may generate spurious results in non-EA populations due to differential effect sizes. Regardless of the origin of the differential effects, caution should be exercised in applying any genetic risk prediction model based on tagSNPs outside of the ancestry group in which it was derived. Models based directly on functional variation may generalize more robustly, but the identification of functional variants remains challenging.
Carlson, Christopher S.; Matise, Tara C.; North, Kari E.; Haiman, Christopher A.; Fesinmeyer, Megan D.; Buyske, Steven; Schumacher, Fredrick R.; Peters, Ulrike; Franceschini, Nora; Ritchie, Marylyn D.; Duggan, David J.; Spencer, Kylee L.; Dumitrescu, Logan; Eaton, Charles B.; Thomas, Fridtjof; Young, Alicia; Carty, Cara; Heiss, Gerardo; Le Marchand, Loic; Crawford, Dana C.; Hindorff, Lucia A.; Kooperberg, Charles L.
2013-01-01
The vast majority of genome-wide association study (GWAS) findings reported to date are from populations with European Ancestry (EA), and it is not yet clear how broadly the genetic associations described will generalize to populations of diverse ancestry. The Population Architecture Using Genomics and Epidemiology (PAGE) study is a consortium of multi-ancestry, population-based studies formed with the objective of refining our understanding of the genetic architecture of common traits emerging from GWAS. In the present analysis of five common diseases and traits, including body mass index, type 2 diabetes, and lipid levels, we compare direction and magnitude of effects for GWAS-identified variants in multiple non-EA populations against EA findings. We demonstrate that, in all populations analyzed, a significant majority of GWAS-identified variants have allelic associations in the same direction as in EA, with none showing a statistically significant effect in the opposite direction, after adjustment for multiple testing. However, 25% of tagSNPs identified in EA GWAS have significantly different effect sizes in at least one non-EA population, and these differential effects were most frequent in African Americans where all differential effects were diluted toward the null. We demonstrate that differential LD between tagSNPs and functional variants within populations contributes significantly to dilute effect sizes in this population. Although most variants identified from GWAS in EA populations generalize to all non-EA populations assessed, genetic models derived from GWAS findings in EA may generate spurious results in non-EA populations due to differential effect sizes. Regardless of the origin of the differential effects, caution should be exercised in applying any genetic risk prediction model based on tagSNPs outside of the ancestry group in which it was derived. Models based directly on functional variation may generalize more robustly, but the identification of functional variants remains challenging. PMID:24068893
[Genetic factors in myocardial infarction].
Hara, Masahiko; Sakata, Yasuhiko; Sato, Hiroshi
2013-02-01
One of the main mechanisms of acute myocardial infarction (AMI) is plaque rupture or erosion followed by intraluminal thrombus formation and occlusion of the coronary arteries. Thus far, many underlying conditions or environmental factors, such as hypertension, diabetes, dyslipidemia, smoking or obesity, as well as a family history of coronary artery diseases have been identified as risks for the onset of AMI. These risks suggest that AMI occurs due to interactions between underlying conditions and multiple genetic susceptibilities. For this reason, many target gene-disease association studies have been performed with the recent introduction of genome-wide association studies (GWAS) that have further revealed new genetic susceptibilities for AMI. GWAS is a way to examine many common genetic variants in different individuals to see if any variant is associated with a trait in a case-control fashion, and typically focuses on associations between single-nucleotide polymorphisms (SNP) and traits. SNP on chromosome 9p21 is one of the robust susceptibility variants for AMI which has been identified by many GWAS. In this review, we overview the methodology of GWAS, introduce genetic variants identified by GWAS as those with susceptibility for AMI, and describe the foresight of using GWAS to investigate genetic susceptibility to AMI.
BioSMACK: a linux live CD for genome-wide association analyses.
Hong, Chang Bum; Kim, Young Jin; Moon, Sanghoon; Shin, Young-Ah; Go, Min Jin; Kim, Dong-Joon; Lee, Jong-Young; Cho, Yoon Shin
2012-01-01
Recent advances in high-throughput genotyping technologies have enabled us to conduct a genome-wide association study (GWAS) on a large cohort. However, analyzing millions of single nucleotide polymorphisms (SNPs) is still a difficult task for researchers conducting a GWAS. Several difficulties such as compatibilities and dependencies are often encountered by researchers using analytical tools, during the installation of software. This is a huge obstacle to any research institute without computing facilities and specialists. Therefore, a proper research environment is an urgent need for researchers working on GWAS. We developed BioSMACK to provide a research environment for GWAS that requires no configuration and is easy to use. BioSMACK is based on the Ubuntu Live CD that offers a complete Linux-based operating system environment without installation. Moreover, we provide users with a GWAS manual consisting of a series of guidelines for GWAS and useful examples. BioSMACK is freely available at http://ksnp.cdc. go.kr/biosmack.
Hall, F Scott; Drgonova, Jana; Jain, Siddharth; Uhl, George R
2013-12-01
Substantial genetic contributions to addiction vulnerability are supported by data from twin studies, linkage studies, candidate gene association studies and, more recently, Genome Wide Association Studies (GWAS). Parallel to this work, animal studies have attempted to identify the genes that may contribute to responses to addictive drugs and addiction liability, initially focusing upon genes for the targets of the major drugs of abuse. These studies identified genes/proteins that affect responses to drugs of abuse; however, this does not necessarily mean that variation in these genes contributes to the genetic component of addiction liability. One of the major problems with initial linkage and candidate gene studies was an a priori focus on the genes thought to be involved in addiction based upon the known contributions of those proteins to drug actions, making the identification of novel genes unlikely. The GWAS approach is systematic and agnostic to such a priori assumptions. From the numerous GWAS now completed several conclusions may be drawn: (1) addiction is highly polygenic; each allelic variant contributing in a small, additive fashion to addiction vulnerability; (2) unexpected, compared to our a priori assumptions, classes of genes are most important in explaining addiction vulnerability; (3) although substantial genetic heterogeneity exists, there is substantial convergence of GWAS signals on particular genes. This review traces the history of this research; from initial transgenic mouse models based upon candidate gene and linkage studies, through the progression of GWAS for addiction and nicotine cessation, to the current human and transgenic mouse studies post-GWAS. © 2013.
Ischemic Stroke: From Next Generation Sequencing and GWAS to Community Genomics?
Black, Michael; Wang, Wenzhi; Wang, Wei
2015-08-01
Stroke is a major cause of mortality and morbidity in both the developed and developing world. Next generation sequencing (NGS) and multi-omics integrative biology research offer new opportunities in the way we research and understand stroke. These biotechnologies also signal a shift from genetics to genomics of stroke, which is highlighted in this review. Stroke is a focal neurological deficit resulting from disruption of the cerebral blood supply. There are two main types of common stroke, ischemic stroke (IS), which comprises 80% of cases, and hemorrhagic stroke (HS) that accounts for about 20% of cases. IS is a complex multi-factorial disease with multiple environmental and genomic determinants. We discuss here IS from genomics and bioinformatics perspectives, including the highlights of the genome wide association studies (GWAS), NGS progress to date, and exome studies. While both 'common variant, common disease' and 'rare variant, common disease' approaches need to be assessed in tandem, future studies into IS omics should also consider pedigree and/or community based sampling to take account of the complex diversity of IS genetics. We conclude by presenting an example of such community genomics research from China in an extended pedigree sample, and the ways in which the intersection of genomics and global society can usefully inform our understanding of IS pathophysiology and potential preventive medicine interventions in the future.
The challenges, advantages and future of phenome-wide association studies.
Hebbring, Scott J
2014-02-01
Over the last decade, significant technological breakthroughs have revolutionized human genomic research in the form of genome-wide association studies (GWASs). GWASs have identified thousands of statistically significant genetic variants associated with hundreds of human conditions including many with immunological aetiologies (e.g. multiple sclerosis, ankylosing spondylitis and rheumatoid arthritis). Unfortunately, most GWASs fail to identify clinically significant associations. Identifying biologically significant variants by GWAS also presents a challenge. The GWAS is a phenotype-to-genotype approach. As a complementary/alternative approach to the GWAS, investigators have begun to exploit extensive electronic medical record systems to conduct a genotype-to-phenotype approach when studying human disease - specifically, the phenome-wide association study (PheWAS). Although the PheWAS approach is in its infancy, this method has already demonstrated its capacity to rediscover important genetic associations related to immunological diseases/conditions. Furthermore, PheWAS has the advantage of identifying genetic variants with pleiotropic properties. This is particularly relevant for HLA variants. For example, PheWAS results have demonstrated that the HLA-DRB1 variant associated with multiple sclerosis may also be associated with erythematous conditions including rosacea. Likewise, PheWAS has demonstrated that the HLA-B genotype is not only associated with spondylopathies, uveitis, and variability in platelet count, but may also play an important role in other conditions, such as mastoiditis. This review will discuss and compare general PheWAS methodologies, describe both the challenges and advantages of the PheWAS, and provide insight into the potential directions in which PheWAS may lead. © 2013 The Authors. Immunology Published by John Wiley & Sons Ltd.
Common single nucleotide variants underlying drug addiction: more than a decade of research.
Bühler, Kora-Mareen; Giné, Elena; Echeverry-Alzate, Victor; Calleja-Conde, Javier; de Fonseca, Fernando Rodriguez; López-Moreno, Jose Antonio
2015-09-01
Drug-related phenotypes are common complex and highly heritable traits. In the last few years, candidate gene (CGAS) and genome-wide association studies (GWAS) have identified a huge number of single nucleotide polymorphisms (SNPs) associated with drug use, abuse or dependence, mainly related to alcohol or nicotine. Nevertheless, few of these associations have been replicated in independent studies. The aim of this study was to provide a review of the SNPs that have been most significantly associated with alcohol-, nicotine-, cannabis- and cocaine-related phenotypes in humans between the years of 2000 and 2012. To this end, we selected CGAS, GWAS, family-based association and case-only studies published in peer-reviewed international scientific journals (using the PubMed/MEDLINE and Addiction GWAS Resource databases) in which a significant association was reported. A total of 371 studies fit the search criteria. We then filtered SNPs with at least one replication study and performed meta-analysis of the significance of the associations. SNPs in the alcohol metabolizing genes, in the cholinergic gene cluster CHRNA5-CHRNA3-CHRNB4, and in the DRD2 and ANNK1 genes, are, to date, the most replicated and significant gene variants associated with alcohol- and nicotine-related phenotypes. In the case of cannabis and cocaine, a far fewer number of studies and replications have been reported, indicating either a need for further investigation or that the genetics of cannabis/cocaine addiction are more elusive. This review brings a global state-of-the-art vision of the behavioral genetics of addiction and collaborates on formulation of new hypothesis to guide future work. © 2015 Society for the Study of Addiction.
Genotype-based gene signature of glioma risk.
Huang, Yen-Tsung; Zhang, Yi; Wu, Zhijin; Michaud, Dominique S
2017-07-01
Glioma accounts for 80% of malignant brain tumors, but its etiologic determinants remain elusive. Despite genetic susceptibility loci identified by genome-wide association study (GWAS), the agnostic approach leaves open the possibility that other susceptibility genes remain to be discovered. Here we conduct a gene-centric integrative GWAS (iGWAS) of glioma risk that combines transcriptomics and genetics. We synthesized a brain transcriptomics dataset (n = 354), a GWAS dataset (n = 4203), and an advanced glioma tumor transcriptomic dataset (n = 483) to conduct an iGWAS. Using the expression quantitative trait loci (eQTL) dataset, we built models to predict gene expression for the GWAS data, based on eQTL genotypes. With the predicted gene expression, iGWAS analyses were performed using a novel statistical method. Gene signature risk score was constructed using a penalized logistic regression model. A total of 30527 transcripts were analyzed using the iGWAS approach. Four novel glioma susceptibility genes were identified with internal and external validation, including DRD5 (P = 3.0 × 10-79), WDR1 (P = 8.4 × 10-77), NOMO1 (P = 1.3 × 10-25), and PDXDC1 (P = 8.3 × 10-24). The genotype-predicted transcription pattern between cases and controls is consistent with that between tumor and its matched normal tissue. The genotype-based 4-gene signature improved the classification between glioma cases and controls based on age, gender, and population stratification, with area under the receiver operating characteristic curve increasing from 0.77 to 0.85 (P = 8.1 × 10-23). A new genotype-based gene signature of glioma was identified using a novel iGWAS approach, which integrates multiplatform genomic data as well as different genetic association studies. © The Author(s) 2017. Published by Oxford University Press on behalf of the Society for Neuro-Oncology. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com
Kim, Jihye; Yoo, Minjae; Shin, Jimin; Kim, Hyunmin; Kang, Jaewoo; Tan, Aik Choon
2018-01-01
Traditional Chinese medicine (TCM) originated in ancient China has been practiced over thousands of years for treating various symptoms and diseases. However, the molecular mechanisms of TCM in treating these diseases remain unknown. In this study, we employ a systems pharmacology-based approach for connecting GWAS diseases with TCM for potential drug repurposing and repositioning. We studied 102 TCM components and their target genes by analyzing microarray gene expression experiments. We constructed disease-gene networks from 2558 GWAS studies. We applied a systems pharmacology approach to prioritize disease-target genes. Using this bioinformatics approach, we analyzed 14,713 GWAS disease-TCM-target gene pairs and identified 115 disease-gene pairs with q value < 0.2. We validated several of these GWAS disease-TCM-target gene pairs with literature evidence, demonstrating that this computational approach could reveal novel indications for TCM. We also develop TCM-Disease web application to facilitate the traditional Chinese medicine drug repurposing efforts. Systems pharmacology is a promising approach for connecting GWAS diseases with TCM for potential drug repurposing and repositioning. The computational approaches described in this study could be easily expandable to other disease-gene network analysis.
Kim, Jihye; Yoo, Minjae; Shin, Jimin; Kim, Hyunmin; Kang, Jaewoo
2018-01-01
Traditional Chinese medicine (TCM) originated in ancient China has been practiced over thousands of years for treating various symptoms and diseases. However, the molecular mechanisms of TCM in treating these diseases remain unknown. In this study, we employ a systems pharmacology-based approach for connecting GWAS diseases with TCM for potential drug repurposing and repositioning. We studied 102 TCM components and their target genes by analyzing microarray gene expression experiments. We constructed disease-gene networks from 2558 GWAS studies. We applied a systems pharmacology approach to prioritize disease-target genes. Using this bioinformatics approach, we analyzed 14,713 GWAS disease-TCM-target gene pairs and identified 115 disease-gene pairs with q value < 0.2. We validated several of these GWAS disease-TCM-target gene pairs with literature evidence, demonstrating that this computational approach could reveal novel indications for TCM. We also develop TCM-Disease web application to facilitate the traditional Chinese medicine drug repurposing efforts. Systems pharmacology is a promising approach for connecting GWAS diseases with TCM for potential drug repurposing and repositioning. The computational approaches described in this study could be easily expandable to other disease-gene network analysis. PMID:29765977
Leveraging lung tissue transcriptome to uncover candidate causal genes in COPD genetic associations.
Lamontagne, Maxime; Bérubé, Jean-Christophe; Obeidat, Ma'en; Cho, Michael H; Hobbs, Brian D; Sakornsakolpat, Phuwanat; de Jong, Kim; Boezen, H Marike; Nickle, David; Hao, Ke; Timens, Wim; van den Berge, Maarten; Joubert, Philippe; Laviolette, Michel; Sin, Don D; Paré, Peter D; Bossé, Yohan
2018-05-15
Causal genes of chronic obstructive pulmonary disease (COPD) remain elusive. The current study aims at integrating genome-wide association studies (GWAS) and lung expression quantitative trait loci (eQTL) data to map COPD candidate causal genes and gain biological insights into the recently discovered COPD susceptibility loci. Two complementary genomic datasets on COPD were studied. First, the lung eQTL dataset which included whole-genome gene expression and genotyping data from 1038 individuals. Second, the largest COPD GWAS to date from the International COPD Genetics Consortium (ICGC) with 13 710 cases and 38 062 controls. Methods that integrated GWAS with eQTL signals including transcriptome-wide association study (TWAS), colocalization and Mendelian randomization-based (SMR) approaches were used to map causality genes, i.e. genes with the strongest evidence of being the functional effector at specific loci. These methods were applied at the genome-wide level and at COPD risk loci derived from the GWAS literature. Replication was performed using lung data from GTEx. We collated 129 non-overlapping risk loci for COPD from the GWAS literature. At the genome-wide scale, 12 new COPD candidate genes/loci were revealed and six replicated in GTEx including CAMK2A, DMPK, MYO15A, TNFRSF10A, BTN3A2 and TRBV30. In addition, we mapped candidate causal genes for 60 out of the 129 GWAS-nominated loci and 23 of them were replicated in GTEx. Mapping candidate causal genes in lung tissue represents an important contribution to the genetics of COPD, enriches our biological interpretation of GWAS findings, and brings us closer to clinical translation of genetic associations.
Design and analysis of multiple diseases genome-wide association studies without controls.
Chen, Zhongxue; Huang, Hanwen; Ng, Hon Keung Tony
2012-11-15
In genome-wide association studies (GWAS), multiple diseases with shared controls is one of the case-control study designs. If data obtained from these studies are appropriately analyzed, this design can have several advantages such as improving statistical power in detecting associations and reducing the time and cost in the data collection process. In this paper, we propose a study design for GWAS which involves multiple diseases but without controls. We also propose corresponding statistical data analysis strategy for GWAS with multiple diseases but no controls. Through a simulation study, we show that the statistical association test with the proposed study design is more powerful than the test with single disease sharing common controls, and it has comparable power to the overall test based on the whole dataset including the controls. We also apply the proposed method to a real GWAS dataset to illustrate the methodologies and the advantages of the proposed design. Some possible limitations of this study design and testing method and their solutions are also discussed. Our findings indicate that the proposed study design and statistical analysis strategy could be more efficient than the usual case-control GWAS as well as those with shared controls. Copyright © 2012 Elsevier B.V. All rights reserved.
Gurung, R; Prata, D P
2015-01-01
The powerful genome-wide association studies (GWAS) revealed common mutations that increase susceptibility for schizophrenia (SZ) and bipolar disorder (BD), but the vast majority were not known to be functional or associated with these illnesses. To help fill this gap, their impact on human brain structure and function has been examined. We systematically discuss this output to facilitate its timely integration in the psychosis research field; and encourage reflection for future research. Irrespective of imaging modality, studies addressing the effect of SZ/BD GWAS risk genes (ANK3, CACNA1C, MHC, TCF4, NRGN, DGKH, PBRM1, NCAN and ZNF804A) were included. Most GWAS risk variations were reported to affect neuroimaging phenotypes implicated in SZ/BD: white-matter integrity (ANK3 and ZNF804A), volume (CACNA1C and ZNF804A) and density (ZNF804A); grey-matter (CACNA1C, NRGN, TCF4 and ZNF804A) and ventricular (TCF4) volume; cortical folding (NCAN) and thickness (ZNF804A); regional activation during executive tasks (ANK3, CACNA1C, DGKH, NRGN and ZNF804A) and functional connectivity during executive tasks (CACNA1C and ZNF804A), facial affect recognition (CACNA1C and ZNF804A) and theory-of-mind (ZNF804A); but inconsistencies and non-replications also exist. Further efforts such as standardizing reporting and exploring complementary designs, are warranted to test the reproducibility of these early findings.
Ma, Li; Runesha, H Birali; Dvorkin, Daniel; Garbe, John R; Da, Yang
2008-01-01
Background Genome-wide association studies (GWAS) using single nucleotide polymorphism (SNP) markers provide opportunities to detect epistatic SNPs associated with quantitative traits and to detect the exact mode of an epistasis effect. Computational difficulty is the main bottleneck for epistasis testing in large scale GWAS. Results The EPISNPmpi and EPISNP computer programs were developed for testing single-locus and epistatic SNP effects on quantitative traits in GWAS, including tests of three single-locus effects for each SNP (SNP genotypic effect, additive and dominance effects) and five epistasis effects for each pair of SNPs (two-locus interaction, additive × additive, additive × dominance, dominance × additive, and dominance × dominance) based on the extended Kempthorne model. EPISNPmpi is the parallel computing program for epistasis testing in large scale GWAS and achieved excellent scalability for large scale analysis and portability for various parallel computing platforms. EPISNP is the serial computing program based on the EPISNPmpi code for epistasis testing in small scale GWAS using commonly available operating systems and computer hardware. Three serial computing utility programs were developed for graphical viewing of test results and epistasis networks, and for estimating CPU time and disk space requirements. Conclusion The EPISNPmpi parallel computing program provides an effective computing tool for epistasis testing in large scale GWAS, and the epiSNP serial computing programs are convenient tools for epistasis analysis in small scale GWAS using commonly available computer hardware. PMID:18644146
Evaluation of European Schizophrenia GWAS Loci in Asian Populations via Comprehensive Meta-Analyses.
Xiao, Xiao; Luo, Xiong-Jian; Chang, Hong; Liu, Zichao; Li, Ming
2017-08-01
Schizophrenia is a severe and highly heritable neuropsychiatric disorder. Recent genetic analyses including genome-wide association studies (GWAS) have implicated multiple genome-wide significant variants for schizophrenia among European populations. However, many of these risk variants were not largely validated in other populations of different ancestry such as Asians. To validate whether these European GWAS significant loci are associated with schizophrenia in Asian populations, we conducted a systematic literature search and meta-analyses on 19 single nucleotide polymorphisms (SNPs) in Asian populations by combining all available case-control and family-based samples, including up to 30,000 individuals. We employed classical fixed (or random) effects inverse variance weighted methods to calculate summary odds ratios (ORs) and 95 % confidence intervals (CIs). Among the 19 GWAS loci, we replicated the risk associations of nine markers (e.g., SNPs at VRK2, ITIH3/4, NDST3, NOTCH4) surpassing significance level (two-tailed P < 0.05), and three additional SNPs in MIR137 and ZNF804A also showed trend associations (one-tailed P < 0.05). These risk associations are in the same directions of allelic effects between Asian replication samples and initial European GWAS findings, and the successful replications of these GWAS loci in a different ethnic group provide stronger evidence for their clinical associations with schizophrenia. Further studies, focusing on the molecular mechanisms of these GWAS significant loci, will become increasingly important for understanding of the pathogenesis to schizophrenia.
Applications and Limitations of Mouse Models for Understanding Human Atherosclerosis
von Scheidt, Moritz; Zhao, Yuqi; Kurt, Zeyneb; Pan, Calvin; Zeng, Lingyao; Yang, Xia; Schunkert, Heribert; Lusis, Aldons J.
2017-01-01
Most of the biological understanding of mechanisms underlying coronary artery disease (CAD) derives from studies of mouse models. The identification of multiple CAD loci and strong candidate genes in large human genome-wide association studies (GWAS) presented an opportunity to examine the relevance of mouse models for the human disease. We comprehensively reviewed the mouse literature, including 827 literature-derived genes, and compared it to human data. First, we observed striking concordance of risk factors for atherosclerosis in mice and humans. Second, there was highly significant overlap of mouse genes with human genes identified by GWAS. In particular, of the 46 genes with strong association signals in CAD-GWAS that were studied in mouse models all but one exhibited consistent effects on atherosclerosis-related phenotypes. Third, we compared 178 CAD-associated pathways derived from human GWAS with 263 from mouse studies and observed that over 50% were consistent between both species. PMID:27916529
Progress of genome wide association study in domestic animals
2012-01-01
Domestic animals are invaluable resources for study of the molecular architecture of complex traits. Although the mapping of quantitative trait loci (QTL) responsible for economically important traits in domestic animals has achieved remarkable results in recent decades, not all of the genetic variation in the complex traits has been captured because of the low density of markers used in QTL mapping studies. The genome wide association study (GWAS), which utilizes high-density single-nucleotide polymorphism (SNP), provides a new way to tackle this issue. Encouraging achievements in dissection of the genetic mechanisms of complex diseases in humans have resulted from the use of GWAS. At present, GWAS has been applied to the field of domestic animal breeding and genetics, and some advances have been made. Many genes or markers that affect economic traits of interest in domestic animals have been identified. In this review, advances in the use of GWAS in domestic animals are described. PMID:22958308
Kuo, Kevin H M
2017-01-01
The issue of multiple testing, also termed multiplicity, is ubiquitous in studies where multiple hypotheses are tested simultaneously. Genome-wide association study (GWAS), a type of genetic association study that has gained popularity in the past decade, is most susceptible to the issue of multiple testing. Different methodologies have been employed to address the issue of multiple testing in GWAS. The purpose of the review is to examine the methodologies employed in dealing with multiple testing in the context of gene discovery using GWAS in sickle cell disease complications.
Liu, Guiyou; Zhang, Fang; Jiang, Yongshuai; Hu, Yang; Gong, Zhongying; Liu, Shoufeng; Chen, Xiuju; Jiang, Qinghua; Hao, Junwei
2017-02-01
Much effort has been expended on identifying the genetic determinants of multiple sclerosis (MS). Existing large-scale genome-wide association study (GWAS) datasets provide strong support for using pathway and network-based analysis methods to investigate the mechanisms underlying MS. However, no shared genetic pathways have been identified to date. We hypothesize that shared genetic pathways may indeed exist in different MS-GWAS datasets. Here, we report results from a three-stage analysis of GWAS and expression datasets. In stage 1, we conducted multiple pathway analyses of two MS-GWAS datasets. In stage 2, we performed a candidate pathway analysis of the large-scale MS-GWAS dataset. In stage 3, we performed a pathway analysis using the dysregulated MS gene list from seven human MS case-control expression datasets. In stage 1, we identified 15 shared pathways. In stage 2, we successfully replicated 14 of these 15 significant pathways. In stage 3, we found that dysregulated MS genes were significantly enriched in 10 of 15 MS risk pathways identified in stages 1 and 2. We report shared genetic pathways in different MS-GWAS datasets and highlight some new MS risk pathways. Our findings provide new insights on the genetic determinants of MS.
Huang, Dandan; Yi, Xianfu; Zhang, Shijie; Zheng, Zhanye; Wang, Panwen; Xuan, Chenghao; Sham, Pak Chung; Wang, Junwen; Li, Mulin Jun
2018-05-16
Genome-wide association studies have generated over thousands of susceptibility loci for many human complex traits, and yet for most of these associations the true causal variants remain unknown. Tissue/cell type-specific prediction and prioritization of non-coding regulatory variants will facilitate the identification of causal variants and underlying pathogenic mechanisms for particular complex diseases and traits. By leveraging recent large-scale functional genomics/epigenomics data, we develop an intuitive web server, GWAS4D (http://mulinlab.tmu.edu.cn/gwas4d or http://mulinlab.org/gwas4d), that systematically evaluates GWAS signals and identifies context-specific regulatory variants. The updated web server includes six major features: (i) updates the regulatory variant prioritization method with our new algorithm; (ii) incorporates 127 tissue/cell type-specific epigenomes data; (iii) integrates motifs of 1480 transcriptional regulators from 13 public resources; (iv) uniformly processes Hi-C data and generates significant interactions at 5 kb resolution across 60 tissues/cell types; (v) adds comprehensive non-coding variant functional annotations; (vi) equips a highly interactive visualization function for SNP-target interaction. Using a GWAS fine-mapped set for 161 coronary artery disease risk loci, we demonstrate that GWAS4D is able to efficiently prioritize disease-causal regulatory variants.
Frau, Francesca; Crowther, Daniel; Ruetten, Hartmut; Allebrandt, Karla V
2017-05-01
Genome-wide association studies (GWAs) for type 2 diabetes (T2D) have been successful in identifying many loci with robust association signals. Nevertheless, there is a clear need for post-GWAs strategies to understand mechanism of action and clinical relevance of these variants. The association of several comorbidities with T2D suggests a common etiology for these phenotypes and complicates the management of the disease. In this study, we focused on the genetics underlying these relationships, using systems genomics to identify genetic variation associated with T2D and 12 other traits. GWAs studies summary statistics for pairwise comparisons were obtained for glycemic traits, obesity, coronary artery disease, and lipids from large consortia GWAs meta-analyses. We used a network medicine approach to leverage experimental information about the identified genes and variants with cross traits effects for biological function interpretation. We identified a set of 38 genetic variants with cross traits effects that point to a main network of genes that should be relevant for T2D and its comorbidities. We prioritized the T2D associated genes based on the number of traits they showed association with and the experimental evidence showing their relation to the disease etiology. In this study, we demonstrated how systems genomics and network medicine approaches can shed light into GWAs discoveries, translating findings into a more therapeutically relevant context. Copyright © 2017 The Authors. Published by Elsevier Inc. All rights reserved.
Jostins, Luke; Levine, Adam P; Barrett, Jeffrey C
2013-01-01
A central focus of complex disease genetics after genome-wide association studies (GWAS) is to identify low frequency and rare risk variants, which may account for an important fraction of disease heritability unexplained by GWAS. A profusion of studies using next-generation sequencing are seeking such risk alleles. We describe how already-known complex trait loci (largely from GWAS) can be used to guide the design of these new studies by selecting cases, controls, or families who are most likely to harbor undiscovered risk alleles. We show that genetic risk prediction can select unrelated cases from large cohorts who are enriched for unknown risk factors, or multiply-affected families that are more likely to harbor high-penetrance risk alleles. We derive the frequency of an undiscovered risk allele in selected cases and controls, and show how this relates to the variance explained by the risk score, the disease prevalence and the population frequency of the risk allele. We also describe a new method for informing the design of sequencing studies using genetic risk prediction in large partially-genotyped families using an extension of the Inside-Outside algorithm for inference on trees. We explore several study design scenarios using both simulated and real data, and show that in many cases genetic risk prediction can provide significant increases in power to detect low-frequency and rare risk alleles. The same approach can also be used to aid discovery of non-genetic risk factors, suggesting possible future utility of genetic risk prediction in conventional epidemiology. Software implementing the methods in this paper is available in the R package Mangrove.
Genome-wide association study of alcohol dependence
Treutlein, Jens; Cichon, Sven; Ridinger, Monika; Wodarz, Norbert; Soyka, Michael; Zill, Peter; Maier, Wolfgang; Moessner, Rainald; Gaebel, Wolfgang; Dahmen, Norbert; Fehr, Christoph; Scherbaum, Norbert; Steffens, Michael; Ludwig, Kerstin U.; Frank, Josef; Wichmann, H.- Erich; Schreiber, Stefan; Dragano, Nico; Sommer, Wolfgang; Leonardi-Essmann, Fernando; Lourdusamy, Anbarasu; Gebicke-Haerter, Peter; Wienker, Thomas F.; Sullivan, Patrick F.; Nöthen, Markus M.; Kiefer, Falk; Spanagel, Rainer; Mann, Karl; Rietschel, Marcella
2014-01-01
Context Identification of genes contributing to alcohol dependence will improve our understanding of the mechanisms underlying this disorder. Objective To identify susceptibility genes for alcohol dependence through a genome-wide association study (GWAS) and follow-up study in a population of German male inpatients with an early age at onset. Design The GWAS included 487 male inpatients with DSM-IV alcohol dependence with an age at onset below 28 years and 1,358 population based control individuals. The follow-up study included 1,024 male inpatients and 996 age-matched male controls. All subjects were of German descent. The GWAS tested 524,396 single nucleotide polymorphisms (SNPs). All SNPs with p<10-4 were subjected to the follow-up study. In addition, nominally significant SNPs from those genes that had also shown expression changes in rat brains after chronic alcohol consumption were selected for the follow-up step. Results The GWAS produced 121 SNPs with nominal p<10-4. These, together with 19 additional SNPs from homologs of rat genes showing differential expression, were genotyped in the follow-up sample. Fifteen SNPs showed significant association with the same allele as in the GWAS. In the combined analysis, two closely linked intergenic SNPs met genome-wide significance (rs7590720 p=9.72×10-9; rs1344694 p=1.69×10-8). They are located on chromosome 2q35, a region which has been implicated in linkage studies for alcohol phenotypes. Nine SNPs were located in genes, including CDH13 and ADH1C genes which have been reported to be associated with alcohol dependence. Conclusion This is the first GWAS and follow-up study to identify a genome-wide significant association in alcohol dependence. Further independent studies are required to confirm these findings. PMID:19581569
DOE Office of Scientific and Technical Information (OSTI.GOV)
Gilbert, Jack A.; Quinn, Robert A.; Debelius, Justine
Rapid advances in DNA sequencing, metabolomics, proteomics and computation dramatically increase accessibility of microbiome studies and identify links between the microbiome and disease. Microbial time-series and multiple molecular perspectives enable Microbiome-Wide Association Studies (MWAS), analogous to Genome-Wide Association Studies (GWAS). Rapid research advances point towards actionable results, although approved clinical tests based on MWAS are still in the future. Appreciating the complexity of interactions between diet, chemistry, health and the microbiome, and determining the frequency of observations needed to capture and integrate this dynamic interface, is paramount for addressing the need for personalized and precision microbiome-based diagnostics and therapies.
Fast and accurate genotype imputation in genome-wide association studies through pre-phasing
Howie, Bryan; Fuchsberger, Christian; Stephens, Matthew; Marchini, Jonathan; Abecasis, Gonçalo R.
2013-01-01
Sequencing efforts, including the 1000 Genomes Project and disease-specific efforts, are producing large collections of haplotypes that can be used for genotype imputation in genome-wide association studies (GWAS). Imputing from these reference panels can help identify new risk alleles, but the use of large panels with existing methods imposes a high computational burden. To keep imputation broadly accessible, we introduce a strategy called “pre-phasing” that maintains the accuracy of leading methods while cutting computational costs by orders of magnitude. In brief, we first statistically estimate the haplotypes for each GWAS individual (“pre-phasing”) and then impute missing genotypes into these estimated haplotypes. This reduces the computational cost because: (i) the GWAS samples must be phased only once, whereas standard methods would implicitly re-phase with each reference panel update; (ii) it is much faster to match a phased GWAS haplotype to one reference haplotype than to match unphased GWAS genotypes to a pair of reference haplotypes. This strategy will be particularly valuable for repeated imputation as reference panels evolve. PMID:22820512
Transethnic differences in GWAS signals: A simulation study.
Zanetti, Daniela; Weale, Michael E
2018-05-07
Genome-wide association studies (GWASs) have allowed researchers to identify thousands of single nucleotide polymorphisms (SNPs) and other variants associated with particular complex traits. Previous studies have reported differences in the strength and even the direction of GWAS signals across different populations. These differences could be due to a combination of (1) lack of power, (2) allele frequency differences, (3) linkage disequilibrium (LD) differences, and (4) true differences in causal variant effect sizes. To determine whether properties (1)-(3) on their own might be sufficient to explain the patterns previously noted in strong GWAS signals, we simulated case-control data of European, Asian and African ancestry, applying realistic allele frequencies and LD from 1000 Genomes data but enforcing equal causal effect sizes across populations. Much of the observed differences in strong GWAS signals could indeed be accounted for by allele frequency and LD differences, enhanced by the Euro-centric SNP bias and lower SNP coverage found in older GWAS panels. While we cannot rule out a role for true transethnic effect size differences, our results suggest that strong causal effects may be largely shared among human populations, motivating the use of transethnic data for fine-mapping. © 2018 John Wiley & Sons Ltd/University College London.
Software engineering the mixed model for genome-wide association studies on large samples.
Zhang, Zhiwu; Buckler, Edward S; Casstevens, Terry M; Bradbury, Peter J
2009-11-01
Mixed models improve the ability to detect phenotype-genotype associations in the presence of population stratification and multiple levels of relatedness in genome-wide association studies (GWAS), but for large data sets the resource consumption becomes impractical. At the same time, the sample size and number of markers used for GWAS is increasing dramatically, resulting in greater statistical power to detect those associations. The use of mixed models with increasingly large data sets depends on the availability of software for analyzing those models. While multiple software packages implement the mixed model method, no single package provides the best combination of fast computation, ability to handle large samples, flexible modeling and ease of use. Key elements of association analysis with mixed models are reviewed, including modeling phenotype-genotype associations using mixed models, population stratification, kinship and its estimation, variance component estimation, use of best linear unbiased predictors or residuals in place of raw phenotype, improving efficiency and software-user interaction. The available software packages are evaluated, and suggestions made for future software development.
Saeed, Mohammad
2017-05-01
Systemic lupus erythematosus (SLE) is a complex disorder. Genetic association studies of complex disorders suffer from the following three major issues: phenotypic heterogeneity, false positive (type I error), and false negative (type II error) results. Hence, genes with low to moderate effects are missed in standard analyses, especially after statistical corrections. OASIS is a novel linkage disequilibrium clustering algorithm that can potentially address false positives and negatives in genome-wide association studies (GWAS) of complex disorders such as SLE. OASIS was applied to two SLE dbGAP GWAS datasets (6077 subjects; ∼0.75 million single-nucleotide polymorphisms). OASIS identified three known SLE genes viz. IFIH1, TNIP1, and CD44, not previously reported using these GWAS datasets. In addition, 22 novel loci for SLE were identified and the 5 SLE genes previously reported using these datasets were verified. OASIS methodology was validated using single-variant replication and gene-based analysis with GATES. This led to the verification of 60% of OASIS loci. New SLE genes that OASIS identified and were further verified include TNFAIP6, DNAJB3, TTF1, GRIN2B, MON2, LATS2, SNX6, RBFOX1, NCOA3, and CHAF1B. This study presents the OASIS algorithm, software, and the meta-analyses of two publicly available SLE GWAS datasets along with the novel SLE genes. Hence, OASIS is a novel linkage disequilibrium clustering method that can be universally applied to existing GWAS datasets for the identification of new genes.
e-GRASP: an integrated evolutionary and GRASP resource for exploring disease associations.
Karim, Sajjad; NourEldin, Hend Fakhri; Abusamra, Heba; Salem, Nada; Alhathli, Elham; Dudley, Joel; Sanderford, Max; Scheinfeldt, Laura B; Chaudhary, Adeel G; Al-Qahtani, Mohammed H; Kumar, Sudhir
2016-10-17
Genome-wide association studies (GWAS) have become a mainstay of biological research concerned with discovering genetic variation linked to phenotypic traits and diseases. Both discrete and continuous traits can be analyzed in GWAS to discover associations between single nucleotide polymorphisms (SNPs) and traits of interest. Associations are typically determined by estimating the significance of the statistical relationship between genetic loci and the given trait. However, the prioritization of bona fide, reproducible genetic associations from GWAS results remains a central challenge in identifying genomic loci underlying common complex diseases. Evolutionary-aware meta-analysis of the growing GWAS literature is one way to address this challenge and to advance from association to causation in the discovery of genotype-phenotype relationships. We have created an evolutionary GWAS resource to enable in-depth query and exploration of published GWAS results. This resource uses the publically available GWAS results annotated in the GRASP2 database. The GRASP2 database includes results from 2082 studies, 177 broad phenotype categories, and ~8.87 million SNP-phenotype associations. For each SNP in e-GRASP, we present information from the GRASP2 database for convenience as well as evolutionary information (e.g., rate and timespan). Users can, therefore, identify not only SNPs with highly significant phenotype-association P-values, but also SNPs that are highly replicated and/or occur at evolutionarily conserved sites that are likely to be functionally important. Additionally, we provide an evolutionary-adjusted SNP association ranking (E-rank) that uses cross-species evolutionary conservation scores and population allele frequencies to transform P-values in an effort to enhance the discovery of SNPs with a greater probability of biologically meaningful disease associations. By adding an evolutionary dimension to the GWAS results available in the GRASP2 database, our e-GRASP resource will enable a more effective exploration of SNPs not only by the statistical significance of trait associations, but also by the number of studies in which associations have been replicated, and the evolutionary context of the associated mutations. Therefore, e-GRASP will be a valuable resource for aiding researchers in the identification of bona fide, reproducible genetic associations from GWAS results. This resource is freely available at http://www.mypeg.info/egrasp .
A two-stage genome-wide association study of sporadic amyotrophic lateral sclerosis.
Chiò, Adriano; Schymick, Jennifer C; Restagno, Gabriella; Scholz, Sonja W; Lombardo, Federica; Lai, Shiao-Lin; Mora, Gabriele; Fung, Hon-Chung; Britton, Angela; Arepalli, Sampath; Gibbs, J Raphael; Nalls, Michael; Berger, Stephen; Kwee, Lydia Coulter; Oddone, Eugene Z; Ding, Jinhui; Crews, Cynthia; Rafferty, Ian; Washecka, Nicole; Hernandez, Dena; Ferrucci, Luigi; Bandinelli, Stefania; Guralnik, Jack; Macciardi, Fabio; Torri, Federica; Lupoli, Sara; Chanock, Stephen J; Thomas, Gilles; Hunter, David J; Gieger, Christian; Wichmann, H Erich; Calvo, Andrea; Mutani, Roberto; Battistini, Stefania; Giannini, Fabio; Caponnetto, Claudia; Mancardi, Giovanni Luigi; La Bella, Vincenzo; Valentino, Francesca; Monsurrò, Maria Rosaria; Tedeschi, Gioacchino; Marinou, Kalliopi; Sabatelli, Mario; Conte, Amelia; Mandrioli, Jessica; Sola, Patrizia; Salvi, Fabrizio; Bartolomei, Ilaria; Siciliano, Gabriele; Carlesi, Cecilia; Orrell, Richard W; Talbot, Kevin; Simmons, Zachary; Connor, James; Pioro, Erik P; Dunkley, Travis; Stephan, Dietrich A; Kasperaviciute, Dalia; Fisher, Elizabeth M; Jabonka, Sibylle; Sendtner, Michael; Beck, Marcus; Bruijn, Lucie; Rothstein, Jeffrey; Schmidt, Silke; Singleton, Andrew; Hardy, John; Traynor, Bryan J
2009-04-15
The cause of sporadic amyotrophic lateral sclerosis (ALS) is largely unknown, but genetic factors are thought to play a significant role in determining susceptibility to motor neuron degeneration. To identify genetic variants altering risk of ALS, we undertook a two-stage genome-wide association study (GWAS): we followed our initial GWAS of 545 066 SNPs in 553 individuals with ALS and 2338 controls by testing the 7600 most associated SNPs from the first stage in three independent cohorts consisting of 2160 cases and 3008 controls. None of the SNPs selected for replication exceeded the Bonferroni threshold for significance. The two most significantly associated SNPs, rs2708909 and rs2708851 [odds ratio (OR) = 1.17 and 1.18, and P-values = 6.98 x 10(-7) and 1.16 x 10(-6)], were located on chromosome 7p13.3 within a 175 kb linkage disequilibrium block containing the SUNC1, HUS1 and C7orf57 genes. These associations did not achieve genome-wide significance in the original cohort and failed to replicate in an additional independent cohort of 989 US cases and 327 controls (OR = 1.18 and 1.19, P-values = 0.08 and 0.06, respectively). Thus, we chose to cautiously interpret our data as hypothesis-generating requiring additional confirmation, especially as all previously reported loci for ALS have failed to replicate successfully. Indeed, the three loci (FGGY, ITPR2 and DPP6) identified in previous GWAS of sporadic ALS were not significantly associated with disease in our study. Our findings suggest that ALS is more genetically and clinically heterogeneous than previously recognized. Genotype data from our study have been made available online to facilitate such future endeavors.
A two-stage genome-wide association study of sporadic amyotrophic lateral sclerosis
Chiò, Adriano; Schymick, Jennifer C.; Restagno, Gabriella; Scholz, Sonja W.; Lombardo, Federica; Lai, Shiao-Lin; Mora, Gabriele; Fung, Hon-Chung; Britton, Angela; Arepalli, Sampath; Gibbs, J. Raphael; Nalls, Michael; Berger, Stephen; Kwee, Lydia Coulter; Oddone, Eugene Z.; Ding, Jinhui; Crews, Cynthia; Rafferty, Ian; Washecka, Nicole; Hernandez, Dena; Ferrucci, Luigi; Bandinelli, Stefania; Guralnik, Jack; Macciardi, Fabio; Torri, Federica; Lupoli, Sara; Chanock, Stephen J.; Thomas, Gilles; Hunter, David J.; Gieger, Christian; Wichmann, H. Erich; Calvo, Andrea; Mutani, Roberto; Battistini, Stefania; Giannini, Fabio; Caponnetto, Claudia; Mancardi, Giovanni Luigi; La Bella, Vincenzo; Valentino, Francesca; Monsurrò, Maria Rosaria; Tedeschi, Gioacchino; Marinou, Kalliopi; Sabatelli, Mario; Conte, Amelia; Mandrioli, Jessica; Sola, Patrizia; Salvi, Fabrizio; Bartolomei, Ilaria; Siciliano, Gabriele; Carlesi, Cecilia; Orrell, Richard W.; Talbot, Kevin; Simmons, Zachary; Connor, James; Pioro, Erik P.; Dunkley, Travis; Stephan, Dietrich A.; Kasperaviciute, Dalia; Fisher, Elizabeth M.; Jabonka, Sibylle; Sendtner, Michael; Beck, Marcus; Bruijn, Lucie; Rothstein, Jeffrey; Schmidt, Silke; Singleton, Andrew; Hardy, John; Traynor, Bryan J.
2009-01-01
The cause of sporadic amyotrophic lateral sclerosis (ALS) is largely unknown, but genetic factors are thought to play a significant role in determining susceptibility to motor neuron degeneration. To identify genetic variants altering risk of ALS, we undertook a two-stage genome-wide association study (GWAS): we followed our initial GWAS of 545 066 SNPs in 553 individuals with ALS and 2338 controls by testing the 7600 most associated SNPs from the first stage in three independent cohorts consisting of 2160 cases and 3008 controls. None of the SNPs selected for replication exceeded the Bonferroni threshold for significance. The two most significantly associated SNPs, rs2708909 and rs2708851 [odds ratio (OR) = 1.17 and 1.18, and P-values = 6.98 × 10−7 and 1.16 × 10−6], were located on chromosome 7p13.3 within a 175 kb linkage disequilibrium block containing the SUNC1, HUS1 and C7orf57 genes. These associations did not achieve genome-wide significance in the original cohort and failed to replicate in an additional independent cohort of 989 US cases and 327 controls (OR = 1.18 and 1.19, P-values = 0.08 and 0.06, respectively). Thus, we chose to cautiously interpret our data as hypothesis-generating requiring additional confirmation, especially as all previously reported loci for ALS have failed to replicate successfully. Indeed, the three loci (FGGY, ITPR2 and DPP6) identified in previous GWAS of sporadic ALS were not significantly associated with disease in our study. Our findings suggest that ALS is more genetically and clinically heterogeneous than previously recognized. Genotype data from our study have been made available online to facilitate such future endeavors. PMID:19193627
Workalemahu, Tsegaselassie; Enquobahrie, Daniel A; Gelaye, Bizu; Sanchez, Sixto E; Garcia, Pedro J; Tekola-Ayele, Fasil; Hajat, Anjum; Thornton, Timothy A; Ananth, Cande V; Williams, Michelle A
2018-06-01
Accumulating epidemiological evidence points to strong genetic susceptibility to placental abruption (PA). However, characterization of genes associated with PA remains incomplete. We conducted a genome-wide association study (GWAS) of PA and a meta-analysis of GWAS. Participants of the Placental Abruption Genetic Epidemiology (PAGE) study, a population based case-control study of PA conducted in Lima, Peru, were genotyped using the Illumina HumanCore-24 BeadChip platform. Genotypes were imputed using the 1000 genomes reference panel, and >4.9 million SNPs that passed quality control were analyzed. We performed a GWAS in PAGE participants (507 PA cases and 1090 controls) and a GWAS meta-analysis in 2512 participants (959 PA cases and 1553 controls) that included PAGE and the previously reported Peruvian Abruptio Placentae Epidemiology (PAPE) study. We fitted population stratification-adjusted logistic regression models and fixed-effects meta-analyses using inverse-variance weighting. Independent loci (linkage-disequilibrium<0.80) suggestively associated with PA (P-value<5e-5) included rs4148646 and rs2074311 in ABCC8, rs7249210, rs7250184, rs7249100 and rs10401828 in ZNF28, rs11133659 in CTNND2, and rs2074314 and rs35271178 near KCNJ11 in the PAGE GWAS. Similarly, independent loci suggestively associated with PA in the GWAS meta-analysis included rs76258369 near IRX1, and rs7094759 and rs12264492 in ADAM12. Functional analyses of these genes showed trophoblast-like cell interaction, as well as networks involved in endocrine system disorders, cardiovascular diseases, and cellular function. We identified several genetic loci and related functions that may play a role in PA risk. Understanding genetic factors underlying pathophysiological mechanisms of PA may facilitate prevention and early diagnostic efforts. Published by Elsevier Ltd.
Cowper-Sal lari, Richard; Cole, Michael D; Karagas, Margaret R; Lupien, Mathieu; Moore, Jason H
2011-01-01
The conceptual foundation of the genome-wide association study (GWAS) has advanced unchecked since its conception. A revision might seem premature as the potential of GWAS has not been fully realized. Multiple technical and practical limitations need to be overcome before GWAS can be fairly criticized. But with the completion of hundreds of studies and a deeper understanding of the genetic architecture of disease, warnings are being raised. The results compiled to date indicate that risk-associated variants lie predominantly in noncoding regions of the genome. Additionally, alternative methodologies are uncovering large and heterogeneous sets of rare variants underlying disease. The fear is that, even in its fulfillment, the current GWAS paradigm might be incapable of dissecting all kinds of phenotypes. In the following text, we review several initiatives that aim to overcome these limitations. The overarching theme of these studies is the inclusion of biological knowledge to both the analysis and interpretation of genotyping data. GWAS is uninformed of biology by design and although there is some virtue in its simplicity, it is also its most conspicuous deficiency. We propose a framework in which to integrate these novel approaches, both empirical and theoretical, in the form of a genome-wide regulatory network (GWRN). By processing experimental data into networks, emerging data types based on chromatin immunoprecipitation are made computationally tractable. This will give GWAS re-analysis efforts the most current and relevant substrates, and root them firmly on our knowledge of human disease. Copyright © 2010 John Wiley & Sons, Inc.
variety of arrays appropriate for a wide breadth of study design needs. Genomic coverage of many of the chromosomal anomalies are services offered at NO ADDITIONAL COST to study investigators with GWAS projects be submitted for both the initial GWAS study as well as replication using our custom SNP service
Bossini-Castillo, Lara; Martin, Jose-Ezequiel; Broen, Jasper; Gorlova, Olga; Simeón, Carmen P.; Beretta, Lorenzo; Vonk, Madelon C.; Luis Callejas, Jose; Castellví, Ivan; Carreira, Patricia; José García-Hernández, Francisco; Fernández Castro, Mónica; Coenen, Marieke J.H.; Riemekasten, Gabriela; Witte, Torsten; Hunzelmann, Nicolas; Kreuter, Alexander; Distler, Jörg H.W.; Koeleman, Bobby P.; Voskuyl, Alexandre E.; Schuerwegh, Annemie J.; Palm, Øyvind; Hesselstrand, Roger; Nordin, Annika; Airó, Paolo; Lunardi, Claudio; Scorza, Raffaella; Shiels, Paul; van Laar, Jacob M.; Herrick, Ariane; Worthington, Jane; Denton, Christopher; Tan, Filemon K.; Arnett, Frank C.; Agarwal, Sandeep K.; Assassi, Shervin; Fonseca, Carmen; Mayes, Maureen D.; Radstake, Timothy R.D.J.; Martin, Javier
2012-01-01
A single-nucleotide polymorphism (SNP) at the IL12RB2 locus showed a suggestive association signal in a previously published genome-wide association study (GWAS) in systemic sclerosis (SSc). Aiming to reveal the possible implication of the IL12RB2 gene in SSc, we conducted a follow-up study of this locus in different Caucasian cohorts. We analyzed 10 GWAS-genotyped SNPs in the IL12RB2 region (2309 SSc patients and 5161 controls). We then selected three SNPs (rs3790567, rs3790566 and rs924080) based on their significance level in the GWAS, for follow-up in an independent European cohort comprising 3344 SSc and 3848 controls. The most-associated SNP (rs3790567) was further tested in an independent cohort comprising 597 SSc patients and 1139 controls from the USA. After conditional logistic regression analysis of the GWAS data, we selected rs3790567 [PMH= 1.92 × 10−5 odds ratio (OR) = 1.19] as the genetic variant with the firmest independent association observed in the analyzed GWAS peak of association. After the first follow-up phase, only the association of rs3790567 was consistent (PMH= 4.84 × 10−3 OR = 1.12). The second follow-up phase confirmed this finding (Pχ2 = 2.82 × 10−4 OR = 1.34). After performing overall pooled-analysis of all the cohorts included in the present study, the association found for the rs3790567 SNP in the IL12RB2 gene region reached GWAS-level significant association (PMH= 2.82 × 10−9 OR = 1.17). Our data clearly support the IL12RB2 genetic association with SSc, and suggest a relevant role of the interleukin 12 signaling pathway in SSc pathogenesis. PMID:22076442
Wu, Mengmeng; Zeng, Wanwen; Liu, Wenqiang; Lv, Hairong; Chen, Ting; Jiang, Rui
2018-06-03
Genome-wide association studies (GWAS) have successfully discovered a number of disease-associated genetic variants in the past decade, providing an unprecedented opportunity for deciphering genetic basis of human inherited diseases. However, it is still a challenging task to extract biological knowledge from the GWAS data, due to such issues as missing heritability and weak interpretability. Indeed, the fact that the majority of discovered loci fall into noncoding regions without clear links to genes has been preventing the characterization of their functions and appealing for a sophisticated approach to bridge genetic and genomic studies. Towards this problem, network-based prioritization of candidate genes, which performs integrated analysis of gene networks with GWAS data, has emerged as a promising direction and attracted much attention. However, most existing methods overlook the sparse and noisy properties of gene networks and thus may lead to suboptimal performance. Motivated by this understanding, we proposed a novel method called REGENT for integrating multiple gene networks with GWAS data to prioritize candidate genes for complex diseases. We leveraged a technique called the network representation learning to embed a gene network into a compact and robust feature space, and then designed a hierarchical statistical model to integrate features of multiple gene networks with GWAS data for the effective inference of genes associated with a disease of interest. We applied our method to six complex diseases and demonstrated the superior performance of REGENT over existing approaches in recovering known disease-associated genes. We further conducted a pathway analysis and showed that the ability of REGENT to discover disease-associated pathways. We expect to see applications of our method to a broad spectrum of diseases for post-GWAS analysis. REGENT is freely available at https://github.com/wmmthu/REGENT. Copyright © 2018 Elsevier Inc. All rights reserved.
Genome-wide association studies in maize: praise and stargaze
USDA-ARS?s Scientific Manuscript database
Genome-wide association study (GWAS) has appeared as a widespread strategy in decoding genotype-phenotype associations in many species thanks to technical advances in next-generation sequencing (NGS) applications. Maize is an ideal crop for GWAS and significant progress has been made in the last dec...
Schizophrenia interactome with 504 novel protein–protein interactions
Ganapathiraju, Madhavi K; Thahir, Mohamed; Handen, Adam; Sarkar, Saumendra N; Sweet, Robert A; Nimgaonkar, Vishwajit L; Loscher, Christine E; Bauer, Eileen M; Chaparala, Srilakshmi
2016-01-01
Genome-wide association studies of schizophrenia (GWAS) have revealed the role of rare and common genetic variants, but the functional effects of the risk variants remain to be understood. Protein interactome-based studies can facilitate the study of molecular mechanisms by which the risk genes relate to schizophrenia (SZ) genesis, but protein–protein interactions (PPIs) are unknown for many of the liability genes. We developed a computational model to discover PPIs, which is found to be highly accurate according to computational evaluations and experimental validations of selected PPIs. We present here, 365 novel PPIs of liability genes identified by the SZ Working Group of the Psychiatric Genomics Consortium (PGC). Seventeen genes that had no previously known interactions have 57 novel interactions by our method. Among the new interactors are 19 drug targets that are targeted by 130 drugs. In addition, we computed 147 novel PPIs of 25 candidate genes investigated in the pre-GWAS era. While there is little overlap between the GWAS genes and the pre-GWAS genes, the interactomes reveal that they largely belong to the same pathways, thus reconciling the apparent disparities between the GWAS and prior gene association studies. The interactome including 504 novel PPIs overall, could motivate other systems biology studies and trials with repurposed drugs. The PPIs are made available on a webserver, called Schizo-Pi at http://severus.dbmi.pitt.edu/schizo-pi with advanced search capabilities. PMID:27336055
α-amanitin resistance in Drosophila melanogaster: A genome-wide association approach.
Mitchell, Chelsea L; Latuszek, Catrina E; Vogel, Kara R; Greenlund, Ian M; Hobmeier, Rebecca E; Ingram, Olivia K; Dufek, Shannon R; Pecore, Jared L; Nip, Felicia R; Johnson, Zachary J; Ji, Xiaohui; Wei, Hairong; Gailing, Oliver; Werner, Thomas
2017-01-01
We investigated the mechanisms of mushroom toxin resistance in the Drosophila Genetic Reference Panel (DGRP) fly lines, using genome-wide association studies (GWAS). While Drosophila melanogaster avoids mushrooms in nature, some lines are surprisingly resistant to α-amanitin-a toxin found solely in mushrooms. This resistance may represent a pre-adaptation, which might enable this species to invade the mushroom niche in the future. Although our previous microarray study had strongly suggested that pesticide-metabolizing detoxification genes confer α-amanitin resistance in a Taiwanese D. melanogaster line Ama-KTT, none of the traditional detoxification genes were among the top candidate genes resulting from the GWAS in the current study. Instead, we identified Megalin, Tequila, and widerborst as candidate genes underlying the α-amanitin resistance phenotype in the North American DGRP lines, all three of which are connected to the Target of Rapamycin (TOR) pathway. Both widerborst and Tequila are upstream regulators of TOR, and TOR is a key regulator of autophagy and Megalin-mediated endocytosis. We suggest that endocytosis and autophagy of α-amanitin, followed by lysosomal degradation of the toxin, is one of the mechanisms that confer α-amanitin resistance in the DGRP lines.
Wei, Wen-Hua; Bowes, John; Plant, Darren; Viatte, Sebastien; Yarwood, Annie; Massey, Jonathan; Worthington, Jane; Eyre, Stephen
2016-04-25
Genotypic variability based genome-wide association studies (vGWASs) can identify potentially interacting loci without prior knowledge of the interacting factors. We report a two-stage approach to make vGWAS applicable to diseases: firstly using a mixed model approach to partition dichotomous phenotypes into additive risk and non-additive environmental residuals on the liability scale and secondly using the Levene's (Brown-Forsythe) test to assess equality of the residual variances across genotype groups per marker. We found widespread significant (P < 2.5e-05) vGWAS signals within the major histocompatibility complex (MHC) across all three study cohorts of rheumatoid arthritis. We further identified 10 epistatic interactions between the vGWAS signals independent of the MHC additive effects, each with a weak effect but jointly explained 1.9% of phenotypic variance. PTPN22 was also identified in the discovery cohort but replicated in only one independent cohort. Combining the three cohorts boosted power of vGWAS and additionally identified TYK2 and ANKRD55. Both PTPN22 and TYK2 had evidence of interactions reported elsewhere. We conclude that vGWAS can help discover interacting loci for complex diseases but require large samples to find additional signals.
Anonymization of electronic medical records for validating genome-wide association studies
Loukides, Grigorios; Gkoulalas-Divanis, Aris; Malin, Bradley
2010-01-01
Genome-wide association studies (GWAS) facilitate the discovery of genotype–phenotype relations from population-based sequence databases, which is an integral facet of personalized medicine. The increasing adoption of electronic medical records allows large amounts of patients’ standardized clinical features to be combined with the genomic sequences of these patients and shared to support validation of GWAS findings and to enable novel discoveries. However, disseminating these data “as is” may lead to patient reidentification when genomic sequences are linked to resources that contain the corresponding patients’ identity information based on standardized clinical features. This work proposes an approach that provably prevents this type of data linkage and furnishes a result that helps support GWAS. Our approach automatically extracts potentially linkable clinical features and modifies them in a way that they can no longer be used to link a genomic sequence to a small number of patients, while preserving the associations between genomic sequences and specific sets of clinical features corresponding to GWAS-related diseases. Extensive experiments with real patient data derived from the Vanderbilt's University Medical Center verify that our approach generates data that eliminate the threat of individual reidentification, while supporting GWAS validation and clinical case analysis tasks. PMID:20385806
A Conceptual Framework for Pharmacodynamic Genome-wide Association Studies in Pharmacogenomics
Wu, Rongling; Tong, Chunfa; Wang, Zhong; Mauger, David; Tantisira, Kelan; Szefler, Stanley J.; Chinchilli, Vernon M.; Israel, Elliot
2013-01-01
Summary Genome-wide association studies (GWAS) have emerged as a powerful tool to identify loci that affect drug response or susceptibility to adverse drug reactions. However, current GWAS based on a simple analysis of associations between genotype and phenotype ignores the biochemical reactions of drug response, thus limiting the scope of inference about its genetic architecture. To facilitate the inference of GWAS in pharmacogenomics, we sought to undertake the mathematical integration of the pharmacodynamic process of drug reactions through computational models. By estimating and testing the genetic control of pharmacodynamic and pharmacokinetic parameters, this mechanistic approach does not only enhance the biological and clinical relevance of significant genetic associations, but also improve the statistical power and robustness of gene detection. This report discusses the general principle and development of pharmacodynamics-based GWAS, highlights the practical use of this approach in addressing various pharmacogenomic problems, and suggests that this approach will be an important method to study the genetic architecture of drug responses or reactions. PMID:21920452
Liley, James; Wallace, Chris
2015-02-01
Genome-wide association studies (GWAS) have been successful in identifying single nucleotide polymorphisms (SNPs) associated with many traits and diseases. However, at existing sample sizes, these variants explain only part of the estimated heritability. Leverage of GWAS results from related phenotypes may improve detection without the need for larger datasets. The Bayesian conditional false discovery rate (cFDR) constitutes an upper bound on the expected false discovery rate (FDR) across a set of SNPs whose p values for two diseases are both less than two disease-specific thresholds. Calculation of the cFDR requires only summary statistics and have several advantages over traditional GWAS analysis. However, existing methods require distinct control samples between studies. Here, we extend the technique to allow for some or all controls to be shared, increasing applicability. Several different SNP sets can be defined with the same cFDR value, and we show that the expected FDR across the union of these sets may exceed expected FDR in any single set. We describe a procedure to establish an upper bound for the expected FDR among the union of such sets of SNPs. We apply our technique to pairwise analysis of p values from ten autoimmune diseases with variable sharing of controls, enabling discovery of 59 SNP-disease associations which do not reach GWAS significance after genomic control in individual datasets. Most of the SNPs we highlight have previously been confirmed using replication studies or larger GWAS, a useful validation of our technique; we report eight SNP-disease associations across five diseases not previously declared. Our technique extends and strengthens the previous algorithm, and establishes robust limits on the expected FDR. This approach can improve SNP detection in GWAS, and give insight into shared aetiology between phenotypically related conditions.
PExFInS: An Integrative Post-GWAS Explorer for Functional Indels and SNPs
Cheng, Zhongshan; Chu, Hin; Fan, Yanhui; Li, Cun; Song, You-Qiang; Zhou, Jie; Yuen, Kwok-Yung
2015-01-01
Expression quantitative trait loci (eQTLs) mapping and linkage disequilibrium (LD) analysis have been widely employed to interpret findings of genome-wide association studies (GWAS). With the availability of deep sequencing data of 423 lymphoblastoid cell lines (LCLs) from six global populations and the microarray expression data, we performed eQTL analysis, identified more than 228 K SNP cis-eQTLs and 21 K indel cis-eQTLs and generated a LCL cis-eQTL database. We demonstrate that the percentages of population-shared and population-specific cis-eQTLs are comparable; while indel cis-eQTLs in the population-specific subsection make more contribution to gene expression variations than those in the population-shared subsection. We found cis-eQTLs, especially the population-shared cis-eQTLs are significantly enriched toward transcription start site. Moreover, the National Human Genome Research Institute cataloged GWAS SNPs are enriched for LCL cis-eQTLs. Specifically, 32.8% GWAS SNPs are LCL cis-eQTLs, among which 12.5% can be tagged by indel cis-eQTLs, suggesting the fundamental contribution of indel cis-eQTLs to GWAS association signals. To search for functional indels and SNPs tagging GWAS SNPs, a pipeline Post-GWAS Explorer for Functional Indels and SNPs (PExFInS) has been developed, integrating LD analysis, functional annotation from public databases, cis-eQTL mapping with our LCL cis-eQTL database and other published cis-eQTL datasets. PMID:26612672
Genetic determinants of leucocyte telomere length in children: a neglected and challenging field.
Stathopoulou, Maria G; Petrelis, Alexandros M; Buxton, Jessica L; Froguel, Philippe; Blakemore, Alexandra I F; Visvikis-Siest, Sophie
2015-03-01
Telomere length is associated with a large range of human diseases. Genome-wide association studies (GWAS) have identified genetic variants that are associated with leucocyte telomere length (LTL). However, these studies are limited to adult populations. Nevertheless, childhood is a crucial period for the determination of LTL, and the assessment of age-specific genetic determinants, although neglected, could be of great importance. Our aim was to provide insights and preliminary results on genetic determinants of LTL in children. Healthy children (n = 322, age range = 6.75-17 years) with available GWAS data (Illumina Human CNV370-Duo array) were included. The LTL was measured using multiplex quantitative real-time polymerase chain reaction. Linear regression models adjusted for age, gender, parental age at child's birth, and body mass index were used to test the associations of LTL with polymorphisms identified in adult GWAS and to perform a discovery-only GWAS. The previously GWAS-identified variants in adults were not associated with LTL in our paediatric sample. This lack of association was not due to possible interactions with age or gene × gene interactions. Furthermore, a discovery-only GWAS approach demonstrated six novel variants that reached the level of suggestive association (P ≤ 5 × 10(-5)) and explain a high percentage of children's LTL. The study of genetic determinants of LTL in children may identify novel variants not previously identified in adults. Studies in large-scale children populations are needed for the confirmation of these results, possibly through a childhood consortium that could better handle the methodological challenges of LTL genetic epidemiology field. © 2015 John Wiley & Sons Ltd.
Sharma, Amitabh; Gulbahce, Natali; Pevzner, Samuel J.; Menche, Jörg; Ladenvall, Claes; Folkersen, Lasse; Eriksson, Per; Orho-Melander, Marju; Barabási, Albert-László
2013-01-01
Genome wide association studies (GWAS) identify susceptibility loci for complex traits, but do not identify particular genes of interest. Integration of functional and network information may help in overcoming this limitation and identifying new susceptibility loci. Using GWAS and comorbidity data, we present a network-based approach to predict candidate genes for lipid and lipoprotein traits. We apply a prediction pipeline incorporating interactome, co-expression, and comorbidity data to Global Lipids Genetics Consortium (GLGC) GWAS for four traits of interest, identifying phenotypically coherent modules. These modules provide insights regarding gene involvement in complex phenotypes with multiple susceptibility alleles and low effect sizes. To experimentally test our predictions, we selected four candidate genes and genotyped representative SNPs in the Malmö Diet and Cancer Cardiovascular Cohort. We found significant associations with LDL-C and total-cholesterol levels for a synonymous SNP (rs234706) in the cystathionine beta-synthase (CBS) gene (p = 1 × 10−5 and adjusted-p = 0.013, respectively). Further, liver samples taken from 206 patients revealed that patients with the minor allele of rs234706 had significant dysregulation of CBS (p = 0.04). Despite the known biological role of CBS in lipid metabolism, SNPs within the locus have not yet been identified in GWAS of lipoprotein traits. Thus, the GWAS-based Comorbidity Module (GCM) approach identifies candidate genes missed by GWAS studies, serving as a broadly applicable tool for the investigation of other complex disease phenotypes. PMID:23882023
Integration of mouse and human genome-wide association data identifies KCNIP4 as an asthma gene.
Himes, Blanca E; Sheppard, Keith; Berndt, Annerose; Leme, Adriana S; Myers, Rachel A; Gignoux, Christopher R; Levin, Albert M; Gauderman, W James; Yang, James J; Mathias, Rasika A; Romieu, Isabelle; Torgerson, Dara G; Roth, Lindsey A; Huntsman, Scott; Eng, Celeste; Klanderman, Barbara; Ziniti, John; Senter-Sylvia, Jody; Szefler, Stanley J; Lemanske, Robert F; Zeiger, Robert S; Strunk, Robert C; Martinez, Fernando D; Boushey, Homer; Chinchilli, Vernon M; Israel, Elliot; Mauger, David; Koppelman, Gerard H; Postma, Dirkje S; Nieuwenhuis, Maartje A E; Vonk, Judith M; Lima, John J; Irvin, Charles G; Peters, Stephen P; Kubo, Michiaki; Tamari, Mayumi; Nakamura, Yusuke; Litonjua, Augusto A; Tantisira, Kelan G; Raby, Benjamin A; Bleecker, Eugene R; Meyers, Deborah A; London, Stephanie J; Barnes, Kathleen C; Gilliland, Frank D; Williams, L Keoki; Burchard, Esteban G; Nicolae, Dan L; Ober, Carole; DeMeo, Dawn L; Silverman, Edwin K; Paigen, Beverly; Churchill, Gary; Shapiro, Steve D; Weiss, Scott T
2013-01-01
Asthma is a common chronic respiratory disease characterized by airway hyperresponsiveness (AHR). The genetics of asthma have been widely studied in mouse and human, and homologous genomic regions have been associated with mouse AHR and human asthma-related phenotypes. Our goal was to identify asthma-related genes by integrating AHR associations in mouse with human genome-wide association study (GWAS) data. We used Efficient Mixed Model Association (EMMA) analysis to conduct a GWAS of baseline AHR measures from males and females of 31 mouse strains. Genes near or containing SNPs with EMMA p-values <0.001 were selected for further study in human GWAS. The results of the previously reported EVE consortium asthma GWAS meta-analysis consisting of 12,958 diverse North American subjects from 9 study centers were used to select a subset of homologous genes with evidence of association with asthma in humans. Following validation attempts in three human asthma GWAS (i.e., Sepracor/LOCCS/LODO/Illumina, GABRIEL, DAG) and two human AHR GWAS (i.e., SHARP, DAG), the Kv channel interacting protein 4 (KCNIP4) gene was identified as nominally associated with both asthma and AHR at a gene- and SNP-level. In EVE, the smallest KCNIP4 association was at rs6833065 (P-value 2.9e-04), while the strongest associations for Sepracor/LOCCS/LODO/Illumina, GABRIEL, DAG were 1.5e-03, 1.0e-03, 3.1e-03 at rs7664617, rs4697177, rs4696975, respectively. At a SNP level, the strongest association across all asthma GWAS was at rs4697177 (P-value 1.1e-04). The smallest P-values for association with AHR were 2.3e-03 at rs11947661 in SHARP and 2.1e-03 at rs402802 in DAG. Functional studies are required to validate the potential involvement of KCNIP4 in modulating asthma susceptibility and/or AHR. Our results suggest that a useful approach to identify genes associated with human asthma is to leverage mouse AHR association data.
Efficiently Identifying Significant Associations in Genome-wide Association Studies
Eskin, Eleazar
2013-01-01
Abstract Over the past several years, genome-wide association studies (GWAS) have implicated hundreds of genes in common disease. More recently, the GWAS approach has been utilized to identify regions of the genome that harbor variation affecting gene expression or expression quantitative trait loci (eQTLs). Unlike GWAS applied to clinical traits, where only a handful of phenotypes are analyzed per study, in eQTL studies, tens of thousands of gene expression levels are measured, and the GWAS approach is applied to each gene expression level. This leads to computing billions of statistical tests and requires substantial computational resources, particularly when applying novel statistical methods such as mixed models. We introduce a novel two-stage testing procedure that identifies all of the significant associations more efficiently than testing all the single nucleotide polymorphisms (SNPs). In the first stage, a small number of informative SNPs, or proxies, across the genome are tested. Based on their observed associations, our approach locates the regions that may contain significant SNPs and only tests additional SNPs from those regions. We show through simulations and analysis of real GWAS datasets that the proposed two-stage procedure increases the computational speed by a factor of 10. Additionally, efficient implementation of our software increases the computational speed relative to the state-of-the-art testing approaches by a factor of 75. PMID:24033261
Genome-wide association studies and epigenome-wide association studies go together in cancer control
Verma, Mukesh
2016-01-01
Completion of the human genome a decade ago laid the foundation for: using genetic information in assessing risk to identify individuals and populations that are likely to develop cancer, and designing treatments based on a person's genetic profiling (precision medicine). Genome-wide association studies (GWAS) completed during the past few years have identified risk-associated single nucleotide polymorphisms that can be used as screening tools in epidemiologic studies of a variety of tumor types. This led to the conduct of epigenome-wide association studies (EWAS). This article discusses the current status, challenges and research opportunities in GWAS and EWAS. Information gained from GWAS and EWAS has potential applications in cancer control and treatment. PMID:27079684
Hill, W David
2018-04-01
Intelligence and educational attainment are strongly genetically correlated. This relationship can be exploited by Multi-Trait Analysis of GWAS (MTAG) to add power to Genome-wide Association Studies (GWAS) of intelligence. MTAG allows the user to meta-analyze GWASs of different phenotypes, based on their genetic correlations, to identify association's specific to the trait of choice. An MTAG analysis using GWAS data sets on intelligence and education was conducted by Lam et al. (2017). Lam et al. (2017) reported 70 loci that they described as 'trait specific' to intelligence. This article examines whether the analysis conducted by Lam et al. (2017) has resulted in genetic information about a phenotype that is more similar to education than intelligence.
Rapid scoring of genes in microbial pan-genome-wide association studies with Scoary.
Brynildsrud, Ola; Bohlin, Jon; Scheffer, Lonneke; Eldholm, Vegard
2016-11-25
Genome-wide association studies (GWAS) have become indispensable in human medicine and genomics, but very few have been carried out on bacteria. Here we introduce Scoary, an ultra-fast, easy-to-use, and widely applicable software tool that scores the components of the pan-genome for associations to observed phenotypic traits while accounting for population stratification, with minimal assumptions about evolutionary processes. We call our approach pan-GWAS to distinguish it from traditional, single nucleotide polymorphism (SNP)-based GWAS. Scoary is implemented in Python and is available under an open source GPLv3 license at https://github.com/AdmiralenOla/Scoary .
Inflammation in Alzheimer's Disease and Molecular Genetics: Recent Update.
Zhang, Zhi-Gang; Li, Yan; Ng, Cheung Toa; Song, You-Qiang
2015-10-01
Alzheimer's disease (AD) is a complex age-related neurodegenerative disorder of the central nervous system. Since the first description of AD in 1907, many hypotheses have been established to explain its causes. The inflammation theory is one of them. Pathological and biochemical studies of brains from AD individuals have provided solid evidence of the activation of inflammatory pathways. Furthermore, people with long-term medication of anti-inflammatory drugs have shown a reduced risk to develop the disease. After three decades of genetic study in AD, dozens of loci harboring genetic variants influencing inflammatory pathways in AD patients has been identified through genome-wide association studies (GWAS). The most well-known GWAS risk factor that is responsible for immune response and inflammation in AD development should be APOE ε4 allele. However, a growing number of other GWAS risk AD candidate genes in inflammation have recently been discovered. In the present study, we try to review the inflammation in AD and immunity-associated GWAS risk genes like HLA-DRB5/DRB1, INPP5D, MEF2C, CR1, CLU and TREM2.
An Adaptive Association Test for Multiple Phenotypes with GWAS Summary Statistics.
Kim, Junghi; Bai, Yun; Pan, Wei
2015-12-01
We study the problem of testing for single marker-multiple phenotype associations based on genome-wide association study (GWAS) summary statistics without access to individual-level genotype and phenotype data. For most published GWASs, because obtaining summary data is substantially easier than accessing individual-level phenotype and genotype data, while often multiple correlated traits have been collected, the problem studied here has become increasingly important. We propose a powerful adaptive test and compare its performance with some existing tests. We illustrate its applications to analyses of a meta-analyzed GWAS dataset with three blood lipid traits and another with sex-stratified anthropometric traits, and further demonstrate its potential power gain over some existing methods through realistic simulation studies. We start from the situation with only one set of (possibly meta-analyzed) genome-wide summary statistics, then extend the method to meta-analysis of multiple sets of genome-wide summary statistics, each from one GWAS. We expect the proposed test to be useful in practice as more powerful than or complementary to existing methods. © 2015 WILEY PERIODICALS, INC.
Chen, Guo-Bo; Lee, Sang Hong; Brion, Marie-Jo A; Montgomery, Grant W; Wray, Naomi R; Radford-Smith, Graham L; Visscher, Peter M
2014-09-01
As custom arrays are cheaper than generic GWAS arrays, larger sample size is achievable for gene discovery. Custom arrays can tag more variants through denser genotyping of SNPs at associated loci, but at the cost of losing genome-wide coverage. Balancing this trade-off is important for maximizing experimental designs. We quantified both the gain in captured SNP-heritability at known candidate regions and the loss due to imperfect genome-wide coverage for inflammatory bowel disease using immunochip (iChip) and imputed GWAS data on 61,251 and 38.550 samples, respectively. For Crohn's disease (CD), the iChip and GWAS data explained 19 and 26% of variation in liability, respectively, and SNPs in the densely genotyped iChip regions explained 13% of the SNP-heritability for both the iChip and GWAS data. For ulcerative colitis (UC), the iChip and GWAS data explained 15 and 19% of variation in liability, respectively, and the dense iChip regions explained 10 and 9% of the SNP-heritability in the iChip and the GWAS data. From bivariate analyses, estimates of the genetic correlation in risk between CD and UC were 0.75 (SE 0.017) and 0.62 (SE 0.042) for the iChip and GWAS data, respectively. We also quantified the SNP-heritability of genomic regions that did or did not contain the previous 163 GWAS hits for CD and UC, and SNP-heritability of the overlapping loci between the densely genotyped iChip regions and the 163 GWAS hits. For both diseases, over different genomic partitioning, the densely genotyped regions on the iChip tagged at least as much variation in liability as in the corresponding regions in the GWAS data, however a certain amount of tagged SNP-heritability in the GWAS data was lost using the iChip due to the low coverage at unselected regions. These results imply that custom arrays with a GWAS backbone will facilitate more gene discovery, both at associated and novel loci. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Privacy-Preserving Data Exploration in Genome-Wide Association Studies.
Johnson, Aaron; Shmatikov, Vitaly
2013-08-01
Genome-wide association studies (GWAS) have become a popular method for analyzing sets of DNA sequences in order to discover the genetic basis of disease. Unfortunately, statistics published as the result of GWAS can be used to identify individuals participating in the study. To prevent privacy breaches, even previously published results have been removed from public databases, impeding researchers' access to the data and hindering collaborative research. Existing techniques for privacy-preserving GWAS focus on answering specific questions, such as correlations between a given pair of SNPs (DNA sequence variations). This does not fit the typical GWAS process, where the analyst may not know in advance which SNPs to consider and which statistical tests to use, how many SNPs are significant for a given dataset, etc. We present a set of practical, privacy-preserving data mining algorithms for GWAS datasets. Our framework supports exploratory data analysis, where the analyst does not know a priori how many and which SNPs to consider. We develop privacy-preserving algorithms for computing the number and location of SNPs that are significantly associated with the disease, the significance of any statistical test between a given SNP and the disease, any measure of correlation between SNPs, and the block structure of correlations. We evaluate our algorithms on real-world datasets and demonstrate that they produce significantly more accurate results than prior techniques while guaranteeing differential privacy.
Genome-wide association study of acute post-surgical pain in humans
Kim, Hyungsuk; Ramsay, Edward; Lee, Hyewon; Wahl, Sharon; Dionne, Raymond A
2009-01-01
Aims Testing a relatively small genomic region with a few hundred SNPs provides limited information. Genome-wide association studies (GWAS) provide an opportunity to overcome the limitation of candidate gene association studies. Here, we report the results of a GWAS for the responses to an NSAID analgesic. Materials & methods European Americans (60 females and 52 males) undergoing oral surgery were genotyped with Affymetrix 500K SNP assay. Additional SNP genotyping was performed from the gene in linkage disequilibrium with the candidate SNP revealed by the GWAS. Results GWAS revealed a candidate SNP (rs2562456) associated with analgesic onset, which is in linkage disequilibrium with a gene encoding a zinc finger protein. Additional SNP genotyping of ZNF429 confirmed the association with analgesic onset in humans (p = 1.8 × 10−10, degrees of freedom = 103, F = 28.3). We also found candidate loci for the maximum post-operative pain rating (rs17122021, p = 6.9 × 10−7) and post-operative pain onset time (rs6693882, p = 2.1 × 10−6), however, correcting for multiple comparisons did not sustain these genetic associations. Conclusion GWAS for acute clinical pain followed by additional SNP genotyping of a neighboring gene suggests that genetic variations in or near the loci encoding DNA binding proteins play a role in the individual variations in responses to analgesic drugs. PMID:19207018
Genome-wide association studies in pharmacogenetics research debate
Bailey, Kent R; Cheng, Cheng
2016-01-01
Will genome-wide association studies (GWAS) ‘work’ for pharmacogenetics research? This question was the topic of a staged debate, with pro and con sides, aimed to bring out the strengths and weaknesses of GWAS for pharmacogenetics studies. After a full day of seminars at the Fifth Statistical Analysis Workshop of the Pharmacogenetics Research Network, the lively debate was held – appropriately – at Goonies Comedy Club in Rochester (MN, USA). The pro side emphasized that the many GWAS successes for identifying genetic variants associated with disease risk show that it works; that the current genotyping platforms are efficient, with good imputation methods to fill in missing data; that its global assessment is always a success even if no significant associations are detected; and that genetic effects are likely to be large because humans have not evolved in a drug-therapy environment. By contrast, the con side emphasized that we have limited knowledge of the complexity of the genome; limited clinical phenotypes compromise studies; the likely multifactorial nature of drug response clouding the small genetic effects; and limitations of sample size and replication studies in pharmacogenetic studies. Lively and insightful discussions emphasized further research efforts that might benefit GWAS in pharmacogenetics. PMID:20235786
Espin-Garcia, Osvaldo; Craiu, Radu V; Bull, Shelley B
2018-02-01
We evaluate two-phase designs to follow-up findings from genome-wide association study (GWAS) when the cost of regional sequencing in the entire cohort is prohibitive. We develop novel expectation-maximization-based inference under a semiparametric maximum likelihood formulation tailored for post-GWAS inference. A GWAS-SNP (where SNP is single nucleotide polymorphism) serves as a surrogate covariate in inferring association between a sequence variant and a normally distributed quantitative trait (QT). We assess test validity and quantify efficiency and power of joint QT-SNP-dependent sampling and analysis under alternative sample allocations by simulations. Joint allocation balanced on SNP genotype and extreme-QT strata yields significant power improvements compared to marginal QT- or SNP-based allocations. We illustrate the proposed method and evaluate the sensitivity of sample allocation to sampling variation using data from a sequencing study of systolic blood pressure. © 2017 The Authors. Genetic Epidemiology Published by Wiley Periodicals, Inc.
The MR-Base platform supports systematic causal inference across the human phenome
Wade, Kaitlin H; Haberland, Valeriia; Baird, Denis; Laurin, Charles; Burgess, Stephen; Bowden, Jack; Langdon, Ryan; Tan, Vanessa Y; Yarmolinsky, James; Shihab, Hashem A; Timpson, Nicholas J; Evans, David M; Relton, Caroline; Martin, Richard M; Davey Smith, George
2018-01-01
Results from genome-wide association studies (GWAS) can be used to infer causal relationships between phenotypes, using a strategy known as 2-sample Mendelian randomization (2SMR) and bypassing the need for individual-level data. However, 2SMR methods are evolving rapidly and GWAS results are often insufficiently curated, undermining efficient implementation of the approach. We therefore developed MR-Base (http://www.mrbase.org): a platform that integrates a curated database of complete GWAS results (no restrictions according to statistical significance) with an application programming interface, web app and R packages that automate 2SMR. The software includes several sensitivity analyses for assessing the impact of horizontal pleiotropy and other violations of assumptions. The database currently comprises 11 billion single nucleotide polymorphism-trait associations from 1673 GWAS and is updated on a regular basis. Integrating data with software ensures more rigorous application of hypothesis-driven analyses and allows millions of potential causal relationships to be efficiently evaluated in phenome-wide association studies. PMID:29846171
Been, L F; Hatfield, J L; Shankar, A; Aston, C E; Ralhan, S; Wander, G S; Mehra, N K; Singh, J R; Mulvihill, J J; Sanghera, D K
2012-11-01
Two common variants (rs1387153, rs10830963) in MTNR1B have been reported to have independent effects on fasting blood glucose (FBG) levels with increased risk to type 2 diabetes (T2D) in recent genome-wide association studies (GWAS). In this investigation, we report the association of these two variants, and an additional variant (rs1374645) within the GWAS locus of MTNR1B with FBG, 2h glucose, insulin resistance (HOMA IR), β-cell function (HOMA B), and T2D in our sample of Asian Sikhs from India. Our cohort comprised 2222 subjects [1201 T2D, 1021 controls]. None of these SNPs was associated with T2D in this cohort. Our data also could not confirm association of rs1387153 and rs10830963 with FBG phenotype. However, upon stratifying data according to body mass index (BMI) (low ≤ 25 kg/m(2) and high > 25 kg/m(2)) in normoglycemic subjects (n = 1021), the rs1374645 revealed a strong association with low FBG levels in low BMI group (β = -0.073, p = 0.002, Bonferroni p = 0.01) compared to the high BMI group (β = 0.015, p = 0.50). We also detected a strong evidence of interaction between rs1374645 and BMI with respect to FBG levels (p = 0.002). Our data provide new information about the significant impact of another MTNR1B variant on FBG levels that appears to be modulated by BMI. Future confirmation on independent datasets and functional studies will be required to define the role of this variant in fasting glucose variation. Published by Elsevier B.V.
Protein Interaction Networks Reveal Novel Autism Risk Genes within GWAS Statistical Noise
Correia, Catarina; Oliveira, Guiomar; Vicente, Astrid M.
2014-01-01
Genome-wide association studies (GWAS) for Autism Spectrum Disorder (ASD) thus far met limited success in the identification of common risk variants, consistent with the notion that variants with small individual effects cannot be detected individually in single SNP analysis. To further capture disease risk gene information from ASD association studies, we applied a network-based strategy to the Autism Genome Project (AGP) and the Autism Genetics Resource Exchange GWAS datasets, combining family-based association data with Human Protein-Protein interaction (PPI) data. Our analysis showed that autism-associated proteins at higher than conventional levels of significance (P<0.1) directly interact more than random expectation and are involved in a limited number of interconnected biological processes, indicating that they are functionally related. The functionally coherent networks generated by this approach contain ASD-relevant disease biology, as demonstrated by an improved positive predictive value and sensitivity in retrieving known ASD candidate genes relative to the top associated genes from either GWAS, as well as a higher gene overlap between the two ASD datasets. Analysis of the intersection between the networks obtained from the two ASD GWAS and six unrelated disease datasets identified fourteen genes exclusively present in the ASD networks. These are mostly novel genes involved in abnormal nervous system phenotypes in animal models, and in fundamental biological processes previously implicated in ASD, such as axon guidance, cell adhesion or cytoskeleton organization. Overall, our results highlighted novel susceptibility genes previously hidden within GWAS statistical “noise” that warrant further analysis for causal variants. PMID:25409314
Protein interaction networks reveal novel autism risk genes within GWAS statistical noise.
Correia, Catarina; Oliveira, Guiomar; Vicente, Astrid M
2014-01-01
Genome-wide association studies (GWAS) for Autism Spectrum Disorder (ASD) thus far met limited success in the identification of common risk variants, consistent with the notion that variants with small individual effects cannot be detected individually in single SNP analysis. To further capture disease risk gene information from ASD association studies, we applied a network-based strategy to the Autism Genome Project (AGP) and the Autism Genetics Resource Exchange GWAS datasets, combining family-based association data with Human Protein-Protein interaction (PPI) data. Our analysis showed that autism-associated proteins at higher than conventional levels of significance (P<0.1) directly interact more than random expectation and are involved in a limited number of interconnected biological processes, indicating that they are functionally related. The functionally coherent networks generated by this approach contain ASD-relevant disease biology, as demonstrated by an improved positive predictive value and sensitivity in retrieving known ASD candidate genes relative to the top associated genes from either GWAS, as well as a higher gene overlap between the two ASD datasets. Analysis of the intersection between the networks obtained from the two ASD GWAS and six unrelated disease datasets identified fourteen genes exclusively present in the ASD networks. These are mostly novel genes involved in abnormal nervous system phenotypes in animal models, and in fundamental biological processes previously implicated in ASD, such as axon guidance, cell adhesion or cytoskeleton organization. Overall, our results highlighted novel susceptibility genes previously hidden within GWAS statistical "noise" that warrant further analysis for causal variants.
Genome-Wide Association Study for Indicator Traits of Sexual Precocity in Nellore Cattle
Irano, Natalia; de Camargo, Gregório Miguel Ferreira; Costa, Raphael Bermal; Terakado, Ana Paula Nascimento; Magalhães, Ana Fabrícia Braga; Silva, Rafael Medeiros de Oliveira; Dias, Marina Mortati; Bignardi, Annaiza Braga; Baldi, Fernando; Carvalheiro, Roberto; de Oliveira, Henrique Nunes; de Albuquerque, Lucia Galvão
2016-01-01
The objective of this study was to perform a genome-wide association study (GWAS) to detect chromosome regions associated with indicator traits of sexual precocity in Nellore cattle. Data from Nellore animals belonging to farms which participate in the DeltaGen® and Paint® animal breeding programs, were used. The traits used in this study were the occurrence of early pregnancy (EP) and scrotal circumference (SC). Data from 72,675 females and 83,911 males with phenotypes were used; of these, 1,770 females and 1,680 males were genotyped. The SNP effects were estimated with a single-step procedure (WssGBLUP) and the observed phenotypes were used as dependent variables. All animals with available genotypes and phenotypes, in addition to those with only phenotypic information, were used. A single-trait animal model was applied to predict breeding values and the solutions of SNP effects were obtained from these breeding values. The results of GWAS are reported as the proportion of variance explained by windows with 150 adjacent SNPs. The 10 windows that explained the highest proportion of variance were identified. The results of this study indicate the polygenic nature of EP and SC, demonstrating that the indicator traits of sexual precocity studied here are probably controlled by many genes, including some of moderate effect. The 10 windows with large effects obtained for EP are located on chromosomes 5, 6, 7, 14, 18, 21 and 27, and together explained 7.91% of the total genetic variance. For SC, these windows are located on chromosomes 4, 8, 11, 13, 14, 19, 22 and 23, explaining 6.78% of total variance. GWAS permitted to identify chromosome regions associated with EP and SC. The identification of these regions contributes to a better understanding and evaluation of these traits, and permits to indicate candidate genes for future investigation of causal mutations. PMID:27494397
Bogenpohl, James W; Mignogna, Kristin M; Smith, Maren L; Miles, Michael F
2017-01-01
Complex behavioral traits, such as alcohol abuse, are caused by an interplay of genetic and environmental factors, producing deleterious functional adaptations in the central nervous system. The long-term behavioral consequences of such changes are of substantial cost to both the individual and society. Substantial progress has been made in the last two decades in understanding elements of brain mechanisms underlying responses to ethanol in animal models and risk factors for alcohol use disorder (AUD) in humans. However, treatments for AUD remain largely ineffective and few medications for this disease state have been licensed. Genome-wide genetic polymorphism analysis (GWAS) in humans, behavioral genetic studies in animal models and brain gene expression studies produced by microarrays or RNA-seq have the potential to produce nonbiased and novel insight into the underlying neurobiology of AUD. However, the complexity of such information, both statistical and informational, has slowed progress toward identifying new targets for intervention in AUD. This chapter describes one approach for integrating behavioral, genetic, and genomic information across animal model and human studies. The goal of this approach is to identify networks of genes functioning in the brain that are most relevant to the underlying mechanisms of a complex disease such as AUD. We illustrate an example of how genomic studies in animal models can be used to produce robust gene networks that have functional implications, and to integrate such animal model genomic data with human genetic studies such as GWAS for AUD. We describe several useful analysis tools for such studies: ComBAT, WGCNA, and EW_dmGWAS. The end result of this analysis is a ranking of gene networks and identification of their cognate hub genes, which might provide eventual targets for future therapeutic development. Furthermore, this combined approach may also improve our understanding of basic mechanisms underlying gene x environmental interactions affecting brain functioning in health and disease.
Bogenpohl, James W.; Mignogna, Kristin M.; Smith, Maren L.; Miles, Michael F.
2016-01-01
Complex behavioral traits, such as alcohol abuse, are caused by an interplay of genetic and environmental factors, producing deleterious functional adaptations in the central nervous system. The long-term behavioral consequences of such changes are of substantial cost to both the individual and society. Substantial progress has been made in the last two decades in understanding elements of brain mechanisms underlying responses to ethanol in animal models and risk factors for alcohol use disorder (AUD) in humans. However, treatments for AUD remain largely ineffective and few medications for this disease state have been licensed. Genome-wide genetic polymorphism analysis (GWAS) in humans, behavioral genetic studies in animal models and brain gene expression studies produced by microarrays or RNA-seq have the potential to produce non-biased and novel insight into the underlying neurobiology of AUD. However, the complexity of such information, both statistical and informational, has slowed progress toward identifying new targets for intervention in AUD. This chapter describes one approach for integrating behavioral, genetic, and genomic information across animal model and human studies. The goal of this approach is to identify networks of genes functioning in the brain that are most relevant to the underlying mechanisms of a complex disease such as AUD. We illustrate an example of how genomic studies in animal models can be used to produce robust gene networks that have functional implications, and to integrate such animal model genomic data with human genetic studies such as GWAS for AUD. We describe several useful analysis tools for such studies: ComBAT, WGCNA and EW_dmGWAS. The end result of this analysis is a ranking of gene networks and identification of their cognate hub genes, which might provide eventual targets for future therapeutic development. Furthermore, this combined approach may also improve our understanding of basic mechanisms underlying gene x environmental interactions affecting brain functioning in health and disease. PMID:27933543
2013-01-01
Preterm birth has the highest mortality and morbidity of all pregnancy complications. The burden of preterm birth on public health worldwide is enormous, yet there are few effective means to prevent a preterm delivery. To date, much of its etiology is unexplained, but genetic predisposition is thought to play a major role. In the upcoming year, the international Preterm Birth Genome Project (PGP) consortium plans to publish a large genome wide association study in early preterm birth. Genome-wide association studies (GWAS) are designed to identify common genetic variants that influence health and disease. Despite the many challenges that are involved, GWAS can be an important discovery tool, revealing genetic variations that are associated with preterm birth. It is highly unlikely that findings of a GWAS can be directly translated into clinical practice in the short run. Nonetheless, it will help us to better understand the etiology of preterm birth and the GWAS results will generate new hypotheses for further research, thus enhancing our understanding of preterm birth and informing prevention efforts in the long run. PMID:23445776
Dolan, Siobhan M; Christiaens, Inge
2013-01-01
Preterm birth has the highest mortality and morbidity of all pregnancy complications. The burden of preterm birth on public health worldwide is enormous, yet there are few effective means to prevent a preterm delivery. To date, much of its etiology is unexplained, but genetic predisposition is thought to play a major role. In the upcoming year, the international Preterm Birth Genome Project (PGP) consortium plans to publish a large genome wide association study in early preterm birth. Genome-wide association studies (GWAS) are designed to identify common genetic variants that influence health and disease. Despite the many challenges that are involved, GWAS can be an important discovery tool, revealing genetic variations that are associated with preterm birth. It is highly unlikely that findings of a GWAS can be directly translated into clinical practice in the short run. Nonetheless, it will help us to better understand the etiology of preterm birth and the GWAS results will generate new hypotheses for further research, thus enhancing our understanding of preterm birth and informing prevention efforts in the long run.
Dobbyn, Amanda; Huckins, Laura M; Boocock, James; Sloofman, Laura G; Glicksberg, Benjamin S; Giambartolomei, Claudia; Hoffman, Gabriel E; Perumal, Thanneer M; Girdhar, Kiran; Jiang, Yan; Raj, Towfique; Ruderfer, Douglas M; Kramer, Robin S; Pinto, Dalila; Akbarian, Schahram; Roussos, Panos; Domenici, Enrico; Devlin, Bernie; Sklar, Pamela; Stahl, Eli A; Sieberts, Solveig K
2018-06-07
Causal genes and variants within genome-wide association study (GWAS) loci can be identified by integrating GWAS statistics with expression quantitative trait loci (eQTL) and determining which variants underlie both GWAS and eQTL signals. Most analyses, however, consider only the marginal eQTL signal, rather than dissect this signal into multiple conditionally independent signals for each gene. Here we show that analyzing conditional eQTL signatures, which could be important under specific cellular or temporal contexts, leads to improved fine mapping of GWAS associations. Using genotypes and gene expression levels from post-mortem human brain samples (n = 467) reported by the CommonMind Consortium (CMC), we find that conditional eQTL are widespread; 63% of genes with primary eQTL also have conditional eQTL. In addition, genomic features associated with conditional eQTL are consistent with context-specific (e.g., tissue-, cell type-, or developmental time point-specific) regulation of gene expression. Integrating the 2014 Psychiatric Genomics Consortium schizophrenia (SCZ) GWAS and CMC primary and conditional eQTL data reveals 40 loci with strong evidence for co-localization (posterior probability > 0.8), including six loci with co-localization of conditional eQTL. Our co-localization analyses support previously reported genes, identify novel genes associated with schizophrenia risk, and provide specific hypotheses for their functional follow-up. Copyright © 2018 The Authors. Published by Elsevier Inc. All rights reserved.
Gene- and pathway-based association tests for multiple traits with GWAS summary statistics.
Kwak, Il-Youp; Pan, Wei
2017-01-01
To identify novel genetic variants associated with complex traits and to shed new insights on underlying biology, in addition to the most popular single SNP-single trait association analysis, it would be useful to explore multiple correlated (intermediate) traits at the gene- or pathway-level by mining existing single GWAS or meta-analyzed GWAS data. For this purpose, we present an adaptive gene-based test and a pathway-based test for association analysis of multiple traits with GWAS summary statistics. The proposed tests are adaptive at both the SNP- and trait-levels; that is, they account for possibly varying association patterns (e.g. signal sparsity levels) across SNPs and traits, thus maintaining high power across a wide range of situations. Furthermore, the proposed methods are general: they can be applied to mixed types of traits, and to Z-statistics or P-values as summary statistics obtained from either a single GWAS or a meta-analysis of multiple GWAS. Our numerical studies with simulated and real data demonstrated the promising performance of the proposed methods. The methods are implemented in R package aSPU, freely and publicly available at: https://cran.r-project.org/web/packages/aSPU/ CONTACT: weip@biostat.umn.eduSupplementary information: Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Nonsyndromic cleft palate: An association study at GWAS candidate loci in a multiethnic sample.
Ishorst, Nina; Francheschelli, Paola; Böhmer, Anne C; Khan, Mohammad Faisal J; Heilmann-Heimbach, Stefanie; Fricker, Nadine; Little, Julian; Steegers-Theunissen, Regine P M; Peterlin, Borut; Nowak, Stefanie; Martini, Markus; Kruse, Teresa; Dunsche, Anton; Kreusch, Thomas; Gölz, Lina; Aldhorae, Khalid; Halboub, Esam; Reutter, Heiko; Mossey, Peter; Nöthen, Markus M; Rubini, Michele; Ludwig, Kerstin U; Knapp, Michael; Mangold, Elisabeth
2018-06-01
Nonsyndromic cleft palate only (nsCPO) is a common and multifactorial form of orofacial clefting. In contrast to successes achieved for the other common form of orofacial clefting, that is, nonsyndromic cleft lip with/without cleft palate (nsCL/P), genome wide association studies (GWAS) of nsCPO have identified only one genome wide significant locus. Aim of the present study was to investigate whether common variants contribute to nsCPO and, if so, to identify novel risk loci. We genotyped 33 SNPs at 27 candidate loci from 2 previously published nsCPO GWAS in an independent multiethnic sample. It included: (i) a family-based sample of European ancestry (n = 212); and (ii) two case/control samples of Central European (n = 94/339) and Arabian ancestry (n = 38/231), respectively. A separate association analysis was performed for each genotyped dataset, and meta-analyses were performed. After association analysis and meta-analyses, none of the 33 SNPs showed genome-wide significance. Two variants showed nominally significant association in the imputed GWAS dataset and exhibited a further decrease in p-value in a European and an overall meta-analysis including imputed GWAS data, respectively (rs395572: P MetaEU = 3.16 × 10 -4 ; rs6809420: P MetaAll = 2.80 × 10 -4 ). Our findings suggest that there is a limited contribution of common variants to nsCPO. However, the individual effect sizes might be too small for detection of further associations in the present sample sizes. Rare variants may play a more substantial role in nsCPO than in nsCL/P, for which GWAS of smaller sample sizes have identified genome-wide significant loci. Whole-exome/genome sequencing studies of nsCPO are now warranted. © 2018 Wiley Periodicals, Inc.
The (in)famous GWAS P-value threshold revisited and updated for low-frequency variants.
Fadista, João; Manning, Alisa K; Florez, Jose C; Groop, Leif
2016-08-01
Genome-wide association studies (GWAS) have long relied on proposed statistical significance thresholds to be able to differentiate true positives from false positives. Although the genome-wide significance P-value threshold of 5 × 10(-8) has become a standard for common-variant GWAS, it has not been updated to cope with the lower allele frequency spectrum used in many recent array-based GWAS studies and sequencing studies. Using a whole-genome- and -exome-sequencing data set of 2875 individuals of European ancestry from the Genetics of Type 2 Diabetes (GoT2D) project and a whole-exome-sequencing data set of 13 000 individuals from five ancestries from the GoT2D and T2D-GENES (Type 2 Diabetes Genetic Exploration by Next-generation sequencing in multi-Ethnic Samples) projects, we describe guidelines for genome- and exome-wide association P-value thresholds needed to correct for multiple testing, explaining the impact of linkage disequilibrium thresholds for distinguishing independent variants, minor allele frequency and ancestry characteristics. We emphasize the advantage of studying recent genetic isolate populations when performing rare and low-frequency genetic association analyses, as the multiple testing burden is diminished due to higher genetic homogeneity.
Overview of the Genetics of Alcohol Use Disorder
Tawa, Elisabeth A.; Hall, Samuel D.; Lohoff, Falk W.
2016-01-01
Aims Alcohol Use Disorder (AUD) is a chronic psychiatric illness characterized by harmful drinking patterns leading to negative emotional, physical, and social ramifications. While the underlying pathophysiology of AUD is poorly understood, there is substantial evidence for a genetic component; however, identification of universal genetic risk variants for AUD has been difficult. Recent efforts in the search for AUD susceptibility genes will be reviewed in this article. Methods In this review, we provide an overview of genetic studies on AUD, including twin studies, linkage studies, candidate gene studies, and genome-wide association studies (GWAS). Results Several potential genetic susceptibility factors for AUD have been identified, but the genes of alcohol metabolism, alcohol dehydrogenase (ADH) and aldehyde dehydrogenase (ALDH), have been found to be protective against the development of AUD. GWAS have also identified a heterogeneous list of SNPs associated with AUD and alcohol-related phenotypes, emphasizing the complexity and heterogeneity of the disorder. In addition, many of these findings have small effect sizes when compared to alcohol metabolism genes, and biological relevance is often unknown. Conclusions Although studies spanning multiple approaches have suggested a genetic basis for AUD, identification of the genetic risk variants has been challenging. Some promising results are emerging from GWAS studies; however, larger sample sizes are needed to improve GWAS results and resolution. As the field of genetics is rapidly developing, whole genome sequencing could soon become the new standard of interrogation of the genes and neurobiological pathways which contribute to the complex phenotype of AUD. Short summary This review examines the genetic underpinnings of Alcohol Use Disorder (AUD), with an emphasis on GWAS approaches for identifying genetic risk variants. The most promising results associated with AUD and alcohol-related phenotypes have included SNPs of the alcohol metabolism genes ADH and ALDH. PMID:27445363
Bioinformatics challenges for genome-wide association studies.
Moore, Jason H; Asselbergs, Folkert W; Williams, Scott M
2010-02-15
The sequencing of the human genome has made it possible to identify an informative set of >1 million single nucleotide polymorphisms (SNPs) across the genome that can be used to carry out genome-wide association studies (GWASs). The availability of massive amounts of GWAS data has necessitated the development of new biostatistical methods for quality control, imputation and analysis issues including multiple testing. This work has been successful and has enabled the discovery of new associations that have been replicated in multiple studies. However, it is now recognized that most SNPs discovered via GWAS have small effects on disease susceptibility and thus may not be suitable for improving health care through genetic testing. One likely explanation for the mixed results of GWAS is that the current biostatistical analysis paradigm is by design agnostic or unbiased in that it ignores all prior knowledge about disease pathobiology. Further, the linear modeling framework that is employed in GWAS often considers only one SNP at a time thus ignoring their genomic and environmental context. There is now a shift away from the biostatistical approach toward a more holistic approach that recognizes the complexity of the genotype-phenotype relationship that is characterized by significant heterogeneity and gene-gene and gene-environment interaction. We argue here that bioinformatics has an important role to play in addressing the complexity of the underlying genetic basis of common human diseases. The goal of this review is to identify and discuss those GWAS challenges that will require computational methods.
Gelernter, Joel; Sherva, Richard; Koesterer, Ryan; Almasy, Laura; Zhao, Hongyu; Kranzler, Henry R.; Farrer, Lindsay
2013-01-01
We report a GWAS for cocaine dependence (CD) in three sets of African- and European-American subjects (AAs and EAs, respectively), to identify pathways, genes, and alleles important in CD risk. The discovery GWAS dataset (n=5,697 subjects) was genotyped using the Illumina OmniQuad microarray (890,000 analyzed SNPs). Additional genotypes were imputed based on the 1000 Genomes reference panel. Top-ranked findings were evaluated by incorporating information from publicly available GWAS data from 4,063 subjects. Then, the most significant GWAS SNPs were genotyped in 2,549 independent subjects. We observed one genomewide-significant (GWS) result: rs7086629 at the FAM53B (“family with sequence similarity 53, member B”) locus. This was supported in both AAs and EAs; p-value (meta-analysis of all samples) =4.28×10−8. The gene maps to the same chromosomal region as the maximum peak we observed in a previous linkage study. NCOR2 (nuclear receptor corepressor 1) SNP rs150954431 was associated with p=1.19×10−9 in the EA discovery sample. SNP rs2456778, which maps to CDK1 (“cyclin-dependent kinase 1”), was associated with cocaine-induced paranoia in AAs in the discovery sample only (p=4.68×10−8). This is the first study to identify risk variants for CD using GWAS. Our results implicate novel risk loci and provide insights into potential therapeutic and prevention strategies. PMID:23958962
Reprogramming neurodegeneration in the big data era.
Zhou, Lujia; Verstreken, Patrik
2018-02-01
Recent genome-wide association studies (GWAS) have identified numerous genetic risk variants for late-onset Alzheimer's disease (AD) and Parkinson's disease (PD). However, deciphering the functional consequences of GWAS data is challenging due to a lack of reliable model systems to study the genetic variants that are often of low penetrance and non-coding identities. Pluripotent stem cell (PSC) technologies offer unprecedented opportunities for molecular phenotyping of GWAS variants in human neurons and microglia. Moreover, rapid technological advances in whole-genome RNA-sequencing and epigenome mapping fuel comprehensive and unbiased investigations of molecular alterations in PSC-derived disease models. Here, we review and discuss how integrated studies that utilize PSC technologies and genome-wide approaches may bring new mechanistic insight into the pathogenesis of AD and PD. Copyright © 2018 Elsevier Ltd. All rights reserved.
Nieuwenhuis, Maartje A.; Siedlinski, Matteusz; van den Berge, Maarten; Granell, Raquel; Li, Xingnan; Niens, Marijke; van der Vlies, Pieter; Altmüller, Janine; Nürnberg, Peter; Kerkhof, Marjan; van Schayck, Onno C.; Riemersma, Ronald A.; van der Molen, Thys; de Monchy, Jan G.; Bossé, Yohan; Sandford, Andrew; Bruijnzeel-Koomen, Carla A.; van Wijk, Roy G.; ten Hacken, Nick H.; Timens, Wim; Boezen, H. Marike; Henderson, John; Kabesch, Michael; Vonk, Judith M.; Postma, Dirkje S.; Koppelman, Gerard H.
2016-01-01
Background Genome wide association studies (GWAS) of asthma have identified single nucleotide polymorphisms (SNPs) that modestly increase the risk for asthma. This could be due to phenotypic heterogeneity of asthma. Bronchial hyperresponsiveness (BHR) is a phenotypic hallmark of asthma. We aim to identify susceptibility genes for asthma combined with BHR and analyse the presence of cis-eQTLs among replicated SNPs. Secondly, we compare the genetic association of SNPs previously associated with (doctor diagnosed) asthma to our GWAS of asthma with BHR. Methods A GWAS was performed in 920 asthmatics with BHR and 980 controls. Top SNPs of our GWAS were analysed in four replication cohorts and lung cis-eQTL analysis was performed on replicated SNPs. We investigated association of SNPs previously associated with asthma in our data. Results 368 SNPs were followed up for replication. Six SNPs in genes encoding ABI3BP, NAF1, MICA and the 17q21 locus replicated in one or more cohorts, with one locus (17q21) achieving genome wide significance after meta-analysis. Five out of 6 replicated SNPs regulated 35 gene transcripts in whole lung. Eight of 20 asthma associated SNPs from previous GWAS were significantly associated with asthma and BHR. Three SNPs, in IL-33 and GSDMB, showed larger effect sizes in our data compared to published literature. Conclusions Combining GWAS with subsequent lung eQTL analysis revealed disease associated SNPs regulating lung mRNA expression levels of potential new asthma genes. Adding BHR to the asthma definition does not lead to an overall larger genetic effect size than analysing (doctor’s diagnosed) asthma. PMID:27439200
Mägi, Reedik; Suleimanov, Yury V; Clarke, Geraldine M; Kaakinen, Marika; Fischer, Krista; Prokopenko, Inga; Morris, Andrew P
2017-01-11
Genome-wide association studies (GWAS) of single nucleotide polymorphisms (SNPs) have been successful in identifying loci contributing genetic effects to a wide range of complex human diseases and quantitative traits. The traditional approach to GWAS analysis is to consider each phenotype separately, despite the fact that many diseases and quantitative traits are correlated with each other, and often measured in the same sample of individuals. Multivariate analyses of correlated phenotypes have been demonstrated, by simulation, to increase power to detect association with SNPs, and thus may enable improved detection of novel loci contributing to diseases and quantitative traits. We have developed the SCOPA software to enable GWAS analysis of multiple correlated phenotypes. The software implements "reverse regression" methodology, which treats the genotype of an individual at a SNP as the outcome and the phenotypes as predictors in a general linear model. SCOPA can be applied to quantitative traits and categorical phenotypes, and can accommodate imputed genotypes under a dosage model. The accompanying META-SCOPA software enables meta-analysis of association summary statistics from SCOPA across GWAS. Application of SCOPA to two GWAS of high-and low-density lipoprotein cholesterol, triglycerides and body mass index, and subsequent meta-analysis with META-SCOPA, highlighted stronger association signals than univariate phenotype analysis at established lipid and obesity loci. The META-SCOPA meta-analysis also revealed a novel signal of association at genome-wide significance for triglycerides mapping to GPC5 (lead SNP rs71427535, p = 1.1x10 -8 ), which has not been reported in previous large-scale GWAS of lipid traits. The SCOPA and META-SCOPA software enable discovery and dissection of multiple phenotype association signals through implementation of a powerful reverse regression approach.
[Enlightenment from genome-wide association study to genetics of psoriasis].
ZHANG, Xue-jun
2009-07-01
Psoriasis is a common autoimmune and hyper proliferative skin disease, characterized by thick, silvery scale patches. Numerous family studies have provided compelling evidence of a genetic predisposition to psoriasis, although the inheritance pattern is unclear. However, few of these studies have achieved consistent results, except for the MHC locus, a problem frequently encountered in the investigation of complex disease. Using high-throughput techniques to genotype hundreds of thousands of single nucleotide polymorphisms explore their relationship with phenotypes, genome-wide association studies (GWAS) are now proven to be a powerful approach for screening the susceptibility genes (loci) of complex disease. Recently, three GWAS on psoriasis published in Nature Genetics have provided us with many novel clues concerning disease pathogenesis, in both immune and non-immune pathways. The MHC locus (HLA-Cw6 and other MHC variance), the major locus involved in the immune reactions of human immune disease, has consistently been shown to be associated with psoriasis, both in previous linkage and present GWAS. IL-12B and IL23R, which are the two non-MHC genes with highly associated evidence with psoriasis in multiple studies performed so far and potent cytokines with complex biological activities, should be of great importance in the pathogenesis of psoriasis. Recent clinical trials, in which anti-IL-12p40 antibodies were used for the treatment of psoriasis, have provided further evidence of the role of IL-12/23 in the pathophysiology of psoriasis,and highlighted a new road of treatment for psoriasis. In 2008,we performed the first large GWAS in the Chinese population and identified a novel susceptibility locus within the late cornified envelope (LCE) gene cluster: LCE3A and LCE3D on chromosome 1q21, with conclusive evidence (rs4085613, p(combined)=6.69*10(-30); odds ratio=0.76). Meanwhile, another group also identified a deletion comprising and LCE gene cluster of LCE3B and LCE3C, which is significantly associated with a risk of psoriasis in Spain, Netherland, Italy and USA. Both of these independent studies provided substantial association evidence for the LCE genes involved in the pathogenesis of psoriasis. The LCE genes encode the stratum-corneum proteins of the cornified envelope, which plays an important role in epidermal terminal differentiation. As we know, psoriasis is a disease of interfollicular epidermis and rapid keratinocyte proliferation may cause the production of parakeratotic keratinocytes in psoriatic skin and, thus, the formation of poorly adherent stratum corneum, which in turn results in the characteristic scale or flakes of psoriasis lesions. Although some of the highlighted genes are already targeted by effective psoriasis therapies, others could become future targets for treatments,especially for the LCE genes, which will be very useful for unlocking new drug targets and tailored treatments for this painful, disfiguring skin disease. Meanwhile larger samples and improved strategy for identification of other susceptibility variants to psoriasis and downstream functional study to elucidate the underlying mechanisms of diseases are also needed. Taken together, unremitting efforts of the basic research on psoriasis will lead us to achieve a better treatment and diagnosis for psoriasis in the near future.
Analysis and visualization of Arabidopsis thaliana GWAS using web 2.0 technologies.
Huang, Yu S; Horton, Matthew; Vilhjálmsson, Bjarni J; Seren, Umit; Meng, Dazhe; Meyer, Christopher; Ali Amer, Muhammad; Borevitz, Justin O; Bergelson, Joy; Nordborg, Magnus
2011-01-01
With large-scale genomic data becoming the norm in biological studies, the storing, integrating, viewing and searching of such data have become a major challenge. In this article, we describe the development of an Arabidopsis thaliana database that hosts the geographic information and genetic polymorphism data for over 6000 accessions and genome-wide association study (GWAS) results for 107 phenotypes representing the largest collection of Arabidopsis polymorphism data and GWAS results to date. Taking advantage of a series of the latest web 2.0 technologies, such as Ajax (Asynchronous JavaScript and XML), GWT (Google-Web-Toolkit), MVC (Model-View-Controller) web framework and Object Relationship Mapper, we have created a web-based application (web app) for the database, that offers an integrated and dynamic view of geographic information, genetic polymorphism and GWAS results. Essential search functionalities are incorporated into the web app to aid reverse genetics research. The database and its web app have proven to be a valuable resource to the Arabidopsis community. The whole framework serves as an example of how biological data, especially GWAS, can be presented and accessed through the web. In the end, we illustrate the potential to gain new insights through the web app by two examples, showcasing how it can be used to facilitate forward and reverse genetics research. Database URL: http://arabidopsis.usc.edu/
Biernacka, Joanna M.; Geske, Jennifer R.; Schneekloth, Terry D.; Frye, Mark A.; Cunningham, Julie M.; Choi, Doo-Sup; Tapp, Courtney L.; Lewis, Bradley R.; Drews, Maureen S.; L.Pietrzak, Tracy; Colby, Colin L.; Hall-Flavin, Daniel K.; Loukianova, Larissa L.; Heit, John A.; Mrazek, David A.; Karpyak, Victor M.
2013-01-01
Genome-wide association studies (GWAS) have revealed many single nucleotide polymorphisms (SNPs) associated with complex traits. Although these studies frequently fail to identify statistically significant associations, the top association signals from GWAS may be enriched for true associations. We therefore investigated the association of alcohol dependence with 43 SNPs selected from association signals in the first two published GWAS of alcoholism. Our analysis of 808 alcohol-dependent cases and 1,248 controls provided evidence of association of alcohol dependence with SNP rs1614972 in the ADH1C gene (unadjusted p = 0.0017). Because the GWAS study that originally reported association of alcohol dependence with this SNP [1] included only men, we also performed analyses in sex-specific strata. The results suggest that this SNP has a similar effect in both sexes (men: OR (95%CI) = 0.80 (0.66, 0.95); women: OR (95%CI) = 0.83 (0.66, 1.03)). We also observed marginal evidence of association of the rs1614972 minor allele with lower alcohol consumption in the non-alcoholic controls (p = 0.081), and independently in the alcohol-dependent cases (p = 0.046). Despite a number of potential differences between the samples investigated by the prior GWAS and the current study, data presented here provide additional support for the association of SNP rs1614972 in ADH1C with alcohol dependence and extend this finding by demonstrating association with consumption levels in both non-alcoholic and alcohol-dependent populations. Further studies should investigate the association of other polymorphisms in this gene with alcohol dependence and related alcohol-use phenotypes. PMID:23516558
Walsh, Kyle M; Anderson, Erik; Hansen, Helen M; Decker, Paul A; Kosel, Matt L; Kollmeyer, Thomas; Rice, Terri; Zheng, Shichun; Xiao, Yuanyuan; Chang, Jeffrey S; McCoy, Lucie S; Bracci, Paige M; Wiemels, Joe L; Pico, Alexander R; Smirnov, Ivan; Lachance, Daniel H; Sicotte, Hugues; Eckel-Passow, Jeanette E; Wiencke, John K; Jenkins, Robert B; Wrensch, Margaret R
2013-02-01
Genomewide association studies (GWAS) and candidate-gene studies have implicated single-nucleotide polymorphisms (SNPs) in at least 45 different genes as putative glioma risk factors. Attempts to validate these associations have yielded variable results and few genetic risk factors have been consistently replicated. We conducted a case-control study of Caucasian glioma cases and controls from the University of California San Francisco (810 cases, 512 controls) and the Mayo Clinic (852 cases, 789 controls) in an attempt to replicate previously reported genetic risk factors for glioma. Sixty SNPs selected from the literature (eight from GWAS and 52 from candidate-gene studies) were successfully genotyped on an Illumina custom genotyping panel. Eight SNPs in/near seven different genes (TERT, EGFR, CCDC26, CDKN2A, PHLDB1, RTEL1, TP53) were significantly associated with glioma risk in the combined dataset (P < 0.05), with all associations in the same direction as in previous reports. Several SNP associations showed considerable differences across histologic subtype. All eight successfully replicated associations were first identified by GWAS, although none of the putative risk SNPs from candidate-gene studies was associated in the full case-control sample (all P values > 0.05). Although several confirmed associations are located near genes long known to be involved in gliomagenesis (e.g., EGFR, CDKN2A, TP53), these associations were first discovered by the GWAS approach and are in noncoding regions. These results highlight that the deficiencies of the candidate-gene approach lay in selecting both appropriate genes and relevant SNPs within these genes. © 2012 WILEY PERIODICALS, INC.
Al-Tassan, Nada A; Whiffin, Nicola; Hosking, Fay J; Palles, Claire; Farrington, Susan M; Dobbins, Sara E; Harris, Rebecca; Gorman, Maggie; Tenesa, Albert; Meyer, Brian F; Wakil, Salma M; Kinnersley, Ben; Campbell, Harry; Martin, Lynn; Smith, Christopher G; Idziaszczyk, Shelley; Barclay, Ella; Maughan, Timothy S; Kaplan, Richard; Kerr, Rachel; Kerr, David; Buchanan, Daniel D; Buchannan, Daniel D; Win, Aung Ko; Hopper, John; Jenkins, Mark; Lindor, Noralane M; Newcomb, Polly A; Gallinger, Steve; Conti, David; Schumacher, Fred; Casey, Graham; Dunlop, Malcolm G; Tomlinson, Ian P; Cheadle, Jeremy P; Houlston, Richard S
2015-05-20
Genome-wide association studies (GWAS) of colorectal cancer (CRC) have identified 23 susceptibility loci thus far. Analyses of previously conducted GWAS indicate additional risk loci are yet to be discovered. To identify novel CRC susceptibility loci, we conducted a new GWAS and performed a meta-analysis with five published GWAS (totalling 7,577 cases and 9,979 controls of European ancestry), imputing genotypes utilising the 1000 Genomes Project. The combined analysis identified new, significant associations with CRC at 1p36.2 marked by rs72647484 (minor allele frequency [MAF] = 0.09) near CDC42 and WNT4 (P = 1.21 × 10(-8), odds ratio [OR] = 1.21 ) and at 16q24.1 marked by rs16941835 (MAF = 0.21, P = 5.06 × 10(-8); OR = 1.15) within the long non-coding RNA (lncRNA) RP11-58A18.1 and ~500 kb from the nearest coding gene FOXL1. Additionally we identified a promising association at 10p13 with rs10904849 intronic to CUBN (MAF = 0.32, P = 7.01 × 10(-8); OR = 1.14). These findings provide further insights into the genetic and biological basis of inherited genetic susceptibility to CRC. Additionally, our analysis further demonstrates that imputation can be used to exploit GWAS data to identify novel disease-causing variants.
Chung, Dongjun; Kim, Hang J; Zhao, Hongyu
2017-02-01
Genome-wide association studies (GWAS) have identified tens of thousands of genetic variants associated with hundreds of phenotypes and diseases, which have provided clinical and medical benefits to patients with novel biomarkers and therapeutic targets. However, identification of risk variants associated with complex diseases remains challenging as they are often affected by many genetic variants with small or moderate effects. There has been accumulating evidence suggesting that different complex traits share common risk basis, namely pleiotropy. Recently, several statistical methods have been developed to improve statistical power to identify risk variants for complex traits through a joint analysis of multiple GWAS datasets by leveraging pleiotropy. While these methods were shown to improve statistical power for association mapping compared to separate analyses, they are still limited in the number of phenotypes that can be integrated. In order to address this challenge, in this paper, we propose a novel statistical framework, graph-GPA, to integrate a large number of GWAS datasets for multiple phenotypes using a hidden Markov random field approach. Application of graph-GPA to a joint analysis of GWAS datasets for 12 phenotypes shows that graph-GPA improves statistical power to identify risk variants compared to statistical methods based on smaller number of GWAS datasets. In addition, graph-GPA also promotes better understanding of genetic mechanisms shared among phenotypes, which can potentially be useful for the development of improved diagnosis and therapeutics. The R implementation of graph-GPA is currently available at https://dongjunchung.github.io/GGPA/.
Family-Based Genome-Wide Association Scan of Attention-Deficit/Hyperactivity Disorder
ERIC Educational Resources Information Center
Mick, Eric; Todorov, Alexandre; Smalley, Susan; Hu, Xiaolan; Loo, Sandra; Todd, Richard D.; Biederman, Joseph; Byrne, Deirdre; Dechairo, Bryan; Guiney, Allan; McCracken, James; McGough, James; Nelson, Stanley F.; Reiersen, Angela M.; Wilens, Timothy E.; Wozniak, Janet; Neale, Benjamin M.; Faraone, Stephen V.
2010-01-01
Objective: Genes likely play a substantial role in the etiology of attention-deficit/hyperactivity disorder (ADHD). However, the genetic architecture of the disorder is unknown, and prior genome-wide association studies (GWAS) have not identified a genome-wide significant association. We have conducted a third, independent, multisite GWAS of…
Nested association mapping for dissecting complex traits using Peanut 58K SNP array
USDA-ARS?s Scientific Manuscript database
Genome-wide association studies (GWAS) and linkage mapping have been the two most predominant strategies to dissect complex traits, but are limited by the occurrence of false positives reported for GWAS, and low resolution in the case of linkage analysis. This has led to the development of a joint a...
USDA-ARS?s Scientific Manuscript database
Copy number variation (CNV) is an important type of genetic variation contributing to phenotypic differences among mammals and may serve as an alternative molecular marker to single nucleotide polymorphism (SNP) for genome-wide association study (GWAS). Recently, GWAS analysis using CNV has been app...
Transcriptional risk scores link GWAS to eQTLs and predict complications in Crohn's disease.
Marigorta, Urko M; Denson, Lee A; Hyams, Jeffrey S; Mondal, Kajari; Prince, Jarod; Walters, Thomas D; Griffiths, Anne; Noe, Joshua D; Crandall, Wallace V; Rosh, Joel R; Mack, David R; Kellermayer, Richard; Heyman, Melvin B; Baker, Susan S; Stephens, Michael C; Baldassano, Robert N; Markowitz, James F; Kim, Mi-Ok; Dubinsky, Marla C; Cho, Judy; Aronow, Bruce J; Kugathasan, Subra; Gibson, Greg
2017-10-01
Gene expression profiling can be used to uncover the mechanisms by which loci identified through genome-wide association studies (GWAS) contribute to pathology. Given that most GWAS hits are in putative regulatory regions and transcript abundance is physiologically closer to the phenotype of interest, we hypothesized that summation of risk-allele-associated gene expression, namely a transcriptional risk score (TRS), should provide accurate estimates of disease risk. We integrate summary-level GWAS and expression quantitative trait locus (eQTL) data with RNA-seq data from the RISK study, an inception cohort of pediatric Crohn's disease. We show that TRSs based on genes regulated by variants linked to inflammatory bowel disease (IBD) not only outperform genetic risk scores (GRSs) in distinguishing Crohn's disease from healthy samples, but also serve to identify patients who in time will progress to complicated disease. Our dissection of eQTL effects may be used to distinguish genes whose association with disease is through promotion versus protection, thereby linking statistical association to biological mechanism. The TRS approach constitutes a potential strategy for personalized medicine that enhances inference from static genotypic risk assessment.
Genomic Influences on Hyperuricemia and Gout.
Merriman, Tony
2017-08-01
Genome-wide association studies (GWAS) have identified nearly 30 loci associated with urate concentrations that also influence the subsequent risk of gout. The ABCG2 Q141 K variant is highly likely to be causal and results in internalization of ABCG2, which can be rescued by drugs. Three other GWAS loci contain uric acid transporter genes, which are also highly likely to be causal. However identification of causal genes at other urate loci is challenging. Finally, relatively little is known about the genetic control of progression from hyperuricemia to gout. Only 4 small GWAS have been published for gout. Copyright © 2017 Elsevier Inc. All rights reserved.
Grover, Sandeep; Del Greco M, Fabiola; Stein, Catherine M; Ziegler, Andreas
2017-01-01
Confounding and reverse causality have prevented us from drawing meaningful clinical interpretation even in well-powered observational studies. Confounding may be attributed to our inability to randomize the exposure variable in observational studies. Mendelian randomization (MR) is one approach to overcome confounding. It utilizes one or more genetic polymorphisms as a proxy for the exposure variable of interest. Polymorphisms are randomly distributed in a population, they are static throughout an individual's lifetime, and may thus help in inferring directionality in exposure-outcome associations. Genome-wide association studies (GWAS) or meta-analyses of GWAS are characterized by large sample sizes and the availability of many single nucleotide polymorphisms (SNPs), making GWAS-based MR an attractive approach. GWAS-based MR comes with specific challenges, including multiple causality. Despite shortcomings, it still remains one of the most powerful techniques for inferring causality.With MR still an evolving concept with complex statistical challenges, the literature is relatively scarce in terms of providing working examples incorporating real datasets. In this chapter, we provide a step-by-step guide for causal inference based on the principles of MR with a real dataset using both individual and summary data from unrelated individuals. We suggest best possible practices and give recommendations based on the current literature.
Mieth, Bettina; Kloft, Marius; Rodríguez, Juan Antonio; Sonnenburg, Sören; Vobruba, Robin; Morcillo-Suárez, Carlos; Farré, Xavier; Marigorta, Urko M.; Fehr, Ernst; Dickhaus, Thorsten; Blanchard, Gilles; Schunk, Daniel; Navarro, Arcadi; Müller, Klaus-Robert
2016-01-01
The standard approach to the analysis of genome-wide association studies (GWAS) is based on testing each position in the genome individually for statistical significance of its association with the phenotype under investigation. To improve the analysis of GWAS, we propose a combination of machine learning and statistical testing that takes correlation structures within the set of SNPs under investigation in a mathematically well-controlled manner into account. The novel two-step algorithm, COMBI, first trains a support vector machine to determine a subset of candidate SNPs and then performs hypothesis tests for these SNPs together with an adequate threshold correction. Applying COMBI to data from a WTCCC study (2007) and measuring performance as replication by independent GWAS published within the 2008–2015 period, we show that our method outperforms ordinary raw p-value thresholding as well as other state-of-the-art methods. COMBI presents higher power and precision than the examined alternatives while yielding fewer false (i.e. non-replicated) and more true (i.e. replicated) discoveries when its results are validated on later GWAS studies. More than 80% of the discoveries made by COMBI upon WTCCC data have been validated by independent studies. Implementations of the COMBI method are available as a part of the GWASpi toolbox 2.0. PMID:27892471
Mieth, Bettina; Kloft, Marius; Rodríguez, Juan Antonio; Sonnenburg, Sören; Vobruba, Robin; Morcillo-Suárez, Carlos; Farré, Xavier; Marigorta, Urko M; Fehr, Ernst; Dickhaus, Thorsten; Blanchard, Gilles; Schunk, Daniel; Navarro, Arcadi; Müller, Klaus-Robert
2016-11-28
The standard approach to the analysis of genome-wide association studies (GWAS) is based on testing each position in the genome individually for statistical significance of its association with the phenotype under investigation. To improve the analysis of GWAS, we propose a combination of machine learning and statistical testing that takes correlation structures within the set of SNPs under investigation in a mathematically well-controlled manner into account. The novel two-step algorithm, COMBI, first trains a support vector machine to determine a subset of candidate SNPs and then performs hypothesis tests for these SNPs together with an adequate threshold correction. Applying COMBI to data from a WTCCC study (2007) and measuring performance as replication by independent GWAS published within the 2008-2015 period, we show that our method outperforms ordinary raw p-value thresholding as well as other state-of-the-art methods. COMBI presents higher power and precision than the examined alternatives while yielding fewer false (i.e. non-replicated) and more true (i.e. replicated) discoveries when its results are validated on later GWAS studies. More than 80% of the discoveries made by COMBI upon WTCCC data have been validated by independent studies. Implementations of the COMBI method are available as a part of the GWASpi toolbox 2.0.
NASA Astrophysics Data System (ADS)
Mieth, Bettina; Kloft, Marius; Rodríguez, Juan Antonio; Sonnenburg, Sören; Vobruba, Robin; Morcillo-Suárez, Carlos; Farré, Xavier; Marigorta, Urko M.; Fehr, Ernst; Dickhaus, Thorsten; Blanchard, Gilles; Schunk, Daniel; Navarro, Arcadi; Müller, Klaus-Robert
2016-11-01
The standard approach to the analysis of genome-wide association studies (GWAS) is based on testing each position in the genome individually for statistical significance of its association with the phenotype under investigation. To improve the analysis of GWAS, we propose a combination of machine learning and statistical testing that takes correlation structures within the set of SNPs under investigation in a mathematically well-controlled manner into account. The novel two-step algorithm, COMBI, first trains a support vector machine to determine a subset of candidate SNPs and then performs hypothesis tests for these SNPs together with an adequate threshold correction. Applying COMBI to data from a WTCCC study (2007) and measuring performance as replication by independent GWAS published within the 2008-2015 period, we show that our method outperforms ordinary raw p-value thresholding as well as other state-of-the-art methods. COMBI presents higher power and precision than the examined alternatives while yielding fewer false (i.e. non-replicated) and more true (i.e. replicated) discoveries when its results are validated on later GWAS studies. More than 80% of the discoveries made by COMBI upon WTCCC data have been validated by independent studies. Implementations of the COMBI method are available as a part of the GWASpi toolbox 2.0.
Ascertainment bias from imputation methods evaluation in wheat.
Brandariz, Sofía P; González Reymúndez, Agustín; Lado, Bettina; Malosetti, Marcos; Garcia, Antonio Augusto Franco; Quincke, Martín; von Zitzewitz, Jarislav; Castro, Marina; Matus, Iván; Del Pozo, Alejandro; Castro, Ariel J; Gutiérrez, Lucía
2016-10-04
Whole-genome genotyping techniques like Genotyping-by-sequencing (GBS) are being used for genetic studies such as Genome-Wide Association (GWAS) and Genomewide Selection (GS), where different strategies for imputation have been developed. Nevertheless, imputation error may lead to poor performance (i.e. smaller power or higher false positive rate) when complete data is not required as it is for GWAS, and each marker is taken at a time. The aim of this study was to compare the performance of GWAS analysis for Quantitative Trait Loci (QTL) of major and minor effect using different imputation methods when no reference panel is available in a wheat GBS panel. In this study, we compared the power and false positive rate of dissecting quantitative traits for imputed and not-imputed marker score matrices in: (1) a complete molecular marker barley panel array, and (2) a GBS wheat panel with missing data. We found that there is an ascertainment bias in imputation method comparisons. Simulating over a complete matrix and creating missing data at random proved that imputation methods have a poorer performance. Furthermore, we found that when QTL were simulated with imputed data, the imputation methods performed better than the not-imputed ones. On the other hand, when QTL were simulated with not-imputed data, the not-imputed method and one of the imputation methods performed better for dissecting quantitative traits. Moreover, larger differences between imputation methods were detected for QTL of major effect than QTL of minor effect. We also compared the different marker score matrices for GWAS analysis in a real wheat phenotype dataset, and we found minimal differences indicating that imputation did not improve the GWAS performance when a reference panel was not available. Poorer performance was found in GWAS analysis when an imputed marker score matrix was used, no reference panel is available, in a wheat GBS panel.
Zbrowse: An interactive GWAS results browser
USDA-ARS?s Scientific Manuscript database
The growing number of genotyped populations, the advent of high-throughput phenotyping techniques and the development of GWAS analysis software has rapidly accelerated the number of GWAS experimental results. Candidate gene discovery from these results files is often tedious, involving many manual s...
Yang, Wanneng; Guo, Zilong; Huang, Chenglong; Duan, Lingfeng; Chen, Guoxing; Jiang, Ni; Fang, Wei; Feng, Hui; Xie, Weibo; Lian, Xingming; Wang, Gongwei; Luo, Qingming; Zhang, Qifa; Liu, Qian; Xiong, Lizhong
2014-01-01
Even as the study of plant genomics rapidly develops through the use of high-throughput sequencing techniques, traditional plant phenotyping lags far behind. Here we develop a high-throughput rice phenotyping facility (HRPF) to monitor 13 traditional agronomic traits and 2 newly defined traits during the rice growth period. Using genome-wide association studies (GWAS) of the 15 traits, we identify 141 associated loci, 25 of which contain known genes such as the Green Revolution semi-dwarf gene, SD1. Based on a performance evaluation of the HRPF and GWAS results, we demonstrate that high-throughput phenotyping has the potential to replace traditional phenotyping techniques and can provide valuable gene identification information. The combination of the multifunctional phenotyping tools HRPF and GWAS provides deep insights into the genetic architecture of important traits. PMID:25295980
Wang, W; Huang, S; Hou, W; Liu, Y; Fan, Q; He, A; Wen, Y; Hao, J; Guo, X; Zhang, F
2017-10-01
Several genome-wide association studies (GWAS) of bone mineral density (BMD) have successfully identified multiple susceptibility genes, yet isolated susceptibility genes are often difficult to interpret biologically. The aim of this study was to unravel the genetic background of BMD at pathway level, by integrating BMD GWAS data with genome-wide expression quantitative trait loci (eQTLs) and methylation quantitative trait loci (meQTLs) data METHOD: We employed the GWAS datasets of BMD from the Genetic Factors for Osteoporosis Consortium (GEFOS), analysing patients' BMD. The areas studied included 32 735 femoral necks, 28 498 lumbar spines, and 8143 forearms. Genome-wide eQTLs (containing 923 021 eQTLs) and meQTLs (containing 683 152 unique methylation sites with local meQTLs) data sets were collected from recently published studies. Gene scores were first calculated by summary data-based Mendelian randomisation (SMR) software and meQTL-aligned GWAS results. Gene set enrichment analysis (GSEA) was then applied to identify BMD-associated gene sets with a predefined significance level of 0.05. We identified multiple gene sets associated with BMD in one or more regions, including relevant known biological gene sets such as the Reactome Circadian Clock (GSEA p-value = 1.0 × 10 -4 for LS and 2.7 × 10 -2 for femoral necks BMD in eQTLs-based GSEA) and insulin-like growth factor receptor binding (GSEA p-value = 5.0 × 10 -4 for femoral necks and 2.6 × 10 -2 for lumbar spines BMD in meQTLs-based GSEA). Our results provided novel clues for subsequent functional analysis of bone metabolism, and illustrated the benefit of integrating eQTLs and meQTLs data into pathway association analysis for genetic studies of complex human diseases. Cite this article : W. Wang, S. Huang, W. Hou, Y. Liu, Q. Fan, A. He, Y. Wen, J. Hao, X. Guo, F. Zhang. Integrative analysis of GWAS, eQTLs and meQTLs data suggests that multiple gene sets are associated with bone mineral density. Bone Joint Res 2017;6:572-576. © 2017 Wang et al.
Veturi, Yogasudha; Ritchie, Marylyn D
2018-01-01
Transcriptome-wide association studies (TWAS) have recently been employed as an approach that can draw upon the advantages of genome-wide association studies (GWAS) and gene expression studies to identify genes associated with complex traits. Unlike standard GWAS, summary level data suffices for TWAS and offers improved statistical power. Two popular TWAS methods include either (a) imputing the cis genetic component of gene expression from smaller sized studies (using multi-SNP prediction or MP) into much larger effective sample sizes afforded by GWAS - TWAS-MP or (b) using summary-based Mendelian randomization - TWAS-SMR. Although these methods have been effective at detecting functional variants, it remains unclear how extensive variability in the genetic architecture of complex traits and diseases impacts TWAS results. Our goal was to investigate the different scenarios under which these methods yielded enough power to detect significant expression-trait associations. In this study, we conducted extensive simulations based on 6000 randomly chosen, unrelated Caucasian males from Geisinger's MyCode population to compare the power to detect cis expression-trait associations (within 500 kb of a gene) using the above-described approaches. To test TWAS across varying genetic backgrounds we simulated gene expression and phenotype using different quantitative trait loci per gene and cis-expression /trait heritability under genetic models that differentiate the effect of causality from that of pleiotropy. For each gene, on a training set ranging from 100 to 1000 individuals, we either (a) estimated regression coefficients with gene expression as the response using five different methods: LASSO, elastic net, Bayesian LASSO, Bayesian spike-slab, and Bayesian ridge regression or (b) performed eQTL analysis. We then sampled with replacement 50,000, 150,000, and 300,000 individuals respectively from the testing set of the remaining 5000 individuals and conducted GWAS on each set. Subsequently, we integrated the GWAS summary statistics derived from the testing set with the weights (or eQTLs) derived from the training set to identify expression-trait associations using (a) TWAS-MP (b) TWAS-SMR (c) eQTL-based GWAS, or (d) standalone GWAS. Finally, we examined the power to detect functionally relevant genes using the different approaches under the considered simulation scenarios. In general, we observed great similarities among TWAS-MP methods although the Bayesian methods resulted in improved power in comparison to LASSO and elastic net as the trait architecture grew more complex while training sample sizes and expression heritability remained small. Finally, we observed high power under causality but very low to moderate power under pleiotropy.
Human genetics as a model for target validation: finding new therapies for diabetes.
Thomsen, Soren K; Gloyn, Anna L
2017-06-01
Type 2 diabetes is a global epidemic with major effects on healthcare expenditure and quality of life. Currently available treatments are inadequate for the prevention of comorbidities, yet progress towards new therapies remains slow. A major barrier is the insufficiency of traditional preclinical models for predicting drug efficacy and safety. Human genetics offers a complementary model to assess causal mechanisms for target validation. Genetic perturbations are 'experiments of nature' that provide a uniquely relevant window into the long-term effects of modulating specific targets. Here, we show that genetic discoveries over the past decades have accurately predicted (now known) therapeutic mechanisms for type 2 diabetes. These findings highlight the potential for use of human genetic variation for prospective target validation, and establish a framework for future applications. Studies into rare, monogenic forms of diabetes have also provided proof-of-principle for precision medicine, and the applicability of this paradigm to complex disease is discussed. Finally, we highlight some of the limitations that are relevant to the use of genome-wide association studies (GWAS) in the search for new therapies for diabetes. A key outstanding challenge is the translation of GWAS signals into disease biology and we outline possible solutions for tackling this experimental bottleneck.
Guo, Michael; Liu, Zun; Willen, Jessie; Shaw, Cameron P; Richard, Daniel; Jagoda, Evelyn; Doxey, Andrew C; Hirschhorn, Joel; Capellini, Terence D
2017-12-05
GWAS have identified hundreds of height-associated loci. However, determining causal mechanisms is challenging, especially since height-relevant tissues (e.g. growth plates) are difficult to study. To uncover mechanisms by which height GWAS variants function, we performed epigenetic profiling of murine femoral growth plates. The profiled open chromatin regions recapitulate known chondrocyte and skeletal biology, are enriched at height GWAS loci, particularly near differentially expressed growth plate genes, and enriched for binding motifs of transcription factors with roles in chondrocyte biology. At specific loci, our analyses identified compelling mechanisms for GWAS variants. For example, at CHSY1 , we identified a candidate causal variant (rs9920291) overlapping an open chromatin region. Reporter assays demonstrated that rs9920291 shows allelic regulatory activity, and CRISPR/Cas9 targeting of human chondrocytes demonstrates that the region regulates CHSY1 expression. Thus, integrating biologically relevant epigenetic information (here, from growth plates) with genetic association results can identify biological mechanisms important for human growth.
Genetic and epigenetic markers in colorectal cancer screening: recent advances.
Singh, Manish Pratap; Rai, Sandhya; Suyal, Shradha; Singh, Sunil Kumar; Singh, Nand Kumar; Agarwal, Akash; Srivastava, Sameer
2017-07-01
Colorectal cancer (CRC) is a heterogenous disease which develops from benign intraepithelial lesions known as adenomas to malignant carcinomas. Acquired alterations in Wnt signaling, TGFβ, MAPK pathway genes and clonal propagation of altered cells are responsible for this transformation. Detection of adenomas or early stage cancer in asymptomatic patients and better prognostic and predictive markers is important for improving the clinical management of CRC. Area covered: In this review, the authors have evaluated the potential of genetic and epigenetic alterations as markers for early detection, prognosis and therapeutic predictive potential in the context of CRC. We have discussed molecular heterogeneity present in CRC and its correlation to prognosis and response to therapy. Expert commentary: Molecular marker based CRC screening methods still fail to gain trust of clinicians. Invasive screening methods, molecular heterogeneity, chemoresistance and low quality test samples are some key challenges which need to be addressed in the present context. New sequencing technologies and integrated omics data analysis of individual or population cohort results in GWAS. MPE studies following a GWAS could be future line of research to establish accurate correlations between CRC and its risk factors. This strategy would identify most reliable biomarkers for CRC screening and management.
Scalable privacy-preserving data sharing methodology for genome-wide association studies.
Yu, Fei; Fienberg, Stephen E; Slavković, Aleksandra B; Uhler, Caroline
2014-08-01
The protection of privacy of individual-level information in genome-wide association study (GWAS) databases has been a major concern of researchers following the publication of "an attack" on GWAS data by Homer et al. (2008). Traditional statistical methods for confidentiality and privacy protection of statistical databases do not scale well to deal with GWAS data, especially in terms of guarantees regarding protection from linkage to external information. The more recent concept of differential privacy, introduced by the cryptographic community, is an approach that provides a rigorous definition of privacy with meaningful privacy guarantees in the presence of arbitrary external information, although the guarantees may come at a serious price in terms of data utility. Building on such notions, Uhler et al. (2013) proposed new methods to release aggregate GWAS data without compromising an individual's privacy. We extend the methods developed in Uhler et al. (2013) for releasing differentially-private χ(2)-statistics by allowing for arbitrary number of cases and controls, and for releasing differentially-private allelic test statistics. We also provide a new interpretation by assuming the controls' data are known, which is a realistic assumption because some GWAS use publicly available data as controls. We assess the performance of the proposed methods through a risk-utility analysis on a real data set consisting of DNA samples collected by the Wellcome Trust Case Control Consortium and compare the methods with the differentially-private release mechanism proposed by Johnson and Shmatikov (2013). Copyright © 2014 Elsevier Inc. All rights reserved.
Genome-Wide Association Study of Multiple Sclerosis Confirms a Novel Locus at 5p13.1
Sanna, Serena; Gayán, Javier; Urcelay, Elena; Zara, Ilenia; Pitzalis, Maristella; Cavanillas, María L.; Arroyo, Rafael; Zoledziewska, Magdalena; Marrosu, Marisa; Fernández, Oscar; Leyva, Laura; Alcina, Antonio; Fedetz, Maria; Moreno-Rey, Concha; Velasco, Juan; Real, Luis M.; Ruiz-Peña, Juan Luis; Cucca, Francesco
2012-01-01
Multiple Sclerosis (MS) is the most common progressive and disabling neurological condition affecting young adults in the world today. From a genetic point of view, MS is a complex disorder resulting from the combination of genetic and non-genetic factors. We aimed to identify previously unidentified loci conducting a new GWAS of Multiple Sclerosis (MS) in a sample of 296 MS cases and 801 controls from the Spanish population. Meta-analysis of our data in combination with previous GWAS was done. A total of 17 GWAS-significant SNPs, corresponding to three different loci were identified:HLA, IL2RA, and 5p13.1. All three have been previously reported as GWAS-significant. We confirmed our observation in 5p13.1 for rs9292777 using two additional independent Spanish samples to make a total of 4912 MS cases and 7498 controls (ORpooled = 0.84; 95%CI: 0.80–0.89; p = 1.36×10-9). This SNP differs from the one reported within this locus in a recent GWAS. Although it is unclear whether both signals are tapping the same genetic association, it seems clear that this locus plays an important role in the pathogenesis of MS. PMID:22570697
Koller, Daniel L.; Ichikawa, Shoji; Lai, Dongbing; Padgett, Leah R.; Doheny, Kimberly F.; Pugh, Elizabeth; Paschall, Justin; Hui, Siu L.; Edenberg, Howard J.; Xuei, Xiaoling; Peacock, Munro; Econs, Michael J.; Foroud, Tatiana
2010-01-01
Context: Several genome-wide association studies (GWAS) have been performed to identify genes contributing to bone mineral density (BMD), typically in samples of elderly women and men. Objective: The objective of the study was to identify genes contributing to BMD in premenopausal women. Design: GWAS using the Illumina 610Quad array in premenopausal European-American (EA) women and replication of the top 50 single-nucleotide polymorphisms (SNPs) for two BMD measures in African-American (AA) women. Subjects: Subjects included 1524 premenopausal EA women aged 20–45 yr from 762 sibships and 669 AA premenopausal women aged 20–44 yr from 383 sibships. Interventions: There were no interventions. Main Outcome Measures: BMD was measured at the lumbar spine and femoral neck by dual-energy x-ray absorptiometry. Age- and weight-adjusted BMD values were tested for association with each SNP, with P values determined by permutation. Results: SNPs in CATSPERB on chromosome 14 provided evidence of association with femoral neck BMD (rs1298989, P = 2.7 × 10−5; rs1285635, P = 3.0 × 10−5) in the EA women, and some supporting evidence was also observed with these SNPs in the AA women (rs1285635, P = 0.003). Genes identified in other BMD GWAS studies, including IBSP and ADAMTS18, were also among the most significant findings in our GWAS. Conclusions: Evidence of association to several novel loci was detected in a GWAS of premenopausal EA women, and SNPs in one of these loci also provided supporting evidence in a sample of AA women. PMID:20164292
Cellular dissection of psoriasis for transcriptome analyses and the post-GWAS era
2014-01-01
Background Genome-scale studies of psoriasis have been used to identify genes of potential relevance to disease mechanisms. For many identified genes, however, the cell type mediating disease activity is uncertain, which has limited our ability to design gene functional studies based on genomic findings. Methods We identified differentially expressed genes (DEGs) with altered expression in psoriasis lesions (n = 216 patients), as well as candidate genes near susceptibility loci from psoriasis GWAS studies. These gene sets were characterized based upon their expression across 10 cell types present in psoriasis lesions. Susceptibility-associated variation at intergenic (non-coding) loci was evaluated to identify sites of allele-specific transcription factor binding. Results Half of DEGs showed highest expression in skin cells, although the dominant cell type differed between psoriasis-increased DEGs (keratinocytes, 35%) and psoriasis-decreased DEGs (fibroblasts, 33%). In contrast, psoriasis GWAS candidates tended to have highest expression in immune cells (71%), with a significant fraction showing maximal expression in neutrophils (24%, P < 0.001). By identifying candidate cell types for genes near susceptibility loci, we could identify and prioritize SNPs at which susceptibility variants are predicted to influence transcription factor binding. This led to the identification of potentially causal (non-coding) SNPs for which susceptibility variants influence binding of AP-1, NF-κB, IRF1, STAT3 and STAT4. Conclusions These findings underscore the role of innate immunity in psoriasis and highlight neutrophils as a cell type linked with pathogenetic mechanisms. Assignment of candidate cell types to genes emerging from GWAS studies provides a first step towards functional analysis, and we have proposed an approach for generating hypotheses to explain GWAS hits at intergenic loci. PMID:24885462
López-Isac, Elena; Martín, Jose-Ezequiel; Assassi, Shervin; Simeón, Carmen P; Carreira, Patricia; Ortego-Centeno, Norberto; Freire, Mayka; Beltrán, Emma; Narváez, Javier; Alegre-Sancho, Juan J; Fernández-Gutiérrez, Benjamín; Balsa, Alejandro; Ortiz, Ana M; González-Gay, Miguel A; Beretta, Lorenzo; Santaniello, Alessandro; Bellocchi, Chiara; Lunardi, Claudio; Moroncini, Gianluca; Gabrielli, Armando; Witte, Torsten; Hunzelmann, Nicolas; Distler, Jörg HW; Riekemasten, Gabriella; van der Helm-van Mil, Annete H; de Vries-Bouwstra, Jeska; Magro-Checa, Cesar; Voskuyl, Alexandre E; Vonk, Madelon C; Molberg, Øyvind; Merriman, Tony; Hesselstrand, Roger; Nordin, Annika; Padyukov, Leonid; Herrick, Ariane; Eyre, Steve; Koeleman, Bobby PC; Denton, Christopher P; Fonseca, Carmen; Radstake, Timothy RDJ; Worthington, Jane; Mayes, Maureen D; Martín, Javier
2017-01-01
Objectives Systemic sclerosis (SSc) and rheumatoid arthritis (RA) are autoimmune diseases that share clinical and immunological characteristics. To date, several shared SSc-RA loci have been identified independently. In this study, we aimed to systematically search for new common SSc-RA loci through an inter-disease meta-GWAS strategy. Methods We performed a meta-analysis combining GWAS datasets of SSc and RA using a strategy that allowed identification of loci with both same-direction and opposing-direction allelic effects. The top single-nucleotide polymorphisms (SNPs) were followed-up in independent SSc and RA case-control cohorts. This allowed us to increase the sample size to a total of 8,830 SSc patients, 16,870 RA patients and 43,393 controls. Results The cross-disease meta-analysis of the GWAS datasets identified several loci with nominal association signals (P-value < 5 × 10-6), which also showed evidence of association in the disease-specific GWAS scan. These loci included several genomic regions not previously reported as shared loci, besides risk factors associated with both diseases in previous studies. The follow-up of the putatively new SSc-RA loci identified IRF4 as a shared risk factor for these two diseases (Pcombined = 3.29 × 10-12). In addition, the analysis of the biological relevance of the known SSc-RA shared loci pointed to the type I interferon and the interleukin 12 signaling pathways as the main common etiopathogenic factors. Conclusions Our study has identified a novel shared locus, IRF4, for SSc and RA and highlighted the usefulness of cross-disease GWAS meta-analysis in the identification of common risk loci. PMID:27111665
Lloyd-Jones, Luke R; Robinson, Matthew R; Yang, Jian; Visscher, Peter M
2018-04-01
Genome-wide association studies (GWAS) have identified thousands of loci that are robustly associated with complex diseases. The use of linear mixed model (LMM) methodology for GWAS is becoming more prevalent due to its ability to control for population structure and cryptic relatedness and to increase power. The odds ratio (OR) is a common measure of the association of a disease with an exposure ( e.g. , a genetic variant) and is readably available from logistic regression. However, when the LMM is applied to all-or-none traits it provides estimates of genetic effects on the observed 0-1 scale, a different scale to that in logistic regression. This limits the comparability of results across studies, for example in a meta-analysis, and makes the interpretation of the magnitude of an effect from an LMM GWAS difficult. In this study, we derived transformations from the genetic effects estimated under the LMM to the OR that only rely on summary statistics. To test the proposed transformations, we used real genotypes from two large, publicly available data sets to simulate all-or-none phenotypes for a set of scenarios that differ in underlying model, disease prevalence, and heritability. Furthermore, we applied these transformations to GWAS summary statistics for type 2 diabetes generated from 108,042 individuals in the UK Biobank. In both simulation and real-data application, we observed very high concordance between the transformed OR from the LMM and either the simulated truth or estimates from logistic regression. The transformations derived and validated in this study improve the comparability of results from prospective and already performed LMM GWAS on complex diseases by providing a reliable transformation to a common comparative scale for the genetic effects. Copyright © 2018 by the Genetics Society of America.
Genetics of common forms of heart failure: challenges and potential solutions.
Rau, Christoph D; Lusis, Aldons J; Wang, Yibin
2015-05-01
In contrast to many other human diseases, the use of genome-wide association studies (GWAS) to identify genes for heart failure (HF) has had limited success. We will discuss the underlying challenges as well as potential new approaches to understanding the genetics of common forms of HF. Recent research using intermediate phenotypes, more detailed and quantitative stratification of HF symptoms, founder populations and novel animal models has begun to allow researchers to make headway toward explaining the genetics underlying HF using GWAS techniques. By expanding analyses of HF to improved clinical traits, additional HF classifications and innovative model systems, the intractability of human HF GWAS should be ameliorated significantly.
Brookes, Keeley J; McConnell, George; Williams, Kirsty; Chaudhury, Sultan; Madhan, Gaganjit; Patel, Tulsi; Turley, Christopher; Guetta-Baranes, Tamar; Bras, Jose; Guerreiro, Rita; Hardy, John; Francis, Paul T; Morgan, Kevin
2018-06-08
The Brains for Dementia Research project is a recently established longitudinal cohort which aims to provide brain tissue for research purposes from neuropathologically defined samples. Here we present the findings from our analysis on the 19 established GWAS index SNPs for Alzheimer's disease, in order to demonstrate if the BDR sample also displays association to these variants. A highly significant association of the APOEɛ4 allele was identified (p = 3.99×10-12). Association tests for the 19 GWAS SNPs found that although no SNPs survive multiple testing, nominal significant findings were detected and concordance with the Lambert et al. GWAS meta-analysis was observed.
Zheng, Jie; Erzurumluoglu, A Mesut; Elsworth, Benjamin L; Kemp, John P; Howe, Laurence; Haycock, Philip C; Hemani, Gibran; Tansey, Katherine; Laurin, Charles; Pourcain, Beate St; Warrington, Nicole M; Finucane, Hilary K; Price, Alkes L; Bulik-Sullivan, Brendan K; Anttila, Verneri; Paternoster, Lavinia; Gaunt, Tom R; Evans, David M; Neale, Benjamin M
2017-01-15
LD score regression is a reliable and efficient method of using genome-wide association study (GWAS) summary-level results data to estimate the SNP heritability of complex traits and diseases, partition this heritability into functional categories, and estimate the genetic correlation between different phenotypes. Because the method relies on summary level results data, LD score regression is computationally tractable even for very large sample sizes. However, publicly available GWAS summary-level data are typically stored in different databases and have different formats, making it difficult to apply LD score regression to estimate genetic correlations across many different traits simultaneously. In this manuscript, we describe LD Hub - a centralized database of summary-level GWAS results for 173 diseases/traits from different publicly available resources/consortia and a web interface that automates the LD score regression analysis pipeline. To demonstrate functionality and validate our software, we replicated previously reported LD score regression analyses of 49 traits/diseases using LD Hub; and estimated SNP heritability and the genetic correlation across the different phenotypes. We also present new results obtained by uploading a recent atopic dermatitis GWAS meta-analysis to examine the genetic correlation between the condition and other potentially related traits. In response to the growing availability of publicly accessible GWAS summary-level results data, our database and the accompanying web interface will ensure maximal uptake of the LD score regression methodology, provide a useful database for the public dissemination of GWAS results, and provide a method for easily screening hundreds of traits for overlapping genetic aetiologies. The web interface and instructions for using LD Hub are available at http://ldsc.broadinstitute.org/ CONTACT: jie.zheng@bristol.ac.ukSupplementary information: Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press.
Abe, Makiko; Ito, Hidemi; Oze, Isao; Nomura, Masatoshi; Ogawa, Yoshihiro; Matsuo, Keitaro
2017-12-01
Little is known about the difference of genetic predisposition for CRC between ethnicities; however, many genetic traits common to colorectal cancer have been identified. This study investigated whether more SNPs identified in GWAS in East Asian population could improve the risk prediction of Japanese and explored possible application of genetic risk groups as an instrument of the risk communication. 558 Patients histologically verified colorectal cancer and 1116 first-visit outpatients were included for derivation study, and 547 cases and 547 controls were for replication study. Among each population, we evaluated prediction models for the risk of CRC that combined the genetic risk group based on SNPs from GWASs in European-population and a similarly developed model adding SNPs from GWASs in East Asian-population. We examined whether adding East Asian-specific SNPs would improve the discrimination. Six SNPs (rs6983267, rs4779584, rs4444235, rs9929218, rs10936599, rs16969681) from 23 SNPs by European-based GWAS and five SNPs (rs704017, rs11196172, rs10774214, rs647161, rs2423279) among ten SNPs by Asian-based GWAS were selected in CRC risk prediction model. Compared with a 6-SNP-based model, an 11-SNP model including Asian GWAS-SNPs showed improved discrimination capacity in Receiver operator characteristic analysis. A model with 11 SNPs resulted in statistically significant improvement in both derivation (P = 0.0039) and replication studies (P = 0.0018) compared with six SNP model. We estimated cumulative risk of CRC by using genetic risk group based on 11 SNPs and found that the cumulative risk at age 80 is approximately 13% in the high-risk group while 6% in the low-risk group. We constructed a more efficient CRC risk prediction model with 11 SNPs including newly identified East Asian-based GWAS SNPs (rs704017, rs11196172, rs10774214, rs647161, rs2423279). Risk grouping based on 11 SNPs depicted lifetime difference of CRC risk. This might be useful for effective individualized prevention for East Asian.
USDA-ARS?s Scientific Manuscript database
Copy number variation (CNV) is an important type of genetic variation contributing to phenotypic differences among mammals and may serve as an alternative molecular marker to single nucleotide polymorphism (SNP) for genome-wide association study (GWAS). Recently, GWAS analysis using CNV has been app...
Genome-wide association studies in Alzheimer's disease.
Bertram, Lars; Tanzi, Rudolph E
2009-10-15
Genome-wide association studies (GWAS) have gained considerable momentum over the last couple of years for the identification of novel complex disease genes. In the field of Alzheimer's disease (AD), there are currently eight published and two provisionally reported GWAS, highlighting over two dozen novel potential susceptibility loci beyond the well-established APOE association. On the basis of the data available at the time of this writing, the most compelling novel GWAS signal has been observed in GAB2 (GRB2-associated binding protein 2), followed by less consistently replicated signals in galanin-like peptide (GALP), piggyBac transposable element derived 1 (PGBD1), tyrosine kinase, non-receptor 1 (TNK1). Furthermore, consistent replication has been recently announced for CLU (clusterin, also known as apolipoprotein J). Finally, there are at least three replicated loci in hitherto uncharacterized genomic intervals on chromosomes 14q32.13, 14q31.2 and 6q24.1 likely implicating the existence of novel AD genes in these regions. In this review, we will discuss the characteristics and potential relevance to pathogenesis of the outcomes of all currently available GWAS in AD. A particular emphasis will be laid on findings with independent data in favor of the original association.
Unsupervised text mining for assessing and augmenting GWAS results.
Ailem, Melissa; Role, François; Nadif, Mohamed; Demenais, Florence
2016-04-01
Text mining can assist in the analysis and interpretation of large-scale biomedical data, helping biologists to quickly and cheaply gain confirmation of hypothesized relationships between biological entities. We set this question in the context of genome-wide association studies (GWAS), an actively emerging field that contributed to identify many genes associated with multifactorial diseases. These studies allow to identify groups of genes associated with the same phenotype, but provide no information about the relationships between these genes. Therefore, our objective is to leverage unsupervised text mining techniques using text-based cosine similarity comparisons and clustering applied to candidate and random gene vectors, in order to augment the GWAS results. We propose a generic framework which we used to characterize the relationships between 10 genes reported associated with asthma by a previous GWAS. The results of this experiment showed that the similarities between these 10 genes were significantly stronger than would be expected by chance (one-sided p-value<0.01). The clustering of observed and randomly selected gene also allowed to generate hypotheses about potential functional relationships between these genes and thus contributed to the discovery of new candidate genes for asthma. Copyright © 2016 Elsevier Inc. All rights reserved.
GWAS meta-analysis and replication identifies three new susceptibility loci for ovarian cancer
Pharoah, Paul D. P.; Tsai, Ya-Yu; Ramus, Susan J.; Phelan, Catherine M.; Goode, Ellen L.; Lawrenson, Kate; Price, Melissa; Fridley, Brooke L.; Tyrer, Jonathan P.; Shen, Howard; Weber, Rachel; Karevan, Rod; Larson, Melissa C.; Song, Honglin; Tessier, Daniel C.; Bacot, François; Vincent, Daniel; Cunningham, Julie M.; Dennis, Joe; Dicks, Ed; Aben, Katja K.; Anton-Culver, Hoda; Antonenkova, Natalia; Armasu, Sebastian M.; Baglietto, Laura; Bandera, Elisa V.; Beckmann, Matthias W.; Birrer, Michael J.; Bloom, Greg; Bogdanova, Natalia; Brenton, James D.; Brinton, Louise A.; Brooks-Wilson, Angela; Brown, Robert; Butzow, Ralf; Campbell, Ian; Carney, Michael E; Carvalho, Renato S.; Chang-Claude, Jenny; Chen, Y. Anne; Chen, Zhihua; Chow, Wong-Ho; Cicek, Mine S.; Coetzee, Gerhard; Cook, Linda S.; Cramer, Daniel W.; Cybulski, Cezary; Dansonka-Mieszkowska, Agnieszka; Despierre, Evelyn; Doherty, Jennifer A; Dörk, Thilo; du Bois, Andreas; Dürst, Matthias; Eccles, Diana; Edwards, Robert; Ekici, Arif B.; Fasching, Peter A.; Fenstermacher, David; Flanagan, James; Gao, Yu-Tang; Garcia-Closas, Montserrat; Gentry-Maharaj, Aleksandra; Giles, Graham; Gjyshi, Anxhela; Gore, Martin; Gronwald, Jacek; Guo, Qi; Halle, Mari K; Harter, Philipp; Hein, Alexander; Heitz, Florian; Hillemanns, Peter; Hoatlin, Maureen; Høgdall, Estrid; Høgdall, Claus K.; Hosono, Satoyo; Jakubowska, Anna; Jensen, Allan; Kalli, Kimberly R.; Karlan, Beth Y.; Kelemen, Linda E.; Kiemeney, Lambertus A.; Kjaer, Susanne Krüger; Konecny, Gottfried E.; Krakstad, Camilla; Kupryjanczyk, Jolanta; Lambrechts, Diether; Lambrechts, Sandrina; Le, Nhu D.; Lee, Nathan; Lee, Janet; Leminen, Arto; Lim, Boon Kiong; Lissowska, Jolanta; Lubiński, Jan; Lundvall, Lene; Lurie, Galina; Massuger, Leon F.A.G.; Matsuo, Keitaro; McGuire, Valerie; McLaughlin, John R; Menon, Usha; Modugno, Francesmary; Moysich, Kirsten B.; Nakanishi, Toru; Narod, Steven A.; Ness, Roberta B.; Nevanlinna, Heli; Nickels, Stefan; Noushmehr, Houtan; Odunsi, Kunle; Olson, Sara; Orlow, Irene; Paul, James; Pejovic, Tanja; Pelttari, Liisa M; Permuth-Wey, Jenny; Pike, Malcolm C; Poole, Elizabeth M; Qu, Xiaotao; Risch, Harvey A.; Rodriguez-Rodriguez, Lorna; Rossing, Mary Anne; Rudolph, Anja; Runnebaum, Ingo; Rzepecka, Iwona K; Salvesen, Helga B.; Schwaab, Ira; Severi, Gianluca; Shen, Hui; Shridhar, Vijayalakshmi; Shu, Xiao-Ou; Sieh, Weiva; Southey, Melissa C.; Spellman, Paul; Tajima, Kazuo; Teo, Soo-Hwang; Terry, Kathryn L.; Thompson, Pamela J; Timorek, Agnieszka; Tworoger, Shelley S.; van Altena, Anne M.; Berg, David Van Den; Vergote, Ignace; Vierkant, Robert A.; Vitonis, Allison F.; Wang-Gohrke, Shan; Wentzensen, Nicolas; Whittemore, Alice S.; Wik, Elisabeth; Winterhoff, Boris; Woo, Yin Ling; Wu, Anna H; Yang, Hannah P.; Zheng, Wei; Ziogas, Argyrios; Zulkifli, Famida; Goodman, Marc T.; Hall, Per; Easton, Douglas F; Pearce, Celeste L; Berchuck, Andrew; Chenevix-Trench, Georgia; Iversen, Edwin; Monteiro, Alvaro N.A.; Gayther, Simon A.; Schildkraut, Joellen M.; Sellers, Thomas A.
2013-01-01
Genome wide association studies (GWAS) have identified four susceptibility loci for epithelial ovarian cancer (EOC) with another two loci being close to genome-wide significance. We pooled data from a GWAS conducted in North America with another GWAS from the United Kingdom. We selected the top 24,551 SNPs for inclusion on the iCOGS custom genotyping array. Follow-up genotyping was carried out in 18,174 cases and 26,134 controls from 43 studies from the Ovarian Cancer Association Consortium. We validated the two loci at 3q25 and 17q21 previously near genome-wide significance and identified three novel loci associated with risk; two loci associated with all EOC subtypes, at 8q21 (rs11782652, P=5.5×10-9) and 10p12 (rs1243180; P=1.8×10-8), and another locus specific to the serous subtype at 17q12 (rs757210; P=8.1×10-10). An integrated molecular analysis of genes and regulatory regions at these loci provided evidence for functional mechanisms underlying susceptibility that implicates CHMP4C in the pathogenesis of ovarian cancer. PMID:23535730
GWAS meta-analysis and replication identifies three new susceptibility loci for ovarian cancer.
Pharoah, Paul D P; Tsai, Ya-Yu; Ramus, Susan J; Phelan, Catherine M; Goode, Ellen L; Lawrenson, Kate; Buckley, Melissa; Fridley, Brooke L; Tyrer, Jonathan P; Shen, Howard; Weber, Rachel; Karevan, Rod; Larson, Melissa C; Song, Honglin; Tessier, Daniel C; Bacot, François; Vincent, Daniel; Cunningham, Julie M; Dennis, Joe; Dicks, Ed; Aben, Katja K; Anton-Culver, Hoda; Antonenkova, Natalia; Armasu, Sebastian M; Baglietto, Laura; Bandera, Elisa V; Beckmann, Matthias W; Birrer, Michael J; Bloom, Greg; Bogdanova, Natalia; Brenton, James D; Brinton, Louise A; Brooks-Wilson, Angela; Brown, Robert; Butzow, Ralf; Campbell, Ian; Carney, Michael E; Carvalho, Renato S; Chang-Claude, Jenny; Chen, Y Anne; Chen, Zhihua; Chow, Wong-Ho; Cicek, Mine S; Coetzee, Gerhard; Cook, Linda S; Cramer, Daniel W; Cybulski, Cezary; Dansonka-Mieszkowska, Agnieszka; Despierre, Evelyn; Doherty, Jennifer A; Dörk, Thilo; du Bois, Andreas; Dürst, Matthias; Eccles, Diana; Edwards, Robert; Ekici, Arif B; Fasching, Peter A; Fenstermacher, David; Flanagan, James; Gao, Yu-Tang; Garcia-Closas, Montserrat; Gentry-Maharaj, Aleksandra; Giles, Graham; Gjyshi, Anxhela; Gore, Martin; Gronwald, Jacek; Guo, Qi; Halle, Mari K; Harter, Philipp; Hein, Alexander; Heitz, Florian; Hillemanns, Peter; Hoatlin, Maureen; Høgdall, Estrid; Høgdall, Claus K; Hosono, Satoyo; Jakubowska, Anna; Jensen, Allan; Kalli, Kimberly R; Karlan, Beth Y; Kelemen, Linda E; Kiemeney, Lambertus A; Kjaer, Susanne Krüger; Konecny, Gottfried E; Krakstad, Camilla; Kupryjanczyk, Jolanta; Lambrechts, Diether; Lambrechts, Sandrina; Le, Nhu D; Lee, Nathan; Lee, Janet; Leminen, Arto; Lim, Boon Kiong; Lissowska, Jolanta; Lubiński, Jan; Lundvall, Lene; Lurie, Galina; Massuger, Leon F A G; Matsuo, Keitaro; McGuire, Valerie; McLaughlin, John R; Menon, Usha; Modugno, Francesmary; Moysich, Kirsten B; Nakanishi, Toru; Narod, Steven A; Ness, Roberta B; Nevanlinna, Heli; Nickels, Stefan; Noushmehr, Houtan; Odunsi, Kunle; Olson, Sara; Orlow, Irene; Paul, James; Pejovic, Tanja; Pelttari, Liisa M; Permuth-Wey, Jenny; Pike, Malcolm C; Poole, Elizabeth M; Qu, Xiaotao; Risch, Harvey A; Rodriguez-Rodriguez, Lorna; Rossing, Mary Anne; Rudolph, Anja; Runnebaum, Ingo; Rzepecka, Iwona K; Salvesen, Helga B; Schwaab, Ira; Severi, Gianluca; Shen, Hui; Shridhar, Vijayalakshmi; Shu, Xiao-Ou; Sieh, Weiva; Southey, Melissa C; Spellman, Paul; Tajima, Kazuo; Teo, Soo-Hwang; Terry, Kathryn L; Thompson, Pamela J; Timorek, Agnieszka; Tworoger, Shelley S; van Altena, Anne M; van den Berg, David; Vergote, Ignace; Vierkant, Robert A; Vitonis, Allison F; Wang-Gohrke, Shan; Wentzensen, Nicolas; Whittemore, Alice S; Wik, Elisabeth; Winterhoff, Boris; Woo, Yin Ling; Wu, Anna H; Yang, Hannah P; Zheng, Wei; Ziogas, Argyrios; Zulkifli, Famida; Goodman, Marc T; Hall, Per; Easton, Douglas F; Pearce, Celeste L; Berchuck, Andrew; Chenevix-Trench, Georgia; Iversen, Edwin; Monteiro, Alvaro N A; Gayther, Simon A; Schildkraut, Joellen M; Sellers, Thomas A
2013-04-01
Genome-wide association studies (GWAS) have identified four susceptibility loci for epithelial ovarian cancer (EOC), with another two suggestive loci reaching near genome-wide significance. We pooled data from a GWAS conducted in North America with another GWAS from the UK. We selected the top 24,551 SNPs for inclusion on the iCOGS custom genotyping array. We performed follow-up genotyping in 18,174 individuals with EOC (cases) and 26,134 controls from 43 studies from the Ovarian Cancer Association Consortium. We validated the two loci at 3q25 and 17q21 that were previously found to have associations close to genome-wide significance and identified three loci newly associated with risk: two loci associated with all EOC subtypes at 8q21 (rs11782652, P = 5.5 × 10(-9)) and 10p12 (rs1243180, P = 1.8 × 10(-8)) and another locus specific to the serous subtype at 17q12 (rs757210, P = 8.1 × 10(-10)). An integrated molecular analysis of genes and regulatory regions at these loci provided evidence for functional mechanisms underlying susceptibility and implicated CHMP4C in the pathogenesis of ovarian cancer.
Peprah, Emmanuel; Xu, Huichun; Tekola-Ayele, Fasil; Royal, Charmaine D.
2014-01-01
Genomic research is one of the tools for elucidating the pathogenesis of diseases of global health relevance, and paving the research dimension to clinical and public health translation. Recent advances in genomic research and technologies have increased our understanding of human diseases, genes associated with these disorders, and the relevant mechanisms. Genome-wide association studies (GWAS) have proliferated since the first studies were published several years ago, and have become an important tool in helping researchers comprehend human variation and the role genetic variants play in disease. However, the need to expand the diversity of populations in GWAS has become increasingly apparent as new knowledge is gained about genetic variation. Inclusion of diverse populations in genomic studies is critical to a more complete understanding of human variation and elucidation of the underpinnings of complex diseases. In this review, we summarize the available data on GWAS in recent-African ancestry populations within the western hemisphere (i.e. African Americans and peoples of the Caribbean) and continental African populations. Furthermore, we highlight ways in which genomic studies in populations of recent African ancestry have led to advances in the areas of malaria, HIV, prostate cancer, and other diseases. Finally, we discuss the advantages of conducting GWAS in recent African ancestry populations in the context of addressing existing and emerging global health conditions. PMID:25427668
Benner, Christian; Havulinna, Aki S; Järvelin, Marjo-Riitta; Salomaa, Veikko; Ripatti, Samuli; Pirinen, Matti
2017-10-05
During the past few years, various novel statistical methods have been developed for fine-mapping with the use of summary statistics from genome-wide association studies (GWASs). Although these approaches require information about the linkage disequilibrium (LD) between variants, there has not been a comprehensive evaluation of how estimation of the LD structure from reference genotype panels performs in comparison with that from the original individual-level GWAS data. Using population genotype data from Finland and the UK Biobank, we show here that a reference panel of 1,000 individuals from the target population is adequate for a GWAS cohort of up to 10,000 individuals, whereas smaller panels, such as those from the 1000 Genomes Project, should be avoided. We also show, both theoretically and empirically, that the size of the reference panel needs to scale with the GWAS sample size; this has important consequences for the application of these methods in ongoing GWAS meta-analyses and large biobank studies. We conclude by providing software tools and by recommending practices for sharing LD information to more efficiently exploit summary statistics in genetics research. Copyright © 2017 American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.
Sardos, Julie; Rouard, Mathieu; Hueber, Yann; Cenci, Alberto; Hyma, Katie E; van den Houwe, Ines; Hribova, Eva; Courtois, Brigitte; Roux, Nicolas
2016-01-01
Banana (Musa sp.) is a vegetatively propagated, low fertility, potentially hybrid and polyploid crop. These qualities make the breeding and targeted genetic improvement of this crop a difficult and long process. The Genome-Wide Association Study (GWAS) approach is becoming widely used in crop plants and has proven efficient to detecting candidate genes for traits of interest, especially in cereals. GWAS has not been applied yet to a vegetatively propagated crop. However, successful GWAS in banana would considerably help unravel the genomic basis of traits of interest and therefore speed up this crop improvement. We present here a dedicated panel of 105 accessions of banana, freely available upon request, and their corresponding GBS data. A set of 5,544 highly reliable markers revealed high levels of admixture in most accessions, except for a subset of 33 individuals from Papua. A GWAS on the seedless phenotype was then successfully applied to the panel. By applying the Mixed Linear Model corrected for both kinship and structure as implemented in TASSEL, we detected 13 candidate genomic regions in which we found a number of genes potentially linked with the seedless phenotype (i.e. parthenocarpy combined with female sterility). An additional GWAS performed on the unstructured Papuan subset composed of 33 accessions confirmed six of these regions as candidate. Out of both sets of analyses, one strong candidate gene for female sterility, a putative orthologous gene to Histidine Kinase CKI1, was identified. The results presented here confirmed the feasibility and potential of GWAS when applied to small sets of banana accessions, at least for traits underpinned by a few loci. As phenotyping in banana is extremely space and time-consuming, this latest finding is of particular importance in the context of banana improvement.
Sardos, Julie; Rouard, Mathieu; Hueber, Yann; Cenci, Alberto; Hyma, Katie E.; van den Houwe, Ines; Hribova, Eva; Courtois, Brigitte; Roux, Nicolas
2016-01-01
Banana (Musa sp.) is a vegetatively propagated, low fertility, potentially hybrid and polyploid crop. These qualities make the breeding and targeted genetic improvement of this crop a difficult and long process. The Genome-Wide Association Study (GWAS) approach is becoming widely used in crop plants and has proven efficient to detecting candidate genes for traits of interest, especially in cereals. GWAS has not been applied yet to a vegetatively propagated crop. However, successful GWAS in banana would considerably help unravel the genomic basis of traits of interest and therefore speed up this crop improvement. We present here a dedicated panel of 105 accessions of banana, freely available upon request, and their corresponding GBS data. A set of 5,544 highly reliable markers revealed high levels of admixture in most accessions, except for a subset of 33 individuals from Papua. A GWAS on the seedless phenotype was then successfully applied to the panel. By applying the Mixed Linear Model corrected for both kinship and structure as implemented in TASSEL, we detected 13 candidate genomic regions in which we found a number of genes potentially linked with the seedless phenotype (i.e. parthenocarpy combined with female sterility). An additional GWAS performed on the unstructured Papuan subset composed of 33 accessions confirmed six of these regions as candidate. Out of both sets of analyses, one strong candidate gene for female sterility, a putative orthologous gene to Histidine Kinase CKI1, was identified. The results presented here confirmed the feasibility and potential of GWAS when applied to small sets of banana accessions, at least for traits underpinned by a few loci. As phenotyping in banana is extremely space and time-consuming, this latest finding is of particular importance in the context of banana improvement. PMID:27144345
An update on the genetics of hyperuricaemia and gout.
Major, Tanya J; Dalbeth, Nicola; Stahl, Eli A; Merriman, Tony R
2018-06-01
A central aspect of the pathogenesis of gout is elevated urate concentrations, which lead to the formation of monosodium urate crystals. The clinical features of gout result from an individual's immune response to these deposited crystals. Genome-wide association studies (GWAS) have confirmed the importance of urate excretion in the control of serum urate levels and the risk of gout and have identified the kidneys, the gut and the liver as sites of urate regulation. The genetic contribution to the progression from hyperuricaemia to gout remains relatively poorly understood, although genes encoding proteins that are involved in the NLRP3 (NOD-, LRR- and pyrin domain-containing 3) inflammasome pathway play a part. Genome-wide and targeted sequencing is beginning to identify uncommon population-specific variants that are associated with urate levels and gout. Mendelian randomization studies using urate-associated genetic variants as unconfounded surrogates for lifelong urate exposure have not supported claims that urate is causal for metabolic conditions that are comorbidities of hyperuricaemia and gout. Genetic studies have also identified genetic variants that predict responsiveness to therapies (for example, urate-lowering drugs) for treatment of hyperuricaemia. Future research should focus on large GWAS (that include asymptomatic hyperuricaemic individuals) and on increasing the use of whole-genome sequencing data to identify uncommon genetic variants with increased penetrance that might provide opportunities for clinical translation.
Hamad, Rita; Tuljapurkar, Shripad; Rehkopf, David H
2016-09-01
Shorter telomere length (TL) has been associated with stress and adverse socioeconomic conditions, yet U.S. blacks have longer TL than whites. The role of genetic versus environmental factors in explaining TL by race and socioeconomic position (SEP) remains unclear. We used data from the U.S. Health and Retirement Study (N=11,934) to test the hypothesis that there are differences in TL-associated SNPs by race and SEP. We constructed a TL polygenic risk score (PRS) and examined its association with race/ethnicity, educational attainment, assets, gender, and age. U.S. blacks were more likely to have a lower PRS for TL, as were older individuals and men. Racial differences in TL were statistically accounted for when controlling for population structure using genetic principal components. The GWAS-derived SNPs for TL, however, may not have consistent associations with TL across different racial/ethnic groups. This study showed that associations of race/ethnicity with TL differed when accounting for population stratification. The role of race/ethnicity for TL remains uncertain, however, as the genetic determinants of TL may differ by race/ethnicity. Future GWAS samples should include racially diverse participants to allow for better characterization of the determinants of TL in human populations. Copyright © 2016 Forschungsgesellschaft für Arbeitsphysiologie und Arbeitschutz e.V. Published by Elsevier B.V. All rights reserved.
Respiratory reviews in asthma 2013.
Kim, Tae-Hyung
2014-03-01
From January 2012 up until March 2013, many articles with huge clinical importance in asthma were published based on large numbered clinical trials or meta-analysis. The main subjects of these studies were the new therapeutic plan based on the asthma phenotype or efficacy along with the safety issues regarding the current treatment guidelines. For efficacy and safety issues, inhaled corticosteroid tapering strategy or continued long-acting beta agonists use was the major concern. As new therapeutic trials, monoclonal antibodies or macrolide antibiotics based on inflammatory phenotypes have been under investigation, with promising preliminary results. There were other issues on the disease susceptibility or genetic background of asthma, particularly for the "severe asthma" phenotype. In the era of genome and pharmacogenetics, there have been extensive studies to identify susceptible candidate genes based on the results of genome wide association studies (GWAS). However, for severe asthma, which is where most of the mortality or medical costs develop, it is very unclear. Moreover, there have been some efforts to find important genetic information in order to predict the possible disease progression, but with few significant results up until now. In conclusion, there are new on-going aspects in the phenotypic classification of asthma and therapeutic strategy according to the phenotypic variations. With more pharmacogenomic information and clear identification of the "severe asthma" group even before disease progression from GWAS data, more adequate and individualized therapeutic strategy could be realized in the future.
Piette, Elizabeth R; Moore, Jason H
2018-01-01
Machine learning methods and conventions are increasingly employed for the analysis of large, complex biomedical data sets, including genome-wide association studies (GWAS). Reproducibility of machine learning analyses of GWAS can be hampered by biological and statistical factors, particularly so for the investigation of non-additive genetic interactions. Application of traditional cross validation to a GWAS data set may result in poor consistency between the training and testing data set splits due to an imbalance of the interaction genotypes relative to the data as a whole. We propose a new cross validation method, proportional instance cross validation (PICV), that preserves the original distribution of an independent variable when splitting the data set into training and testing partitions. We apply PICV to simulated GWAS data with epistatic interactions of varying minor allele frequencies and prevalences and compare performance to that of a traditional cross validation procedure in which individuals are randomly allocated to training and testing partitions. Sensitivity and positive predictive value are significantly improved across all tested scenarios for PICV compared to traditional cross validation. We also apply PICV to GWAS data from a study of primary open-angle glaucoma to investigate a previously-reported interaction, which fails to significantly replicate; PICV however improves the consistency of testing and training results. Application of traditional machine learning procedures to biomedical data may require modifications to better suit intrinsic characteristics of the data, such as the potential for highly imbalanced genotype distributions in the case of epistasis detection. The reproducibility of genetic interaction findings can be improved by considering this variable imbalance in cross validation implementation, such as with PICV. This approach may be extended to problems in other domains in which imbalanced variable distributions are a concern.
Evaluation of different sources of DNA for use in genome wide studies and forensic application.
Al Safar, Habiba S; Abidi, Fatima H; Khazanehdari, Kamal A; Dadour, Ian R; Tay, Guan K
2011-02-01
In the field of epidemiology, Genome-Wide Association Studies (GWAS) are commonly used to identify genetic predispositions of many human diseases. Large repositories housing biological specimens for clinical and genetic investigations have been established to store material and data for these studies. The logistics of specimen collection and sample storage can be onerous, and new strategies have to be explored. This study examines three different DNA sources (namely, degraded genomic DNA, amplified degraded genomic DNA and amplified extracted DNA from FTA card) for GWAS using the Illumina platform. No significant difference in call rate was detected between amplified degraded genomic DNA extracted from whole blood and amplified DNA retrieved from FTA™ cards. However, using unamplified-degraded genomic DNA reduced the call rate to a mean of 42.6% compared to amplified DNA extracted from FTA card (mean of 96.6%). This study establishes the utility of FTA™ cards as a viable storage matrix for cells from which DNA can be extracted to perform GWAS analysis.
A Genome-Wide Association Study of the Human Metabolome in a Community-Based Cohort
Rhee, Eugene P.; Ho, Jennifer E.; Chen, Ming-Huei; Shen, Dongxiao; Cheng, Susan; Larson, Martin G.; Ghorbani, Anahita; Shi, Xu; Helenius, Iiro T.; O’Donnell, Christopher J.; Souza, Amanda L.; Deik, Amy; Pierce, Kerry A.; Bullock, Kevin; Walford, Geoffrey A.; Vasan, Ramachandran S.; Florez, Jose C.; Clish, Clary; Yeh, J.-R. Joanna; Wang, Thomas J.; Gerszten, Robert E.
2014-01-01
SUMMARY Because metabolites are hypothesized to play key roles as markers and effectors of cardio-metabolic diseases, recent studies have sought to annotate the genetic determinants of circulating metabolite levels. We report a genome-wide association study (GWAS) of 217 plasma metabolites, including >100 not measured in prior GWAS, in 2,076 participants of the Framingham Heart Study. For the majority of analytes, we find that estimated heritability explains >20% of inter-individual variation, and that variation attributable to heritable factors is greater than that attributable to clinical factors. Further, we identify 31 genetic loci associated with plasma metabolites, including 23 that have not previously been reported. Importantly, we include GWAS results for all surveyed metabolites, and demonstrate how this information highlights a role for AGXT2 in cholesterol ester and triacylglycerol metabolism. Thus, our study outlines the relative contributions of inherited and clinical factors on the plasma metabolome and provides a resource for metabolism research. PMID:23823483
Multi-trait analysis of genome-wide association summary statistics using MTAG.
Turley, Patrick; Walters, Raymond K; Maghzian, Omeed; Okbay, Aysu; Lee, James J; Fontana, Mark Alan; Nguyen-Viet, Tuan Anh; Wedow, Robbee; Zacher, Meghan; Furlotte, Nicholas A; Magnusson, Patrik; Oskarsson, Sven; Johannesson, Magnus; Visscher, Peter M; Laibson, David; Cesarini, David; Neale, Benjamin M; Benjamin, Daniel J
2018-02-01
We introduce multi-trait analysis of GWAS (MTAG), a method for joint analysis of summary statistics from genome-wide association studies (GWAS) of different traits, possibly from overlapping samples. We apply MTAG to summary statistics for depressive symptoms (N eff = 354,862), neuroticism (N = 168,105), and subjective well-being (N = 388,538). As compared to the 32, 9, and 13 genome-wide significant loci identified in the single-trait GWAS (most of which are themselves novel), MTAG increases the number of associated loci to 64, 37, and 49, respectively. Moreover, association statistics from MTAG yield more informative bioinformatics analyses and increase the variance explained by polygenic scores by approximately 25%, matching theoretical expectations.
Roshandel, Delnaz; Gubitosi-Klug, Rose; Bull, Shelley B; Canty, Angelo J; Pezzolesi, Marcus G; King, George L; Keenan, Hillary A; Snell-Bergeon, Janet K; Maahs, David M; Klein, Ronald; Klein, Barbara E K; Orchard, Trevor J; Costacou, Tina; Weedon, Michael N; Oram, Richard A; Paterson, Andrew D
2018-05-01
The aim of this study was to identify genetic variants associated with beta cell function in type 1 diabetes, as measured by serum C-peptide levels, through meta-genome-wide association studies (meta-GWAS). We performed a meta-GWAS to combine the results from five studies in type 1 diabetes with cross-sectionally measured stimulated, fasting or random C-peptide levels, including 3479 European participants. The p values across studies were combined, taking into account sample size and direction of effect. We also performed separate meta-GWAS for stimulated (n = 1303), fasting (n = 2019) and random (n = 1497) C-peptide levels. In the meta-GWAS for stimulated/fasting/random C-peptide levels, a SNP on chromosome 1, rs559047 (Chr1:238753916, T>A, minor allele frequency [MAF] 0.24-0.26), was associated with C-peptide (p = 4.13 × 10 -8 ), meeting the genome-wide significance threshold (p < 5 × 10 -8 ). In the same meta-GWAS, a locus in the MHC region (rs9260151) was close to the genome-wide significance threshold (Chr6:29911030, C>T, MAF 0.07-0.10, p = 8.43 × 10 -8 ). In the stimulated C-peptide meta-GWAS, rs61211515 (Chr6:30100975, T/-, MAF 0.17-0.19) in the MHC region was associated with stimulated C-peptide (β [SE] = - 0.39 [0.07], p = 9.72 × 10 -8 ). rs61211515 was also associated with the rate of stimulated C-peptide decline over time in a subset of individuals (n = 258) with annual repeated measures for up to 6 years (p = 0.02). In the meta-GWAS of random C-peptide, another MHC region, SNP rs3135002 (Chr6:32668439, C>A, MAF 0.02-0.06), was associated with C-peptide (p = 3.49 × 10 -8 ). Conditional analyses suggested that the three identified variants in the MHC region were independent of each other. rs9260151 and rs3135002 have been associated with type 1 diabetes, whereas rs559047 and rs61211515 have not been associated with a risk of developing type 1 diabetes. We identified a locus on chromosome 1 and multiple variants in the MHC region, at least some of which were distinct from type 1 diabetes risk loci, that were associated with C-peptide, suggesting partly non-overlapping mechanisms for the development and progression of type 1 diabetes. These associations need to be validated in independent populations. Further investigations could provide insights into mechanisms of beta cell loss and opportunities to preserve beta cell function.
Spindel, J E; Begum, H; Akdemir, D; Collard, B; Redoña, E; Jannink, J-L; McCouch, S
2016-01-01
To address the multiple challenges to food security posed by global climate change, population growth and rising incomes, plant breeders are developing new crop varieties that can enhance both agricultural productivity and environmental sustainability. Current breeding practices, however, are unable to keep pace with demand. Genomic selection (GS) is a new technique that helps accelerate the rate of genetic gain in breeding by using whole-genome data to predict the breeding value of offspring. Here, we describe a new GS model that combines RR-BLUP with markers fit as fixed effects selected from the results of a genome-wide-association study (GWAS) on the RR-BLUP training data. We term this model GS + de novo GWAS. In a breeding population of tropical rice, GS + de novo GWAS outperformed six other models for a variety of traits and in multiple environments. On the basis of these results, we propose an extended, two-part breeding design that can be used to efficiently integrate novel variation into elite breeding populations, thus expanding genetic diversity and enhancing the potential for sustainable productivity gains. PMID:26860200
Chang, Lun-Ching; Jamain, Stephane; Lin, Chien-Wei; Rujescu, Dan; Tseng, George C; Sibille, Etienne
2014-01-01
Large scale gene expression (transcriptome) analysis and genome-wide association studies (GWAS) for single nucleotide polymorphisms have generated a considerable amount of gene- and disease-related information, but heterogeneity and various sources of noise have limited the discovery of disease mechanisms. As systematic dataset integration is becoming essential, we developed methods and performed meta-clustering of gene coexpression links in 11 transcriptome studies from postmortem brains of human subjects with major depressive disorder (MDD) and non-psychiatric control subjects. We next sought enrichment in the top 50 meta-analyzed coexpression modules for genes otherwise identified by GWAS for various sets of disorders. One coexpression module of 88 genes was consistently and significantly associated with GWAS for MDD, other neuropsychiatric disorders and brain functions, and for medical illnesses with elevated clinical risk of depression, but not for other diseases. In support of the superior discriminative power of this novel approach, we observed no significant enrichment for GWAS-related genes in coexpression modules extracted from single studies or in meta-modules using gene expression data from non-psychiatric control subjects. Genes in the identified module encode proteins implicated in neuronal signaling and structure, including glutamate metabotropic receptors (GRM1, GRM7), GABA receptors (GABRA2, GABRA4), and neurotrophic and development-related proteins [BDNF, reelin (RELN), Ephrin receptors (EPHA3, EPHA5)]. These results are consistent with the current understanding of molecular mechanisms of MDD and provide a set of putative interacting molecular partners, potentially reflecting components of a functional module across cells and biological pathways that are synchronously recruited in MDD, other brain disorders and MDD-related illnesses. Collectively, this study demonstrates the importance of integrating transcriptome data, gene coexpression modules and GWAS results for providing novel and complementary approaches to investigate the molecular pathology of MDD and other complex brain disorders.
Novel genome-wide association study-based candidate loci for differentiated thyroid cancer risk.
Figlioli, Gisella; Köhler, Aleksandra; Chen, Bowang; Elisei, Rossella; Romei, Cristina; Cipollini, Monica; Cristaudo, Alfonso; Bambi, Franco; Paolicchi, Elisa; Hoffmann, Per; Herms, Stefan; Kalemba, Michał; Kula, Dorota; Pastor, Susana; Marcos, Ricard; Velázquez, Antonia; Jarząb, Barbara; Landi, Stefano; Hemminki, Kari; Försti, Asta; Gemignani, Federica
2014-10-01
Genome-wide association studies (GWASs) on differentiated thyroid cancer (DTC) have identified robust associations with single nucleotide polymorphisms (SNPs) at 9q22.33 (FOXE1), 14q13.3 (NKX2-1), and 2q35 (DIRC3). Our recently published GWAS suggested additional susceptibility loci specific for the high-incidence Italian population. The purpose of this study was to identify novel Italian-specific DTC risk variants based on our GWAS and to test them further in low-incidence populations. We investigated 45 SNPs selected from our GWAS first in an Italian population. SNPs that showed suggestive evidence of association were investigated in the Polish and Spanish cohorts. The combined analysis of the GWAS and the Italian replication study (2260 case patients and 2218 control subjects) provided strong evidence of association with rs10136427 near BATF (odds ratio [OR] =1.40, P = 4.35 × 10(-7)) and rs7267944 near DHX35 (OR = 1.39, P = 2.13 × 10(-8)). A possible role in DTC susceptibility in the Italian populations was also found for rs13184587 (ARSB) (P = 8.54 × 10(-6)) and rs1220597 (SPATA13) (P = 3.25 × 10(-6)). Only the associations between rs10136427 and rs7267944 and DTC risk were replicated in the Polish and the Spanish populations with little evidence of population heterogeneity (GWAS and all replications combined, OR = 1.30, P = 9.30 × 10(-7) and OR = 1.32, P = 1.34 × 10(-8), respectively). In silico analyses provided new insights into the possible functional consequences of the SNPs that showed the strongest association with DTC. Our findings provide evidence for novel DTC susceptibility variants. Further studies are warranted to identify the specific genetic variants responsible for the observed associations and to functionally validate our in silico predictions.
Verma, Shefali S.; Hall, Molly A.; Goodloe, Robert J.; Berg, Richard L.; Carrell, Dave S.; Carlson, Christopher S.; Chen, Lin; Crosslin, David R.; Denny, Joshua C.; Jarvik, Gail; Li, Rongling; Linneman, James G.; Pathak, Jyoti; Peissig, Peggy; Rasmussen, Luke V.; Ramirez, Andrea H.; Wang, Xiaoming; Wilke, Russell A.; Wolf, Wendy A.; Torstenson, Eric S.; Turner, Stephen D.; McCarty, Catherine A.
2014-01-01
Purpose Cataract is the leading cause of blindness in the world, and in the United States accounts for approximately 60% of Medicare costs related to vision. The purpose of this study was to identify genetic markers for age-related cataract through a genome-wide association study (GWAS). Methods In the electronic medical records and genomics (eMERGE) network, we ran an electronic phenotyping algorithm on individuals in each of five sites with electronic medical records linked to DNA biobanks. We performed a GWAS using 530,101 SNPs from the Illumina 660W-Quad in a total of 7,397 individuals (5,503 cases and 1,894 controls). We also performed an age-at-diagnosis case-only analysis. Results We identified several statistically significant associations with age-related cataract (45 SNPs) as well as age at diagnosis (44 SNPs). The 45 SNPs associated with cataract at p<1×10−5 are in several interesting genes, including ALDOB, MAP3K1, and MEF2C. All have potential biologic relationships with cataracts. Conclusions This is the first genome-wide association study of age-related cataract, and several regions of interest have been identified. The eMERGE network has pioneered the exploration of genomic associations in biobanks linked to electronic health records, and this study is another example of the utility of such resources. Explorations of age-related cataract including validation and replication of the association results identified herein are needed in future studies. PMID:25352737
Multi-criteria decision making approaches for quality control of genome-wide association studies.
Malovini, Alberto; Rognoni, Carla; Puca, Annibale; Bellazzi, Riccardo
2009-03-01
Experimental errors in the genotyping phases of a Genome-Wide Association Study (GWAS) can lead to false positive findings and to spurious associations. An appropriate quality control phase could minimize the effects of this kind of errors. Several filtering criteria can be used to perform quality control. Currently, no formal methods have been proposed for taking into account at the same time these criteria and the experimenter's preferences. In this paper we propose two strategies for setting appropriate genotyping rate thresholds for GWAS quality control. These two approaches are based on the Multi-Criteria Decision Making theory. We have applied our method on a real dataset composed by 734 individuals affected by Arterial Hypertension (AH) and 486 nonagenarians without history of AH. The proposed strategies appear to deal with GWAS quality control in a sound way, as they lead to rationalize and make explicit the experimenter's choices thus providing more reproducible results.
Espin‐Garcia, Osvaldo; Craiu, Radu V.
2017-01-01
ABSTRACT We evaluate two‐phase designs to follow‐up findings from genome‐wide association study (GWAS) when the cost of regional sequencing in the entire cohort is prohibitive. We develop novel expectation‐maximization‐based inference under a semiparametric maximum likelihood formulation tailored for post‐GWAS inference. A GWAS‐SNP (where SNP is single nucleotide polymorphism) serves as a surrogate covariate in inferring association between a sequence variant and a normally distributed quantitative trait (QT). We assess test validity and quantify efficiency and power of joint QT‐SNP‐dependent sampling and analysis under alternative sample allocations by simulations. Joint allocation balanced on SNP genotype and extreme‐QT strata yields significant power improvements compared to marginal QT‐ or SNP‐based allocations. We illustrate the proposed method and evaluate the sensitivity of sample allocation to sampling variation using data from a sequencing study of systolic blood pressure. PMID:29239496
Improved minimum cost and maximum power two stage genome-wide association study designs.
Stanhope, Stephen A; Skol, Andrew D
2012-01-01
In a two stage genome-wide association study (2S-GWAS), a sample of cases and controls is allocated into two groups, and genetic markers are analyzed sequentially with respect to these groups. For such studies, experimental design considerations have primarily focused on minimizing study cost as a function of the allocation of cases and controls to stages, subject to a constraint on the power to detect an associated marker. However, most treatments of this problem implicitly restrict the set of feasible designs to only those that allocate the same proportions of cases and controls to each stage. In this paper, we demonstrate that removing this restriction can improve the cost advantages demonstrated by previous 2S-GWAS designs by up to 40%. Additionally, we consider designs that maximize study power with respect to a cost constraint, and show that recalculated power maximizing designs can recover a substantial amount of the planned study power that might otherwise be lost if study funding is reduced. We provide open source software for calculating cost minimizing or power maximizing 2S-GWAS designs.
Brassinosteroid and gibberellin control of seedling traits in maize (Zea mays L.).
Hu, Songlin; Sanchez, Darlene L; Wang, Cuiling; Lipka, Alexander E; Yin, Yanhai; Gardner, Candice A C; Lübberstedt, Thomas
2017-10-01
In this study, we established two doubled haploid (DH) libraries with a total of 207 DH lines. We applied BR and GA inhibitors to all DH lines at seedling stage and measured seedling BR and GA inhibitor responses. Moreover, we evaluated field traits for each DH line (untreated). We conducted genome-wide association studies (GWAS) with 62,049 genome wide SNPs to explore the genetic control of seedling traits by BR and GA. In addition, we correlate seedling stage hormone inhibitor response with field traits. Large variation for BR and GA inhibitor response and field traits was observed across these DH lines. Seedling stage BR and GA inhibitor response was significantly correlate with yield and flowering time. Using three different GWAS approaches to balance false positive/negatives, multiple SNPs were discovered to be significantly associated with BR/GA inhibitor responses with some localized within gene models. SNPs from gene model GRMZM2G013391 were associated with GA inhibitor response across all three GWAS models. This gene is expressed in roots and shoots and was shown to regulate GA signaling. These results show that BRs and GAs have a great impact for controlling seedling growth. Gene models from GWAS results could be targets for seeding traits improvement. Copyright © 2017 Elsevier B.V. All rights reserved.
Genome-wide Pleiotropy Between Parkinson Disease and Autoimmune Diseases.
Witoelar, Aree; Jansen, Iris E; Wang, Yunpeng; Desikan, Rahul S; Gibbs, J Raphael; Blauwendraat, Cornelis; Thompson, Wesley K; Hernandez, Dena G; Djurovic, Srdjan; Schork, Andrew J; Bettella, Francesco; Ellinghaus, David; Franke, Andre; Lie, Benedicte A; McEvoy, Linda K; Karlsen, Tom H; Lesage, Suzanne; Morris, Huw R; Brice, Alexis; Wood, Nicholas W; Heutink, Peter; Hardy, John; Singleton, Andrew B; Dale, Anders M; Gasser, Thomas; Andreassen, Ole A; Sharma, Manu
2017-07-01
Recent genome-wide association studies (GWAS) and pathway analyses supported long-standing observations of an association between immune-mediated diseases and Parkinson disease (PD). The post-GWAS era provides an opportunity for cross-phenotype analyses between different complex phenotypes. To test the hypothesis that there are common genetic risk variants conveying risk of both PD and autoimmune diseases (ie, pleiotropy) and to identify new shared genetic variants and their pathways by applying a novel statistical framework in a genome-wide approach. Using the conjunction false discovery rate method, this study analyzed GWAS data from a selection of archetypal autoimmune diseases among 138 511 individuals of European ancestry and systemically investigated pleiotropy between PD and type 1 diabetes, Crohn disease, ulcerative colitis, rheumatoid arthritis, celiac disease, psoriasis, and multiple sclerosis. NeuroX data (6927 PD cases and 6108 controls) were used for replication. The study investigated the biological correlation between the top loci through protein-protein interaction and changes in the gene expression and methylation levels. The dates of the analysis were June 10, 2015, to March 4, 2017. The primary outcome was a list of novel loci and their pathways involved in PD and autoimmune diseases. Genome-wide conjunctional analysis identified 17 novel loci at false discovery rate less than 0.05 with overlap between PD and autoimmune diseases, including known PD loci adjacent to GAK, HLA-DRB5, LRRK2, and MAPT for rheumatoid arthritis, ulcerative colitis and Crohn disease. Replication confirmed the involvement of HLA, LRRK2, MAPT, TRIM10, and SETD1A in PD. Among the novel genes discovered, WNT3, KANSL1, CRHR1, BOLA2, and GUCY1A3 are within a protein-protein interaction network with known PD genes. A subset of novel loci was significantly associated with changes in methylation or expression levels of adjacent genes. The study findings provide novel mechanistic insights into PD and autoimmune diseases and identify a common genetic pathway between these phenotypes. The results may have implications for future therapeutic trials involving anti-inflammatory agents.
Duell, Eric J.; Yu, Kai; Risch, Harvey A.; Olson, Sara H.; Kooperberg, Charles; Wolpin, Brian M.; Jiao, Li; Dong, Xiaoqun; Wheeler, Bill; Arslan, Alan A.; Bueno-de-Mesquita, H. Bas; Fuchs, Charles S.; Gallinger, Steven; Gross, Myron; Hartge, Patricia; Hoover, Robert N.; Holly, Elizabeth A.; Jacobs, Eric J.; Klein, Alison P.; LaCroix, Andrea; Mandelson, Margaret T.; Petersen, Gloria; Zheng, Wei; Agalliu, Ilir; Albanes, Demetrius; Boutron-Ruault, Marie-Christine; Bracci, Paige M.; Buring, Julie E.; Canzian, Federico; Chang, Kenneth; Chanock, Stephen J.; Cotterchio, Michelle; Gaziano, J.Michael; Giovannucci, Edward L.; Goggins, Michael; Hallmans, Göran; Hankinson, Susan E.; Hoffman Bolton, Judith A.; Hunter, David J.; Hutchinson, Amy; Jacobs, Kevin B.; Jenab, Mazda; Khaw, Kay-Tee; Kraft, Peter; Krogh, Vittorio; Kurtz, Robert C.; McWilliams, Robert R.; Mendelsohn, Julie B.; Patel, Alpa V.; Rabe, Kari G.; Riboli, Elio; Shu, Xiao-Ou; Tjønneland, Anne; Tobias, Geoffrey S.; Trichopoulos, Dimitrios; Virtamo, Jarmo; Visvanathan, Kala; Watters, Joanne; Yu, Herbert; Zeleniuch-Jacquotte, Anne; Stolzenberg-Solomon, Rachael Z.
2012-01-01
Four loci have been associated with pancreatic cancer through genome-wide association studies (GWAS). Pathway-based analysis of GWAS data is a complementary approach to identify groups of genes or biological pathways enriched with disease-associated single-nucleotide polymorphisms (SNPs) whose individual effect sizes may be too small to be detected by standard single-locus methods. We used the adaptive rank truncated product method in a pathway-based analysis of GWAS data from 3851 pancreatic cancer cases and 3934 control participants pooled from 12 cohort studies and 8 case–control studies (PanScan). We compiled 23 biological pathways hypothesized to be relevant to pancreatic cancer and observed a nominal association between pancreatic cancer and five pathways (P < 0.05), i.e. pancreatic development, Helicobacter pylori lacto/neolacto, hedgehog, Th1/Th2 immune response and apoptosis (P = 2.0 × 10−6, 1.6 × 10−5, 0.0019, 0.019 and 0.023, respectively). After excluding previously identified genes from the original GWAS in three pathways (NR5A2, ABO and SHH), the pancreatic development pathway remained significant (P = 8.3 × 10−5), whereas the others did not. The most significant genes (P < 0.01) in the five pathways were NR5A2, HNF1A, HNF4G and PDX1 for pancreatic development; ABO for H. pylori lacto/neolacto; SHH for hedgehog; TGFBR2 and CCL18 for Th1/Th2 immune response and MAPK8 and BCL2L11 for apoptosis. Our results provide a link between inherited variation in genes important for pancreatic development and cancer and show that pathway-based approaches to analysis of GWAS data can yield important insights into the collective role of genetic risk variants in cancer. PMID:22523087
Graham, Deborah S Cunninghame; Pinder, Christopher L; Tombleson, Philip; Behrens, Timothy W; Martín, Javier; Fairfax, Benjamin P; Knight, Julian C; Chen, Lingyan; Replogle, Joseph; Syvänen, Ann-Christine; Rönnblom, Lars; Graham, Robert R; Wither, Joan E; Rioux, John D; Alarcón-Riquelme, Marta E; Vyse, Timothy J
2015-01-01
Systemic lupus erythematosus (SLE; OMIM 152700) is a genetically complex autoimmune disease characterized by loss of immune tolerance to nuclear and cell surface antigens. Previous genome-wide association studies (GWAS) had modest sample sizes, reducing their scope and reliability. Our study comprised 7,219 cases and 15,991 controls of European ancestry: a new GWAS, meta-analysis with a published GWAS and a replication study. We have mapped 43 susceptibility loci, including 10 novel associations. Assisted by dense genome coverage, imputation provided evidence for missense variants underpinning associations in eight genes. Other likely causal genes were established by examining associated alleles for cis-acting eQTL effects in a range of ex vivo immune cells. We found an over-representation (n=16) of transcription factors among SLE susceptibility genes. This supports the view that aberrantly regulated gene expression networks in multiple cell types in both the innate and adaptive immune response contribute to the risk of developing SLE. PMID:26502338
No association between telomere length-related loci and number of cutaneous nevi.
Li, Xin; Liang, Geyu; Du, Mengmeng; De Vivo, Immaculata; Nan, Hongmei
2016-12-13
Longer telomeres have been associated both with increased melanoma risk and increased nevus counts. Nevus count is one of the strongest risk factors for melanoma. Recent data showed that a genetic score derived by telomere length-related single nucleotide polymorphisms (SNPs) was strongly associated with melanoma risk; however, the relationships between these SNPs and number of cutaneous nevi have not been investigated. We evaluated the associations between telomere length-related SNPs reported by previous genome-wide association study (GWAS) and nevus counts among 15,955 participants of European Ancestry in the Nurses' Health Study and Health Professionals Follow-up Study. None of the SNPs was associated with nevus counts, nor was the genetic score combining the dosage of alleles related to increased telomere length. The telomere length-related SNPs identified by published GWAS do not appear to play an important role in nevus formation. Genetic determinants of telomere length reported by GWAS do not explain the observed epidemiologic association between telomere length and nevus counts.
The relationship between the human genome and microbiome comes into view
Goodrich, Julia K.; Davenport, Emily R.; Clark, Andrew G.; Ley, Ruth E.
2017-01-01
The microbiome’s involvement in health and disease, and the complexity of its composition and function, make it intriguing to consider human genetic factors that impact microbiome composition. Genes may influence health through their ability to promote a stable microbial community in the gut. Studies of heritability yield a consistent subset of microbes that are impacted by genes, but the use of genome-wide association studies (GWAS) to identify specific genetic variants associated with microbiota phenotypes has proven challenging. Processing microbiome datasets into traits to be modeled and reducing the burden of multiple testing are just some of the technical hurdles in microbiome GWAS. Studies to date are small by GWAS standards, making cross-study comparisons and validations particularly important in identifying authentic signals. Cross-study comparisons are hampered by differences in analytical approaches. Nevertheless, some consistent associations have emerged between populations, most notably between Bifidobacteria and the lactase non-persister genotype. These early successes open the way for the microbiome to be incorporated into studies that quantify interactions among genotype, environment, and the microbiome for predicting disease susceptibility. PMID:28934590
Peng, W-F; Xu, S-S; Ren, X; Lv, F-H; Xie, X-L; Zhao, Y-X; Zhang, M; Shen, Z-Q; Ren, Y-L; Gao, L; Shen, M; Kantanen, J; Li, M-H
2017-10-01
Genome-wide association studies (GWASs) have been widely applied in livestock to identify genes associated with traits of economic interest. Here, we conducted the first GWAS of the supernumerary nipple phenotype in Wadi sheep, a native Chinese sheep breed, based on Ovine Infinium HD SNP BeadChip genotypes in a total of 144 ewes (75 cases with four teats, including two normal and two supernumerary teats, and 69 control cases with two teats). We detected 63 significant SNPs at the chromosome-wise threshold. Additionally, one candidate region (chr1: 170.723-170.734 Mb) was identified by haplotype-based association tests, with one SNP (rs413490006) surrounding functional genes BBX and CD47 on chromosome 1 being commonly identified as significant by the two mentioned analyses. Moreover, Gene Ontology enrichment for the significant SNPs identified by the GWAS analysis was functionally clustered into the categories of receptor activity and synaptic membrane. In addition, pathway mapping revealed four promising pathways (Wnt, oxytocin, MAPK and axon guidance) involved in the development of the supernumerary nipple phenotype. Our results provide novel and important insights into the genetic mechanisms underlying the phenotype of supernumerary nipples in mammals, including humans. These findings may be useful for future breeding and genetics in sheep and other livestock. © 2017 Stichting International Foundation for Animal Genetics.
Polygenic dissection of diagnosis and clinical dimensions of bipolar disorder and schizophrenia.
Ruderfer, Douglas M; Fanous, Ayman H; Ripke, Stephan; McQuillin, Andrew; Amdur, Richard L; Gejman, Pablo V; O'Donovan, Michael C; Andreassen, Ole A; Djurovic, Srdjan; Hultman, Christina M; Kelsoe, John R; Jamain, Stephane; Landén, Mikael; Leboyer, Marion; Nimgaonkar, Vishwajit; Nurnberger, John; Smoller, Jordan W; Craddock, Nick; Corvin, Aiden; Sullivan, Patrick F; Holmans, Peter; Sklar, Pamela; Kendler, Kenneth S
2014-09-01
Bipolar disorder and schizophrenia are two often severe disorders with high heritabilities. Recent studies have demonstrated a large overlap of genetic risk loci between these disorders but diagnostic and molecular distinctions still remain. Here, we perform a combined genome-wide association study (GWAS) of 19 779 bipolar disorder (BP) and schizophrenia (SCZ) cases versus 19 423 controls, in addition to a direct comparison GWAS of 7129 SCZ cases versus 9252 BP cases. In our case-control analysis, we identify five previously identified regions reaching genome-wide significance (CACNA1C, IFI44L, MHC, TRANK1 and MAD1L1) and a novel locus near PIK3C2A. We create a polygenic risk score that is significantly different between BP and SCZ and show a significant correlation between a BP polygenic risk score and the clinical dimension of mania in SCZ patients. Our results indicate that first, combining diseases with similar genetic risk profiles improves power to detect shared risk loci and second, that future direct comparisons of BP and SCZ are likely to identify loci with significant differential effects. Identifying these loci should aid in the fundamental understanding of how these diseases differ biologically. These findings also indicate that combining clinical symptom dimensions and polygenic signatures could provide additional information that may someday be used clinically.
Amin Al Olama, Ali; Dadaev, Tokhir; Hazelett, Dennis J; Li, Qiuyan; Leongamornlert, Daniel; Saunders, Edward J; Stephens, Sarah; Cieza-Borrella, Clara; Whitmore, Ian; Benlloch Garcia, Sara; Giles, Graham G; Southey, Melissa C; Fitzgerald, Liesel; Gronberg, Henrik; Wiklund, Fredrik; Aly, Markus; Henderson, Brian E; Schumacher, Fredrick; Haiman, Christopher A; Schleutker, Johanna; Wahlfors, Tiina; Tammela, Teuvo L; Nordestgaard, Børge G; Key, Tim J; Travis, Ruth C; Neal, David E; Donovan, Jenny L; Hamdy, Freddie C; Pharoah, Paul; Pashayan, Nora; Khaw, Kay-Tee; Stanford, Janet L; Thibodeau, Stephen N; Mcdonnell, Shannon K; Schaid, Daniel J; Maier, Christiane; Vogel, Walther; Luedeke, Manuel; Herkommer, Kathleen; Kibel, Adam S; Cybulski, Cezary; Wokołorczyk, Dominika; Kluzniak, Wojciech; Cannon-Albright, Lisa; Brenner, Hermann; Butterbach, Katja; Arndt, Volker; Park, Jong Y; Sellers, Thomas; Lin, Hui-Yi; Slavov, Chavdar; Kaneva, Radka; Mitev, Vanio; Batra, Jyotsna; Clements, Judith A; Spurdle, Amanda; Teixeira, Manuel R; Paulo, Paula; Maia, Sofia; Pandha, Hardev; Michael, Agnieszka; Kierzek, Andrzej; Govindasami, Koveela; Guy, Michelle; Lophatonanon, Artitaya; Muir, Kenneth; Viñuela, Ana; Brown, Andrew A; Freedman, Mathew; Conti, David V; Easton, Douglas; Coetzee, Gerhard A; Eeles, Rosalind A; Kote-Jarai, Zsofia
2015-10-01
Genome-wide association studies (GWAS) have identified numerous common prostate cancer (PrCa) susceptibility loci. We have fine-mapped 64 GWAS regions known at the conclusion of the iCOGS study using large-scale genotyping and imputation in 25 723 PrCa cases and 26 274 controls of European ancestry. We detected evidence for multiple independent signals at 16 regions, 12 of which contained additional newly identified significant associations. A single signal comprising a spectrum of correlated variation was observed at 39 regions; 35 of which are now described by a novel more significantly associated lead SNP, while the originally reported variant remained as the lead SNP only in 4 regions. We also confirmed two association signals in Europeans that had been previously reported only in East-Asian GWAS. Based on statistical evidence and linkage disequilibrium (LD) structure, we have curated and narrowed down the list of the most likely candidate causal variants for each region. Functional annotation using data from ENCODE filtered for PrCa cell lines and eQTL analysis demonstrated significant enrichment for overlap with bio-features within this set. By incorporating the novel risk variants identified here alongside the refined data for existing association signals, we estimate that these loci now explain ∼38.9% of the familial relative risk of PrCa, an 8.9% improvement over the previously reported GWAS tag SNPs. This suggests that a significant fraction of the heritability of PrCa may have been hidden during the discovery phase of GWAS, in particular due to the presence of multiple independent signals within the same region. © The Author 2015. Published by Oxford University Press.
Dolejsi, Erich; Bodenstorfer, Bernhard; Frommlet, Florian
2014-01-01
The prevailing method of analyzing GWAS data is still to test each marker individually, although from a statistical point of view it is quite obvious that in case of complex traits such single marker tests are not ideal. Recently several model selection approaches for GWAS have been suggested, most of them based on LASSO-type procedures. Here we will discuss an alternative model selection approach which is based on a modification of the Bayesian Information Criterion (mBIC2) which was previously shown to have certain asymptotic optimality properties in terms of minimizing the misclassification error. Heuristic search strategies are introduced which attempt to find the model which minimizes mBIC2, and which are efficient enough to allow the analysis of GWAS data. Our approach is implemented in a software package called MOSGWA. Its performance in case control GWAS is compared with the two algorithms HLASSO and d-GWASelect, as well as with single marker tests, where we performed a simulation study based on real SNP data from the POPRES sample. Our results show that MOSGWA performs slightly better than HLASSO, where specifically for more complex models MOSGWA is more powerful with only a slight increase in Type I error. On the other hand according to our simulations GWASelect does not at all control the type I error when used to automatically determine the number of important SNPs. We also reanalyze the GWAS data from the Wellcome Trust Case-Control Consortium and compare the findings of the different procedures, where MOSGWA detects for complex diseases a number of interesting SNPs which are not found by other methods. PMID:25061809
iPat: intelligent prediction and association tool for genomic research.
Chen, Chunpeng James; Zhang, Zhiwu
2018-06-01
The ultimate goal of genomic research is to effectively predict phenotypes from genotypes so that medical management can improve human health and molecular breeding can increase agricultural production. Genomic prediction or selection (GS) plays a complementary role to genome-wide association studies (GWAS), which is the primary method to identify genes underlying phenotypes. Unfortunately, most computing tools cannot perform data analyses for both GWAS and GS. Furthermore, the majority of these tools are executed through a command-line interface (CLI), which requires programming skills. Non-programmers struggle to use them efficiently because of the steep learning curves and zero tolerance for data formats and mistakes when inputting keywords and parameters. To address these problems, this study developed a software package, named the Intelligent Prediction and Association Tool (iPat), with a user-friendly graphical user interface. With iPat, GWAS or GS can be performed using a pointing device to simply drag and/or click on graphical elements to specify input data files, choose input parameters and select analytical models. Models available to users include those implemented in third party CLI packages such as GAPIT, PLINK, FarmCPU, BLINK, rrBLUP and BGLR. Users can choose any data format and conduct analyses with any of these packages. File conversions are automatically conducted for specified input data and selected packages. A GWAS-assisted genomic prediction method was implemented to perform genomic prediction using any GWAS method such as FarmCPU. iPat was written in Java for adaptation to multiple operating systems including Windows, Mac and Linux. The iPat executable file, user manual, tutorials and example datasets are freely available at http://zzlab.net/iPat. zhiwu.zhang@wsu.edu.
Smeland, Olav B; Wang, Yunpeng; Frei, Oleksandr; Li, Wen; Hibar, Derrek P; Franke, Barbara; Bettella, Francesco; Witoelar, Aree; Djurovic, Srdjan; Chen, Chi-Hua; Thompson, Paul M; Dale, Anders M; Andreassen, Ole A
2018-06-06
Schizophrenia (SCZ) is associated with differences in subcortical brain volumes and intracranial volume (ICV). However, little is known about the underlying etiology of these brain alterations. Here, we explored whether brain structure volumes and SCZ share genetic risk factors. Using conditional false discovery rate (FDR) analysis, we integrated genome-wide association study (GWAS) data on SCZ (n = 82315) and GWAS data on 7 subcortical brain volumes and ICV (n = 11840). By conditioning the FDR on overlapping associations, this statistical approach increases power to discover genetic loci. To assess the credibility of our approach, we studied the identified loci in larger GWAS samples on ICV (n = 26577) and hippocampal volume (n = 26814). We observed polygenic overlap between SCZ and volumes of hippocampus, putamen, and ICV. Based on conjunctional FDR < 0.05, we identified 2 loci shared between SCZ and ICV implicating genes FOXO3 (rs10457180) and ITIH4 (rs4687658), 2 loci shared between SCZ and hippocampal volume implicating SLC4A10 (rs4664442) and SPATS2L (rs1653290), and 2 loci shared between SCZ and volume of putamen implicating DCC (rs4632195) and DLG2 (rs11233632). The loci shared between SCZ and hippocampal volume or ICV had not reached significance in the primary GWAS on brain phenotypes. Proving our point of increased power, 2 loci did reach genome-wide significance with ICV (rs10457180) and hippocampal volume (rs4664442) in the larger GWAS. Three of the 6 identified loci are novel for SCZ. Altogether, the findings provide new insights into the relationship between SCZ and brain structure volumes, suggesting that their genetic architectures are not independent.
Trampush, J W; Yang, M L Z; Yu, J; Knowles, E; Davies, G; Liewald, D C; Starr, J M; Djurovic, S; Melle, I; Sundet, K; Christoforou, A; Reinvang, I; DeRosse, P; Lundervold, A J; Steen, V M; Espeseth, T; Räikkönen, K; Widen, E; Palotie, A; Eriksson, J G; Giegling, I; Konte, B; Roussos, P; Giakoumaki, S; Burdick, K E; Payton, A; Ollier, W; Horan, M; Chiba-Falek, O; Attix, D K; Need, A C; Cirulli, E T; Voineskos, A N; Stefanis, N C; Avramopoulos, D; Hatzimanolis, A; Arking, D E; Smyrnis, N; Bilder, R M; Freimer, N A; Cannon, T D; London, E; Poldrack, R A; Sabb, F W; Congdon, E; Conley, E D; Scult, M A; Dickinson, D; Straub, R E; Donohoe, G; Morris, D; Corvin, A; Gill, M; Hariri, A R; Weinberger, D R; Pendleton, N; Bitsios, P; Rujescu, D; Lahti, J; Le Hellard, S; Keller, M C; Andreassen, O A; Deary, I J; Glahn, D C; Malhotra, A K; Lencz, T
2017-03-01
The complex nature of human cognition has resulted in cognitive genomics lagging behind many other fields in terms of gene discovery using genome-wide association study (GWAS) methods. In an attempt to overcome these barriers, the current study utilized GWAS meta-analysis to examine the association of common genetic variation (~8M single-nucleotide polymorphisms (SNP) with minor allele frequency ⩾1%) to general cognitive function in a sample of 35 298 healthy individuals of European ancestry across 24 cohorts in the Cognitive Genomics Consortium (COGENT). In addition, we utilized individual SNP lookups and polygenic score analyses to identify genetic overlap with other relevant neurobehavioral phenotypes. Our primary GWAS meta-analysis identified two novel SNP loci (top SNPs: rs76114856 in the CENPO gene on chromosome 2 and rs6669072 near LOC105378853 on chromosome 1) associated with cognitive performance at the genome-wide significance level (P<5 × 10 -8 ). Gene-based analysis identified an additional three Bonferroni-corrected significant loci at chromosomes 17q21.31, 17p13.1 and 1p13.3. Altogether, common variation across the genome resulted in a conservatively estimated SNP heritability of 21.5% (s.e.=0.01%) for general cognitive function. Integration with prior GWAS of cognitive performance and educational attainment yielded several additional significant loci. Finally, we found robust polygenic correlations between cognitive performance and educational attainment, several psychiatric disorders, birth length/weight and smoking behavior, as well as a novel genetic association to the personality trait of openness. These data provide new insight into the genetics of neurocognitive function with relevance to understanding the pathophysiology of neuropsychiatric illness.
Trampush, J W; Yang, M L Z; Yu, J; Knowles, E; Davies, G; Liewald, D C; Starr, J M; Djurovic, S; Melle, I; Sundet, K; Christoforou, A; Reinvang, I; DeRosse, P; Lundervold, A J; Steen, V M; Espeseth, T; Räikkönen, K; Widen, E; Palotie, A; Eriksson, J G; Giegling, I; Konte, B; Roussos, P; Giakoumaki, S; Burdick, K E; Payton, A; Ollier, W; Horan, M; Chiba-Falek, O; Attix, D K; Need, A C; Cirulli, E T; Voineskos, A N; Stefanis, N C; Avramopoulos, D; Hatzimanolis, A; Arking, D E; Smyrnis, N; Bilder, R M; Freimer, N A; Cannon, T D; London, E; Poldrack, R A; Sabb, F W; Congdon, E; Conley, E D; Scult, M A; Dickinson, D; Straub, R E; Donohoe, G; Morris, D; Corvin, A; Gill, M; Hariri, A R; Weinberger, D R; Pendleton, N; Bitsios, P; Rujescu, D; Lahti, J; Le Hellard, S; Keller, M C; Andreassen, O A; Deary, I J; Glahn, D C; Malhotra, A K; Lencz, T
2017-01-01
The complex nature of human cognition has resulted in cognitive genomics lagging behind many other fields in terms of gene discovery using genome-wide association study (GWAS) methods. In an attempt to overcome these barriers, the current study utilized GWAS meta-analysis to examine the association of common genetic variation (~8M single-nucleotide polymorphisms (SNP) with minor allele frequency ⩾1%) to general cognitive function in a sample of 35 298 healthy individuals of European ancestry across 24 cohorts in the Cognitive Genomics Consortium (COGENT). In addition, we utilized individual SNP lookups and polygenic score analyses to identify genetic overlap with other relevant neurobehavioral phenotypes. Our primary GWAS meta-analysis identified two novel SNP loci (top SNPs: rs76114856 in the CENPO gene on chromosome 2 and rs6669072 near LOC105378853 on chromosome 1) associated with cognitive performance at the genome-wide significance level (P<5 × 10−8). Gene-based analysis identified an additional three Bonferroni-corrected significant loci at chromosomes 17q21.31, 17p13.1 and 1p13.3. Altogether, common variation across the genome resulted in a conservatively estimated SNP heritability of 21.5% (s.e.=0.01%) for general cognitive function. Integration with prior GWAS of cognitive performance and educational attainment yielded several additional significant loci. Finally, we found robust polygenic correlations between cognitive performance and educational attainment, several psychiatric disorders, birth length/weight and smoking behavior, as well as a novel genetic association to the personality trait of openness. These data provide new insight into the genetics of neurocognitive function with relevance to understanding the pathophysiology of neuropsychiatric illness. PMID:28093568
Fanous, Ayman H; Zhou, Baiyu; Aggen, Steven H; Bergen, Sarah E; Amdur, Richard L; Duan, Jubao; Sanders, Alan R; Shi, Jianxin; Mowry, Bryan J; Olincy, Ann; Amin, Farooq; Cloninger, C Robert; Silverman, Jeremy M; Buccola, Nancy G; Byerley, William F; Black, Donald W; Freedman, Robert; Dudbridge, Frank; Holmans, Peter A; Ripke, Stephan; Gejman, Pablo V; Kendler, Kenneth S; Levinson, Douglas F
2012-12-01
Multiple sources of evidence suggest that genetic factors influence variation in clinical features of schizophrenia. The authors present the first genome-wide association study (GWAS) of dimensional symptom scores among individuals with schizophrenia. Based on the Lifetime Dimensions of Psychosis Scale ratings of 2,454 case subjects of European ancestry from the Molecular Genetics of Schizophrenia (MGS) sample, three symptom factors (positive, negative/disorganized, and mood) were identified with exploratory factor analysis. Quantitative scores for each factor from a confirmatory factor analysis were analyzed for association with 696,491 single-nucleotide polymorphisms (SNPs) using linear regression, with correction for age, sex, clinical site, and ancestry. Polygenic score analysis was carried out to determine whether case and comparison subjects in 16 Psychiatric GWAS Consortium (PGC) schizophrenia samples (excluding MGS samples) differed in scores computed by weighting their genotypes by MGS association test results for each symptom factor. No genome-wide significant associations were observed between SNPs and factor scores. Most of the SNPs producing the strongest evidence for association were in or near genes involved in neurodevelopment, neuroprotection, or neurotransmission, including genes playing a role in Mendelian CNS diseases, but no statistically significant effect was observed for any defined gene pathway. Finally, polygenic scores based on MGS GWAS results for the negative/disorganized factor were significantly different between case and comparison subjects in the PGC data set; for MGS subjects, negative/disorganized factor scores were correlated with polygenic scores generated using case-control GWAS results from the other PGC samples. The polygenic signal that has been observed in cross-sample analyses of schizophrenia GWAS data sets could be in part related to genetic effects on negative and disorganized symptoms (i.e., core features of chronic schizophrenia).
Deciphering the distance to antibiotic resistance for the pneumococcus using genome sequencing data
Mobegi, Fredrick M.; Cremers, Amelieke J. H.; de Jonge, Marien I.; Bentley, Stephen D.; van Hijum, Sacha A. F. T.; Zomer, Aldert
2017-01-01
Advances in genome sequencing technologies and genome-wide association studies (GWAS) have provided unprecedented insights into the molecular basis of microbial phenotypes and enabled the identification of the underlying genetic variants in real populations. However, utilization of genome sequencing in clinical phenotyping of bacteria is challenging due to the lack of reliable and accurate approaches. Here, we report a method for predicting microbial resistance patterns using genome sequencing data. We analyzed whole genome sequences of 1,680 Streptococcus pneumoniae isolates from four independent populations using GWAS and identified probable hotspots of genetic variation which correlate with phenotypes of resistance to essential classes of antibiotics. With the premise that accumulation of putative resistance-conferring SNPs, potentially in combination with specific resistance genes, precedes full resistance, we retrogressively surveyed the hotspot loci and quantified the number of SNPs and/or genes, which if accumulated would confer full resistance to an otherwise susceptible strain. We name this approach the ‘distance to resistance’. It can be used to identify the creep towards complete antibiotics resistance in bacteria using genome sequencing. This approach serves as a basis for the development of future sequencing-based methods for predicting resistance profiles of bacterial strains in hospital microbiology and public health settings. PMID:28205635
Advances in molecular genetic studies of attention deficit hyperactivity disorder in China
GAO, Qian; LIU, Lu; QIAN, Qiujin; WANG, Yufeng
2014-01-01
Summary Attention deficit hyperactivity disorder (ADHD) is a common psychiatric condition in children worldwide that typically includes a combination of symptoms of inattention and hyperactivity/impulsivity. Genetic factors are believed to be important in the development and course of ADHD so many candidate genes studies and genome-wide association studies (GWAS) have been conducted in search of the genetic mechanisms that cause or influence the condition. This review provides an overview of gene association and pharmacogenetic studies of ADHD from mainland China and elsewhere that use Han Chinese samples. To date, studies from China and elsewhere remain inconclusive so future studies need to consider alternative analytic techniques and test new biological hypotheses about the relationship of neurotransmission and neurodevelopment to the onset and course of this disabling condition. PMID:25317006
Accounting for linkage disequilibrium in association analysis of diverse populations.
Charles, Bashira A; Shriner, Daniel; Rotimi, Charles N
2014-04-01
The National Human Genome Research Institute's catalog of published genome-wide association studies (GWAS) lists over 10,000 genetic variants collectively associated with over 800 human diseases or traits. Most of these GWAS have been conducted in European-ancestry populations. Findings gleaned from these studies have led to identification of disease-associated loci and biologic pathways involved in disease etiology. In multiple instances, these genomic findings have led to the development of novel medical therapies or evidence for prescribing a given drug as the appropriate treatment for a given individual beyond phenotypic appearances or socially defined constructs of race or ethnicity. Such findings have implications for populations throughout the globe and GWAS are increasingly being conducted in more diverse populations. A major challenge for investigators seeking to follow up genomic findings between diverse populations is discordant patterns of linkage disequilibrium (LD). We provide an overview of common measures of LD and opportunities for their use in novel methods designed to address challenges associated with following up GWAS conducted in European-ancestry populations in African-ancestry populations or, more generally, between populations with discordant LD patterns. We detail the strengths and weaknesses associated with different approaches. We also describe application of these strategies in follow-up studies of populations with concordant LD patterns (replication) or discordant LD patterns (transferability) as well as fine-mapping studies. We review application of these methods to a variety of traits and diseases. © 2014 WILEY PERIODICALS, INC.
Otto, Lars-Gernot; Mondal, Prodyut; Brassac, Jonathan; Preiss, Susanne; Degenhardt, Jörg; He, Sang; Reif, Jochen Christoph; Sharbel, Timothy Francis
2017-08-10
Chamomile (Matricaria recutita L.) has a long history of use in herbal medicine with various applications, and the flower heads contain numerous secondary metabolites which are medicinally active. In the major crop plants, next generation sequencing (NGS) approaches are intensely applied to exploit genetic resources, to develop genomic resources and to enhance breeding. Here, genotyping-by-sequencing (GBS) has been used in the non-model medicinal plant chamomile to evaluate the genetic structure of the cultivated varieties/populations, and to perform genome wide association study (GWAS) focusing on genes with large effect on flowering time and the medicinally important alpha-bisabolol content. GBS analysis allowed the identification of 6495 high-quality SNP-markers in our panel of 91 M. recutita plants from 33 origins (2-4 genotypes each) and 4 M. discoidea plants as outgroup, grown in the greenhouse in Gatersleben, Germany. M. recutita proved to be clearly distinct from the outgroup, as was demonstrated by different cluster and principal coordinate analyses using the SNP-markers. Chamomile genotypes from the same origin were mostly genetically similar. Model-based cluster analysis revealed one large group of tetraploid genotypes with low genetic differentiation including 39 plants from 14 origins. Tetraploids tended to display lower genetic diversity than diploids, probably reflecting their origin by artificial polyploidisation from only a limited set of genetic backgrounds. Analyses of flowering time demonstrated that diploids generally flowered earlier than tetraploids, and the analysis of alpha-bisabolol identified several tetraploid genotypes with a high content. GWAS identified highly significant (P < 0.01) SNPs for flowering time (9) and alpha-bisabolol (71). One sequence harbouring SNPs associated with flowering time was described to play a role in self-pollination in Arabidopsis thaliana, whereas four sequences harbouring SNPs associated with alpha-bisabolol were identified to be involved in plant biotic and abiotic stress response in various plants species. The first genomic resource for future applications to enhance breeding in chamomile was created, andanalyses of diversity will facilitate the exploitation of these genetic resources. The GWAS data pave the way for future research towards the genetics underlying important traits in chamomile, the identification of marker-trait associations, and development of reliable markers for practical breeding.
Galesloot, Tessel E.; van Dijk, Freerk; Geurts-Moespot, Anneke J.; Girelli, Domenico; Kiemeney, Lambertus A. L. M.; Sweep, Fred C. G. J.; Swertz, Morris A.; van der Meer, Peter; Camaschella, Clara; Toniolo, Daniela; Vermeulen, Sita H.; van der Harst, Pim; Swinkels, Dorine W.
2016-01-01
Serum hepcidin concentration is regulated by iron status, inflammation, erythropoiesis and numerous other factors, but underlying processes are incompletely understood. We studied the association of common and rare single nucleotide variants (SNVs) with serum hepcidin in one Italian study and two large Dutch population-based studies. We genotyped common SNVs with genome-wide association study (GWAS) arrays and subsequently performed imputation using the 1000 Genomes reference panel. Cohort-specific GWAS were performed for log-transformed serum hepcidin, adjusted for age and gender, and results were combined in a fixed-effects meta-analysis (total N 6,096). Six top SNVs (p<5x10-6) were genotyped in 3,821 additional samples, but associations were not replicated. Furthermore, we meta-analyzed cohort-specific exome array association results of rare SNVs with serum hepcidin that were available for two of the three cohorts (total N 3,226), but no exome-wide significant signal (p<1.4x10-6) was identified. Gene-based meta-analyses revealed 19 genes that showed significant association with hepcidin. Our results suggest the absence of common SNVs and rare exonic SNVs explaining a large proportion of phenotypic variation in serum hepcidin. We recommend extension of our study once additional substantial cohorts with hepcidin measurements, GWAS and/or exome array data become available in order to increase power to identify variants that explain a smaller proportion of hepcidin variation. In addition, we encourage follow-up of the potentially interesting genes that resulted from the gene-based analysis of low-frequency and rare variants. PMID:27846281
Chen, Fang; He, Jing; Zhang, Jianqi; Chen, Gary K.; Thomas, Venetta; Ambrosone, Christine B.; Bandera, Elisa V.; Berndt, Sonja I.; Bernstein, Leslie; Blot, William J.; Cai, Qiuyin; Carpten, John; Casey, Graham; Chanock, Stephen J.; Cheng, Iona; Chu, Lisa; Deming, Sandra L.; Driver, W. Ryan; Goodman, Phyllis; Hayes, Richard B.; Hennis, Anselm J. M.; Hsing, Ann W.; Hu, Jennifer J.; Ingles, Sue A.; John, Esther M.; Kittles, Rick A.; Kolb, Suzanne; Leske, M. Cristina; Monroe, Kristine R.; Murphy, Adam; Nemesure, Barbara; Neslund-Dudas, Christine; Nyante, Sarah; Ostrander, Elaine A; Press, Michael F.; Rodriguez-Gil, Jorge L.; Rybicki, Ben A.; Schumacher, Fredrick; Stanford, Janet L.; Signorello, Lisa B.; Strom, Sara S.; Stevens, Victoria; Van Den Berg, David; Wang, Zhaoming; Witte, John S.; Wu, Suh-Yuh; Yamamura, Yuko; Zheng, Wei; Ziegler, Regina G.; Stram, Alexander H.; Kolonel, Laurence N.; Marchand, Loïc Le; Henderson, Brian E.; Haiman, Christopher A.; Stram, Daniel O.
2015-01-01
Height has an extremely polygenic pattern of inheritance. Genome-wide association studies (GWAS) have revealed hundreds of common variants that are associated with human height at genome-wide levels of significance. However, only a small fraction of phenotypic variation can be explained by the aggregate of these common variants. In a large study of African-American men and women (n = 14,419), we genotyped and analyzed 966,578 autosomal SNPs across the entire genome using a linear mixed model variance components approach implemented in the program GCTA (Yang et al Nat Genet 2010), and estimated an additive heritability of 44.7% (se: 3.7%) for this phenotype in a sample of evidently unrelated individuals. While this estimated value is similar to that given by Yang et al in their analyses, we remain concerned about two related issues: (1) whether in the complete absence of hidden relatedness, variance components methods have adequate power to estimate heritability when a very large number of SNPs are used in the analysis; and (2) whether estimation of heritability may be biased, in real studies, by low levels of residual hidden relatedness. We addressed the first question in a semi-analytic fashion by directly simulating the distribution of the score statistic for a test of zero heritability with and without low levels of relatedness. The second question was addressed by a very careful comparison of the behavior of estimated heritability for both observed (self-reported) height and simulated phenotypes compared to imputation R2 as a function of the number of SNPs used in the analysis. These simulations help to address the important question about whether today's GWAS SNPs will remain useful for imputing causal variants that are discovered using very large sample sizes in future studies of height, or whether the causal variants themselves will need to be genotyped de novo in order to build a prediction model that ultimately captures a large fraction of the variability of height, and by implication other complex phenotypes. Our overall conclusions are that when study sizes are quite large (5,000 or so) the additive heritability estimate for height is not apparently biased upwards using the linear mixed model; however there is evidence in our simulation that a very large number of causal variants (many thousands) each with very small effect on phenotypic variance will need to be discovered to fill the gap between the heritability explained by known versus unknown causal variants. We conclude that today's GWAS data will remain useful in the future for causal variant prediction, but that finding the causal variants that need to be predicted may be extremely laborious. PMID:26125186
Chen, Fang; He, Jing; Zhang, Jianqi; Chen, Gary K; Thomas, Venetta; Ambrosone, Christine B; Bandera, Elisa V; Berndt, Sonja I; Bernstein, Leslie; Blot, William J; Cai, Qiuyin; Carpten, John; Casey, Graham; Chanock, Stephen J; Cheng, Iona; Chu, Lisa; Deming, Sandra L; Driver, W Ryan; Goodman, Phyllis; Hayes, Richard B; Hennis, Anselm J M; Hsing, Ann W; Hu, Jennifer J; Ingles, Sue A; John, Esther M; Kittles, Rick A; Kolb, Suzanne; Leske, M Cristina; Millikan, Robert C; Monroe, Kristine R; Murphy, Adam; Nemesure, Barbara; Neslund-Dudas, Christine; Nyante, Sarah; Ostrander, Elaine A; Press, Michael F; Rodriguez-Gil, Jorge L; Rybicki, Ben A; Schumacher, Fredrick; Stanford, Janet L; Signorello, Lisa B; Strom, Sara S; Stevens, Victoria; Van Den Berg, David; Wang, Zhaoming; Witte, John S; Wu, Suh-Yuh; Yamamura, Yuko; Zheng, Wei; Ziegler, Regina G; Stram, Alexander H; Kolonel, Laurence N; Le Marchand, Loïc; Henderson, Brian E; Haiman, Christopher A; Stram, Daniel O
2015-01-01
Height has an extremely polygenic pattern of inheritance. Genome-wide association studies (GWAS) have revealed hundreds of common variants that are associated with human height at genome-wide levels of significance. However, only a small fraction of phenotypic variation can be explained by the aggregate of these common variants. In a large study of African-American men and women (n = 14,419), we genotyped and analyzed 966,578 autosomal SNPs across the entire genome using a linear mixed model variance components approach implemented in the program GCTA (Yang et al Nat Genet 2010), and estimated an additive heritability of 44.7% (se: 3.7%) for this phenotype in a sample of evidently unrelated individuals. While this estimated value is similar to that given by Yang et al in their analyses, we remain concerned about two related issues: (1) whether in the complete absence of hidden relatedness, variance components methods have adequate power to estimate heritability when a very large number of SNPs are used in the analysis; and (2) whether estimation of heritability may be biased, in real studies, by low levels of residual hidden relatedness. We addressed the first question in a semi-analytic fashion by directly simulating the distribution of the score statistic for a test of zero heritability with and without low levels of relatedness. The second question was addressed by a very careful comparison of the behavior of estimated heritability for both observed (self-reported) height and simulated phenotypes compared to imputation R2 as a function of the number of SNPs used in the analysis. These simulations help to address the important question about whether today's GWAS SNPs will remain useful for imputing causal variants that are discovered using very large sample sizes in future studies of height, or whether the causal variants themselves will need to be genotyped de novo in order to build a prediction model that ultimately captures a large fraction of the variability of height, and by implication other complex phenotypes. Our overall conclusions are that when study sizes are quite large (5,000 or so) the additive heritability estimate for height is not apparently biased upwards using the linear mixed model; however there is evidence in our simulation that a very large number of causal variants (many thousands) each with very small effect on phenotypic variance will need to be discovered to fill the gap between the heritability explained by known versus unknown causal variants. We conclude that today's GWAS data will remain useful in the future for causal variant prediction, but that finding the causal variants that need to be predicted may be extremely laborious.
A comprehensive survey of genetic variation in 20,691 subjects from four large cohorts
Loomis, Stephanie; Turman, Constance; Huang, Hongyan; Huang, Jinyan; Aschard, Hugues; Chan, Andrew T.; Choi, Hyon; Cornelis, Marilyn; Curhan, Gary; De Vivo, Immaculata; Eliassen, A. Heather; Fuchs, Charles; Gaziano, Michael; Hankinson, Susan E.; Hu, Frank; Jensen, Majken; Kang, Jae H.; Kabrhel, Christopher; Liang, Liming; Pasquale, Louis R.; Rimm, Eric; Stampfer, Meir J.; Tamimi, Rulla M.; Tworoger, Shelley S.; Wiggs, Janey L.; Hunter, David J.; Kraft, Peter
2017-01-01
The Nurses’ Health Study (NHS), Nurses’ Health Study II (NHSII), Health Professionals Follow Up Study (HPFS) and the Physicians Health Study (PHS) have collected detailed longitudinal data on multiple exposures and traits for approximately 310,000 study participants over the last 35 years. Over 160,000 study participants across the cohorts have donated a DNA sample and to date, 20,691 subjects have been genotyped as part of genome-wide association studies (GWAS) of twelve primary outcomes. However, these studies utilized six different GWAS arrays making it difficult to conduct analyses of secondary phenotypes or share controls across studies. To allow for secondary analyses of these data, we have created three new datasets merged by platform family and performed imputation using a common reference panel, the 1,000 Genomes Phase I release. Here, we describe the methodology behind the data merging and imputation and present imputation quality statistics and association results from two GWAS of secondary phenotypes (body mass index (BMI) and venous thromboembolism (VTE)). We observed the strongest BMI association for the FTO SNP rs55872725 (β = 0.45, p = 3.48x10-22), and using a significance level of p = 0.05, we replicated 19 out of 32 known BMI SNPs. For VTE, we observed the strongest association for the rs2040445 SNP (OR = 2.17, 95% CI: 1.79–2.63, p = 2.70x10-15), located downstream of F5 and also observed significant associations for the known ABO and F11 regions. This pooled resource can be used to maximize power in GWAS of phenotypes collected across the cohorts and for studying gene-environment interactions as well as rare phenotypes and genotypes. PMID:28301549
Su, Junji; Li, Libei; Zhang, Chi; Wang, Caixiang; Gu, Lijiao; Wang, Hantao; Wei, Hengling; Liu, Qibao; Huang, Long; Yu, Shuxun
2018-06-01
Thirty significant associations between 22 SNPs and five plant architecture component traits in Chinese upland cotton were identified via GWAS. Four peak SNP loci located on chromosome D03 were simultaneously associated with more plant architecture component traits. A candidate gene, Gh_D03G0922, might be responsible for plant height in upland cotton. A compact plant architecture is increasingly required for mechanized harvesting processes in China. Therefore, cotton plant architecture is an important trait, and its components, such as plant height, fruit branch length and fruit branch angle, affect the suitability of a cultivar for mechanized harvesting. To determine the genetic basis of cotton plant architecture, a genome-wide association study (GWAS) was performed using a panel composed of 355 accessions and 93,250 single nucleotide polymorphisms (SNPs) identified using the specific-locus amplified fragment sequencing method. Thirty significant associations between 22 SNPs and five plant architecture component traits were identified via GWAS. Most importantly, four peak SNP loci located on chromosome D03 were simultaneously associated with more plant architecture component traits, and these SNPs were harbored in one linkage disequilibrium block. Furthermore, 21 candidate genes for plant architecture were predicted in a 0.95-Mb region including the four peak SNPs. One of these genes (Gh_D03G0922) was near the significant SNP D03_31584163 (8.40 kb), and its Arabidopsis homologs contain MADS-box domains that might be involved in plant growth and development. qRT-PCR showed that the expression of Gh_D03G0922 was upregulated in the apical buds and young leaves of the short and compact cotton varieties, and virus-induced gene silencing (VIGS) proved that the silenced plants exhibited increased PH. These results indicate that Gh_D03G0922 is likely the candidate gene for PH in cotton. The genetic variations and candidate genes identified in this study lay a foundation for cultivating moderately short and compact varieties in future Chinese cotton-breeding programs.
Integrated Post-GWAS Analysis Sheds New Light on the Disease Mechanisms of Schizophrenia
Lin, Jhih-Rong; Cai, Ying; Zhang, Quanwei; Zhang, Wen; Nogales-Cadenas, Rubén; Zhang, Zhengdong D.
2016-01-01
Schizophrenia is a severe mental disorder with a large genetic component. Recent genome-wide association studies (GWAS) have identified many schizophrenia-associated common variants. For most of the reported associations, however, the underlying biological mechanisms are not clear. The critical first step for their elucidation is to identify the most likely disease genes as the source of the association signals. Here, we describe a general computational framework of post-GWAS analysis for complex disease gene prioritization. We identify 132 putative schizophrenia risk genes in 76 risk regions spanning 120 schizophrenia-associated common variants, 78 of which have not been recognized as schizophrenia disease genes by previous GWAS. Even more significantly, 29 of them are outside the risk regions, likely under regulation of transcriptional regulatory elements contained therein. These putative schizophrenia risk genes are transcriptionally active in both brain and the immune system, and highly enriched among cellular pathways, consistent with leading pathophysiological hypotheses about the pathogenesis of schizophrenia. With their involvement in distinct biological processes, these putative schizophrenia risk genes, with different association strengths, show distinctive temporal expression patterns, and play specific biological roles during brain development. PMID:27754856
Deng, Yangqing; Pan, Wei
2018-06-01
Due to issues of practicality and confidentiality of genomic data sharing on a large scale, typically only meta- or mega-analyzed genome-wide association study (GWAS) summary data, not individual-level data, are publicly available. Reanalyses of such GWAS summary data for a wide range of applications have become more and more common and useful, which often require the use of an external reference panel with individual-level genotypic data to infer linkage disequilibrium (LD) among genetic variants. However, with a small sample size in only hundreds, as for the most popular 1000 Genomes Project European sample, estimation errors for LD are not negligible, leading to often dramatically increased numbers of false positives in subsequent analyses of GWAS summary data. To alleviate the problem in the context of association testing for a group of SNPs, we propose an alternative estimator of the covariance matrix with an idea similar to multiple imputation. We use numerical examples based on both simulated and real data to demonstrate the severe problem with the use of the 1000 Genomes Project reference panels, and the improved performance of our new approach. Copyright © 2018 by the Genetics Society of America.
Missing data imputation and haplotype phase inference for genome-wide association studies
Browning, Sharon R.
2009-01-01
Imputation of missing data and the use of haplotype-based association tests can improve the power of genome-wide association studies (GWAS). In this article, I review methods for haplotype inference and missing data imputation, and discuss their application to GWAS. I discuss common features of the best algorithms for haplotype phase inference and missing data imputation in large-scale data sets, as well as some important differences between classes of methods, and highlight the methods that provide the highest accuracy and fastest computational performance. PMID:18850115
Genetic Risk Variants for Social Anxiety
Stein, Murray B.; Chen, Chia-Yen; Jain, Sonia; Jensen, Kevin P.; He, Feng; Heeringa, Steven G.; Kessler, Ronald C.; Maihofer, Adam; Nock, Matthew K.; Ripke, Stephan; Sun, Xiaoying; Thomas, Michael L.; Ursano, Robert J.; Smoller, Jordan W.; Gelernter, Joel
2017-01-01
Social anxiety is a neurobehavioral trait characterized by fear and reticence in social situations. Twin studies have shown that social anxiety has a heritable basis, shared with neuroticism and extraversion, but genetic studies have yet to demonstrate robust risk variants. We conducted genomewide association analysis (GWAS) of subjects within the Army Study To Assess Risk and Resilience in Service members (Army STARRS) to (1) determine SNP-based heritability of social anxiety; (2) discern genetic risk loci for social anxiety; and (3) determine shared genetic risk with neuroticism and extraversion. GWAS were conducted within ancestral groups (EUR, AFR, LAT) using linear regression models for each of the 3 component studies in Army STARRS, and then meta-analyzed across studies. SNP-based heritability for social anxiety was significant (h2g=0.12, p=2.17×10-4 in EUR). One meta-analytically genomewide significant locus was seen in each of EUR (rs708012, Chr 6: BP 36965970, p = 1.55×10-8; beta = 0.073) and AFR (rs78924501, Chr 1: BP 88406905, p = 3.58×10-8; beta = 0.265) samples. Social anxiety in Army STARRS was significantly genetically correlated (negatively) with extraversion (rg = -0.52, se = 0.22, p = 0.02) but not with neuroticism (rg = 0.05, se = 0.22, p = 0.81) or with an anxiety disorder factor score (rg = 0.02, se = 0.32, p = 0.94) from external GWAS meta-analyses. This first GWAS of social anxiety confirms a genetic basis for social anxiety, shared with extraversion but possibly less so with neuroticism. PMID:28224735
2012-01-01
Background Genome-wide association studies (GWAS) do not provide a full account of the heritability of genetic diseases since gene-gene interactions, also known as epistasis are not considered in single locus GWAS. To address this problem, a considerable number of methods have been developed for identifying disease-associated gene-gene interactions. However, these methods typically fail to identify interacting markers explaining more of the disease heritability over single locus GWAS, since many of the interactions significant for disease are obscured by uninformative marker interactions e.g., linkage disequilibrium (LD). Results In this study, we present a novel SNP interaction prioritization algorithm, named iLOCi (Interacting Loci). This algorithm accounts for marker dependencies separately in case and control groups. Disease-associated interactions are then prioritized according to a novel ranking score calculated from the difference in marker dependencies for every possible pair between case and control groups. The analysis of a typical GWAS dataset can be completed in less than a day on a standard workstation with parallel processing capability. The proposed framework was validated using simulated data and applied to real GWAS datasets using the Wellcome Trust Case Control Consortium (WTCCC) data. The results from simulated data showed the ability of iLOCi to identify various types of gene-gene interactions, especially for high-order interaction. From the WTCCC data, we found that among the top ranked interacting SNP pairs, several mapped to genes previously known to be associated with disease, and interestingly, other previously unreported genes with biologically related roles. Conclusion iLOCi is a powerful tool for uncovering true disease interacting markers and thus can provide a more complete understanding of the genetic basis underlying complex disease. The program is available for download at http://www4a.biotec.or.th/GI/tools/iloci. PMID:23281813
Age at menarche and age at natural menopause in East Asian women: a genome-wide association study.
Shi, Jiajun; Zhang, Ben; Choi, Ji-Yeob; Gao, Yu-Tang; Li, Huaixing; Lu, Wei; Long, Jirong; Kang, Daehee; Xiang, Yong-Bing; Wen, Wanqing; Park, Sue K; Ye, Xingwang; Noh, Dong-Young; Zheng, Ying; Wang, Yiqin; Chung, Seokang; Lin, Xu; Cai, Qiuyin; Shu, Xiao-Ou
2016-12-01
Age at menarche (AM) and age at natural menopause (ANM) are complex traits with a high heritability. Abnormal timing of menarche or menopause is associated with a reduced span of fertility and risk for several age-related diseases including breast, endometrial and ovarian cancer, cardiovascular disease, and osteoporosis. To identify novel genetic loci for AM or ANM in East Asian women and to replicate previously identified loci primarily in women of European ancestry by genome-wide association studies (GWASs), we conducted a two-stage GWAS. Stage I aimed to discover promising novel AM and ANM loci using GWAS data of 8073 women from Shanghai, China. The Stage II replication study used the data from another Chinese GWAS (n = 1230 for AM and n = 1458 for ANM), a Korean GWAS (n = 4215 for AM and n = 1739 for ANM), and de novo genotyping of 2877 additional Chinese women. Previous GWAS-identified loci for AM and ANM were also evaluated. We identified two suggestive menarcheal age loci tagged by rs79195475 at 10q21.3 (beta = -0.118 years, P = 3.4 × 10 -6 ) and rs1023935 at 4p15.1 (beta = -0.145 years, P = 4.9 × 10 -6 ) and one menopausal age locus tagged by rs3818134 at 22q12.2 (beta = -0.276 years, P = 8.8 × 10 -6 ). These suggestive loci warrant a further validation in independent populations. Although limited by low statistical power, we replicated 19 of the 98 menarche loci and 5 of the 20 menopause loci previously identified in women of European ancestry in East Asian women, suggesting a shared genetic architecture for these two traits across populations.
van Leeuwen, Elisabeth M; Sabo, Aniko; Bis, Joshua C; Huffman, Jennifer E; Manichaikul, Ani; Smith, Albert V; Feitosa, Mary F; Demissie, Serkalem; Joshi, Peter K; Duan, Qing; Marten, Jonathan; van Klinken, Jan B; Surakka, Ida; Nolte, Ilja M; Zhang, Weihua; Mbarek, Hamdi; Li-Gao, Ruifang; Trompet, Stella; Verweij, Niek; Evangelou, Evangelos; Lyytikäinen, Leo-Pekka; Tayo, Bamidele O; Deelen, Joris; van der Most, Peter J; van der Laan, Sander W; Arking, Dan E; Morrison, Alanna; Dehghan, Abbas; Franco, Oscar H; Hofman, Albert; Rivadeneira, Fernando; Sijbrands, Eric J; Uitterlinden, Andre G; Mychaleckyj, Josyf C; Campbell, Archie; Hocking, Lynne J; Padmanabhan, Sandosh; Brody, Jennifer A; Rice, Kenneth M; White, Charles C; Harris, Tamara; Isaacs, Aaron; Campbell, Harry; Lange, Leslie A; Rudan, Igor; Kolcic, Ivana; Navarro, Pau; Zemunik, Tatijana; Salomaa, Veikko; Kooner, Angad S; Kooner, Jaspal S; Lehne, Benjamin; Scott, William R; Tan, Sian-Tsung; de Geus, Eco J; Milaneschi, Yuri; Penninx, Brenda W J H; Willemsen, Gonneke; de Mutsert, Renée; Ford, Ian; Gansevoort, Ron T; Segura-Lepe, Marcelo P; Raitakari, Olli T; Viikari, Jorma S; Nikus, Kjell; Forrester, Terrence; McKenzie, Colin A; de Craen, Anton J M; de Ruijter, Hester M; Pasterkamp, Gerard; Snieder, Harold; Oldehinkel, Albertine J; Slagboom, P Eline; Cooper, Richard S; Kähönen, Mika; Lehtimäki, Terho; Elliott, Paul; van der Harst, Pim; Jukema, J Wouter; Mook-Kanamori, Dennis O; Boomsma, Dorret I; Chambers, John C; Swertz, Morris; Ripatti, Samuli; Willems van Dijk, Ko; Vitart, Veronique; Polasek, Ozren; Hayward, Caroline; Wilson, James G; Wilson, James F; Gudnason, Vilmundur; Rich, Stephen S; Psaty, Bruce M; Borecki, Ingrid B; Boerwinkle, Eric; Rotter, Jerome I; Cupples, L Adrienne; van Duijn, Cornelia M
2016-01-01
Background So far, more than 170 loci have been associated with circulating lipid levels through genome-wide association studies (GWAS). These associations are largely driven by common variants, their function is often not known, and many are likely to be markers for the causal variants. In this study we aimed to identify more new rare and low-frequency functional variants associated with circulating lipid levels. Methods We used the 1000 Genomes Project as a reference panel for the imputations of GWAS data from ∼60 000 individuals in the discovery stage and ∼90 000 samples in the replication stage. Results Our study resulted in the identification of five new associations with circulating lipid levels at four loci. All four loci are within genes that can be linked biologically to lipid metabolism. One of the variants, rs116843064, is a damaging missense variant within the ANGPTL4 gene. Conclusions This study illustrates that GWAS with high-scale imputation may still help us unravel the biological mechanism behind circulating lipid levels. PMID:27036123
Zhang, Qingrun; Long, Quan; Ott, Jurg
2014-06-01
Identifying gene-gene interaction is a hot topic in genome wide association studies. Two fundamental challenges are: (1) how to smartly identify combinations of variants that may be associated with the trait from astronomical number of all possible combinations; and (2) how to test epistatic interaction when all potential combinations are available. We developed AprioriGWAS, which brings two innovations. (1) Based on Apriori, a successful method in field of Frequent Itemset Mining (FIM) in which a pattern growth strategy is leveraged to effectively and accurately reduce search space, AprioriGWAS can efficiently identify genetically associated genotype patterns. (2) To test the hypotheses of epistasis, we adopt a new conditional permutation procedure to obtain reliable statistical inference of Pearson's chi-square test for the [Formula: see text] contingency table generated by associated variants. By applying AprioriGWAS to age-related macular degeneration (AMD) data, we found that: (1) angiopoietin 1 (ANGPT1) and four retinal genes interact with Complement Factor H (CFH). (2) GO term "glycosaminoglycan biosynthetic process" was enriched in AMD interacting genes. The epistatic interactions newly found by AprioriGWAS on AMD data are likely true interactions, since genes interacting with CFH are retinal genes, and GO term enrichment also verified that interaction between glycosaminoglycans (GAGs) and CFH plays an important role in disease pathology of AMD. By applying AprioriGWAS on Bipolar disorder in WTCCC data, we found variants without marginal effect show significant interactions. For example, multiple-SNP genotype patterns inside gene GABRB2 and GRIA1 (AMPA subunit 1 receptor gene). AMPARs are found in many parts of the brain and are the most commonly found receptor in the nervous system. The GABRB2 mediates the fastest inhibitory synaptic transmission in the central nervous system. GRIA1 and GABRB2 are relevant to mental disorders supported by multiple evidences.
The 19q12 bladder cancer GWAS signal: association with cyclin E function and aggressive disease
Fu, Yi-Ping; Kohaar, Indu; Moore, Lee E.; Lenz, Petra; Figueroa, Jonine D.; Tang, Wei; Porter-Gill, Patricia; Chatterjee, Nilanjan; Scott-Johnson, Alexandra; Garcia-Closas, Montserrat; Muchmore, Brian; Baris, Dalsu; Paquin, Ashley; Ylaya, Kris; Schwenn, Molly; Apolo, Andrea B.; Karagas, Margaret R.; Tarway, McAnthony; Johnson, Alison; Mumy, Adam; Schned, Alan; Guedez, Liliana; Jones, Michael A.; Kida, Masatoshi; Monawar Hosain, GM; Malats, Nuria; Kogevinas, Manolis; Tardon, Adonina; Serra, Consol; Carrato, Alfredo; Garcia-Closas, Reina; Lloreta, Josep; Wu, Xifeng; Purdue, Mark; Andriole, Gerald L.; Grubb, Robert L.; Black, Amanda; Landi, Maria T.; Caporaso, Neil E.; Vineis, Paolo; Siddiq, Afshan; Bueno-de-Mesquita, H. Bas; Trichopoulos, Dimitrios; Ljungberg, Börje; Severi, Gianluca; Weiderpass, Elisabete; Krogh, Vittorio; Dorronsoro, Miren; Travis, Ruth C.; Tjønneland, Anne; Brennan, Paul; Chang-Claude, Jenny; Riboli, Elio; Prescott, Jennifer; Chen, Constance; De Vivo, Immaculata; Govannucci, Edward; Hunter, David; Kraft, Peter; Lindstrom, Sara; Gapstur, Susan M.; Jacobs, Eric J.; Diver, W. Ryan; Albanes, Demetrius; Weinstein, Stephanie J.; Virtamo, Jarmo; Kooperberg, Charles; Hohensee, Chancellor; Rodabough, Rebecca J.; Cortessis, Victoria K.; Conti, David V.; Gago-Dominguez, Manuela; Stern, Mariana C.; Pike, Malcolm C.; Van Den Berg, David; Yuan, Jian-Min; Haiman, Christopher A.; Cussenot, Olivier; Cancel-Tassin, Geraldine; Roupret, Morgan; Comperat, Eva; Porru, Stefano; Carta, Angela; Pavanello, Sofia; Arici, Cecilia; Mastrangelo, Giuseppe; Grossman, H. Barton; Wang, Zhaoming; Deng, Xiang; Chung, Charles C.; Hutchinson, Amy; Burdette, Laurie; Wheeler, William; Fraumeni, Joseph; Chanock, Stephen J.; Hewitt, Stephen M.; Silverman, Debra T.; Rothman, Nathaniel; Prokunina-Olsson, Ludmila
2014-01-01
A genome-wide association study (GWAS) of bladder cancer identified a genetic marker rs8102137 within the 19q12 region as a novel susceptibility variant. This marker is located upstream of the CCNE1 gene, which encodes cyclin E, a cell cycle protein. We performed genetic fine mapping analysis of the CCNE1 region using data from two bladder cancer GWAS (5,942 cases and 10,857 controls). We found that the original GWAS marker rs8102137 represents a group of 47 linked SNPs (with r2≥0.7) associated with increased bladder cancer risk. From this group we selected a functional promoter variant rs7257330, which showed strong allele-specific binding of nuclear proteins in several cell lines. In both GWAS, rs7257330 was associated only with aggressive bladder cancer, with a combined per-allele odds ratio (OR) =1.18 (95%CI=1.09-1.27, p=4.67×10−5 vs. OR =1.01 (95%CI=0.93-1.10, p=0.79) for non-aggressive disease, with p=0.0015 for case-only analysis. Cyclin E protein expression analyzed in 265 bladder tumors was increased in aggressive tumors (p=0.013) and, independently, with each rs7257330-A risk allele (ptrend=0.024). Over-expression of recombinant cyclin E in cell lines caused significant acceleration of cell cycle. In conclusion, we defined the 19q12 signal as the first GWAS signal specific for aggressive bladder cancer. Molecular mechanisms of this genetic association may be related to cyclin E over-expression and alteration of cell cycle in carriers of CCNE1 risk variants. In combination with established bladder cancer risk factors and other somatic and germline genetic markers, the CCNE1 variants could be useful for inclusion into bladder cancer risk prediction models. PMID:25320178
Building a biomedical cyberinfrastructure for collaborative research.
Schad, Peter A; Mobley, Lee Rivers; Hamilton, Carol M
2011-05-01
For the potential power of genome-wide association studies (GWAS) and translational medicine to be realized, the biomedical research community must adopt standard measures, vocabularies, and systems to establish an extensible biomedical cyberinfrastructure. Incorporating standard measures will greatly facilitate combining and comparing studies via meta-analysis. Incorporating consensus-based and well-established measures into various studies should reduce the variability across studies due to attributes of measurement, making findings across studies more comparable. This article describes two well-established consensus-based approaches to identifying standard measures and systems: PhenX (consensus measures for phenotypes and eXposures), and the Open Geospatial Consortium (OGC). NIH support for these efforts has produced the PhenX Toolkit, an assembled catalog of standard measures for use in GWAS and other large-scale genomic research efforts, and the RTI Spatial Impact Factor Database (SIFD), a comprehensive repository of geo-referenced variables and extensive meta-data that conforms to OGC standards. The need for coordinated development of cyberinfrastructure to support measures and systems that enhance collaboration and data interoperability is clear; this paper includes a discussion of standard protocols for ensuring data compatibility and interoperability. Adopting a cyberinfrastructure that includes standard measures and vocabularies, and open-source systems architecture, such as the two well-established systems discussed here, will enhance the potential of future biomedical and translational research. Establishing and maintaining the cyberinfrastructure will require a fundamental change in the way researchers think about study design, collaboration, and data storage and analysis. Copyright © 2011 American Journal of Preventive Medicine. Published by Elsevier Inc. All rights reserved.
Building a Biomedical Cyberinfrastructure for Collaborative Research
Schad, Peter A.; Mobley, Lee Rivers; Hamilton, Carol M.
2018-01-01
For the potential power of genome-wide association studies (GWAS) and translational medicine to be realized, the biomedical research community must adopt standard measures, vocabularies, and systems to establish an extensible biomedical cyberinfrastructure. Incorporating standard measures will greatly facilitate combining and comparing studies via meta-analysis, which is a means for deriving larger populations, needed for increased statistical power to detect less apparent and more complex associations (gene-environment interactions and polygenic gene-gene interactions). Incorporating consensus-based and well-established measures into various studies should reduce the variability across studies due to attributes of measurement, making findings across studies more comparable. This article describes two consensus-based approaches to establishing standard measures and systems: PhenX (consensus measures for Phenotypes and eXposures), and the Open Geospatial Consortium (OGC). National Institutes of Health support for these efforts has produced the PhenX Toolkit, an assembled catalog of standard measures for use in GWAS and other large-scale genomic research efforts, and the RTI Spatial Impact Factor Database (SIFD), a comprehensive repository of georeferenced variables and extensive metadata that conforms to OGC standards. The need for coordinated development of cyberinfrastructure to support collaboration and data interoperability is clear, and we discuss standard protocols for ensuring data compatibility and interoperability. Adopting a cyberinfrastructure that includes standard measures, vocabularies, and open-source systems architecture will enhance the potential of future biomedical and translational research. Establishing and maintaining the cyberinfrastructure will require a fundamental change in the way researchers think about study design, collaboration, and data storage and analysis. PMID:21521587
[Genome-wide association study for adolescent idiopathic scoliosis].
Ogura, Yoji; Kou, Ikuyo; Scoliosis, Japan; Matsumoto, Morio; Watanabe, Kota; Ikegawa, Shiro
2016-04-01
Adolescent idiopathic scoliosis(AIS)is a polygenic disease. Genome-wide association studies(GWASs)have been performed for a lot of polygenic diseases. For AIS, we conducted GWAS and identified the first AIS locus near LBX1. After the discovery, we have extended our study by increasing the numbers of subjects and SNPs. In total, our Japanese GWAS has identified four susceptibility genes. GWASs for AIS have also been performed in the USA and China, which identified one and three susceptibility genes, respectively. Here we review GWASs in Japan and abroad and functional analysis to clarify the pathomechanism of AIS.
Bonfiglio, F; Henström, M; Nag, A; Hadizadeh, F; Zheng, T; Cenit, M C; Tigchelaar, E; Williams, F; Reznichenko, A; Ek, W E; Rivera, N V; Homuth, G; Aghdassi, A A; Kacprowski, T; Männikkö, M; Karhunen, V; Bujanda, L; Rafter, J; Wijmenga, C; Ronkainen, J; Hysi, P; Zhernakova, A; D'Amato, M
2018-04-19
Irritable bowel syndrome (IBS) shows genetic predisposition, however, large-scale, powered gene mapping studies are lacking. We sought to exploit existing genetic (genotype) and epidemiological (questionnaire) data from a series of population-based cohorts for IBS genome-wide association studies (GWAS) and their meta-analysis. Based on questionnaire data compatible with Rome III Criteria, we identified a total of 1335 IBS cases and 9768 asymptomatic individuals from 5 independent European genotyped cohorts. Individual GWAS were carried out with sex-adjusted logistic regression under an additive model, followed by meta-analysis using the inverse variance method. Functional annotation of significant results was obtained via a computational pipeline exploiting ontology and interaction networks, and tissue-specific and gene set enrichment analyses. Suggestive GWAS signals (P ≤ 5.0 × 10 -6 ) were detected for 7 genomic regions, harboring 64 gene candidates to affect IBS risk via functional or expression changes. Functional annotation of this gene set convincingly (best FDR-corrected P = 3.1 × 10 -10 ) highlighted regulation of ion channel activity as the most plausible pathway affecting IBS risk. Our results confirm the feasibility of population-based studies for gene-discovery efforts in IBS, identify risk genes and loci to be prioritized in independent follow-ups, and pinpoint ion channels as important players and potential therapeutic targets warranting further investigation. © 2018 John Wiley & Sons Ltd.
CONAN: copy number variation analysis software for genome-wide association studies
2010-01-01
Background Genome-wide association studies (GWAS) based on single nucleotide polymorphisms (SNPs) revolutionized our perception of the genetic regulation of complex traits and diseases. Copy number variations (CNVs) promise to shed additional light on the genetic basis of monogenic as well as complex diseases and phenotypes. Indeed, the number of detected associations between CNVs and certain phenotypes are constantly increasing. However, while several software packages support the determination of CNVs from SNP chip data, the downstream statistical inference of CNV-phenotype associations is still subject to complicated and inefficient in-house solutions, thus strongly limiting the performance of GWAS based on CNVs. Results CONAN is a freely available client-server software solution which provides an intuitive graphical user interface for categorizing, analyzing and associating CNVs with phenotypes. Moreover, CONAN assists the evaluation process by visualizing detected associations via Manhattan plots in order to enable a rapid identification of genome-wide significant CNV regions. Various file formats including the information on CNVs in population samples are supported as input data. Conclusions CONAN facilitates the performance of GWAS based on CNVs and the visual analysis of calculated results. CONAN provides a rapid, valid and straightforward software solution to identify genetic variation underlying the 'missing' heritability for complex traits that remains unexplained by recent GWAS. The freely available software can be downloaded at http://genepi-conan.i-med.ac.at. PMID:20546565
Langlois, Christine; Abadi, Arkan; Peralta-Romero, Jesus; Alyass, Akram; Suarez, Fernando; Gomez-Zamudio, Jaime; Burguete-Garcia, Ana I.; Yazdi, Fereshteh T.; Cruz, Miguel; Meyre, David
2016-01-01
Genome wide association studies (GWAS) have identified single-nucleotide polymorphisms (SNPs) that are associated with fasting plasma glucose (FPG) in adult European populations. The contribution of these SNPs to FPG in non-Europeans and children is unclear. We studied the association of 15 GWAS SNPs and a genotype score (GS) with FPG and 7 metabolic traits in 1,421 Mexican children and adolescents from Mexico City. Genotyping of the 15 SNPs was performed using TaqMan Open Array. We used multivariate linear regression models adjusted for age, sex, body mass index standard deviation score, and recruitment center. We identified significant associations between 3 SNPs (G6PC2 (rs560887), GCKR (rs1260326), MTNR1B (rs10830963)), the GS and FPG level. The FPG risk alleles of 11 out of the 15 SNPs (73.3%) displayed significant or non-significant beta values for FPG directionally consistent with those reported in adult European GWAS. The risk allele frequencies for 11 of 15 (73.3%) SNPs differed significantly in Mexican children and adolescents compared to European adults from the 1000G Project, but no significant enrichment in FPG risk alleles was observed in the Mexican population. Our data support a partial transferability of European GWAS FPG association signals in children and adolescents from the admixed Mexican population. PMID:27782183
Pooled Genome-Wide Analysis to Identify Novel Risk Loci for Pediatric Allergic Asthma
Ricci, Giampaolo; Astolfi, Annalisa; Remondini, Daniel; Cipriani, Francesca; Formica, Serena; Dondi, Arianna; Pession, Andrea
2011-01-01
Background Genome-wide association studies of pooled DNA samples were shown to be a valuable tool to identify candidate SNPs associated to a phenotype. No such study was up to now applied to childhood allergic asthma, even if the very high complexity of asthma genetics is an appropriate field to explore the potential of pooled GWAS approach. Methodology/Principal Findings We performed a pooled GWAS and individual genotyping in 269 children with allergic respiratory diseases comparing allergic children with and without asthma. We used a modular approach to identify the most significant loci associated with asthma by combining silhouette statistics and physical distance method with cluster-adapted thresholding. We found 97% concordance between pooled GWAS and individual genotyping, with 36 out of 37 top-scoring SNPs significant at individual genotyping level. The most significant SNP is located inside the coding sequence of C5, an already identified asthma susceptibility gene, while the other loci regulate functions that are relevant to bronchial physiopathology, as immune- or inflammation-mediated mechanisms and airway smooth muscle contraction. Integration with gene expression data showed that almost half of the putative susceptibility genes are differentially expressed in experimental asthma mouse models. Conclusion/Significance Combined silhouette statistics and cluster-adapted physical distance threshold analysis of pooled GWAS data is an efficient method to identify candidate SNP associated to asthma development in an allergic pediatric population. PMID:21359210
Wei, Lijuan; Qu, Cunmin; Xu, Xinfu; Lu, Kun; Qian, Wei; Li, Jiana; Li, Maoteng; Liu, Liezhao
2015-01-01
A stable yellow-seeded variety is the breeding goal for obtaining the ideal rapeseed (Brassica napus L.) plant, and the amount of acid detergent lignin (ADL) in the seeds and the hull content (HC) are often used as yellow-seeded rapeseed screening indices. In this study, a genome-wide association analysis of 520 accessions was performed using the Q + K model with a total of 31,839 single-nucleotide polymorphism (SNP) sites. As a result, three significant associations on the B. napus chromosomes A05, A09, and C05 were detected for seed ADL content. The peak SNPs were within 9.27, 14.22, and 20.86 kb of the key genes BnaA.PAL4, BnaA.CAD2/BnaA.CAD3, and BnaC.CCR1, respectively. Further analyses were performed on the major locus of A05, which was also detected in the seed HC examination. A comparison of our genome-wide association study (GWAS) results and previous linkage mappings revealed a common chromosomal region on A09, which indicates that GWAS can be used as a powerful complementary strategy for dissecting complex traits in B. napus. Genomic selection (GS) utilizing the significant SNP markers based on the GWAS results exhibited increased predictive ability, indicating that the predictive ability of a given model can be substantially improved by using GWAS and GS. PMID:26673885
Multi-Criteria Decision Making Approaches for Quality Control of Genome-Wide Association Studies
Malovini, Alberto; Rognoni, Carla; Puca, Annibale; Bellazzi, Riccardo
2009-01-01
Experimental errors in the genotyping phases of a Genome-Wide Association Study (GWAS) can lead to false positive findings and to spurious associations. An appropriate quality control phase could minimize the effects of this kind of errors. Several filtering criteria can be used to perform quality control. Currently, no formal methods have been proposed for taking into account at the same time these criteria and the experimenter’s preferences. In this paper we propose two strategies for setting appropriate genotyping rate thresholds for GWAS quality control. These two approaches are based on the Multi-Criteria Decision Making theory. We have applied our method on a real dataset composed by 734 individuals affected by Arterial Hypertension (AH) and 486 nonagenarians without history of AH. The proposed strategies appear to deal with GWAS quality control in a sound way, as they lead to rationalize and make explicit the experimenter’s choices thus providing more reproducible results. PMID:21347174
Genome-wide association studies of obesity and metabolic syndrome.
Fall, Tove; Ingelsson, Erik
2014-01-25
Until just a few years ago, the genetic determinants of obesity and metabolic syndrome were largely unknown, with the exception of a few forms of monogenic extreme obesity. Since genome-wide association studies (GWAS) became available, large advances have been made. The first single nucleotide polymorphism robustly associated with increased body mass index (BMI) was in 2007 mapped to a gene with for the time unknown function. This gene, now known as fat mass and obesity associated (FTO) has been repeatedly replicated in several ethnicities and is affecting obesity by regulating appetite. Since the first report from a GWAS of obesity, an increasing number of markers have been shown to be associated with BMI, other measures of obesity or fat distribution and metabolic syndrome. This systematic review of obesity GWAS will summarize genome-wide significant findings for obesity and metabolic syndrome and briefly give a few suggestions of what is to be expected in the next few years. Copyright © 2012 Elsevier Ireland Ltd. All rights reserved.
Almli, Lynn M; Duncan, Richard; Feng, Hao; Ghosh, Debashis; Binder, Elisabeth B; Bradley, Bekh; Ressler, Kerry J; Conneely, Karen N; Epstein, Michael P
2014-12-01
Genetic association studies of psychiatric outcomes often consider interactions with environmental exposures and, in particular, apply tests that jointly consider gene and gene-environment interaction effects for analysis. Using a genome-wide association study (GWAS) of posttraumatic stress disorder (PTSD), we report that heteroscedasticity (defined as variability in outcome that differs by the value of the environmental exposure) can invalidate traditional joint tests of gene and gene-environment interaction. To identify the cause of bias in traditional joint tests of gene and gene-environment interaction in a PTSD GWAS and determine whether proposed robust joint tests are insensitive to this problem. The PTSD GWAS data set consisted of 3359 individuals (978 men and 2381 women) from the Grady Trauma Project (GTP), a cohort study from Atlanta, Georgia. The GTP performed genome-wide genotyping of participants and collected environmental exposures using the Childhood Trauma Questionnaire and Trauma Experiences Inventory. We performed joint interaction testing of the Beck Depression Inventory and modified PTSD Symptom Scale in the GTP GWAS. We assessed systematic bias in our interaction analyses using quantile-quantile plots and genome-wide inflation factors. Application of the traditional joint interaction test to the GTP GWAS yielded systematic inflation across different outcomes and environmental exposures (inflation-factor estimates ranging from 1.07 to 1.21), whereas application of the robust joint test to the same data set yielded no such inflation (inflation-factor estimates ranging from 1.01 to 1.02). Simulated data further revealed that the robust joint test is valid in different heteroscedasticity models, whereas the traditional joint test is invalid. The robust joint test also has power similar to the traditional joint test when heteroscedasticity is not an issue. We believe the robust joint test should be used in candidate-gene studies and GWASs of psychiatric outcomes that consider environmental interactions. To make the procedure useful for applied investigators, we created a software tool that can be called from the popular PLINK package for analysis.
Logue, Mark W; Amstadter, Ananda B; Baker, Dewleen G; Duncan, Laramie; Koenen, Karestan C; Liberzon, Israel; Miller, Mark W; Morey, Rajendra A; Nievergelt, Caroline M; Ressler, Kerry J; Smith, Alicia K; Smoller, Jordan W; Stein, Murray B; Sumner, Jennifer A; Uddin, Monica
2015-01-01
The development of posttraumatic stress disorder (PTSD) is influenced by genetic factors. Although there have been some replicated candidates, the identification of risk variants for PTSD has lagged behind genetic research of other psychiatric disorders such as schizophrenia, autism, and bipolar disorder. Psychiatric genetics has moved beyond examination of specific candidate genes in favor of the genome-wide association study (GWAS) strategy of very large numbers of samples, which allows for the discovery of previously unsuspected genes and molecular pathways. The successes of genetic studies of schizophrenia and bipolar disorder have been aided by the formation of a large-scale GWAS consortium: the Psychiatric Genomics Consortium (PGC). In contrast, only a handful of GWAS of PTSD have appeared in the literature to date. Here we describe the formation of a group dedicated to large-scale study of PTSD genetics: the PGC-PTSD. The PGC-PTSD faces challenges related to the contingency on trauma exposure and the large degree of ancestral genetic diversity within and across participating studies. Using the PGC analysis pipeline supplemented by analyses tailored to address these challenges, we anticipate that our first large-scale GWAS of PTSD will comprise over 10 000 cases and 30 000 trauma-exposed controls. Following in the footsteps of our PGC forerunners, this collaboration—of a scope that is unprecedented in the field of traumatic stress—will lead the search for replicable genetic associations and new insights into the biological underpinnings of PTSD. PMID:25904361
Hamidi Hay, E; Roberts, A
2017-04-01
Longevity is a highly important trait to the efficiency of beef cattle production. The objective of this study was to evaluate the genomic prediction of longevity and identify genomic regions associated with this trait. The data used in this study consisted of 547 Composite Gene Combination cows (1/2 Red Angus, 1/4 Charolais, 1/4 Tarentaise) born from 2002 to 2011 genotyped with Illumina BovineSNP50 BeadChip. Three models were used to assess genomic prediction: Bayes A, Bayes B and GBLUP using a genomic relationship matrix. To identify genomic regions associated with longevity 2 approaches were adopted: single marker genome wide association and Bayesian approach using GenSel software. The genomic prediction accuracy was low 0.28, 0.25, and 0.22 for Bayes A, Bayes B and GBLUP, respectively. The single-marker genome wide association study (GWAS)identified 5 loci with -value less than 0.05 after false discovery correction: UA-IFASA-7571 on chromosome 19 (58.03 Mb), ARS-BFGL-BAC-15059 on BTA 1 (28.8 Mb), ARS-BFGL-NGS-104159 on BTA3 (29.4 Mb), ARS-BFGL-NGS-32882 on BTA9 (104.07 Mb) and ARS-BFGL-NGS-32883 on BTA25 (33.77 Mb). The Bayesian GWAS yielded 4 genomic regions overlapping with the single marker GWAS results. The region with the highest percentage of genomic variance (3.73%) was detected on chromosome 19. Both GWAS approaches adopted in this study showed evidence for association with various chromosomal locations.
Genome-wide Association Study for Ovarian Cancer Susceptibility using Pooled DNA
Lu, Yi; Chen, Xiaoqing; Beesley, Jonathan; Johnatty, Sharon E.; deFazio, Anna; Lambrechts, Sandrina; Lambrechts, Diether; Despierre, Evelyn; Vergotes, Ignace; Chang-Claude, Jenny; Hein, Rebecca; Nickels, Stefan; Wang-Gohrke, Shan; Dörk, Thilo; Dürst, Matthias; Antonenkova, Natalia; Bogdanova, Natalia; Goodman, Marc T.; Lurie, Galina; Wilkens, Lynne R.; Carney, Michael E.; Butzow, Ralf; Nevanlinna, Heli; Heikkinen, Tuomas; Leminen, Arto; Kiemeney, Lambertus A.; Massuger, Leon F.A.G.; van Altena, Anne M.; Aben, Katja K.; Kjaer, Susanne Krüger; Høgdall, Estrid; Jensen, Allan; Brooks-Wilson, Angela; Le, Nhu; Cook, Linda; Earp, Madalene; Kelemen, Linda; Easton, Douglas; Pharoah, Paul; Song, Honglin; Tyrer, Jonathan; Ramus, Susan; Menon, Usha; Gentry-Maharaj, Alexandra; Gayther, Simon A.; Bandera, Elisa V.; Olson, Sara H.; Orlow, Irene; Rodriguez-Rodriguez, Lorna
2013-01-01
Recent genome-wide association studies (GWAS) have identified four low-penetrance ovarian cancer susceptibility loci. We hypothesized that further moderate or low penetrance variants exist among the subset of SNPs not well tagged by the genotyping arrays used in the previous studies which would account for some of the remaining risk. We therefore conducted a time- and cost-effective stage 1 GWAS on 342 invasive serous cases and 643 controls genotyped on pooled DNA using the high density Illumina 1M-Duo array. We followed up 20 of the most significantly associated SNPs, which are not well tagged by the lower density arrays used by the published GWAS, and genotyping them on individual DNA. Most of the top 20 SNPs were clearly validated by individually genotyping the samples used in the pools. However, none of the 20 SNPs replicated when tested for association in a much larger stage 2 set of 4,651 cases and 6,966 controls from the Ovarian Cancer Association Consortium. Given that most of the top 20 SNPs from pooling were validated in the same samples by individual genotyping, the lack of replication is likely to be due to the relatively small sample size in our stage 1 GWAS rather than due to problems with the pooling approach. We conclude that there are unlikely to be any moderate or large effects on ovarian cancer risk untagged by the less dense arrays. However our study lacked power to make clear statements on the existence of hitherto untagged small effect variants. PMID:22794196
GWASinlps: Nonlocal prior based iterative SNP selection tool for genome-wide association studies.
Sanyal, Nilotpal; Lo, Min-Tzu; Kauppi, Karolina; Djurovic, Srdjan; Andreassen, Ole A; Johnson, Valen E; Chen, Chi-Hua
2018-06-19
Multiple marker analysis of the genome-wide association study (GWAS) data has gained ample attention in recent years. However, because of the ultra high-dimensionality of GWAS data, such analysis is challenging. Frequently used penalized regression methods often lead to large number of false positives, whereas Bayesian methods are computationally very expensive. Motivated to ameliorate these issues simultaneously, we consider the novel approach of using nonlocal priors in an iterative variable selection framework. We develop a variable selection method, named, iterative nonlocal prior based selection for GWAS, or GWASinlps, that combines, in an iterative variable selection framework, the computational efficiency of the screen-and-select approach based on some association learning and the parsimonious uncertainty quantification provided by the use of nonlocal priors. The hallmark of our method is the introduction of 'structured screen-and-select' strategy, that considers hierarchical screening, which is not only based on response-predictor associations, but also based on response-response associations, and concatenates variable selection within that hierarchy. Extensive simulation studies with SNPs having realistic linkage disequilibrium structures demonstrate the advantages of our computationally efficient method compared to several frequentist and Bayesian variable selection methods, in terms of true positive rate, false discovery rate, mean squared error, and effect size estimation error. Further, we provide empirical power analysis useful for study design. Finally, a real GWAS data application was considered with human height as phenotype. An R-package for implementing the GWASinlps method is available at https://cran.r-project.org/web/packages/GWASinlps/index.html. Supplementary data are available at Bioinformatics online.
Genome-wide association study (GWAS) for molar-incisor hypomineralization (MIH).
Kühnisch, Jan; Thiering, Elisabeth; Heitmüller, Daniela; Tiesler, Carla M T; Grallert, Harald; Heinrich-Weltzien, Roswitha; Hickel, Reinhard; Heinrich, Joachim
2014-01-01
This genome-wide association study (GWAS) investigated the relationship between molar-incisor hypomineralization (MIH) and possible genetic loci. Clinical and genetic data from the 10-year follow-up of 668 children from the Munich GINI-plus and LISA-plus birth cohort studies were analyzed. The dental examinations included the diagnosis of MIH according to the criteria of the European Academy of Paediatric Dentistry (EAPD). Children with MIH were categorized as those with a minimum of one hypomineralized first permanent molar. A GWAS was implemented following a quality-control step and an additive genetic effect was assumed. A total of 2,013,491 single-nucleotide polymorphisms (SNPs) were available for analysis. Rs13058467, which is located near the SCUBE1 gene on chromosome 22 (p < 3.72E-7), was identified as a possible locus linked to MIH when using a threshold of p value <1E-6. After considering the limitations of the present study (e.g., limited sample size and lack of an independent replication sample), it can be concluded that (1) replication analyses in an independent cohort study are strongly recommended and (2) large-scale and well-powered studies are needed to investigate a possible genetic link to MIH.
Evaluation of 19 susceptibility loci of breast cancer in women of African ancestry
Huo, Dezheng; Zheng, Yonglan; Ogundiran, Temidayo O.; Adebamowo, Clement; Nathanson, Katherine L.; Domchek, Susan M.; Rebbeck, Timothy R.; Simon, Michael S.; John, Esther M.; Hennis, Anselm; Nemesure, Barbara; Wu, Suh-Yuh; Leske, M.Cristina; Ambs, Stefan; Niu, Qun; Zhang, Jing; Cox, Nancy J.; Olopade, Olufunmilayo I.
2012-01-01
Multiple breast cancer susceptibility loci have been identified in genome-wide association studies (GWAS) in populations of European and Asian ancestry using array chips optimized for populations of European ancestry. It is important to examine whether these loci are associated with breast cancer risk in women of African ancestry. We evaluated 25 single nucleotide polymorphisms (SNPs) at 19 loci in a pooled case–control study of breast cancer, which included 1509 cases and 1383 controls. Cases and controls were enrolled in Nigeria, Barbados and the USA; all women were of African ancestry. We found significant associations for three SNPs, which were in the same direction and of similar magnitude as those reported in previous fine-mapping studies in women of African ancestry. The allelic odds ratios were 1.24 [95% confidence interval (CI): 1.04–1.47; P = 0.018] for the rs2981578-G allele (10q26/FGFR2), 1.34 (95% CI: 1.10–1.63; P = 0.0035) for the rs9397435-G allele (6q25) and 1.12 (95% CI: 1.00–1.25; P = 0.04) for the rs3104793-C allele (16q12). Although a significant association was observed for an additional index SNP (rs3817198), it was in the opposite direction to prior GWAS studies. In conclusion, this study highlights the complexity of applying current GWAS findings across racial/ethnic groups, as none of GWAS-identified index SNPs could be replicated in women of African ancestry. Further fine-mapping studies in women of African ancestry will be needed to reveal additional and causal variants for breast cancer. PMID:22357627
Saykin, Andrew J.; Shen, Li; Foroud, Tatiana M.; Potkin, Steven G.; Swaminathan, Shanker; Kim, Sungeun; Risacher, Shannon L.; Nho, Kwangsik; Huentelman, Matthew J.; Craig, David W.; Thompson, Paul M.; Stein, Jason L.; Moore, Jason H.; Farrer, Lindsay A.; Green, Robert C.; Bertram, Lars; Jack, Clifford R.; Weiner, Michael W.
2010-01-01
The role of the Alzheimer’s Disease Neuroimaging Initiative Genetics Core is to facilitate the investigation of genetic influences on disease onset and trajectory as reflected in structural, functional, and molecular imaging changes; fluid biomarkers; and cognitive status. Major goals include (1) blood sample processing, genotyping, and dissemination, (2) genome-wide association studies (GWAS) of longitudinal phenotypic data, and (3) providing a central resource, point of contact and planning group for genetics within Alzheimer’s Disease Neuroimaging Initiative. Genome-wide array data have been publicly released and updated, and several neuroimaging GWAS have recently been reported examining baseline magnetic resonance imaging measures as quantitative phenotypes. Other preliminary investigations include copy number variation in mild cognitive impairment and Alzheimer’s disease and GWAS of baseline cerebrospinal fluid biomarkers and longitudinal changes on magnetic resonance imaging. Blood collection for RNA studies is a new direction. Genetic studies of longitudinal phenotypes hold promise for elucidating disease mechanisms and risk, development of therapeutic strategies, and refining selection criteria for clinical trials. PMID:20451875
Analyzing Association Mapping in Pedigree-Based GWAS Using a Penalized Multitrait Mixed Model
Liu, Jin; Yang, Can; Shi, Xingjie; Li, Cong; Huang, Jian; Zhao, Hongyu; Ma, Shuangge
2017-01-01
Genome-wide association studies (GWAS) have led to the identification of many genetic variants associated with complex diseases in the past 10 years. Penalization methods, with significant numerical and statistical advantages, have been extensively adopted in analyzing GWAS. This study has been partly motivated by the analysis of Genetic Analysis Workshop (GAW) 18 data, which have two notable characteristics. First, the subjects are from a small number of pedigrees and hence related. Second, for each subject, multiple correlated traits have been measured. Most of the existing penalization methods assume independence between subjects and traits and can be suboptimal. There are a few methods in the literature based on mixed modeling that can accommodate correlations. However, they cannot fully accommodate the two types of correlations while conducting effective marker selection. In this study, we develop a penalized multitrait mixed modeling approach. It accommodates the two different types of correlations and includes several existing methods as special cases. Effective penalization is adopted for marker selection. Simulation demonstrates its satisfactory performance. The GAW 18 data are analyzed using the proposed method. PMID:27247027
Hong, Kyung-Won; Min, Haesook; Heo, Byeong-Mun; Joo, Seong Eun; Kim, Sung Soo; Kim, Yeonjung
2012-06-01
Increased pulse pressure (PP) and decreased mean arterial pressure (MAP) are strong prognostic predictors of adverse cardiovascular events. Recently, the International Consortium for Blood Pressure Genome-Wide Association Studies (ICBP-GWAS) reported eight loci that influenced PP and MAP. The ICBP-GWAS examined 51 cohorts--comprising 122 671 individuals of European ancestry--and identified eight SNPs: five that governed PP and three that controlled MAP. Six of these loci were novel. To replicate these newly identified loci and examine genetic architecture of PP and MAP between European and Asian populations, we conducted a meta-analysis of the eight SNPs combining data from ICBP and general population-based Korean cohorts. Two SNPs (rs13002573 (FIGN) and rs871606 (CHIC2)) for PP and two SNPs (rs1446468 (FIGN) and rs319690 (MAP4)) for MAP were replicated in Koreans. Although our GWAS only found moderate association, we believe that the findings promote us to propose that a similar genetic architecture governs PP and MAP in Asians and Europeans. However, further studies will be needed to confirm the possibility using other Asian population.
seXY: a tool for sex inference from genotype arrays.
Qian, David C; Busam, Jonathan A; Xiao, Xiangjun; O'Mara, Tracy A; Eeles, Rosalind A; Schumacher, Frederick R; Phelan, Catherine M; Amos, Christopher I
2017-02-15
Checking concordance between reported sex and genotype-inferred sex is a crucial quality control measure in genome-wide association studies (GWAS). However, limited insights exist regarding the true accuracy of software that infer sex from genotype array data. We present seXY, a logistic regression model trained on both X chromosome heterozygosity and Y chromosome missingness, that consistently demonstrated >99.5% sex inference accuracy in cross-validation for 889 males and 5,361 females enrolled in prostate cancer and ovarian cancer GWAS. Compared to PLINK, one of the most popular tools for sex inference in GWAS that assesses only X chromosome heterozygosity, seXY achieved marginally better male classification and 3% more accurate female classification. https://github.com/Christopher-Amos-Lab/seXY. Christopher.I.Amos@dartmouth.edu. Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com
Chen, Zhijian; Craiu, Radu V; Bull, Shelley B
2014-11-01
In focused studies designed to follow up associations detected in a genome-wide association study (GWAS), investigators can proceed to fine-map a genomic region by targeted sequencing or dense genotyping of all variants in the region, aiming to identify a functional sequence variant. For the analysis of a quantitative trait, we consider a Bayesian approach to fine-mapping study design that incorporates stratification according to a promising GWAS tag SNP in the same region. Improved cost-efficiency can be achieved when the fine-mapping phase incorporates a two-stage design, with identification of a smaller set of more promising variants in a subsample taken in stage 1, followed by their evaluation in an independent stage 2 subsample. To avoid the potential negative impact of genetic model misspecification on inference we incorporate genetic model selection based on posterior probabilities for each competing model. Our simulation study shows that, compared to simple random sampling that ignores genetic information from GWAS, tag-SNP-based stratified sample allocation methods reduce the number of variants continuing to stage 2 and are more likely to promote the functional sequence variant into confirmation studies. © 2014 WILEY PERIODICALS, INC.
Genome-wide association mapping of quantitative traits in a breeding population of sugarcane.
Racedo, Josefina; Gutiérrez, Lucía; Perera, María Francisca; Ostengo, Santiago; Pardo, Esteban Mariano; Cuenya, María Inés; Welin, Bjorn; Castagnaro, Atilio Pedro
2016-06-24
Molecular markers associated with relevant agronomic traits could significantly reduce the time and cost involved in developing new sugarcane varieties. Previous sugarcane genome-wide association analyses (GWAS) have found few molecular markers associated with relevant traits at plant-cane stage. The aim of this study was to establish an appropriate GWAS to find molecular markers associated with yield related traits consistent across harvesting seasons in a breeding population. Sugarcane clones were genotyped with DArT (Diversity Array Technology) and TRAP (Target Region Amplified Polymorphism) markers, and evaluated for cane yield (CY) and sugar content (SC) at two locations during three successive crop cycles. GWAS mapping was applied within a novel mixed-model framework accounting for population structure with Principal Component Analysis scores as random component. A total of 43 markers significantly associated with CY in plant-cane, 42 in first ratoon, and 41 in second ratoon were detected. Out of these markers, 20 were associated with CY in 2 years. Additionally, 38 significant associations for SC were detected in plant-cane, 34 in first ratoon, and 47 in second ratoon. For SC, one marker-trait association was found significant for the 3 years of the study, while twelve markers presented association for 2 years. In the multi-QTL model several markers with large allelic substitution effect were found. Sequences of four DArT markers showed high similitude and e-value with coding sequences of Sorghum bicolor, confirming the high gene microlinearity between sorghum and sugarcane. In contrast with other sugarcane GWAS studies reported earlier, the novel methodology to analyze multi-QTLs through successive crop cycles used in the present study allowed us to find several markers associated with relevant traits. Combining existing phenotypic trial data and genotypic DArT and TRAP marker characterizations within a GWAS approach including population structure as random covariates may prove to be highly successful. Moreover, sequences of DArT marker associated with the traits of interest were aligned in chromosomal regions where sorghum QTLs has previously been reported. This approach could be a valuable tool to assist the improvement of sugarcane and better supply sugarcane demand that has been projected for the upcoming decades.
Correcting Systematic Inflation in Genetic Association Tests That Consider Interaction Effects
Almli, Lynn M.; Duncan, Richard; Feng, Hao; Ghosh, Debashis; Binder, Elisabeth B.; Bradley, Bekh; Ressler, Kerry J.; Conneely, Karen N.; Epstein, Michael P.
2015-01-01
IMPORTANCE Genetic association studies of psychiatric outcomes often consider interactions with environmental exposures and, in particular, apply tests that jointly consider gene and gene-environment interaction effects for analysis. Using a genome-wide association study (GWAS) of posttraumatic stress disorder (PTSD), we report that heteroscedasticity (defined as variability in outcome that differs by the value of the environmental exposure) can invalidate traditional joint tests of gene and gene-environment interaction. OBJECTIVES To identify the cause of bias in traditional joint tests of gene and gene-environment interaction in a PTSD GWAS and determine whether proposed robust joint tests are insensitive to this problem. DESIGN, SETTING, AND PARTICIPANTS The PTSD GWAS data set consisted of 3359 individuals (978 men and 2381 women) from the Grady Trauma Project (GTP), a cohort study from Atlanta, Georgia. The GTP performed genome-wide genotyping of participants and collected environmental exposures using the Childhood Trauma Questionnaire and Trauma Experiences Inventory. MAIN OUTCOMES AND MEASURES We performed joint interaction testing of the Beck Depression Inventory and modified PTSD Symptom Scale in the GTP GWAS. We assessed systematic bias in our interaction analyses using quantile-quantile plots and genome-wide inflation factors. RESULTS Application of the traditional joint interaction test to the GTP GWAS yielded systematic inflation across different outcomes and environmental exposures (inflation-factor estimates ranging from 1.07 to 1.21), whereas application of the robust joint test to the same data set yielded no such inflation (inflation-factor estimates ranging from 1.01 to 1.02). Simulated data further revealed that the robust joint test is valid in different heteroscedasticity models, whereas the traditional joint test is invalid. The robust joint test also has power similar to the traditional joint test when heteroscedasticity is not an issue. CONCLUSIONS AND RELEVANCE We believe the robust joint test should be used in candidate-gene studies and GWASs of psychiatric outcomes that consider environmental interactions. To make the procedure useful for applied investigators, we created a software tool that can be called from the popular PLINK package for analysis. PMID:25354142
Trans-ethnic meta-analysis of genome-wide association studies for Hirschsprung disease.
Tang, Clara Sze-Man; Gui, Hongsheng; Kapoor, Ashish; Kim, Jeong-Hyun; Luzón-Toro, Berta; Pelet, Anna; Burzynski, Grzegorz; Lantieri, Francesca; So, Man-Ting; Berrios, Courtney; Shin, Hyoung Doo; Fernández, Raquel M; Le, Thuy-Linh; Verheij, Joke B G M; Matera, Ivana; Cherny, Stacey S; Nandakumar, Priyanka; Cheong, Hyun Sub; Antiñolo, Guillermo; Amiel, Jeanne; Seo, Jeong-Meen; Kim, Dae-Yeon; Oh, Jung-Tak; Lyonnet, Stanislas; Borrego, Salud; Ceccherini, Isabella; Hofstra, Robert M W; Chakravarti, Aravinda; Kim, Hyun-Young; Sham, Pak Chung; Tam, Paul K H; Garcia-Barceló, Maria-Mercè
2016-12-01
Hirschsprung disease (HSCR) is the most common cause of neonatal intestinal obstruction. It is characterized by the absence of ganglia in the nerve plexuses of the lower gastrointestinal tract. So far, three common disease-susceptibility variants at the RET, SEMA3 and NRG1 loci have been detected through genome-wide association studies (GWAS) in Europeans and Asians to understand its genetic etiologies. Here we present a trans-ethnic meta-analysis of 507 HSCR cases and 1191 controls, combining all published GWAS results on HSCR to fine-map these loci and narrow down the putatively causal variants to 99% credible sets. We also demonstrate that the effects of RET and NRG1 are universal across European and Asian ancestries. In contrast, we detected a European-specific association of a low-frequency variant, rs80227144, in SEMA3 [odds ratio (OR) = 5.2, P = 4.7 × 10-10]. Conditional analyses on the lead SNPs revealed a secondary association signal, corresponding to an Asian-specific, low-frequency missense variant encoding RET p.Asp489Asn (rs9282834, conditional OR = 20.3, conditional P = 4.1 × 10-14). When in trans with the RET intron 1 enhancer risk allele, rs9282834 increases the risk of HSCR from 1.1 to 26.7. Overall, our study provides further insights into the genetic architecture of HSCR and has profound implications for future study designs. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Squarcina, L; Houenou, J; Altamura, A C; Soares, J; Brambilla, P
2017-10-15
Diffusion tensor imaging (DTI) studies, which allow the in-vivo investigation of brain tissue integrity, have shown that bipolar disorder (BD) patients present signs of white matter dysconnectivity. In parallel, genome-wide association studies (GWAS) identified several risk genetic variants for BD. In this mini-review, we summarized DTI studies coupling tract-based spatial statistics (TBSS), a reliable technique exploring white matter axon bundles, and genetics in BD. We performed a bibliographic search on PUBMED, using the search terms "TBSS", "genetics", "genome", "genes", "polymorphism", "bipolar disorder". Ten studies met these inclusion criteria. ANK3 and ZNF804A polymorphisms have shown the most consistent results, with the risk alleles showing abnormal white matter integrity in patients with BD. Current studies are limited by the investigation of single SNPs in small and chronically treated samples. Most considered TBSS-DTI studies found associations between decreased white matter integrity and genetic risk variants. These results suggest an involvement of dysmyelination in the pathogenesis of BD. The combination of TBSS with genotyping can be powerful to unveil the role of white matter in BD, in conjunction with risk genes. Future DTI studies should combine TBSS and GWAS in large populations of drug-free or minimally treated patients with BD at the onset of the disease. Copyright © 2017 Elsevier B.V. All rights reserved.
Bensen, Jeannette T; Xu, Zongli; Smith, Gary J; Mohler, James L; Fontham, Elizabeth T H; Taylor, Jack A
2013-01-01
Genome-wide association studies have established a number of replicated single nucleotide polymorphisms (SNPs) for susceptibility to prostate cancer (CaP), but it is unclear whether these susceptibility SNPs are also associated with disease aggressiveness. This study evaluates whether such replication SNPs or other candidate SNPs are associated with CaP aggressiveness in African-American (AA) and European-American (EA) men. A 1,536 SNP panel which included 34 genome-wide association study (GWAS) replication SNPs, 38 flanking SNPs, a set of ancestry informative markers, and SNPs in candidate genes and other areas was genotyped in 1,060 AA and 1,087 EA men with incident CaP from the North Carolina-Louisiana Prostate Cancer Project (PCaP). Tests for association were conducted using ordinal logistic regression with a log-additive genotype model and a 3-category CaP aggressiveness variable. Four GWAS replication SNPs (rs2660753, rs13254738, rs10090154, rs2735839) and seven flanking SNPs were associated with CaP aggressiveness (P < 0.05) in three genomic regions: One at 3p12 (EA), seven at 8q24 (5 AA, 2 EA), and three at 19q13 at the kallilkrein-related peptidase 3 (KLK3) locus (two AA, one AA and EA). The KLK3 SNPs also were associated with serum prostate-specific antigen (PSA) levels in AA (P < 0.001) but not in EA. A number of the other SNPs showed some evidence of association but none met study-wide significance levels after adjusting for multiple comparisons. Some replicated GWAS susceptibility SNPs may play a role in CaP aggressiveness. However, like susceptibility, these associations are not consistent between racial groups. Copyright © 2012 Wiley Periodicals, Inc.
Bensen, Jeannette T.; Xu, Zongli; Smith, Gary J.; Mohler, James L.; Fontham, Elizabeth T.H.; Taylor, Jack A.
2012-01-01
BACKGROUND Genome-wide association studies have established a number of replicated single nucleotide polymorphisms (SNPs) for susceptibility to prostate cancer (CaP), but it is unclear whether these susceptibility SNPs are also associated with disease aggressiveness. This study evaluates whether such replication SNPs or other candidate SNPs are associated with CaP aggressiveness in African-American (AA) and European-American (EA) men. METHODS A 1,536 SNP panel which included 34 genome-wide association study (GWAS) replication SNPs, 38 flanking SNPs, a set of ancestry informative markers, and SNPs in candidate genes and other areas was genotyped in 1,060 AA and 1,087 EA men with incident CaP from the North Carolina-Louisiana Prostate Cancer Project (PCaP). Tests for association were conducted using ordinal logistic regression with a log-additive genotype model and a 3-category CaP aggressiveness variable. RESULTS 4 GWAS replication SNPs (rs2660753, rs13254738, rs10090154, rs2735839) and 7 flanking SNPs were associated with CaP aggressiveness (P<0.05) in 3 genomic regions: one at 3p12 (EA), 7 at 8q24 (5 AA, 2 EA), and 3 at 19q13 at the kallilkrein-related peptidase 3 (KLK3) locus (2 AA, 1 AA and EA). The KLK3 SNPs also were associated with serum prostate-specific antigen (PSA) levels in AA (p < 0.001) but not in EA. A number of the other SNPs showed some evidence of association but none met study-wide significance levels after adjusting for multiple comparisons. CONCLUSIONS Some replicated GWAS susceptibility SNPs may play a role in CaP aggressiveness. However, like susceptibility, these associations are not consistent between racial groups. PMID:22549899
Zhang, Chenan; Chen, Lin S; Gao, Jianjun; Roy, Shantanu; Shinkle, Justin; Sabarinathan, Mekala; Tong, Lin; Ahmed, Alauddin; Islam, Tariqul; Rakibuz-Zaman, Muhammad; Sarwar, Golam; Shahriar, Hasan; Rahman, Mahfuzar; Yunus, Mohammad; Jasmine, Farzana; Kibriya, Muhammad G; Ahsan, Habibul; Pierce, Brandon L
2018-01-01
Background Leucocyte telomere length (TL) is a potential biomarker of ageing and risk for age-related disease. Leucocyte TL is heritable and shows substantial differences by race/ethnicity. Recent genome-wide association studies (GWAS) report ~10 loci harbouring SNPs associated with leucocyte TL, but these studies focus primarily on populations of European ancestry. Objective This study aims to enhance our understanding of genetic determinants of TL across populations. Methods We performed a GWAS of TL using data on 5075 Bangladeshi adults. We measured TL using one of two technologies (qPCR or a Luminex-based method) and used standardised variables as TL phenotypes. Results Our results replicate previously reported associations in the TERC and TERT regions (P=2.2×10−8 and P=6.4×10−6, respectively). We observed a novel association signal in the RTEL1 gene (intronic SNP rs2297439; P=2.82×10−7) that is independent of previously reported TL-associated SNPs in this region. The minor allele for rs2297439 is common in South Asian populations (≥0.25) but at lower frequencies in other populations (eg, 0.07 in Northern Europeans). Among the eight other previously reported association signals, all were directionally consistent with our study, but only rs8105767 (ZNF208) was nominally significant (P=0.003). SNP-based heritability estimates were as high as 44% when analysing close relatives but much lower when analysing distant relatives only. Conclusions In this first GWAS of TL in a South Asian population, we replicate some, but not all, of the loci reported in prior GWAS of individuals of European ancestry, and we identify a novel second association signal at the RTEL1 locus. PMID:29151059
Intergenic disease-associated regions are abundant in novel transcripts.
Bartonicek, N; Clark, M B; Quek, X C; Torpy, J R; Pritchard, A L; Maag, J L V; Gloss, B S; Crawford, J; Taft, R J; Hayward, N K; Montgomery, G W; Mattick, J S; Mercer, T R; Dinger, M E
2017-12-28
Genotyping of large populations through genome-wide association studies (GWAS) has successfully identified many genomic variants associated with traits or disease risk. Unexpectedly, a large proportion of GWAS single nucleotide polymorphisms (SNPs) and associated haplotype blocks are in intronic and intergenic regions, hindering their functional evaluation. While some of these risk-susceptibility regions encompass cis-regulatory sites, their transcriptional potential has never been systematically explored. To detect rare tissue-specific expression, we employed the transcript-enrichment method CaptureSeq on 21 human tissues to identify 1775 multi-exonic transcripts from 561 intronic and intergenic haploblocks associated with 392 traits and diseases, covering 73.9 Mb (2.2%) of the human genome. We show that a large proportion (85%) of disease-associated haploblocks express novel multi-exonic non-coding transcripts that are tissue-specific and enriched for GWAS SNPs as well as epigenetic markers of active transcription and enhancer activity. Similarly, we captured transcriptomes from 13 melanomas, targeting nine melanoma-associated haploblocks, and characterized 31 novel melanoma-specific transcripts that include fusion proteins, novel exons and non-coding RNAs, one-third of which showed allelically imbalanced expression. This resource of previously unreported transcripts in disease-associated regions ( http://gwas-captureseq.dingerlab.org ) should provide an important starting point for the translational community in search of novel biomarkers, disease mechanisms, and drug targets.
Thorwarth, Patrick; Yousef, Eltohamy A A; Schmid, Karl J
2018-02-02
Genetic resources are an important source of genetic variation for plant breeding. Genome-wide association studies (GWAS) and genomic prediction greatly facilitate the analysis and utilization of useful genetic diversity for improving complex phenotypic traits in crop plants. We explored the potential of GWAS and genomic prediction for improving curd-related traits in cauliflower ( Brassica oleracea var. botrytis ) by combining 174 randomly selected cauliflower gene bank accessions from two different gene banks. The collection was genotyped with genotyping-by-sequencing (GBS) and phenotyped for six curd-related traits at two locations and three growing seasons. A GWAS analysis based on 120,693 single-nucleotide polymorphisms identified a total of 24 significant associations for curd-related traits. The potential for genomic prediction was assessed with a genomic best linear unbiased prediction model and BayesB. Prediction abilities ranged from 0.10 to 0.66 for different traits and did not differ between prediction methods. Imputation of missing genotypes only slightly improved prediction ability. Our results demonstrate that GWAS and genomic prediction in combination with GBS and phenotyping of highly heritable traits can be used to identify useful quantitative trait loci and genotypes among genetically diverse gene bank material for subsequent utilization as genetic resources in cauliflower breeding. Copyright © 2018 Thorwarth et al.
Human brain arousal in the resting state: a genome-wide association study.
Jawinski, Philippe; Kirsten, Holger; Sander, Christian; Spada, Janek; Ulke, Christine; Huang, Jue; Burkhardt, Ralph; Scholz, Markus; Hensch, Tilman; Hegerl, Ulrich
2018-04-27
Arousal affects cognition, emotion, and behavior and has been implicated in the etiology of psychiatric disorders. Although environmental conditions substantially contribute to the level of arousal, stable interindividual characteristics are well-established and a genetic basis has been suggested. Here we investigated the molecular genetics of brain arousal in the resting state by conducting a genome-wide association study (GWAS). We selected N = 1877 participants from the population-based LIFE-Adult cohort. Participants underwent a 20-min eyes-closed resting state EEG, which was analyzed using the computerized VIGALL 2.1 (Vigilance Algorithm Leipzig). At the SNP-level, GWAS analyses revealed no genome-wide significant locus (p < 5E-8), although seven loci were suggestive (p < 1E-6). The strongest hit was an expression quantitative trait locus (eQTL) of TMEM159 (lead-SNP: rs79472635, p = 5.49E-8). Importantly, at the gene-level, GWAS analyses revealed significant evidence for TMEM159 (p = 0.013, Bonferroni-corrected). By mapping our SNPs to the GWAS results from the Psychiatric Genomics Consortium, we found that all corresponding markers of TMEM159 showed nominally significant associations with Major Depressive Disorder (MDD; 0.006 ≤ p ≤ 0.011). More specifically, variants associated with high arousal levels have previously been linked to an increased risk for MDD. In line with this, the MetaXcan database suggests increased expression levels of TMEM159 in MDD, as well as Autism Spectrum Disorder, and Alzheimer's Disease. Furthermore, our pathway analyses provided evidence for a role of sodium/calcium exchangers in resting state arousal. In conclusion, the present GWAS identifies TMEM159 as a novel candidate gene which may modulate the risk for psychiatric disorders through arousal mechanisms. Our results also encourage the elaboration of the previously reported interrelations between ion-channel modulators, sleep-wake behavior, and psychiatric disorders.
Bostrom, Meredith A.; Kao, W.H. Linda; Li, Man; Abboud, Hanna E.; Adler, Sharon G.; Iyengar, Sudha K.; Kimmel, Paul L.; Hanson, Robert L.; Nicholas, Susanne B.; Rasooly, Rebekah S.; Sedor, John R.; Coresh, Josef; Kohn, Orly F.; Leehey, David J.; Thornley-Brown, Denyse; Bottinger, Erwin P.; Lipkowitz, Michael S.; Meoni, Lucy A.; Klag, Michael J.; Lu, Lingyi; Hicks, Pamela J.; Langefeld, Carl D.; Parekh, Rulan S.; Bowden, Donald W.; Freedman, Barry I.
2011-01-01
Background African Americans (AAs) have increased susceptibility to non-diabetic nephropathy relative to European Americans. Study Design Follow-up of a pooled genome-wide association study (GWAS) in AA dialysis patients with nondiabetic nephropathy; novel gene-gene interaction analyses. Setting & Participants Wake Forest sample: 962 AA nondiabetic nephropathy cases; 931 non-nephropathy controls. Replication sample: 668 Family Investigation of Nephropathy and Diabetes (FIND) AA nondiabetic nephropathy cases; 804 non-nephropathy controls. Predictors Individual genotyping of top 1420 pooled GWAS-associated single nucleotide polymorphisms (SNPs) and 54 SNPs in six nephropathy susceptibility genes. Outcomes APOL1 genetic association and additional candidate susceptibility loci interacting with, or independently from, APOL1. Results The strongest GWAS associations included two non-coding APOL1 SNPs, rs2239785 (odds ratio [OR], 0.33; dominant; p = 5.9 × 10−24) and rs136148 (OR, 0.54; additive; p = 1.1 × 10−7) with replication in FIND (p = 5.0 × 10−21 and 1.9 × 10−05, respectively). Rs2239785 remained significantly associated after controlling for the APOL1 G1 and G2 coding variants. Additional top hits included a CFH SNP(OR from meta-analysis in above 3367 AA cases and controls, 0.81; additive; p = 6.8 × 10−4). The 1420 SNPs were tested for interaction with APOL1 G1 and G2 variants. Several interactive SNPs were detected, the most significant was rs16854341 in the podocin gene (NPHS2) (p = 0.0001). Limitations Non-pooled GWAS have not been performed in AA nondiabetic nephropathy. Conclusions This follow-up of a pooled GWAS provides additional and independent evidence that APOL1 variants contribute to nondiabetic nephropathy in AAs and identified additional associated and interactive non-diabetic nephropathy susceptibility genes. PMID:22119407
SUSCEPTIBILITY LOCI FOR UMBILICAL HERNIA IN SWINE DETECTED BY GENOME-WIDE ASSOCIATION.
Liao, X J; Lia, L; Zhang, Z Y; Long, Y; Yang, B; Ruan, G R; Su, Y; Ai, H S; Zhang, W C; Deng, W Y; Xiao, S J; Ren, J; Ding, N S; Huang, L S
2015-10-01
Umbilical hernia (UH) is a complex disorder caused by both genetic and environmental factors. UH brings animal welfare problems and severe economic loss to the pig industry. Until now, the genetic basis of UH is poorly understood. The high-density 60K porcine SNP array enables the rapid application of genome-wide association study (GWAS) to identify genetic loci for phenotypic traits at genome wide scale in pigs. The objective of this research was to identify susceptibility loci for swine umbilical hernia using the GWAS approach. We genotyped 478 piglets from 142 families representing three Western commercial breeds with the Illumina PorcineSNP60 BeadChip. Then significant SNPs were detected by GWAS using ROADTRIPS (Robust Association-Detection Test for Related Individuals with Population Substructure) software base on a Bonferroni corrected threshold (P = 1.67E-06) or suggestive threshold (P = 3.34E-05) and false discovery rate (FDR = 0.05). After quality control, 29,924 qualified SNPs and 472 piglets were used for GWAS. Two suggestive loci predisposing to pig UH were identified at 44.25MB on SSC2 (rs81358018, P = 3.34E-06, FDR = 0.049933) and at 45.90MB on SSC17 (rs81479278, P = 3.30E-06, FDR = 0.049933) in Duroc population, respectively. And no SNP was detected to be associated with pig UH at significant level in neither Landrace nor Large White population. Furthermore, we carried out a meta-analysis in the combined pure-breed population containing all the 472 piglets. rs81479278 (P = 1.16E-06, FDR = 0.022475) was identified to associate with pig UH at genome-wide significant level. SRC was characterized as plausible candidate gene for susceptibility to pig UH according to its genomic position and biological functions. To our knowledge, this study gives the first description of GWAS identifying susceptibility loci for umbilical hernia in pigs. Our findings provide deeper insights to the genetic architecture of umbilical hernia in pigs.
Sekula, Peggy; Li, Yong; Stanescu, Horia C; Wuttke, Matthias; Ekici, Arif B; Bockenhauer, Detlef; Walz, Gerd; Powis, Stephen H; Kielstein, Jan T; Brenchley, Paul; Eckardt, Kai-Uwe; Kronenberg, Florian; Kleta, Robert; Köttgen, Anna
2017-02-01
Membranous nephropathy (MN) is a common cause of nephrotic syndrome in adults. Previous genome-wide association studies (GWAS) of 300 000 genotyped variants identified MN-associated loci at HLA-DQA1 and PLA2R1. We used a combined approach of genotype imputation, GWAS, human leucocyte antigen (HLA) imputation and extension to other aetiologies of chronic kidney disease (CKD) to investigate genetic MN risk variants more comprehensively. GWAS using 9 million high-quality imputed genotypes and classical HLA alleles were conducted for 323 MN European-ancestry cases and 345 controls. Additionally, 4960 patients with different CKD aetiologies in the German Chronic Kidney Disease (GCKD) study were genotyped for risk variants at HLA-DQA1 and PLA2R1. In GWAS, lead variants in known loci [rs9272729, HLA-DQA1, odds ratio (OR) = 7.3 per risk allele, P = 5.9 × 10 -27 and rs17830558, PLA2R1, OR = 2.2, P = 1.9 × 10 -8 ] were significantly associated with MN. No novel signals emerged in GWAS of X-chromosomal variants or in sex-specific analyses. Classical HLA alleles (DRB1*0301-DQA1*0501-DQB1*0201 haplotype) were associated with MN but provided little additional information beyond rs9272729. Associations were replicated in 137 GCKD patients with MN (HLA-DQA1: P = 6.4 × 10 -24 ; PLA2R1: P = 5.0 × 10 -4 ). MN risk increased steeply for patients with high-risk genotype combinations (OR > 79). While genetic variation in PLA2R1 exclusively associated with MN across 19 CKD aetiologies, the HLA-DQA1 risk allele was also associated with lupus nephritis (P = 2.8 × 10 -6 ), type 1 diabetic nephropathy (P = 6.9 × 10 -5 ) and focal segmental glomerulosclerosis (P = 5.1 × 10 -5 ), but not with immunoglobulin A nephropathy. PLA2R1 and HLA-DQA1 are the predominant risk loci for MN detected by GWAS. While HLA-DQA1 risk variants show an association with other CKD aetiologies, PLA2R1 variants are specific to MN. © The Author 2016. Published by Oxford University Press on behalf of ERA-EDTA. All rights reserved.
Shu, Xiang; Purdue, Mark P; Ye, Yuanqing; Tu, Huakang; Wood, Christopher G; Tannir, Nizar M; Wang, Zhaoming; Albanes, Demetrius; Gapstur, Susan M; Stevens, Victoria L; Rothman, Nathaniel; Chanock, Stephen J; Wu, Xifeng
2017-09-01
Background: Obesity is an established risk factor for renal cell carcinoma (RCC). Although genome-wide association studies (GWAS) of RCC have identified several susceptibility loci, additional variants might be missed due to the highly conservative selection. Methods: We conducted a multiphase study utilizing three independent genome-wide scans at MD Anderson Cancer Center (MDA RCC GWAS and MDA RCC OncoArray) and National Cancer Institute (NCI RCC GWAS), which consisted of a total of 3,530 cases and 5,714 controls, to investigate genetic variations in obesity-related genes and RCC risk. Results: In the discovery phase, 32,946 SNPs located at ±10 kb of 2,001 obesity-related genes were extracted from MDA RCC GWAS and analyzed using multivariable logistic regression. Proxies ( R 2 > 0.8) were searched or imputation was performed if SNPs were not directly genotyped in the validation sets. Twenty-one SNPs with P < 0.05 in both MDA RCC GWAS and NCI RCC GWAS were subsequently evaluated in MDA RCC OncoArray. In the overall meta-analysis, significant ( P < 0.05) associations with RCC risk were observed for SNP mapping to IL1RAPL2 [rs10521506-G: OR meta = 0.87 (0.81-0.93), P meta = 2.33 × 10 -5 ], PLIN2 [rs2229536-A: OR meta = 0.87 (0.81-0.93), P meta = 2.33 × 10 -5 ], SMAD3 [rs4601989-A: OR meta = 0.86 (0.80-0.93), P meta = 2.71 × 10 -4 ], MED13L [rs10850596-A: OR meta = 1.14 (1.07-1.23), P meta = 1.50 × 10 -4 ], and TSC1 [rs3761840-G: OR meta = 0.90 (0.85-0.97), P meta = 2.47 × 10 -3 ]. We did not observe any significant cis-expression quantitative trait loci effect for these SNPs in the TCGA KIRC data. Conclusions: Taken together, we found that genetic variation of obesity-related genes could influence RCC susceptibility. Impact: The five identified loci may provide new insights into disease etiology that reveal importance of obesity-related genes in RCC development. Cancer Epidemiol Biomarkers Prev; 26(9); 1436-42. ©2017 AACR . ©2017 American Association for Cancer Research.
Graham, Hillary T; Rotroff, Daniel M; Marvel, Skylar W; Buse, John B; Havener, Tammy M; Wilson, Alyson G; Wagner, Michael J; Motsinger-Reif, Alison A
2016-01-01
Given the high costs of conducting a drug-response trial, researchers are now aiming to use retrospective analyses to conduct genome-wide association studies (GWAS) to identify underlying genetic contributions to drug-response variation. To prevent confounding results from a GWAS to investigate drug response, it is necessary to account for concomitant medications, defined as any medication taken concurrently with the primary medication being investigated. We use data from the Action to Control Cardiovascular Disease (ACCORD) trial in order to implement a novel scoring procedure for incorporating concomitant medication information into a linear regression model in preparation for GWAS. In order to accomplish this, two primary medications were selected: thiazolidinediones and metformin because of the wide-spread use of these medications and large sample sizes available within the ACCORD trial. A third medication, fenofibrate, along with a known confounding medication, statin, were chosen as a proof-of-principle for the scoring procedure. Previous studies have identified SNP rs7412 as being associated with statin response. Here we hypothesize that including the score for statin as a covariate in the GWAS model will correct for confounding of statin and yield a change in association at rs7412. The response of the confounded signal was successfully diminished from p = 3.19 × 10 -7 to p = 1.76 × 10 -5 , by accounting for statin using the scoring procedure presented here. This approach provides the ability for researchers to account for concomitant medications in complex trial designs where monotherapy treatment regimens are not available.
Controlling the Rate of GWAS False Discoveries
Brzyski, Damian; Peterson, Christine B.; Sobczyk, Piotr; Candès, Emmanuel J.; Bogdan, Malgorzata; Sabatti, Chiara
2017-01-01
With the rise of both the number and the complexity of traits of interest, control of the false discovery rate (FDR) in genetic association studies has become an increasingly appealing and accepted target for multiple comparison adjustment. While a number of robust FDR-controlling strategies exist, the nature of this error rate is intimately tied to the precise way in which discoveries are counted, and the performance of FDR-controlling procedures is satisfactory only if there is a one-to-one correspondence between what scientists describe as unique discoveries and the number of rejected hypotheses. The presence of linkage disequilibrium between markers in genome-wide association studies (GWAS) often leads researchers to consider the signal associated to multiple neighboring SNPs as indicating the existence of a single genomic locus with possible influence on the phenotype. This a posteriori aggregation of rejected hypotheses results in inflation of the relevant FDR. We propose a novel approach to FDR control that is based on prescreening to identify the level of resolution of distinct hypotheses. We show how FDR-controlling strategies can be adapted to account for this initial selection both with theoretical results and simulations that mimic the dependence structure to be expected in GWAS. We demonstrate that our approach is versatile and useful when the data are analyzed using both tests based on single markers and multiple regression. We provide an R package that allows practitioners to apply our procedure on standard GWAS format data, and illustrate its performance on lipid traits in the North Finland Birth Cohort 66 cohort study. PMID:27784720
Controlling the Rate of GWAS False Discoveries.
Brzyski, Damian; Peterson, Christine B; Sobczyk, Piotr; Candès, Emmanuel J; Bogdan, Malgorzata; Sabatti, Chiara
2017-01-01
With the rise of both the number and the complexity of traits of interest, control of the false discovery rate (FDR) in genetic association studies has become an increasingly appealing and accepted target for multiple comparison adjustment. While a number of robust FDR-controlling strategies exist, the nature of this error rate is intimately tied to the precise way in which discoveries are counted, and the performance of FDR-controlling procedures is satisfactory only if there is a one-to-one correspondence between what scientists describe as unique discoveries and the number of rejected hypotheses. The presence of linkage disequilibrium between markers in genome-wide association studies (GWAS) often leads researchers to consider the signal associated to multiple neighboring SNPs as indicating the existence of a single genomic locus with possible influence on the phenotype. This a posteriori aggregation of rejected hypotheses results in inflation of the relevant FDR. We propose a novel approach to FDR control that is based on prescreening to identify the level of resolution of distinct hypotheses. We show how FDR-controlling strategies can be adapted to account for this initial selection both with theoretical results and simulations that mimic the dependence structure to be expected in GWAS. We demonstrate that our approach is versatile and useful when the data are analyzed using both tests based on single markers and multiple regression. We provide an R package that allows practitioners to apply our procedure on standard GWAS format data, and illustrate its performance on lipid traits in the North Finland Birth Cohort 66 cohort study. Copyright © 2017 by the Genetics Society of America.
Integrative Genomics Reveals Novel Molecular Pathways and Gene Networks for Coronary Artery Disease
Mäkinen, Ville-Petteri; Civelek, Mete; Meng, Qingying; Zhang, Bin; Zhu, Jun; Levian, Candace; Huan, Tianxiao; Segrè, Ayellet V.; Ghosh, Sujoy; Vivar, Juan; Nikpay, Majid; Stewart, Alexandre F. R.; Nelson, Christopher P.; Willenborg, Christina; Erdmann, Jeanette; Blakenberg, Stefan; O'Donnell, Christopher J.; März, Winfried; Laaksonen, Reijo; Epstein, Stephen E.; Kathiresan, Sekar; Shah, Svati H.; Hazen, Stanley L.; Reilly, Muredach P.; Lusis, Aldons J.; Samani, Nilesh J.; Schunkert, Heribert; Quertermous, Thomas; McPherson, Ruth; Yang, Xia; Assimes, Themistocles L.
2014-01-01
The majority of the heritability of coronary artery disease (CAD) remains unexplained, despite recent successes of genome-wide association studies (GWAS) in identifying novel susceptibility loci. Integrating functional genomic data from a variety of sources with a large-scale meta-analysis of CAD GWAS may facilitate the identification of novel biological processes and genes involved in CAD, as well as clarify the causal relationships of established processes. Towards this end, we integrated 14 GWAS from the CARDIoGRAM Consortium and two additional GWAS from the Ottawa Heart Institute (25,491 cases and 66,819 controls) with 1) genetics of gene expression studies of CAD-relevant tissues in humans, 2) metabolic and signaling pathways from public databases, and 3) data-driven, tissue-specific gene networks from a multitude of human and mouse experiments. We not only detected CAD-associated gene networks of lipid metabolism, coagulation, immunity, and additional networks with no clear functional annotation, but also revealed key driver genes for each CAD network based on the topology of the gene regulatory networks. In particular, we found a gene network involved in antigen processing to be strongly associated with CAD. The key driver genes of this network included glyoxalase I (GLO1) and peptidylprolyl isomerase I (PPIL1), which we verified as regulatory by siRNA experiments in human aortic endothelial cells. Our results suggest genetic influences on a diverse set of both known and novel biological processes that contribute to CAD risk. The key driver genes for these networks highlight potential novel targets for further mechanistic studies and therapeutic interventions. PMID:25033284
Genome-Wide Association of the Laboratory-Based Nicotine Metabolite Ratio in Three Ancestries.
Baurley, James W; Edlund, Christopher K; Pardamean, Carissa I; Conti, David V; Krasnow, Ruth; Javitz, Harold S; Hops, Hyman; Swan, Gary E; Benowitz, Neal L; Bergen, Andrew W
2016-09-01
Metabolic enzyme variation and other patient and environmental characteristics influence smoking behaviors, treatment success, and risk of related disease. Population-specific variation in metabolic genes contributes to challenges in developing and optimizing pharmacogenetic interventions. We applied a custom genome-wide genotyping array for addiction research (Smokescreen), to three laboratory-based studies of nicotine metabolism with oral or venous administration of labeled nicotine and cotinine, to model nicotine metabolism in multiple populations. The trans-3'-hydroxycotinine/cotinine ratio, the nicotine metabolite ratio (NMR), was the nicotine metabolism measure analyzed. Three hundred twelve individuals of self-identified European, African, and Asian American ancestry were genotyped and included in ancestry-specific genome-wide association scans (GWAS) and a meta-GWAS analysis of the NMR. We modeled natural-log transformed NMR with covariates: principal components of genetic ancestry, age, sex, body mass index, and smoking status. African and Asian American NMRs were statistically significantly (P values ≤ 5E-5) lower than European American NMRs. Meta-GWAS analysis identified 36 genome-wide significant variants over a 43 kilobase pair region at CYP2A6 with minimum P = 2.46E-18 at rs12459249, proximal to CYP2A6. Additional minima were located in intron 4 (rs56113850, P = 6.61E-18) and in the CYP2A6-CYP2A7 intergenic region (rs34226463, P = 1.45E-12). Most (34/36) genome-wide significant variants suggested reduced CYP2A6 activity; functional mechanisms were identified and tested in knowledge-bases. Conditional analysis resulted in intergenic variants of possible interest (P values < 5E-5). This meta-GWAS of the NMR identifies CYP2A6 variants, replicates the top-ranked single nucleotide polymorphism from a recent Finnish meta-GWAS of the NMR, identifies functional mechanisms, and provides pan-continental population biomarkers for nicotine metabolism. This multiple ancestry meta-GWAS of the laboratory study-based NMR provides novel evidence and replication for genome-wide association of CYP2A6 single nucleotide and insertion-deletion polymorphisms. We identify three regions of genome-wide significance: proximal, intronic, and distal to CYP2A6. We replicate the top-ranking single nucleotide polymorphism from a recent GWAS of the NMR in Finnish smokers, identify a functional mechanism for this intronic variant from in silico analyses of RNA-seq data that is consistent with CYP2A6 expression measured in postmortem lung and liver, and provide additional support for the intergenic region between CYP2A6 and CYP2A7. © The Author 2016. Published by Oxford University Press on behalf of the Society for Research on Nicotine and Tobacco.
Genome-Wide Association of the Laboratory-Based Nicotine Metabolite Ratio in Three Ancestries
Baurley, James W.; Edlund, Christopher K.; Pardamean, Carissa I.; Conti, David V.; Krasnow, Ruth; Javitz, Harold S.; Hops, Hyman; Swan, Gary E.; Benowitz, Neal L.
2016-01-01
Introduction: Metabolic enzyme variation and other patient and environmental characteristics influence smoking behaviors, treatment success, and risk of related disease. Population-specific variation in metabolic genes contributes to challenges in developing and optimizing pharmacogenetic interventions. We applied a custom genome-wide genotyping array for addiction research (Smokescreen), to three laboratory-based studies of nicotine metabolism with oral or venous administration of labeled nicotine and cotinine, to model nicotine metabolism in multiple populations. The trans-3′-hydroxycotinine/cotinine ratio, the nicotine metabolite ratio (NMR), was the nicotine metabolism measure analyzed. Methods: Three hundred twelve individuals of self-identified European, African, and Asian American ancestry were genotyped and included in ancestry-specific genome-wide association scans (GWAS) and a meta-GWAS analysis of the NMR. We modeled natural-log transformed NMR with covariates: principal components of genetic ancestry, age, sex, body mass index, and smoking status. Results: African and Asian American NMRs were statistically significantly (P values ≤ 5E-5) lower than European American NMRs. Meta-GWAS analysis identified 36 genome-wide significant variants over a 43 kilobase pair region at CYP2A6 with minimum P = 2.46E-18 at rs12459249, proximal to CYP2A6. Additional minima were located in intron 4 (rs56113850, P = 6.61E-18) and in the CYP2A6-CYP2A7 intergenic region (rs34226463, P = 1.45E-12). Most (34/36) genome-wide significant variants suggested reduced CYP2A6 activity; functional mechanisms were identified and tested in knowledge-bases. Conditional analysis resulted in intergenic variants of possible interest (P values < 5E-5). Conclusions: This meta-GWAS of the NMR identifies CYP2A6 variants, replicates the top-ranked single nucleotide polymorphism from a recent Finnish meta-GWAS of the NMR, identifies functional mechanisms, and provides pan-continental population biomarkers for nicotine metabolism. Implications: This multiple ancestry meta-GWAS of the laboratory study-based NMR provides novel evidence and replication for genome-wide association of CYP2A6 single nucleotide and insertion–deletion polymorphisms. We identify three regions of genome-wide significance: proximal, intronic, and distal to CYP2A6. We replicate the top-ranking single nucleotide polymorphism from a recent GWAS of the NMR in Finnish smokers, identify a functional mechanism for this intronic variant from in silico analyses of RNA-seq data that is consistent with CYP2A6 expression measured in postmortem lung and liver, and provide additional support for the intergenic region between CYP2A6 and CYP2A7. PMID:27113016
Zhang, Dong; Kong, Wenqian; Robertson, Jon; Goff, Valorie H; Epps, Ethan; Kerr, Alexandra; Mills, Gabriel; Cromwell, Jay; Lugin, Yelena; Phillips, Christine; Paterson, Andrew H
2015-04-19
Domestication has played an important role in shaping characteristics of the inflorescence and plant height in cultivated cereals. Taking advantage of meta-analysis of QTLs, phylogenetic analyses in 502 diverse sorghum accessions, GWAS in a sorghum association panel (n = 354) and comparative data, we provide insight into the genetic basis of the domestication traits in sorghum and rice. We performed genome-wide association studies (GWAS) on 6 traits related to inflorescence morphology and 6 traits related to plant height in sorghum, comparing the genomic regions implicated in these traits by GWAS and QTL mapping, respectively. In a search for signatures of selection, we identify genomic regions that may contribute to sorghum domestication regarding plant height, flowering time and pericarp color. Comparative studies across taxa show functionally conserved 'hotspots' in sorghum and rice for awn presence and pericarp color that do not appear to reflect corresponding single genes but may indicate co-regulated clusters of genes. We also reveal homoeologous regions retaining similar functions for plant height and flowering time since genome duplication an estimated 70 million years ago or more in a common ancestor of cereals. In most such homoeologous QTL pairs, only one QTL interval exhibits strong selection signals in modern sorghum. Intersections among QTL, GWAS and comparative data advance knowledge of genetic determinants of inflorescence and plant height components in sorghum, and add new dimensions to comparisons between sorghum and rice.
A simulation study of gene-by-environment interactions in GWAS implies ample hidden effects
Marigorta, Urko M.; Gibson, Greg
2014-01-01
The switch to a modern lifestyle in recent decades has coincided with a rapid increase in prevalence of obesity and other diseases. These shifts in prevalence could be explained by the release of genetic susceptibility for disease in the form of gene-by-environment (GxE) interactions. Yet, the detection of interaction effects requires large sample sizes, little replication has been reported, and a few studies have demonstrated environmental effects only after summing the risk of GWAS alleles into genetic risk scores (GRSxE). We performed extensive simulations of a quantitative trait controlled by 2500 causal variants to inspect the feasibility to detect gene-by-environment interactions in the context of GWAS. The simulated individuals were assigned either to an ancestral or a modern setting that alters the phenotype by increasing the effect size by 1.05–2-fold at a varying fraction of perturbed SNPs (from 1 to 20%). We report two main results. First, for a wide range of realistic scenarios, highly significant GRSxE is detected despite the absence of individual genotype GxE evidence at the contributing loci. Second, an increase in phenotypic variance after environmental perturbation reduces the power to discover susceptibility variants by GWAS in mixed cohorts with individuals from both ancestral and modern environments. We conclude that a pervasive presence of gene-by-environment effects can remain hidden even though it contributes to the genetic architecture of complex traits. PMID:25101110
DOE Office of Scientific and Technical Information (OSTI.GOV)
Zhang, Dong; Kong, Wenqian; Robertson, Jon
Domestication has played an important role in shaping characteristics of the inflorescence and plant height in cultivated cereals. Taking advantage of meta-analysis of QTLs, phylogenetic analyses in 502 diverse sorghum accessions, GWAS in a sorghum association panel (n = 354) and comparative data, we provide insight into the genetic basis of the domestication traits in sorghum and rice. We performed genome-wide association studies (GWAS) on 6 traits related to inflorescence morphology and 6 traits related to plant height in sorghum, comparing the genomic regions implicated in these traits by GWAS and QTL mapping, respectively. In a search for signatures ofmore » selection, we identify genomic regions that may contribute to sorghum domestication regarding plant height, flowering time and pericarp color. Comparative studies across taxa show functionally conserved ‘hotspots’ in sorghum and rice for awn presence and pericarp color that do not appear to reflect corresponding single genes but may indicate co-regulated clusters of genes. We also reveal homoeologous regions retaining similar functions for plant height and flowering time since genome duplication an estimated 70 million years ago or more in a common ancestor of cereals. In most such homoeologous QTL pairs, only one QTL interval exhibits strong selection signals in modern sorghum. Intersections among QTL, GWAS and comparative data advance knowledge of genetic determinants of inflorescence and plant height components in sorghum, and add new dimensions to comparisons between sorghum and rice.« less
Zhang, Dong; Kong, Wenqian; Robertson, Jon; ...
2015-12-01
Domestication has played an important role in shaping characteristics of the inflorescence and plant height in cultivated cereals. Taking advantage of meta-analysis of QTLs, phylogenetic analyses in 502 diverse sorghum accessions, GWAS in a sorghum association panel (n = 354) and comparative data, we provide insight into the genetic basis of the domestication traits in sorghum and rice. We performed genome-wide association studies (GWAS) on 6 traits related to inflorescence morphology and 6 traits related to plant height in sorghum, comparing the genomic regions implicated in these traits by GWAS and QTL mapping, respectively. In a search for signatures ofmore » selection, we identify genomic regions that may contribute to sorghum domestication regarding plant height, flowering time and pericarp color. Comparative studies across taxa show functionally conserved ‘hotspots’ in sorghum and rice for awn presence and pericarp color that do not appear to reflect corresponding single genes but may indicate co-regulated clusters of genes. We also reveal homoeologous regions retaining similar functions for plant height and flowering time since genome duplication an estimated 70 million years ago or more in a common ancestor of cereals. In most such homoeologous QTL pairs, only one QTL interval exhibits strong selection signals in modern sorghum. Intersections among QTL, GWAS and comparative data advance knowledge of genetic determinants of inflorescence and plant height components in sorghum, and add new dimensions to comparisons between sorghum and rice.« less
Perez-Andreu, Virginia; Roberts, Kathryn G; Xu, Heng; Smith, Colton; Zhang, Hui; Yang, Wenjian; Harvey, Richard C; Payne-Turner, Debbie; Devidas, Meenakshi; Cheng, I-Ming; Carroll, William L; Heerema, Nyla A; Carroll, Andrew J; Raetz, Elizabeth A; Gastier-Foster, Julie M; Marcucci, Guido; Bloomfield, Clara D; Mrózek, Krzysztof; Kohlschmidt, Jessica; Stock, Wendy; Kornblau, Steven M; Konopleva, Marina; Paietta, Elisabeth; Rowe, Jacob M; Luger, Selina M; Tallman, Martin S; Dean, Michael; Burchard, Esteban G; Torgerson, Dara G; Yue, Feng; Wang, Yanli; Pui, Ching-Hon; Jeha, Sima; Relling, Mary V; Evans, William E; Gerhard, Daniela S; Loh, Mignon L; Willman, Cheryl L; Hunger, Stephen P; Mullighan, Charles G; Yang, Jun J
2015-01-22
Acute lymphoblastic leukemia (ALL) in adolescents and young adults (AYA) is characterized by distinct presenting features and inferior prognosis compared with pediatric ALL. We performed a genome-wide association study (GWAS) to comprehensively identify inherited genetic variants associated with susceptibility to AYA ALL. In the discovery GWAS, we compared genotype frequency at 635 297 single nucleotide polymorphisms (SNPs) in 308 AYA ALL cases and 6,661 non-ALL controls by using a logistic regression model with genetic ancestry as a covariate. SNPs that reached P ≤ 5 × 10(-8) in GWAS were tested in an independent cohort of 162 AYA ALL cases and 5,755 non-ALL controls. We identified a single genome-wide significant susceptibility locus in GATA3: rs3824662, odds ratio (OR), 1.77 (P = 2.8 × 10(-10)) and rs3781093, OR, 1.73 (P = 3.2 × 10(-9)). These findings were validated in the replication cohort. The risk allele at rs3824662 was most frequent in Philadelphia chromosome (Ph)-like ALL but also conferred susceptibility to non-Ph-like ALL in AYAs. In 1,827 non-selected ALL cases, the risk allele frequency at this SNP was positively correlated with age at diagnosis (P = 6.29 × 10(-11)). Our results from this first GWAS of AYA ALL susceptibility point to unique biology underlying leukemogenesis and potentially distinct disease etiology by age group.
Valluru, Ravi; Reynolds, Matthew P; Davies, William J; Sukumaran, Sivakumar
2017-04-01
The gaseous phytohormone ethylene plays an important role in spike development in wheat (Triticum aestivum). However, the genotypic variation and the genomic regions governing spike ethylene (SET) production in wheat under long-term heat stress remain unexplored. We investigated genotypic variation in the production of SET and its relationship with spike dry weight (SDW) in 130 diverse wheat elite lines and landraces under heat-stressed field conditions. We employed an Illumina iSelect 90K single nucleotide polymorphism (SNP) genotyping array to identify the genetic loci for SET and SDW through a genome-wide association study (GWAS) in a subset of the Wheat Association Mapping Initiative (WAMI) panel. The SET and SDW exhibited appreciable genotypic variation among wheat genotypes at the anthesis stage. There was a strong negative correlation between SET and SDW. The GWAS uncovered five and 32 significant SNPs for SET, and 22 and 142 significant SNPs for SDW, in glasshouse and field conditions, respectively. Some of these SNPs closely localized to the SNPs for plant height, suggesting close associations between plant height and spike-related traits. The phenotypic and genetic elucidation of SET and its relationship with SDW supports future efforts toward gene discovery and breeding wheat cultivars with reduced ethylene effects on yield under heat stress. © 2016 The Authors. New Phytologist © 2016 New Phytologist Trust.
Establishing the role of rare coding variants in known Parkinson's disease risk loci.
Jansen, Iris E; Gibbs, J Raphael; Nalls, Mike A; Price, T Ryan; Lubbe, Steven; van Rooij, Jeroen; Uitterlinden, André G; Kraaij, Robert; Williams, Nigel M; Brice, Alexis; Hardy, John; Wood, Nicholas W; Morris, Huw R; Gasser, Thomas; Singleton, Andrew B; Heutink, Peter; Sharma, Manu
2017-11-01
Many common genetic factors have been identified to contribute to Parkinson's disease (PD) susceptibility, improving our understanding of the related underlying biological mechanisms. The involvement of rarer variants in these loci has been poorly studied. Using International Parkinson's Disease Genomics Consortium data sets, we performed a comprehensive study to determine the impact of rare variants in 23 previously published genome-wide association studies (GWAS) loci in PD. We applied Prix fixe to select the putative causal genes underneath the GWAS peaks, which was based on underlying functional similarities. The Sequence Kernel Association Test was used to analyze the joint effect of rare, common, or both types of variants on PD susceptibility. All genes were tested simultaneously as a gene set and each gene individually. We observed a moderate association of common variants, confirming the involvement of the known PD risk loci within our genetic data sets. Focusing on rare variants, we identified additional association signals for LRRK2, STBD1, and SPATA19. Our study suggests an involvement of rare variants within several putatively causal genes underneath previously identified PD GWAS peaks. Copyright © 2017 Elsevier Inc. All rights reserved.
Lee, Myoungsook; Kwon, Dae Young; Kim, Myung-Sunny; Choi, Chong Ran; Park, Mi-Young; Kim, Ae-Jung
2016-02-01
This is the first study to identify common genetic factors associated with the basal metabolic rate (BMR) and body mass index (BMI) in obese Korean women including overweight. This will be a basic study for future research of obese gene-BMR interaction. The experimental design was 2 by 2 with variables of BMR and BMI. A genome-wide association study (GWAS) of single nucleotide polymorphisms (SNPs) was conducted in the overweight and obesity (BMI > 23 kg/m(2)) compared to the normality, and in women with low BMR (< 1426.3 kcal/day) compared to high BMR. A total of 140 SNPs reached formal genome-wide statistical significance in this study (P < 1 × 10(-4)). Surveys to estimate energy intake using 24-h recall method for three days and questionnaires for family history, a medical examination, and physical activities were conducted. We found that two NRG3 gene SNPs in the 10q23.1 chromosomal region were highly associated with BMR (rs10786764; P = 8.0 × 10(-7), rs1040675; 2.3 × 10(-6)) and BMI (rs10786764; P = 2.5 × 10(-5), rs10786764; 6.57 × 10(-5)). The other genes related to BMI (HSD52, TMA16, MARCH1, NRG1, NRXN3, and STK4) yielded P <10 × 10(-4). Five new loci associated with BMR and BMI, including NRG3, OR8U8, BCL2L2-PABPN1, PABPN1, and SLC22A17 were identified in obese Korean women (P < 1 × 10(-4)). In the questionnaire investigation, significant differences were found in the number of starvation periods per week, family history of stomach cancer, coffee intake, and trial of weight control in each group. We discovered several common BMR- and BMI-related genes using GWAS. Although most of these newly established loci were not previously associated with obesity, they may provide new insights into body weight regulation. Our findings of five common genes associated with BMR and BMI in Koreans will serve as a reference for replication and validation of future studies on the metabolic rate.
He, Liang; Zhbannikov, Ilya; Arbeev, Konstantin G; Yashin, Anatoliy I; Kulminski, Alexander M
2017-11-01
Unraveling the underlying biological mechanisms or pathways behind the effects of genetic variations on complex diseases remains one of the major challenges in the post-GWAS (where GWAS is genome-wide association study) era. To further explore the relationship between genetic variations, biomarkers, and diseases for elucidating underlying pathological mechanism, a huge effort has been placed on examining pleiotropic and gene-environmental interaction effects. We propose a novel genetic stochastic process model (GSPM) that can be applied to GWAS and jointly investigate the genetic effects on longitudinally measured biomarkers and risks of diseases. This model is characterized by more profound biological interpretation and takes into account the dynamics of biomarkers during follow-up when investigating the hazards of a disease. We illustrate the rationale and evaluate the performance of the proposed model through two GWAS. One is to detect single nucleotide polymorphisms (SNPs) having interaction effects on type 2 diabetes (T2D) with body mass index (BMI) and the other is to detect SNPs affecting the optimal BMI level for protecting from T2D. We identified multiple SNPs that showed interaction effects with BMI on T2D, including a novel SNP rs11757677 in the CDKAL1 gene (P = 5.77 × 10 -7 ). We also found a SNP rs1551133 located on 2q14.2 that reversed the effect of BMI on T2D (P = 6.70 × 10 -7 ). In conclusion, the proposed GSPM provides a promising and useful tool in GWAS of longitudinal data for interrogating pleiotropic and interaction effects to gain more insights into the relationship between genes, quantitative biomarkers, and risks of complex diseases. © 2017 WILEY PERIODICALS, INC.
Markunas, Christina A; Johnson, Eric O; Hancock, Dana B
2017-07-01
Genome-wide association study (GWAS)-identified variants are enriched for functional elements. However, we have limited knowledge of how functional enrichment may differ by disease/trait and tissue type. We tested a broad set of eight functional elements for enrichment among GWAS-identified SNPs (p < 5×10 -8 ) from the NHGRI-EBI Catalog across seven disease/trait categories: cancer, cardiovascular disease, diabetes, autoimmune disease, psychiatric disease, neurological disease, and anthropometric traits. SNPs were annotated using HaploReg for the eight functional elements across any tissue: DNase sites, expression quantitative trait loci (eQTL), sequence conservation, enhancers, promoters, missense variants, sequence motifs, and protein binding sites. In addition, tissue-specific annotations were considered for brain vs. blood. Disease/trait SNPs were compared to a control set of 4809 SNPs matched to the GWAS SNPs (N = 1639) on allele frequency, gene density, distance to nearest gene, and linkage disequilibrium at ~3:1 ratio. Enrichment analyses were conducted using logistic regression, with Bonferroni correction. Overall, a significant enrichment was observed for all functional elements, except sequence motifs. Missense SNPs showed the strongest magnitude of enrichment. eQTLs were the only functional element significantly enriched across all diseases/traits. Magnitudes of enrichment were generally similar across diseases/traits, where enrichment was statistically significant. Blood vs. brain tissue effects on enrichment were dependent on disease/trait and functional element (e.g., cardiovascular disease: eQTLs P TissueDifference = 1.28 × 10 -6 vs. enhancers P TissueDifference = 0.94). Identifying disease/trait-relevant functional elements and tissue types could provide new insight into the underlying biology, by guiding a priori GWAS analyses (e.g., brain enhancer elements for psychiatric disease) or facilitating post hoc interpretation.
Network-Guided GWAS Improves Identification of Genes Affecting Free Amino Acids.
Angelovici, Ruthie; Batushansky, Albert; Deason, Nicholas; Gonzalez-Jorge, Sabrina; Gore, Michael A; Fait, Aaron; DellaPenna, Dean
2017-01-01
Amino acids are essential for proper growth and development in plants. Amino acids serve as building blocks for proteins but also are important for responses to stress and the biosynthesis of numerous essential compounds. In seed, the pool of free amino acids (FAAs) also contributes to alternative energy, desiccation, and seed vigor; thus, manipulating FAA levels can significantly impact a seed's nutritional qualities. While genome-wide association studies (GWAS) on branched-chain amino acids have identified some regulatory genes controlling seed FAAs, the genetic regulation of FAA levels, composition, and homeostasis in seeds remains mostly unresolved. Hence, we performed GWAS on 18 FAAs from a 313-ecotype Arabidopsis (Arabidopsis thaliana) association panel. Specifically, GWAS was performed on 98 traits derived from known amino acid metabolic pathways (approach 1) and then on 92 traits generated from an unbiased correlation-based metabolic network analysis (approach 2), and the results were compared. The latter approach facilitated the discovery of additional novel metabolic interactions and single-nucleotide polymorphism-trait associations not identified by the former approach. The most prominent network-guided GWAS signal was for a histidine (His)-related trait in a region containing two genes: a cationic amino acid transporter (CAT4) and a polynucleotide phosphorylase resistant to inhibition with fosmidomycin. A reverse genetics approach confirmed CAT4 to be responsible for the natural variation of His-related traits across the association panel. Given that His is a semiessential amino acid and a potent metal chelator, CAT4 orthologs could be considered as candidate genes for seed quality biofortification in crop plants. © 2017 American Society of Plant Biologists. All Rights Reserved.
Gupta, Aditi; Juyal, Garima; Sood, Ajit; Midha, Vandana; Yamazaki, Keiko; Vich Vila, Arnau; Esaki, Motohiro; Matsui, Toshiyuki; Takahashi, Atsushi; Kubo, Michiaki; Weersma, Rinse K; Thelma, B K
2017-01-01
The first ever genome-wide association study (GWAS) of ulcerative colitis in genetically distinct north Indian population identified two novel genes namely CFB and SLC44A4. Considering their biological relevance, we investigated allelic/genetic heterogeneity in these genes among ulcerative colitis cohorts of north Indian, Japanese and Dutch origin using high-density ImmunoChip case–control genotype data. Comparative linkage disequilibrium profiling and test of association were performed. Of the 28 CFB SNPs, similar strength of association was observed for rs4151657 (novel ulcerative colitis GWAS SNP) in north Indians (P=1.73 × 10−10) and Japanese (P=2.02 × 10−12) but not in the Dutch. Further, a three-marker haplotype was shared between north Indians and Japanese (P<10−8), but a different five-marker haplotype was associated (P=2.07 × 10−6) in the Dutch. Of the 22 SLC44A4 SNPs, rs2736428 (novel ulcerative colitis GWAS SNP) was found significantly associated in north Indians (P=4.94 × 10−10) and Japanese (P=3.37 × 10−9), but not among the Dutch. These results suggest (i) apparent allelic heterogeneity in CFB and genetic heterogeneity in SLC44A4 across different ethnic groups; (ii) shared ulcerative colitis genetic etiological factors among Asians; and finally (iii) re-exploration of GWAS findings together with high-density genotyping/sequencing and trans-ethnic fine mapping approaches may help identify shared and population-specific risk variants and enable to explain missing disease heritability. PMID:27759029
Gala, Manish; Abecasis, Goncalo; Bezieau, Stephane; Brenner, Hermann; Butterbach, Katja; Caan, Bette J.; Carlson, Christopher S.; Casey, Graham; Chang-Claude, Jenny; Conti, David V.; Curtis, Keith R.; Duggan, David; Gallinger, Steven; Haile, Robert W.; Harrison, Tabitha A.; Hayes, Richard B.; Hoffmeister, Michael; Hopper, John L.; Hudson, Thomas J.; Jenkins, Mark A.; Küry, Sébastien; Le Marchand, Loic; Leal, Suzanne M.; Newcomb, Polly A.; Nickerson, Deborah A.; Potter, John D.; Schoen, Robert E.; Schumacher, Fredrick R.; Seminara, Daniela; Slattery, Martha L.; Hsu, Li; Chan, Andrew T.; White, Emily; Berndt, Sonja I.; Peters, Ulrike
2016-01-01
Genome-wide association studies (GWAS) have identified many common single nucleotide polymorphisms (SNPs) associated with colorectal cancer risk. These SNPs may tag correlated variants with biological importance. Fine-mapping around GWAS loci can facilitate detection of functional candidates and additional independent risk variants. We analyzed 11,900 cases and 14,311 controls in the Genetics and Epidemiology of Colorectal Cancer Consortium and the Colon Cancer Family Registry. To fine-map genomic regions containing all known common risk variants, we imputed high-density genetic data from the 1000 Genomes Project. We tested single-variant associations with colorectal tumor risk for all variants spanning genomic regions 250-kb upstream or downstream of 31 GWAS-identified SNPs (index SNPs). We queried the University of California, Santa Cruz Genome Browser to examine evidence for biological function. Index SNPs did not show the strongest association signals with colorectal tumor risk in their respective genomic regions. Bioinformatics analysis of SNPs showing smaller P-values in each region revealed 21 functional candidates in 12 loci (5q31.1, 8q24, 11q13.4, 11q23, 12p13.32, 12q24.21, 14q22.2, 15q13, 18q21, 19q13.1, 20p12.3, and 20q13.33). We did not observe evidence of additional independent association signals in GWAS-identified regions. Our results support the utility of integrating data from comprehensive fine-mapping with expanding publicly available genomic databases to help clarify GWAS associations and identify functional candidates that warrant more onerous laboratory follow-up. Such efforts may aid the eventual discovery of disease-causing variant(s). PMID:27379672
Windle, Michael; Mrug, Sylvie
2015-01-01
Research in molecular genetics has generally focused on genome-wide association studies (GWAS) and exploratory candidate gene and candidate gene-environment (G × E) studies. In this article it is proposed that hypothesis-driven and biologically informed research provides a complementary approach to GWAS to advance pressing research questions about G × E relations that are of public health relevance. Prior research studies and developmental and evolutionary theory were used to guide hypothesis testing of G × E relationships in this study. The study investigated whether the oxytocin polymorphism, rs53576, moderated the relationship between parental divorce during adolescence and depression symptoms in young adulthood. Oxytocin is a neuropeptide that has been related to the regulation of complex social cognition and behaviors such as empathy, attachment, and nurturance. We hypothesized that the GG polymorphism would be associated with more depressive symptoms following parental divorce, and that this effect would be stronger in females than males. The sample consisted of 340 individuals who participated in a longitudinal study with data used both from adolescence and young adulthood. Findings using prospective follow-up and autoregressive change models supported the hypothesized relationships. Young adult females who had experienced parental divorce during adolescence and had the GG oxytocin genotype reported almost twice as many depressive symptoms relative to young adult females who also experienced parental divorce during adolescence but had the AA or AG genotype. This pattern was not indicated among males. Findings were discussed with regard to how molecular genetic factors in combination with environmental stressors, such parental divorce, framed within a developmental framework may facilitate the future study of G × E relationships in the parental divorce-child adjustment literature and contribute to a prevention science perspective.
Windle, Michael; Mrug, Sylvie
2015-01-01
Research in molecular genetics has generally focused on genome-wide association studies (GWAS) and exploratory candidate gene and candidate gene–environment (G × E) studies. In this article it is proposed that hypothesis-driven and biologically informed research provides a complementary approach to GWAS to advance pressing research questions about G × E relations that are of public health relevance. Prior research studies and developmental and evolutionary theory were used to guide hypothesis testing of G × E relationships in this study. The study investigated whether the oxytocin polymorphism, rs53576, moderated the relationship between parental divorce during adolescence and depression symptoms in young adulthood. Oxytocin is a neuropeptide that has been related to the regulation of complex social cognition and behaviors such as empathy, attachment, and nurturance. We hypothesized that the GG polymorphism would be associated with more depressive symptoms following parental divorce, and that this effect would be stronger in females than males. The sample consisted of 340 individuals who participated in a longitudinal study with data used both from adolescence and young adulthood. Findings using prospective follow-up and autoregressive change models supported the hypothesized relationships. Young adult females who had experienced parental divorce during adolescence and had the GG oxytocin genotype reported almost twice as many depressive symptoms relative to young adult females who also experienced parental divorce during adolescence but had the AA or AG genotype. This pattern was not indicated among males. Findings were discussed with regard to how molecular genetic factors in combination with environmental stressors, such parental divorce, framed within a developmental framework may facilitate the future study of G × E relationships in the parental divorce-child adjustment literature and contribute to a prevention science perspective. PMID:26441708
Postmus, Iris; Trompet, Stella; Deshmukh, Harshal A.; Barnes, Michael R.; Li, Xiaohui; Warren, Helen R.; Chasman, Daniel I.; Zhou, Kaixin; Arsenault, Benoit J.; Donnelly, Louise A.; Wiggins, Kerri L.; Avery, Christy L.; Griffin, Paula; Feng, QiPing; Taylor, Kent D.; Li, Guo; Evans, Daniel S.; Smith, Albert V.; de Keyser, Catherine E.; Johnson, Andrew D.; de Craen, Anton J. M.; Stott, David J.; Buckley, Brendan M.; Ford, Ian; Westendorp, Rudi G. J.; Eline Slagboom, P.; Sattar, Naveed; Munroe, Patricia B.; Sever, Peter; Poulter, Neil; Stanton, Alice; Shields, Denis C.; O’Brien, Eoin; Shaw-Hawkins, Sue; Ida Chen, Y.-D.; Nickerson, Deborah A.; Smith, Joshua D.; Pierre Dubé, Marie; Matthijs Boekholdt, S.; Kees Hovingh, G.; Kastelein, John J. P.; McKeigue, Paul M.; Betteridge, John; Neil, Andrew; Durrington, Paul N.; Doney, Alex; Carr, Fiona; Morris, Andrew; McCarthy, Mark I.; Groop, Leif; Ahlqvist, Emma; Bis, Joshua C.; Rice, Kenneth; Smith, Nicholas L.; Lumley, Thomas; Whitsel, Eric A.; Stürmer, Til; Boerwinkle, Eric; Ngwa, Julius S.; O’Donnell, Christopher J.; Vasan, Ramachandran S.; Wei, Wei-Qi; Wilke, Russell A.; Liu, Ching-Ti; Sun, Fangui; Guo, Xiuqing; Heckbert, Susan R; Post, Wendy; Sotoodehnia, Nona; Arnold, Alice M.; Stafford, Jeanette M.; Ding, Jingzhong; Herrington, David M.; Kritchevsky, Stephen B.; Eiriksdottir, Gudny; Launer, Leonore J.; Harris, Tamara B.; Chu, Audrey Y.; Giulianini, Franco; MacFadyen, Jean G.; Barratt, Bryan J.; Nyberg, Fredrik; Stricker, Bruno H.; Uitterlinden, André G.; Hofman, Albert; Rivadeneira, Fernando; Emilsson, Valur; Franco, Oscar H.; Ridker, Paul M.; Gudnason, Vilmundur; Liu, Yongmei; Denny, Joshua C.; Ballantyne, Christie M.; Rotter, Jerome I.; Adrienne Cupples, L.; Psaty, Bruce M.; Palmer, Colin N. A.; Tardif, Jean-Claude; Colhoun, Helen M.; Hitman, Graham; Krauss, Ronald M.; Wouter Jukema, J; Caulfield, Mark J.; Donnelly, Peter; Barroso, Ines; Blackwell, Jenefer M.; Bramon, Elvira; Brown, Matthew A.; Casas, Juan P.; Corvin, Aiden; Deloukas, Panos; Duncanson, Audrey; Jankowski, Janusz; Markus, Hugh S.; Mathew, Christopher G.; Palmer, Colin N. A.; Plomin, Robert; Rautanen, Anna; Sawcer, Stephen J.; Trembath, Richard C.; Viswanathan, Ananth C.; Wood, Nicholas W.; Spencer, Chris C. A.; Band, Gavin; Bellenguez, Céline; Freeman, Colin; Hellenthal, Garrett; Giannoulatou, Eleni; Pirinen, Matti; Pearson, Richard; Strange, Amy; Su, Zhan; Vukcevic, Damjan; Donnelly, Peter; Langford, Cordelia; Hunt, Sarah E.; Edkins, Sarah; Gwilliam, Rhian; Blackburn, Hannah; Bumpstead, Suzannah J.; Dronov, Serge; Gillman, Matthew; Gray, Emma; Hammond, Naomi; Jayakumar, Alagurevathi; McCann, Owen T.; Liddle, Jennifer; Potter, Simon C.; Ravindrarajah, Radhi; Ricketts, Michelle; Waller, Matthew; Weston, Paul; Widaa, Sara; Whittaker, Pamela; Barroso, Ines; Deloukas, Panos; Mathew, Christopher G.; Blackwell, Jenefer M.; Brown, Matthew A.; Corvin, Aiden; McCarthy, Mark I.; Spencer, Chris C. A.
2014-01-01
Statins effectively lower LDL cholesterol levels in large studies and the observed interindividual response variability may be partially explained by genetic variation. Here we perform a pharmacogenetic meta-analysis of genome-wide association studies (GWAS) in studies addressing the LDL cholesterol response to statins, including up to 18,596 statin-treated subjects. We validate the most promising signals in a further 22,318 statin recipients and identify two loci, SORT1/CELSR2/PSRC1 and SLCO1B1, not previously identified in GWAS. Moreover, we confirm the previously described associations with APOE and LPA. Our findings advance the understanding of the pharmacogenetic architecture of statin response. PMID:25350695
Pasaniuc, Bogdan; Zaitlen, Noah; Lettre, Guillaume; Chen, Gary K; Tandon, Arti; Kao, W H Linda; Ruczinski, Ingo; Fornage, Myriam; Siscovick, David S; Zhu, Xiaofeng; Larkin, Emma; Lange, Leslie A; Cupples, L Adrienne; Yang, Qiong; Akylbekova, Ermeg L; Musani, Solomon K; Divers, Jasmin; Mychaleckyj, Joe; Li, Mingyao; Papanicolaou, George J; Millikan, Robert C; Ambrosone, Christine B; John, Esther M; Bernstein, Leslie; Zheng, Wei; Hu, Jennifer J; Ziegler, Regina G; Nyante, Sarah J; Bandera, Elisa V; Ingles, Sue A; Press, Michael F; Chanock, Stephen J; Deming, Sandra L; Rodriguez-Gil, Jorge L; Palmer, Cameron D; Buxbaum, Sarah; Ekunwe, Lynette; Hirschhorn, Joel N; Henderson, Brian E; Myers, Simon; Haiman, Christopher A; Reich, David; Patterson, Nick; Wilson, James G; Price, Alkes L
2011-04-01
While genome-wide association studies (GWAS) have primarily examined populations of European ancestry, more recent studies often involve additional populations, including admixed populations such as African Americans and Latinos. In admixed populations, linkage disequilibrium (LD) exists both at a fine scale in ancestral populations and at a coarse scale (admixture-LD) due to chromosomal segments of distinct ancestry. Disease association statistics in admixed populations have previously considered SNP association (LD mapping) or admixture association (mapping by admixture-LD), but not both. Here, we introduce a new statistical framework for combining SNP and admixture association in case-control studies, as well as methods for local ancestry-aware imputation. We illustrate the gain in statistical power achieved by these methods by analyzing data of 6,209 unrelated African Americans from the CARe project genotyped on the Affymetrix 6.0 chip, in conjunction with both simulated and real phenotypes, as well as by analyzing the FGFR2 locus using breast cancer GWAS data from 5,761 African-American women. We show that, at typed SNPs, our method yields an 8% increase in statistical power for finding disease risk loci compared to the power achieved by standard methods in case-control studies. At imputed SNPs, we observe an 11% increase in statistical power for mapping disease loci when our local ancestry-aware imputation framework and the new scoring statistic are jointly employed. Finally, we show that our method increases statistical power in regions harboring the causal SNP in the case when the causal SNP is untyped and cannot be imputed. Our methods and our publicly available software are broadly applicable to GWAS in admixed populations.
Chen, Zhuo; Tao, Sha; Gao, Yong; Zhang, Ju; Hu, Yanling; Mo, Linjian; Kim, Seong-Tae; Yang, Xiaobo; Tan, Aihua; Zhang, Haiying; Qin, Xue; Li, Li; Wu, Yongming; Zhang, Shijun; Zheng, S Lilly; Xu, Jianfeng; Mo, Zengnan; Sun, Jielin
2013-12-01
Sex hormones and gonadotropins exert a wide variety of effects in physiological and pathological processes. Accumulated evidence shows a strong heritable component of circulating concentrations of these hormones. Recently, several genome-wide association studies (GWASs) conducted in Caucasians have identified multiple loci that influence serum levels of sex hormones. However, the genetic determinants remain unknown in Chinese populations. In this study, we aimed to identify genetic variants associated with major sex hormones, gonadotropins, including testosterone, oestradiol, follicle-stimulating hormone (FSH), luteinising hormone (LH) and sex hormone binding globulin (SHBG) in a Chinese population. A two-stage GWAS was conducted in a total of 3495 healthy Chinese men (1999 subjects in the GWAS discovery stage and 1496 in the confirmation stage). We identified a novel genetic region at 15q21.2 (rs2414095 in CYP19A1), which was significantly associated with oestradiol and FSH in the Chinese population at a genome-wide significant level (p=6.54×10(-31) and 1.59×10(-16), respectively). Another single nucleotide polymorphism in CYP19A1 gene was significantly associated with oestradiol level (rs2445762, p=7.75×10(-28)). In addition, we confirmed the previous GWAS-identified locus at 17p13.1 for testosterone (rs2075230, p=1.13×10(-8)) and SHBG level (rs2075230, p=4.75×10(-19)) in the Chinese population. This study is the first GWAS investigation of genetic determinants of FSH and LH. The identification of novel susceptibility loci may provide more biological implications for the synthesis and metabolism of these hormones. More importantly, the confirmation of the genetic loci for testosterone and SHBG suggests common genetic components shared among different ethnicities.
Alqudah, Ahmad M.; Sharma, Rajiv; Pasam, Raj K.; Graner, Andreas; Kilian, Benjamin; Schnurbusch, Thorsten
2014-01-01
Heading time is a complex trait, and natural variation in photoperiod responses is a major factor controlling time to heading, adaptation and grain yield. In barley, previous heading time studies have been mainly conducted under field conditions to measure total days to heading. We followed a novel approach and studied the natural variation of time to heading in a world-wide spring barley collection (218 accessions), comprising of 95 photoperiod-sensitive (Ppd-H1) and 123 accessions with reduced photoperiod sensitivity (ppd-H1) to long-day (LD) through dissecting pre-anthesis development into four major stages and sub-phases. The study was conducted under greenhouse (GH) conditions (LD; 16/8 h; ∼20/∼16°C day/night). Genotyping was performed using a genome-wide high density 9K single nucleotide polymorphisms (SNPs) chip which assayed 7842 SNPs. We used the barley physical map to identify candidate genes underlying genome-wide association scans (GWAS). GWAS for pre-anthesis stages/sub-phases in each photoperiod group provided great power for partitioning genetic effects on floral initiation and heading time. In addition to major genes known to regulate heading time under field conditions, several novel QTL with medium to high effects, including new QTL having major effects on developmental stages/sub-phases were found to be associated in this study. For example, highly associated SNPs tagged the physical regions around HvCO1 (barley CONSTANS1) and BFL (BARLEY FLORICAULA/LEAFY) genes. Based upon our GWAS analysis, we propose a new genetic network model for each photoperiod group, which includes several newly identified genes, such as several HvCO-like genes, belonging to different heading time pathways in barley. PMID:25420105
Multi-variant study of obesity risk genes in African Americans: The Jackson Heart Study.
Liu, Shijian; Wilson, James G; Jiang, Fan; Griswold, Michael; Correa, Adolfo; Mei, Hao
2016-11-30
Genome-wide association study (GWAS) has been successful in identifying obesity risk genes by single-variant association analysis. For this study, we designed steps of analysis strategy and aimed to identify multi-variant effects on obesity risk among candidate genes. Our analyses were focused on 2137 African American participants with body mass index measured in the Jackson Heart Study and 657 common single nucleotide polymorphisms (SNPs) genotyped at 8 GWAS-identified obesity risk genes. Single-variant association test showed that no SNPs reached significance after multiple testing adjustment. The following gene-gene interaction analysis, which was focused on SNPs with unadjusted p-value<0.10, identified 6 significant multi-variant associations. Logistic regression showed that SNPs in these associations did not have significant linear interactions; examination of genetic risk score evidenced that 4 multi-variant associations had significant additive effects of risk SNPs; and haplotype association test presented that all multi-variant associations contained one or several combinations of particular alleles or haplotypes, associated with increased obesity risk. Our study evidenced that obesity risk genes generated multi-variant effects, which can be additive or non-linear interactions, and multi-variant study is an important supplement to existing GWAS for understanding genetic effects of obesity risk genes. Copyright © 2016 Elsevier B.V. All rights reserved.
Multi-Trait GWAS and New Candidate Genes Annotation for Growth Curve Parameters in Brahman Cattle
Crispim, Aline Camporez; Kelly, Matthew John; Guimarães, Simone Eliza Facioni; e Silva, Fabyano Fonseca; Fortes, Marina Rufino Salinas; Wenceslau, Raphael Rocha; Moore, Stephen
2015-01-01
Understanding the genetic architecture of beef cattle growth cannot be limited simply to the genome-wide association study (GWAS) for body weight at any specific ages, but should be extended to a more general purpose by considering the whole growth trajectory over time using a growth curve approach. For such an approach, the parameters that are used to describe growth curves were treated as phenotypes under a GWAS model. Data from 1,255 Brahman cattle that were weighed at birth, 6, 12, 15, 18, and 24 months of age were analyzed. Parameter estimates, such as mature weight (A) and maturity rate (K) from nonlinear models are utilized as substitutes for the original body weights for the GWAS analysis. We chose the best nonlinear model to describe the weight-age data, and the estimated parameters were used as phenotypes in a multi-trait GWAS. Our aims were to identify and characterize associated SNP markers to indicate SNP-derived candidate genes and annotate their function as related to growth processes in beef cattle. The Brody model presented the best goodness of fit, and the heritability values for the parameter estimates for mature weight (A) and maturity rate (K) were 0.23 and 0.32, respectively, proving that these traits can be a feasible alternative when the objective is to change the shape of growth curves within genetic improvement programs. The genetic correlation between A and K was -0.84, indicating that animals with lower mature body weights reached that weight at younger ages. One hundred and sixty seven (167) and two hundred and sixty two (262) significant SNPs were associated with A and K, respectively. The annotated genes closest to the most significant SNPs for A had direct biological functions related to muscle development (RAB28), myogenic induction (BTG1), fetal growth (IL2), and body weights (APEX2); K genes were functionally associated with body weight, body height, average daily gain (TMEM18), and skeletal muscle development (SMN1). Candidate genes emerging from this GWAS may inform the search for causative mutations that could underpin genomic breeding for improved growth rates. PMID:26445451
Multi-Trait GWAS and New Candidate Genes Annotation for Growth Curve Parameters in Brahman Cattle.
Crispim, Aline Camporez; Kelly, Matthew John; Guimarães, Simone Eliza Facioni; Fonseca e Silva, Fabyano; Fortes, Marina Rufino Salinas; Wenceslau, Raphael Rocha; Moore, Stephen
2015-01-01
Understanding the genetic architecture of beef cattle growth cannot be limited simply to the genome-wide association study (GWAS) for body weight at any specific ages, but should be extended to a more general purpose by considering the whole growth trajectory over time using a growth curve approach. For such an approach, the parameters that are used to describe growth curves were treated as phenotypes under a GWAS model. Data from 1,255 Brahman cattle that were weighed at birth, 6, 12, 15, 18, and 24 months of age were analyzed. Parameter estimates, such as mature weight (A) and maturity rate (K) from nonlinear models are utilized as substitutes for the original body weights for the GWAS analysis. We chose the best nonlinear model to describe the weight-age data, and the estimated parameters were used as phenotypes in a multi-trait GWAS. Our aims were to identify and characterize associated SNP markers to indicate SNP-derived candidate genes and annotate their function as related to growth processes in beef cattle. The Brody model presented the best goodness of fit, and the heritability values for the parameter estimates for mature weight (A) and maturity rate (K) were 0.23 and 0.32, respectively, proving that these traits can be a feasible alternative when the objective is to change the shape of growth curves within genetic improvement programs. The genetic correlation between A and K was -0.84, indicating that animals with lower mature body weights reached that weight at younger ages. One hundred and sixty seven (167) and two hundred and sixty two (262) significant SNPs were associated with A and K, respectively. The annotated genes closest to the most significant SNPs for A had direct biological functions related to muscle development (RAB28), myogenic induction (BTG1), fetal growth (IL2), and body weights (APEX2); K genes were functionally associated with body weight, body height, average daily gain (TMEM18), and skeletal muscle development (SMN1). Candidate genes emerging from this GWAS may inform the search for causative mutations that could underpin genomic breeding for improved growth rates.
Moffitt, Terrie E; Baker, Timothy B; Biddle, Andrea K; Evans, James P; Harrington, HonaLee; Houts, Renate; Meier, Madeline; Sugden, Karen; Williams, Benjamin; Poulton, Richie; Caspi, Avshalom
2013-01-01
OBJECTIVE To test how genomic loci identified in genome-wide association studies (GWAS) influence the developmental progression of smoking behavior. DESIGN A 38-year prospective longitudinal study of a representative birth-cohort. SETTING The Dunedin Multidisciplinary Health and Development Study, New Zealand. PARTICIPANTS N=1037 male and female study members. MAIN EXPOSURES We assessed genetic risk with a multi-locus genetic risk score (GRS). The GRS was composed of single-nucleotide polymorphisms identified in three meta-analyses of GWAS of smoking quantity phenotypes. OUTCOME MEASURES Smoking initiation, conversion to daily smoking, progression to heavy smoking, nicotine dependence (Fagerstrom Test of Nicotine Dependence), and cessation difficulties were evaluated at eight assessments spanning ages 11-38 years. RESULTS Genetic risk score was unrelated to smoking initiation. However, individuals at higher genetic risk were more likely to convert to daily smoking as teenagers, progressed more rapidly from smoking initiation to heavy smoking, persisted longer in smoking heavily, developed nicotine dependence more frequently, were more reliant on smoking to cope with stress, and were more likely to fail in their cessation attempts. Further analysis revealed that two adolescent developmental phenotypes—early conversion to daily smoking and rapid progression to heavy smoking--mediated associations between the genetic risk score and mature phenotypes of persistent heavy smoking, nicotine dependence, and cessation failure. The genetic risk score predicted smoking risk over and above family history. CONCLUSIONS Initiatives that disrupt the developmental progression of smoking behavior among adolescents may mitigate genetic risks for developing adult smoking problems. Future genetic research may maximize discovery potential by focusing on smoking behavior soon after smoking initiation and by studying young smokers. PMID:23536134
The influence of genetic variation on late toxicities in childhood cancer survivors: A review.
Clemens, E; van der Kooi, A L F; Broer, L; van Dulmen-den Broeder, E; Visscher, H; Kremer, L; Tissing, W; Loonen, J; Ronckers, C M; Pluijm, S M F; Neggers, S J C M M; Zolk, O; Langer, T; Zehnhoff-Dinnesen, A Am; Wilson, C L; Hudson, M M; Carleton, B; Laven, J S E; Uitterlinden, A G; van den Heuvel-Eibrink, M M
2018-06-01
The variability in late toxicities among childhood cancer survivors (CCS) is only partially explained by treatment and baseline patient characteristics. Inter-individual variability in the association between treatment exposure and risk of late toxicity suggests that genetic variation possibly modifies this association. We reviewed the available literature on genetic susceptibility of late toxicity after childhood cancer treatment related to components of metabolic syndrome, bone mineral density, gonadal impairment and hearing impairment. A systematic literature search was performed, using Embase, Cochrane Library, Google Scholar, MEDLINE, and Web of Science databases. Eligible publications included all English language reports of candidate gene studies and genome wide association studies (GWAS) that aimed to identify genetic risk factors associated with the four late toxicities, defined as toxicity present after end of treatment. Twenty-seven articles were identified, including 26 candidate gene studies: metabolic syndrome (n = 6); BMD (n = 6); gonadal impairment (n = 2); hearing impairment (n = 12) and one GWAS (metabolic syndrome). Eighty percent of the genetic studies on late toxicity after childhood cancer had relatively small sample sizes (n < 200), leading to insufficient power, and lacked adjustment for multiple comparisons. Only four (4/26 = 15%) candidate gene studies had their findings validated in independent replication cohorts as part of their own report. Genetic susceptibility associations are not consistent or not replicated and therefore, currently no evidence-based recommendations can be made for hearing impairment, gonadal impairment, bone mineral density impairment and metabolic syndrome in CCS. To advance knowledge related to genetic variation influencing late toxicities among CCS, future studies need adequate power, independent cohorts for replication, harmonization of disease outcomes and sample collections, and (international) collaboration. Copyright © 2018 The Authors. Published by Elsevier B.V. All rights reserved.
Epidemiology, genetics, and subtyping of preserved ratio impaired spirometry (PRISm) in COPDGene.
Wan, Emily S; Castaldi, Peter J; Cho, Michael H; Hokanson, John E; Regan, Elizabeth A; Make, Barry J; Beaty, Terri H; Han, MeiLan K; Curtis, Jeffrey L; Curran-Everett, Douglas; Lynch, David A; DeMeo, Dawn L; Crapo, James D; Silverman, Edwin K
2014-08-06
Preserved Ratio Impaired Spirometry (PRISm), defined as a reduced FEV1 in the setting of a preserved FEV1/FVC ratio, is highly prevalent and is associated with increased respiratory symptoms, systemic inflammation, and mortality. Studies investigating quantitative chest tomographic features, genetic associations, and subtypes in PRISm subjects have not been reported. Data from current and former smokers enrolled in COPDGene (n = 10,192), an observational, cross-sectional study which recruited subjects aged 45-80 with ≥10 pack years of smoking, were analyzed. To identify epidemiological and radiographic predictors of PRISm, we performed univariate and multivariate analyses comparing PRISm subjects both to control subjects with normal spirometry and to subjects with COPD. To investigate common genetic predictors of PRISm, we performed a genome-wide association study (GWAS). To explore potential subgroups within PRISm, we performed unsupervised k-means clustering. The prevalence of PRISm in COPDGene is 12.3%. Increased dyspnea, reduced 6-minute walk distance, increased percent emphysema and decreased total lung capacity, as well as increased segmental bronchial wall area percentage were significant predictors (p-value <0.05) of PRISm status when compared to control subjects in multivariate models. Although no common genetic variants were identified on GWAS testing, a significant association with Klinefelter's syndrome (47XXY) was observed (p-value < 0.001). Subgroups identified through k-means clustering include a putative "COPD-subtype", "Restrictive-subtype", and a highly symptomatic "Metabolic-subtype". PRISm subjects are clinically and genetically heterogeneous. Future investigations into the pathophysiological mechanisms behind and potential treatment options for subgroups within PRISm are warranted. Clinicaltrials.gov Identifier: NCT000608764.
Recent developments in the genetics of ADHD.
Grimm, Oliver; Kittel-Schneider, Sarah; Reif, Andreas
2018-05-02
Attention deficit hyperactivity disorder (ADHD) is a developmental psychiatric disorder which affects children and adults. ADHD is one of the psychiatric disorders with the strongest genetic basis according to familial, twin and SNP-based epidemiological studies. In this review, we provide an update of recent insights in the genetic basis of ADHD. We discuss recent progress from genome-wide association studies (GWAS) looking at common variants as well as rare copy number variations (CNVs). New analysis of gene groups, so-called functional ontologies, provide some insight into the gene networks afflicted, pointing to the role of neurodevelopmentally expressed gene-networks. Bioinformatic methods such as functional enrichment analysis and protein-protein network analysis are used to highlight biological processes of likely relevance to the aetiology of ADHD. Additionally, CNVs seem to map on important pathways implicated in synaptic signalling and neurodevelopment. While some candidate gene associations of e.g. neurotransmitter receptors and signalling have been replicated, they do not seem to explain significant variance in recent GWAS. We discuss insights from recent case-control SNP-GWAS which gave whole-genome significant SNPs in ADHD. This article is protected by copyright. All rights reserved.
Kulbrock, Maike; Lehner, Stefanie; Metzger, Julia; Ohnesorge, Bernhard; Distl, Ottmar
2013-01-01
Equine recurrent uveitis (ERU) is a common eye disease affecting up to 3-15% of the horse population. A genome-wide association study (GWAS) using the Illumina equine SNP50 bead chip was performed to identify loci conferring risk to ERU. The sample included a total of 144 German warmblood horses. A GWAS showed a significant single nucleotide polymorphism (SNP) on horse chromosome (ECA) 20 at 49.3 Mb, with IL-17A and IL-17F being the closest genes. This locus explained a fraction of 23% of the phenotypic variance for ERU. A GWAS taking into account the severity of ERU, revealed a SNP on ECA18 nearby to the crystalline gene cluster CRYGA-CRYGF. For both genomic regions on ECA18 and 20, significantly associated haplotypes containing the genome-wide significant SNPs could be demonstrated. In conclusion, our results are indicative for a genetic component regulating the possible critical role of IL-17A and IL-17F in the pathogenesis of ERU. The associated SNP on ECA18 may be indicative for cataract formation in the course of ERU.
Identification of ten variants associated with risk of estrogen-receptor-negative breast cancer
Milne, Roger L; Kuchenbaecker, Karoline B; Michailidou, Kyriaki; Beesley, Jonathan; Kar, Siddhartha; Lindström, Sara; Hui, Shirley; Lemaçon, Audrey; Soucy, Penny; Dennis, Joe; Jiang, Xia; Rostamianfar, Asha; Finucane, Hilary; Bolla, Manjeet K; McGuffog, Lesley; Wang, Qin; Aalfs, Cora M; Adams, Marcia; Adlard, Julian; Agata, Simona; Ahmed, Shahana; Ahsan, Habibul; Aittomäki, Kristiina; Al-Ejeh, Fares; Allen, Jamie; Ambrosone, Christine B; Amos, Christopher I; Andrulis, Irene L; Anton-Culver, Hoda; Antonenkova, Natalia N; Arndt, Volker; Arnold, Norbert; Aronson, Kristan J; Auber, Bernd; Auer, Paul L; Ausems, Margreet G E M; Azzollini, Jacopo; Bacot, François; Balmaña, Judith; Barile, Monica; Barjhoux, Laure; Barkardottir, Rosa B; Barrdahl, Myrto; Barnes, Daniel; Barrowdale, Daniel; Baynes, Caroline; Beckmann, Matthias W; Benitez, Javier; Bermisheva, Marina; Bernstein, Leslie; Bignon, Yves-Jean; Blazer, Kathleen R; Blok, Marinus J; Blomqvist, Carl; Blot, William; Bobolis, Kristie; Boeckx, Bram; Bogdanova, Natalia V; Bojesen, Anders; Bojesen, Stig E; Bonanni, Bernardo; Børresen-Dale, Anne-Lise; Bozsik, Aniko; Bradbury, Angela R; Brand, Judith S; Brauch, Hiltrud; Brenner, Hermann; Bressac-de Paillerets, Brigitte; Brewer, Carole; Brinton, Louise; Broberg, Per; Brooks-Wilson, Angela; Brunet, Joan; Brüning, Thomas; Burwinkel, Barbara; Buys, Saundra S; Byun, Jinyoung; Cai, Qiuyin; Caldés, Trinidad; Caligo, Maria A; Campbell, Ian; Canzian, Federico; Caron, Olivier; Carracedo, Angel; Carter, Brian D; Castelao, J Esteban; Castera, Laurent; Caux-Moncoutier, Virginie; Chan, Salina B; Chang-Claude, Jenny; Chanock, Stephen J; Chen, Xiaoqing; Cheng, Ting-Yuan David; Chiquette, Jocelyne; Christiansen, Hans; Claes, Kathleen B M; Clarke, Christine L; Conner, Thomas; Conroy, Don M; Cook, Jackie; Cordina-Duverger, Emilie; Cornelissen, Sten; Coupier, Isabelle; Cox, Angela; Cox, David G; Cross, Simon S; Cuk, Katarina; Cunningham, Julie M; Czene, Kamila; Daly, Mary B; Damiola, Francesca; Darabi, Hatef; Davidson, Rosemarie; De Leeneer, Kim; Devilee, Peter; Dicks, Ed; Diez, Orland; Ding, Yuan Chun; Ditsch, Nina; Doheny, Kimberly F; Domchek, Susan M; Dorfling, Cecilia M; Dörk, Thilo; dos-Santos-Silva, Isabel; Dubois, Stéphane; Dugué, Pierre-Antoine; Dumont, Martine; Dunning, Alison M; Durcan, Lorraine; Dwek, Miriam; Dworniczak, Bernd; Eccles, Diana; Eeles, Ros; Ehrencrona, Hans; Eilber, Ursula; Ejlertsen, Bent; Ekici, Arif B; Engel, Christoph; Eriksson, Mikael; Fachal, Laura; Faivre, Laurence; Fasching, Peter A; Faust, Ulrike; Figueroa, Jonine; Flesch-Janys, Dieter; Fletcher, Olivia; Flyger, Henrik; Foulkes, William D; Friedman, Eitan; Fritschi, Lin; Frost, Debra; Gabrielson, Marike; Gaddam, Pragna; Gammon, Marilie D; Ganz, Patricia A; Gapstur, Susan M; Garber, Judy; Garcia-Barberan, Vanesa; García-Sáenz, José A; Gaudet, Mia M; Gauthier-Villars, Marion; Gehrig, Andrea; Georgoulias, Vassilios; Gerdes, Anne-Marie; Giles, Graham G; Glendon, Gord; Godwin, Andrew K; Goldberg, Mark S; Goldgar, David E; González-Neira, Anna; Goodfellow, Paul; Greene, Mark H; Grip, Mervi; Gronwald, Jacek; Grundy, Anne; Gschwantler-Kaulich, Daphne; Guénel, Pascal; Guo, Qi; Haeberle, Lothar; Hahnen, Eric; Haiman, Christopher A; Håkansson, Niclas; Hallberg, Emily; Hamann, Ute; Hamel, Nathalie; Hankinson, Susan; Hansen, Thomas V O; Harrington, Patricia; Hart, Steven N; Hartikainen, Jaana M; Healey, Catherine S; Hein, Alexander; Helbig, Sonja; Henderson, Alex; Heyworth, Jane; Hicks, Belynda; Hillemanns, Peter; Hodgson, Shirley; Hogervorst, Frans B; Hollestelle, Antoinette; Hooning, Maartje J; Hoover, Bob; Hopper, John L; Hu, Chunling; Huang, Guanmengqian; Hulick, Peter J; Humphreys, Keith; Hunter, David J; Imyanitov, Evgeny N; Isaacs, Claudine; Iwasaki, Motoki; Izatt, Louise; Jakubowska, Anna; James, Paul; Janavicius, Ramunas; Janni, Wolfgang; Jensen, Uffe Birk; John, Esther M; Johnson, Nichola; Jones, Kristine; Jones, Michael; Jukkola-Vuorinen, Arja; Kaaks, Rudolf; Kabisch, Maria; Kaczmarek, Katarzyna; Kang, Daehee; Kast, Karin; Keeman, Renske; Kerin, Michael J; Kets, Carolien M; Keupers, Machteld; Khan, Sofia; Khusnutdinova, Elza; Kiiski, Johanna I; Kim, Sung-Won; Knight, Julia A; Konstantopoulou, Irene; Kosma, Veli-Matti; Kristensen, Vessela N; Kruse, Torben A; Kwong, Ava; Lænkholm, Anne-Vibeke; Laitman, Yael; Lalloo, Fiona; Lambrechts, Diether; Landsman, Keren; Lasset, Christine; Lazaro, Conxi; Le Marchand, Loic; Lecarpentier, Julie; Lee, Andrew; Lee, Eunjung; Lee, Jong Won; Lee, Min Hyuk; Lejbkowicz, Flavio; Lesueur, Fabienne; Li, Jingmei; Lilyquist, Jenna; Lincoln, Anne; Lindblom, Annika; Lissowska, Jolanta; Lo, Wing-Yee; Loibl, Sibylle; Long, Jirong; Loud, Jennifer T; Lubinski, Jan; Luccarini, Craig; Lush, Michael; MacInnis, Robert J; Maishman, Tom; Makalic, Enes; Kostovska, Ivana Maleva; Malone, Kathleen E; Manoukian, Siranoush; Manson, JoAnn E; Margolin, Sara; Martens, John W M; Martinez, Maria Elena; Matsuo, Keitaro; Mavroudis, Dimitrios; Mazoyer, Sylvie; McLean, Catriona; Meijers-Heijboer, Hanne; Menéndez, Primitiva; Meyer, Jeffery; Miao, Hui; Miller, Austin; Miller, Nicola; Mitchell, Gillian; Montagna, Marco; Muir, Kenneth; Mulligan, Anna Marie; Mulot, Claire; Nadesan, Sue; Nathanson, Katherine L; Neuhausen, Susan L; Nevanlinna, Heli; Nevelsteen, Ines; Niederacher, Dieter; Nielsen, Sune F; Nordestgaard, Børge G; Norman, Aaron; Nussbaum, Robert L; Olah, Edith; Olopade, Olufunmilayo I; Olson, Janet E; Olswold, Curtis; Ong, Kai-ren; Oosterwijk, Jan C; Orr, Nick; Osorio, Ana; Pankratz, V Shane; Papi, Laura; Park-Simon, Tjoung-Won; Paulsson-Karlsson, Ylva; Lloyd, Rachel; Pedersen, Inge Søkilde; Peissel, Bernard; Peixoto, Ana; Perez, Jose I A; Peterlongo, Paolo; Peto, Julian; Pfeiler, Georg; Phelan, Catherine M; Pinchev, Mila; Plaseska-Karanfilska, Dijana; Poppe, Bruce; Porteous, Mary E; Prentice, Ross; Presneau, Nadege; Prokofieva, Darya; Pugh, Elizabeth; Pujana, Miquel Angel; Pylkäs, Katri; Rack, Brigitte; Radice, Paolo; Rahman, Nazneen; Rantala, Johanna; Rappaport-Fuerhauser, Christine; Rennert, Gad; Rennert, Hedy S; Rhenius, Valerie; Rhiem, Kerstin; Richardson, Andrea; Rodriguez, Gustavo C; Romero, Atocha; Romm, Jane; Rookus, Matti A; Rudolph, Anja; Ruediger, Thomas; Saloustros, Emmanouil; Sanders, Joyce; Sandler, Dale P; Sangrajrang, Suleeporn; Sawyer, Elinor J; Schmidt, Daniel F; Schoemaker, Minouk J; Schumacher, Fredrick; Schürmann, Peter; Schwentner, Lukas; Scott, Christopher; Scott, Rodney J; Seal, Sheila; Senter, Leigha; Seynaeve, Caroline; Shah, Mitul; Sharma, Priyanka; Shen, Chen-Yang; Sheng, Xin; Shimelis, Hermela; Shrubsole, Martha J; Shu, Xiao-Ou; Side, Lucy E; Singer, Christian F; Sohn, Christof; Southey, Melissa C; Spinelli, John J; Spurdle, Amanda B; Stegmaier, Christa; Stoppa-Lyonnet, Dominique; Sukiennicki, Grzegorz; Surowy, Harald; Sutter, Christian; Swerdlow, Anthony; Szabo, Csilla I; Tamimi, Rulla M; Tan, Yen Y; Taylor, Jack A; Tejada, Maria-Isabel; Tengström, Maria; Teo, Soo H; Terry, Mary B; Tessier, Daniel C; Teulé, Alex; Thöne, Kathrin; Thull, Darcy L; Tibiletti, Maria Grazia; Tihomirova, Laima; Tischkowitz, Marc; Toland, Amanda E; Tollenaar, Rob A E M; Tomlinson, Ian; Tong, Ling; Torres, Diana; Tranchant, Martine; Truong, Thérèse; Tucker, Kathy; Tung, Nadine; Tyrer, Jonathan; Ulmer, Hans-Ulrich; Vachon, Celine; van Asperen, Christi J; Van Den Berg, David; van den Ouweland, Ans M W; van Rensburg, Elizabeth J; Varesco, Liliana; Varon-Mateeva, Raymonda; Vega, Ana; Viel, Alessandra; Vijai, Joseph; Vincent, Daniel; Vollenweider, Jason; Walker, Lisa; Wang, Zhaoming; Wang-Gohrke, Shan; Wappenschmidt, Barbara; Weinberg, Clarice R; Weitzel, Jeffrey N; Wendt, Camilla; Wesseling, Jelle; Whittemore, Alice S; Wijnen, Juul T; Willett, Walter; Winqvist, Robert; Wolk, Alicja; Wu, Anna H; Xia, Lucy; Yang, Xiaohong R; Yannoukakos, Drakoulis; Zaffaroni, Daniela; Zheng, Wei; Zhu, Bin; Ziogas, Argyrios; Ziv, Elad; Zorn, Kristin K; Gago-Dominguez, Manuela; Mannermaa, Arto; Olsson, Håkan; Teixeira, Manuel R; Stone, Jennifer; Offit, Kenneth; Ottini, Laura; Park, Sue K; Thomassen, Mads; Hall, Per; Meindl, Alfons; Schmutzler, Rita K; Droit, Arnaud; Bader, Gary D; Pharoah, Paul D P; Couch, Fergus J; Easton, Douglas F; Kraft, Peter; Chenevix-Trench, Georgia; García-Closas, Montserrat; Schmidt, Marjanka K; Antoniou, Antonis C; Simard, Jacques
2018-01-01
Most common breast cancer susceptibility variants have been identified through genome-wide association studies (GWAS) of predominantly estrogen receptor (ER)-positive disease1. We conducted a GWAS using 21,468 ER-negative cases and 100,594 controls combined with 18,908 BRCA1 mutation carriers (9,414 with breast cancer), all of European origin. We identified independent associations at P < 5 × 10−8 with ten variants at nine new loci. At P < 0.05, we replicated associations with 10 of 11 variants previously reported in ER-negative disease or BRCA1 mutation carrier GWAS and observed consistent associations with ER-negative disease for 105 susceptibility variants identified by other studies. These 125 variants explain approximately 14% of the familial risk of this breast cancer subtype. There was high genetic correlation (0.72) between risk of ER-negative breast cancer and breast cancer risk for BRCA1 mutation carriers. These findings may lead to improved risk prediction and inform further fine-mapping and functional work to better understand the biological basis of ER-negative breast cancer. PMID:29058716
Identification of ten variants associated with risk of estrogen-receptor-negative breast cancer.
Milne, Roger L; Kuchenbaecker, Karoline B; Michailidou, Kyriaki; Beesley, Jonathan; Kar, Siddhartha; Lindström, Sara; Hui, Shirley; Lemaçon, Audrey; Soucy, Penny; Dennis, Joe; Jiang, Xia; Rostamianfar, Asha; Finucane, Hilary; Bolla, Manjeet K; McGuffog, Lesley; Wang, Qin; Aalfs, Cora M; Adams, Marcia; Adlard, Julian; Agata, Simona; Ahmed, Shahana; Ahsan, Habibul; Aittomäki, Kristiina; Al-Ejeh, Fares; Allen, Jamie; Ambrosone, Christine B; Amos, Christopher I; Andrulis, Irene L; Anton-Culver, Hoda; Antonenkova, Natalia N; Arndt, Volker; Arnold, Norbert; Aronson, Kristan J; Auber, Bernd; Auer, Paul L; Ausems, Margreet G E M; Azzollini, Jacopo; Bacot, François; Balmaña, Judith; Barile, Monica; Barjhoux, Laure; Barkardottir, Rosa B; Barrdahl, Myrto; Barnes, Daniel; Barrowdale, Daniel; Baynes, Caroline; Beckmann, Matthias W; Benitez, Javier; Bermisheva, Marina; Bernstein, Leslie; Bignon, Yves-Jean; Blazer, Kathleen R; Blok, Marinus J; Blomqvist, Carl; Blot, William; Bobolis, Kristie; Boeckx, Bram; Bogdanova, Natalia V; Bojesen, Anders; Bojesen, Stig E; Bonanni, Bernardo; Børresen-Dale, Anne-Lise; Bozsik, Aniko; Bradbury, Angela R; Brand, Judith S; Brauch, Hiltrud; Brenner, Hermann; Bressac-de Paillerets, Brigitte; Brewer, Carole; Brinton, Louise; Broberg, Per; Brooks-Wilson, Angela; Brunet, Joan; Brüning, Thomas; Burwinkel, Barbara; Buys, Saundra S; Byun, Jinyoung; Cai, Qiuyin; Caldés, Trinidad; Caligo, Maria A; Campbell, Ian; Canzian, Federico; Caron, Olivier; Carracedo, Angel; Carter, Brian D; Castelao, J Esteban; Castera, Laurent; Caux-Moncoutier, Virginie; Chan, Salina B; Chang-Claude, Jenny; Chanock, Stephen J; Chen, Xiaoqing; Cheng, Ting-Yuan David; Chiquette, Jocelyne; Christiansen, Hans; Claes, Kathleen B M; Clarke, Christine L; Conner, Thomas; Conroy, Don M; Cook, Jackie; Cordina-Duverger, Emilie; Cornelissen, Sten; Coupier, Isabelle; Cox, Angela; Cox, David G; Cross, Simon S; Cuk, Katarina; Cunningham, Julie M; Czene, Kamila; Daly, Mary B; Damiola, Francesca; Darabi, Hatef; Davidson, Rosemarie; De Leeneer, Kim; Devilee, Peter; Dicks, Ed; Diez, Orland; Ding, Yuan Chun; Ditsch, Nina; Doheny, Kimberly F; Domchek, Susan M; Dorfling, Cecilia M; Dörk, Thilo; Dos-Santos-Silva, Isabel; Dubois, Stéphane; Dugué, Pierre-Antoine; Dumont, Martine; Dunning, Alison M; Durcan, Lorraine; Dwek, Miriam; Dworniczak, Bernd; Eccles, Diana; Eeles, Ros; Ehrencrona, Hans; Eilber, Ursula; Ejlertsen, Bent; Ekici, Arif B; Eliassen, A Heather; Engel, Christoph; Eriksson, Mikael; Fachal, Laura; Faivre, Laurence; Fasching, Peter A; Faust, Ulrike; Figueroa, Jonine; Flesch-Janys, Dieter; Fletcher, Olivia; Flyger, Henrik; Foulkes, William D; Friedman, Eitan; Fritschi, Lin; Frost, Debra; Gabrielson, Marike; Gaddam, Pragna; Gammon, Marilie D; Ganz, Patricia A; Gapstur, Susan M; Garber, Judy; Garcia-Barberan, Vanesa; García-Sáenz, José A; Gaudet, Mia M; Gauthier-Villars, Marion; Gehrig, Andrea; Georgoulias, Vassilios; Gerdes, Anne-Marie; Giles, Graham G; Glendon, Gord; Godwin, Andrew K; Goldberg, Mark S; Goldgar, David E; González-Neira, Anna; Goodfellow, Paul; Greene, Mark H; Alnæs, Grethe I Grenaker; Grip, Mervi; Gronwald, Jacek; Grundy, Anne; Gschwantler-Kaulich, Daphne; Guénel, Pascal; Guo, Qi; Haeberle, Lothar; Hahnen, Eric; Haiman, Christopher A; Håkansson, Niclas; Hallberg, Emily; Hamann, Ute; Hamel, Nathalie; Hankinson, Susan; Hansen, Thomas V O; Harrington, Patricia; Hart, Steven N; Hartikainen, Jaana M; Healey, Catherine S; Hein, Alexander; Helbig, Sonja; Henderson, Alex; Heyworth, Jane; Hicks, Belynda; Hillemanns, Peter; Hodgson, Shirley; Hogervorst, Frans B; Hollestelle, Antoinette; Hooning, Maartje J; Hoover, Bob; Hopper, John L; Hu, Chunling; Huang, Guanmengqian; Hulick, Peter J; Humphreys, Keith; Hunter, David J; Imyanitov, Evgeny N; Isaacs, Claudine; Iwasaki, Motoki; Izatt, Louise; Jakubowska, Anna; James, Paul; Janavicius, Ramunas; Janni, Wolfgang; Jensen, Uffe Birk; John, Esther M; Johnson, Nichola; Jones, Kristine; Jones, Michael; Jukkola-Vuorinen, Arja; Kaaks, Rudolf; Kabisch, Maria; Kaczmarek, Katarzyna; Kang, Daehee; Kast, Karin; Keeman, Renske; Kerin, Michael J; Kets, Carolien M; Keupers, Machteld; Khan, Sofia; Khusnutdinova, Elza; Kiiski, Johanna I; Kim, Sung-Won; Knight, Julia A; Konstantopoulou, Irene; Kosma, Veli-Matti; Kristensen, Vessela N; Kruse, Torben A; Kwong, Ava; Lænkholm, Anne-Vibeke; Laitman, Yael; Lalloo, Fiona; Lambrechts, Diether; Landsman, Keren; Lasset, Christine; Lazaro, Conxi; Le Marchand, Loic; Lecarpentier, Julie; Lee, Andrew; Lee, Eunjung; Lee, Jong Won; Lee, Min Hyuk; Lejbkowicz, Flavio; Lesueur, Fabienne; Li, Jingmei; Lilyquist, Jenna; Lincoln, Anne; Lindblom, Annika; Lissowska, Jolanta; Lo, Wing-Yee; Loibl, Sibylle; Long, Jirong; Loud, Jennifer T; Lubinski, Jan; Luccarini, Craig; Lush, Michael; MacInnis, Robert J; Maishman, Tom; Makalic, Enes; Kostovska, Ivana Maleva; Malone, Kathleen E; Manoukian, Siranoush; Manson, JoAnn E; Margolin, Sara; Martens, John W M; Martinez, Maria Elena; Matsuo, Keitaro; Mavroudis, Dimitrios; Mazoyer, Sylvie; McLean, Catriona; Meijers-Heijboer, Hanne; Menéndez, Primitiva; Meyer, Jeffery; Miao, Hui; Miller, Austin; Miller, Nicola; Mitchell, Gillian; Montagna, Marco; Muir, Kenneth; Mulligan, Anna Marie; Mulot, Claire; Nadesan, Sue; Nathanson, Katherine L; Neuhausen, Susan L; Nevanlinna, Heli; Nevelsteen, Ines; Niederacher, Dieter; Nielsen, Sune F; Nordestgaard, Børge G; Norman, Aaron; Nussbaum, Robert L; Olah, Edith; Olopade, Olufunmilayo I; Olson, Janet E; Olswold, Curtis; Ong, Kai-Ren; Oosterwijk, Jan C; Orr, Nick; Osorio, Ana; Pankratz, V Shane; Papi, Laura; Park-Simon, Tjoung-Won; Paulsson-Karlsson, Ylva; Lloyd, Rachel; Pedersen, Inge Søkilde; Peissel, Bernard; Peixoto, Ana; Perez, Jose I A; Peterlongo, Paolo; Peto, Julian; Pfeiler, Georg; Phelan, Catherine M; Pinchev, Mila; Plaseska-Karanfilska, Dijana; Poppe, Bruce; Porteous, Mary E; Prentice, Ross; Presneau, Nadege; Prokofieva, Darya; Pugh, Elizabeth; Pujana, Miquel Angel; Pylkäs, Katri; Rack, Brigitte; Radice, Paolo; Rahman, Nazneen; Rantala, Johanna; Rappaport-Fuerhauser, Christine; Rennert, Gad; Rennert, Hedy S; Rhenius, Valerie; Rhiem, Kerstin; Richardson, Andrea; Rodriguez, Gustavo C; Romero, Atocha; Romm, Jane; Rookus, Matti A; Rudolph, Anja; Ruediger, Thomas; Saloustros, Emmanouil; Sanders, Joyce; Sandler, Dale P; Sangrajrang, Suleeporn; Sawyer, Elinor J; Schmidt, Daniel F; Schoemaker, Minouk J; Schumacher, Fredrick; Schürmann, Peter; Schwentner, Lukas; Scott, Christopher; Scott, Rodney J; Seal, Sheila; Senter, Leigha; Seynaeve, Caroline; Shah, Mitul; Sharma, Priyanka; Shen, Chen-Yang; Sheng, Xin; Shimelis, Hermela; Shrubsole, Martha J; Shu, Xiao-Ou; Side, Lucy E; Singer, Christian F; Sohn, Christof; Southey, Melissa C; Spinelli, John J; Spurdle, Amanda B; Stegmaier, Christa; Stoppa-Lyonnet, Dominique; Sukiennicki, Grzegorz; Surowy, Harald; Sutter, Christian; Swerdlow, Anthony; Szabo, Csilla I; Tamimi, Rulla M; Tan, Yen Y; Taylor, Jack A; Tejada, Maria-Isabel; Tengström, Maria; Teo, Soo H; Terry, Mary B; Tessier, Daniel C; Teulé, Alex; Thöne, Kathrin; Thull, Darcy L; Tibiletti, Maria Grazia; Tihomirova, Laima; Tischkowitz, Marc; Toland, Amanda E; Tollenaar, Rob A E M; Tomlinson, Ian; Tong, Ling; Torres, Diana; Tranchant, Martine; Truong, Thérèse; Tucker, Kathy; Tung, Nadine; Tyrer, Jonathan; Ulmer, Hans-Ulrich; Vachon, Celine; van Asperen, Christi J; Van Den Berg, David; van den Ouweland, Ans M W; van Rensburg, Elizabeth J; Varesco, Liliana; Varon-Mateeva, Raymonda; Vega, Ana; Viel, Alessandra; Vijai, Joseph; Vincent, Daniel; Vollenweider, Jason; Walker, Lisa; Wang, Zhaoming; Wang-Gohrke, Shan; Wappenschmidt, Barbara; Weinberg, Clarice R; Weitzel, Jeffrey N; Wendt, Camilla; Wesseling, Jelle; Whittemore, Alice S; Wijnen, Juul T; Willett, Walter; Winqvist, Robert; Wolk, Alicja; Wu, Anna H; Xia, Lucy; Yang, Xiaohong R; Yannoukakos, Drakoulis; Zaffaroni, Daniela; Zheng, Wei; Zhu, Bin; Ziogas, Argyrios; Ziv, Elad; Zorn, Kristin K; Gago-Dominguez, Manuela; Mannermaa, Arto; Olsson, Håkan; Teixeira, Manuel R; Stone, Jennifer; Offit, Kenneth; Ottini, Laura; Park, Sue K; Thomassen, Mads; Hall, Per; Meindl, Alfons; Schmutzler, Rita K; Droit, Arnaud; Bader, Gary D; Pharoah, Paul D P; Couch, Fergus J; Easton, Douglas F; Kraft, Peter; Chenevix-Trench, Georgia; García-Closas, Montserrat; Schmidt, Marjanka K; Antoniou, Antonis C; Simard, Jacques
2017-12-01
Most common breast cancer susceptibility variants have been identified through genome-wide association studies (GWAS) of predominantly estrogen receptor (ER)-positive disease. We conducted a GWAS using 21,468 ER-negative cases and 100,594 controls combined with 18,908 BRCA1 mutation carriers (9,414 with breast cancer), all of European origin. We identified independent associations at P < 5 × 10 -8 with ten variants at nine new loci. At P < 0.05, we replicated associations with 10 of 11 variants previously reported in ER-negative disease or BRCA1 mutation carrier GWAS and observed consistent associations with ER-negative disease for 105 susceptibility variants identified by other studies. These 125 variants explain approximately 16% of the familial risk of this breast cancer subtype. There was high genetic correlation (0.72) between risk of ER-negative breast cancer and breast cancer risk for BRCA1 mutation carriers. These findings may lead to improved risk prediction and inform further fine-mapping and functional work to better understand the biological basis of ER-negative breast cancer.
Convergent evidence from systematic analysis of GWAS revealed genetic basis of esophageal cancer.
Gao, Xue-Xin; Gao, Lei; Wang, Jiu-Qiang; Qu, Su-Su; Qu, Yue; Sun, Hong-Lei; Liu, Si-Dang; Shang, Ying-Li
2016-07-12
Recent genome-wide association studies (GWAS) have identified single nucleotide polymorphisms (SNPs) associated with risk of esophageal cancer (EC). However, investigation of genetic basis from the perspective of systematic biology and integrative genomics remains scarce.In this study, we explored genetic basis of EC based on GWAS data and implemented a series of bioinformatics methods including functional annotation, expression quantitative trait loci (eQTL) analysis, pathway enrichment analysis and pathway grouped network analysis.Two hundred and thirteen risk SNPs were identified, in which 44 SNPs were found to have significantly differential gene expression in esophageal tissues by eQTL analysis. By pathway enrichment analysis, 170 risk genes mapped by risk SNPs were enriched into 38 significant GO terms and 17 significant KEGG pathways, which were significantly grouped into 9 sub-networks by pathway grouped network analysis. The 9 groups of interconnected pathways were mainly involved with muscle cell proliferation, cellular response to interleukin-6, cell adhesion molecules, and ethanol oxidation, which might participate in the development of EC.Our findings provide genetic evidence and new insight for exploring the molecular mechanisms of EC.
Raffler, Johannes; Friedrich, Nele; Arnold, Matthias; Kacprowski, Tim; Rueedi, Rico; Altmaier, Elisabeth; Bergmann, Sven; Budde, Kathrin; Gieger, Christian; Homuth, Georg; Pietzner, Maik; Römisch-Margl, Werner; Strauch, Konstantin; Völzke, Henry; Waldenberger, Melanie; Wallaschofski, Henri; Nauck, Matthias; Völker, Uwe; Kastenmüller, Gabi; Suhre, Karsten
2015-01-01
Genome-wide association studies with metabolic traits (mGWAS) uncovered many genetic variants that influence human metabolism. These genetically influenced metabotypes (GIMs) contribute to our metabolic individuality, our capacity to respond to environmental challenges, and our susceptibility to specific diseases. While metabolic homeostasis in blood is a well investigated topic in large mGWAS with over 150 known loci, metabolic detoxification through urinary excretion has only been addressed by few small mGWAS with only 11 associated loci so far. Here we report the largest mGWAS to date, combining targeted and non-targeted 1H NMR analysis of urine samples from 3,861 participants of the SHIP-0 cohort and 1,691 subjects of the KORA F4 cohort. We identified and replicated 22 loci with significant associations with urinary traits, 15 of which are new (HIBCH, CPS1, AGXT, XYLB, TKT, ETNPPL, SLC6A19, DMGDH, SLC36A2, GLDC, SLC6A13, ACSM3, SLC5A11, PNMT, SLC13A3). Two-thirds of the urinary loci also have a metabolite association in blood. For all but one of the 6 loci where significant associations target the same metabolite in blood and urine, the genetic effects have the same direction in both fluids. In contrast, for the SLC5A11 locus, we found increased levels of myo-inositol in urine whereas mGWAS in blood reported decreased levels for the same genetic variant. This might indicate less effective re-absorption of myo-inositol in the kidneys of carriers. In summary, our study more than doubles the number of known loci that influence urinary phenotypes. It thus allows novel insights into the relationship between blood homeostasis and its regulation through excretion. The newly discovered loci also include variants previously linked to chronic kidney disease (CPS1, SLC6A13), pulmonary hypertension (CPS1), and ischemic stroke (XYLB). By establishing connections from gene to disease via metabolic traits our results provide novel hypotheses about molecular mechanisms involved in the etiology of diseases. PMID:26352407
A genome-wide association study of seed protein and oil content in soybean
2014-01-01
Background Association analysis is an alternative to conventional family-based methods to detect the location of gene(s) or quantitative trait loci (QTL) and provides relatively high resolution in terms of defining the genome position of a gene or QTL. Seed protein and oil concentration are quantitative traits which are determined by the interaction among many genes with small to moderate genetic effects and their interaction with the environment. In this study, a genome-wide association study (GWAS) was performed to identify quantitative trait loci (QTL) controlling seed protein and oil concentration in 298 soybean germplasm accessions exhibiting a wide range of seed protein and oil content. Results A total of 55,159 single nucleotide polymorphisms (SNPs) were genotyped using various methods including Illumina Infinium and GoldenGate assays and 31,954 markers with minor allele frequency >0.10 were used to estimate linkage disequilibrium (LD) in heterochromatic and euchromatic regions. In euchromatic regions, the mean LD (r 2 ) rapidly declined to 0.2 within 360 Kbp, whereas the mean LD declined to 0.2 at 9,600 Kbp in heterochromatic regions. The GWAS results identified 40 SNPs in 17 different genomic regions significantly associated with seed protein. Of these, the five SNPs with the highest associations and seven adjacent SNPs were located in the 27.6-30.0 Mbp region of Gm20. A major seed protein QTL has been previously mapped to the same location and potential candidate genes have recently been identified in this region. The GWAS results also detected 25 SNPs in 13 different genomic regions associated with seed oil. Of these markers, seven SNPs had a significant association with both protein and oil. Conclusions This research indicated that GWAS not only identified most of the previously reported QTL controlling seed protein and oil, but also resulted in narrower genomic regions than the regions reported as containing these QTL. The narrower GWAS-defined genome regions will allow more precise marker-assisted allele selection and will expedite positional cloning of the causal gene(s). PMID:24382143
A genome-wide association study of seed protein and oil content in soybean.
Hwang, Eun-Young; Song, Qijian; Jia, Gaofeng; Specht, James E; Hyten, David L; Costa, Jose; Cregan, Perry B
2014-01-02
Association analysis is an alternative to conventional family-based methods to detect the location of gene(s) or quantitative trait loci (QTL) and provides relatively high resolution in terms of defining the genome position of a gene or QTL. Seed protein and oil concentration are quantitative traits which are determined by the interaction among many genes with small to moderate genetic effects and their interaction with the environment. In this study, a genome-wide association study (GWAS) was performed to identify quantitative trait loci (QTL) controlling seed protein and oil concentration in 298 soybean germplasm accessions exhibiting a wide range of seed protein and oil content. A total of 55,159 single nucleotide polymorphisms (SNPs) were genotyped using various methods including Illumina Infinium and GoldenGate assays and 31,954 markers with minor allele frequency >0.10 were used to estimate linkage disequilibrium (LD) in heterochromatic and euchromatic regions. In euchromatic regions, the mean LD (r2) rapidly declined to 0.2 within 360 Kbp, whereas the mean LD declined to 0.2 at 9,600 Kbp in heterochromatic regions. The GWAS results identified 40 SNPs in 17 different genomic regions significantly associated with seed protein. Of these, the five SNPs with the highest associations and seven adjacent SNPs were located in the 27.6-30.0 Mbp region of Gm20. A major seed protein QTL has been previously mapped to the same location and potential candidate genes have recently been identified in this region. The GWAS results also detected 25 SNPs in 13 different genomic regions associated with seed oil. Of these markers, seven SNPs had a significant association with both protein and oil. This research indicated that GWAS not only identified most of the previously reported QTL controlling seed protein and oil, but also resulted in narrower genomic regions than the regions reported as containing these QTL. The narrower GWAS-defined genome regions will allow more precise marker-assisted allele selection and will expedite positional cloning of the causal gene(s).
Pe’er, Itsik
2017-01-01
Genome-wide association studies (GWAS) have identified hundreds of SNPs responsible for variation in human quantitative traits. However, genome-wide-significant associations often fail to replicate across independent cohorts, in apparent inconsistency with their apparent strong effects in discovery cohorts. This limited success of replication raises pervasive questions about the utility of the GWAS field. We identify all 332 studies of quantitative traits from the NHGRI-EBI GWAS Database with attempted replication. We find that the majority of studies provide insufficient data to evaluate replication rates. The remaining papers replicate significantly worse than expected (p < 10−14), even when adjusting for regression-to-the-mean of effect size between discovery- and replication-cohorts termed the Winner’s Curse (p < 10−16). We show this is due in part to misreporting replication cohort-size as a maximum number, rather than per-locus one. In 39 studies accurately reporting per-locus cohort-size for attempted replication of 707 loci in samples with similar ancestry, replication rate matched expectation (predicted 458, observed 457, p = 0.94). In contrast, ancestry differences between replication and discovery (13 studies, 385 loci) cause the most highly-powered decile of loci to replicate worse than expected, due to difference in linkage disequilibrium. PMID:28715421
Meta-analysis and genome-wide interpretation of genetic susceptibility to drug addiction
2011-01-01
Background Classical genetic studies provide strong evidence for heritable contributions to susceptibility to developing dependence on addictive substances. Candidate gene and genome-wide association studies (GWAS) have sought genes, chromosomal regions and allelic variants likely to contribute to susceptibility to drug addiction. Results Here, we performed a meta-analysis of addiction candidate gene association studies and GWAS to investigate possible functional mechanisms associated with addiction susceptibility. From meta-data retrieved from 212 publications on candidate gene association studies and 5 GWAS reports, we linked a total of 843 haplotypes to addiction susceptibility. We mapped the SNPs in these haplotypes to functional and regulatory elements in the genome and estimated the magnitude of the contributions of different molecular mechanisms to their effects on addiction susceptibility. In addition to SNPs in coding regions, these data suggest that haplotypes in gene regulatory regions may also contribute to addiction susceptibility. When we compared the lists of genes identified by association studies and those identified by molecular biological studies of drug-regulated genes, we observed significantly higher participation in the same gene interaction networks than expected by chance, despite little overlap between the two gene lists. Conclusions These results appear to offer new insights into the genetic factors underlying drug addiction. PMID:21999673
BlueSNP: R package for highly scalable genome-wide association studies using Hadoop clusters.
Huang, Hailiang; Tata, Sandeep; Prill, Robert J
2013-01-01
Computational workloads for genome-wide association studies (GWAS) are growing in scale and complexity outpacing the capabilities of single-threaded software designed for personal computers. The BlueSNP R package implements GWAS statistical tests in the R programming language and executes the calculations across computer clusters configured with Apache Hadoop, a de facto standard framework for distributed data processing using the MapReduce formalism. BlueSNP makes computationally intensive analyses, such as estimating empirical p-values via data permutation, and searching for expression quantitative trait loci over thousands of genes, feasible for large genotype-phenotype datasets. http://github.com/ibm-bioinformatics/bluesnp
Lin, Ying-Ju; Liao, Wen-Ling; Wang, Chung-Hsing; Tsai, Li-Ping; Tang, Chih-Hsin; Chen, Chien-Hsiun; Wu, Jer-Yuarn; Liang, Wen-Miin; Hsieh, Ai-Ru; Cheng, Chi-Fung; Chen, Jin-Hua; Chien, Wen-Kuei; Lin, Ting-Hsu; Wu, Chia-Ming; Liao, Chiu-Chu; Huang, Shao-Mei; Tsai, Fuu-Jen
2017-07-25
Human height can be described as a classical and inherited trait model. Genome-wide association studies (GWAS) have revealed susceptible loci and provided insights into the polygenic nature of human height. Familial short stature (FSS) represents a suitable trait for investigating short stature genetics because disease associations with short stature have been ruled out in this case. In addition, FSS is caused only by genetically inherited factors. In this study, we explored the correlations of FSS risk with the genetic loci associated with human height in previous GWAS, alone and cumulatively. We systematically evaluated 34 known human height single nucleotide polymorphisms (SNPs) in relation to FSS in the additive model (p < 0.00005). A cumulative effect was observed: the odds ratios gradually increased with increasing genetic risk score quartiles (p < 0.001; Cochran-Armitage trend test). Six affected genes-ZBTB38, ZNF638, LCORL, CABLES1, CDK10, and TSEN15-are located in the nucleus and have been implicated in embryonic, organismal, and tissue development. In conclusion, our study suggests that 13 human height GWAS-identified SNPs are associated with FSS risk both alone and cumulatively.
Mahmoudpour, Seyed Hamidreza; Veluchamy, Abirami; Siddiqui, Moneeza Kalhan; Asselbergs, Folkert W.; Souverein, Patrick C.; de Keyser, Catherine E.; Hofman, Albert; Lang, Chim C.; Doney, Alexander SF.; Stricker, Bruno H.; de Boer, Anthonius; Maitland-van der Zee, Anke-Hilse; Palmer, Colin NA.
2016-01-01
Objectives To identify SNPs associated with switching from an ACE-inhibitor to an angiotensin receptor blocker (ARB). Methods Two cohorts of patients starting ACE-inhibitors were identified within the Rotterdam Study in the Netherlands and the GoDARTS study in Scotland. Cases were intolerant subjects who switched from an ACE-inhibitor to an ARB, controls were subjects who used ACE-inhibitors continuously for at least 2 years and did not switch. GWAS using an additive model was run in these sets and results were meta-analysed using GWAMA. Results 972 cases out of 5 161 ACE-inhibitor starters were identified. 8 SNPs within 4 genes reached the GWAS significance level (P<5×10-8) in the meta-analysis (RBFOX3, GABRG2, SH2B1 and MBOAT1). The strongest associated SNP was located in an intron of RBFOX3, which contains a RNA binding protein (rs2061538: MAF=0.16, OR=1.52[95%CI: 1.32-1.76], p=6.2x10-9). Conclusions These results indicate that genetic variation in abovementioned genes may increase the risk of ACE-inhibitors induced adverse reactions. PMID:28030426
Pathway-Based Genome-Wide Association Studies for Two Meat Production Traits in Simmental Cattle.
Fan, Huizhong; Wu, Yang; Zhou, Xiaojing; Xia, Jiangwei; Zhang, Wengang; Song, Yuxin; Liu, Fei; Chen, Yan; Zhang, Lupei; Gao, Xue; Gao, Huijiang; Li, Junya
2015-12-17
Most single nucleotide polymorphisms (SNPs) detected by genome-wide association studies (GWAS), explain only a small fraction of phenotypic variation. Pathway-based GWAS were proposed to improve the proportion of genes for some human complex traits that could be explained by enriching a mass of SNPs within genetic groups. However, few attempts have been made to describe the quantitative traits in domestic animals. In this study, we used a dataset with approximately 7,700,000 SNPs from 807 Simmental cattle and analyzed live weight and longissimus muscle area using a modified pathway-based GWAS method to orthogonalise the highly linked SNPs within each gene using principal component analysis (PCA). As a result, of the 262 biological pathways of cattle collected from the KEGG database, the gamma aminobutyric acid (GABA)ergic synapse pathway and the non-alcoholic fatty liver disease (NAFLD) pathway were significantly associated with the two traits analyzed. The GABAergic synapse pathway was biologically applicable to the traits analyzed because of its roles in feed intake and weight gain. The proposed method had high statistical power and a low false discovery rate, compared to those of the smallest P-value and SNP set enrichment analysis methods.
Imamura, Minako; Takahashi, Atsushi; Yamauchi, Toshimasa; Hara, Kazuo; Yasuda, Kazuki; Grarup, Niels; Zhao, Wei; Wang, Xu; Huerta-Chagoya, Alicia; Hu, Cheng; Moon, Sanghoon; Long, Jirong; Kwak, Soo Heon; Rasheed, Asif; Saxena, Richa; Ma, Ronald C. W.; Okada, Yukinori; Iwata, Minoru; Hosoe, Jun; Shojima, Nobuhiro; Iwasaki, Minaka; Fujita, Hayato; Suzuki, Ken; Danesh, John; Jørgensen, Torben; Jørgensen, Marit E.; Witte, Daniel R.; Brandslund, Ivan; Christensen, Cramer; Hansen, Torben; Mercader, Josep M.; Flannick, Jason; Moreno-Macías, Hortensia; Burtt, Noël P.; Zhang, Rong; Kim, Young Jin; Zheng, Wei; Singh, Jai Rup; Tam, Claudia H. T.; Hirose, Hiroshi; Maegawa, Hiroshi; Ito, Chikako; Kaku, Kohei; Watada, Hirotaka; Tanaka, Yasushi; Tobe, Kazuyuki; Kawamori, Ryuzo; Kubo, Michiaki; Cho, Yoon Shin; Chan, Juliana C. N.; Sanghera, Dharambir; Frossard, Philippe; Park, Kyong Soo; Shu, Xiao-Ou; Kim, Bong-Jo; Florez, Jose C.; Tusié-Luna, Teresa; Jia, Weiping; Tai, E Shyong; Pedersen, Oluf; Saleheen, Danish; Maeda, Shiro; Kadowaki, Takashi
2016-01-01
Genome-wide association studies (GWAS) have identified more than 80 susceptibility loci for type 2 diabetes (T2D), but most of its heritability still remains to be elucidated. In this study, we conducted a meta-analysis of GWAS for T2D in the Japanese population. Combined data from discovery and subsequent validation analyses (23,399 T2D cases and 31,722 controls) identify 7 new loci with genome-wide significance (P<5 × 10−8), rs1116357 near CCDC85A, rs147538848 in FAM60A, rs1575972 near DMRTA1, rs9309245 near ASB3, rs67156297 near ATP8B2, rs7107784 near MIR4686 and rs67839313 near INAFM2. Of these, the association of 4 loci with T2D is replicated in multi-ethnic populations other than Japanese (up to 65,936 T2Ds and 158,030 controls, P<0.007). These results indicate that expansion of single ethnic GWAS is still useful to identify novel susceptibility loci to complex traits not only for ethnicity-specific loci but also for common loci across different ethnicities. PMID:26818947
An efficient empirical Bayes method for genomewide association studies.
Wang, Q; Wei, J; Pan, Y; Xu, S
2016-08-01
Linear mixed model (LMM) is one of the most popular methods for genomewide association studies (GWAS). Numerous forms of LMM have been developed; however, there are two major issues in GWAS that have not been fully addressed before. The two issues are (i) the genomic background noise and (ii) low statistical power after Bonferroni correction. We proposed an empirical Bayes (EB) method by assigning each marker effect a normal prior distribution, resulting in shrinkage estimates of marker effects. We found that such a shrinkage approach can selectively shrink marker effects and reduce the noise level to zero for majority of non-associated markers. In the meantime, the EB method allows us to use an 'effective number of tests' to perform Bonferroni correction for multiple tests. Simulation studies for both human and pig data showed that EB method can significantly increase statistical power compared with the widely used exact GWAS methods, such as GEMMA and FaST-LMM-Select. Real data analyses in human breast cancer identified improved detection signals for markers previously known to be associated with breast cancer. We therefore believe that EB method is a valuable tool for identifying the genetic basis of complex traits. © 2015 Blackwell Verlag GmbH.
Single nucleotide variations: Biological impact and theoretical interpretation
Katsonis, Panagiotis; Koire, Amanda; Wilson, Stephen Joseph; Hsu, Teng-Kuei; Lua, Rhonald C; Wilkins, Angela Dawn; Lichtarge, Olivier
2014-01-01
Genome-wide association studies (GWAS) and whole-exome sequencing (WES) generate massive amounts of genomic variant information, and a major challenge is to identify which variations drive disease or contribute to phenotypic traits. Because the majority of known disease-causing mutations are exonic non-synonymous single nucleotide variations (nsSNVs), most studies focus on whether these nsSNVs affect protein function. Computational studies show that the impact of nsSNVs on protein function reflects sequence homology and structural information and predict the impact through statistical methods, machine learning techniques, or models of protein evolution. Here, we review impact prediction methods and discuss their underlying principles, their advantages and limitations, and how they compare to and complement one another. Finally, we present current applications and future directions for these methods in biological research and medical genetics. PMID:25234433
Ghazarian, Armen A.; Simonds, Naoko I.; Bennett, Kelly; Pimentel, Camilla B.; Ellison, Gary L.; Gillanders, Elizabeth M.; Schully, Sheri D.; Mechanic, Leah E.
2013-01-01
Background Genetic and environmental factors jointly influence cancer risk. The National Institutes of Health (NIH) has made the study of gene-environment (GxE) interactions a research priority since the year 2000. Methods To assess the current status of GxE research in cancer, we analyzed the extramural grant portfolio of the National Cancer Institute (NCI) from Fiscal Years 2007 to 2009. Publications attributed to selected grants were also evaluated. Results From the 1,106 research grants identified in our portfolio analysis, a random sample of 450 grants (40%) was selected for data abstraction; of these, 147 (33%) were considered relevant. The most common cancer type was breast (20%, n=29), followed by lymphoproliferative (10%, n=14), colorectal (9%, n=13), melanoma/other skin (9%, n=13), and lung/upper aero-digestive tract (8%, n=12) cancers. The majority of grants were studies of candidate genes (68%, n=100) compared to genome-wide association studies (GWAS) (8%, n=12). Approximately one third studied environmental exposures categorized as energy balance (37%, n=54) or drugs/treatment (29%, n=43). From the 147 relevant grants, 108 publications classified as GxE or pharmacogenomic were identified. These publications were linked to 37 of the 147 grant applications (25%). Conclusion The findings from our portfolio analysis suggest that GxE studies are concentrated in specific areas. There is room for investments in other aspects of GxE research, including, but not limited to developing alternative approaches to exposure assessment, broadening the spectrum of cancer types investigated, and performing GxE within GWAS. Impact This portfolio analysis provides a cross-sectional review of NCI support for GxE research in cancer. PMID:23462918
Ghazarian, Armen A; Simonds, Naoko I; Bennett, Kelly; Pimentel, Camilla B; Ellison, Gary L; Gillanders, Elizabeth M; Schully, Sheri D; Mechanic, Leah E
2013-04-01
Genetic and environmental factors jointly influence cancer risk. The NIH has made the study of gene-environment (GxE) interactions a research priority since the year 2000. To assess the current status of GxE research in cancer, we analyzed the extramural grant portfolio of the National Cancer Institute (NCI) from Fiscal Years 2007 to 2009. Publications attributed to selected grants were also evaluated. From the 1,106 research grants identified in our portfolio analysis, a random sample of 450 grants (40%) was selected for data abstraction; of these, 147 (33%) were considered relevant. The most common cancer type was breast (20%, n = 29), followed by lymphoproliferative (10%, n = 14), colorectal (9%, n = 13), melanoma/other skin (9%, n = 13), and lung/upper aerodigestive tract (8%, n = 12) cancers. The majority of grants were studies of candidate genes (68%, n = 100) compared with genome-wide association studies (GWAS) (8%, n = 12). Approximately one-third studied environmental exposures categorized as energy balance (37%, n = 54) or drugs/treatment (29%, n = 43). From the 147 relevant grants, 108 publications classified as GxE or pharmacogenomic were identified. These publications were linked to 37 of the 147 grant applications (25%). The findings from our portfolio analysis suggest that GxE studies are concentrated in specific areas. There is room for investments in other aspects of GxE research, including, but not limited to developing alternative approaches to exposure assessment, broadening the spectrum of cancer types investigated, and conducting GxE within GWAS. This portfolio analysis provides a cross-sectional review of NCI support for GxE research in cancer.
Fang, Lingzhao; Sahana, Goutam; Su, Guosheng; Yu, Ying; Zhang, Shengli; Lund, Mogens Sandø; Sørensen, Peter
2017-01-01
Connecting genome-wide association study (GWAS) to biological mechanisms underlying complex traits is a major challenge. Mastitis resistance and milk production are complex traits of economic importance in the dairy sector and are associated with intra-mammary infection (IMI). Here, we integrated IMI-relevant RNA-Seq data from Holstein cattle and sequence-based GWAS data from three dairy cattle breeds (i.e., Holstein, Nordic red cattle, and Jersey) to explore the genetic basis of mastitis resistance and milk production using post-GWAS analyses and a genomic feature linear mixed model. At 24 h post-IMI, genes responsive to IMI in the mammary gland were preferentially enriched for genetic variants associated with mastitis resistance rather than milk production. Response genes in the liver were mainly enriched for variants associated with mastitis resistance at an early time point (3 h) post-IMI, whereas responsive genes at later stages were enriched for associated variants with milk production. The up- and down-regulated genes were enriched for associated variants with mastitis resistance and milk production, respectively. The patterns were consistent across breeds, indicating that different breeds shared similarities in the genetic basis of these traits. Our approaches provide a framework for integrating multiple layers of data to understand the genetic architecture underlying complex traits. PMID:28358110
1000 Genomes-based meta-analysis identifies 10 novel loci for kidney function
Gorski, Mathias; van der Most, Peter J.; Teumer, Alexander; Chu, Audrey Y.; Li, Man; Mijatovic, Vladan; Nolte, Ilja M.; Cocca, Massimiliano; Taliun, Daniel; Gomez, Felicia; Li, Yong; Tayo, Bamidele; Tin, Adrienne; Feitosa, Mary F.; Aspelund, Thor; Attia, John; Biffar, Reiner; Bochud, Murielle; Boerwinkle, Eric; Borecki, Ingrid; Bottinger, Erwin P.; Chen, Ming-Huei; Chouraki, Vincent; Ciullo, Marina; Coresh, Josef; Cornelis, Marilyn C.; Curhan, Gary C.; d’Adamo, Adamo Pio; Dehghan, Abbas; Dengler, Laura; Ding, Jingzhong; Eiriksdottir, Gudny; Endlich, Karlhans; Enroth, Stefan; Esko, Tõnu; Franco, Oscar H.; Gasparini, Paolo; Gieger, Christian; Girotto, Giorgia; Gottesman, Omri; Gudnason, Vilmundur; Gyllensten, Ulf; Hancock, Stephen J.; Harris, Tamara B.; Helmer, Catherine; Höllerer, Simon; Hofer, Edith; Hofman, Albert; Holliday, Elizabeth G.; Homuth, Georg; Hu, Frank B.; Huth, Cornelia; Hutri-Kähönen, Nina; Hwang, Shih-Jen; Imboden, Medea; Johansson, Åsa; Kähönen, Mika; König, Wolfgang; Kramer, Holly; Krämer, Bernhard K.; Kumar, Ashish; Kutalik, Zoltan; Lambert, Jean-Charles; Launer, Lenore J.; Lehtimäki, Terho; de Borst, Martin; Navis, Gerjan; Swertz, Morris; Liu, Yongmei; Lohman, Kurt; Loos, Ruth J. F.; Lu, Yingchang; Lyytikäinen, Leo-Pekka; McEvoy, Mark A.; Meisinger, Christa; Meitinger, Thomas; Metspalu, Andres; Metzger, Marie; Mihailov, Evelin; Mitchell, Paul; Nauck, Matthias; Oldehinkel, Albertine J.; Olden, Matthias; WJH Penninx, Brenda; Pistis, Giorgio; Pramstaller, Peter P.; Probst-Hensch, Nicole; Raitakari, Olli T.; Rettig, Rainer; Ridker, Paul M.; Rivadeneira, Fernando; Robino, Antonietta; Rosas, Sylvia E.; Ruderfer, Douglas; Ruggiero, Daniela; Saba, Yasaman; Sala, Cinzia; Schmidt, Helena; Schmidt, Reinhold; Scott, Rodney J.; Sedaghat, Sanaz; Smith, Albert V.; Sorice, Rossella; Stengel, Benedicte; Stracke, Sylvia; Strauch, Konstantin; Toniolo, Daniela; Uitterlinden, Andre G.; Ulivi, Sheila; Viikari, Jorma S.; Völker, Uwe; Vollenweider, Peter; Völzke, Henry; Vuckovic, Dragana; Waldenberger, Melanie; Jin Wang, Jie; Yang, Qiong; Chasman, Daniel I.; Tromp, Gerard; Snieder, Harold; Heid, Iris M.; Fox, Caroline S.; Köttgen, Anna; Pattaro, Cristian; Böger, Carsten A.; Fuchsberger, Christian
2017-01-01
HapMap imputed genome-wide association studies (GWAS) have revealed >50 loci at which common variants with minor allele frequency >5% are associated with kidney function. GWAS using more complete reference sets for imputation, such as those from The 1000 Genomes project, promise to identify novel loci that have been missed by previous efforts. To investigate the value of such a more complete variant catalog, we conducted a GWAS meta-analysis of kidney function based on the estimated glomerular filtration rate (eGFR) in 110,517 European ancestry participants using 1000 Genomes imputed data. We identified 10 novel loci with p-value < 5 × 10−8 previously missed by HapMap-based GWAS. Six of these loci (HOXD8, ARL15, PIK3R1, EYA4, ASTN2, and EPB41L3) are tagged by common SNPs unique to the 1000 Genomes reference panel. Using pathway analysis, we identified 39 significant (FDR < 0.05) genes and 127 significantly (FDR < 0.05) enriched gene sets, which were missed by our previous analyses. Among those, the 10 identified novel genes are part of pathways of kidney development, carbohydrate metabolism, cardiac septum development and glucose metabolism. These results highlight the utility of re-imputing from denser reference panels, until whole-genome sequencing becomes feasible in large samples. PMID:28452372
1000 Genomes-based meta-analysis identifies 10 novel loci for kidney function.
Gorski, Mathias; van der Most, Peter J; Teumer, Alexander; Chu, Audrey Y; Li, Man; Mijatovic, Vladan; Nolte, Ilja M; Cocca, Massimiliano; Taliun, Daniel; Gomez, Felicia; Li, Yong; Tayo, Bamidele; Tin, Adrienne; Feitosa, Mary F; Aspelund, Thor; Attia, John; Biffar, Reiner; Bochud, Murielle; Boerwinkle, Eric; Borecki, Ingrid; Bottinger, Erwin P; Chen, Ming-Huei; Chouraki, Vincent; Ciullo, Marina; Coresh, Josef; Cornelis, Marilyn C; Curhan, Gary C; d'Adamo, Adamo Pio; Dehghan, Abbas; Dengler, Laura; Ding, Jingzhong; Eiriksdottir, Gudny; Endlich, Karlhans; Enroth, Stefan; Esko, Tõnu; Franco, Oscar H; Gasparini, Paolo; Gieger, Christian; Girotto, Giorgia; Gottesman, Omri; Gudnason, Vilmundur; Gyllensten, Ulf; Hancock, Stephen J; Harris, Tamara B; Helmer, Catherine; Höllerer, Simon; Hofer, Edith; Hofman, Albert; Holliday, Elizabeth G; Homuth, Georg; Hu, Frank B; Huth, Cornelia; Hutri-Kähönen, Nina; Hwang, Shih-Jen; Imboden, Medea; Johansson, Åsa; Kähönen, Mika; König, Wolfgang; Kramer, Holly; Krämer, Bernhard K; Kumar, Ashish; Kutalik, Zoltan; Lambert, Jean-Charles; Launer, Lenore J; Lehtimäki, Terho; de Borst, Martin; Navis, Gerjan; Swertz, Morris; Liu, Yongmei; Lohman, Kurt; Loos, Ruth J F; Lu, Yingchang; Lyytikäinen, Leo-Pekka; McEvoy, Mark A; Meisinger, Christa; Meitinger, Thomas; Metspalu, Andres; Metzger, Marie; Mihailov, Evelin; Mitchell, Paul; Nauck, Matthias; Oldehinkel, Albertine J; Olden, Matthias; Wjh Penninx, Brenda; Pistis, Giorgio; Pramstaller, Peter P; Probst-Hensch, Nicole; Raitakari, Olli T; Rettig, Rainer; Ridker, Paul M; Rivadeneira, Fernando; Robino, Antonietta; Rosas, Sylvia E; Ruderfer, Douglas; Ruggiero, Daniela; Saba, Yasaman; Sala, Cinzia; Schmidt, Helena; Schmidt, Reinhold; Scott, Rodney J; Sedaghat, Sanaz; Smith, Albert V; Sorice, Rossella; Stengel, Benedicte; Stracke, Sylvia; Strauch, Konstantin; Toniolo, Daniela; Uitterlinden, Andre G; Ulivi, Sheila; Viikari, Jorma S; Völker, Uwe; Vollenweider, Peter; Völzke, Henry; Vuckovic, Dragana; Waldenberger, Melanie; Jin Wang, Jie; Yang, Qiong; Chasman, Daniel I; Tromp, Gerard; Snieder, Harold; Heid, Iris M; Fox, Caroline S; Köttgen, Anna; Pattaro, Cristian; Böger, Carsten A; Fuchsberger, Christian
2017-04-28
HapMap imputed genome-wide association studies (GWAS) have revealed >50 loci at which common variants with minor allele frequency >5% are associated with kidney function. GWAS using more complete reference sets for imputation, such as those from The 1000 Genomes project, promise to identify novel loci that have been missed by previous efforts. To investigate the value of such a more complete variant catalog, we conducted a GWAS meta-analysis of kidney function based on the estimated glomerular filtration rate (eGFR) in 110,517 European ancestry participants using 1000 Genomes imputed data. We identified 10 novel loci with p-value < 5 × 10 -8 previously missed by HapMap-based GWAS. Six of these loci (HOXD8, ARL15, PIK3R1, EYA4, ASTN2, and EPB41L3) are tagged by common SNPs unique to the 1000 Genomes reference panel. Using pathway analysis, we identified 39 significant (FDR < 0.05) genes and 127 significantly (FDR < 0.05) enriched gene sets, which were missed by our previous analyses. Among those, the 10 identified novel genes are part of pathways of kidney development, carbohydrate metabolism, cardiac septum development and glucose metabolism. These results highlight the utility of re-imputing from denser reference panels, until whole-genome sequencing becomes feasible in large samples.
Cannon, Maren E.; Duan, Qing; Wu, Ying; Zeynalzadeh, Monica; Xu, Zheng; Kangas, Antti J.; Soininen, Pasi; Ala-Korpela, Mika; Civelek, Mete; Lusis, Aldons J.; Kuusisto, Johanna; Collins, Francis S.; Boehnke, Michael; Tang, Hua; Laakso, Markku; Li, Yun; Mohlke, Karen L.
2017-01-01
Recent genome-wide association studies (GWAS) have identified variants associated with high-density lipoprotein cholesterol (HDL-C) located in or near the ANGPTL8 gene. Given the extensive sharing of GWAS loci across populations, we hypothesized that at least one shared variant at this locus affects HDL-C. The HDL-C–associated variants are coincident with expression quantitative trait loci for ANGPTL8 and DOCK6 in subcutaneous adipose tissue; however, only ANGPTL8 expression levels are associated with HDL-C levels. We identified a 400-bp promoter region of ANGPTL8 and enhancer regions within 5 kb that contribute to regulating expression in liver and adipose. To identify variants functionally responsible for the HDL-C association, we performed fine-mapping analyses and selected 13 candidate variants that overlap putative regulatory regions to test for allelic differences in regulatory function. Of these variants, rs12463177-G increased transcriptional activity (1.5-fold, P = 0.004) and showed differential protein binding. Six additional variants (rs17699089, rs200788077, rs56322906, rs3760782, rs737337, and rs3745683) showed evidence of allelic differences in transcriptional activity and/or protein binding. Taken together, these data suggest a regulatory mechanism at the ANGPTL8 HDL-C GWAS locus involving tissue-selective expression and at least one functional variant. PMID:28754724
Bauchet, Guillaume; Grenier, Stéphane; Samson, Nicolas; Bonnet, Julien; Grivet, Laurent; Causse, Mathilde
2017-05-01
A panel of 300 tomato accessions including breeding materials was built and characterized with >11,000 SNP. A population structure in six subgroups was identified. Strong heterogeneity in linkage disequilibrium and recombination landscape among groups and chromosomes was shown. GWAS identified several associations for fruit weight, earliness and plant growth. Genome-wide association studies (GWAS) have become a method of choice in quantitative trait dissection. First limited to highly polymorphic and outcrossing species, it is now applied in horticultural crops, notably in tomato. Until now GWAS in tomato has been performed on panels of heirloom and wild accessions. Using modern breeding materials would be of direct interest for breeding purpose. To implement GWAS on a large panel of 300 tomato accessions including 168 breeding lines, this study assessed the genetic diversity and linkage disequilibrium decay and revealed the population structure and performed GWA experiment. Genetic diversity and population structure analyses were based on molecular markers (>11,000 SNP) covering the whole genome. Six genetic subgroups were revealed and associated to traits of agronomical interest, such as fruit weight and disease resistance. Estimates of linkage disequilibrium highlighted the heterogeneity of its decay among genetic subgroups. Haplotype definition allowed a fine characterization of the groups and their recombination landscape revealing the patterns of admixture along the genome. Selection footprints showed results in congruence with introgressions. Taken together, all these elements refined our knowledge of the genetic material included in this panel and allowed the identification of several associations for fruit weight, plant growth and earliness, deciphering the genetic architecture of these complex traits and identifying several new loci useful for tomato breeding.
Contrasting results from GWAS and QTL mapping on wing length in great reed warblers.
Hansson, Bengt; Sigeman, Hanna; Stervander, Martin; Tarka, Maja; Ponnikas, Suvi; Strandh, Maria; Westerdahl, Helena; Hasselquist, Dennis
2018-04-15
A major goal in evolutionary biology is to understand the genetic basis of adaptive traits. In migratory birds, wing morphology is such a trait. Our previous work on the great reed warbler (Acrocephalus arundinaceus) shows that wing length is highly heritable and under sexually antagonistic selection. Moreover, a quantitative trait locus (QTL) mapping analysis detected a pronounced QTL for wing length on chromosome 2, suggesting that wing morphology is partly controlled by genes with large effects. Here, we re-evaluate the genetic basis of wing length in great reed warblers using a genomewide association study (GWAS) approach based on restriction site-associated DNA sequencing (RADseq) data. We use GWAS models that account for relatedness between individuals and include covariates (sex, age and tarsus length). The resulting association landscape was flat with no peaks on chromosome 2 or elsewhere, which is in line with expectations for polygenic traits. Analysis of the distribution of p-values did not reveal biases, and the inflation factor was low. Effect sizes were however not uniformly distributed on some chromosomes, and the Z chromosome had weaker associations than autosomes. The level of linkage disequilibrium (LD) in the population decayed to background levels within c. 1 kbp. There could be several reasons to why our QTL study and GWAS gave contrasting results including differences in how associations are modelled (cosegregation in pedigree vs. LD associations), how covariates are accounted for in the models, type of marker used (multi- vs. biallelic), difference in power or a combination of these. Our study highlights that the genetic architecture even of highly heritable traits is difficult to characterize in wild populations. © 2018 John Wiley & Sons Ltd.
Qian, David C.; Byun, Jinyoung; Han, Younghun; Greene, Casey S.; Field, John K.; Hung, Rayjean J.; Brhane, Yonathan; Mclaughlin, John R.; Fehringer, Gordon; Landi, Maria Teresa; Rosenberger, Albert; Bickeböller, Heike; Malhotra, Jyoti; Risch, Angela; Heinrich, Joachim; Hunter, David J.; Henderson, Brian E.; Haiman, Christopher A.; Schumacher, Fredrick R.; Eeles, Rosalind A.; Easton, Douglas F.; Seminara, Daniela; Amos, Christopher I.
2015-01-01
Results from genome-wide association studies (GWAS) have indicated that strong single-gene effects are the exception, not the rule, for most diseases. We assessed the joint effects of germline genetic variations through a pathway-based approach that considers the tissue-specific contexts of GWAS findings. From GWAS meta-analyses of lung cancer (12 160 cases/16 838 controls), breast cancer (15 748 cases/18 084 controls) and prostate cancer (14 160 cases/12 724 controls) in individuals of European ancestry, we determined the tissue-specific interaction networks of proteins expressed from genes that are likely to be affected by disease-associated variants. Reactome pathways exhibiting enrichment of proteins from each network were compared across the cancers. Our results show that pathways associated with all three cancers tend to be broad cellular processes required for growth and survival. Significant examples include the nerve growth factor (P = 7.86 × 10−33), epidermal growth factor (P = 1.18 × 10−31) and fibroblast growth factor (P = 2.47 × 10−31) signaling pathways. However, within these shared pathways, the genes that influence risk largely differ by cancer. Pathways found to be unique for a single cancer focus on more specific cellular functions, such as interleukin signaling in lung cancer (P = 1.69 × 10−15), apoptosis initiation by Bad in breast cancer (P = 3.14 × 10−9) and cellular responses to hypoxia in prostate cancer (P = 2.14 × 10−9). We present the largest comparative cross-cancer pathway analysis of GWAS to date. Our approach can also be applied to the study of inherited mechanisms underlying risk across multiple diseases in general. PMID:26483192
Eleftherohorinou, Hariklia; Hoggart, Clive J; Wright, Victoria J; Levin, Michael; Coin, Lachlan J M
2011-09-01
Rheumatoid arthritis (RA) is the commonest chronic, systemic, inflammatory disorder affecting ∼1% of the world population. It has a strong genetic component and a growing number of associated genes have been discovered in genome-wide association studies (GWAS), which nevertheless only account for 23% of the total genetic risk. We aimed to identify additional susceptibility loci through the analysis of GWAS in the context of biological function. We bridge the gap between pathway and gene-oriented analyses of GWAS, by introducing a pathway-driven gene stability-selection methodology that identifies potential causal genes in the top-associated disease pathways that may be driving the pathway association signals. We analysed the WTCCC and the NARAC studies of ∼5000 and ∼2000 subjects, respectively. We examined 700 pathways comprising ∼8000 genes. Ranking pathways by significance revealed that the NARAC top-ranked ∼6% laid within the top 10% of WTCCC. Gene selection on those pathways identified 58 genes in WTCCC and 61 in NARAC; 21 of those were common (P(overlap)< 10(-21)), of which 16 were novel discoveries. Among the identified genes, we validated 10 known RA associations in WTCCC and 13 in NARAC, not discovered using single-SNP approaches on the same data. Gene ontology functional enrichment analysis on the identified genes showed significant over-representation of signalling activity (P< 10(-29)) in both studies. Our findings suggest a novel model of RA genetic predisposition, which involves cell-membrane receptors and genes in second messenger signalling systems, in addition to genes that regulate immune responses, which have been the focus of interest previously.
Chen, D T; Jiang, X; Akula, N; Shugart, Y Y; Wendland, J R; Steele, C J M; Kassem, L; Park, J-H; Chatterjee, N; Jamain, S; Cheng, A; Leboyer, M; Muglia, P; Schulze, T G; Cichon, S; Nöthen, M M; Rietschel, M; McMahon, F J; Farmer, A; McGuffin, P; Craig, I; Lewis, C; Hosang, G; Cohen-Woods, S; Vincent, J B; Kennedy, J L; Strauss, J
2013-02-01
Meta-analyses of bipolar disorder (BD) genome-wide association studies (GWAS) have identified several genome-wide significant signals in European-ancestry samples, but so far account for little of the inherited risk. We performed a meta-analysis of ∼750,000 high-quality genetic markers on a combined sample of ∼14,000 subjects of European and Asian-ancestry (phase I). The most significant findings were further tested in an extended sample of ∼17,700 cases and controls (phase II). The results suggest novel association findings near the genes TRANK1 (LBA1), LMAN2L and PTGFR. In phase I, the most significant single nucleotide polymorphism (SNP), rs9834970 near TRANK1, was significant at the P=2.4 × 10(-11) level, with no heterogeneity. Supportive evidence for prior association findings near ANK3 and a locus on chromosome 3p21.1 was also observed. The phase II results were similar, although the heterogeneity test became significant for several SNPs. On the basis of these results and other established risk loci, we used the method developed by Park et al. to estimate the number, and the effect size distribution, of BD risk loci that could still be found by GWAS methods. We estimate that >63,000 case-control samples would be needed to identify the ∼105 BD risk loci discoverable by GWAS, and that these will together explain <6% of the inherited risk. These results support previous GWAS findings and identify three new candidate genes for BD. Further studies are needed to replicate these findings and may potentially lead to identification of functional variants. Sample size will remain a limiting factor in the discovery of common alleles associated with BD.
Kulminski, Alexander M.; Culminskaya, Irina; Arbeev, Konstantin G.; Arbeeva, Liubov; Ukraintseva, Svetlana V.; Stallard, Eric; Wu, Deqing; Yashin, Anatoliy I.
2015-01-01
Insights into genetic origin of diseases and related traits could substantially impact strategies for improving human health. The results of genome-wide association studies (GWAS) are often positioned as discoveries of unconditional risk alleles of complex health traits. We re-analyzed the associations of single nucleotide polymorphisms (SNPs) associated with total cholesterol (TC) in a large-scale GWAS meta-analysis. We focused on three generations of genotyped participants of the Framingham Heart Study (FHS). We show that the effects of all ten directly-genotyped SNPs were clustered in different FHS generations and/or birth cohorts in a sex-specific or sex-unspecific manner. The sample size and procedure-therapeutic issues play, at most, a minor role in this clustering. An important result was clustering of significant associations with the strongest effects in the youngest, or 3rd Generation, cohort. These results imply that an assumption of unconditional connections of these SNPs with TC is generally implausible and that a demographic perspective can substantially improve GWAS efficiency. The analyses of genetic effects in age-matched samples suggest a role of environmental and age-related mechanisms in the associations of different SNPs with TC. Analysis of the literature supports systemic roles for genes for these SNPs beyond those related to lipid metabolism. Our analyses reveal strong antagonistic effects of rs2479409 (the PCSK9 gene) that cautions strategies aimed at targeting this gene in the next generation of lipid drugs. Our results suggest that standard GWAS strategies need to be advanced in order to appropriately address the problem of genetic susceptibility to complex traits that is imperative for translation to health care. PMID:26295473
Complex Disease Endotypes and Implications for GWAS and Exposomics***
Presentation Type: Symposia Symposium Title: Human Exposome Discovery and Disease Investigation Abstract Title: Complex Disease Endotypes and Implications for GWAS and Exposomics Authors: Stephen W. Edwards1, David M. Reif, Elaine Cohen Hubaf, ClarLynda Williams-DeVa...
HU, TING; DARABOS, CHRISTIAN; CRICCO, MARIA E.; KONG, EMILY; MOORE, JASON H.
2014-01-01
The large volume of GWAS data poses great computational challenges for analyzing genetic interactions associated with common human diseases. We propose a computational framework for characterizing epistatic interactions among large sets of genetic attributes in GWAS data. We build the human phenotype network (HPN) and focus around a disease of interest. In this study, we use the GLAUGEN glaucoma GWAS dataset and apply the HPN as a biological knowledge-based filter to prioritize genetic variants. Then, we use the statistical epistasis network (SEN) to identify a significant connected network of pairwise epistatic interactions among the prioritized SNPs. These clearly highlight the complex genetic basis of glaucoma. Furthermore, we identify key SNPs by quantifying structural network characteristics. Through functional annotation of these key SNPs using Biofilter, a software accessing multiple publicly available human genetic data sources, we find supporting biomedical evidences linking glaucoma to an array of genetic diseases, proving our concept. We conclude by suggesting hypotheses for a better understanding of the disease. PMID:25592582
Hicks, Chindo; Kumar, Ranjit; Pannuti, Antonio; Miele, Lucio
2012-01-01
Variable response and resistance to tamoxifen treatment in breast cancer patients remains a major clinical problem. To determine whether genes and biological pathways containing SNPs associated with risk for breast cancer are dysregulated in response to tamoxifen treatment, we performed analysis combining information from 43 genome-wide association studies with gene expression data from 298 ER(+) breast cancer patients treated with tamoxifen and 125 ER(+) controls. We identified 95 genes which distinguished tamoxifen treated patients from controls. Additionally, we identified 54 genes which stratified tamoxifen treated patients into two distinct groups. We identified biological pathways containing SNPs associated with risk for breast cancer, which were dysregulated in response to tamoxifen treatment. Key pathways identified included the apoptosis, P53, NFkB, DNA repair and cell cycle pathways. Combining GWAS with transcription profiling provides a unified approach for associating GWAS findings with response to drug treatment and identification of potential drug targets.
Assessment of Parkinson’s disease risk loci in Greece
Kara, Eleanna; Xiromerisiou, Georgia; Spanaki, Cleanthe; Bozi, Maria; Koutsis, Georgios; Panas, Marios; Dardiotis, Efthimios; Ralli, Styliani; Bras, Jose; Letson, Christopher; Edsall, Connor; Pliner, Hannah; Arepali, Sampath; Kalinderi, Kallirhoe; Fidani, Liana; Bostanjopoulou, Sevasti; Keller, Margaux F; Wood, Nicholas W; Hardy, John; Houlden, Henry; Stefanis, Leonidas; Plaitakis, Andreas; Hernandez, Dena; Hadjigeorgiou, Georgios M; Nalls, Mike A; Singleton, Andrew B
2013-01-01
Genome wide association studies (GWAS) have been shown to be a powerful approach to identify risk loci for neurodegenerative diseases. Recent GWAS in Parkinson’s disease (PD) have been successful in identifying numerous risk variants pointing to novel pathways potentially implicated in the pathogenesis of PD. Contributing to these GWAS efforts, we performed genotyping of previously identified risk alleles in PD patients and controls from Greece. We showed that previously published risk profiles for Northern European and American populations are also applicable to the Greek population. In addition, while we were largely underpowered to detect individual associations we replicated 5 of 32 previously published risk variants with nominal p-values <0.05. Genome-wide complex trait analysis (GCTA) revealed that known risk loci explain disease risk in 1.27% of Greek PD patients. Collectively, these results indicate that there is likely a substantial genetic component to PD in Greece similarly to other worldwide populations that remains to be discovered. PMID:24080174
Gene-Gene and Gene-Environment Interactions in Ulcerative Colitis
Wang, Ming-Hsi; Fiocchi, Claudio; Zhu, Xiaofeng; Ripke, Stephan; Kamboh, M. Ilyas; Rebert, Nancy; Duerr, Richard H.; Achkar, Jean-Paul
2014-01-01
Genome-wide association studies (GWAS) have identified at least 133 ulcerative colitis (UC) associated loci. The role of genetic factors in clinical practice is not clearly defined. The relevance of genetic variants to disease pathogenesis is still uncertain because of not characterized gene-gene and gene-environment interactions. We examined the predictive value of combining the 133 UC risk loci with genetic interactions in an ongoing inflammatory bowel disease (IBD) GWAS. The Wellcome Trust Case-Control Consortium (WTCCC) IBD GWAS was used as a replication cohort. We applied logic regression (LR), a novel adaptive regression methodology, to search for high order interactions. Exploratory genotype correlations with UC sub-phenotypes (extent of disease, need of surgery, age of onset, extra-intestinal manifestations and primary sclerosing cholangitis (PSC)) were conducted. The combination of 133 UC loci yielded good UC risk predictability (area under the curve [AUC] of 0.86). A higher cumulative allele score predicted higher UC risk. Through LR, several lines of evidence for genetic interactions were identified and successfully replicated in the WTCCC cohort. The genetic interactions combined with the gene-smoking interaction significantly improved predictability in the model (AUC, from 0.86 to 0.89, P=3.26E-05). Explained UC variance increased from 37% to 42% after adding the interaction terms. A within case analysis found suggested genetic association with PSC. Our study demonstrates that the LR methodology allows the identification and replication of high order genetic interactions in UC GWAS datasets. UC risk can be predicted by a 133 loci and improved by adding gene-gene and gene-environment interactions. PMID:24241240
Sonah, Humira; O'Donoughue, Louise; Cober, Elroy; Rajcan, Istvan; Belzile, François
2015-02-01
Soya bean is a major source of edible oil and protein for human consumption as well as animal feed. Understanding the genetic basis of different traits in soya bean will provide important insights for improving breeding strategies for this crop. A genome-wide association study (GWAS) was conducted to accelerate molecular breeding for the improvement of agronomic traits in soya bean. A genotyping-by-sequencing (GBS) approach was used to provide dense genome-wide marker coverage (>47,000 SNPs) for a panel of 304 short-season soya bean lines. A subset of 139 lines, representative of the diversity among these, was characterized phenotypically for eight traits under six environments (3 sites × 2 years). Marker coverage proved sufficient to ensure highly significant associations between the genes known to control simple traits (flower, hilum and pubescence colour) and flanking SNPs. Between one and eight genomic loci associated with more complex traits (maturity, plant height, seed weight, seed oil and protein) were also identified. Importantly, most of these GWAS loci were located within genomic regions identified by previously reported quantitative trait locus (QTL) for these traits. In some cases, the reported QTLs were also successfully validated by additional QTL mapping in a biparental population. This study demonstrates that integrating GBS and GWAS can be used as a powerful complementary approach to classical biparental mapping for dissecting complex traits in soya bean. © 2014 Society for Experimental Biology, Association of Applied Biologists and John Wiley & Sons Ltd.
Abdulkadir, Mohamed; Londono, Douglas; Gordon, Derek; Fernandez, Thomas V; Brown, Lawrence W; Cheon, Keun-Ah; Coffey, Barbara J; Elzerman, Lonneke; Fremer, Carolin; Fründt, Odette; Garcia-Delgar, Blanca; Gilbert, Donald L; Grice, Dorothy E; Hedderly, Tammy; Heyman, Isobel; Hong, Hyun Ju; Huyser, Chaim; Ibanez-Gomez, Laura; Jakubovski, Ewgeni; Kim, Young Key; Kim, Young Shin; Koh, Yun-Joo; Kook, Sodahm; Kuperman, Samuel; Leventhal, Bennett; Ludolph, Andrea G; Madruga-Garrido, Marcos; Maras, Athanasios; Mir, Pablo; Morer, Astrid; Müller-Vahl, Kirsten; Münchau, Alexander; Murphy, Tara L; Plessen, Kerstin J; Roessner, Veit; Shin, Eun-Young; Song, Dong-Ho; Song, Jungeun; Tübing, Jennifer; van den Ban, Els; Visscher, Frank; Wanderer, Sina; Woods, Martin; Zinner, Samuel H; King, Robert A; Tischfield, Jay A; Heiman, Gary A; Hoekstra, Pieter J; Dietrich, Andrea
2018-04-01
Genetic studies in Tourette syndrome (TS) are characterized by scattered and poorly replicated findings. We aimed to replicate findings from candidate gene and genome-wide association studies (GWAS). Our cohort included 465 probands with chronic tic disorder (93% TS) and both parents from 412 families (some probands were siblings). We assessed 75 single nucleotide polymorphisms (SNPs) in 465 parent-child trios; 117 additional SNPs in 211 trios; and 4 additional SNPs in 254 trios. We performed SNP and gene-based transmission disequilibrium tests and compared nominally significant SNP results with those from a large independent case-control cohort. After quality control 71 SNPs were available in 371 trios; 112 SNPs in 179 trios; and 3 SNPs in 192 trios. 17 were candidate SNPs implicated in TS and 2 were implicated in obsessive-compulsive disorder (OCD) or autism spectrum disorder (ASD); 142 were tagging SNPs from eight monoamine neurotransmitter-related genes (including dopamine and serotonin); 10 were top SNPs from TS GWAS; and 13 top SNPs from attention-deficit/hyperactivity disorder, OCD, or ASD GWAS. None of the SNPs or genes reached significance after adjustment for multiple testing. We observed nominal significance for the candidate SNPs rs3744161 (TBCD) and rs4565946 (TPH2) and for five tagging SNPs; none of these showed significance in the independent cohort. Also, SLC1A1 in our gene-based analysis and two TS GWAS SNPs showed nominal significance, rs11603305 (intergenic) and rs621942 (PICALM). We found no convincing support for previously implicated genetic polymorphisms. Targeted re-sequencing should fully appreciate the relevance of candidate genes.
Relevance of genetic relationship in GWAS and genomic prediction.
Pereira, Helcio Duarte; Soriano Viana, José Marcelo; Andrade, Andréa Carla Bastos; Fonseca E Silva, Fabyano; Paes, Geísa Pinheiro
2018-02-01
The objective of this study was to analyze the relevance of relationship information on the identification of low heritability quantitative trait loci (QTLs) from a genome-wide association study (GWAS) and on the genomic prediction of complex traits in human, animal and cross-pollinating populations. The simulation-based data sets included 50 samples of 1000 individuals of seven populations derived from a common population with linkage disequilibrium. The populations had non-inbred and inbred progeny structure (50 to 200) with varying number of members (5 to 20). The individuals were genotyped for 10,000 single nucleotide polymorphisms (SNPs) and phenotyped for a quantitative trait controlled by 10 QTLs and 90 minor genes showing dominance. The SNP density was 0.1 cM and the narrow sense heritability was 25%. The QTL heritabilities ranged from 1.1 to 2.9%. We applied mixed model approaches for both GWAS and genomic prediction using pedigree-based and genomic relationship matrices. For GWAS, the observed false discovery rate was kept below the significance level of 5%, the power of detection for the low heritability QTLs ranged from 14 to 50%, and the average bias between significant SNPs and a QTL ranged from less than 0.01 to 0.23 cM. The QTL detection power was consistently higher using genomic relationship matrix. Regardless of population and training set size, genomic prediction provided higher prediction accuracy of complex trait when compared to pedigree-based prediction. The accuracy of genomic prediction when there is relatedness between individuals in the training set and the reference population is much higher than the value for unrelated individuals.
Lack of replication of previous autism spectrum disorder GWAS hits in European populations.
Torrico, Bàrbara; Chiocchetti, Andreas G; Bacchelli, Elena; Trabetti, Elisabetta; Hervás, Amaia; Franke, Barbara; Buitelaar, Jan K; Rommelse, Nanda; Yousaf, Afsheen; Duketis, Eftichia; Freitag, Christine M; Caballero-Andaluz, Rafaela; Martinez-Mir, Amalia; Scholl, Francisco G; Ribasés, Marta; Battaglia, Agatino; Malerba, Giovanni; Delorme, Richard; Benabou, Marion; Maestrini, Elena; Bourgeron, Thomas; Cormand, Bru; Toma, Claudio
2017-02-01
Common variants contribute significantly to the genetics of autism spectrum disorder (ASD), although the identification of individual risk polymorphisms remains still elusive due to their small effect sizes and limited sample sizes available for association studies. During the last decade several genome-wide association studies (GWAS) have enabled the detection of a few plausible risk variants. The three main studies are family-based and pointed at SEMA5A (rs10513025), MACROD2 (rs4141463) and MSNP1 (rs4307059). In our study we attempted to replicate these GWAS hits using a case-control association study in five European populations of ASD patients and gender-matched controls, all Caucasians. Results showed no association of individual variants with ASD in any of the population groups considered or in the combined European sample. We performed a meta-analysis study across five European populations for rs10513025 (1,904 ASD cases and 2,674 controls), seven European populations for rs4141463 (2,855 ASD cases and 36,177 controls) and five European populations for rs4307059 (2,347 ASD cases and 2,764 controls). The results showed an odds ratio (OR) of 1.05 (95% CI = 0.84-1.32) for rs10513025, 1.0002 (95% CI = 0.93-1.08) for rs4141463 and 1.01 (95% CI = 0.92-1.1) for rs4307059, with no significant P-values (rs10513025, P = 0.73; rs4141463, P = 0.95; rs4307059, P = 0.9). No association was found when we considered either only high functioning autism (HFA), genders separately or only multiplex families. Ongoing GWAS projects with larger ASD cohorts will contribute to clarify the role of common variation in the disorder and will likely identify risk variants of modest effect not detected previously. Autism Res 2017, 10: 202-211. © 2016 International Society for Autism Research, Wiley Periodicals, Inc. © 2016 International Society for Autism Research, Wiley Periodicals, Inc.
Howard, Jeremy T; Jiao, Shihui; Tiezzi, Francesco; Huang, Yijian; Gray, Kent A; Maltecca, Christian
2015-05-30
Feed intake and growth are economically important traits in swine production. Previous genome wide association studies (GWAS) have utilized average daily gain or daily feed intake to identify regions that impact growth and feed intake across time. The use of longitudinal models in GWAS studies, such as random regression, allows for SNPs having a heterogeneous effect across the trajectory to be characterized. The objective of this study is therefore to conduct a single step GWAS (ssGWAS) on the animal polynomial coefficients for feed intake and growth. Corrected daily feed intake (DFI Adj) and average daily weight measurements (DBW Avg) on 8981 (n=525,240 observations) and 5643 (n=283,607 observations) animals were utilized in a random regression model using Legendre polynomials (order=2) and a relationship matrix that included genotyped and un-genotyped animals. A ssGWAS was conducted on the animal polynomials coefficients (intercept, linear and quadratic) for animals with genotypes (DFIAdj: n=855; DBWAvg: n=590). Regions were characterized based on the variance of 10-SNP sliding windows GEBV (WGEBV). A bootstrap analysis (n=1000) was conducted to declare significance. Heritability estimates for the traits trajectory ranged from 0.34-0.52 to 0.07-0.23 for DBWAvg and DFIAdj, respectively. Genetic correlations across age classes were large and positive for both DBWAvg and DFIAdj, albeit age classes at the beginning had a small to moderate genetic correlation with age classes towards the end of the trajectory for both traits. The WGEBV variance explained by significant regions (P<0.001) for each polynomial coefficient ranged from 0.2-0.9 to 0.3-1.01% for DBWAvg and DFIAdj, respectively. The WGEBV variance explained by significant regions for the trajectory was 1.54 and 1.95% for DBWAvg and DFIAdj. Both traits identified candidate genes with functions related to metabolite and energy homeostasis, glucose and insulin signaling and behavior. We have identified regions of the genome that have an impact on the intercept, linear and quadratic terms for DBWAvg and DFIAdj. These results provide preliminary evidence that individual growth and feed intake trajectories are impacted by different regions of the genome at different times.
Polygenic risk and the development and course of asthma: Evidence from a 4-decade longitudinal study
Belsky, DW; Sears, MR; Hancox, RJ; Harrington, HL; Houts, R; Moffitt, TE; Sugden, K; Williams, B; Poulton, R; Caspi, A
2013-01-01
BACKGROUND Genome-wide association studies (GWAS) have discovered loci that predispose to asthma. To integrate these new discoveries with emerging models of asthma pathobiology, research is needed to test how genetic discoveries relate to developmental and biological characteristics of asthma. METHODS We derived a multi-locus profile of genetic risk from published GWAS of asthma case status. We then tested associations between this “genetic risk score” and developmental and biological characteristics of asthma in a population-based long-running birth cohort, the Dunedin Longitudinal Study (n=1,037). We evaluated asthma onset, persistence, atopy, airway hyperresponsiveness, incompletely reversible airflow obstruction, and asthma-related school and work absenteeism and hospitalization during 9 prospective assessments spanning ages 9–38 years, when 95% of surviving cohort members were seen. INTERPRETATION Cohort members at higher genetic risk experienced asthma onset earlier in life (HR=1.12 [1.01–1.26]). Childhood-onset asthma cases at higher genetic risk were more likely to become life-course-persistent asthma cases (RR=1.36 [1.14–1.63]). Asthma cases at higher genetic risk more often manifested atopy (RR=1.07 [1.01–1.14]), airway hyperresponsiveness (RR=1.16 [1.03–1.32]), and incompletely reversible airflow obstruction (RR=1.28 [1.04–1.57]). They were also more likely to miss school or work due to asthma (IRR=1.38 [1.02–1.86]) and to be hospitalized with breathing problems (HR=1.38 [1.07–1.79]). Genotypic information about asthma risk was independent of and additive to information derived from cohort members’ family histories of asthma. CONCLUSIONS Findings from this population study confirm that GWAS-discoveries for asthma associate with a childhood-onset phenotype and advance asthma genetics beyond the original GWAS-discoveries in three ways: (1) We show that genetic risks predict which childhood-onset asthma cases remit and which become life-course-persistent cases, although these predictions are not sufficiently sensitive or specific to support immediate clinical translation; (2) We elucidate a biological profile of the asthma that arises from these genetic risks: asthma characterized by atopy and airway hyperresponsiveness and leading to incompletely reversible airflow obstruction; and (3) We describe the real-life impact of GWAS-discoveries by quantifying genetic associations with missed school and work and hospitalization. PMID:24429243
2013-01-01
Background The apparent effect of a single nucleotide polymorphism (SNP) on phenotype depends on the linkage disequilibrium (LD) between the SNP and a quantitative trait locus (QTL). However, the phase of LD between a SNP and a QTL may differ between Bos indicus and Bos taurus because they diverged at least one hundred thousand years ago. Here, we test the hypothesis that the apparent effect of a SNP on a quantitative trait depends on whether the SNP allele is inherited from a Bos taurus or Bos indicus ancestor. Methods Phenotype data on one or more traits and SNP genotype data for 10 181 cattle from Bos taurus, Bos indicus and composite breeds were used. All animals had genotypes for 729 068 SNPs (real or imputed). Chromosome segments were classified as originating from B. indicus or B. taurus on the basis of the haplotype of SNP alleles they contained. Consequently, SNP alleles were classified according to their sub-species origin. Three models were used for the association study: (1) conventional GWAS (genome-wide association study), fitting a single SNP effect regardless of subspecies origin, (2) interaction GWAS, fitting an interaction between SNP and subspecies-origin, and (3) best variable GWAS, fitting the most significant combination of SNP and sub-species origin. Results Fitting an interaction between SNP and subspecies origin resulted in more significant SNPs (i.e. more power) than a conventional GWAS. Thus, the effect of a SNP depends on the subspecies that the allele originates from. Also, most QTL segregated in only one subspecies, suggesting that many mutations that affect the traits studied occurred after divergence of the subspecies or the mutation became fixed or was lost in one of the subspecies. Conclusions The results imply that GWAS and genomic selection could gain power by distinguishing SNP alleles based on their subspecies origin, and that only few QTL segregate in both B. indicus and B. taurus cattle. Thus, the QTL that segregate in current populations likely resulted from mutations that occurred in one of the subspecies and can have both positive and negative effects on the traits. There was no evidence that selection has increased the frequency of alleles that increase body weight. PMID:24168700
Xu, Jinfeng; Yuan, Ao; Zheng, Gang
2012-01-01
Summary In the analysis of case-control genetic association, the trend test and Pearson’s test are the two most commonly used tests. In genome-wide association studies (GWAS), Bayes factor is a useful tool to support significant p-values, and a better measure than p-value when results are compared across studies with different sample sizes. When reporting the p-value of the trend test, we propose a Bayes factor directly based on the trend test. To improve the power to detect association under recessive or dominant genetic models, we propose a Bayes factor based on the trend test and incorporating Hardy-Weinberg disequilibrium in cases. When the true model is unknown, or both the trend test and Pearson’s test or other robust tests are applied in genome-wide scans, we propose a joint Bayes factor, combining the previous two Bayes factors. All three Bayes factors studied in this paper have closed forms and are easy to compute without integrations, so they can be reported along with p-values, especially in GWAS. We discuss how to use each of them and how to specify priors. Simulation studies and applications to three GWAS are provided to illustrate their usefulness to detect non-additive gene susceptibility in practice. PMID:22607017
Genome-wide association study identifies multiple loci associated with bladder cancer risk
Figueroa, Jonine D.; Ye, Yuanqing; Siddiq, Afshan; Garcia-Closas, Montserrat; Chatterjee, Nilanjan; Prokunina-Olsson, Ludmila; Cortessis, Victoria K.; Kooperberg, Charles; Cussenot, Olivier; Benhamou, Simone; Prescott, Jennifer; Porru, Stefano; Dinney, Colin P.; Malats, Núria; Baris, Dalsu; Purdue, Mark; Jacobs, Eric J.; Albanes, Demetrius; Wang, Zhaoming; Deng, Xiang; Chung, Charles C.; Tang, Wei; Bas Bueno-de-Mesquita, H.; Trichopoulos, Dimitrios; Ljungberg, Börje; Clavel-Chapelon, Françoise; Weiderpass, Elisabete; Krogh, Vittorio; Dorronsoro, Miren; Travis, Ruth; Tjønneland, Anne; Brenan, Paul; Chang-Claude, Jenny; Riboli, Elio; Conti, David; Gago-Dominguez, Manuela; Stern, Mariana C.; Pike, Malcolm C.; Van Den Berg, David; Yuan, Jian-Min; Hohensee, Chancellor; Rodabough, Rebecca; Cancel-Tassin, Geraldine; Roupret, Morgan; Comperat, Eva; Chen, Constance; De Vivo, Immaculata; Giovannucci, Edward; Hunter, David J.; Kraft, Peter; Lindstrom, Sara; Carta, Angela; Pavanello, Sofia; Arici, Cecilia; Mastrangelo, Giuseppe; Kamat, Ashish M.; Lerner, Seth P.; Barton Grossman, H.; Lin, Jie; Gu, Jian; Pu, Xia; Hutchinson, Amy; Burdette, Laurie; Wheeler, William; Kogevinas, Manolis; Tardón, Adonina; Serra, Consol; Carrato, Alfredo; García-Closas, Reina; Lloreta, Josep; Schwenn, Molly; Karagas, Margaret R.; Johnson, Alison; Schned, Alan; Armenti, Karla R.; Hosain, G.M.; Andriole, Gerald; Grubb, Robert; Black, Amanda; Ryan Diver, W.; Gapstur, Susan M.; Weinstein, Stephanie J.; Virtamo, Jarmo; Haiman, Chris A.; Landi, Maria T.; Caporaso, Neil; Fraumeni, Joseph F.; Vineis, Paolo; Wu, Xifeng; Silverman, Debra T.; Chanock, Stephen; Rothman, Nathaniel
2014-01-01
Candidate gene and genome-wide association studies (GWAS) have identified 11 independent susceptibility loci associated with bladder cancer risk. To discover additional risk variants, we conducted a new GWAS of 2422 bladder cancer cases and 5751 controls, followed by a meta-analysis with two independently published bladder cancer GWAS, resulting in a combined analysis of 6911 cases and 11 814 controls of European descent. TaqMan genotyping of 13 promising single nucleotide polymorphisms with P < 1 × 10−5 was pursued in a follow-up set of 801 cases and 1307 controls. Two new loci achieved genome-wide statistical significance: rs10936599 on 3q26.2 (P = 4.53 × 10−9) and rs907611 on 11p15.5 (P = 4.11 × 10−8). Two notable loci were also identified that approached genome-wide statistical significance: rs6104690 on 20p12.2 (P = 7.13 × 10−7) and rs4510656 on 6p22.3 (P = 6.98 × 10−7); these require further studies for confirmation. In conclusion, our study has identified new susceptibility alleles for bladder cancer risk that require fine-mapping and laboratory investigation, which could further understanding into the biological underpinnings of bladder carcinogenesis. PMID:24163127
Understanding the pharmacogenetics of selective serotonin reuptake inhibitors.
Fabbri, Chiara; Minarini, Alessandro; Niitsu, Tomihisa; Serretti, Alessandro
2014-08-01
The genetic background of antidepressant response represents a unique opportunity to identify biological markers of treatment outcome. Encouraging results alternating with inconsistent findings made antidepressant pharmacogenetics a stimulating but often discouraging field that requires careful discussion about cumulative evidence and methodological issues. The present review discusses both known and less replicated genes that have been implicated in selective serotonin reuptake inhibitors (SSRIs) efficacy and side effects. Candidate genes studies and genome-wide association studies (GWAS) were collected through MEDLINE database search (articles published till January 2014). Further, GWAS signals localized in promising genetic regions according to candidate gene studies are reported in order to assess the general comparability of results obtained through these two types of pharmacogenetic studies. Finally, a pathway enrichment approach is applied to the top genes (those harboring SNPs with p < 0.0001) outlined by previous GWAS in order to identify possible molecular mechanisms involved in SSRI effect. In order to improve the understanding of SSRI pharmacogenetics, the present review discusses the proposal of moving from the analysis of individual polymorphisms to genes and molecular pathways, and from the separation across different methodological approaches to their combination. Efforts in this direction are justified by the recent evidence of a favorable cost-utility of gene-guided antidepressant treatment.
Allele-Skewed DNA Modification in the Brain: Relevance to a Schizophrenia GWAS
Gagliano, Sarah A.; Ptak, Carolyn; Mak, Denise Y.F.; Shamsi, Mehrdad; Oh, Gabriel; Knight, Joanne; Boutros, Paul C.; Petronis, Arturas
2016-01-01
Numerous recent studies have suggested that phenotypic effects of DNA sequence variants can be mediated or modulated by their epigenetic marks, such as allele-skewed DNA modification (ASM). Using Affymetrix SNP microarrays, we performed a comprehensive search of ASM effects in human post-mortem brain and sperm samples (total n = 256) from individuals with major psychosis and control individuals. Depending on the phenotypic category of the brain samples, 1.4%–7.5% of interrogated SNPs exhibited ASM effects. Next, we investigated ASM in the context of genetic studies of schizophrenia and detected that brain ASM SNPs were significantly overrepresented among sub-threshold SNPs from a schizophrenia genome-wide association study (GWAS). Brain ASM SNPs showed a much stronger enrichment in a schizophrenia GWAS than in 17 large GWASs of non-psychiatric diseases and traits, arguing that ASM effects are at least partially tissue specific. Studies of germline and control brain ASM SNPs supported a causal association between ASM and schizophrenia. Finally, significantly higher proportions of ASM SNPs than of non-ASM SNPs were detected at loci exhibiting epigenetic signatures of enhancers and promoters, and they were overrepresented within transcription factor binding regions and DNase I hypersensitive sites. All of these findings collectively indicate that ASM SNPs should be prioritized in follow-up GWASs. PMID:27087318
ITGB5 and AGFG1 variants are associated with severity of airway responsiveness.
Himes, Blanca E; Qiu, Weiliang; Klanderman, Barbara; Ziniti, John; Senter-Sylvia, Jody; Szefler, Stanley J; Lemanske, Robert F; Zeiger, Robert S; Strunk, Robert C; Martinez, Fernando D; Boushey, Homer; Chinchilli, Vernon M; Israel, Elliot; Mauger, David; Koppelman, Gerard H; Nieuwenhuis, Maartje A E; Postma, Dirkje S; Vonk, Judith M; Rafaels, Nicholas; Hansel, Nadia N; Barnes, Kathleen; Raby, Benjamin; Tantisira, Kelan G; Weiss, Scott T
2013-08-28
Airway hyperresponsiveness (AHR), a primary characteristic of asthma, involves increased airway smooth muscle contractility in response to certain exposures. We sought to determine whether common genetic variants were associated with AHR severity. A genome-wide association study (GWAS) of AHR, quantified as the natural log of the dosage of methacholine causing a 20% drop in FEV1, was performed with 994 non-Hispanic white asthmatic subjects from three drug clinical trials: CAMP, CARE, and ACRN. Genotyping was performed on Affymetrix 6.0 arrays, and imputed data based on HapMap Phase 2, was used to measure the association of SNPs with AHR using a linear regression model. Replication of primary findings was attempted in 650 white subjects from DAG, and 3,354 white subjects from LHS. Evidence that the top SNPs were eQTL of their respective genes was sought using expression data available for 419 white CAMP subjects. The top primary GWAS associations were in rs848788 (P-value 7.2E-07) and rs6731443 (P-value 2.5E-06), located within the ITGB5 and AGFG1 genes, respectively. The AGFG1 result replicated at a nominally significant level in one independent population (LHS P-value 0.012), and the SNP had a nominally significant unadjusted P-value (0.0067) for being an eQTL of AGFG1. Based on current knowledge of ITGB5 and AGFG1, our results suggest that variants within these genes may be involved in modulating AHR. Future functional studies are required to confirm that our associations represent true biologically significant findings.
Li, C; Sun, D; Zhang, S; Liu, L; Alim, M A; Zhang, Q
2016-08-01
The stearoyl-CoA desaturase (delta-9-desaturase) gene encodes a key enzyme in the cellular biosynthesis of monounsaturated fatty acids. In our initial genome-wide association study (GWAS) of Chinese Holstein cows, 19 SNPs fell in a 1.8-Mb region (20.3-22.1 Mb) on chromosome 26 underlying the SCD gene and were highly significantly associated with C14:1 or C14 index. The aims of this study were to verify whether the SCD gene has significant genetic effects on milk fatty acid composition in dairy cattle. By resequencing the entire coding region of the bovine SCD gene, a total of six variations were identified, including three coding variations (g.10153G>A, g.10213T>C and g.10329C>T) and three intronic variations (g.6926A>G, g.8646G>A and g.16158G>C). The SNP in exon 3, g.10329C>T, was predicted to result in an amino acid replacement from alanine (GCG) to valine (GTG) in the SCD protein. An association study for 16 milk fatty acids using 346 Chinese Holstein cows with accurate phenotypes and genotypes was performed using the mixed animal model with the proc mixed procedure in sas 9.2. All six detected SNPs were revealed to be associated with six medium- and long-chain unsaturated fatty acids (P = 0.0457 to P < 0.0001), specifically for C14:1 and C14 index (P = 0.0005 to P < 0.0001). Subsequently, strong linkage disequilibrium (D' = 0.88-1.00) was observed among all six SNPs in SCD and the five SNPs (rs41623887, rs109923480, rs42090224, rs42092174 and rs42091426) within the 1.8-Mb region identified in our previous GWAS, indicating that the significant association of the SCD gene with milk fatty acid content traits reduced the observed significant 1.8-Mb chromosome region in GWAS. Haplotype-based analysis revealed significant associations of the haplotypes encompassing the six SCD SNPs and one SNP (rs109923480) in a GWAS with C14:1, C14 index, C16:1 and C16 index (P = 0.0011 to P < 0.0001). In summary, our findings provide replicate evidence for our previous GWAS and demonstrate that variants in the SCD gene are significantly associated with milk fatty acid composition in dairy cattle, which provides clear evidence for an increased understanding of milk fatty acid synthesis and enhances opportunities to improve milk-fat composition in dairy cattle. © 2016 Stichting International Foundation for Animal Genetics.
Mapping of Gene Expression Reveals CYP27A1 as a Susceptibility Gene for Sporadic ALS
van Rheenen, Wouter; Franke, Lude; Jansen, Ritsert C.; van Es, Michael A.; van Vught, Paul W. J.; Blauw, Hylke M.; Groen, Ewout J. N.; Horvath, Steve; Estrada, Karol; Rivadeneira, Fernando; Hofman, Albert; Uitterlinden, Andre G.; Robberecht, Wim; Andersen, Peter M.; Melki, Judith; Meininger, Vincent; Hardiman, Orla; Landers, John E.; Brown, Robert H.; Shatunov, Aleksey; Shaw, Christopher E.; Leigh, P. Nigel; Al-Chalabi, Ammar; Ophoff, Roel A.
2012-01-01
Amyotrophic lateral sclerosis (ALS) is a progressive, neurodegenerative disease characterized by loss of upper and lower motor neurons. ALS is considered to be a complex trait and genome-wide association studies (GWAS) have implicated a few susceptibility loci. However, many more causal loci remain to be discovered. Since it has been shown that genetic variants associated with complex traits are more likely to be eQTLs than frequency-matched variants from GWAS platforms, we conducted a two-stage genome-wide screening for eQTLs associated with ALS. In addition, we applied an eQTL analysis to finemap association loci. Expression profiles using peripheral blood of 323 sporadic ALS patients and 413 controls were mapped to genome-wide genotyping data. Subsequently, data from a two-stage GWAS (3,568 patients and 10,163 controls) were used to prioritize eQTLs identified in the first stage (162 ALS, 207 controls). These prioritized eQTLs were carried forward to the second sample with both gene-expression and genotyping data (161 ALS, 206 controls). Replicated eQTL SNPs were then tested for association in the second-stage GWAS data to find SNPs associated with disease, that survived correction for multiple testing. We thus identified twelve cis eQTLs with nominally significant associations in the second-stage GWAS data. Eight SNP-transcript pairs of highest significance (lowest p = 1.27×10−51) withstood multiple-testing correction in the second stage and modulated CYP27A1 gene expression. Additionally, we show that C9orf72 appears to be the only gene in the 9p21.2 locus that is regulated in cis, showing the potential of this approach in identifying causative genes in association loci in ALS. This study has identified candidate genes for sporadic ALS, most notably CYP27A1. Mutations in CYP27A1 are causal to cerebrotendinous xanthomatosis which can present as a clinical mimic of ALS with progressive upper motor neuron loss, making it a plausible susceptibility gene for ALS. PMID:22509407
Zhang, Chunyan; Wang, Zhiquan; Bruce, Heather; Kemp, Robert Alan; Charagu, Patrick; Miar, Younes; Yang, Tianfu; Plastow, Graham
2015-04-07
Improving meat quality is a high priority for the pork industry to satisfy consumers' preferences. GWAS have become a state-of-the-art approach to genetically improve economically important traits. However, GWAS focused on pork quality are still relatively rare. Six genomic regions were shown to affect loin pH and Minolta colour a* and b* on both loin and ham through GWAS in 1943 crossbred commercial pigs. Five of them, located on Sus scrofa chromosome (SSC) 1, SSC5, SSC9, SSC16 and SSCX, were associated with meat colour. However, the most promising region was detected on SSC15 spanning 133-134 Mb which explained 3.51% - 17.06% of genetic variance for five measurements of pH and colour. Three SNPs (ASGA0070625, MARC0083357 and MARC0039273) in very strong LD were considered most likely to account for the effects in this region. ASGA0070625 is located in intron 2 of ZNF142, and the other two markers are close to PRKAG3, STK36, TTLL7 and CDK5R2. After fitting MARC0083357 (the closest SNP to PRKAG3) as a fixed factor, six SNPs still remained significant for at least one trait. Four of them are intragenic with ARPC2, TMBIM1, NRAMP1 and VIL1, while the remaining two are close to RUFY4 and CDK5R2. The gene network constructed demonstrated strong connections of these genes with two major hubs of PRKAG3 and UBC in the super-pathways of cell-to-cell signaling and interaction, cellular function and maintenance. All these pathways play important roles in maintaining the integral architecture and functionality of muscle cells facing the dramatic changes that occur after exsanguination, which is in agreement with the GWAS results found in this study. There may be other markers and/or genes in this region besides PRKAG3 that have an important effect on pH and colour. The potential markers and their interactions with PRKAG3 require further investigation.
Demirci, F. Yesim; Wang, Xingbin; Kelly, Jennifer A.; Morris, David L.; Barmada, M. Michael; Feingold, Eleanor; Kao, Amy H.; Sivils, Kathy L.; Bernatsky, Sasha; Pineau, Christian; Clarke, Ann; Ramsey-Goldman, Rosalind; Vyse, Timothy J.; Gaffney, Patrick M.; Manzi, Susan; Kamboh, M. Ilyas
2016-01-01
Objective Genome-wide association studies (GWASs) in individuals of European ancestry identified a number of systemic lupus erythematosus (SLE) susceptibility loci using earlier versions of high-density genotyping platforms. Follow-up studies on suggestive GWAS regions using larger samples and more markers identified additional SLE loci in European-descent subjects. Here we report the results of a multi-stage study that we performed to identify novel SLE loci. Methods In Stage 1, we conducted a new GWAS of SLE in a North American case-control sample of European ancestry (n=1,166) genotyped on Affymetrix Genome-Wide Human SNP Array 6.0. In Stage 2, we further investigated top new suggestive GWAS hits by in silico evaluation and meta-analysis using an additional dataset of European-descent subjects (>2,500 individuals), followed by replication of top meta-analysis findings in another dataset of European-descent subjects (>10,000 individuals) in Stage 3. Results As expected, our GWAS revealed most significant associations at the major histocompatibility complex locus (6p21), which easily surpassed genome-wide significance threshold (P<5×10−8). Several other SLE signals/loci previously implicated in Caucasians and/or Asians were also supported in Stage 1 discovery sample and strongest signals were observed at 2q32/STAT4 (P=3.6×10−7) and at 8p23/BLK (P=8.1×10−6). Stage 2 meta-analyses identified a new genome-wide significant SLE locus at 12q12 (meta P=3.1×10−8), which was replicated in Stage 3. Conclusion Our multi-stage study identified and replicated a new SLE locus that warrants further follow-up in additional studies. Publicly available databases suggest that this new SLE signal falls within a functionally relevant genomic region and near biologically important genes. PMID:26316170
Delgado, Dayana A; Zhang, Chenan; Chen, Lin S; Gao, Jianjun; Roy, Shantanu; Shinkle, Justin; Sabarinathan, Mekala; Argos, Maria; Tong, Lin; Ahmed, Alauddin; Islam, Tariqul; Rakibuz-Zaman, Muhammad; Sarwar, Golam; Shahriar, Hasan; Rahman, Mahfuzar; Yunus, Mohammad; Jasmine, Farzana; Kibriya, Muhammad G; Ahsan, Habibul; Pierce, Brandon L
2018-01-01
Leucocyte telomere length (TL) is a potential biomarker of ageing and risk for age-related disease. Leucocyte TL is heritable and shows substantial differences by race/ethnicity. Recent genome-wide association studies (GWAS) report ~10 loci harbouring SNPs associated with leucocyte TL, but these studies focus primarily on populations of European ancestry. This study aims to enhance our understanding of genetic determinants of TL across populations. We performed a GWAS of TL using data on 5075 Bangladeshi adults. We measured TL using one of two technologies (qPCR or a Luminex-based method) and used standardised variables as TL phenotypes. Our results replicate previously reported associations in the TERC and TERT regions (P=2.2×10 -8 and P=6.4×10 -6 , respectively). We observed a novel association signal in the RTEL1 gene (intronic SNP rs2297439; P=2.82×10 -7 ) that is independent of previously reported TL-associated SNPs in this region. The minor allele for rs2297439 is common in South Asian populations (≥0.25) but at lower frequencies in other populations (eg, 0.07 in Northern Europeans). Among the eight other previously reported association signals, all were directionally consistent with our study, but only rs8105767 ( ZNF208 ) was nominally significant (P=0.003). SNP-based heritability estimates were as high as 44% when analysing close relatives but much lower when analysing distant relatives only. In this first GWAS of TL in a South Asian population, we replicate some, but not all, of the loci reported in prior GWAS of individuals of European ancestry, and we identify a novel second association signal at the RTEL1 locus. © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2018. All rights reserved. No commercial use is permitted unless otherwise expressly granted.
LD Score Regression Distinguishes Confounding from Polygenicity in Genome-Wide Association Studies
Bulik-Sullivan, Brendan K.; Loh, Po-Ru; Finucane, Hilary; Ripke, Stephan; Yang, Jian; Patterson, Nick; Daly, Mark J.; Price, Alkes L.; Neale, Benjamin M.
2015-01-01
Both polygenicity (i.e., many small genetic effects) and confounding biases, such as cryptic relatedness and population stratification, can yield an inflated distribution of test statistics in genome-wide association studies (GWAS). However, current methods cannot distinguish between inflation from true polygenic signal and bias. We have developed an approach, LD Score regression, that quantifies the contribution of each by examining the relationship between test statistics and linkage disequilibrium (LD). The LD Score regression intercept can be used to estimate a more powerful and accurate correction factor than genomic control. We find strong evidence that polygenicity accounts for the majority of test statistic inflation in many GWAS of large sample size. PMID:25642630
Genome-wide association studies on HIV susceptibility, pathogenesis and pharmacogenomics
2012-01-01
Susceptibility to HIV-1 and the clinical course after infection show a substantial heterogeneity between individuals. Part of this variability can be attributed to host genetic variation. Initial candidate gene studies have revealed interesting host factors that influence HIV infection, replication and pathogenesis. Recently, genome-wide association studies (GWAS) were utilized for unbiased searches at a genome-wide level to discover novel genetic factors and pathways involved in HIV-1 infection. This review gives an overview of findings from the GWAS performed on HIV infection, within different cohorts, with variable patient and phenotype selection. Furthermore, novel techniques and strategies in research that might contribute to the complete understanding of virus-host interactions and its role on the pathogenesis of HIV infection are discussed. PMID:22920050
Austin, Melissa A.; Hair, Marilyn S.; Fullerton, Stephanie M.
2012-01-01
Scientific research has shifted from studies conducted by single investigators to the creation of large consortia. Genetic epidemiologists, for example, now collaborate extensively for genome-wide association studies (GWAS). The effect has been a stream of confirmed disease-gene associations. However, effects on human subjects oversight, data-sharing, publication and authorship practices, research organization and productivity, and intellectual property remain to be examined. The aim of this analysis was to identify all research consortia that had published the results of a GWAS analysis since 2005, characterize them, determine which have publicly accessible guidelines for research practices, and summarize the policies in these guidelines. A review of the National Human Genome Research Institute’s Catalog of Published Genome-Wide Association Studies identified 55 GWAS consortia as of April 1, 2011. These consortia were comprised of individual investigators, research centers, studies, or other consortia and studied 48 different diseases or traits. Only 14 (25%) were found to have publicly accessible research guidelines on consortia websites. The available guidelines provide information on organization, governance, and research protocols; half address institutional review board approval. Details of publication, authorship, data-sharing, and intellectual property vary considerably. Wider access to consortia guidelines is needed to establish appropriate research standards with broad applicability to emerging forms of large-scale collaboration. PMID:22491085
Multi-ethnic genome-wide association study identifies novel locus for type 2 diabetes susceptibility
Cook, James P; Morris, Andrew P
2016-01-01
Genome-wide association studies (GWAS) have traditionally been undertaken in homogeneous populations from the same ancestry group. However, with the increasing availability of GWAS in large-scale multi-ethnic cohorts, we have evaluated a framework for detecting association of genetic variants with complex traits, allowing for population structure, and developed a powerful test of heterogeneity in allelic effects between ancestry groups. We have applied the methodology to identify and characterise loci associated with susceptibility to type 2 diabetes (T2D) using GWAS data from the Resource for Genetic Epidemiology on Adult Health and Aging, a large multi-ethnic population-based cohort, created for investigating the genetic and environmental basis of age-related diseases. We identified a novel locus for T2D susceptibility at genome-wide significance (P<5 × 10−8) that maps to TOMM40-APOE, a region previously implicated in lipid metabolism and Alzheimer's disease. We have also confirmed previous reports that single-nucleotide polymorphisms at the TCF7L2 locus demonstrate the greatest extent of heterogeneity in allelic effects between ethnic groups, with the lowest risk observed in populations of East Asian ancestry. PMID:27189021
Pleiotropic analysis of cancer risk loci on esophageal adenocarcinoma risk
Lee, Eunjung; Stram, Daniel O.; Ek, Weronica E.; Onstad, Lynn E; MacGregor, Stuart; Gharahkhani, Puya; Ye, Weimin; Lagergren, Jesper; Shaheen, Nicholas J.; Murray, Liam J.; Hardie, Laura J; Gammon, Marilie D.; Chow, Wong-Ho; Risch, Harvey A.; Corley, Douglas A.; Levine, David M; Whiteman, David C.; Bernstein, Leslie; Bird, Nigel C.; Vaughan, Thomas L.; Wu, Anna H.
2015-01-01
Background Several cancer-associated loci identified from genome-wide association studies (GWAS) have been associated with risks of multiple cancer sites, suggesting pleiotropic effects. We investigated whether GWAS-identified risk variants for other common cancers are associated with risk of esophageal adenocarcinoma (EA) or its precursor, Barrett's esophagus (BE). Methods We examined the associations between risks of EA and BE and 387 single nucleotide polymorphisms (SNPs) that have been associated with risks of other cancers, by using genotype imputation data on 2,163 control participants and 3,885 (1,501 EA and 2,384 BE) case patients from the Barrett's and Esophageal Adenocarcinoma Genetic Susceptibility Study, and investigated effect modification by smoking history, body mass index (BMI), and reflux/heartburn. Results After correcting for multiple testing, none of the tested 387 SNPs were statistically significantly associated with risk of EA or BE. No evidence of effect modification by smoking, BMI, or reflux/heartburn was observed. Conclusions Genetic risk variants for common cancers identified from GWAS appear not to be associated with risks of EA or BE. Impact To our knowledge, this is the first investigation of pleiotropic genetic associations with risks of EA and BE. PMID:26364162
Genome-wide association study yields variants at 20p12.2 that associate with urinary bladder cancer.
Rafnar, Thorunn; Sulem, Patrick; Thorleifsson, Gudmar; Vermeulen, Sita H; Helgason, Hannes; Saemundsdottir, Jona; Gudjonsson, Sigurjon A; Sigurdsson, Asgeir; Stacey, Simon N; Gudmundsson, Julius; Johannsdottir, Hrefna; Alexiusdottir, Kristin; Petursdottir, Vigdis; Nikulasson, Sigfus; Geirsson, Gudmundur; Jonsson, Thorvaldur; Aben, Katja K H; Grotenhuis, Anne J; Verhaegh, Gerald W; Dudek, Aleksandra M; Witjes, J Alfred; van der Heijden, Antoine G; Vrieling, Alina; Galesloot, Tessel E; De Juan, Ana; Panadero, Angeles; Rivera, Fernando; Hurst, Carolyn; Bishop, D Timothy; Sak, Sei C; Choudhury, Ananya; Teo, Mark T W; Arici, Cecilia; Carta, Angela; Toninelli, Elena; de Verdier, Petra; Rudnai, Peter; Gurzau, Eugene; Koppova, Kvetoslava; van der Keur, Kirstin A; Lurkin, Irene; Goossens, Mieke; Kellen, Eliane; Guarrera, Simonetta; Russo, Alessia; Critelli, Rossana; Sacerdote, Carlotta; Vineis, Paolo; Krucker, Clémentine; Zeegers, Maurice P; Gerullis, Holger; Ovsiannikov, Daniel; Volkert, Frank; Hengstler, Jan G; Selinski, Silvia; Magnusson, Olafur T; Masson, Gisli; Kong, Augustine; Gudbjartsson, Daniel; Lindblom, Annika; Zwarthoff, Ellen; Porru, Stefano; Golka, Klaus; Buntinx, Frank; Matullo, Giuseppe; Kumar, Rajiv; Mayordomo, José I; Steineck, D Gunnar; Kiltie, Anne E; Jonsson, Eirikur; Radvanyi, François; Knowles, Margaret A; Thorsteinsdottir, Unnur; Kiemeney, Lambertus A; Stefansson, Kari
2014-10-15
Genome-wide association studies (GWAS) of urinary bladder cancer (UBC) have yielded common variants at 12 loci that associate with risk of the disease. We report here the results of a GWAS of UBC including 1670 UBC cases and 90 180 controls, followed by replication analysis in additional 5266 UBC cases and 10 456 controls. We tested a dataset containing 34.2 million variants, generated by imputation based on whole-genome sequencing of 2230 Icelanders. Several correlated variants at 20p12, represented by rs62185668, show genome-wide significant association with UBC after combining discovery and replication results (OR = 1.19, P = 1.5 × 10(-11) for rs62185668-A, minor allele frequency = 23.6%). The variants are located in a non-coding region approximately 300 kb upstream from the JAG1 gene, an important component of the Notch signaling pathways that may be oncogenic or tumor suppressive in several forms of cancer. Our results add to the growing number of UBC risk variants discovered through GWAS. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Kulbrock, Maike; Lehner, Stefanie; Metzger, Julia; Ohnesorge, Bernhard; Distl, Ottmar
2013-01-01
Equine recurrent uveitis (ERU) is a common eye disease affecting up to 3–15% of the horse population. A genome-wide association study (GWAS) using the Illumina equine SNP50 bead chip was performed to identify loci conferring risk to ERU. The sample included a total of 144 German warmblood horses. A GWAS showed a significant single nucleotide polymorphism (SNP) on horse chromosome (ECA) 20 at 49.3 Mb, with IL-17A and IL-17F being the closest genes. This locus explained a fraction of 23% of the phenotypic variance for ERU. A GWAS taking into account the severity of ERU, revealed a SNP on ECA18 nearby to the crystalline gene cluster CRYGA-CRYGF. For both genomic regions on ECA18 and 20, significantly associated haplotypes containing the genome-wide significant SNPs could be demonstrated. In conclusion, our results are indicative for a genetic component regulating the possible critical role of IL-17A and IL-17F in the pathogenesis of ERU. The associated SNP on ECA18 may be indicative for cataract formation in the course of ERU. PMID:23977091
Age-related macular degeneration: genome-wide association studies to translation.
Black, James R M; Clark, Simon J
2016-04-01
In recent years, genome-wide association studies (GWAS), which are able to analyze the contribution to disease of genetic variations that are common within a population, have attracted considerable investment. Despite identifying genetic variants for many conditions, they have been criticized for yielding data with minimal clinical utility. However, in this regard, age-related macular degeneration (AMD), the most common form of blindness in the Western world, is a striking exception. Through GWAS, common genetic variants at a number of loci have been discovered. Two loci in particular, including genes of the complement cascade on chromosome 1 and the ARMS2/HTRA1 genes on chromosome 10, have been shown to convey significantly increased susceptibility to developing AMD. Today, although it is possible to screen individuals for a genetic predisposition to the disease, effective interventional strategies for those at risk of developing AMD are scarce. Ongoing research in this area is nonetheless promising. After providing brief overviews of AMD and common disease genetics, we outline the main recent advances in the understanding of AMD, particularly those made through GWAS. Finally, the true merit of these findings and their current and potential translational value is examined.Genet Med 18 4, 283-289.
Yang, Cheng-Hong; Chuang, Li-Yeh; Lin, Yu-Da
2017-08-01
Detecting epistatic interactions in genome-wide association studies (GWAS) is a computational challenge. Such huge numbers of single-nucleotide polymorphism (SNP) combinations limit the some of the powerful algorithms to be applied to detect the potential epistasis in large-scale SNP datasets. We propose a new algorithm which combines the differential evolution (DE) algorithm with a classification based multifactor-dimensionality reduction (CMDR), termed DECMDR. DECMDR uses the CMDR as a fitness measure to evaluate values of solutions in DE process for scanning the potential statistical epistasis in GWAS. The results indicated that DECMDR outperforms the existing algorithms in terms of detection success rate by the large simulation and real data obtained from the Wellcome Trust Case Control Consortium. For running time comparison, DECMDR can efficient to apply the CMDR to detect the significant association between cases and controls amongst all possible SNP combinations in GWAS. DECMDR is freely available at https://goo.gl/p9sLuJ . chuang@isu.edu.tw or e0955767257@yahoo.com.tw. Supplementary data are available at Bioinformatics online. © The Author (2017). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com
A genome-wide association study of breast cancer in women of African ancestry
Chen, Fang; Chen, Gary K.; Stram, Daniel O.; Millikan, Robert C.; Ambrosone, Christine B.; John, Esther M.; Bernstein, Leslie; Zheng, Wei; Palmer, Julie R.; Hu, Jennifer J.; Rebbeck, Tim R.; Ziegler, Regina G.; Nyante, Sarah; Bandera, Elisa V.; Ingles, Sue A.; Press, Michael F.; Ruiz-Narvaez, Edward A.; Deming, Sandra L.; Rodriguez-Gil, Jorge L.; DeMichele, Angela; Chanock, Stephen J.; Blot, William; Signorello, Lisa; Cai, Qiuyin; Li, Guoliang; Long, Jirong; Huo, Dezheng; Zheng, Yonglan; Cox, Nancy J.; Olopade, Olufunmilayo I.; Ogundiran, Temidayo O.; Adebamowo, Clement; Nathanson, Katherine L.; Domchek, Susan M.; Simon, Michael S.; Hennis, Anselm; Nemesure, Barbara; Wu, Suh-Yuh; Leske, M. Cristina; Ambs, Stefan; Hutter, Carolyn M.; Young, Alicia; Kooperberg, Charles; Peters, Ulrike; Rhie, Suhn K.; Wan, Peggy; Sheng, Xin; Pooler, Loreall C.; Van Den Berg, David J.; Le Marchand, Loic; Kolonel, Laurence N.; Henderson, Brian E.; Haiman, Christopher A.
2013-01-01
Genome-wide association studies (GWAS) in diverse populations are needed to reveal variants that are more common and/or limited to defined populations. We conducted a GWAS of breast cancer in women of African ancestry, with genotyping of > 1,000,000 SNPs in 3,153 African American cases and 2,831 controls, and replication testing of the top 66 associations in an additional 3,607 breast cancer cases and 11,330 controls of African ancestry. Two of the 66 SNPs replicated (p < 0.05) in stage 2, which reached statistical significance levels of 10−6 and 10−5 in the stage 1 and 2 combined analysis (rs4322600 at chromosome 14q31: OR = 1.18, p = 4.3×10−6; rs10510333 at chromosome 3p26: OR = 1.15, p = 1.5×10−5). These suggestive risk loci have not been identified in previous GWAS in other populations and will need to be examined in additional samples. Identification of novel risk variants for breast cancer in women of African ancestry will demand testing of a substantially larger set of markers from stage 1 in a larger replication sample. PMID:22923054
Cook, James P; Mahajan, Anubha; Morris, Andrew P
2017-02-01
Linear mixed models are increasingly used for the analysis of genome-wide association studies (GWAS) of binary phenotypes because they can efficiently and robustly account for population stratification and relatedness through inclusion of random effects for a genetic relationship matrix. However, the utility of linear (mixed) models in the context of meta-analysis of GWAS of binary phenotypes has not been previously explored. In this investigation, we present simulations to compare the performance of linear and logistic regression models under alternative weighting schemes in a fixed-effects meta-analysis framework, considering designs that incorporate variable case-control imbalance, confounding factors and population stratification. Our results demonstrate that linear models can be used for meta-analysis of GWAS of binary phenotypes, without loss of power, even in the presence of extreme case-control imbalance, provided that one of the following schemes is used: (i) effective sample size weighting of Z-scores or (ii) inverse-variance weighting of allelic effect sizes after conversion onto the log-odds scale. Our conclusions thus provide essential recommendations for the development of robust protocols for meta-analysis of binary phenotypes with linear models.
Evaluating genetic risk for prostate cancer among Japanese and Latinos
Cheng, Iona; Chen, Gary K.; Nakagawa, Hidewaki; He, Jing; Wan, Peggy; Laurie, Cathy; Shen, Jess; Sheng, Xin; Pooler, Loreall C.; Crenshaw, Andrew T.; Mirel, Daniel B.; Takahashi, Atsushi; Kubo, Michiaki; Nakamura, Yusuke; Al Olama, Ali Amin; Benlloch, Sara; Donovan, Jenny L.; Guy, Michelle; Hamdy, Freddie C.; Kote-Jarai, Zsofia; Neal, David E.; Wilkens, Lynne R.; Monroe, Kristine R.; Stram, Daniel O.; Muir, Kenneth; Eeles, Rosalind A.; Easton, Douglas F.; Kolonel, Laurence N.; Henderson, Brian E.; Le Marchand, Loïc; Haiman, Christopher A.
2012-01-01
Background There have been few genome-wide association studies (GWAS) of prostate cancer among diverse populations. To search for novel prostate cancer risk variants, we conducted GWAS of prostate cancer in Japanese and Latinos. In addition, we tested prostate cancer risk variants and developed genetic risk models of prostate cancer for Japanese and Latinos. Methods Our first stage GWAS of prostate cancer included Japanese (cases/controls=1,033/1,042) and Latino (cases/controls=1,043/1,057) from the Multiethnic Cohort. Significant associations from stage 1 (P < 1.0×10−4) were examined in silico in GWAS of prostate cancer (stage 2) in Japanese (cases/controls=1,583/3,386) and Europeans (cases/controls=1,854/1,894). Results No novel stage 1 SNPs outside of known risk regions reached genome-wide significance. For Japanese, in stage 1, the most notable putative novel association was seen with 10 SNPs (P<8.0. x10−6) at chromosome 2q33; however, this was not replicated in stage 2. For Latinos, the most significant association was observed with rs17023900 at the known 3p12 risk locus (stage 1: OR=1.45; P=7.01×10−5 and stage 2: OR=1.58; P =3.05×10−7). The majority of the established risk variants for prostate cancer, 79% and 88%, were positively associated with prostate cancer in Japanese and Latinos (stage I), respectively. The cumulative effects of these variants significantly influence prostate cancer risk (OR per allele=1.10; P = 2.71×10−25 and OR=1.07; P = 1.02×10−16 for Japanese and Latinos, respectively). Conclusion and Impact Our GWAS of prostate cancer did not identify novel genome-wide significant variants. However, our findings demonstrate that established risk variants for prostate cancer significantly contribute to risk among Japanese and Latinos. PMID:22923026
Mahajan, Anubha; Sim, Xueling; Ng, Hui Jin; Manning, Alisa; Rivas, Manuel A.; Highland, Heather M.; Locke, Adam E.; Grarup, Niels; Im, Hae Kyung; Cingolani, Pablo; Flannick, Jason; Fontanillas, Pierre; Fuchsberger, Christian; Gaulton, Kyle J.; Teslovich, Tanya M.; Rayner, N. William; Robertson, Neil R.; Beer, Nicola L.; Rundle, Jana K.; Bork-Jensen, Jette; Ladenvall, Claes; Blancher, Christine; Buck, David; Buck, Gemma; Burtt, Noël P.; Gabriel, Stacey; Gjesing, Anette P.; Groves, Christopher J.; Hollensted, Mette; Huyghe, Jeroen R.; Jackson, Anne U.; Jun, Goo; Justesen, Johanne Marie; Mangino, Massimo; Murphy, Jacquelyn; Neville, Matt; Onofrio, Robert; Small, Kerrin S.; Stringham, Heather M.; Syvänen, Ann-Christine; Trakalo, Joseph; Abecasis, Goncalo; Bell, Graeme I.; Blangero, John; Cox, Nancy J.; Duggirala, Ravindranath; Hanis, Craig L.; Seielstad, Mark; Wilson, James G.; Christensen, Cramer; Brandslund, Ivan; Rauramaa, Rainer; Surdulescu, Gabriela L.; Doney, Alex S. F.; Lannfelt, Lars; Linneberg, Allan; Isomaa, Bo; Tuomi, Tiinamaija; Jørgensen, Marit E.; Jørgensen, Torben; Kuusisto, Johanna; Uusitupa, Matti; Salomaa, Veikko; Spector, Timothy D.; Morris, Andrew D.; Palmer, Colin N. A.; Collins, Francis S.; Mohlke, Karen L.; Bergman, Richard N.; Ingelsson, Erik; Lind, Lars; Tuomilehto, Jaakko; Hansen, Torben; Watanabe, Richard M.; Prokopenko, Inga; Dupuis, Josee; Karpe, Fredrik; Groop, Leif; Laakso, Markku; Pedersen, Oluf; Florez, Jose C.; Morris, Andrew P.; Altshuler, David; Meigs, James B.; Boehnke, Michael; McCarthy, Mark I.; Lindgren, Cecilia M.; Gloyn, Anna L.
2015-01-01
Genome wide association studies (GWAS) for fasting glucose (FG) and insulin (FI) have identified common variant signals which explain 4.8% and 1.2% of trait variance, respectively. It is hypothesized that low-frequency and rare variants could contribute substantially to unexplained genetic variance. To test this, we analyzed exome-array data from up to 33,231 non-diabetic individuals of European ancestry. We found exome-wide significant (P<5×10-7) evidence for two loci not previously highlighted by common variant GWAS: GLP1R (p.Ala316Thr, minor allele frequency (MAF)=1.5%) influencing FG levels, and URB2 (p.Glu594Val, MAF = 0.1%) influencing FI levels. Coding variant associations can highlight potential effector genes at (non-coding) GWAS signals. At the G6PC2/ABCB11 locus, we identified multiple coding variants in G6PC2 (p.Val219Leu, p.His177Tyr, and p.Tyr207Ser) influencing FG levels, conditionally independent of each other and the non-coding GWAS signal. In vitro assays demonstrate that these associated coding alleles result in reduced protein abundance via proteasomal degradation, establishing G6PC2 as an effector gene at this locus. Reconciliation of single-variant associations and functional effects was only possible when haplotype phase was considered. In contrast to earlier reports suggesting that, paradoxically, glucose-raising alleles at this locus are protective against type 2 diabetes (T2D), the p.Val219Leu G6PC2 variant displayed a modest but directionally consistent association with T2D risk. Coding variant associations for glycemic traits in GWAS signals highlight PCSK1, RREB1, and ZHX3 as likely effector transcripts. These coding variant association signals do not have a major impact on the trait variance explained, but they do provide valuable biological insights. PMID:25625282
Evaluating genetic risk for prostate cancer among Japanese and Latinos.
Cheng, Iona; Chen, Gary K; Nakagawa, Hidewaki; He, Jing; Wan, Peggy; Laurie, Cathy C; Shen, Jess; Sheng, Xin; Pooler, Loreall C; Crenshaw, Andrew T; Mirel, Daniel B; Takahashi, Atsushi; Kubo, Michiaki; Nakamura, Yusuke; Al Olama, Ali Amin; Benlloch, Sara; Donovan, Jenny L; Guy, Michelle; Hamdy, Freddie C; Kote-Jarai, Zsofia; Neal, David E; Wilkens, Lynne R; Monroe, Kristine R; Stram, Daniel O; Muir, Kenneth; Eeles, Rosalind A; Easton, Douglas F; Kolonel, Laurence N; Henderson, Brian E; Le Marchand, Loïc; Haiman, Christopher A
2012-11-01
There have been few genome-wide association studies (GWAS) of prostate cancer among diverse populations. To search for novel prostate cancer risk variants, we conducted GWAS of prostate cancer in Japanese and Latinos. In addition, we tested prostate cancer risk variants and developed genetic risk models of prostate cancer for Japanese and Latinos. Our first-stage GWAS of prostate cancer included Japanese (cases/controls = 1,033/1,042) and Latino (cases/controls = 1,043/1,057) from the Multiethnic Cohort (MEC). Significant associations from stage I (P < 1.0 × 10(-4)) were examined in silico in GWAS of prostate cancer (stage II) in Japanese (cases/controls = 1,583/3,386) and Europeans (cases/controls = 1,854/1,894). No novel stage I single-nucleotide polymorphism (SNP) outside of known risk regions reached genome-wide significance. For Japanese, in stage I, the most notable putative novel association was seen with 10 SNPs (P ≤ 8.0 × 10(-6)) at chromosome 2q33; however, this was not replicated in stage II. For Latinos, the most significant association was observed with rs17023900 at the known 3p12 risk locus (stage I: OR = 1.45; P = 7.01 × 10(-5) and stage II: OR = 1.58; P = 3.05 × 10(-7)). The majority of the established risk variants for prostate cancer, 79% and 88%, were positively associated with prostate cancer in Japanese and Latinos (stage I), respectively. The cumulative effects of these variants significantly influence prostate cancer risk (OR per allele = 1.10; P = 2.71 × 10(-25) and OR = 1.07; P = 1.02 × 10(-16) for Japanese and Latinos, respectively). Our GWAS of prostate cancer did not identify novel genome-wide significant variants. However, our findings show that established risk variants for prostate cancer significantly contribute to risk among Japanese and Latinos. ©2012 AACR.
Mahajan, Anubha; Sim, Xueling; Ng, Hui Jin; Manning, Alisa; Rivas, Manuel A; Highland, Heather M; Locke, Adam E; Grarup, Niels; Im, Hae Kyung; Cingolani, Pablo; Flannick, Jason; Fontanillas, Pierre; Fuchsberger, Christian; Gaulton, Kyle J; Teslovich, Tanya M; Rayner, N William; Robertson, Neil R; Beer, Nicola L; Rundle, Jana K; Bork-Jensen, Jette; Ladenvall, Claes; Blancher, Christine; Buck, David; Buck, Gemma; Burtt, Noël P; Gabriel, Stacey; Gjesing, Anette P; Groves, Christopher J; Hollensted, Mette; Huyghe, Jeroen R; Jackson, Anne U; Jun, Goo; Justesen, Johanne Marie; Mangino, Massimo; Murphy, Jacquelyn; Neville, Matt; Onofrio, Robert; Small, Kerrin S; Stringham, Heather M; Syvänen, Ann-Christine; Trakalo, Joseph; Abecasis, Goncalo; Bell, Graeme I; Blangero, John; Cox, Nancy J; Duggirala, Ravindranath; Hanis, Craig L; Seielstad, Mark; Wilson, James G; Christensen, Cramer; Brandslund, Ivan; Rauramaa, Rainer; Surdulescu, Gabriela L; Doney, Alex S F; Lannfelt, Lars; Linneberg, Allan; Isomaa, Bo; Tuomi, Tiinamaija; Jørgensen, Marit E; Jørgensen, Torben; Kuusisto, Johanna; Uusitupa, Matti; Salomaa, Veikko; Spector, Timothy D; Morris, Andrew D; Palmer, Colin N A; Collins, Francis S; Mohlke, Karen L; Bergman, Richard N; Ingelsson, Erik; Lind, Lars; Tuomilehto, Jaakko; Hansen, Torben; Watanabe, Richard M; Prokopenko, Inga; Dupuis, Josee; Karpe, Fredrik; Groop, Leif; Laakso, Markku; Pedersen, Oluf; Florez, Jose C; Morris, Andrew P; Altshuler, David; Meigs, James B; Boehnke, Michael; McCarthy, Mark I; Lindgren, Cecilia M; Gloyn, Anna L
2015-01-01
Genome wide association studies (GWAS) for fasting glucose (FG) and insulin (FI) have identified common variant signals which explain 4.8% and 1.2% of trait variance, respectively. It is hypothesized that low-frequency and rare variants could contribute substantially to unexplained genetic variance. To test this, we analyzed exome-array data from up to 33,231 non-diabetic individuals of European ancestry. We found exome-wide significant (P<5×10-7) evidence for two loci not previously highlighted by common variant GWAS: GLP1R (p.Ala316Thr, minor allele frequency (MAF)=1.5%) influencing FG levels, and URB2 (p.Glu594Val, MAF = 0.1%) influencing FI levels. Coding variant associations can highlight potential effector genes at (non-coding) GWAS signals. At the G6PC2/ABCB11 locus, we identified multiple coding variants in G6PC2 (p.Val219Leu, p.His177Tyr, and p.Tyr207Ser) influencing FG levels, conditionally independent of each other and the non-coding GWAS signal. In vitro assays demonstrate that these associated coding alleles result in reduced protein abundance via proteasomal degradation, establishing G6PC2 as an effector gene at this locus. Reconciliation of single-variant associations and functional effects was only possible when haplotype phase was considered. In contrast to earlier reports suggesting that, paradoxically, glucose-raising alleles at this locus are protective against type 2 diabetes (T2D), the p.Val219Leu G6PC2 variant displayed a modest but directionally consistent association with T2D risk. Coding variant associations for glycemic traits in GWAS signals highlight PCSK1, RREB1, and ZHX3 as likely effector transcripts. These coding variant association signals do not have a major impact on the trait variance explained, but they do provide valuable biological insights.
Sun, Chengming; Wang, Benqi; Yan, Lei; Hu, Kaining; Liu, Sheng; Zhou, Yongming; Guan, Chunyun; Zhang, Zhenqian; Li, Jiana; Zhang, Jiefu; Chen, Song; Wen, Jing; Ma, Chaozhi; Tu, Jinxing; Shen, Jinxiong; Fu, Tingdong; Yi, Bin
2016-01-01
Plant height is a key morphological trait of rapeseed. In this study, we measured plant height of a rapeseed population across six environments. This population contains 476 inbred lines representing the major Chinese rapeseed genepool and 44 lines from other countries. The 60K Brassica Infinium® SNP array was utilized to genotype the association panel. A genome-wide association study (GWAS) was performed via three methods, including a robust, novel, nonparametric Anderson-Darling (A-D) test. Consequently, 68 loci were identified as significantly associated with plant height (P < 5.22 × 10(-5)), and more than 70% of the loci (48) overlapped the confidence intervals of reported QTLs from nine mapping populations. Moreover, 24 GWAS loci were detected with selective sweep signals, which reflected the signatures of historical semi-dwarf breeding. In the linkage disequilibrium (LD) decay range up-and downstream of 65 loci (r (2) > 0.1), we found plausible candidates orthologous to the documented Arabidopsis genes involved in height regulation. One significant association found by GWAS colocalized with the established height locus BnRGA in rapeseed. Our results provide insights into the genetic basis of plant height in rapeseed and may facilitate marker-based breeding.
Renin-Angiotensin System Gene Variants and Type 2 Diabetes Mellitus: Influence of Angiotensinogen
Joyce-Tan, Siew Mei; Zain, Shamsul Mohd; Abdul Sattar, Munavvar Zubaid; Abdullah, Nor Azizan
2016-01-01
Genome-wide association studies (GWAS) have been successfully used to call for variants associated with diseases including type 2 diabetes mellitus (T2DM). However, some variants are not included in the GWAS to avoid penalty in multiple hypothetic testing. Thus, candidate gene approach is still useful even at GWAS era. This study attempted to assess whether genetic variations in the renin-angiotensin system (RAS) and their gene interactions are associated with T2DM risk. We genotyped 290 T2DM patients and 267 controls using three genes of the RAS, namely, angiotensin converting enzyme (ACE), angiotensinogen (AGT), and angiotensin II type 1 receptor (AGTR1). There were significant differences in allele frequencies between cases and controls for AGT variants (P = 0.05) but not for ACE and AGTR1. Haplotype TCG of the AGT was associated with increased risk of T2DM (OR 1.92, 95% CI 1.15–3.20, permuted P = 0.012); however, no evidence of significant gene-gene interactions was seen. Nonetheless, our analysis revealed that the associations of the AGT variants with T2DM were independently associated. Thus, this study suggests that genetic variants of the RAS can modestly influence the T2DM risk. PMID:26682227
Schaid, Daniel J; Sinnwell, Jason P; Jenkins, Gregory D; McDonnell, Shannon K; Ingle, James N; Kubo, Michiaki; Goss, Paul E; Costantino, Joseph P; Wickerham, D Lawrence; Weinshilboum, Richard M
2012-01-01
Gene-set analyses have been widely used in gene expression studies, and some of the developed methods have been extended to genome wide association studies (GWAS). Yet, complications due to linkage disequilibrium (LD) among single nucleotide polymorphisms (SNPs), and variable numbers of SNPs per gene and genes per gene-set, have plagued current approaches, often leading to ad hoc "fixes." To overcome some of the current limitations, we developed a general approach to scan GWAS SNP data for both gene-level and gene-set analyses, building on score statistics for generalized linear models, and taking advantage of the directed acyclic graph structure of the gene ontology when creating gene-sets. However, other types of gene-set structures can be used, such as the popular Kyoto Encyclopedia of Genes and Genomes (KEGG). Our approach combines SNPs into genes, and genes into gene-sets, but assures that positive and negative effects of genes on a trait do not cancel. To control for multiple testing of many gene-sets, we use an efficient computational strategy that accounts for LD and provides accurate step-down adjusted P-values for each gene-set. Application of our methods to two different GWAS provide guidance on the potential strengths and weaknesses of our proposed gene-set analyses. © 2011 Wiley Periodicals, Inc.
Pathway-Based Kernel Boosting for the Analysis of Genome-Wide Association Studies
Manitz, Juliane; Burger, Patricia; Amos, Christopher I.; Chang-Claude, Jenny; Wichmann, Heinz-Erich; Kneib, Thomas; Bickeböller, Heike
2017-01-01
The analysis of genome-wide association studies (GWAS) benefits from the investigation of biologically meaningful gene sets, such as gene-interaction networks (pathways). We propose an extension to a successful kernel-based pathway analysis approach by integrating kernel functions into a powerful algorithmic framework for variable selection, to enable investigation of multiple pathways simultaneously. We employ genetic similarity kernels from the logistic kernel machine test (LKMT) as base-learners in a boosting algorithm. A model to explain case-control status is created iteratively by selecting pathways that improve its prediction ability. We evaluated our method in simulation studies adopting 50 pathways for different sample sizes and genetic effect strengths. Additionally, we included an exemplary application of kernel boosting to a rheumatoid arthritis and a lung cancer dataset. Simulations indicate that kernel boosting outperforms the LKMT in certain genetic scenarios. Applications to GWAS data on rheumatoid arthritis and lung cancer resulted in sparse models which were based on pathways interpretable in a clinical sense. Kernel boosting is highly flexible in terms of considered variables and overcomes the problem of multiple testing. Additionally, it enables the prediction of clinical outcomes. Thus, kernel boosting constitutes a new, powerful tool in the analysis of GWAS data and towards the understanding of biological processes involved in disease susceptibility. PMID:28785300
Pathway-Based Kernel Boosting for the Analysis of Genome-Wide Association Studies.
Friedrichs, Stefanie; Manitz, Juliane; Burger, Patricia; Amos, Christopher I; Risch, Angela; Chang-Claude, Jenny; Wichmann, Heinz-Erich; Kneib, Thomas; Bickeböller, Heike; Hofner, Benjamin
2017-01-01
The analysis of genome-wide association studies (GWAS) benefits from the investigation of biologically meaningful gene sets, such as gene-interaction networks (pathways). We propose an extension to a successful kernel-based pathway analysis approach by integrating kernel functions into a powerful algorithmic framework for variable selection, to enable investigation of multiple pathways simultaneously. We employ genetic similarity kernels from the logistic kernel machine test (LKMT) as base-learners in a boosting algorithm. A model to explain case-control status is created iteratively by selecting pathways that improve its prediction ability. We evaluated our method in simulation studies adopting 50 pathways for different sample sizes and genetic effect strengths. Additionally, we included an exemplary application of kernel boosting to a rheumatoid arthritis and a lung cancer dataset. Simulations indicate that kernel boosting outperforms the LKMT in certain genetic scenarios. Applications to GWAS data on rheumatoid arthritis and lung cancer resulted in sparse models which were based on pathways interpretable in a clinical sense. Kernel boosting is highly flexible in terms of considered variables and overcomes the problem of multiple testing. Additionally, it enables the prediction of clinical outcomes. Thus, kernel boosting constitutes a new, powerful tool in the analysis of GWAS data and towards the understanding of biological processes involved in disease susceptibility.
Nature vs. nurture in human sociality: multi-level genomic analyses of social conformity.
Chen, Biqing; Zhu, Zijian; Wang, Yingying; Ding, Xiaohu; Guo, Xiaobo; He, Mingguang; Fang, Wan; Zhou, Qin; Zhou, Shanbi; Lei, Han; Huang, Ailong; Chen, Tingmei; Ni, Dongsheng; Gu, Yuping; Liu, Jianing; Rao, Yi
2018-05-01
Social conformity is fundamental to human societies and has been studied for more than six decades, but our understanding of its mechanisms remains limited. Individual differences in conformity have been attributed to social and cultural environmental influences, but not to genes. Here we demonstrate a genetic contribution to conformity after analyzing 1,140 twins and single-nucleotide polymorphism (SNP)-based studies of 2,130 young adults. A two-step genome-wide association study (GWAS) revealed replicable associations in 9 genomic loci, and a meta-analysis of three GWAS with a sample size of ~2,600 further confirmed one locus, corresponding to the NAV3 (Neuron Navigator 3) gene which encodes a protein important for axon outgrowth and guidance. Further multi-level (haplotype, gene, pathway) GWAS strongly associated genes including NAV3, PTPRD (protein tyrosine phosphatase receptor type D), ARL10 (ADP ribosylation factor-like GTPase 10), and CTNND2 (catenin delta 2), with conformity. Magnetic resonance imaging of 64 subjects shows correlation of activation or structural features of brain regions with the SNPs of these genes, supporting their functional significance. Our results suggest potential moderate genetic influence on conformity, implicate several specific genetic elements in conformity and will facilitate further research on cellular and molecular mechanisms underlying human conformity.
Pérez-Palma, Eduardo; Bustos, Bernabé I; Villamán, Camilo F; Alarcón, Marcelo A; Avila, Miguel E; Ugarte, Giorgia D; Reyes, Ariel E; Opazo, Carlos; De Ferrari, Giancarlo V
2014-01-01
Genome-wide association studies (GWAS) have successfully identified several risk loci for Alzheimer's disease (AD). Nonetheless, these loci do not explain the entire susceptibility of the disease, suggesting that other genetic contributions remain to be identified. Here, we performed a meta-analysis combining data of 4,569 individuals (2,540 cases and 2,029 healthy controls) derived from three publicly available GWAS in AD and replicated a broad genomic region (>248,000 bp) associated with the disease near the APOE/TOMM40 locus in chromosome 19. To detect minor effect size contributions that could help to explain the remaining genetic risk, we conducted network-based pathway analyses either by extracting gene-wise p-values (GW), defined as the single strongest association signal within a gene, or calculated a more stringent gene-based association p-value using the extended Simes (GATES) procedure. Comparison of these strategies revealed that ontological sub-networks (SNs) involved in glutamate signaling were significantly overrepresented in AD (p<2.7×10(-11), p<1.9×10(-11); GW and GATES, respectively). Notably, glutamate signaling SNs were also found to be significantly overrepresented (p<5.1×10(-8)) in the Alzheimer's disease Neuroimaging Initiative (ADNI) study, which was used as a targeted replication sample. Interestingly, components of the glutamate signaling SNs are coordinately expressed in disease-related tissues, which are tightly related to known pathological hallmarks of AD. Our findings suggest that genetic variation within glutamate signaling contributes to the remaining genetic risk of AD and support the notion that functional biological networks should be targeted in future therapies aimed to prevent or treat this devastating neurological disorder.
Aebi, Marcel; van Donkelaar, Marjolein M J; Poelmans, Geert; Buitelaar, Jan K; Sonuga-Barke, Edmund J S; Stringaris, Argyris; Consortium, Image; Faraone, Stephen V; Franke, Barbara; Steinhausen, Hans-Christoph; van Hulzen, Kimm J E
2016-07-01
Oppositional defiant disorder (ODD) is a frequent psychiatric disorder seen in children and adolescents with attention-deficit-hyperactivity disorder (ADHD). ODD is also a common antecedent to both affective disorders and aggressive behaviors. Although the heritability of ODD has been estimated to be around 0.60, there has been little research into the molecular genetics of ODD. The present study examined the association of irritable and defiant/vindictive dimensions and categorical subtypes of ODD (based on latent class analyses) with previously described specific polymorphisms (DRD4 exon3 VNTR, 5-HTTLPR, and seven OXTR SNPs) as well as with dopamine, serotonin, and oxytocin genes and pathways in a clinical sample of children and adolescents with ADHD. In addition, we performed a multivariate genome-wide association study (GWAS) of the aforementioned ODD dimensions and subtypes. Apart from adjusting the analyses for age and sex, we controlled for "parental ability to cope with disruptive behavior." None of the hypothesis-driven analyses revealed a significant association with ODD dimensions and subtypes. Inadequate parenting behavior was significantly associated with all ODD dimensions and subtypes, most strongly with defiant/vindictive behaviors. In addition, the GWAS did not result in genome-wide significant findings but bioinformatics and literature analyses revealed that the proteins encoded by 28 of the 53 top-ranked genes functionally interact in a molecular landscape centered around Beta-catenin signaling and involved in the regulation of neurite outgrowth. Our findings provide new insights into the molecular basis of ODD and inform future genetic studies of oppositional behavior. © 2015 The Authors. American Journal of Medical Genetics Part B: Neuropsychiatric Genetics Published by Wiley Periodicals, Inc. © 2015 The Authors. American Journal of Medical Genetics Part B: Neuropsychiatric Genetics Published by Wiley Periodicals, Inc.
Wild, Philipp S.; Felix, Janine F.; Schillert, Arne; Chen, Ming-Huei; Leening, Maarten J.G.; Völker, Uwe; Großmann, Vera; Brody, Jennifer A.; Irvin, Marguerite R.; Shah, Sanjiv J.; Pramana, Setia; Lieb, Wolfgang; Schmidt, Reinhold; Stanton, Alice V.; Malzahn, Dörthe; Lyytikäinen, Leo-Pekka; Tiller, Daniel; Smith, J. Gustav; Di Tullio, Marco R.; Musani, Solomon K.; Morrison, Alanna C.; Pers, Tune H.; Morley, Michael; Kleber, Marcus E.; Aragam, Jayashri; Bis, Joshua C.; Bisping, Egbert; Broeckel, Ulrich; Cheng, Susan; Deckers, Jaap W.; Del Greco M, Fabiola; Edelmann, Frank; Fornage, Myriam; Franke, Lude; Friedrich, Nele; Harris, Tamara B.; Hofer, Edith; Hofman, Albert; Huang, Jie; Hughes, Alun D.; Kähönen, Mika; investigators, KNHI; Kruppa, Jochen; Lackner, Karl J.; Lannfelt, Lars; Laskowski, Rafael; Launer, Lenore J.; Lindgren, Cecilia M.; Loley, Christina; Mayet, Jamil; Medenwald, Daniel; Morris, Andrew P.; Müller, Christian; Müller-Nurasyid, Martina; Nappo, Stefania; Nilsson, Peter M.; Nuding, Sebastian; Nutile, Teresa; Peters, Annette; Pfeufer, Arne; Pietzner, Diana; Pramstaller, Peter P.; Raitakari, Olli T.; Rice, Kenneth M.; Rotter, Jerome I.; Ruohonen, Saku T.; Sacco, Ralph L.; Samdarshi, Tandaw E.; Sharp, Andrew S.P.; Shields, Denis C.; Sorice, Rossella; Sotoodehnia, Nona; Stricker, Bruno H.; Surendran, Praveen; Töglhofer, Anna M.; Uitterlinden, André G.; Völzke, Henry; Ziegler, Andreas; Münzel, Thomas; März, Winfried; Cappola, Thomas P.; Hirschhorn, Joel N.; Mitchell, Gary F.; Smith, Nicholas L.; Fox, Ervin R.; Dueker, Nicole D.; Jaddoe, Vincent W.V.; Melander, Olle; Lehtimäki, Terho; Ciullo, Marina; Hicks, Andrew A.; Lind, Lars; Gudnason, Vilmundur; Pieske, Burkert; Barron, Anthony J.; Zweiker, Robert; Schunkert, Heribert; Ingelsson, Erik; Liu, Kiang; Arnett, Donna K.; Psaty, Bruce M.; Blankenberg, Stefan; Larson, Martin G.; Felix, Stephan B.; Franco, Oscar H.; Zeller, Tanja; Vasan, Ramachandran S.; Dörr, Marcus
2017-01-01
BACKGROUND. Understanding the genetic architecture of cardiac structure and function may help to prevent and treat heart disease. This investigation sought to identify common genetic variations associated with inter-individual variability in cardiac structure and function. METHODS. A GWAS meta-analysis of echocardiographic traits was performed, including 46,533 individuals from 30 studies (EchoGen consortium). The analysis included 16 traits of left ventricular (LV) structure, and systolic and diastolic function. RESULTS. The discovery analysis included 21 cohorts for structural and systolic function traits (n = 32,212) and 17 cohorts for diastolic function traits (n = 21,852). Replication was performed in 5 cohorts (n = 14,321) and 6 cohorts (n = 16,308), respectively. Besides 5 previously reported loci, the combined meta-analysis identified 10 additional genome-wide significant SNPs: rs12541595 near MTSS1 and rs10774625 in ATXN2 for LV end-diastolic internal dimension; rs806322 near KCNRG, rs4765663 in CACNA1C, rs6702619 near PALMD, rs7127129 in TMEM16A, rs11207426 near FGGY, rs17608766 in GOSR2, and rs17696696 in CFDP1 for aortic root diameter; and rs12440869 in IQCH for Doppler transmitral A-wave peak velocity. Findings were in part validated in other cohorts and in GWAS of related disease traits. The genetic loci showed associations with putative signaling pathways, and with gene expression in whole blood, monocytes, and myocardial tissue. CONCLUSION. The additional genetic loci identified in this large meta-analysis of cardiac structure and function provide insights into the underlying genetic architecture of cardiac structure and warrant follow-up in future functional studies. FUNDING. For detailed information per study, see Acknowledgments. PMID:28394258
Polygenic risk of Alzheimer disease is associated with early- and late-life processes.
Mormino, Elizabeth C; Sperling, Reisa A; Holmes, Avram J; Buckner, Randy L; De Jager, Philip L; Smoller, Jordan W; Sabuncu, Mert R
2016-08-02
To examine associations between aggregate genetic risk and Alzheimer disease (AD) markers in stages preceding the clinical symptoms of dementia using data from 2 large observational cohort studies. We computed polygenic risk scores (PGRS) using summary statistics from the International Genomics of Alzheimer's Project genome-wide association study of AD. Associations between PGRS and AD markers (cognitive decline, clinical progression, hippocampus volume, and β-amyloid) were assessed within older participants with dementia. Associations between PGRS and hippocampus volume were additionally examined within healthy younger participants (age 18-35 years). Within participants without dementia, elevated PGRS was associated with worse memory (p = 0.002) and smaller hippocampus (p = 0.002) at baseline, as well as greater longitudinal cognitive decline (memory: p = 0.0005, executive function: p = 0.01) and clinical progression (p < 0.00001). High PGRS was associated with AD-like levels of β-amyloid burden as measured with florbetapir PET (p = 0.03) but did not reach statistical significance for CSF β-amyloid (p = 0.11). Within the younger group, higher PGRS was associated with smaller hippocampus volume (p = 0.05). This pattern was evident when examining a PGRS that included many loci below the genome-wide association study (GWAS)-level significance threshold (16,123 single nucleotide polymorphisms), but not when PGRS was restricted to GWAS-level significant loci (18 single nucleotide polymorphisms). Effects related to common genetic risk loci distributed throughout the genome are detectable among individuals without dementia. The influence of this genetic risk may begin in early life and make an individual more susceptible to cognitive impairment in late life. Future refinement of polygenic risk scores may help identify individuals at risk for AD dementia. © 2016 American Academy of Neurology.
Villamán, Camilo F.; Alarcón, Marcelo A.; Avila, Miguel E.; Ugarte, Giorgia D.; Reyes, Ariel E.; Opazo, Carlos; De Ferrari, Giancarlo V.
2014-01-01
Genome-wide association studies (GWAS) have successfully identified several risk loci for Alzheimer's disease (AD). Nonetheless, these loci do not explain the entire susceptibility of the disease, suggesting that other genetic contributions remain to be identified. Here, we performed a meta-analysis combining data of 4,569 individuals (2,540 cases and 2,029 healthy controls) derived from three publicly available GWAS in AD and replicated a broad genomic region (>248,000 bp) associated with the disease near the APOE/TOMM40 locus in chromosome 19. To detect minor effect size contributions that could help to explain the remaining genetic risk, we conducted network-based pathway analyses either by extracting gene-wise p-values (GW), defined as the single strongest association signal within a gene, or calculated a more stringent gene-based association p-value using the extended Simes (GATES) procedure. Comparison of these strategies revealed that ontological sub-networks (SNs) involved in glutamate signaling were significantly overrepresented in AD (p<2.7×10−11, p<1.9×10−11; GW and GATES, respectively). Notably, glutamate signaling SNs were also found to be significantly overrepresented (p<5.1×10−8) in the Alzheimer's disease Neuroimaging Initiative (ADNI) study, which was used as a targeted replication sample. Interestingly, components of the glutamate signaling SNs are coordinately expressed in disease-related tissues, which are tightly related to known pathological hallmarks of AD. Our findings suggest that genetic variation within glutamate signaling contributes to the remaining genetic risk of AD and support the notion that functional biological networks should be targeted in future therapies aimed to prevent or treat this devastating neurological disorder. PMID:24755620
van Donkelaar, Marjolein M. J.; Poelmans, Geert; Buitelaar, Jan K.; Sonuga‐Barke, Edmund J. S.; Stringaris, Argyris; consortium, IMAGE; Faraone, Stephen V.; Franke, Barbara; Steinhausen, Hans‐Christoph; van Hulzen, Kimm J. E.
2015-01-01
Oppositional defiant disorder (ODD) is a frequent psychiatric disorder seen in children and adolescents with attention‐deficit‐hyperactivity disorder (ADHD). ODD is also a common antecedent to both affective disorders and aggressive behaviors. Although the heritability of ODD has been estimated to be around 0.60, there has been little research into the molecular genetics of ODD. The present study examined the association of irritable and defiant/vindictive dimensions and categorical subtypes of ODD (based on latent class analyses) with previously described specific polymorphisms (DRD4 exon3 VNTR, 5‐HTTLPR, and seven OXTR SNPs) as well as with dopamine, serotonin, and oxytocin genes and pathways in a clinical sample of children and adolescents with ADHD. In addition, we performed a multivariate genome‐wide association study (GWAS) of the aforementioned ODD dimensions and subtypes. Apart from adjusting the analyses for age and sex, we controlled for “parental ability to cope with disruptive behavior.” None of the hypothesis‐driven analyses revealed a significant association with ODD dimensions and subtypes. Inadequate parenting behavior was significantly associated with all ODD dimensions and subtypes, most strongly with defiant/vindictive behaviors. In addition, the GWAS did not result in genome‐wide significant findings but bioinformatics and literature analyses revealed that the proteins encoded by 28 of the 53 top‐ranked genes functionally interact in a molecular landscape centered around Beta‐catenin signaling and involved in the regulation of neurite outgrowth. Our findings provide new insights into the molecular basis of ODD and inform future genetic studies of oppositional behavior. © 2015 The Authors. American Journal of Medical Genetics Part B: Neuropsychiatric Genetics Published by Wiley Periodicals, Inc. PMID:26184070
Saowaphak, P; Duangjinda, M; Plaengkaeo, S; Suwannasing, R; Boonkum, W
2017-06-29
In this study, we estimated the genetic parameters and identified the putative quantitative trait loci (QTL) associated with the length of productive life (LPL), days open (DO), and 305-day milk yield for the first lactation (FM305) of crossbred Holstein dairy cattle. Data comprising 4,739 records collected between 1986 and 2004 were used to estimate the variance-covariance components using the multiple-trait animal linear mixed models based on the average information restricted maximum likelihood (AI-REML) algorithm. Thirty-six animals were genotyped using the Illumina BovineSNP50 Bead Chip [>50,000 single nucleotide polymorphisms (SNPs)] to identify the putative QTL in a genome-wide association study. The heritability of the production trait FM305 was 0.25 and that of the functional traits, LPL and DO, was low (0.10 and 0.06, respectively). The genetic correlation estimates demonstrated favorable negative correlations between LPL and DO (-0.02). However, we observed a favorable positive correlation between FM305 and LPL (0.43) and an unfavorable positive correlation between FM305 and DO (0.1). The GWAS results indicated that 23 QTLs on bovine chromosomes 1, 4, 5, 8, 15, 26, and X were associated with the traits of interest, and the putative QTL regions were identified within seven genes (SYT1, DOCK11, KLHL13, IL13RA1, PRKG1, GNA14, and LRRC4C). In conclusion, the heritability estimates of the LPL and DO were low. Therefore, the approach of multiple-trait selection indexes should be applied, and the QTL identified here should be considered for use in marker-assisted selection in the future.
Polygenic risk of Alzheimer disease is associated with early- and late-life processes
Sperling, Reisa A.; Holmes, Avram J.; Buckner, Randy L.; De Jager, Philip L.; Smoller, Jordan W.; Sabuncu, Mert R.
2016-01-01
Objective: To examine associations between aggregate genetic risk and Alzheimer disease (AD) markers in stages preceding the clinical symptoms of dementia using data from 2 large observational cohort studies. Methods: We computed polygenic risk scores (PGRS) using summary statistics from the International Genomics of Alzheimer's Project genome-wide association study of AD. Associations between PGRS and AD markers (cognitive decline, clinical progression, hippocampus volume, and β-amyloid) were assessed within older participants with dementia. Associations between PGRS and hippocampus volume were additionally examined within healthy younger participants (age 18–35 years). Results: Within participants without dementia, elevated PGRS was associated with worse memory (p = 0.002) and smaller hippocampus (p = 0.002) at baseline, as well as greater longitudinal cognitive decline (memory: p = 0.0005, executive function: p = 0.01) and clinical progression (p < 0.00001). High PGRS was associated with AD-like levels of β-amyloid burden as measured with florbetapir PET (p = 0.03) but did not reach statistical significance for CSF β-amyloid (p = 0.11). Within the younger group, higher PGRS was associated with smaller hippocampus volume (p = 0.05). This pattern was evident when examining a PGRS that included many loci below the genome-wide association study (GWAS)–level significance threshold (16,123 single nucleotide polymorphisms), but not when PGRS was restricted to GWAS-level significant loci (18 single nucleotide polymorphisms). Conclusions: Effects related to common genetic risk loci distributed throughout the genome are detectable among individuals without dementia. The influence of this genetic risk may begin in early life and make an individual more susceptible to cognitive impairment in late life. Future refinement of polygenic risk scores may help identify individuals at risk for AD dementia. PMID:27385740
Pathak, Jyotishman; Kiefer, Richard C.; Chute, Christopher G.
2012-01-01
The ability to conduct genome-wide association studies (GWAS) has enabled new exploration of how genetic variations contribute to health and disease etiology. One of the key requirements to perform GWAS is the identification of subject cohorts with accurate classification of disease phenotypes. In this work, we study how emerging Semantic Web technologies can be applied in conjunction with clinical data stored in electronic health records (EHRs) to accurately identify subjects with specific diseases for inclusion in cohort studies. In particular, we demonstrate the role of using Resource Description Framework (RDF) for representing EHR data and enabling federated querying and inferencing via standardized Web protocols for identifying subjects with Diabetes Mellitus. Our study highlights the potential of using Web-scale data federation approaches to execute complex queries. PMID:22779040
Gonzalez-Pena, Dianelys; Gao, Guangtu; Baranski, Matthew; Moen, Thomas; Cleveland, Beth M; Kenney, P Brett; Vallejo, Roger L; Palti, Yniv; Leeds, Timothy D
2016-01-01
Fillet yield (FY, %) is an economically-important trait in rainbow trout aquaculture that affects production efficiency. Despite that, FY has received little attention in breeding programs because it is difficult to measure on a large number of fish and cannot be directly measured on breeding candidates. The recent development of a high-density SNP array for rainbow trout has provided the needed tool for studying the underlying genetic architecture of this trait. A genome-wide association study (GWAS) was conducted for FY, body weight at 10 (BW10) and 13 (BW13) months post-hatching, head-off carcass weight (CAR), and fillet weight (FW) in a pedigreed rainbow trout population selectively bred for improved growth performance. The GWAS analysis was performed using the weighted single-step GBLUP method (wssGWAS). Phenotypic records of 1447 fish (1.5 kg at harvest) from 299 full-sib families in three successive generations, of which 875 fish from 196 full-sib families were genotyped, were used in the GWAS analysis. A total of 38,107 polymorphic SNPs were analyzed in a univariate model with hatch year and harvest group as fixed effects, harvest weight as a continuous covariate, and animal and common environment as random effects. A new linkage map was developed to create windows of 20 adjacent SNPs for use in the GWAS. The two windows with largest effect for FY and FW were located on chromosome Omy9 and explained only 1.0-1.5% of genetic variance, thus suggesting a polygenic architecture affected by multiple loci with small effects in this population. One window on Omy5 explained 1.4 and 1.0% of the genetic variance for BW10 and BW13, respectively. Three windows located on Omy27, Omy17, and Omy9 (same window detected for FY) explained 1.7, 1.7, and 1.0%, respectively, of genetic variance for CAR. Among the detected 100 SNPs, 55% were located directly in genes (intron and exons). Nucleotide sequences of intragenic SNPs were blasted to the Mus musculus genome to create a putative gene network. The network suggests that differences in the ability to maintain a proliferative and renewable population of myogenic precursor cells may affect variation in growth and fillet yield in rainbow trout.
Gonzalez-Pena, Dianelys; Gao, Guangtu; Baranski, Matthew; Moen, Thomas; Cleveland, Beth M.; Kenney, P. Brett; Vallejo, Roger L.; Palti, Yniv; Leeds, Timothy D.
2016-01-01
Fillet yield (FY, %) is an economically-important trait in rainbow trout aquaculture that affects production efficiency. Despite that, FY has received little attention in breeding programs because it is difficult to measure on a large number of fish and cannot be directly measured on breeding candidates. The recent development of a high-density SNP array for rainbow trout has provided the needed tool for studying the underlying genetic architecture of this trait. A genome-wide association study (GWAS) was conducted for FY, body weight at 10 (BW10) and 13 (BW13) months post-hatching, head-off carcass weight (CAR), and fillet weight (FW) in a pedigreed rainbow trout population selectively bred for improved growth performance. The GWAS analysis was performed using the weighted single-step GBLUP method (wssGWAS). Phenotypic records of 1447 fish (1.5 kg at harvest) from 299 full-sib families in three successive generations, of which 875 fish from 196 full-sib families were genotyped, were used in the GWAS analysis. A total of 38,107 polymorphic SNPs were analyzed in a univariate model with hatch year and harvest group as fixed effects, harvest weight as a continuous covariate, and animal and common environment as random effects. A new linkage map was developed to create windows of 20 adjacent SNPs for use in the GWAS. The two windows with largest effect for FY and FW were located on chromosome Omy9 and explained only 1.0–1.5% of genetic variance, thus suggesting a polygenic architecture affected by multiple loci with small effects in this population. One window on Omy5 explained 1.4 and 1.0% of the genetic variance for BW10 and BW13, respectively. Three windows located on Omy27, Omy17, and Omy9 (same window detected for FY) explained 1.7, 1.7, and 1.0%, respectively, of genetic variance for CAR. Among the detected 100 SNPs, 55% were located directly in genes (intron and exons). Nucleotide sequences of intragenic SNPs were blasted to the Mus musculus genome to create a putative gene network. The network suggests that differences in the ability to maintain a proliferative and renewable population of myogenic precursor cells may affect variation in growth and fillet yield in rainbow trout. PMID:27920797
Hemoglobin genetics: recent contributions of GWAS and gene editing
Smith, Elenoe C.; Orkin, Stuart H.
2016-01-01
The β-hemoglobinopathies are inherited disorders resulting from altered coding potential or expression of the adult β-globin gene. Impaired expression of β-globin reduces adult hemoglobin (α2β2) production, the hallmark of β-thalassemia. A single-base mutation at codon 6 leads to formation of HbS (α2βS2) and sickle cell disease. While the basis of these diseases is known, therapy remains largely supportive. Bone marrow transplantation is the only curative therapy. Patients with elevated levels of fetal hemoglobin (HbF, α2γ2) as adults exhibit reduced symptoms and enhanced survival. The β-globin gene locus is a paradigm of cell- and developmental stage-specific regulation. Although the principal erythroid cell transcription factors are known, mechanisms responsible for silencing of the γ-globin gene were obscure until application of genome-wide association studies (GWAS). Here, we review findings in the field. GWAS identified BCL11A as a candidate negative regulator of γ-globin expression. Subsequent studies have established BCL11A as a quantitative repressor. GWAS-related single-nucleotide polymorphisms lie within an essential erythroid enhancer of the BCL11A gene. Disruption of a discrete region within the enhancer reduces BCL11A expression and induces HbF expression, providing the basis for gene therapy using gene editing tools. A recently identified, second silencing factor, leukemia/lymphoma-related factor/Pokemon, shares features with BCL11A, including interaction with the nucleosome remodeling deacetylase repressive complex. These findings suggest involvement of a common pathway for HbF silencing. In addition, we discuss other factors that may be involved in γ-globin gene silencing and their potential manipulation for therapeutic benefit in treating the β-hemoglobinopathies. PMID:27340226
Mägi, Reedik; Horikoshi, Momoko; Sofer, Tamar; Mahajan, Anubha; Kitajima, Hidetoshi; Franceschini, Nora; McCarthy, Mark I.; Morris, Andrew P.
2017-01-01
Abstract Trans-ethnic meta-analysis of genome-wide association studies (GWAS) across diverse populations can increase power to detect complex trait loci when the underlying causal variants are shared between ancestry groups. However, heterogeneity in allelic effects between GWAS at these loci can occur that is correlated with ancestry. Here, a novel approach is presented to detect SNP association and quantify the extent of heterogeneity in allelic effects that is correlated with ancestry. We employ trans-ethnic meta-regression to model allelic effects as a function of axes of genetic variation, derived from a matrix of mean pairwise allele frequency differences between GWAS, and implemented in the MR-MEGA software. Through detailed simulations, we demonstrate increased power to detect association for MR-MEGA over fixed- and random-effects meta-analysis across a range of scenarios of heterogeneity in allelic effects between ethnic groups. We also demonstrate improved fine-mapping resolution, in loci containing a single causal variant, compared to these meta-analysis approaches and PAINTOR, and equivalent performance to MANTRA at reduced computational cost. Application of MR-MEGA to trans-ethnic GWAS of kidney function in 71,461 individuals indicates stronger signals of association than fixed-effects meta-analysis when heterogeneity in allelic effects is correlated with ancestry. Application of MR-MEGA to fine-mapping four type 2 diabetes susceptibility loci in 22,086 cases and 42,539 controls highlights: (i) strong evidence for heterogeneity in allelic effects that is correlated with ancestry only at the index SNP for the association signal at the CDKAL1 locus; and (ii) 99% credible sets with six or fewer variants for five distinct association signals. PMID:28911207
Combined linkage and association analyses identify a novel locus for obesity near PROX1 in Asians.
Kim, Hyun-Jin; Yoo, Yun Joo; Ju, Young Seok; Lee, Seungbok; Cho, Sung-Il; Sung, Joohon; Kim, Jong-Il; Seo, Jeong-Sun
2013-11-01
Although genome-wide association studies (GWAS) have substantially contributed to understanding the genetic architecture, unidentified variants for complex traits remain an issue. One of the efficient approaches is the improvement of the power of GWAS scan by weighting P values with prior linkage signals. Our objective was to identify the novel candidates for obesity in Asian populations by using genemapping strategies that combine linkage and association analyses. To obtain linkage information for body mass index (BMI) and waist circumference (WC), we performed a multipoint genome-wide linkage study in an isolated Mongolian sample of 1,049 individuals from 74 families. Next, a family-based GWAS, which integrates within- and between-family components, was performed using the genotype data of 756 individuals of the Mongolian sample, and P values for association were weighted using linkage information obtained previously. For both BMI (LOD = 3.3) and WC (LOD = 2.6), the highest linkage peak was discovered at chromosome 10q11.22. In family-based GWAS combined with linkage information, six single-nucleotide polymorphisms (SNPs) for BMI and five SNPs for WC reached a significant level of association (linkage weighted P < 1 × 10(-5) ). Of these, only one of the SNPs associated with WC (rs1704198) was replicated in 327 Korean families comprising 1,301 individuals. This SNP was located in the proximity of the prosperorelated homeobox 1 (PROX1) gene, the function of which was validated previously in a mouse model. Our powerful strategic analysis enabled the discovery of a novel candidate gene, PROX1, associated with WC in an Asian population. Copyright © 2012 The Obesity Society.
Genetic characteristics of inflammatory bowel disease in a Japanese population.
Fuyuno, Yuta; Yamazaki, Keiko; Takahashi, Atsushi; Esaki, Motohiro; Kawaguchi, Takaaki; Takazoe, Masakazu; Matsumoto, Takayuki; Matsui, Toshiyuki; Tanaka, Hiroki; Motoya, Satoshi; Suzuki, Yasuo; Kiyohara, Yutaka; Kitazono, Takanari; Kubo, Michiaki
2016-07-01
Crohn's disease (CD) and ulcerative colitis (UC) are two major forms of inflammatory bowel disease (IBD). Meta-analyses of genome-wide association studies (GWAS) have identified 163 susceptibility loci for IBD among European populations; however, there is limited information for IBD susceptibility in a Japanese population. We performed a GWAS using imputed genotypes of 743 IBD patients (372 with CD and 371 with UC) and 3321 controls. Using 100 tag single-nucleotide polymorphisms (SNPs) (P < 5 × 10(-5)), a replication study was conducted with an independent set of 1310 IBD patients (949 with CD and 361 with UC) and 4163 controls. In addition, 163 SNPs identified by a European IBD GWAS were genotyped, and genetic backgrounds were compared between the Japanese and European populations. In the IBD GWAS, two East Asia-specific IBD susceptibility loci were identified in the Japanese population: ATG16L2-FCHSD2 and SLC25A15-ELF1-WBP4. Among 163 reported SNPs in European IBD patients, significant associations were confirmed in 18 (8 CD-specific, 4 UC-specific, and 6 IBD-shared). In Japanese CD patients, genes in the Th17-IL23 pathway showed stronger genetic effects, whereas the association of genes in the autophagy pathway was limited. The association of genes in the epithelial barrier and the Th17-IL23R pathways were similar in the Japanese and European UC populations. We confirmed two IBD susceptibility loci as common for CD and UC, and East Asian-specific. The genetic architecture in UC appeared to be similar between Europeans and East Asians, but may have some differences in CD.
Park, Sung Hee; Lee, Ji Young; Kim, Sangsoo
2011-01-01
Current Genome-Wide Association Studies (GWAS) are performed in a single trait framework without considering genetic correlations between important disease traits. Hence, the GWAS have limitations in discovering genetic risk factors affecting pleiotropic effects. This work reports a novel data mining approach to discover patterns of multiple phenotypic associations over 52 anthropometric and biochemical traits in KARE and a new analytical scheme for GWAS of multivariate phenotypes defined by the discovered patterns. This methodology applied to the GWAS for multivariate phenotype highLDLhighTG derived from the predicted patterns of the phenotypic associations. The patterns of the phenotypic associations were informative to draw relations between plasma lipid levels with bone mineral density and a cluster of common traits (Obesity, hypertension, insulin resistance) related to Metabolic Syndrome (MS). A total of 15 SNPs in six genes (PAK7, C20orf103, NRIP1, BCL2, TRPM3, and NAV1) were identified for significant associations with highLDLhighTG. Noteworthy findings were that the significant associations included a mis-sense mutation (PAK7:R335P), a frame shift mutation (C20orf103) and SNPs in splicing sites (TRPM3). The six genes corresponded to rat and mouse quantitative trait loci (QTLs) that had shown associations with the common traits such as the well characterized MS and even tumor susceptibility. Our findings suggest that the six genes may play important roles in the pleiotropic effects on lipid metabolism and the MS, which increase the risk of Type 2 Diabetes and cardiovascular disease. The use of the multivariate phenotypes can be advantageous in identifying genetic risk factors, accounting for the pleiotropic effects when the multivariate phenotypes have a common etiological pathway.
Nagao, Yumiko; Nishida, Nao; Toyo-Oka, Licht; Kawaguchi, Atsushi; Amoroso, Antonio; Carrozzo, Marco; Sata, Michio; Mizokami, Masashi; Tokunaga, Katsushi; Tanaka, Yasuhito
2017-06-01
There is a close relationship between hepatitis C virus (HCV) infection and lichen planus, a chronic inflammatory mucocutaneous disease. We performed a genome-wide association study (GWAS) to identify genetic variants associated with HCV-related lichen planus. We conducted a GWAS of 261 patients with HCV infection treated at a tertiary medical center in Japan from October 2007 through January 2013; a total of 71 had lichen planus and 190 had normal oral mucosa. We validated our findings in a GWAS of 38 patients with HCV-associated lichen planus and 7 HCV-infected patients with normal oral mucosa treated at a medical center in Italy. Single-nucleotide polymorphisms in NRP2 (rs884000) and IGFBP4 (rs538399) were associated with risk of HCV-associated lichen planus (P < 1 × 10 -4 ). We also found an association between a single-nucleotide polymorphism in the HLA-DR/DQ genes (rs9461799) and susceptibility to HCV-associated lichen planus. The odds ratios for the minor alleles of rs884000, rs538399, and rs9461799 were 3.25 (95% confidence interval, 1.95-5.41), 0.40 (95% confidence interval, 0.25-0.63), and 2.15 (95% confidence interval, 1.41-3.28), respectively. In a GWAS of Japanese patients with HCV infection, we replicated associations between previously reported polymorphisms in HLA class II genes and risk for lichen planus. We also identified single-nucleotide polymorphisms in NRP2 and IGFBP4 loci that increase and reduce risk of lichen planus, respectively. These genetic variants might be used to identify patients with HCV infection who are at risk for lichen planus. Copyright © 2017 AGA Institute. Published by Elsevier Inc. All rights reserved.
Melo, Thaise P; Takada, Luciana; Baldi, Fernando; Oliveira, Henrique N; Dias, Marina M; Neves, Haroldo H R; Schenkel, Flavio S; Albuquerque, Lucia G; Carvalheiro, Roberto
2016-06-21
QTL mapping through genome-wide association studies (GWAS) is challenging, especially in the case of low heritability complex traits and when few animals possess genotypic and phenotypic information. When most of the phenotypic information is from non-genotyped animals, GWAS can be performed using the weighted single-step GBLUP (WssGBLUP) method, which permits to combine all available information, even that of non-genotyped animals. However, it is not clear to what extent phenotypic information from non-genotyped animals increases the power of QTL detection, and whether factors such as the extent of linkage disequilibrium (LD) in the population and weighting SNPs in WssGBLUP affect the importance of using information from non-genotyped animals in GWAS. These questions were investigated in this study using real and simulated data. Analysis of real data showed that the use of phenotypes of non-genotyped animals affected SNP effect estimates and, consequently, QTL mapping. Despite some coincidence, the most important genomic regions identified by the analyses, either using or ignoring phenotypes of non-genotyped animals, were not the same. The simulation results indicated that the inclusion of all available phenotypic information, even that of non-genotyped animals, tends to improve QTL detection for low heritability complex traits. For populations with low levels of LD, this trend of improvement was less pronounced. Stronger shrinkage on SNPs explaining lower variance was not necessarily associated with better QTL mapping. The use of phenotypic information from non-genotyped animals in GWAS may improve the ability to detect QTL for low heritability complex traits, especially in populations in which the level of LD is high.
Evangelou, Marina; Smyth, Deborah J; Fortune, Mary D; Burren, Oliver S; Walker, Neil M; Guo, Hui; Onengut-Gumuscu, Suna; Chen, Wei-Min; Concannon, Patrick; Rich, Stephen S; Todd, John A; Wallace, Chris
2014-01-01
Pathway analysis can complement point-wise single nucleotide polymorphism (SNP) analysis in exploring genomewide association study (GWAS) data to identify specific disease-associated genes that can be candidate causal genes. We propose a straightforward methodology that can be used for conducting a gene-based pathway analysis using summary GWAS statistics in combination with widely available reference genotype data. We used this method to perform a gene-based pathway analysis of a type 1 diabetes (T1D) meta-analysis GWAS (of 7,514 cases and 9,045 controls). An important feature of the conducted analysis is the removal of the major histocompatibility complex gene region, the major genetic risk factor for T1D. Thirty-one of the 1,583 (2%) tested pathways were identified to be enriched for association with T1D at a 5% false discovery rate. We analyzed these 31 pathways and their genes to identify SNPs in or near these pathway genes that showed potentially novel association with T1D and attempted to replicate the association of 22 SNPs in additional samples. Replication P-values were skewed () with 12 of the 22 SNPs showing . Support, including replication evidence, was obtained for nine T1D associated variants in genes ITGB7 (rs11170466, ), NRP1 (rs722988, ), BAD (rs694739, ), CTSB (rs1296023, ), FYN (rs11964650, ), UBE2G1 (rs9906760, ), MAP3K14 (rs17759555, ), ITGB1 (rs1557150, ), and IL7R (rs1445898, ). The proposed methodology can be applied to other GWAS datasets for which only summary level data are available. PMID:25371288
Paziewska, Agnieszka; Cukrowska, Bozena; Dabrowska, Michalina; Goryca, Krzysztof; Piatkowska, Magdalena; Kluska, Anna; Mikula, Michal; Karczmarski, Jakub; Oralewska, Beata; Rybak, Anna; Socha, Jerzy; Balabas, Aneta; Zeber-Lubecka, Natalia; Ambrozkiewicz, Filip; Konopka, Ewa; Trojanowska, Ilona; Zagroba, Malgorzata; Szperl, Malgorzata; Ostrowski, Jerzy
2015-01-01
Assessment of non-HLA variants alongside standard HLA testing was previously shown to improve the identification of potential coeliac disease (CD) patients. We intended to identify new genetic variants associated with CD in the Polish population that would improve CD risk prediction when used alongside HLA haplotype analysis. DNA samples of 336 CD and 264 unrelated healthy controls were used to create DNA pools for a genome wide association study (GWAS). GWAS findings were validated with individual HLA tag single nucleotide polymorphism (SNP) typing of 473 patients and 714 healthy controls. Association analysis using four HLA-tagging SNPs showed that, as was found in other populations, positive predicting genotypes (HLA-DQ2.5/DQ2.5, HLA-DQ2.5/DQ2.2, and HLA-DQ2.5/DQ8) were found at higher frequencies in CD patients than in healthy control individuals in the Polish population. Both CD-associated SNPs discovered by GWAS were found in the CD susceptibility region, confirming the previously-determined association of the major histocompatibility (MHC) region with CD pathogenesis. The two most significant SNPs from the GWAS were rs9272346 (HLA-dependent; localized within 1 Kb of DQA1) and rs3130484 (HLA-independent; mapped to MSH5). Specificity of CD prediction using the four HLA-tagging SNPs achieved 92.9%, but sensitivity was only 45.5%. However, when a testing combination of the HLA-tagging SNPs and the MSH5 SNP was used, specificity decreased to 80%, and sensitivity increased to 74%. This study confirmed that improvement of CD risk prediction sensitivity could be achieved by including non-HLA SNPs alongside HLA SNPs in genetic testing.
Discovery and characterization of two new stem rust resistance genes in Aegilops sharonensis.
Yu, Guotai; Champouret, Nicolas; Steuernagel, Burkhard; Olivera, Pablo D; Simmons, Jamie; Williams, Cole; Johnson, Ryan; Moscou, Matthew J; Hernández-Pinzón, Inmaculada; Green, Phon; Sela, Hanan; Millet, Eitan; Jones, Jonathan D G; Ward, Eric R; Steffenson, Brian J; Wulff, Brande B H
2017-06-01
We identified two novel wheat stem rust resistance genes, Sr-1644-1Sh and Sr-1644-5Sh in Aegilops sharonensis that are effective against widely virulent African races of the wheat stem rust pathogen. Stem rust is one of the most important diseases of wheat in the world. When single stem rust resistance (Sr) genes are deployed in wheat, they are often rapidly overcome by the pathogen. To this end, we initiated a search for novel sources of resistance in diverse wheat relatives and identified the wild goatgrass species Aegilops sharonesis (Sharon goatgrass) as a rich reservoir of resistance to wheat stem rust. The objectives of this study were to discover and map novel Sr genes in Ae. sharonensis and to explore the possibility of identifying new Sr genes by genome-wide association study (GWAS). We developed two biparental populations between resistant and susceptible accessions of Ae. sharonensis and performed QTL and linkage analysis. In an F 6 recombinant inbred line and an F 2 population, two genes were identified that mapped to the short arm of chromosome 1S sh , designated as Sr-1644-1Sh, and the long arm of chromosome 5S sh , designated as Sr-1644-5Sh. The gene Sr-1644-1Sh confers a high level of resistance to race TTKSK (a member of the Ug99 race group), while the gene Sr-1644-5Sh conditions strong resistance to TRTTF, another widely virulent race found in Yemen. Additionally, GWAS was conducted on 125 diverse Ae. sharonensis accessions for stem rust resistance. The gene Sr-1644-1Sh was detected by GWAS, while Sr-1644-5Sh was not detected, indicating that the effectiveness of GWAS might be affected by marker density, population structure, low allele frequency and other factors.
Bigdeli, Tim B.; Ripke, Stephan; Bacanu, Silviu-Alin; Lee, Sang Hong; Wray, Naomi R.; Gejman, Pablo V.; Rietschel, Marcella; Cichon, Sven; St Clair, David; Corvin, Aiden; Kirov, George; McQuillin, Andrew; Gurling, Hugh; Rujescu, Dan; Andreassen, Ole A.; Werge, Thomas; Blackwood, Douglas H.R.; Pato, Carlos N.; Pato, Michele T.; Malhotra, Anil K.; O’Donovan, Michael C.; Kendler, Kenneth S.; Fanous, Ayman H.
2018-01-01
Genome-wide association studies (GWAS) of schizophrenia have yielded more than 100 common susceptibility variants, and strongly support a substantial polygenic contribution of a large number of small allelic effects. It has been hypothesized that familial schizophrenia is largely a consequence of inherited rather than environmental factors. We investigated the extent to which familiality of schizophrenia is associated with enrichment for common risk variants detectable in a large GWAS. We analyzed single nucleotide polymorphism (SNP) data for cases reporting a family history of psychotic illness (N = 978), cases reporting no such family history (N = 4,503), and unscreened controls (N = 8,285) from the Psychiatric Genomics Consortium (PGC1) study of schizophrenia. We used a multinomial logistic regression approach with model-fitting to detect allelic effects specific to either family history subgroup. We also considered a polygenic model, in which we tested whether family history positive subjects carried more schizophrenia risk alleles than family history negative subjects, on average. Several individual SNPs attained suggestive but not genome-wide significant association with either family history subgroup. Comparison of genome-wide polygenic risk scores based on GWAS summary statistics indicated a significant enrichment for SNP effects among family history positive compared to family history negative cases (Nagelkerke’s R2 = 0.0021; P = 0.00331; P-value threshold <0.4). Estimates of variability in disease liability attributable to the aggregate effect of genome-wide SNPs were significantly greater for family history positive compared to family history negative cases (0.32 and 0.22, respectively; P = 0.031).We found suggestive evidence of allelic effects detectable in large GWAS of schizophrenia that might be specific to particular family history subgroups. However, consideration of a polygenic risk score indicated a significant enrichment among family history positive cases for common allelic effects. Familial illness might, therefore, represent a more heritable form of schizophrenia, as suggested by previous epidemiological studies. PMID:26663532
Meng, Shan; He, Jianbo; Zhao, Tuanjie; Xing, Guangnan; Li, Yan; Yang, Shouping; Lu, Jiangjie; Wang, Yufeng; Gai, Junyi
2016-08-01
Utilizing an innovative GWAS in CSLRP, 44 QTL 199 alleles with 72.2 % contribution to SIFC variation were detected and organized into a QTL-allele matrix for cross design and gene annotation. The seed isoflavone content (SIFC) of soybeans is of great importance to health care. The Chinese soybean landrace population (CSLRP) as a genetic reservoir was studied for its whole-genome quantitative trait loci (QTL) system of the SIFC using an innovative restricted two-stage multi-locus genome-wide association study procedure (RTM-GWAS). A sample of 366 landraces was tested under four environments and sequenced using RAD-seq (restriction-site-associated DNA sequencing) technique to obtain 116,769 single nucleotide polymorphisms (SNPs) then organized into 29,119 SNP linkage disequilibrium blocks (SNPLDBs) for GWAS. The detected 44 QTL 199 alleles on 16 chromosomes (explaining 72.2 % of the total phenotypic variation) with the allele effects (92 positive and 107 negative) of the CSLRP were organized into a QTL-allele matrix showing the SIFC population genetic structure. Additional differentiation among eco-regions due to the SIFC in addition to that of genome-wide markers was found. All accessions comprised both positive and negative alleles, implying a great potential for recombination within the population. The optimal crosses were predicted from the matrices, showing transgressive potentials in the CSLRP. From the detected QTL system, 55 candidate genes related to 11 biological processes were χ (2)-tested as an SIFC candidate gene system. The present study explored the genome-wide SIFC QTL/gene system with the innovative RTM-GWAS and found the potentials of the QTL-allele matrix in optimal cross design and population genetic and genomic studies, which may have provided a solution to match the breeding by design strategy at both QTL and gene levels in breeding programs.
Gan, Wei; Walters, Robin G; Holmes, Michael V; Bragg, Fiona; Millwood, Iona Y; Banasik, Karina; Chen, Yiping; Du, Huaidong; Iona, Andri; Mahajan, Anubha; Yang, Ling; Bian, Zheng; Guo, Yu; Clarke, Robert J; Li, Liming; McCarthy, Mark I; Chen, Zhengming
2016-07-01
Genome-wide association studies (GWAS) have discovered many risk variants for type 2 diabetes. However, estimates of the contributions of risk variants to type 2 diabetes predisposition are often based on highly selected case-control samples, and reliable estimates of population-level effect sizes are missing, especially in non-European populations. The individual and cumulative effects of 59 established type 2 diabetes risk loci were measured in a population-based China Kadoorie Biobank (CKB) study of 93,000 Chinese adults, including >7,100 diabetes cases. Association signals were directionally consistent between CKB and the original discovery GWAS: of 56 variants passing quality control, 48 showed the same direction of effect (binomial test, p = 2.3 × 10(-8)). We observed a consistent overall trend towards lower risk variant effect sizes in CKB than in case-control samples of GWAS meta-analyses (mean 19-22% decrease in log odds, p ≤ 0.0048), likely to reflect correction of both 'winner's curse' and spectrum bias effects. The association with risk of diabetes of a genetic risk score, based on lead variants at 25 loci considered to act through beta cell function, demonstrated significant interactions with several measures of adiposity (BMI, waist circumference [WC], WHR and percentage body fat [PBF]; all p interaction < 1 × 10(-4)), with a greater effect being observed in leaner adults. Our study provides further evidence of shared genetic architecture for type 2 diabetes between Europeans and East Asians. It also indicates that even very large GWAS meta-analyses may be vulnerable to substantial inflation of effect size estimates, compared with those observed in large-scale population-based cohort studies. Details of how to access China Kadoorie Biobank data and details of the data release schedule are available from www.ckbiobank.org/site/Data+Access .
Identification of a Bipolar Disorder Vulnerable Gene CHDH at 3p21.1.
Chang, Hong; Li, Lingyi; Peng, Tao; Grigoroiu-Serbanescu, Maria; Bergen, Sarah E; Landén, Mikael; Hultman, Christina M; Forstner, Andreas J; Strohmaier, Jana; Hecker, Julian; Schulze, Thomas G; Müller-Myhsok, Bertram; Reif, Andreas; Mitchell, Philip B; Martin, Nicholas G; Cichon, Sven; Nöthen, Markus M; Jamain, Stéphane; Leboyer, Marion; Bellivier, Frank; Etain, Bruno; Kahn, Jean-Pierre; Henry, Chantal; Rietschel, Marcella; Xiao, Xiao; Li, Ming
2017-09-01
Genome-wide analysis (GWA) is an effective strategy to discover extreme effects surpassing genome-wide significant levels in studying complex disorders; however, when sample size is limited, the true effects may fail to achieve genome-wide significance. In such case, there may be authentic results among the pools of nominal candidates, and an alternative approach is to consider nominal candidates but are replicable across different samples. Here, we found that mRNA expression of the choline dehydrogenase gene (CHDH) was uniformly upregulated in the brains of bipolar disorder (BPD) patients compared with healthy controls across different studies. Follow-up genetic analyses of CHDH variants in multiple independent clinical datasets (including 11,564 cases and 17,686 controls) identified a risk SNP rs9836592 showing consistent associations with BPD (P meta = 5.72 × 10 -4 ), and the risk allele indicated an increased CHDH expression in multiple neuronal tissues (lowest P = 6.70 × 10 -16 ). These converging results may identify a nominal but true BPD susceptibility gene CHDH. Further exploratory analysis revealed suggestive associations of rs9836592 with childhood intelligence (P = 0.044) and educational attainment (P = 0.0039), a "proxy phenotype" of general cognitive abilities. Intriguingly, the CHDH gene is located at chromosome 3p21.1, a risk region implicated in previous BPD genome-wide association studies (GWAS), but CHDH is lying outside of the core GWAS linkage disequilibrium (LD) region, and our studied SNP rs9836592 is ∼1.2 Mb 3' downstream of the previous GWAS loci (e.g., rs2251219) with no LD between them; thus, the association observed here is unlikely a reflection of previous GWAS signals. In summary, our results imply that CHDH may play a previously unknown role in the etiology of BPD and also highlight the informative value of integrating gene expression and genetic code in advancing our understanding of its biological basis.
Batra, Jyotsna; Lose, Felicity; O'Mara, Tracy; Marquart, Louise; Stephens, Carson; Alexander, Kimberly; Srinivasan, Srilakshmi; Eeles, Rosalind A.; Easton, Douglas F.; Olama, Ali Amin Al; Kote-Jarai, Zsofia; Guy, Michelle; Muir, Kenneth; Lophatananon, Artitaya; Rahman, Aneela A.; Neal, David E.; Hamdy, Freddie C.; Donovan, Jenny L.; Chambers, Suzanne; Gardiner, Robert A.; Aitken, Joanne; Yaxley, John; Kedda, Mary-Anne
2011-01-01
Background Kallikrein 15 (KLK15)/Prostinogen is a plausible candidate for prostate cancer susceptibility. Elevated KLK15 expression has been reported in prostate cancer and it has been described as an unfavorable prognostic marker for the disease. Objectives We performed a comprehensive analysis of association of variants in the KLK15 gene with prostate cancer risk and aggressiveness by genotyping tagSNPs, as well as putative functional SNPs identified by extensive bioinformatics analysis. Methods and Data Sources Twelve out of 22 SNPs, selected on the basis of linkage disequilibrium pattern, were analyzed in an Australian sample of 1,011 histologically verified prostate cancer cases and 1,405 ethnically matched controls. Replication was sought from two existing genome wide association studies (GWAS): the Cancer Genetic Markers of Susceptibility (CGEMS) project and a UK GWAS study. Results Two KLK15 SNPs, rs2659053 and rs3745522, showed evidence of association (p<0.05) but were not present on the GWAS platforms. KLK15 SNP rs2659056 was found to be associated with prostate cancer aggressiveness and showed evidence of association in a replication cohort of 5,051 patients from the UK, Australia, and the CGEMS dataset of US samples. A highly significant association with Gleason score was observed when the data was combined from these three studies with an Odds Ratio (OR) of 0.85 (95% CI = 0.77–0.93; p = 2.7×10−4). The rs2659056 SNP is predicted to alter binding of the RORalpha transcription factor, which has a role in the control of cell growth and differentiation and has been suggested to control the metastatic behavior of prostate cancer cells. Conclusions Our findings suggest a role for KLK15 genetic variation in the etiology of prostate cancer among men of European ancestry, although further studies in very large sample sets are necessary to confirm effect sizes. PMID:22132073
Yu, Kai; Chin, Yoon-Ming; Lou, Pei-Jen; Hsu, Wan-Lun; McKay, James D.; Chen, Chien-Jen; Chang, Yu-Sun; Chen, Li-Zhen; Chen, Ming-Yuan; Cui, Qian; Feng, Fu-Tuo; Feng, Qi-Shen; Guo, Yun-Miao; Jia, Wei-Hua; Khoo, Alan Soo-Beng; Liu, Wen-Sheng; Mo, Hao-Yuan; Pua, Kin-Choo; Teo, Soo-Hwang; Tse, Ka-Po; Xia, Yun-Fei; Zhang, Hongxin; Zhou, Gang-Qiao; Liu, Jian-Jun; Zeng, Yi-Xin; Hildesheim, Allan
2015-01-01
Background Genetic loci within the major histocompatibility complex (MHC) have been associated with nasopharyngeal carcinoma (NPC), an Epstein-Barr virus (EBV)-associated cancer, in several GWAS. Results outside this region have varied. Methods We conducted a meta-analysis of four NPC GWAS among Chinese individuals (2,152 cases;3,740 controls). 43 noteworthy findings outside the MHC region were identified and targeted for replication in a pooled analysis of 4 independent case-control studies across 3 regions in Asia (4,716 cases;5,379 controls). A meta-analysis that combined results from the initial GWA and replication studies was performed. Results In the combined meta-analysis, rs31489, located within the CLPTM1L/TERT region on chromosome 5p15.33, was strongly associated with NPC (OR=0.81;p-value 6.3*10−13). Our results also provide support for associations reported from published NPC GWAS - rs6774494 (p = 1.5*10−12;located in the MECOM gene region), rs9510787 (p = 5.0*10−10;located in the TNFRSF19 gene region), and rs1412829/rs4977756/rs1063192 (p = 2.8*10−8,p = 7.0*10−7,and p = 8.4*10−7 respectively;located in the CDKN2A/B gene region). Conclusion We have identified a novel association between genetic variation in the CLPTM1L/TERT region and NPC. Supporting our finding, rs31489 and other SNPs in this region have been reported to be associated with multiple cancer sites, candidate-based studies have reported associations between polymorphisms in this region and NPC, the TERT gene is important for telomere maintenance and has been reported to be over-expressed in NPC, and an EBV protein expressed in NPC (LMP1) modulates TERT expression/telomerase activity. Impact Our finding suggests that factors involved in telomere length maintenance are involved in NPC pathogenesis. PMID:26545403
Staley, James R; Jones, Edmund; Kaptoge, Stephen; Butterworth, Adam S; Sweeting, Michael J; Wood, Angela M; Howson, Joanna M M
2017-06-01
Logistic regression is often used instead of Cox regression to analyse genome-wide association studies (GWAS) of single-nucleotide polymorphisms (SNPs) and disease outcomes with cohort and case-cohort designs, as it is less computationally expensive. Although Cox and logistic regression models have been compared previously in cohort studies, this work does not completely cover the GWAS setting nor extend to the case-cohort study design. Here, we evaluated Cox and logistic regression applied to cohort and case-cohort genetic association studies using simulated data and genetic data from the EPIC-CVD study. In the cohort setting, there was a modest improvement in power to detect SNP-disease associations using Cox regression compared with logistic regression, which increased as the disease incidence increased. In contrast, logistic regression had more power than (Prentice weighted) Cox regression in the case-cohort setting. Logistic regression yielded inflated effect estimates (assuming the hazard ratio is the underlying measure of association) for both study designs, especially for SNPs with greater effect on disease. Given logistic regression is substantially more computationally efficient than Cox regression in both settings, we propose a two-step approach to GWAS in cohort and case-cohort studies. First to analyse all SNPs with logistic regression to identify associated variants below a pre-defined P-value threshold, and second to fit Cox regression (appropriately weighted in case-cohort studies) to those identified SNPs to ensure accurate estimation of association with disease.
Lam, Max; Trampush, Joey W; Yu, Jin; Knowles, Emma; Davies, Gail; Liewald, David C; Starr, John M; Djurovic, Srdjan; Melle, Ingrid; Sundet, Kjetil; Christoforou, Andrea; Reinvang, Ivar; DeRosse, Pamela; Lundervold, Astri J; Steen, Vidar M; Espeseth, Thomas; Räikkönen, Katri; Widen, Elisabeth; Palotie, Aarno; Eriksson, Johan G; Giegling, Ina; Konte, Bettina; Roussos, Panos; Giakoumaki, Stella; Burdick, Katherine E; Payton, Antony; Ollier, William; Chiba-Falek, Ornit; Attix, Deborah K; Need, Anna C; Cirulli, Elizabeth T; Voineskos, Aristotle N; Stefanis, Nikos C; Avramopoulos, Dimitrios; Hatzimanolis, Alex; Arking, Dan E; Smyrnis, Nikolaos; Bilder, Robert M; Freimer, Nelson A; Cannon, Tyrone D; London, Edythe; Poldrack, Russell A; Sabb, Fred W; Congdon, Eliza; Conley, Emily Drabant; Scult, Matthew A; Dickinson, Dwight; Straub, Richard E; Donohoe, Gary; Morris, Derek; Corvin, Aiden; Gill, Michael; Hariri, Ahmad R; Weinberger, Daniel R; Pendleton, Neil; Bitsios, Panos; Rujescu, Dan; Lahti, Jari; Le Hellard, Stephanie; Keller, Matthew C; Andreassen, Ole A; Deary, Ian J; Glahn, David C; Malhotra, Anil K; Lencz, Todd
2017-11-28
Here, we present a large (n = 107,207) genome-wide association study (GWAS) of general cognitive ability ("g"), further enhanced by combining results with a large-scale GWAS of educational attainment. We identified 70 independent genomic loci associated with general cognitive ability. Results showed significant enrichment for genes causing Mendelian disorders with an intellectual disability phenotype. Competitive pathway analysis implicated the biological processes of neurogenesis and synaptic regulation, as well as the gene targets of two pharmacologic agents: cinnarizine, a T-type calcium channel blocker, and LY97241, a potassium channel inhibitor. Transcriptome-wide and epigenome-wide analysis revealed that the implicated loci were enriched for genes expressed across all brain regions (most strongly in the cerebellum). Enrichment was exclusive to genes expressed in neurons but not oligodendrocytes or astrocytes. Finally, we report genetic correlations between cognitive ability and disparate phenotypes including psychiatric disorders, several autoimmune disorders, longevity, and maternal age at first birth. Copyright © 2017 The Author(s). Published by Elsevier Inc. All rights reserved.
Local Genetic Correlation Gives Insights into the Shared Genetic Architecture of Complex Traits.
Shi, Huwenbo; Mancuso, Nicholas; Spendlove, Sarah; Pasaniuc, Bogdan
2017-11-02
Although genetic correlations between complex traits provide valuable insights into epidemiological and etiological studies, a precise quantification of which genomic regions disproportionately contribute to the genome-wide correlation is currently lacking. Here, we introduce ρ-HESS, a technique to quantify the correlation between pairs of traits due to genetic variation at a small region in the genome. Our approach requires GWAS summary data only and makes no distributional assumption on the causal variant effect sizes while accounting for linkage disequilibrium (LD) and overlapping GWAS samples. We analyzed large-scale GWAS summary data across 36 quantitative traits, and identified 25 genomic regions that contribute significantly to the genetic correlation among these traits. Notably, we find 6 genomic regions that contribute to the genetic correlation of 10 pairs of traits that show negligible genome-wide correlation, further showcasing the power of local genetic correlation analyses. Finally, we report the distribution of local genetic correlations across the genome for 55 pairs of traits that show putative causal relationships. Copyright © 2017 American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.
GlobAl Distribution of GEnetic Traits (GADGET) web server: polygenic trait scores worldwide.
Chande, Aroon T; Wang, Lu; Rishishwar, Lavanya; Conley, Andrew B; Norris, Emily T; Valderrama-Aguirre, Augusto; Jordan, I King
2018-05-18
Human populations from around the world show striking phenotypic variation across a wide variety of traits. Genome-wide association studies (GWAS) are used to uncover genetic variants that influence the expression of heritable human traits; accordingly, population-specific distributions of GWAS-implicated variants may shed light on the genetic basis of human phenotypic diversity. With this in mind, we developed the GlobAl Distribution of GEnetic Traits web server (GADGET http://gadget.biosci.gatech.edu). The GADGET web server provides users with a dynamic visual platform for exploring the relationship between worldwide genetic diversity and the genetic architecture underlying numerous human phenotypes. GADGET integrates trait-implicated single nucleotide polymorphisms (SNPs) from GWAS, with population genetic data from the 1000 Genomes Project, to calculate genome-wide polygenic trait scores (PTS) for 818 phenotypes in 2504 individual genomes. Population-specific distributions of PTS are shown for 26 human populations across 5 continental population groups, with traits ordered based on the extent of variation observed among populations. Users of GADGET can also upload custom trait SNP sets to visualize global PTS distributions for their own traits of interest.
Poisson Approximation-Based Score Test for Detecting Association of Rare Variants.
Fang, Hongyan; Zhang, Hong; Yang, Yaning
2016-07-01
Genome-wide association study (GWAS) has achieved great success in identifying genetic variants, but the nature of GWAS has determined its inherent limitations. Under the common disease rare variants (CDRV) hypothesis, the traditional association analysis methods commonly used in GWAS for common variants do not have enough power for detecting rare variants with a limited sample size. As a solution to this problem, pooling rare variants by their functions provides an efficient way for identifying susceptible genes. Rare variant typically have low frequencies of minor alleles, and the distribution of the total number of minor alleles of the rare variants can be approximated by a Poisson distribution. Based on this fact, we propose a new test method, the Poisson Approximation-based Score Test (PAST), for association analysis of rare variants. Two testing methods, namely, ePAST and mPAST, are proposed based on different strategies of pooling rare variants. Simulation results and application to the CRESCENDO cohort data show that our methods are more powerful than the existing methods. © 2016 John Wiley & Sons Ltd/University College London.
Genetic architecture for susceptibility to gout in the KARE cohort study.
Shin, Jimin; Kim, Younyoung; Kong, Minyoung; Lee, Chaeyoung
2012-06-01
This study aimed to identify functional associations of cis-regulatory regions with gout susceptibility using data resulted from a genome-wide association study (GWAS), and to show a genetic architecture for gout with interaction effects among genes within each of the identified functions. The GWAS was conducted with 8314 control subjects and 520 patients with gout in the Korea Association REsource cohort. However, genetic associations with any individual nucleotide variants were not discovered by Bonferroni multiple testing in the GWAS (P>1.42 × 10(-7)). Genomic regions enrichment analysis was employed to identify functional associations of cis-regulatory regions. This analysis revealed several biological processes associated with gout susceptibility, and they were quite different from those with serum uric acid level. Epistasis for susceptibility to gout was estimated using entropy decomposition with selected genes within each biological process identified by the genomic regions enrichment analysis. Some epistases among nucleotide sequence variants for gout susceptibility were found to be larger than their individual effects. This study provided the first evidence that genetic factors for gout susceptibility greatly differed from those for serum uric acid level, which may suggest that research endeavors for identifying genetic factors for gout susceptibility should not be heavily dependent on pathogenesis of uric acid. Interaction effects between genes should be examined to explain a large portion of phenotypic variability for gout susceptibility.
2012-01-01
Background We performed a genome-wide association study (GWAS) to identify common risk variants for schizophrenia. Methods The discovery scan included 1606 patients and 1794 controls from Ireland, using 6,212,339 directly genotyped or imputed single nucleotide polymorphisms (SNPs). A subset of this sample (270 cases and 860 controls) was subsequently included in the Psychiatric GWAS Consortium-schizophrenia GWAS meta-analysis. Results One hundred eight SNPs were taken forward for replication in an independent sample of 13,195 cases and 31,021 control subjects. The most significant associations in discovery, corrected for genomic inflation, were (rs204999, p combined = 1.34 × 10−9 and in combined samples (rs2523722 p combined = 2.88 × 10−16) mapped to the major histocompatibility complex (MHC) region. We imputed classical human leukocyte antigen (HLA) alleles at the locus; the most significant finding was with HLA-C*01:02. This association was distinct from the top SNP signal. The HLA alleles DRB1*03:01 and B*08:01 were protective, replicating a previous study. Conclusions This study provides further support for involvement of MHC class I molecules in schizophrenia. We found evidence of association with previously reported risk alleles at the TCF4, VRK2, and ZNF804A loci. PMID:22883433
Goudey, Benjamin; Abedini, Mani; Hopper, John L; Inouye, Michael; Makalic, Enes; Schmidt, Daniel F; Wagner, John; Zhou, Zeyu; Zobel, Justin; Reumann, Matthias
2015-01-01
Genome-wide association studies (GWAS) are a common approach for systematic discovery of single nucleotide polymorphisms (SNPs) which are associated with a given disease. Univariate analysis approaches commonly employed may miss important SNP associations that only appear through multivariate analysis in complex diseases. However, multivariate SNP analysis is currently limited by its inherent computational complexity. In this work, we present a computational framework that harnesses supercomputers. Based on our results, we estimate a three-way interaction analysis on 1.1 million SNP GWAS data requiring over 5.8 years on the full "Avoca" IBM Blue Gene/Q installation at the Victorian Life Sciences Computation Initiative. This is hundreds of times faster than estimates for other CPU based methods and four times faster than runtimes estimated for GPU methods, indicating how the improvement in the level of hardware applied to interaction analysis may alter the types of analysis that can be performed. Furthermore, the same analysis would take under 3 months on the currently largest IBM Blue Gene/Q supercomputer "Sequoia" at the Lawrence Livermore National Laboratory assuming linear scaling is maintained as our results suggest. Given that the implementation used in this study can be further optimised, this runtime means it is becoming feasible to carry out exhaustive analysis of higher order interaction studies on large modern GWAS.
Pardo, Luba M; Piras, Giovanna; Asproni, Rosanna; van der Gaag, Kristiaan J; Gabbas, Attilio; Ruiz-Linares, Andres; de Knijff, Peter; Monne, Maria; Rizzu, Patrizia; Heutink, Peter
2012-09-01
Sardinia has been used for genetic studies because of its historical isolation, genetic homogeneity and increased prevalence of certain rare diseases. Controversy remains concerning the genetic substructure and the extent of genetic homogeneity, which has implications for the design of genome-wide association studies (GWAS). We revisited this issue by examining the genetic make-up of a sample from North-East Sardinia using a dense set of autosomal, Y chromosome and mitochondrial markers to assess the potential of the sample for GWAS and fine mapping studies. We genotyped individuals for 500K single-nucleotide polymorphisms, Y chromosome markers and sequenced the mitochondrial hypervariable (HVI-HVII) regions. We identified major haplogroups and compared these with other populations. We estimated linkage disequilibrium (LD) and haplotype diversity across autosomal markers, and compared these with other populations. Our results show that within Sardinia there is no major population substructure and thus it can be considered a genetically homogenous population. We did not find substantial differences in the extent of LD in Sardinians compared with other populations. However, we showed that at least 9% of genomic regions in Sardinians differed in LD structure, which is helpful for identifying functional variants using fine mapping. We concluded that Sardinia is a powerful setting for genetic studies including GWAS and other mapping approaches.
Merriman, Tony R; Choi, Hyon K; Dalbeth, Nicola
2014-05-01
Gout results from deposition of monosodium urate (MSU) crystals. Elevated serum urate concentrations (hyperuricemia) are not sufficient for the development of disease. Genome-wide association studies (GWAS) have identified 28 loci controlling serum urate levels. The largest genetic effects are seen in genes involved in the renal excretion of uric acid, with others being involved in glycolysis. Whereas much is understood about the genetic control of serum urate levels, little is known about the genetic control of inflammatory responses to MSU crystals. Extending knowledge in this area depends on recruitment of large, clinically ascertained gout sample sets suitable for GWAS. Copyright © 2014 Elsevier Inc. All rights reserved.
USDA-ARS?s Scientific Manuscript database
Fine-mapping of causal variants is becoming feasible for complex traits in livestock GWAS, as an increasing number of animals are sequenced. Imputation has been routinely applied to ascertain sequence variants in large genotyped populations based on small reference populations of sequenced animals. ...
"Good Work Awards:" Effects on Children's Families. Technical Report #12.
ERIC Educational Resources Information Center
Chun, Sherlyn; Mays, Violet
This brief report describes parental reaction to a reinforcement strategy used with children in the Kamehameha Early Education Program (KEEP). Staff members report that "Good Work Awards" (GWAs) are viewed favorably by mothers of students. GWAs are dittoed notes sent home with children when they have met a minimum criterion for daily…
Turuspekov, Yerlan; Baibulatova, Aida; Yermekbayev, Kanat; Tokhetova, Laura; Chudinov, Vladimir; Sereda, Grigoriy; Ganal, Martin; Griffiths, Simon; Abugalieva, Saule
2017-11-14
Spring wheat is the largest agricultural crop grown in Kazakhstan with an annual sowing area of 12 million hectares in 2016. Annually, the country harvests around 15 million tons of high quality grain. Despite environmental stress factors it is predicted that the use of new technologies may lead to increases in productivity from current levels of 1.5 to up to 3 tons per hectare. One way of improving wheat productivity is by the application of new genomic oriented approaches in plant breeding projects. Genome wide association studies (GWAS) are emerging as powerful tools for the understanding of the inheritance of complex traits via utilization of high throughput genotyping technologies and phenotypic assessments of plant collections. In this study, phenotyping and genotyping data on 194 spring wheat accessions from Kazakhstan, Russia, Europe, and CIMMYT were assessed for the identification of marker-trait associations (MTA) of agronomic traits by using GWAS. Field trials in Northern, Central and Southern regions of Kazakhstan using 194 spring wheat accessions revealed strong correlations of yield with booting date, plant height, biomass, number of spikes per plant, and number of kernels per spike. The accessions from Europe and CIMMYT showed high breeding potential for Southern and Central regions of the country in comparison with the performance of the local varieties. The GGE biplot method, using average yield per plant, suggested a clear separation of accessions into their three breeding origins in relationship to the three environments in which they were evaluated. The genetic variation in the three groups of accessions was further studied using 3245 polymorphic SNP (single nucleotide polymorphism) markers. The application of Principal Coordinate analysis clearly grouped the 194 accessions into three clades according to their breeding origins. GWAS on data from nine field trials allowed the identification of 114 MTAs for 12 different agronomic traits. Field evaluation of foreign germplasm revealed its poor yield performance in Northern Kazakhstan, which is the main wheat growing region in the country. However, it was found that EU and CIMMYT germplasm has high breeding potential to improve yield performance in Central and Southern regions. The use of Principal Coordinate analysis clearly separated the panel into three distinct groups according to their breeding origin. GWAS based on use of the TASSEL 5.0 package allowed the identification of 114 MTAs for twelve agronomic traits. The study identifies a network of key genes for improvement of yield productivity in wheat growing regions of Kazakhstan.
Song, Minsun; Wheeler, William; Caporaso, Neil E; Landi, Maria Teresa; Chatterjee, Nilanjan
2018-03-01
Genome-wide association studies (GWAS) are now routinely imputed for untyped single nucleotide polymorphisms (SNPs) based on various powerful statistical algorithms for imputation trained on reference datasets. The use of predicted allele counts for imputed SNPs as the dosage variable is known to produce valid score test for genetic association. In this paper, we investigate how to best handle imputed SNPs in various modern complex tests for genetic associations incorporating gene-environment interactions. We focus on case-control association studies where inference for an underlying logistic regression model can be performed using alternative methods that rely on varying degree on an assumption of gene-environment independence in the underlying population. As increasingly large-scale GWAS are being performed through consortia effort where it is preferable to share only summary-level information across studies, we also describe simple mechanisms for implementing score tests based on standard meta-analysis of "one-step" maximum-likelihood estimates across studies. Applications of the methods in simulation studies and a dataset from GWAS of lung cancer illustrate ability of the proposed methods to maintain type-I error rates for the underlying testing procedures. For analysis of imputed SNPs, similar to typed SNPs, the retrospective methods can lead to considerable efficiency gain for modeling of gene-environment interactions under the assumption of gene-environment independence. Methods are made available for public use through CGEN R software package. © 2017 WILEY PERIODICALS, INC.
de Tayrac, Marie; Roth, Marie-Paule; Jouanolle, Anne-Marie; Coppin, Hélène; le Gac, Gérald; Piperno, Alberto; Férec, Claude; Pelucchi, Sara; Scotet, Virginie; Bardou-Jacquet, Edouard; Ropert, Martine; Bouvet, Régis; Génin, Emmanuelle; Mosser, Jean; Deugnier, Yves
2015-03-01
Hereditary hemochromatosis (HH) is the most common form of genetic iron loading disease. It is mainly related to the homozygous C282Y/C282Y mutation in the HFE gene that is, however, a necessary but not a sufficient condition to develop clinical and even biochemical HH. This suggests that modifier genes are likely involved in the expressivity of the disease. Our aim was to identify such modifier genes. We performed a genome-wide association study (GWAS) using DNA collected from 474 unrelated C282Y homozygotes. Associations were examined for both quantitative iron burden indices and clinical outcomes with 534,213 single nucleotide polymorphisms (SNP) genotypes, with replication analyses in an independent sample of 748 C282Y homozygotes from four different European centres. One SNP met genome-wide statistical significance for association with transferrin concentration (rs3811647, GWAS p value of 7×10(-9) and replication p value of 5×10(-13)). This SNP, located within intron 11 of the TF gene, had a pleiotropic effect on serum iron (GWAS p value of 4.9×10(-6) and replication p value of 3.2×10(-6)). Both serum transferrin and iron levels were associated with serum ferritin levels, amount of iron removed and global clinical stage (p<0.01). Serum iron levels were also associated with fibrosis stage (p<0.0001). This GWAS, the largest one performed so far in unselected HFE-associated HH (HFE-HH) patients, identified the rs3811647 polymorphism in the TF gene as the only SNP significantly associated with iron metabolism through serum transferrin and iron levels. Because these two outcomes were clearly associated with the biochemical and clinical expression of the disease, an indirect link between the rs3811647 polymorphism and the phenotypic presentation of HFE-HH is likely. Copyright © 2014 European Association for the Study of the Liver. Published by Elsevier B.V. All rights reserved.
The GTPase Activating Rap/RanGAP Domain-Like 1 Gene Is Associated with Chicken Reproductive Traits
Shen, Xu; Zeng, Hua; Xie, Liang; He, Jun; Li, Jian; Xie, Xiujuan; Luo, Chenglong; Xu, Haiping; Zhou, Min; Nie, Qinghua; Zhang, Xiquan
2012-01-01
Background Abundant evidence indicates that chicken reproduction is strictly regulated by the hypothalamic-pituitary-gonad (HPG) axis, and the genes included in the HPG axis have been studied extensively. However, the question remains as to whether any other genes outside of the HPG system are involved in regulating chicken reproduction. The present study was aimed to identify, on a genome-wide level, novel genes associated with chicken reproductive traits. Methodology/Principal Finding Suppressive subtractive hybridization (SSH), genome-wide association study (GWAS), and gene-centric GWAS were used to identify novel genes underlying chicken reproduction. Single marker-trait association analysis with a large population and allelic frequency spectrum analysis were used to confirm the effects of candidate genes. Using two full-sib Ningdu Sanhuang (NDH) chickens, GARNL1 was identified as a candidate gene involved in chicken broodiness by SSH analysis. Its expression levels in the hypothalamus and pituitary were significantly higher in brooding chickens than in non-brooding chickens. GWAS analysis with a NDH two tail sample showed that 2802 SNPs were significantly associated with egg number at 300 d of age (EN300). Among the 2802 SNPs, 2 SNPs composed a block overlapping the GARNL1 gene. The gene-centric GWAS analysis with another two tail sample of NDH showed that GARNL1 was strongly associated with EN300 and age at first egg (AFE). Single marker-trait association analysis in 1301 female NDH chickens confirmed that variation in this gene was related to EN300 and AFE. The allelic frequency spectrum of the SNP rs15700989 among 5 different populations supported the above associations. Western blotting, RT-PCR, and qPCR were used to analyze alternative splicing of the GARNL1 gene. RT-PCR detected 5 transcripts and revealed that the transcript, which has a 141 bp insertion, was expressed in a tissue-specific manner. Conclusions/Significance Our findings demonstrate that the GARNL1 gene contributes to chicken reproductive traits. PMID:22496769
Cericola, Fabio; Jahoor, Ahmed; Orabi, Jihad; Andersen, Jeppe R; Janss, Luc L; Jensen, Just
2017-01-01
Wheat breeding programs generate a large amount of variation which cannot be completely explored because of limited phenotyping throughput. Genomic prediction (GP) has been proposed as a new tool which provides breeding values estimations without the need of phenotyping all the material produced but only a subset of it named training population (TP). However, genotyping of all the accessions under analysis is needed and, therefore, optimizing TP dimension and genotyping strategy is pivotal to implement GP in commercial breeding schemes. Here, we explored the optimum TP size and we integrated pedigree records and genome wide association studies (GWAS) results to optimize the genotyping strategy. A total of 988 advanced wheat breeding lines were genotyped with the Illumina 15K SNPs wheat chip and phenotyped across several years and locations for yield, lodging, and starch content. Cross-validation using the largest possible TP size and all the SNPs available after editing (~11k), yielded predictive abilities (rGP) ranging between 0.5-0.6. In order to explore the Training population size, rGP were computed using progressively smaller TP. These exercises showed that TP of around 700 lines were enough to yield the highest observed rGP. Moreover, rGP were calculated by randomly reducing the SNPs number. This showed that around 1K markers were enough to reach the highest observed rGP. GWAS was used to identify markers associated with the traits analyzed. A GWAS-based selection of SNPs resulted in increased rGP when compared with random selection and few hundreds SNPs were sufficient to obtain the highest observed rGP. For each of these scenarios, advantages of adding the pedigree information were shown. Our results indicate that moderate TP sizes were enough to yield high rGP and that pedigree information and GWAS results can be used to greatly optimize the genotyping strategy.
A hidden two-locus disease association pattern in genome-wide association studies
2011-01-01
Background Recent association analyses in genome-wide association studies (GWAS) mainly focus on single-locus association tests (marginal tests) and two-locus interaction detections. These analysis methods have provided strong evidence of associations between genetics variances and complex diseases. However, there exists a type of association pattern, which often occurs within local regions in the genome and is unlikely to be detected by either marginal tests or interaction tests. This association pattern involves a group of correlated single-nucleotide polymorphisms (SNPs). The correlation among SNPs can lead to weak marginal effects and the interaction does not play a role in this association pattern. This phenomenon is due to the existence of unfaithfulness: the marginal effects of correlated SNPs do not express their significant joint effects faithfully due to the correlation cancelation. Results In this paper, we develop a computational method to detect this association pattern masked by unfaithfulness. We have applied our method to analyze seven data sets from the Wellcome Trust Case Control Consortium (WTCCC). The analysis for each data set takes about one week to finish the examination of all pairs of SNPs. Based on the empirical result of these real data, we show that this type of association masked by unfaithfulness widely exists in GWAS. Conclusions These newly identified associations enrich the discoveries of GWAS, which may provide new insights both in the analysis of tagSNPs and in the experiment design of GWAS. Since these associations may be easily missed by existing analysis tools, we can only connect some of them to publicly available findings from other association studies. As independent data set is limited at this moment, we also have difficulties to replicate these findings. More biological implications need further investigation. Availability The software is freely available at http://bioinformatics.ust.hk/hidden_pattern_finder.zip. PMID:21569557
Ahsan, Muhammad; Ek, Weronica E.; Karlsson, Torgny; Gyllensten, Ulf
2017-01-01
Associations between epigenetic alterations and disease status have been identified for many diseases. However, there is no strong evidence that epigenetic alterations are directly causal for disease pathogenesis. In this study, we combined SNP and DNA methylation data with measurements of protein biomarkers for cancer, inflammation or cardiovascular disease, to investigate the relative contribution of genetic and epigenetic variation on biomarker levels. A total of 121 protein biomarkers were measured and analyzed in relation to DNA methylation at 470,000 genomic positions and to over 10 million SNPs. We performed epigenome-wide association study (EWAS) and genome-wide association study (GWAS) analyses, and integrated biomarker, DNA methylation and SNP data using between 698 and 1033 samples depending on data availability for the different analyses. We identified 124 and 45 loci (Bonferroni adjusted P < 0.05) with effect sizes up to 0.22 standard units’ change per 1% change in DNA methylation levels and up to four standard units’ change per copy of the effective allele in the EWAS and GWAS respectively. Most GWAS loci were cis-regulatory whereas most EWAS loci were located in trans. Eleven EWAS loci were associated with multiple biomarkers, including one in NLRC5 associated with CXCL11, CXCL9, IL-12, and IL-18 levels. All EWAS signals that overlapped with a GWAS locus were driven by underlying genetic variants and three EWAS signals were confounded by smoking. While some cis-regulatory SNPs for biomarkers appeared to have an effect also on DNA methylation levels, cis-regulatory SNPs for DNA methylation were not observed to affect biomarker levels. We present associations between protein biomarker and DNA methylation levels at numerous loci in the genome. The associations are likely to reflect the underlying pattern of genetic variants, specific environmental exposures, or represent secondary effects to the pathogenesis of disease. PMID:28915241
Ellinghaus, David; Folseraas, Trine; Holm, Kristian; Ellinghaus, Eva; Melum, Espen; Balschun, Tobias; Laerdahl, Jon K; Shiryaev, Alexey; Gotthardt, Daniel N; Weismüller, Tobias J; Schramm, Christoph; Wittig, Michael; Bergquist, Annika; Björnsson, Einar; Marschall, Hanns-Ulrich; Vatn, Morten; Teufel, Andreas; Rust, Christian; Gieger, Christian; Wichmann, H-Erich; Runz, Heiko; Sterneck, Martina; Rupp, Christian; Braun, Felix; Weersma, Rinse K; Wijmenga, Cisca; Ponsioen, Cyriel Y; Mathew, Christopher G; Rutgeerts, Paul; Vermeire, Séverine; Schrumpf, Erik; Hov, Johannes R; Manns, Michael P; Boberg, Kirsten M; Schreiber, Stefan; Franke, Andre; Karlsen, Tom H
2013-09-01
Approximately 60%-80% of patients with primary sclerosing cholangitis (PSC) have concurrent ulcerative colitis (UC). Previous genome-wide association studies (GWAS) in PSC have detected a number of susceptibility loci that also show associations in UC and other immune-mediated diseases. We aimed to systematically compare genetic associations in PSC with genotype data in UC patients with the aim of detecting new susceptibility loci for PSC. We performed combined analyses of GWAS for PSC and UC comprising 392 PSC cases, 987 UC cases, and 2,977 controls and followed up top association signals in an additional 1,012 PSC cases, 4,444 UC cases, and 11,659 controls. We discovered novel genome-wide significant associations with PSC at 2q37 [rs3749171 at G-protein-coupled receptor 35 (GPR35); P = 3.0 × 10(-9) in the overall study population, combined odds ratio [OR] and 95% confidence interval [CI] of 1.39 (1.24-1.55)] and at 18q21 [rs1452787 at transcription factor 4 (TCF4); P = 2.61 × 10(-8) , OR (95% CI) = 0.75 (0.68-0.83)]. In addition, several suggestive PSC associations were detected. The GPR35 rs3749171 is a missense single nucleotide polymorphism resulting in a shift from threonine to methionine. Structural modeling showed that rs3749171 is located in the third transmembrane helix of GPR35 and could possibly alter efficiency of signaling through the GPR35 receptor. By refining the analysis of a PSC GWAS by parallel assessments in a UC GWAS, we were able to detect two novel risk loci at genome-wide significance levels. GPR35 shows associations in both UC and PSC, whereas TCF4 represents a PSC risk locus not associated with UC. Both loci may represent previously unexplored aspects of PSC pathogenesis. Copyright © 2012 American Association for the Study of Liver Diseases.
Re-Ranking Sequencing Variants in the Post-GWAS Era for Accurate Causal Variant Identification
Faye, Laura L.; Machiela, Mitchell J.; Kraft, Peter; Bull, Shelley B.; Sun, Lei
2013-01-01
Next generation sequencing has dramatically increased our ability to localize disease-causing variants by providing base-pair level information at costs increasingly feasible for the large sample sizes required to detect complex-trait associations. Yet, identification of causal variants within an established region of association remains a challenge. Counter-intuitively, certain factors that increase power to detect an associated region can decrease power to localize the causal variant. First, combining GWAS with imputation or low coverage sequencing to achieve the large sample sizes required for high power can have the unintended effect of producing differential genotyping error among SNPs. This tends to bias the relative evidence for association toward better genotyped SNPs. Second, re-use of GWAS data for fine-mapping exploits previous findings to ensure genome-wide significance in GWAS-associated regions. However, using GWAS findings to inform fine-mapping analysis can bias evidence away from the causal SNP toward the tag SNP and SNPs in high LD with the tag. Together these factors can reduce power to localize the causal SNP by more than half. Other strategies commonly employed to increase power to detect association, namely increasing sample size and using higher density genotyping arrays, can, in certain common scenarios, actually exacerbate these effects and further decrease power to localize causal variants. We develop a re-ranking procedure that accounts for these adverse effects and substantially improves the accuracy of causal SNP identification, often doubling the probability that the causal SNP is top-ranked. Application to the NCI BPC3 aggressive prostate cancer GWAS with imputation meta-analysis identified a new top SNP at 2 of 3 associated loci and several additional possible causal SNPs at these loci that may have otherwise been overlooked. This method is simple to implement using R scripts provided on the author's website. PMID:23950724
Characterizing Genetic Risk at Known Prostate Cancer Susceptibility Loci in African Americans
Haiman, Christopher A.; Chen, Gary K.; Blot, William J.; Strom, Sara S.; Berndt, Sonja I.; Kittles, Rick A.; Rybicki, Benjamin A.; Isaacs, William B.; Ingles, Sue A.; Stanford, Janet L.; Diver, W. Ryan; Witte, John S.; Chanock, Stephen J.; Kolb, Suzanne; Signorello, Lisa B.; Yamamura, Yuko; Neslund-Dudas, Christine; Thun, Michael J.; Murphy, Adam; Casey, Graham; Sheng, Xin; Wan, Peggy; Pooler, Loreall C.; Monroe, Kristine R.; Waters, Kevin M.; Le Marchand, Loic; Kolonel, Laurence N.; Stram, Daniel O.; Henderson, Brian E.
2011-01-01
GWAS of prostate cancer have been remarkably successful in revealing common genetic variants and novel biological pathways that are linked with its etiology. A more complete understanding of inherited susceptibility to prostate cancer in the general population will come from continuing such discovery efforts and from testing known risk alleles in diverse racial and ethnic groups. In this large study of prostate cancer in African American men (3,425 prostate cancer cases and 3,290 controls), we tested 49 risk variants located in 28 genomic regions identified through GWAS in men of European and Asian descent, and we replicated associations (at p≤0.05) with roughly half of these markers. Through fine-mapping, we identified nearby markers in many regions that better define associations in African Americans. At 8q24, we found 9 variants (p≤6×10−4) that best capture risk of prostate cancer in African Americans, many of which are more common in men of African than European descent. The markers found to be associated with risk at each locus improved risk modeling in African Americans (per allele OR = 1.17) over the alleles reported in the original GWAS (OR = 1.08). In summary, in this detailed analysis of the prostate cancer risk loci reported from GWAS, we have validated and improved upon markers of risk in some regions that better define the association with prostate cancer in African Americans. Our findings with variants at 8q24 also reinforce the importance of this region as a major risk locus for prostate cancer in men of African ancestry. PMID:21637779
Zhang, Mingming; Mu, Hongbo; Shang, Zhenwei; Kang, Kai; Lv, Hongchao; Duan, Lian; Li, Jin; Chen, Xinren; Teng, Yanbo; Jiang, Yongshuai; Zhang, Ruijie
2017-01-06
Parkinson's disease (PD) is the second most common neurodegenerative disease. It is generally believed that it is influenced by both genetic and environmental factors, but the precise pathogenesis of PD is unknown to date. In this study, we performed a pathway analysis based on genome-wide association study (GWAS) to detect risk pathways of PD in three GWAS datasets. We first mapped all SNP markers to autosomal genes in each GWAS dataset. Then, we evaluated gene risk values using the minimum P-value of the tagSNPs. We took a pathway as a unit to identify the risk pathways based on the cumulative risks of the genes in the pathway. Finally, we combine the analysis results of the three datasets to detect the high risk pathways associated with PD. We found there were five same pathways in the three datasets. Besides, we also found there were five pathways which were shared in two datasets. Most of these pathways are associated with nervoussystem. Five pathways had been reported to be PD-related pathways in the previous literature. Our findings also implied that there was a close association between immune response and PD. Continued investigation of these pathways will further help us explain the pathogenesis of PD. Copyright © 2016. Published by Elsevier Ltd.
Litchfield, K; Shipley, J; Turnbull, C
2015-01-01
Testicular germ cell tumour (TGCT) is the most common cause of cancer in young men (aged 15-45 years) in many populations. Multiple genome-wide association studies (GWAS) of TGCT have now been conducted, yielding over 25 disease-associated single-nucleotide polymorphism (SNP)s at 19 independent loci. The genes at these loci have provided rich biological and genetic insight into possible mechanisms underlying testicular germ cell oncogenesis. In this review, we summarize these mechanisms which can be grouped into five distinct categories: KIT/KITLG signalling, other pathways of male germ cell development/differentiation, telomerase function, microtubule assembly and DNA damage repair. The TGCT risk markers identified through GWAS include individual SNPs carrying per allele odds ratios (OR) in excess of 2.5. These ORs are among the highest reported in GWAS of any cancer type, hence suggesting a potential clinical utility in risk determination. Here, we present analysis of such an approach, using polygenic risk scores to calculate the combined effect of all risk loci on overall TGCT risk and discuss how a potential screening strategy may fit within a broader clinical context. © 2015 American Society of Andrology and European Academy of Andrology.
Implications of discoveries from genome-wide association studies in current cardiovascular practice
Jeemon, Panniyammakal; Pettigrew, Kerry; Sainsbury, Christopher; Prabhakaran, Dorairaj; Padmanabhan, Sandosh
2011-01-01
Genome-wide association studies (GWAS) have identified several genetic variants associated with coronary heart disease (CHD), and variations in plasma lipoproteins and blood pressure (BP). Loci corresponding to CDKN2A/CDKN2B/ANRIL, MTHFD1L, CELSR2, PSRC1 and SORT1 genes have been associated with CHD, and TMEM57, DOCK7, CELSR2, APOB, ABCG5, HMGCR, TRIB1, FADS2/S3, LDLR, NCAN and TOMM40-APOE with total cholesterol. Similarly, CELSR2-PSRC1-SORT1, PCSK9, APOB, HMGCR, NCAN-CILP2-PBX4, LDLR, TOMM40-APOE, and APOC1-APOE are associated with variations in low-density lipoprotein cholesterol levels. Altogether, forty, forty three and twenty loci have been associated with high-density lipoprotein cholesterol, triglycerides and BP phenotypes, respectively. Some of these identified loci are common for all the traits, some do not map to functional genes, and some are located in genes that encode for proteins not previously known to be involved in the biological pathway of the trait. GWAS have been successful at identifying new and unexpected genetic loci common to diseases and traits, thus rapidly providing key novel insights into disease biology. Since genotype information is fixed, with minimum biological variability, it is useful in early life risk prediction. However, these variants explain only a small proportion of the observed variance of these traits. Therefore, the utility of genetic determinants in assessing risk at later stages of life has limited immediate clinical impact. The future application of genetic screening will be in identifying risk groups early in life to direct targeted preventive measures. PMID:21860704
The low single nucleotide polymorphism heritability of plasma and saliva cortisol levels.
Neumann, Alexander; Direk, Nese; Crawford, Andrew A; Mirza, Saira; Adams, Hieab; Bolton, Jennifer; Hayward, Caroline; Strachan, David P; Payne, Erin K; Smith, Jennifer A; Milaneschi, Yuri; Penninx, Brenda; Hottenga, Jouke J; de Geus, Eco; Oldehinkel, Albertine J; van der Most, Peter J; de Rijke, Yolanda; Walker, Brian R; Tiemeier, Henning
2017-11-01
Cortisol is an important stress hormone affected by a variety of biological and environmental factors, such as the circadian rhythm, exercise and psychological stress. Cortisol is mostly measured using blood or saliva samples. A number of genetic variants have been found to contribute to cortisol levels with these methods. While the effects of several specific single genetic variants is known, the joint genome-wide contribution to cortisol levels is unclear. Our aim was to estimate the amount of cortisol variance explained by common single nucleotide polymorphisms, i.e. the SNP heritability, using a variety of cortisol measures, cohorts and analysis approaches. We analyzed morning plasma (n=5705) and saliva levels (n=1717), as well as diurnal saliva levels (n=1541), in the Rotterdam Study using genomic restricted maximum likelihood estimation. Additionally, linkage disequilibrium score regression was fitted on the results of genome-wide association studies (GWAS) performed by the CORNET consortium on morning plasma cortisol (n=12,597) and saliva cortisol (n=7703). No significant SNP heritability was detected for any cortisol measure, sample or analysis approach. Point estimates ranged from 0% to 9%. Morning plasma cortisol in the CORNET cohorts, the sample with the most power, had a 6% [95%CI: 0-13%] SNP heritability. The results consistently suggest a low SNP heritability of these acute and short-term measures of cortisol. The low SNP heritability may reflect the substantial environmental and, in particular, situational component of these cortisol measures. Future GWAS will require very large sample sizes. Alternatively, more long-term cortisol measures such as hair cortisol samples are needed to discover further genetic pathways regulating cortisol concentrations. Copyright © 2017 Elsevier Ltd. All rights reserved.
Dissecting Vancomycin-Intermediate Resistance in Staphylococcus aureus Using Genome-Wide Association
Alam, Md Tauqeer; Petit, Robert A.; Crispell, Emily K.; Thornton, Timothy A.; Conneely, Karen N.; Jiang, Yunxuan; Satola, Sarah W.; Read, Timothy D.
2014-01-01
Vancomycin-intermediate Staphylococcus aureus (VISA) is currently defined as having minimal inhibitory concentration (MIC) of 4–8 µg/ml. VISA evolves through changes in multiple genetic loci with at least 16 candidate genes identified in clinical and in vitro-selected VISA strains. We report a whole-genome comparative analysis of 49 vancomycin-sensitive S. aureus and 26 VISA strains. Resistance to vancomycin was determined by broth microdilution, Etest, and population analysis profile-area under the curve (PAP-AUC). Genome-wide association studies (GWAS) of 55,977 single-nucleotide polymorphisms identified in one or more strains found one highly significant association (P = 8.78E-08) between a nonsynonymous mutation at codon 481 (H481) of the rpoB gene and increased vancomycin MIC. Additionally, we used a database of public S. aureus genome sequences to identify rare mutations in candidate genes associated with VISA. On the basis of these data, we proposed a preliminary model called ECM+RMCG for the VISA phenotype as a benchmark for future efforts. The model predicted VISA based on the presence of a rare mutation in a set of candidate genes (walKR, vraSR, graSR, and agrA) and/or three previously experimentally verified mutations (including the rpoB H481 locus) with an accuracy of 81% and a sensitivity of 73%. Further, the level of resistance measured by both Etest and PAP-AUC regressed positively with the number of mutations present in a strain. This study demonstrated 1) the power of GWAS for identifying common genetic variants associated with antibiotic resistance in bacteria and 2) that rare mutations in candidate gene, identified using large genomic data sets, can also be associated with resistance phenotypes. PMID:24787619
A genome-wide investigation of food addiction.
Cornelis, Marilyn C; Flint, Alan; Field, Alison E; Kraft, Peter; Han, Jiali; Rimm, Eric B; van Dam, Rob M
2016-06-01
Evidence of parallels between drug addiction and eating behavior continues to accumulate. Genetic studies of addictive substances have yielded a number of susceptibility loci that point to common higher order genetic pathways underlying addiction. It was hypothesized that a genome-wide association study (GWAS) of food addiction would yield significant enrichment in genes and pathways linked to addiction. A GWAS of food addiction, determined by the modified Yale Food Addiction Scale (mYFAS), was conducted among 9,314 women of European ancestry, and results for enrichment of single-nucleotide polymorphisms (SNPs) (n = 44), genes (n = 238), and pathways (n = 11) implicated in drug addiction were examined. Two loci met GW-significance (P < 2.5 × 10(-8) ) mapping to 17q21.31 and 11q13.4 that harbor genes with no obvious roles in eating behavior. GW results were significantly enriched for gene members of the MAPK signaling pathway (P = 0.02). No candidate SNP or gene for drug addiction was significantly associated with food addiction after correction for multiple testing. In the first GWAS of mYFAS, suggestive loci worthy of further follow-up were identified, but limited support was provided for shared genetic underpinnings of food addiction and drug addiction. The latter might be due to limited study power and knowledge of the genetics of drug addiction. © 2016 The Obesity Society.
Hou, Liping; Bergen, Sarah E.; Akula, Nirmala; Song, Jie; Hultman, Christina M.; Landén, Mikael; Adli, Mazda; Alda, Martin; Ardau, Raffaella; Arias, Bárbara; Aubry, Jean-Michel; Backlund, Lena; Badner, Judith A.; Barrett, Thomas B.; Bauer, Michael; Baune, Bernhard T.; Bellivier, Frank; Benabarre, Antonio; Bengesser, Susanne; Berrettini, Wade H.; Bhattacharjee, Abesh Kumar; Biernacka, Joanna M.; Birner, Armin; Bloss, Cinnamon S.; Brichant-Petitjean, Clara; Bui, Elise T.; Byerley, William; Cervantes, Pablo; Chillotti, Caterina; Cichon, Sven; Colom, Francesc; Coryell, William; Craig, David W.; Cruceanu, Cristiana; Czerski, Piotr M.; Davis, Tony; Dayer, Alexandre; Degenhardt, Franziska; Del Zompo, Maria; DePaulo, J. Raymond; Edenberg, Howard J.; Étain, Bruno; Falkai, Peter; Foroud, Tatiana; Forstner, Andreas J.; Frisén, Louise; Frye, Mark A.; Fullerton, Janice M.; Gard, Sébastien; Garnham, Julie S.; Gershon, Elliot S.; Goes, Fernando S.; Greenwood, Tiffany A.; Grigoroiu-Serbanescu, Maria; Hauser, Joanna; Heilbronner, Urs; Heilmann-Heimbach, Stefanie; Herms, Stefan; Hipolito, Maria; Hitturlingappa, Shashi; Hoffmann, Per; Hofmann, Andrea; Jamain, Stephane; Jiménez, Esther; Kahn, Jean-Pierre; Kassem, Layla; Kelsoe, John R.; Kittel-Schneider, Sarah; Kliwicki, Sebastian; Koller, Daniel L.; König, Barbara; Lackner, Nina; Laje, Gonzalo; Lang, Maren; Lavebratt, Catharina; Lawson, William B.; Leboyer, Marion; Leckband, Susan G.; Liu, Chunyu; Maaser, Anna; Mahon, Pamela B.; Maier, Wolfgang; Maj, Mario; Manchia, Mirko; Martinsson, Lina; McCarthy, Michael J.; McElroy, Susan L.; McInnis, Melvin G.; McKinney, Rebecca; Mitchell, Philip B.; Mitjans, Marina; Mondimore, Francis M.; Monteleone, Palmiero; Mühleisen, Thomas W.; Nievergelt, Caroline M.; Nöthen, Markus M.; Novák, Tomas; Nurnberger, John I.; Nwulia, Evaristus A.; Ösby, Urban; Pfennig, Andrea; Potash, James B.; Propping, Peter; Reif, Andreas; Reininghaus, Eva; Rice, John; Rietschel, Marcella; Rouleau, Guy A.; Rybakowski, Janusz K.; Schalling, Martin; Scheftner, William A.; Schofield, Peter R.; Schork, Nicholas J.; Schulze, Thomas G.; Schumacher, Johannes; Schweizer, Barbara W.; Severino, Giovanni; Shekhtman, Tatyana; Shilling, Paul D.; Simhandl, Christian; Slaney, Claire M.; Smith, Erin N.; Squassina, Alessio; Stamm, Thomas; Stopkova, Pavla; Streit, Fabian; Strohmaier, Jana; Szelinger, Szabolcs; Tighe, Sarah K.; Tortorella, Alfonso; Turecki, Gustavo; Vieta, Eduard; Volkert, Julia; Witt, Stephanie H.; Wright, Adam; Zandi, Peter P.; Zhang, Peng; Zollner, Sebastian; McMahon, Francis J.
2016-01-01
Bipolar disorder (BD) is a genetically complex mental illness characterized by severe oscillations of mood and behaviour. Genome-wide association studies (GWAS) have identified several risk loci that together account for a small portion of the heritability. To identify additional risk loci, we performed a two-stage meta-analysis of >9 million genetic variants in 9,784 bipolar disorder patients and 30,471 controls, the largest GWAS of BD to date. In this study, to increase power we used ∼2,000 lithium-treated cases with a long-term diagnosis of BD from the Consortium on Lithium Genetics, excess controls, and analytic methods optimized for markers on the X-chromosome. In addition to four known loci, results revealed genome-wide significant associations at two novel loci: an intergenic region on 9p21.3 (rs12553324, P = 5.87 × 10 − 9; odds ratio (OR) = 1.12) and markers within ERBB2 (rs2517959, P = 4.53 × 10 − 9; OR = 1.13). No significant X-chromosome associations were detected and X-linked markers explained very little BD heritability. The results add to a growing list of common autosomal variants involved in BD and illustrate the power of comparing well-characterized cases to an excess of controls in GWAS. PMID:27329760
Genome-wide and gene-based association implicates FRMD6 in Alzheimer disease.
Hong, Mun-Gwan; Reynolds, Chandra A; Feldman, Adina L; Kallin, Mikael; Lambert, Jean-Charles; Amouyel, Philippe; Ingelsson, Erik; Pedersen, Nancy L; Prince, Jonathan A
2012-03-01
Genome-wide association studies (GWAS) that allow for allelic heterogeneity may facilitate the discovery of novel genes not detectable by models that require replication of a single variant site. One strategy to accomplish this is to focus on genes rather than markers as units of association, and so potentially capture a spectrum of causal alleles that differ across populations. Here, we conducted a GWAS of Alzheimer disease (AD) in 2,586 Swedes and performed gene-based meta-analysis with three additional studies from France, Canada, and the United States, in total encompassing 4,259 cases and 8,284 controls. Implementing a newly designed gene-based algorithm, we identified two loci apart from the region around APOE that achieved study-wide significance in combined samples, the strongest finding being for FRMD6 on chromosome 14q (P = 2.6 × 10(-14)) and a weaker signal for NARS2 that is immediately adjacent to GAB2 on chromosome 11q (P = 7.8 × 10(-9)). Ontology-based pathway analyses revealed significant enrichment of genes involved in glycosylation. Results suggest that gene-based approaches that accommodate allelic heterogeneity in GWAS can provide a complementary avenue for gene discovery and may help to explain a portion of the missing heritability not detectable with single nucleotide polymorphisms (SNPs) derived from marker-specific meta-analysis. © 2011 Wiley Periodicals, Inc.
Genome-wide association study of rust traits in orchardgrass using SLAF-seq technology.
Zeng, Bing; Yan, Haidong; Liu, Xinchun; Zang, Wenjing; Zhang, Ailing; Zhou, Sifan; Huang, Linkai; Liu, Jinping
2017-01-01
While orchardgrass ( Dactylis glomerata L.) is a well-known perennial forage species, rust diseases cause serious reductions in the yield and quality of orchardgrass; however, genetic mechanisms of rust resistance are not well understood in orchardgrass. In this study, a genome-wide association study (GWAS) was performed using specific-locus amplified fragment sequencing (SLAF-seq) technology in orchardgrass. A total of 2,334,889 SLAF tags were generated to produce 2,309,777 SNPs. ADMIXTURE analysis revealed unstructured subpopulations for 33 accessions, indicating that this orchardgrass population could be used for association analysis. Linkage disequilibrium (LD) analysis revealed an average r 2 of 0.4 across all SNP pairs, indicating a high extent of LD in these samples. Through GWAS, a total of 4,604 SNPs were found to be significantly ( P < 0.01) associated with the rust trait. The bulk analysis discovered a number of 5,211 SNPs related to rust trait. Two candidate genes, including cytochrome P450, and prolamin were implicated in disease resistance through prediction of functional genes surrounding each high-quality SNP ( P < 0.01) associated with rust traits based on GWAS analysis and bulk analysis. The large number of SNPs associated with rust traits and these two candidate genes may provide the basis for further research on rust resistance mechanisms and marker-assisted selection (MAS) for rust-resistant lineages.
Gene-Environment Interactions in Asthma: Genetic and Epigenetic Effects.
Lee, Jong-Uk; Kim, Jeong Dong; Park, Choon-Sik
2015-07-01
Over the past three decades, a large number of genetic studies have been aimed at finding genetic variants associated with the risk of asthma, applying various genetic and genomic approaches including linkage analysis, candidate gene polymorphism studies, and genome-wide association studies (GWAS). However, contrary to general expectation, even single nucleotide polymorphisms (SNPs) discovered by GWAS failed to fully explain the heritability of asthma. Thus, application of rare allele polymorphisms in well defined phenotypes and clarification of environmental factors have been suggested to overcome the problem of 'missing' heritability. Such factors include allergens, cigarette smoke, air pollutants, and infectious agents during pre- and post-natal periods. The first and simplest interaction between a gene and the environment is a candidate interaction of both a well known gene and environmental factor in a direct physical or chemical interaction such as between CD14 and endotoxin or between HLA and allergens. Several GWAS have found environmental interactions with occupational asthma, aspirin exacerbated respiratory disease, tobacco smoke-related airway dysfunction, and farm-related atopic diseases. As one of the mechanisms behind gene-environment interaction is epigenetics, a few studies on DNA CpG methylation have been reported on subphenotypes of asthma, pitching the exciting idea that it may be possible to intervene at the junction between the genome and the environment. Epigenetic studies are starting to include data from clinical samples, which will make them another powerful tool for re-search on gene-environment interactions in asthma.
Mansour, Hader A; Talkowski, Michael E; Wood, Joel; Chowdari, Kodavali V; McClain, Lora; Prasad, Konasale; Montrose, Debra; Fagiolini, Andrea; Friedman, Edward S; Allen, Michael H; Bowden, Charles L; Calabrese, Joseph; El-Mallakh, Rif S; Escamilla, Michael; Faraone, Stephen V; Fossey, Mark D; Gyulai, Laszlo; Loftis, Jennifer M; Hauser, Peter; Ketter, Terence A; Marangell, Lauren B; Miklowitz, David J; Nierenberg, Andrew A; Patel, Jayendra; Sachs, Gary S; Sklar, Pamela; Smoller, Jordan W; Laird, Nan; Keshavan, Matcheri; Thase, Michael E; Axelson, David; Birmaher, Boris; Lewis, David; Monk, Tim; Frank, Ellen; Kupfer, David J; Devlin, Bernie; Nimgaonkar, Vishwajit L
2012-01-01
Objective Published studies suggest associations between circadian gene polymorphisms and bipolar I disorder (BPI), as well as schizoaffective disorder (SZA) and schizophrenia (SZ). The results are plausible, based on prior studies of circadian abnormalities. As replications have not been attempted uniformly, we evaluated representative, common polymorphisms in all three disorders. Methods We assayed 276 publicly available ‘tag’ single nucleotide polymorphisms (SNPs) at 21 circadian genes among 523 patients with BPI, 527 patients with SZ/SZA, and 477 screened adult controls. Detected associations were evaluated in relation to two published genome-wide association studies (GWAS). Results Using gene-based tests, suggestive associations were noted between EGR3 and BPI (p = 0.017), and between NPAS2 and SZ/SZA (p = 0.034). Three SNPs were associated with both sets of disorders (NPAS2: rs13025524 and rs11123857; RORB: rs10491929; p < 0.05). None of the associations remained significant following corrections for multiple comparisons. Approximately 15% of the analyzed SNPs overlapped with an independent study that conducted GWAS for BPI; suggestive overlap between the GWAS analyses and ours was noted at ARNTL. Conclusions Several suggestive, novel associations were detected with circadian genes and BPI and SZ/SZA, but the present analyses do not support associations with common polymorphisms that confer risk with odds ratios greater than 1.5. Additional analyses using adequately powered samples are warranted to further evaluate these results. PMID:19839995
TEAM: efficient two-locus epistasis tests in human genome-wide association study.
Zhang, Xiang; Huang, Shunping; Zou, Fei; Wang, Wei
2010-06-15
As a promising tool for identifying genetic markers underlying phenotypic differences, genome-wide association study (GWAS) has been extensively investigated in recent years. In GWAS, detecting epistasis (or gene-gene interaction) is preferable over single locus study since many diseases are known to be complex traits. A brute force search is infeasible for epistasis detection in the genome-wide scale because of the intensive computational burden. Existing epistasis detection algorithms are designed for dataset consisting of homozygous markers and small sample size. In human study, however, the genotype may be heterozygous, and number of individuals can be up to thousands. Thus, existing methods are not readily applicable to human datasets. In this article, we propose an efficient algorithm, TEAM, which significantly speeds up epistasis detection for human GWAS. Our algorithm is exhaustive, i.e. it does not ignore any epistatic interaction. Utilizing the minimum spanning tree structure, the algorithm incrementally updates the contingency tables for epistatic tests without scanning all individuals. Our algorithm has broader applicability and is more efficient than existing methods for large sample study. It supports any statistical test that is based on contingency tables, and enables both family-wise error rate and false discovery rate controlling. Extensive experiments show that our algorithm only needs to examine a small portion of the individuals to update the contingency tables, and it achieves at least an order of magnitude speed up over the brute force approach.
SQC: secure quality control for meta-analysis of genome-wide association studies.
Huang, Zhicong; Lin, Huang; Fellay, Jacques; Kutalik, Zoltán; Hubaux, Jean-Pierre
2017-08-01
Due to the limited power of small-scale genome-wide association studies (GWAS), researchers tend to collaborate and establish a larger consortium in order to perform large-scale GWAS. Genome-wide association meta-analysis (GWAMA) is a statistical tool that aims to synthesize results from multiple independent studies to increase the statistical power and reduce false-positive findings of GWAS. However, it has been demonstrated that the aggregate data of individual studies are subject to inference attacks, hence privacy concerns arise when researchers share study data in GWAMA. In this article, we propose a secure quality control (SQC) protocol, which enables checking the quality of data in a privacy-preserving way without revealing sensitive information to a potential adversary. SQC employs state-of-the-art cryptographic and statistical techniques for privacy protection. We implement the solution in a meta-analysis pipeline with real data to demonstrate the efficiency and scalability on commodity machines. The distributed execution of SQC on a cluster of 128 cores for one million genetic variants takes less than one hour, which is a modest cost considering the 10-month time span usually observed for the completion of the QC procedure that includes timing of logistics. SQC is implemented in Java and is publicly available at https://github.com/acs6610987/secureqc. jean-pierre.hubaux@epfl.ch. Supplementary data are available at Bioinformatics online. © The Author (2017). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com
Fan, Qianrui; Wang, Wenyu; Hao, Jingcan; He, Awen; Wen, Yan; Guo, Xiong; Wu, Cuiyan; Ning, Yujie; Wang, Xi; Wang, Sen; Zhang, Feng
2017-08-01
Neuroticism is a fundamental personality trait with significant genetic determinant. To identify novel susceptibility genes for neuroticism, we conducted an integrative analysis of genomic and transcriptomic data of genome wide association study (GWAS) and expression quantitative trait locus (eQTL) study. GWAS summary data was driven from published studies of neuroticism, totally involving 170,906 subjects. eQTL dataset containing 927,753 eQTLs were obtained from an eQTL meta-analysis of 5311 samples. Integrative analysis of GWAS and eQTL data was conducted by summary data-based Mendelian randomization (SMR) analysis software. To identify neuroticism associated gene sets, the SMR analysis results were further subjected to gene set enrichment analysis (GSEA). The gene set annotation dataset (containing 13,311 annotated gene sets) of GSEA Molecular Signatures Database was used. SMR single gene analysis identified 6 significant genes for neuroticism, including MSRA (p value=2.27×10 -10 ), MGC57346 (p value=6.92×10 -7 ), BLK (p value=1.01×10 -6 ), XKR6 (p value=1.11×10 -6 ), C17ORF69 (p value=1.12×10 -6 ) and KIAA1267 (p value=4.00×10 -6 ). Gene set enrichment analysis observed significant association for Chr8p23 gene set (false discovery rate=0.033). Our results provide novel clues for the genetic mechanism studies of neuroticism. Copyright © 2017. Published by Elsevier Inc.
Ng, Maggie C Y; Graff, Mariaelisa; Lu, Yingchang; Justice, Anne E; Mudgal, Poorva; Liu, Ching-Ti; Young, Kristin; Yanek, Lisa R; Feitosa, Mary F; Wojczynski, Mary K; Rand, Kristin; Brody, Jennifer A; Cade, Brian E; Dimitrov, Latchezar; Duan, Qing; Guo, Xiuqing; Lange, Leslie A; Nalls, Michael A; Okut, Hayrettin; Tajuddin, Salman M; Tayo, Bamidele O; Vedantam, Sailaja; Bradfield, Jonathan P; Chen, Guanjie; Chen, Wei-Min; Chesi, Alessandra; Irvin, Marguerite R; Padhukasahasram, Badri; Smith, Jennifer A; Zheng, Wei; Allison, Matthew A; Ambrosone, Christine B; Bandera, Elisa V; Bartz, Traci M; Berndt, Sonja I; Bernstein, Leslie; Blot, William J; Bottinger, Erwin P; Carpten, John; Chanock, Stephen J; Chen, Yii-Der Ida; Conti, David V; Cooper, Richard S; Fornage, Myriam; Freedman, Barry I; Garcia, Melissa; Goodman, Phyllis J; Hsu, Yu-Han H; Hu, Jennifer; Huff, Chad D; Ingles, Sue A; John, Esther M; Kittles, Rick; Klein, Eric; Li, Jin; McKnight, Barbara; Nayak, Uma; Nemesure, Barbara; Ogunniyi, Adesola; Olshan, Andrew; Press, Michael F; Rohde, Rebecca; Rybicki, Benjamin A; Salako, Babatunde; Sanderson, Maureen; Shao, Yaming; Siscovick, David S; Stanford, Janet L; Stevens, Victoria L; Stram, Alex; Strom, Sara S; Vaidya, Dhananjay; Witte, John S; Yao, Jie; Zhu, Xiaofeng; Ziegler, Regina G; Zonderman, Alan B; Adeyemo, Adebowale; Ambs, Stefan; Cushman, Mary; Faul, Jessica D; Hakonarson, Hakon; Levin, Albert M; Nathanson, Katherine L; Ware, Erin B; Weir, David R; Zhao, Wei; Zhi, Degui; Arnett, Donna K; Grant, Struan F A; Kardia, Sharon L R; Oloapde, Olufunmilayo I; Rao, D C; Rotimi, Charles N; Sale, Michele M; Williams, L Keoki; Zemel, Babette S; Becker, Diane M; Borecki, Ingrid B; Evans, Michele K; Harris, Tamara B; Hirschhorn, Joel N; Li, Yun; Patel, Sanjay R; Psaty, Bruce M; Rotter, Jerome I; Wilson, James G; Bowden, Donald W; Cupples, L Adrienne; Haiman, Christopher A; Loos, Ruth J F; North, Kari E
2017-04-01
Genome-wide association studies (GWAS) have identified >300 loci associated with measures of adiposity including body mass index (BMI) and waist-to-hip ratio (adjusted for BMI, WHRadjBMI), but few have been identified through screening of the African ancestry genomes. We performed large scale meta-analyses and replications in up to 52,895 individuals for BMI and up to 23,095 individuals for WHRadjBMI from the African Ancestry Anthropometry Genetics Consortium (AAAGC) using 1000 Genomes phase 1 imputed GWAS to improve coverage of both common and low frequency variants in the low linkage disequilibrium African ancestry genomes. In the sex-combined analyses, we identified one novel locus (TCF7L2/HABP2) for WHRadjBMI and eight previously established loci at P < 5×10-8: seven for BMI, and one for WHRadjBMI in African ancestry individuals. An additional novel locus (SPRYD7/DLEU2) was identified for WHRadjBMI when combined with European GWAS. In the sex-stratified analyses, we identified three novel loci for BMI (INTS10/LPL and MLC1 in men, IRX4/IRX2 in women) and four for WHRadjBMI (SSX2IP, CASC8, PDE3B and ZDHHC1/HSD11B2 in women) in individuals of African ancestry or both African and European ancestry. For four of the novel variants, the minor allele frequency was low (<5%). In the trans-ethnic fine mapping of 47 BMI loci and 27 WHRadjBMI loci that were locus-wide significant (P < 0.05 adjusted for effective number of variants per locus) from the African ancestry sex-combined and sex-stratified analyses, 26 BMI loci and 17 WHRadjBMI loci contained ≤ 20 variants in the credible sets that jointly account for 99% posterior probability of driving the associations. The lead variants in 13 of these loci had a high probability of being causal. As compared to our previous HapMap imputed GWAS for BMI and WHRadjBMI including up to 71,412 and 27,350 African ancestry individuals, respectively, our results suggest that 1000 Genomes imputation showed modest improvement in identifying GWAS loci including low frequency variants. Trans-ethnic meta-analyses further improved fine mapping of putative causal variants in loci shared between the African and European ancestry populations.
Veerkamp, Roel F; Bouwman, Aniek C; Schrooten, Chris; Calus, Mario P L
2016-12-01
Whole-genome sequence data is expected to capture genetic variation more completely than common genotyping panels. Our objective was to compare the proportion of variance explained and the accuracy of genomic prediction by using imputed sequence data or preselected SNPs from a genome-wide association study (GWAS) with imputed whole-genome sequence data. Phenotypes were available for 5503 Holstein-Friesian bulls. Genotypes were imputed up to whole-genome sequence (13,789,029 segregating DNA variants) by using run 4 of the 1000 bull genomes project. The program GCTA was used to perform GWAS for protein yield (PY), somatic cell score (SCS) and interval from first to last insemination (IFL). From the GWAS, subsets of variants were selected and genomic relationship matrices (GRM) were used to estimate the variance explained in 2087 validation animals and to evaluate the genomic prediction ability. Finally, two GRM were fitted together in several models to evaluate the effect of selected variants that were in competition with all the other variants. The GRM based on full sequence data explained only marginally more genetic variation than that based on common SNP panels: for PY, SCS and IFL, genomic heritability improved from 0.81 to 0.83, 0.83 to 0.87 and 0.69 to 0.72, respectively. Sequence data also helped to identify more variants linked to quantitative trait loci and resulted in clearer GWAS peaks across the genome. The proportion of total variance explained by the selected variants combined in a GRM was considerably smaller than that explained by all variants (less than 0.31 for all traits). When selected variants were used, accuracy of genomic predictions decreased and bias increased. Although 35 to 42 variants were detected that together explained 13 to 19% of the total variance (18 to 23% of the genetic variance) when fitted alone, there was no advantage in using dense sequence information for genomic prediction in the Holstein data used in our study. Detection and selection of variants within a single breed are difficult due to long-range linkage disequilibrium. Stringent selection of variants resulted in more biased genomic predictions, although this might be due to the training population being the same dataset from which the selected variants were identified.
Witt, S H; Streit, F; Jungkunz, M; Frank, J; Awasthi, S; Reinbold, C S; Treutlein, J; Degenhardt, F; Forstner, A J; Heilmann-Heimbach, S; Dietl, L; Schwarze, C E; Schendel, D; Strohmaier, J; Abdellaoui, A; Adolfsson, R; Air, T M; Akil, H; Alda, M; Alliey-Rodriguez, N; Andreassen, O A; Babadjanova, G; Bass, N J; Bauer, M; Baune, B T; Bellivier, F; Bergen, S; Bethell, A; Biernacka, J M; Blackwood, D H R; Boks, M P; Boomsma, D I; Børglum, A D; Borrmann-Hassenbach, M; Brennan, P; Budde, M; Buttenschøn, H N; Byrne, E M; Cervantes, P; Clarke, T-K; Craddock, N; Cruceanu, C; Curtis, D; Czerski, P M; Dannlowski, U; Davis, T; de Geus, E J C; Di Florio, A; Djurovic, S; Domenici, E; Edenberg, H J; Etain, B; Fischer, S B; Forty, L; Fraser, C; Frye, M A; Fullerton, J M; Gade, K; Gershon, E S; Giegling, I; Gordon, S D; Gordon-Smith, K; Grabe, H J; Green, E K; Greenwood, T A; Grigoroiu-Serbanescu, M; Guzman-Parra, J; Hall, L S; Hamshere, M; Hauser, J; Hautzinger, M; Heilbronner, U; Herms, S; Hitturlingappa, S; Hoffmann, P; Holmans, P; Hottenga, J-J; Jamain, S; Jones, I; Jones, L A; Juréus, A; Kahn, R S; Kammerer-Ciernioch, J; Kirov, G; Kittel-Schneider, S; Kloiber, S; Knott, S V; Kogevinas, M; Landén, M; Leber, M; Leboyer, M; Li, Q S; Lissowska, J; Lucae, S; Martin, N G; Mayoral-Cleries, F; McElroy, S L; McIntosh, A M; McKay, J D; McQuillin, A; Medland, S E; Middeldorp, C M; Milaneschi, Y; Mitchell, P B; Montgomery, G W; Morken, G; Mors, O; Mühleisen, T W; Müller-Myhsok, B; Myers, R M; Nievergelt, C M; Nurnberger, J I; O'Donovan, M C; Loohuis, L M O; Ophoff, R; Oruc, L; Owen, M J; Paciga, S A; Penninx, B W J H; Perry, A; Pfennig, A; Potash, J B; Preisig, M; Reif, A; Rivas, F; Rouleau, G A; Schofield, P R; Schulze, T G; Schwarz, M; Scott, L; Sinnamon, G C B; Stahl, E A; Strauss, J; Turecki, G; Van der Auwera, S; Vedder, H; Vincent, J B; Willemsen, G; Witt, C C; Wray, N R; Xi, H S; Tadic, A; Dahmen, N; Schott, B H; Cichon, S; Nöthen, M M; Ripke, S; Mobascher, A; Rujescu, D; Lieb, K; Roepke, S; Schmahl, C; Bohus, M; Rietschel, M
2017-06-20
Borderline personality disorder (BOR) is determined by environmental and genetic factors, and characterized by affective instability and impulsivity, diagnostic symptoms also observed in manic phases of bipolar disorder (BIP). Up to 20% of BIP patients show comorbidity with BOR. This report describes the first case-control genome-wide association study (GWAS) of BOR, performed in one of the largest BOR patient samples worldwide. The focus of our analysis was (i) to detect genes and gene sets involved in BOR and (ii) to investigate the genetic overlap with BIP. As there is considerable genetic overlap between BIP, major depression (MDD) and schizophrenia (SCZ) and a high comorbidity of BOR and MDD, we also analyzed the genetic overlap of BOR with SCZ and MDD. GWAS, gene-based tests and gene-set analyses were performed in 998 BOR patients and 1545 controls. Linkage disequilibrium score regression was used to detect the genetic overlap between BOR and these disorders. Single marker analysis revealed no significant association after correction for multiple testing. Gene-based analysis yielded two significant genes: DPYD (P=4.42 × 10 -7 ) and PKP4 (P=8.67 × 10 -7 ); and gene-set analysis yielded a significant finding for exocytosis (GO:0006887, P FDR =0.019; FDR, false discovery rate). Prior studies have implicated DPYD, PKP4 and exocytosis in BIP and SCZ. The most notable finding of the present study was the genetic overlap of BOR with BIP (r g =0.28 [P=2.99 × 10 -3 ]), SCZ (r g =0.34 [P=4.37 × 10 -5 ]) and MDD (r g =0.57 [P=1.04 × 10 -3 ]). We believe our study is the first to demonstrate that BOR overlaps with BIP, MDD and SCZ on the genetic level. Whether this is confined to transdiagnostic clinical symptoms should be examined in future studies.
Wang, Xianshu; Pankratz, V Shane; Fredericksen, Zachary; Tarrell, Robert; Karaus, Mary; McGuffog, Lesley; Pharaoh, Paul D P; Ponder, Bruce A J; Dunning, Alison M; Peock, Susan; Cook, Margaret; Oliver, Clare; Frost, Debra; Sinilnikova, Olga M; Stoppa-Lyonnet, Dominique; Mazoyer, Sylvie; Houdayer, Claude; Hogervorst, Frans B L; Hooning, Maartje J; Ligtenberg, Marjolijn J; Spurdle, Amanda; Chenevix-Trench, Georgia; Schmutzler, Rita K; Wappenschmidt, Barbara; Engel, Christoph; Meindl, Alfons; Domchek, Susan M; Nathanson, Katherine L; Rebbeck, Timothy R; Singer, Christian F; Gschwantler-Kaulich, Daphne; Dressler, Catherina; Fink, Anneliese; Szabo, Csilla I; Zikan, Michal; Foretova, Lenka; Claes, Kathleen; Thomas, Gilles; Hoover, Robert N; Hunter, David J; Chanock, Stephen J; Easton, Douglas F; Antoniou, Antonis C; Couch, Fergus J
2010-07-15
Recent studies have identified single nucleotide polymorphisms (SNPs) that significantly modify breast cancer risk in BRCA1 and BRCA2 mutation carriers. Since these risk modifiers were originally identified as genetic risk factors for breast cancer in genome-wide association studies (GWASs), additional risk modifiers for BRCA1 and BRCA2 may be identified from promising signals discovered in breast cancer GWAS. A total of 350 SNPs identified as candidate breast cancer risk factors (P < 1 x 10(-3)) in two breast cancer GWAS studies were genotyped in 3451 BRCA1 and 2006 BRCA2 mutation carriers from nine centers. Associations with breast cancer risk were assessed using Cox models weighted for penetrance. Eight SNPs in BRCA1 carriers and 12 SNPs in BRCA2 carriers, representing an enrichment over the number expected, were significantly associated with breast cancer risk (P(trend) < 0.01). The minor alleles of rs6138178 in SNRPB and rs6602595 in CAMK1D displayed the strongest associations in BRCA1 carriers (HR = 0.78, 95% CI: 0.69-0.90, P(trend) = 3.6 x 10(-4) and HR = 1.25, 95% CI: 1.10-1.41, P(trend) = 4.2 x 10(-4)), whereas rs9393597 in LOC134997 and rs12652447 in FBXL7 showed the strongest associations in BRCA2 carriers (HR = 1.55, 95% CI: 1.25-1.92, P(trend) = 6 x 10(-5) and HR = 1.37, 95% CI: 1.16-1.62, P(trend) = 1.7 x 10(-4)). The magnitude and direction of the associations were consistent with the original GWAS. In subsequent risk assessment studies, the loci appeared to interact multiplicatively for breast cancer risk in BRCA1 and BRCA2 carriers. Promising candidate SNPs from GWAS were identified as modifiers of breast cancer risk in BRCA1 and BRCA2 carriers. Upon further validation, these SNPs together with other genetic and environmental factors may improve breast cancer risk assessment in these populations.
[Progress in genetic research of human height].
Chen, Kaixu; Wang, Weilan; Zhang, Fuchun; Zheng, Xiufen
2015-08-01
It is well known that both environmental and genetic factors contribute to adult height variation in general population. However, heritability studies have shown that the variation in height is more affected by genetic factors. Height is a typical polygenic trait which has been studied by traditional linkage analysis and association analysis to identify common DNA sequence variation associated with height, but progress has been slow. More recently, with the development of genotyping and DNA sequencing technologies, tremendous achievements have been made in genetic research of human height. Hundreds of single nucleotide polymorphisms (SNPs) associated with human height have been identified and validated with the application of genome-wide association studies (GWAS) methodology, which deepens our understanding of the genetics of human growth and development and also provides theoretic basis and reference for studying other complex human traits. In this review, we summarize recent progress in genetic research of human height and discuss problems and prospects in this research area which may provide some insights into future genetic studies of human height.
Imaging genetics of schizophrenia in the post-GWAS era.
Arslan, Ayla
2018-01-03
Imaging genetics is a research methodology studying the effect of genetic variation on brain structure, function, behavior, and risk for psychopathology. Since the early 2000s, imaging genetics has been increasingly used in the research of schizophrenia (SZ). SZ is a severe mental disorder with no precise knowledge of its underlying neurobiology, however, new genetic and neurobiological data generate a climate for new avenues. The accumulating data of genome wide association studies (GWAS) continuously decode SZ risk genes. Global neuroimaging consortia produce collections of brain phenotypes from tens of thousands of people. In this context, imaging genetics will be strategically important both for the validation and discovery of SZ related findings. Thus, the study of GWAS supported risk variants as candidate genes to validate by neuroimaging is one trend. The study of epigenetic differences in relation to variations of brain phenotypes and the study of large scale multivariate analysis of genome wide and brain wide associations are other trends. While these studies hold a big potential for understanding the neurobiology of SZ, the problem of reproducibility appears as a major challenge, which requires standardizations in study designs and compensations of methodological limitations such as sensitivity and specificity. On the other hand, advancements of neuroimaging, optical and electron microscopy along with the use of genetically encoded fluorescent probes and robust statistical approaches will not only catalyze integrative methodologies but also will help better design the imaging genetics studies. In this invited paper, I will discuss the current perspective of imaging genetics and emerging opportunities of SZ research. Copyright © 2017 Elsevier Inc. All rights reserved.
Ryu, Dongchan; Ryu, Jihye; Lee, Chaeyoung
2016-05-01
A genome-wide association study (GWAS) was conducted to examine genetic associations of common autosomal nucleotide variants with sex in a Korean population with 4183 males and 4659 females. Nine genetic association signals were identified in four intragenic and five intergenic regions (P<5 × 10(-8)). Further analysis with an independent data set confirmed two intragenic association signals in the genes encoding protein phosphatase 1, regulatory subunit 12B (PPP1R12B, intron 12, rs1819043) and dynein, axonemal, heavy chain 11 (DNAH11, intron 61, rs10255013), which are directly involved in the reproductive system. This study revealed autosomal genetic variants associated with sex ratio by GWAS for the first time. This implies that genetic variants in proximity to the association signals may influence sex-specific selection and contribute to sex ratio variation. Further studies are required to reveal the mechanisms underlying sex-specific selection.
Host genetics of HIV acquisition and viral control.
Shea, Patrick R; Shianna, Kevin V; Carrington, Mary; Goldstein, David B
2013-01-01
Since the discovery of HIV as the cause of AIDS, numerous insights have been gained from studies of its natural history and epidemiology. It has become clear that there are substantial interindividual differences in the risk of HIV acquisition and course of disease. Meanwhile, the field of human genetics has undergone a series of rapid transitions that have fundamentally altered the approach to studying HIV host genetics. We aim to describe the field as it has transitioned from the era of candidate-gene studies and the era of genome-wide association studies (GWAS) to its current state in the infancy of comprehensive sequencing. In some ways the field has come full circle, having evolved from being driven almost exclusively by our knowledge of immunology, to a bias-free GWAS approach, to a point where our ability to catalogue human variation far outstrips our ability to biologically interpret it.
A meta-analysis of genome-wide association studies of asthma in Puerto Ricans
Yan, Qi; Brehm, John; Pino-Yanes, Maria; Forno, Erick; Lin, Jerome; Oh, Sam S.; Acosta-Perez, Edna; Laurie, Cathy C.; Cloutier, Michelle M.; Raby, Benjamin A.; Stilp, Adrienne M.; Sofer, Tamar; Hu, Donglei; Huntsman, Scott; Eng, Celeste S.; Conomos, Matthew P.; Rastogi, Deepa; Rice, Kenneth; Canino, Glorisa; Chen, Wei; Barr, R. Graham; Burchard, Esteban G.; Celedón, Juan C.
2017-01-01
Rationale No genome-wide association study (GWAS) of asthma has been conducted in Puerto Ricans. Objective To identify susceptibility genetic variants for asthma in Puerto Ricans. Methods We conducted a meta-analysis of GWAS of asthma, including Puerto Rican participants from: GALA I-II, the Hartford-Puerto Rico Study, and the Hispanic Community Health Study. Moreover, we examined whether susceptibility loci identified in previous meta-analyses of GWAS are associated with asthma in Puerto Ricans. Results The only locus to achieve a genome-wide significant association with asthma in an analysis of 2,144 cases and 2,893 controls was chromosome 17q21, as evidenced by our top SNP, rs907092 (OR = 0.71, P = 1.2 ×10−12) on IKZF3. Similar to findings in non-Puerto Ricans, SNPs in genes in the same LD block as IKZF3 (e.g. ZPBP2, ORMDL3 and GSDMB) were also significantly associated with asthma in Puerto Ricans. With regard to results from a meta-analysis in Europeans, we replicated findings for the SNP at GSDMB, but not for SNPs in any other genes. On the other hand, we replicated results from a meta-analysis of North American populations for SNPs in IL1RL1, TSLP and GSDMB but not for IL33. Conclusions Common variants on chromosome 17q21 have the greatest effects on asthma in Puerto Ricans, a high-risk ethnic group. PMID:28461288
Morgan, Thomas M; House, John A; Cresci, Sharon; Jones, Philip; Allayee, Hooman; Hazen, Stanley L; Patel, Yesha; Patel, Riyaz S; Eapen, Danny J; Waddy, Salina P; Quyyumi, Arshed A; Kleber, Marcus E; März, Winfried; Winkelmann, Bernhard R; Boehm, Bernhard O; Krumholz, Harlan M; Spertus, John A
2011-09-29
Genome-wide association studies (GWAS) have identified new candidate genes for the occurrence of acute coronary syndrome (ACS), but possible effects of such genes on survival following ACS have yet to be investigated. We examined 95 polymorphisms in 69 distinct gene regions identified in a GWAS for premature myocardial infarction for their association with post-ACS mortality among 811 whites recruited from university-affiliated hospitals in Kansas City, Missouri. We then sought replication of a positive genetic association in a large, racially diverse cohort of myocardial infarction patients (N = 2284) using Kaplan-Meier survival analyses and Cox regression to adjust for relevant covariates. Finally, we investigated the apparent association further in 6086 additional coronary artery disease patients. After Cox adjustment for other ACS risk factors, of 95 SNPs tested in 811 whites only the association with the rs6922269 in MTHFD1L was statistically significant, with a 2.6-fold mortality hazard (P = 0.007). The recessive A/A genotype was of borderline significance in an age- and race-adjusted analysis of the entire combined cohort (N = 3095; P = 0.052), but this finding was not confirmed in independent cohorts (N = 6086). We found no support for the hypothesis that the GWAS-identified variants in this study substantially alter the probability of post-ACS survival. Large-scale, collaborative, genome-wide studies may be required in order to detect genetic variants that are robustly associated with survival in patients with coronary artery disease.
Genome-Wide Association Study of Erosive Tooth Wear in a Finnish Cohort.
Alaraudanjoki, Viivi Karoliina; Koivisto, Salla; Pesonen, Paula; Männikkö, Minna; Leinonen, Jukka; Tjäderhane, Leo; Laitala, Marja-Liisa; Lussi, Adrian; Anttonen, Vuokko Anna-Marketta
2018-06-13
Erosive tooth wear is defined as irreversible loss of dental tissues due to intrinsic or extrinsic acids, exacerbated by mechanical forces. Recent studies have suggested a higher prevalence of erosive tooth wear in males, as well as a genetic contribution to susceptibility to erosive tooth wear. Our aim was to examine erosive tooth wear by performing a genome-wide association study (GWAS) in a sample of the Northern Finland Birth Cohort 1966 (n = 1,962). Erosive tooth wear was assessed clinically using the basic erosive wear examination. A GWAS was performed for the whole sample as well as separately for males and females. We identified one genome-wide significant signal (rs11681214) in the GWAS of the whole sample near the genes PXDN and MYT1L. When the sample was stratified by sex, the strongest genome-wide significant signals were observed in or near the genes FGFR1, C8orf86, CDH4, SCD5, F2R, and ING1. Additionally, multiple suggestive association signals were detected in all GWASs performed. Many of the signals were in or near the genes putatively related to oral environment or tooth development, and some were near the regions considered to be associated with dental caries, such as 2p24, 4q21, and 13q33. Replications of these associations in other samples, as well as experimental studies to determine the biological functions of associated genetic variants, are needed. © 2018 S. Karger AG, Basel.
Genetic variants near MLST8 and DHX57 affect the epigenetic age of the cerebellum
NASA Astrophysics Data System (ADS)
Lu, Ake T.; Hannon, Eilis; Levine, Morgan E.; Hao, Ke; Crimmins, Eileen M.; Lunnon, Katie; Kozlenkov, Alexey; Mill, Jonathan; Dracheva, Stella; Horvath, Steve
2016-02-01
DNA methylation (DNAm) levels lend themselves for defining an epigenetic biomarker of aging known as the `epigenetic clock'. Our genome-wide association study (GWAS) of cerebellar epigenetic age acceleration identifies five significant (P<5.0 × 10-8) SNPs in two loci: 2p22.1 (inside gene DHX57) and 16p13.3 near gene MLST8 (a subunit of mTOR complex 1 and 2). We find that the SNP in 16p13.3 has a cis-acting effect on the expression levels of MLST8 (P=6.9 × 10-18) in most brain regions. In cerebellar samples, the SNP in 2p22.1 has a cis-effect on DHX57 (P=4.4 × 10-5). Gene sets found by our GWAS analysis of cerebellar age acceleration exhibit significant overlap with those of Alzheimer's disease (P=4.4 × 10-15), age-related macular degeneration (P=6.4 × 10-6), and Parkinson's disease (P=2.6 × 10-4). Overall, our results demonstrate the utility of a new paradigm for understanding aging and age-related diseases: it will be fruitful to use epigenetic tissue age as endophenotype in GWAS.
Alharbi, Khalid Khalaf; Ali Khan, Imran; Alotaibi, Mohammad Abdullah; Saud Aloyaid, Abdullah; Al-Basheer, Haifa Abdulaziz; Alghamdi, Naelah Abdullah; Al-Baradie, Raid Saleem; Al-Sulaiman, A M
2018-01-01
Stroke is a multifactorial and heterogeneous disorder, correlates with heritability and considered as one of the major diseases. The prior reports performed the variable models such as genome-wide association studies (GWAS), replication, case-control, cross-sectional and meta-analysis studies and still, we lack diagnostic marker in the global world. There are limited studies were carried out in Saudi population, and we aim to investigate the molecular association of single nucleotide polymorphisms (SNPs) identified through GWAS and meta-analysis studies in stroke patients in the Saudi population. In this case-control study, we have opted gender equality of 207 cases and 207 controls from the capital city of Saudi Arabia in King Saud University Hospital. The peripheral blood (5 ml) sample will be collected in two different vacutainers, and three mL of the coagulated blood will be used for lipid analysis (biochemical tests) and two mL will be used for DNA analysis (molecular tests). Genomic DNA will be extracted with the collected blood samples, and specific primers will be designed for the opted SNPs ( SORT1 -rs646218 and OLR1 -rs11053646 polymorphisms) and PCR-RFLP will be performed and randomly DNA sequencing will be carried out to cross check the results. The rs646218 and rs11053646 polymorphisms were significantly associated with allele, genotype and dominant models with and without crude odds ratios (OR's) and Multiple logistic regression analysis (p < 0.05). Correlation between lipid profile and genotypes has confirmed the significant relation between triglycerides and rs646218 and rs1105364 6polymorphisms. However, rs11053646 polymorphism was correlated with HDLC (p = 0.04). Genotypes were examined in both males' vs. males and females' vs. females in cases and control and we concluded that in rs11053646 polymorphisms with male subjects compared between cases and controls found to be associated with dominant model heterozygote genotypes (p < 0.05). The results of the current study confirmed the SORT1 and OLR1 SNPs were associated in the Saudi population. The current results were in the association with the prior study results documented through GWAS and meta-analysis association. However, other ethnic population studies should be performed to rule out in the human hereditary diseases.
Bei, Jin-Xin; Su, Wen-Hui; Ng, Ching-Ching; Yu, Kai; Chin, Yoon-Ming; Lou, Pei-Jen; Hsu, Wan-Lun; McKay, James D; Chen, Chien-Jen; Chang, Yu-Sun; Chen, Li-Zhen; Chen, Ming-Yuan; Cui, Qian; Feng, Fu-Tuo; Feng, Qi-Shen; Guo, Yun-Miao; Jia, Wei-Hua; Khoo, Alan Soo-Beng; Liu, Wen-Sheng; Mo, Hao-Yuan; Pua, Kin-Choo; Teo, Soo-Hwang; Tse, Ka-Po; Xia, Yun-Fei; Zhang, Hongxin; Zhou, Gang-Qiao; Liu, Jian-Jun; Zeng, Yi-Xin; Hildesheim, Allan
2016-01-01
Genetic loci within the major histocompatibility complex (MHC) have been associated with nasopharyngeal carcinoma (NPC), an Epstein-Barr virus (EBV)-associated cancer, in several GWAS. Results outside this region have varied. We conducted a meta-analysis of four NPC GWAS among Chinese individuals (2,152 cases; 3,740 controls). Forty-three noteworthy findings outside the MHC region were identified and targeted for replication in a pooled analysis of four independent case-control studies across three regions in Asia (4,716 cases; 5,379 controls). A meta-analysis that combined results from the initial GWA and replication studies was performed. In the combined meta-analysis, rs31489, located within the CLPTM1L/TERT region on chromosome 5p15.33, was strongly associated with NPC (OR = 0.81; P value 6.3 × 10(-13)). Our results also provide support for associations reported from published NPC GWAS-rs6774494 (P = 1.5 × 10(-12); located in the MECOM gene region), rs9510787 (P = 5.0 × 10(-10); located in the TNFRSF19 gene region), and rs1412829/rs4977756/rs1063192 (P = 2.8 × 10(-8), P = 7.0 × 10(-7), and P = 8.4 × 10(-7), respectively; located in the CDKN2A/B gene region). We have identified a novel association between genetic variation in the CLPTM1L/TERT region and NPC. Supporting our finding, rs31489 and other SNPs in this region have been reported to be associated with multiple cancer sites, candidate-based studies have reported associations between polymorphisms in this region and NPC, the TERT gene has been shown to be important for telomere maintenance and has been reported to be overexpressed in NPC, and an EBV protein expressed in NPC (LMP1) has been reported to modulate TERT expression/telomerase activity. Our finding suggests that factors involved in telomere length maintenance are involved in NPC pathogenesis. ©2015 American Association for Cancer Research.
2013-01-01
Background The theoretical basis of genome-wide association studies (GWAS) is statistical inference of linkage disequilibrium (LD) between any polymorphic marker and a putative disease locus. Most methods widely implemented for such analyses are vulnerable to several key demographic factors and deliver a poor statistical power for detecting genuine associations and also a high false positive rate. Here, we present a likelihood-based statistical approach that accounts properly for non-random nature of case–control samples in regard of genotypic distribution at the loci in populations under study and confers flexibility to test for genetic association in presence of different confounding factors such as population structure, non-randomness of samples etc. Results We implemented this novel method together with several popular methods in the literature of GWAS, to re-analyze recently published Parkinson’s disease (PD) case–control samples. The real data analysis and computer simulation show that the new method confers not only significantly improved statistical power for detecting the associations but also robustness to the difficulties stemmed from non-randomly sampling and genetic structures when compared to its rivals. In particular, the new method detected 44 significant SNPs within 25 chromosomal regions of size < 1 Mb but only 6 SNPs in two of these regions were previously detected by the trend test based methods. It discovered two SNPs located 1.18 Mb and 0.18 Mb from the PD candidates, FGF20 and PARK8, without invoking false positive risk. Conclusions We developed a novel likelihood-based method which provides adequate estimation of LD and other population model parameters by using case and control samples, the ease in integration of these samples from multiple genetically divergent populations and thus confers statistically robust and powerful analyses of GWAS. On basis of simulation studies and analysis of real datasets, we demonstrated significant improvement of the new method over the non-parametric trend test, which is the most popularly implemented in the literature of GWAS. PMID:23394771
Methods for meta-analysis of multiple traits using GWAS summary statistics.
Ray, Debashree; Boehnke, Michael
2018-03-01
Genome-wide association studies (GWAS) for complex diseases have focused primarily on single-trait analyses for disease status and disease-related quantitative traits. For example, GWAS on risk factors for coronary artery disease analyze genetic associations of plasma lipids such as total cholesterol, LDL-cholesterol, HDL-cholesterol, and triglycerides (TGs) separately. However, traits are often correlated and a joint analysis may yield increased statistical power for association over multiple univariate analyses. Recently several multivariate methods have been proposed that require individual-level data. Here, we develop metaUSAT (where USAT is unified score-based association test), a novel unified association test of a single genetic variant with multiple traits that uses only summary statistics from existing GWAS. Although the existing methods either perform well when most correlated traits are affected by the genetic variant in the same direction or are powerful when only a few of the correlated traits are associated, metaUSAT is designed to be robust to the association structure of correlated traits. metaUSAT does not require individual-level data and can test genetic associations of categorical and/or continuous traits. One can also use metaUSAT to analyze a single trait over multiple studies, appropriately accounting for overlapping samples, if any. metaUSAT provides an approximate asymptotic P-value for association and is computationally efficient for implementation at a genome-wide level. Simulation experiments show that metaUSAT maintains proper type-I error at low error levels. It has similar and sometimes greater power to detect association across a wide array of scenarios compared to existing methods, which are usually powerful for some specific association scenarios only. When applied to plasma lipids summary data from the METSIM and the T2D-GENES studies, metaUSAT detected genome-wide significant loci beyond the ones identified by univariate analyses. Evidence from larger studies suggest that the variants additionally detected by our test are, indeed, associated with lipid levels in humans. In summary, metaUSAT can provide novel insights into the genetic architecture of a common disease or traits. © 2017 WILEY PERIODICALS, INC.
Pyun, Jung-A; Kim, Sunshin; Cho, Nam H; Koh, InSong; Lee, Jong-Young; Shin, Chol; Kwack, KyuBum
2014-05-01
The aim of this study was to identify polymorphisms and gene-gene interactions that are significantly associated with age at menarche and age at menopause in a Korean population. A total of 3,452 and 1,827 women participated in studies of age at menarche and age at natural menopause, respectively. Linear regression analyses adjusted for residence area were used to perform genome-wide association studies (GWAS), candidate gene association studies, and interactions between the candidate genes for age at menarche and age at natural menopause. In GWAS, four single nucleotide polymorphisms (SNPs; rs7528241, rs1324329, rs11597068, and rs6495785) were strongly associated with age at natural menopause (lowest P = 9.66 × 10). However, GWAS of age at menarche did not reveal any strong associations. In candidate gene association studies, SNPs with P < 0.01 were selected to test their synergistic interactions. For age at natural menopause, there was a significant interaction between intronic SNPs on ADAM metallopeptidase with thrombospondin type I motif 9 (ADAMTS9) and SMAD family member 3 (SMAD3) genes (P = 9.52 × 10). For age at menarche, there were three significant interactions between three intronic SNPs on follicle-stimulating hormone receptor (FSHR) gene and one SNP located at the 3' flanking region of insulin-like growth factor 2 receptor (IGF2R) gene (lowest P = 1.95 × 10). Novel SNPs and synergistic interactions between candidate genes are significantly associated with age at menarche and age at natural menopause in a Korean population.
Genetic susceptibility loci for subtypes of breast cancer in an African American population
Palmer, Julie R.; Ruiz-Narvaez, Edward A.; Rotimi, Charles N.; Cupples, L. Adrienne; Cozier, Yvette C.; Adams-Campbell, Lucile L.; Rosenberg, Lynn
2012-01-01
Background Most genome-wide association scans (GWAS) have been carried out in European ancestry populations; no risk variants for breast cancer have been identified solely from African ancestry GWAS data. Few GWAS hits have replicated in African ancestry populations. Methods In a nested case-control study of breast cancer in the Black Women’s Health Study (1,199 cases/1,948 controls), we evaluated index SNPs in 21 loci from GWAS of European or Asian ancestry populations, overall, in subtypes defined by estrogen (ER) and progesterone (PR) receptor status (ER+/PR+, n=336; ER−/PR−, n=229), and in triple-negative breast cancer (TNBC, N=81). To evaluate the contribution of genetic factors to population differences in breast cancer subtype, we also examined global percent African ancestry. Results Index SNPs in five loci were replicated, including three associated with ER−/PR− breast cancer (TERT rs10069690 in 5p15.33, rs704010 in 10q22.3, and rs8170 in 19p13.11): per allele odds ratios were 1.29 (95% confidence interval (CI) 1.04–1.59), p=0.02, 1.52 (95% CI 1.12–2.08), p=0.01, and 1.30 (95% CI 1.01–1.68), p=0.04, respectively. Stronger associations were observed for TNBC. Furthermore, cases in the highest quintile of percent African ancestry were three times more likely to have TNBC than ER+/PR+ cancer. Conclusions These findings provide the first confirmation of the TNBC SNP rs8170 in an African ancestry population, and independent confirmation of the TERT ER− SNP. Further, the risk of developing ER− breast cancer, particularly TNBC, increased with increasing proportion of global African ancestry. Impact The findings demonstrate the importance of genetic factors in the disproportionately high occurrence of TNBC in African American women. PMID:23136140
Wang, Quanxiu; Zhao, Hu; Jiang, Junpeng; Xu, Jiuyue; Xie, Weibo; Fu, Xiangkui; Liu, Chang; He, Yuqing; Wang, Gongwei
2017-01-01
The photoprotective processes conferred by nonphotochemical quenching (NPQ) serve fundamental roles in maintaining plant fitness and sustainable yield. So far, few loci have been reported to be involved in natural variation of NPQ capacity in rice (Oryza sativa), and the extents of variation explored are very limited. Here we conducted a genome-wide association study (GWAS) for NPQ capacity using a diverse worldwide collection of 529 O. sativa accessions. A total of 33 significant association loci were identified. To check the validity of the GWAS signals, three F2 mapping populations with parents selected from the association panel were constructed and assayed. All QTLs detected in mapping populations could correspond to at least one GWAS signal, indicating the GWAS results were quite reliable. OsPsbS1 was repeatedly detected and explained more than 40% of the variation in the whole association population in two years, and demonstrated to be a common major QTL in all three mapping populations derived from inter-group crosses. We revealed 43 single nucleotide polymorphisms (SNPs) and 7 insertions and deletions (InDels) within a 6,997-bp DNA fragment of OsPsbS1, but found no non-synonymous SNPs or InDels in the coding region, indicating the PsbS1 protein sequence is highly conserved. Haplotypes with the 2,674-bp insertion in the promoter region exhibited significantly higher NPQ values and higher expression levels of OsPsbS1. The OsPsbS1 RNAi plants and CRISPR/Cas9 mutants exhibited drastically decreased NPQ values. OsPsbS1 had specific and high-level expression in green tissues of rice. However, we didn't find significant function for OsPsbS2, the other rice PsbS homologue. Manipulation of the significant loci or candidate genes identified may enhance photoprotection and improve photosynthesis and yield in rice. PMID:29081789
Gupta, Mayetri; Cheung, Ching-Lung; Hsu, Yi-Hsiang; Demissie, Serkalem; Cupples, L Adrienne; Kiel, Douglas P; Karasik, David
2011-06-01
Genome-wide association studies (GWAS) using high-density genotyping platforms offer an unbiased strategy to identify new candidate genes for osteoporosis. It is imperative to be able to clearly distinguish signal from noise by focusing on the best phenotype in a genetic study. We performed GWAS of multiple phenotypes associated with fractures [bone mineral density (BMD), bone quantitative ultrasound (QUS), bone geometry, and muscle mass] with approximately 433,000 single-nucleotide polymorphisms (SNPs) and created a database of resulting associations. We performed analysis of GWAS data from 23 phenotypes by a novel modification of a block clustering algorithm followed by gene-set enrichment analysis. A data matrix of standardized regression coefficients was partitioned along both axes--SNPs and phenotypes. Each partition represents a distinct cluster of SNPs that have similar effects over a particular set of phenotypes. Application of this method to our data shows several SNP-phenotype connections. We found a strong cluster of association coefficients of high magnitude for 10 traits (BMD at several skeletal sites, ultrasound measures, cross-sectional bone area, and section modulus of femoral neck and shaft). These clustered traits were highly genetically correlated. Gene-set enrichment analyses indicated the augmentation of genes that cluster with the 10 osteoporosis-related traits in pathways such as aldosterone signaling in epithelial cells, role of osteoblasts, osteoclasts, and chondrocytes in rheumatoid arthritis, and Parkinson signaling. In addition to several known candidate genes, we also identified PRKCH and SCNN1B as potential candidate genes for multiple bone traits. In conclusion, our mining of GWAS results revealed the similarity of association results between bone strength phenotypes that may be attributed to pleiotropic effects of genes. This knowledge may prove helpful in identifying novel genes and pathways that underlie several correlated phenotypes, as well as in deciphering genetic and phenotypic modularity underlying osteoporosis risk. Copyright © 2011 American Society for Bone and Mineral Research.
Yu, Dongmei; Mathews, Carol A; Scharf, Jeremiah M; Neale, Benjamin M; Davis, Lea K; Gamazon, Eric R; Derks, Eske M; Evans, Patrick; Edlund, Christopher K; Crane, Jacquelyn; Fagerness, Jesen A; Osiecki, Lisa; Gallagher, Patience; Gerber, Gloria; Haddad, Stephen; Illmann, Cornelia; McGrath, Lauren M; Mayerfeld, Catherine; Arepalli, Sampath; Barlassina, Cristina; Barr, Cathy L; Bellodi, Laura; Benarroch, Fortu; Berrió, Gabriel Bedoya; Bienvenu, O Joseph; Black, Donald W; Bloch, Michael H; Brentani, Helena; Bruun, Ruth D; Budman, Cathy L; Camarena, Beatriz; Campbell, Desmond D; Cappi, Carolina; Silgado, Julio C Cardona; Cavallini, Maria C; Chavira, Denise A; Chouinard, Sylvain; Cook, Edwin H; Cookson, M R; Coric, Vladimir; Cullen, Bernadette; Cusi, Daniele; Delorme, Richard; Denys, Damiaan; Dion, Yves; Eapen, Valsama; Egberts, Karin; Falkai, Peter; Fernandez, Thomas; Fournier, Eduardo; Garrido, Helena; Geller, Daniel; Gilbert, Donald L; Girard, Simon L; Grabe, Hans J; Grados, Marco A; Greenberg, Benjamin D; Gross-Tsur, Varda; Grünblatt, Edna; Hardy, John; Heiman, Gary A; Hemmings, Sian M J; Herrera, Luis D; Hezel, Dianne M; Hoekstra, Pieter J; Jankovic, Joseph; Kennedy, James L; King, Robert A; Konkashbaev, Anuar I; Kremeyer, Barbara; Kurlan, Roger; Lanzagorta, Nuria; Leboyer, Marion; Leckman, James F; Lennertz, Leonhard; Liu, Chunyu; Lochner, Christine; Lowe, Thomas L; Lupoli, Sara; Macciardi, Fabio; Maier, Wolfgang; Manunta, Paolo; Marconi, Maurizio; McCracken, James T; Mesa Restrepo, Sandra C; Moessner, Rainald; Moorjani, Priya; Morgan, Jubel; Muller, Heike; Murphy, Dennis L; Naarden, Allan L; Nurmi, Erika; Ochoa, William Cornejo; Ophoff, Roel A; Pakstis, Andrew J; Pato, Michele T; Pato, Carlos N; Piacentini, John; Pittenger, Christopher; Pollak, Yehuda; Rauch, Scott L; Renner, Tobias; Reus, Victor I; Richter, Margaret A; Riddle, Mark A; Robertson, Mary M; Romero, Roxana; Rosário, Maria C; Rosenberg, David; Ruhrmann, Stephan; Sabatti, Chiara; Salvi, Erika; Sampaio, Aline S; Samuels, Jack; Sandor, Paul; Service, Susan K; Sheppard, Brooke; Singer, Harvey S; Smit, Jan H; Stein, Dan J; Strengman, Eric; Tischfield, Jay A; Turiel, Maurizio; Valencia Duarte, Ana V; Vallada, Homero; Veenstra-VanderWeele, Jeremy; Walitza, Susanne; Wang, Ying; Weale, Mike; Weiss, Robert; Wendland, Jens R; Westenberg, Herman G M; Shugart, Yin Yao; Hounie, Ana G; Miguel, Euripedes C; Nicolini, Humberto; Wagner, Michael; Ruiz-Linares, Andres; Cath, Danielle C; McMahon, William; Posthuma, Danielle; Oostra, Ben A; Nestadt, Gerald; Rouleau, Guy A; Purcell, Shaun; Jenike, Michael A; Heutink, Peter; Hanna, Gregory L; Conti, David V; Arnold, Paul D; Freimer, Nelson B; Stewart, S Evelyn; Knowles, James A; Cox, Nancy J; Pauls, David L
2015-01-01
Obsessive-compulsive disorder (OCD) and Tourette's syndrome are highly heritable neurodevelopmental disorders that are thought to share genetic risk factors. However, the identification of definitive susceptibility genes for these etiologically complex disorders remains elusive. The authors report a combined genome-wide association study (GWAS) of Tourette's syndrome and OCD. The authors conducted a GWAS in 2,723 cases (1,310 with OCD, 834 with Tourette's syndrome, 579 with OCD plus Tourette's syndrome/chronic tics), 5,667 ancestry-matched controls, and 290 OCD parent-child trios. GWAS summary statistics were examined for enrichment of functional variants associated with gene expression levels in brain regions. Polygenic score analyses were conducted to investigate the genetic architecture within and across the two disorders. Although no individual single-nucleotide polymorphisms (SNPs) achieved genome-wide significance, the GWAS signals were enriched for SNPs strongly associated with variations in brain gene expression levels (expression quantitative loci, or eQTLs), suggesting the presence of true functional variants that contribute to risk of these disorders. Polygenic score analyses identified a significant polygenic component for OCD (p=2×10(-4)), predicting 3.2% of the phenotypic variance in an independent data set. In contrast, Tourette's syndrome had a smaller, nonsignificant polygenic component, predicting only 0.6% of the phenotypic variance (p=0.06). No significant polygenic signal was detected across the two disorders, although the sample is likely underpowered to detect a modest shared signal. Furthermore, the OCD polygenic signal was significantly attenuated when cases with both OCD and co-occurring Tourette's syndrome/chronic tics were included in the analysis (p=0.01). Previous work has shown that Tourette's syndrome and OCD have some degree of shared genetic variation. However, the data from this study suggest that there are also distinct components to the genetic architectures of these two disorders. Furthermore, OCD with co-occurring Tourette's syndrome/chronic tics may have different underlying genetic susceptibility compared with OCD alone.
Wang, Quanxiu; Zhao, Hu; Jiang, Junpeng; Xu, Jiuyue; Xie, Weibo; Fu, Xiangkui; Liu, Chang; He, Yuqing; Wang, Gongwei
2017-01-01
The photoprotective processes conferred by nonphotochemical quenching (NPQ) serve fundamental roles in maintaining plant fitness and sustainable yield. So far, few loci have been reported to be involved in natural variation of NPQ capacity in rice ( Oryza sativa ), and the extents of variation explored are very limited. Here we conducted a genome-wide association study (GWAS) for NPQ capacity using a diverse worldwide collection of 529 O. sativa accessions. A total of 33 significant association loci were identified. To check the validity of the GWAS signals, three F2 mapping populations with parents selected from the association panel were constructed and assayed. All QTLs detected in mapping populations could correspond to at least one GWAS signal, indicating the GWAS results were quite reliable. OsPsbS1 was repeatedly detected and explained more than 40% of the variation in the whole association population in two years, and demonstrated to be a common major QTL in all three mapping populations derived from inter-group crosses. We revealed 43 single nucleotide polymorphisms (SNPs) and 7 insertions and deletions (InDels) within a 6,997-bp DNA fragment of OsPsbS1 , but found no non-synonymous SNPs or InDels in the coding region, indicating the PsbS1 protein sequence is highly conserved. Haplotypes with the 2,674-bp insertion in the promoter region exhibited significantly higher NPQ values and higher expression levels of OsPsbS1 . The OsPsbS1 RNAi plants and CRISPR/Cas9 mutants exhibited drastically decreased NPQ values. OsPsbS1 had specific and high-level expression in green tissues of rice. However, we didn't find significant function for OsPsbS2 , the other rice PsbS homologue. Manipulation of the significant loci or candidate genes identified may enhance photoprotection and improve photosynthesis and yield in rice.
Earp, Madalene A.; Kelemen, Linda E.; Magliocco, Anthony M.; Swenerton, Kenneth D.; Chenevix–Trench, Georgia; Lu, Yi; Hein, Alexander; Ekici, Arif B.; Beckmann, Matthias W.; Fasching, Peter A.; Lambrechts, Diether; Despierre, Evelyn; Vergote, Ignace; Lambrechts, Sandrina; Doherty, Jennifer A.; Rossing, Mary Anne; Chang-Claude, Jenny; Rudolph, Anja; Friel, Grace; Moysich, Kirsten B.; Odunsi, Kunle; Sucheston-Campbell, Lara; Lurie, Galina; Goodman, Marc T.; Carney, Michael E.; Thompson, Pamela J.; Runnebaum, Ingo B.; Dürst, Matthias; Hillemanns, Peter; Dörk, Thilo; Antonenkova, Natalia; Bogdanova, Natalia; Leminen, Arto; Nevanlinna, Heli; Pelttari, Liisa M.; Butzow, Ralf; Bunker, Clareann H.; Modugno, Francesmary; Edwards, Robert P.; Ness, Roberta B.; du Bois, Andreas; Heitz, Florian; Schwaab, Ira; Harter, Philipp; Karlan, Beth Y.; Walsh, Christine; Lester, Jenny; Jensen, Allan; Kjær, Susanne K.; Høgdall, Claus K.; Høgdall, Estrid; Lundvall, Lene; Sellers, Thomas A.; Fridley, Brooke L.; Goode, Ellen L.; Cunningham, Julie M.; Vierkant, Robert A.; Giles, Graham G.; Baglietto, Laura; Severi, Gianluca; Southey, Melissa C.; Liang, Dong; Wu, Xifeng; Lu, Karen; Hildebrandt, Michelle A.T.; Levine, Douglas A.; Bisogna, Maria; Schildkraut, Joellen M.; Iversen, Edwin S.; Weber, Rachel Palmieri; Berchuck, Andrew; Cramer, Daniel W.; Terry, Kathryn L.; Poole, Elizabeth M.; Tworoger, Shelley S.; Bandera, Elisa V.; Chandran, Urmila; Orlow, Irene; Olson, Sara H.; Wik, Elisabeth; Salvesen, Helga B.; Bjorge, Line; Halle, Mari K.; van Altena, Anne M.; Aben, Katja K.H.; Kiemeney, Lambertus A.; Massuger, Leon F.A.G.; Pejovic, Tanja; Bean, Yukie T.; Cybulski, Cezary; Gronwald, Jacek; Lubinski, Jan; Wentzensen, Nicolas; Brinton, Louise A.; Lissowska, Jolanta; Garcia–Closas, Montserrat; Dicks, Ed; Dennis, Joe; Easton, Douglas F.; Song, Honglin; Tyrer, Jonathan P.; Pharoah, Paul D. P.; Eccles, Diana; Campbell, Ian G.; Whittemore, Alice S.; McGuire, Valerie; Sieh, Weiva; Rothstein, Joseph H.; Flanagan, James M.; Paul, James; Brown, Robert; Phelan, Catherine M.; Risch, Harvey A.; McLaughlin, John R.; Narod, Steven A.; Ziogas, Argyrios; Anton-Culver, Hoda; Gentry-Maharaj, Aleksandra; Menon, Usha; Gayther, Simon A.; Ramus, Susan J.; Wu, Anna H.; Pearce, Celeste L.; Pike, Malcolm C.; Dansonka-Mieszkowska, Agnieszka; Rzepecka, Iwona K; Szafron, Lukasz M; Kupryjanczyk, Jolanta; Cook, Linda S.; Le, Nhu D.; Brooks–Wilson, Angela
2014-01-01
Epithelial ovarian cancer (EOC) is a heterogeneous cancer with both genetic and environmental risk factors. Variants influencing the risk of developing the less-common EOC subtypes have not been fully investigated. We performed a genome-wide association study (GWAS) of EOC according to subtype by pooling genomic DNA from 545 cases and 398 controls of European descent, and testing for allelic associations. We evaluated for replication 188 variants from the GWAS (56 variants for mucinous, 55 for endometrioid and clear cell, 53 for low malignant potential (LMP) serous, and 24 for invasive serous EOC), selected using pre-defined criteria. Genotypes from 13,188 cases and 23,164 controls of European descent were used to perform unconditional logistic regression under the log-additive genetic model; odds ratios (OR) and 95% confidence intervals are reported. Nine variants tagging 6 loci were associated with subtype-specific EOC risk at P<0.05, and had an OR that agreed in direction of effect with the GWAS results. Several of these variants are in or near genes with a biological rationale for conferring EOC risk, including ZFP36L1 and RAD51B for mucinous EOC (rs17106154, OR=1.17, P=0.029, n=1,483 cases), GRB10 for endometrioid and clear cell EOC (rs2190503, P=0.014, n=2,903 cases), and C22orf26/BPIL2 for LMP serous EOC (rs9609538, OR=0.86, P=0.0043, n=892 cases). In analyses that included the 75 GWAS samples, the association between rs9609538 (OR=0.84, P=0.0007) and LMP serous EOC risk remained statistically significant at P<0.0012 adjusted for multiple testing. Replication in additional samples will be important to verify these results for the less-common EOC subtypes. PMID:24190013
Elliott, Katherine S; Chapman, Kay; Day-Williams, Aaron; Panoutsopoulou, Kalliope; Southam, Lorraine; Lindgren, Cecilia M; Arden, Nigel; Aslam, Nadim; Birrell, Fraser; Carluke, Ian; Carr, Andrew; Deloukas, Panos; Doherty, Michael; Loughlin, John; McCaskie, Andrew; Ollier, William E R; Rai, Ashok; Ralston, Stuart; Reed, Mike R; Spector, Timothy D; Valdes, Ana M; Wallis, Gillian A; Wilkinson, Mark; Zeggini, Eleftheria
2013-06-01
Obesity as measured by body mass index (BMI) is one of the major risk factors for osteoarthritis. In addition, genetic overlap has been reported between osteoarthritis and normal adult height variation. We investigated whether this relationship is due to a shared genetic aetiology on a genome-wide scale. We compared genetic association summary statistics (effect size, p value) for BMI and height from the GIANT consortium genome-wide association study (GWAS) with genetic association summary statistics from the arcOGEN consortium osteoarthritis GWAS. Significance was evaluated by permutation. Replication of osteoarthritis association of the highlighted signals was investigated in an independent dataset. Phenotypic information of height and BMI was accounted for in a separate analysis using osteoarthritis-free controls. We found significant overlap between osteoarthritis and height (p=3.3×10(-5) for signals with p≤0.05) when the GIANT and arcOGEN GWAS were compared. For signals with p≤0.001 we found 17 shared signals between osteoarthritis and height and four between osteoarthritis and BMI. However, only one of the height or BMI signals that had shown evidence of association with osteoarthritis in the arcOGEN GWAS was also associated with osteoarthritis in the independent dataset: rs12149832, within the FTO gene (combined p=2.3×10(-5)). As expected, this signal was attenuated when we adjusted for BMI. We found a significant excess of shared signals between both osteoarthritis and height and osteoarthritis and BMI, suggestive of a common genetic aetiology. However, only one signal showed association with osteoarthritis when followed up in a new dataset.
Bernatsky, S; Easton, D F; Dunning, A; Michailidou, K; Ramsey-Goldman, R; Gordon, C; Clarke, A E; Foulkes, W
2012-07-01
Recent work has demonstrated an important decrease in breast cancers for women with systemic lupus erythematosus (SLE). The reason behind this phenomenon is unknown. Our purpose was to explore whether the single nucleotide polymorphisms (SNPs) predisposing to SLE might be protective against breast cancer (in women in the general population). We focused on loci relevant to 10 SNPs associated with SLE (with a p value of <10(-9)). We determined whether we could establish a decreased frequency of these SNPs in breast cancer cases versus controls, within the general population. To do this we used a large breast cancer genome-wide association study (GWAS) dataset, involving 3,659 breast cancer cases and 4,897 controls. These subjects were all primarily of European ancestry. The population-based GWAS breast cancer data we examined suggested little evidence for important associations between breast cancer and SLE-related SNPs. Within the general population GWAS data, a cytosine(C) nucleotide substitution at rs9888739 (on chromosome 16p11.2) showed a very weak inverse association with breast cancer. The odds ratio (OR) for the rs9888739-C allele was 0.907551 (p value 0.049899) in the GWAS breast cancer sample, compared to controls. There was a slightly stronger, positive, association with breast cancer for rs6445975-G (Guanine) on chromosome 3p14.3, with a breast cancer OR of 1.0911 (p value 0.0097). Within this large breast cancer dataset, we did not demonstrate important associations with 10 lupus-associated SNPs. If decreased breast cancer risk in SLE is influenced by genetic profiles, this may be due to complex interactions and/or epigenetic factors.
Elliott, Katherine S; Chapman, Kay; Day-Williams, Aaron; Panoutsopoulou, Kalliope; Southam, Lorraine; Lindgren, Cecilia M; Arden, Nigel; Aslam, Nadim; Birrell, Fraser; Carluke, Ian; Carr, Andrew; Deloukas, Panos; Doherty, Michael; Loughlin, John; McCaskie, Andrew; Ollier, William E R; Rai, Ashok; Ralston, Stuart; Reed, Mike R; Spector, Timothy D; Valdes, Ana M; Wallis, Gillian A; Wilkinson, Mark; Zeggini, Eleftheria
2013-01-01
Objectives Obesity as measured by body mass index (BMI) is one of the major risk factors for osteoarthritis. In addition, genetic overlap has been reported between osteoarthritis and normal adult height variation. We investigated whether this relationship is due to a shared genetic aetiology on a genome-wide scale. Methods We compared genetic association summary statistics (effect size, p value) for BMI and height from the GIANT consortium genome-wide association study (GWAS) with genetic association summary statistics from the arcOGEN consortium osteoarthritis GWAS. Significance was evaluated by permutation. Replication of osteoarthritis association of the highlighted signals was investigated in an independent dataset. Phenotypic information of height and BMI was accounted for in a separate analysis using osteoarthritis-free controls. Results We found significant overlap between osteoarthritis and height (p=3.3×10−5 for signals with p≤0.05) when the GIANT and arcOGEN GWAS were compared. For signals with p≤0.001 we found 17 shared signals between osteoarthritis and height and four between osteoarthritis and BMI. However, only one of the height or BMI signals that had shown evidence of association with osteoarthritis in the arcOGEN GWAS was also associated with osteoarthritis in the independent dataset: rs12149832, within the FTO gene (combined p=2.3×10−5). As expected, this signal was attenuated when we adjusted for BMI. Conclusions We found a significant excess of shared signals between both osteoarthritis and height and osteoarthritis and BMI, suggestive of a common genetic aetiology. However, only one signal showed association with osteoarthritis when followed up in a new dataset. PMID:22956599
A Population Genetic Signal of Polygenic Adaptation
Berg, Jeremy J.; Coop, Graham
2014-01-01
Adaptation in response to selection on polygenic phenotypes may occur via subtle allele frequencies shifts at many loci. Current population genomic techniques are not well posed to identify such signals. In the past decade, detailed knowledge about the specific loci underlying polygenic traits has begun to emerge from genome-wide association studies (GWAS). Here we combine this knowledge from GWAS with robust population genetic modeling to identify traits that may have been influenced by local adaptation. We exploit the fact that GWAS provide an estimate of the additive effect size of many loci to estimate the mean additive genetic value for a given phenotype across many populations as simple weighted sums of allele frequencies. We use a general model of neutral genetic value drift for an arbitrary number of populations with an arbitrary relatedness structure. Based on this model, we develop methods for detecting unusually strong correlations between genetic values and specific environmental variables, as well as a generalization of comparisons to test for over-dispersion of genetic values among populations. Finally we lay out a framework to identify the individual populations or groups of populations that contribute to the signal of overdispersion. These tests have considerably greater power than their single locus equivalents due to the fact that they look for positive covariance between like effect alleles, and also significantly outperform methods that do not account for population structure. We apply our tests to the Human Genome Diversity Panel (HGDP) dataset using GWAS data for height, skin pigmentation, type 2 diabetes, body mass index, and two inflammatory bowel disease datasets. This analysis uncovers a number of putative signals of local adaptation, and we discuss the biological interpretation and caveats of these results. PMID:25102153
Baker, Lauren A.; Kirkpatrick, Brian; Rosa, Guilherme J. M.; Gianola, Daniel; Valente, Bruno; Sumner, Julia P.; Baltzer, Wendy; Hao, Zhengling; Binversie, Emily E.; Volstad, Nicola; Piazza, Alexander; Sample, Susannah J.
2017-01-01
Anterior cruciate ligament (ACL) rupture is a common condition that can be devastating and life changing, particularly in young adults. A non-contact mechanism is typical. Second ACL ruptures through rupture of the contralateral ACL or rupture of a graft repair is also common. Risk of rupture is increased in females. ACL rupture is also common in dogs. Disease prevalence exceeds 5% in several dog breeds, ~100 fold higher than human beings. We provide insight into the genetic etiology of ACL rupture by genome-wide association study (GWAS) in a high-risk breed using 98 case and 139 control Labrador Retrievers. We identified 129 single nucleotide polymorphisms (SNPs) within 99 risk loci. Associated loci (P<5E-04) explained approximately half of phenotypic variance in the ACL rupture trait. Two of these loci were located in uncharacterized or non-coding regions of the genome. A chromosome 24 locus containing nine genes with diverse functions met genome-wide significance (P = 3.63E-0.6). GWAS pathways were enriched for c-type lectins, a gene set that includes aggrecan, a gene set encoding antimicrobial proteins, and a gene set encoding membrane transport proteins with a variety of physiological functions. Genotypic risk estimated for each dog based on the risk contributed by each GWAS locus showed clear separation of ACL rupture cases and controls. Power analysis of the GWAS data set estimated that ~172 loci explain the genetic contribution to ACL rupture in the Labrador Retriever. Heritability was estimated at 0.48. We conclude ACL rupture is a moderately heritable highly polygenic complex trait. Our results implicate c-type lectin pathways in ACL homeostasis. PMID:28379989
Prospecting sugarcane resistance to Sugarcane yellow leaf virus by genome-wide association.
Debibakas, S; Rocher, S; Garsmeur, O; Toubi, L; Roques, D; D'Hont, A; Hoarau, J-Y; Daugrois, J H
2014-08-01
Using GWAS approaches, we detected independent resistant markers in sugarcane towards a vectored virus disease. Based on comparative genomics, several candidate genes potentially involved in virus/aphid/plant interactions were pinpointed. Yellow leaf of sugarcane is an emerging viral disease whose causal agent is a Polerovirus, the Sugarcane yellow leaf virus (SCYLV) transmitted by aphids. To identify quantitative trait loci controlling resistance to yellow leaf which are of direct relevance for breeding, we undertook a genome-wide association study (GWAS) on a sugarcane cultivar panel (n = 189) representative of current breeding germplasm. This panel was fingerprinted with 3,949 polymorphic markers (DArT and AFLP). The panel was phenotyped for SCYLV infection in leaves and stalks in two trials for two crop cycles, under natural disease pressure prevalent in Guadeloupe. Mixed linear models including co-factors representing population structure fixed effects and pairwise kinship random effects provided an efficient control of the risk of inflated type-I error at a genome-wide level. Six independent markers were significantly detected in association with SCYLV resistance phenotype. These markers explained individually between 9 and 14 % of the disease variation of the cultivar panel. Their frequency in the panel was relatively low (8-20 %). Among them, two markers were detected repeatedly across the GWAS exercises based on the different disease resistance parameters. These two markers could be blasted on Sorghum bicolor genome and candidate genes potentially involved in plant-aphid or plant-virus interactions were localized in the vicinity of sorghum homologs of sugarcane markers. Our results illustrate the potential of GWAS approaches to prospect among sugarcane germplasm for accessions likely bearing resistance alleles of significant effect useful in breeding programs.
Aromatase Inhibitor-Associated Bone Fractures: A Case-Cohort GWAS and Functional Genomics
Liu, Mohan; Goss, Paul E.; Ingle, James N.; Kubo, Michiaki; Furukawa, Yoichi; Batzler, Anthony; Jenkins, Gregory D.; Carlson, Erin E.; Nakamura, Yusuke; Schaid, Daniel J.; Chapman, Judy-Anne W.; Shepherd, Lois E.; Ellis, Matthew J.; Khosla, Sundeep; Wang, Liewei
2014-01-01
Bone fractures are a major consequence of osteoporosis. There is a direct relationship between serum estrogen concentrations and osteoporosis risk. Aromatase inhibitors (AIs) greatly decrease serum estrogen levels in postmenopausal women, and increased incidence of fractures is a side effect of AI therapy. We performed a discovery case-cohort genome-wide association study (GWAS) using samples from 1071 patients, 231 cases and 840 controls, enrolled in the MA.27 breast cancer AI trial to identify genetic factors involved in AI-related fractures, followed by functional genomic validation. Association analyses identified 20 GWAS single nucleotide polymorphism (SNP) signals with P < 5E-06. After removal of signals in gene deserts and those composed entirely of imputed SNPs, we applied a functional validation “decision cascade” that resulted in validation of the CTSZ-SLMO2-ATP5E, TRAM2-TMEM14A, and MAP4K4 genes. These genes all displayed estradiol (E2)-dependent induction in human fetal osteoblasts transfected with estrogen receptor-α, and their knockdown altered the expression of known osteoporosis-related genes. These same genes also displayed SNP-dependent variation in E2 induction that paralleled the SNP-dependent induction of known osteoporosis genes, such as osteoprotegerin. In summary, our case-cohort GWAS identified SNPs in or near CTSZ-SLMO2-ATP5E, TRAM2-TMEM14A, and MAP4K4 that were associated with risk for bone fracture in estrogen receptor-positive breast cancer patients treated with AIs. These genes displayed E2-dependent induction, their knockdown altered the expression of genes related to osteoporosis, and they displayed SNP genotype-dependent variation in E2 induction. These observations may lead to the identification of novel mechanisms associated with fracture risk in postmenopausal women treated with AIs. PMID:25148458
Next Generation Analytic Tools for Large Scale Genetic Epidemiology Studies of Complex Diseases
Mechanic, Leah E.; Chen, Huann-Sheng; Amos, Christopher I.; Chatterjee, Nilanjan; Cox, Nancy J.; Divi, Rao L.; Fan, Ruzong; Harris, Emily L.; Jacobs, Kevin; Kraft, Peter; Leal, Suzanne M.; McAllister, Kimberly; Moore, Jason H.; Paltoo, Dina N.; Province, Michael A.; Ramos, Erin M.; Ritchie, Marylyn D.; Roeder, Kathryn; Schaid, Daniel J.; Stephens, Matthew; Thomas, Duncan C.; Weinberg, Clarice R.; Witte, John S.; Zhang, Shunpu; Zöllner, Sebastian; Feuer, Eric J.; Gillanders, Elizabeth M.
2012-01-01
Over the past several years, genome-wide association studies (GWAS) have succeeded in identifying hundreds of genetic markers associated with common diseases. However, most of these markers confer relatively small increments of risk and explain only a small proportion of familial clustering. To identify obstacles to future progress in genetic epidemiology research and provide recommendations to NIH for overcoming these barriers, the National Cancer Institute sponsored a workshop entitled “Next Generation Analytic Tools for Large-Scale Genetic Epidemiology Studies of Complex Diseases” on September 15–16, 2010. The goal of the workshop was to facilitate discussions on (1) statistical strategies and methods to efficiently identify genetic and environmental factors contributing to the risk of complex disease; and (2) how to develop, apply, and evaluate these strategies for the design, analysis, and interpretation of large-scale complex disease association studies in order to guide NIH in setting the future agenda in this area of research. The workshop was organized as a series of short presentations covering scientific (gene-gene and gene-environment interaction, complex phenotypes, and rare variants and next generation sequencing) and methodological (simulation modeling and computational resources and data management) topic areas. Specific needs to advance the field were identified during each session and are summarized. PMID:22147673
Clarke, T-K; Adams, M J; Davies, G; Howard, D M; Hall, L S; Padmanabhan, S; Murray, A D; Smith, B H; Campbell, A; Hayward, C; Porteous, D J; Deary, I J; McIntosh, A M
2017-01-01
Alcohol consumption has been linked to over 200 diseases and is responsible for over 5% of the global disease burden. Well-known genetic variants in alcohol metabolizing genes, for example, ALDH2 and ADH1B, are strongly associated with alcohol consumption but have limited impact in European populations where they are found at low frequency. We performed a genome-wide association study (GWAS) of self-reported alcohol consumption in 112 117 individuals in the UK Biobank (UKB) sample of white British individuals. We report significant genome-wide associations at 14 loci. These include single-nucleotide polymorphisms (SNPs) in alcohol metabolizing genes (ADH1B/ADH1C/ADH5) and two loci in KLB, a gene recently associated with alcohol consumption. We also identify SNPs at novel loci including GCKR, CADM2 and FAM69C. Gene-based analyses found significant associations with genes implicated in the neurobiology of substance use (DRD2, PDE4B). GCTA analyses found a significant SNP-based heritability of self-reported alcohol consumption of 13% (se=0.01). Sex-specific analyses found largely overlapping GWAS loci and the genetic correlation (rG) between male and female alcohol consumption was 0.90 (s.e.=0.09, P-value=7.16 × 10−23). Using LD score regression, genetic overlap was found between alcohol consumption and years of schooling (rG=0.18, s.e.=0.03), high-density lipoprotein cholesterol (rG=0.28, s.e.=0.05), smoking (rG=0.40, s.e.=0.06) and various anthropometric traits (for example, overweight, rG=−0.19, s.e.=0.05). This study replicates the association between alcohol consumption and alcohol metabolizing genes and KLB, and identifies novel gene associations that should be the focus of future studies investigating the neurobiology of alcohol consumption. PMID:28937693
Genome-wide association study of colorectal cancer in Hispanics
Schmit, Stephanie L.; Schumacher, Fredrick R.; Edlund, Christopher K.; Conti, David V.; Ihenacho, Ugonna; Wan, Peggy; Van Den Berg, David; Casey, Graham; Fortini, Barbara K.; Lenz, Heinz-Josef; Tusié-Luna, Teresa; Aguilar-Salinas, Carlos A.; Moreno-Macías, Hortensia; Huerta-Chagoya, Alicia; Ordóñez-Sánchez, María Luisa; Rodríguez-Guillén, Rosario; Cruz-Bautista, Ivette; Rodríguez-Torres, Maribel; Muñóz-Hernández, Linda Liliana; Arellano-Campos, Olimpia; Gómez, Donají; Alvirde, Ulices; González-Villalpando, Clicerio; González-Villalpando, María Elena; Le Marchand, Loic; Haiman, Christopher A.; Figueiredo, Jane C.
2016-01-01
Genome-wide association studies (GWAS) have identified 58 susceptibility alleles across 37 regions associated with the risk of colorectal cancer (CRC) with P < 5×10−8. Most studies have been conducted in non-Hispanic whites and East Asians; however, the generalizability of these findings and the potential for ethnic-specific risk variation in Hispanic and Latino (HL) individuals have been largely understudied. We describe the first GWAS of common genetic variation contributing to CRC risk in HL (1611 CRC cases and 4330 controls). We also examine known susceptibility alleles and implement imputation-based fine-mapping to identify potential ethnicity-specific association signals in known risk regions. We discovered 17 variants across 4 independent regions that merit further investigation due to suggestive CRC associations (P < 1×10−6) at 1p34.3 (rs7528276; Odds Ratio (OR) = 1.86 [95% confidence interval (CI): 1.47–2.36); P = 2.5×10−7], 2q23.3 (rs1367374; OR = 1.37 (95% CI: 1.21–1.55); P = 4.0×10−7), 14q24.2 (rs143046984; OR = 1.65 (95% CI: 1.36–2.01); P = 4.1×10−7) and 16q12.2 [rs142319636; OR = 1.69 (95% CI: 1.37–2.08); P=7.8×10−7]. Among the 57 previously published CRC susceptibility alleles with minor allele frequency ≥1%, 76.5% of SNPs had a consistent direction of effect and 19 (33.3%) were nominally statistically significant (P < 0.05). Further, rs185423955 and rs60892987 were identified as novel secondary susceptibility variants at 3q26.2 (P = 5.3×10–5) and 11q12.2 (P = 6.8×10−5), respectively. Our findings demonstrate the importance of fine mapping in HL. These results are informative for variant prioritization in functional studies and future risk prediction modeling in minority populations. PMID:27207650
Gong, Xian; Zhang, Chao; Yiliyasi·Aisa, Yiliyasi·Aisa; Shi, Ying; Yang, Xue-wei; NuersimanguliAosiman, NuersimanguliAosiman; Guan, Ya-qun; Xu, Shu-hua
2016-06-20
Over the last decade, a larger number of type 2 diabetes mellitus (T2DM) susceptible candidate genes have been reported by numerous genome-wide association studies (GWAS). Understanding the genetic diversity of these candidate genes among worldwide populations not only facilitates to elucidating the genetic mechanism of T2DM, but also provides guidance to further studies of pathogenesis of T2DM in any certain population. In this study, we identified 170 genes or genomic regions associated with T2DM by searching the GWAS databases and related literatures. We next analyzed the genetic diversity of these genes (or genomic regions) among present-day human populations by curetting the 1000 Genomes Projects phase1 dataset covering 14 worldwide populations. We further compared the characteristics of T2DM genes in different populations. No significant differences of genetic diversity were observed among the 14 worldwide populations between the T2DM candidate genes and the non-T2DM genes in terms of overall pattern. However, we observed some genes, such as IL20RA, RNMTL1-NXN, NOTCH2, ADRA2A-BTBD7P2, TBC1D4, RBM38-HMGB1P1, UBE2E2, and PPARD, show considerable differentiation between populations. In particular, IL20RA (FST=0.1521) displays the greatest population difference which is mainly contributed by that between Africans and non-Africans. Moreover, we revealed genetic differences between East Asians and Europeans on some candidate genes such as DGKB-AGMO (FST=0.173) and JAZF1 (FST=0.182). Our results indicate that some T2DM susceptible candidate genes harbor highly-differentiated variants between populations. These analyses, despite preliminary, should advance our understanding of the population difference of susceptibility to T2DM and provide insightful reference that future studies can relay on.
Matsuda, Fumio; Nakabayashi, Ryo; Yang, Zhigang; Okazaki, Yozo; Yonemaru, Jun-ichi; Ebana, Kaworu; Yano, Masahiro; Saito, Kazuki
2015-01-01
Plants produce structurally diverse secondary (specialized) metabolites to increase their fitness for survival under adverse environments. Several bioactive compounds for new drugs have been identified through screening of plant extracts. In this study, genome-wide association studies (GWAS) were conducted to investigate the genetic architecture behind the natural variation of rice secondary metabolites. GWAS using the metabolome data of 175 rice accessions successfully identified 323 associations among 143 single nucleotide polymorphisms (SNPs) and 89 metabolites. The data analysis highlighted that levels of many metabolites are tightly associated with a small number of strong quantitative trait loci (QTLs). The tight association may be a mechanism generating strains with distinct metabolic composition through the crossing of two different strains. The results indicate that one plant species produces more diverse phytochemicals than previously expected, and plants still contain many useful compounds for human applications. PMID:25267402
Fan, Yun; Zhou, Gaofeng; Shabala, Sergey; Chen, Zhong-Hua; Cai, Shengguan; Li, Chengdao; Zhou, Meixue
2016-01-01
Salinity stress is one of the most severe abiotic stresses that affect agricultural production. Genome wide association study (GWAS) has been widely used to detect genetic variations in extensive natural accessions with more recombination and higher resolution. In this study, 206 barley accessions collected worldwide were genotyped with 408 Diversity Arrays Technology (DArT) markers and evaluated for salinity stress tolerance using salinity tolerance score – a reliable trait developed in our previous work. GWAS for salinity tolerance had been conducted through a general linkage model and a mixed linkage model based on population structure and kinship. A total of 24 significant marker-trait associations were identified. A QTL on 4H with the nearest marker of bPb-9668 was consistently detected in all different methods. This QTL has not been reported before and is worth to be further confirmed with bi-parental populations. PMID:27446173
Genetic polymorphism and chronic obstructive pulmonary disease.
Yuan, Cunhua; Chang, De; Lu, Guangming; Deng, Xiaowei
2017-01-01
Chronic obstructive pulmonary disease (COPD) is a common chronic disease, and its morbidity and mortality are increasing. There are many studies that have tried to explain the pathogenesis of COPD from genetic susceptibility, to identify the susceptibility of COPD factors, which play a role in early prevention, early detection and the early treatment. However, it is well known that COPD is an inflammatory disease characterized by incomplete reversible airflow limitation in which genes interact with the environment. In recent years, many studies have proved gene polymorphisms and COPD correlation. However, there is less research on the relationship between COPD and genome-wide association study (GWAS), epigenetics and apoptosis. In this paper, we summarized the correlation between gene level and COPD from the following four aspects: the GWAS, the gene polymorphism, the epigenetics and the apoptosis, and the relationship between COPD and gene is summarized comprehensively.
Gemenetzi, M; Yang, Y; Lotery, A J
2012-01-01
Glaucoma is a common, complex, heterogenous disease and it constitutes the major cause of irreversible blindness worldwide. Primary open-angle glaucoma (POAG) is the most common type of glaucoma in all populations. Most of the molecular mechanisms leading to POAG development are still unknown. Gene mutations in various populations have been identified by genetic studies and a genetic basis for glaucoma pathogenesis has been established. Linkage analysis and association studies are genetic approaches in the investigation of the genetic basis of POAG. Genome-wide association studies (GWAS) are more powerful compared with linkage analysis in discovering genes of small effect that might contribute to the development of the disease. POAG links to at least 20 genetic loci, but only 2 genes identified in these loci, myocilin and optineurin, are considered as well-established glaucoma-causing genes, whereas the role of other loci, genes, and variants implicated in the development of POAG remains controversial. Gene mutations associated with POAG result in retinal ganglion cell death, which is the common outcome of pathogenetic mechanisms in glaucoma. In future, if the sensitivity and specificity of genotyping increases, it may be possible to screen individuals routinely for disease susceptibility. This review is an update on the latest progress of genetic studies associated with POAG. It emphasizes the correlation of recent achievements in genetics with glaucoma pathophysiology, glaucoma treatment perspectives, and the possibility of future prevention of irreversible visual loss caused by the disease. PMID:22173078
Zhao, Huiying; Nyholt, Dale R; Yang, Yuanhao; Wang, Jihua; Yang, Yuedong
2017-06-14
Genome-wide association studies (GWAS) have successfully identified single variants associated with diseases. To increase the power of GWAS, gene-based and pathway-based tests are commonly employed to detect more risk factors. However, the gene- and pathway-based association tests may be biased towards genes or pathways containing a large number of single-nucleotide polymorphisms (SNPs) with small P-values caused by high linkage disequilibrium (LD) correlations. To address such bias, numerous pathway-based methods have been developed. Here we propose a novel method, DGAT-path, to divide all SNPs assigned to genes in each pathway into LD blocks, and to sum the chi-square statistics of LD blocks for assessing the significance of the pathway by permutation tests. The method was proven robust with the type I error rate >1.6 times lower than other methods. Meanwhile, the method displays a higher power and is not biased by the pathway size. The applications to the GWAS summary statistics for schizophrenia and breast cancer indicate that the detected top pathways contain more genes close to associated SNPs than other methods. As a result, the method identified 17 and 12 significant pathways containing 20 and 21 novel associated genes, respectively for two diseases. The method is available online by http://sparks-lab.org/server/DGAT-path .
Yuan, Zhongshang; Liu, Hong; Zhang, Xiaoshuai; Li, Fangyu; Zhao, Jinghua; Zhang, Furen; Xue, Fuzhong
2013-01-01
Currently, the genetic variants identified by genome wide association study (GWAS) generally only account for a small proportion of the total heritability for complex disease. One crucial reason is the underutilization of gene-gene joint effects commonly encountered in GWAS, which includes their main effects and co-association. However, gene-gene co-association is often customarily put into the framework of gene-gene interaction vaguely. From the causal graph perspective, we elucidate in detail the concept and rationality of gene-gene co-association as well as its relationship with traditional gene-gene interaction, and propose two Fisher r-to-z transformation-based simple statistics to detect it. Three series of simulations further highlight that gene-gene co-association refers to the extent to which the joint effects of two genes differs from the main effects, not only due to the traditional interaction under the nearly independent condition but the correlation between two genes. The proposed statistics are more powerful than logistic regression under various situations, cannot be affected by linkage disequilibrium and can have acceptable false positive rate as long as strictly following the reasonable GWAS data analysis roadmap. Furthermore, an application to gene pathway analysis associated with leprosy confirms in practice that our proposed gene-gene co-association concepts as well as the correspondingly proposed statistics are strongly in line with reality. PMID:23923021
Copy number variants in patients with short stature
van Duyvenvoorde, Hermine A; Lui, Julian C; Kant, Sarina G; Oostdijk, Wilma; Gijsbers, Antoinet CJ; Hoffer, Mariëtte JV; Karperien, Marcel; Walenkamp, Marie JE; Noordam, Cees; Voorhoeve, Paul G; Mericq, Verónica; Pereira, Alberto M; Claahsen-van de Grinten, Hedi L; van Gool, Sandy A; Breuning, Martijn H; Losekoot, Monique; Baron, Jeffrey; Ruivenkamp, Claudia AL; Wit, Jan M
2014-01-01
Height is a highly heritable and classic polygenic trait. Recent genome-wide association studies (GWAS) have revealed that at least 180 genetic variants influence adult height. However, these variants explain only about 10% of the phenotypic variation in height. Genetic analysis of short individuals can lead to the discovery of novel rare gene defects with a large effect on growth. In an effort to identify novel genes associated with short stature, genome-wide analysis for copy number variants (CNVs), using single-nucleotide polymorphism arrays, in 162 patients (149 families) with short stature was performed. Segregation analysis was performed if possible, and genes in CNVs were compared with information from GWAS, gene expression in rodents' growth plates and published information. CNVs were detected in 40 families. In six families, a known cause of short stature was found (SHOX deletion or duplication, IGF1R deletion), in two combined with a de novo potentially pathogenic CNV. Thirty-three families had one or more potentially pathogenic CNVs (n=40). In 24 of these families, segregation analysis could be performed, identifying three de novo CNVs and nine CNVs segregating with short stature. Four were located near loci associated with height in GWAS (ADAMTS17, TULP4, PRKG2/BMP3 and PAPPA). Besides six CNVs known to be causative for short stature, 40 CNVs with possible pathogenicity were identified. Segregation studies and bioinformatics analysis suggested various potential candidate genes. PMID:24065112
Lack of association between arterial stiffness and genetic variants by genome-wide association scan.
Park, Sungha; Lee, Ji-Young; Kim, Byeong-Keuk; Lee, Sang-Hak; Chang, Hyuk-Jae; Choi, DongHoon; Jang, Yangsoo
2015-01-01
Arterial stiffness is an independent predictor of cardiovascular disease risk. However, whether genetic risk variants are associated with arterial stiffness measures, such as pulse-wave velocity (PWV), is largely unknown. Therefore, we performed a genome-wide association study (GWAS) to identify single-nucleotide polymorphisms (SNPs) associated with PWV in a Korea population. Study participants consisted of 402 patients in the Yonsei cardiovascular genome center cohort. Arterial stiffness was measured as brachial-ankle pulse-wave velocity (baPWV). Genotyping was performed in 402 subjects with the Axiom Genome-Wide ASI 1 Array Plate containing more than 600,000 SNP markers. The findings were tested for replication in independent subjects from a community-based cohort of 1206 individuals, using a Taqman assay to include two candidate SNPs. Associations with PWV were evaluated using an additive genetic model that included age, gender, systolic blood pressure and diastolic blood pressure as covariates. GWAS and replication analyses were conducted using the measured genotype method implemented in PLINK and SAS. We observed two candidate SNPs associated with baPWV in GWAS: rs7271920 (p = 7.15 × 10(-9)) and rs10125157 (p = 8.25 × 10(-7)). However, neither of these was significant in the replication cohort. In summary, we did not identify any common genetic variants associated with baPWV in cardiovascular patients.
Natural variation reveals that OsSAP16 controls low-temperature germination in rice.
Wang, Xiang; Zou, Baohong; Shao, Qiaolin; Cui, Yongmei; Lu, Shan; Zhang, Yan; Huang, Quansheng; Huang, Ji; Hua, Jian
2018-01-23
Low temperature affects seed germination in plants, and low-temperature germination (LTG) is an important agronomic trait. Natural variation of LTG has been reported in rice, but the molecular basis for this variation is largely unknown. Here we report the phenotypic analysis of LTG in 187 rice natural accessions and a genome-wide association study (GWAS) of LTG in this collection. A total of 53 quantitative trait loci (QTLs) were found to be associated with LTG, of which 20 were located in previously reported QTLs. We further identified Stress-Associated Protein 16 (OsSAP16), coding for a zinc-finger domain protein, as a causal gene for one of the major LTG QTLs. Loss of OsSAP16 function reduces germination while greater expression of OsSAP16 enhances germination at low temperature. In addition, accessions with extremely high and low LTG values have correspondingly high and low OsSAP16 expression at low temperatures, suggesting that variation in expression of the OsSAP16 gene contributes to LTG variation. As the first case of identification of an LTG gene through GWAS, this study indicates that GWAS of natural accessions is an effective strategy in genetically dissecting LTG processes and gaining molecular understanding of low-temperature response and germination. © The Author(s) 2017. Published by Oxford University Press on behalf of the Society for Experimental Biology.
Multispecies, Integrative GWAS for Focal Segmental Glomerulosclerosis
2017-09-01
is a frequent cause of end-stage renal disease (ESRD. We investigated the genetic basis of FSGS and recruited a heterogeneous population of...understanding the complex genetic mechanisms of FSGS. 15. SUBJECT TERMS FSGS, MCD, GWAS, CNV 16. SECURITY CLASSIFICATION OF: 17. LIMITATION OF ABSTRACT uu...disease (MCD). Using a variety of statistical and genetic approaches, including genome wide association analysis and rare copy number variations (CNVs
Demirkan, A; Lahti, J; Direk, N; Viktorin, A; Lunetta, K L; Terracciano, A; Nalls, M A; Tanaka, T; Hek, K; Fornage, M; Wellmann, J; Cornelis, M C; Ollila, H M; Yu, L; Smith, J A; Pilling, L C; Isaacs, A; Palotie, A; Zhuang, W V; Zonderman, A; Faul, J D; Sutin, A; Meirelles, O; Mulas, A; Hofman, A; Uitterlinden, A; Rivadeneira, F; Perola, M; Zhao, W; Salomaa, V; Yaffe, K; Luik, A I; Liu, Y; Ding, J; Lichtenstein, P; Landén, M; Widen, E; Weir, D R; Llewellyn, D J; Murray, A; Kardia, S L R; Eriksson, J G; Koenen, K; Magnusson, P K E; Ferrucci, L; Mosley, T H; Cucca, F; Oostra, B A; Bennett, D A; Paunio, T; Berger, K; Harris, T B; Pedersen, N L; Murabito, J M; Tiemeier, H; van Duijn, C M; Räikkönen, K
2016-06-01
Major depressive disorder (MDD) is moderately heritable, however genome-wide association studies (GWAS) for MDD, as well as for related continuous outcomes, have not shown consistent results. Attempts to elucidate the genetic basis of MDD may be hindered by heterogeneity in diagnosis. The Center for Epidemiological Studies Depression (CES-D) scale provides a widely used tool for measuring depressive symptoms clustered in four different domains which can be combined together into a total score but also can be analysed as separate symptom domains. We performed a meta-analysis of GWAS of the CES-D symptom clusters. We recruited 12 cohorts with the 20- or 10-item CES-D scale (32 528 persons). One single nucleotide polymorphism (SNP), rs713224, located near the brain-expressed melatonin receptor (MTNR1A) gene, was associated with the somatic complaints domain of depression symptoms, with borderline genome-wide significance (p discovery = 3.82 × 10-8). The SNP was analysed in an additional five cohorts comprising the replication sample (6813 persons). However, the association was not consistent among the replication sample (p discovery+replication = 1.10 × 10-6) with evidence of heterogeneity. Despite the effort to harmonize the phenotypes across cohorts and participants, our study is still underpowered to detect consistent association for depression, even by means of symptom classification. On the contrary, the SNP-based heritability and co-heritability estimation results suggest that a very minor part of the variation could be captured by GWAS, explaining the reason of sparse findings.
Velders, Fleur P; Kuningas, Maris; Kumari, Meena; Dekker, Marieke J; Uitterlinden, Andre G; Kirschbaum, Clemens; Hek, Karin; Hofman, Albert; Verhulst, Frank C; Kivimaki, Mika; Van Duijn, Cornelia M; Walker, Brian R; Tiemeier, Henning
2011-08-01
Depressive patients often have altered cortisol secretion, but few studies have investigated genetic variants in relation to both cortisol secretion and depression. To identify genes related to both these conditions, we: (1) tested the association of single nucleotide polymorphisms (SNPs) in hypothalamic-pituitary-adrenal-axis (HPA-axis) candidate genes with a summary measure of total cortisol secretion during the day (cortisol(AUC)), (2) performed a genome wide association study (GWAS) of cortisol(AUC), and (3) tested the association of identified cortisol-related SNPs with depressive symptoms. We analyzed data on candidate SNPs for the HPA-axis, genome-wide scans, cortisol secretion (n=1711) and depressive symptoms (the Centre for Epidemiology Studies Depression Scale, CES-D) (n=2928) in elderly persons of the Rotterdam Study. We used data from the Whitehall II study (n=2836) to replicate the GWAS findings. Of the 1456 SNPs in 33 candidate genes, minor alleles of 4 SNPs (rs9470080, rs9394309, rs7748266 and rs1360780) in the FKBP5 gene were associated with a decreased cortisol(AUC) (p<1×10(-4) after correction for multiple testing using permutations). These SNPs were also associated with an increased risk of depressive symptoms (rs9470080: OR 1.19 (95%CI 1.0; 1.4)). The GWAS for cortisol yielded 2 SNPs with p-values of 1×10(-06) (rs8062512, rs2252459), but these associations could not be replicated. These results suggest that variation in the FKBP5 gene is associated with both cortisol(AUC) and the likelihood of depressive symptoms. Copyright © 2011 Elsevier Ltd. All rights reserved.
Low, Siew-Kee; Chung, Suyoun; Takahashi, Atsushi; Zembutsu, Hitoshi; Mushiroda, Taisei; Kubo, Michiaki; Nakamura, Yusuke
2013-08-01
Chemotherapeutic agents are notoriously known to have a narrow therapeutic range that often results in life-threatening toxicity. Hence, it is clinically important to identify the patients who are at high risk for severe toxicity to certain chemotherapy through a pharmacogenomics approach. In this study, we carried out multiple genome-wide association studies (GWAS) of 13 122 cancer patients who received different chemotherapy regimens, including cyclophosphamide- and platinum-based (cisplatin and carboplatin), anthracycline-based (doxorubicin and epirubicin), and antimetabolite-based (5-fluorouracil and gemcitabine) treatment, antimicrotubule agents (paclitaxel and docetaxel), and topoisomerase inhibitors (camptothecin and etoposide), as well as combination therapy with paclitaxel and carboplatin, to identify genetic variants that are associated with the risk of severe neutropenia/leucopenia in the Japanese population. In addition, we used a weighted genetic risk scoring system to evaluate the cumulative effects of the suggestive genetic variants identified from GWAS in order to predict the risk levels of individuals who carry multiple risk alleles. Although we failed to identify genetic variants that surpassed the genome-wide significance level (P < 5.0 × 10(-8) ) through GWAS, probably due to insufficient statistical power and complex clinical features, we were able to shortlist some of the suggestive associated loci. The current study is at the relatively preliminary stage, but does highlight the complexity and problematic issues associated with retrospective pharmacogenomics studies. However, we hope that verification of these genetic variants through local and international collaborations could improve the clinical outcome for cancer patients. © 2013 Japanese Cancer Association.
Laufey Amundadottir Presents NIH Director’s Seminar
Dr. Laufey Amundadottir presented a lecture titled “From germline genetics to function: Making sense of genome-wide association studies (GWAS) for pancreatic cancer risk” for the prestigious NIH Director’s Seminar Series.
Therapeutic approaches for celiac disease
Plugis, Nicholas M.; Khosla, Chaitan
2015-01-01
Celiac disease is a common, lifelong autoimmune disorder for which dietary control is the only accepted form of therapy. A strict gluten-free diet is burdensome to patients and can be limited in efficacy, indicating there is an unmet need for novel therapeutic approaches to supplement or supplant dietary therapy. Many molecular events required for disease pathogenesis have been recently characterized and inspire most current and emerging drug-discovery efforts. Genome-wide association studies (GWAS) confirm the importance of human leukocyte antigen genes in our pathogenic model and identify a number of new risk loci in this complex disease. Here, we review the status of both emerging and potential therapeutic strategies in the context of disease pathophysiology. We conclude with a discussion of how genes identified during GWAS and follow-up studies that enhance susceptibility may offer insight into developing novel therapies. PMID:26060114
Painter, Jodie N; O'Mara, Tracy A; Morris, Andrew P; Cheng, Timothy H T; Gorman, Maggie; Martin, Lynn; Hodson, Shirley; Jones, Angela; Martin, Nicholas G; Gordon, Scott; Henders, Anjali K; Attia, John; McEvoy, Mark; Holliday, Elizabeth G; Scott, Rodney J; Webb, Penelope M; Fasching, Peter A; Beckmann, Matthias W; Ekici, Arif B; Hein, Alexander; Rübner, Matthias; Hall, Per; Czene, Kamila; Dörk, Thilo; Dürst, Matthias; Hillemanns, Peter; Runnebaum, Ingo; Lambrechts, Diether; Amant, Frederic; Annibali, Daniela; Depreeuw, Jeroen; Vanderstichele, Adriaan; Goode, Ellen L; Cunningham, Julie M; Dowdy, Sean C; Winham, Stacey J; Trovik, Jone; Hoivik, Erling; Werner, Henrica M J; Krakstad, Camilla; Ashton, Katie; Otton, Geoffrey; Proietto, Tony; Tham, Emma; Mints, Miriam; Ahmed, Shahana; Healey, Catherine S; Shah, Mitul; Pharoah, Paul D P; Dunning, Alison M; Dennis, Joe; Bolla, Manjeet K; Michailidou, Kyriaki; Wang, Qin; Tyrer, Jonathan P; Hopper, John L; Peto, Julian; Swerdlow, Anthony J; Burwinkel, Barbara; Brenner, Hermann; Meindl, Alfons; Brauch, Hiltrud; Lindblom, Annika; Chang-Claude, Jenny; Couch, Fergus J; Giles, Graham G; Kristensen, Vessela N; Cox, Angela; Zondervan, Krina T; Nyholt, Dale R; MacGregor, Stuart; Montgomery, Grant W; Tomlinson, Ian; Easton, Douglas F; Thompson, Deborah J; Spurdle, Amanda B
2018-05-01
Epidemiological, biological, and molecular data suggest links between endometriosis and endometrial cancer, with recent epidemiological studies providing evidence for an association between a previous diagnosis of endometriosis and risk of endometrial cancer. We used genetic data as an alternative approach to investigate shared biological etiology of these two diseases. Genetic correlation analysis of summary level statistics from genomewide association studies (GWAS) using LD Score regression revealed moderate but significant genetic correlation (r g = 0.23, P = 9.3 × 10 -3 ), and SNP effect concordance analysis provided evidence for significant SNP pleiotropy (P = 6.0 × 10 -3 ) and concordance in effect direction (P = 2.0 × 10 -3 ) between the two diseases. Cross-disease GWAS meta-analysis highlighted 13 distinct loci associated at P ≤ 10 -5 with both endometriosis and endometrial cancer, with one locus (SNP rs2475335) located within PTPRD associated at a genomewide significant level (P = 4.9 × 10 -8 , OR = 1.11, 95% CI = 1.07-1.15). PTPRD acts in the STAT3 pathway, which has been implicated in both endometriosis and endometrial cancer. This study demonstrates the value of cross-disease genetic analysis to support epidemiological observations and to identify biological pathways of relevance to multiple diseases. © 2018 The Authors. Cancer Medicine published by John Wiley & Sons Ltd.
How rare bone diseases have informed our knowledge of complex diseases.
Johnson, Mark L
2016-01-01
Rare bone diseases, generally defined as monogenic traits with either autosomal recessive or dominant patterns of inheritance, have provided a rich database of genes and associated pathways over the past 2-3 decades. The molecular genetic dissection of these bone diseases has yielded some major surprises in terms of the causal genes and/or involved pathways. The discovery of genes/pathways involved in diseases such as osteopetrosis, osteosclerosis, osteogenesis imperfecta and many other rare bone diseases have all accelerated our understanding of complex traits. Importantly these discoveries have provided either direct validation for a specific gene embedded in a group of genes within an interval identified through a complex trait genome-wide association study (GWAS) or based upon the pathway associated with a monogenic trait gene, provided a means to prioritize a large number of genes for functional validation studies. In some instances GWAS studies have yielded candidate genes that fall within linkage intervals associated with monogenic traits and resulted in the identification of causal mutations in those rare diseases. Driving all of this discovery is a complement of technologies such as genome sequencing, bioinformatics and advanced statistical analysis methods that have accelerated genetic dissection and greatly reduced the cost. Thus, rare bone disorders in partnership with GWAS have brought us to the brink of a new era of personalized genomic medicine in which the prevention and management of complex diseases will be driven by the molecular understanding of each individuals contributing genetic risks for disease.
Martinón-Torres, Federico; Png, Eileen; Khor, Chiea Chuen; Davila, Sonia; Wright, Victoria J; Sim, Kar Seng; Vega, Ana; Fachal, Laura; Inwald, David; Nadel, Simon; Carrol, Enitan D; Martinón-Torres, Nazareth; Alonso, Sonia Marcos; Carracedo, Angel; Morteruel, Elvira; López-Bayón, Julio; Torre, Andrés Concha; Monge, Cristina Calvo; de Aguilar, Pilar Azcón González; Torné, Elisabeth Esteban; Martínez-Padilla, María Del Carmen; Martinón-Sánchez, José María; Levin, Michael; Hibberd, Martin L; Salas, Antonio
2016-11-02
Meningococcal disease (MD) remains an important infectious cause of life threatening infection in both industrialized and resource poor countries. Genetic factors influence both occurrence and severity of presentation, but the genes responsible are largely unknown. We performed a genome-wide association study (GWAS) examining 5,440,063 SNPs in 422 Spanish MD patients and 910 controls. We then performed a meta-analysis of the Spanish GWAS with GWAS data from the United Kingdom (combined cohorts: 897 cases and 5,613 controls; 4,898,259 SNPs). The meta-analysis identified strong evidence of association (P-value ≤ 5 × 10 -8 ) in 20 variants located at the CFH gene. SNP rs193053835 showed the most significant protective effect (Odds Ratio (OR) = 0.62, 95% confidence interval (C.I.) = 0.52-0.73; P-value = 9.62 × 10 -9 ). Five other variants had been previously reported to be associated with susceptibility to MD, including the missense SNP rs1065489 (OR = 0.64, 95% C.I.) = 0.55-0.76, P-value = 3.25 × 10 -8 ). Theoretical predictions point to a functional effect of rs1065489, which may be directly responsible for protection against MD. Our study confirms the association of CFH with susceptibility to MD and strengthens the importance of this link in understanding pathogenesis of the disease.
Nutrigenetics and nutrigenomics of atherosclerosis.
Merched, Aksam J; Chan, Lawrence
2013-06-01
The latest genome-wide association studies (GWAS) have re-energized our effort to understand the genetic basis of atherosclerotic cardiovascular disease. Although the knowledge generated by GWAS has confirmed that mediators of inflammation and perturbed lipid metabolism are major players in cardiovascular disease (CVD) development, much of individual disease heritability remains unexplained by the variants identified through GWAS. Moreover, results from interventions that aim at the pharmaceutical modification of lipid parameters fall short of expectation. These elusive treatment goals based on heritability studies highlight a key supportive, and perhaps even primary, role of nutritional therapy to achieve better health outcomes. Nonetheless, effective and specific interventions for CVD prevention using principles of "personalized" nutrition require a better knowledge of gene-diet interactions, an area that remains poorly explored. Dietary fatty acids such as omega-3 polyunsaturated fatty acids (PUFAs) are an excellent example of a widely studied "environment" that interacts with the genetic makeup in relation to CVD. A thorough exploration of the nutrigenomics and nutrigenetics of omega-3 PUFAs is key to understanding the etiology, and developing effective preventive measures. In this review, we will summarize the current state of knowledge of genetic interactions with omega-3 PUFAs in modulating lipid metabolism and inflammation, and defining health outcomes. Nutrigenetics and nutrigenomics are still in their infancy with respect to CVD prediction and therapy. Integration of the progress in the omics, including metabolomics, lipidomics, transcriptomics, and proteomics, coupled with advances in nutrigenomic and nutrigenetic research will move us towards personalized medicine as the ultimate paradigm of responsible clinical practice.
Design considerations for genetic linkage and association studies.
Nsengimana, Jérémie; Bishop, D Timothy
2012-01-01
This chapter describes the main issues that genetic epidemiologists usually consider in the design of linkage and association studies. For linkage, we briefly consider the situation of rare, highly penetrant alleles showing a disease pattern consistent with Mendelian inheritance investigated through parametric methods in large pedigrees or with autozygosity mapping in inbred families, and we then turn our focus to the most common design, affected sibling pairs, of more relevance for common, complex diseases. Theoretical and more practical power and sample size calculations are provided as a function of the strength of the genetic effect being investigated. We also discuss the impact of other determinants of statistical power such as disease heterogeneity, pedigree, and genotyping errors, as well as the effect of the type and density of genetic markers. Linkage studies should be as large as possible to have sufficient power in relation to the expected genetic effect size. Segregation analysis, a formal statistical technique to describe the underlying genetic susceptibility, may assist in the estimation of the relevant parameters to apply, for instance. However, segregation analyses estimate the total genetic component rather than a single-locus effect. Locus heterogeneity should be considered when power is estimated and at the analysis stage, i.e. assuming smaller locus effect than the total the genetic component from segregation studies. Disease heterogeneity should be minimised by considering subtypes if they are well defined or by otherwise collecting known sources of heterogeneity and adjusting for them as covariates; the power will depend upon the relationship between the disease subtype and the underlying genotypes. Ultimately, identifying susceptibility alleles of modest effects (e.g. RR≤1.5) requires a number of families that seem unfeasible in a single study. Meta-analysis and data pooling between different research groups can provide a sizeable study, but both approaches require even a higher level of vigilance about locus and disease heterogeneity when data come from different populations. All necessary steps should be taken to minimise pedigree and genotyping errors at the study design stage as they are, for the most part, due to human factors. A two-stage design is more cost-effective than one stage when using short tandem repeats (STRs). However, dense single-nucleotide polymorphism (SNP) arrays offer a more robust alternative, and due to their lower cost per unit, the total cost of studies using SNPs may in the future become comparable to that of studies using STRs in one or two stages. For association studies, we consider the popular case-control design for dichotomous phenotypes, and we provide power and sample size calculations for one-stage and multistage designs. For candidate genes, guidelines are given on the prioritisation of genetic variants, and for genome-wide association studies (GWAS), the issue of choosing an appropriate SNP array is discussed. A warning is issued regarding the danger of designing an underpowered replication study following an initial GWAS. The risk of finding spurious association due to population stratification, cryptic relatedness, and differential bias is underlined. GWAS have a high power to detect common variants of high or moderate effect. For weaker effects (e.g. relative risk<1.2), the power is greatly reduced, particularly for recessive loci. While sample sizes of 10,000 or 20,000 cases are not beyond reach for most common diseases, only meta-analyses and data pooling can allow attaining a study size of this magnitude for many other diseases. It is acknowledged that detecting the effects from rare alleles (i.e. frequency<5%) is not feasible in GWAS, and it is expected that novel methods and technology, such as next-generation resequencing, will fill this gap. At the current stage, the choice of which GWAS SNP array to use does not influence the power in populations of European ancestry. A multistage design reduces the study cost but has less power than the standard one-stage design. If one opts for a multistage design, the power can be improved by jointly analysing the data from different stages for the SNPs they share. The estimates of locus contribution to disease risk from genome-wide scans are often biased, and relying on them might result in an underpowered replication study. Population structure has so far caused less spurious associations than initially feared, thanks to systematic ethnicity matching and application of standard quality control measures. Differential bias could be a more serious threat and must be minimised by strictly controlling all the aspects of DNA acquisition, storage, and processing.
Genome-wide meta-analysis identifies five new susceptibility loci for cutaneous malignant melanoma.
Law, Matthew H; Bishop, D Timothy; Lee, Jeffrey E; Brossard, Myriam; Martin, Nicholas G; Moses, Eric K; Song, Fengju; Barrett, Jennifer H; Kumar, Rajiv; Easton, Douglas F; Pharoah, Paul D P; Swerdlow, Anthony J; Kypreou, Katerina P; Taylor, John C; Harland, Mark; Randerson-Moor, Juliette; Akslen, Lars A; Andresen, Per A; Avril, Marie-Françoise; Azizi, Esther; Scarrà, Giovanna Bianchi; Brown, Kevin M; Dębniak, Tadeusz; Duffy, David L; Elder, David E; Fang, Shenying; Friedman, Eitan; Galan, Pilar; Ghiorzo, Paola; Gillanders, Elizabeth M; Goldstein, Alisa M; Gruis, Nelleke A; Hansson, Johan; Helsing, Per; Hočevar, Marko; Höiom, Veronica; Ingvar, Christian; Kanetsky, Peter A; Chen, Wei V; Landi, Maria Teresa; Lang, Julie; Lathrop, G Mark; Lubiński, Jan; Mackie, Rona M; Mann, Graham J; Molven, Anders; Montgomery, Grant W; Novaković, Srdjan; Olsson, Håkan; Puig, Susana; Puig-Butille, Joan Anton; Qureshi, Abrar A; Radford-Smith, Graham L; van der Stoep, Nienke; van Doorn, Remco; Whiteman, David C; Craig, Jamie E; Schadendorf, Dirk; Simms, Lisa A; Burdon, Kathryn P; Nyholt, Dale R; Pooley, Karen A; Orr, Nick; Stratigos, Alexander J; Cust, Anne E; Ward, Sarah V; Hayward, Nicholas K; Han, Jiali; Schulze, Hans-Joachim; Dunning, Alison M; Bishop, Julia A Newton; Demenais, Florence; Amos, Christopher I; MacGregor, Stuart; Iles, Mark M
2015-09-01
Thirteen common susceptibility loci have been reproducibly associated with cutaneous malignant melanoma (CMM). We report the results of an international 2-stage meta-analysis of CMM genome-wide association studies (GWAS). This meta-analysis combines 11 GWAS (5 previously unpublished) and a further three stage 2 data sets, totaling 15,990 CMM cases and 26,409 controls. Five loci not previously associated with CMM risk reached genome-wide significance (P < 5 × 10(-8)), as did 2 previously reported but unreplicated loci and all 13 established loci. Newly associated SNPs fall within putative melanocyte regulatory elements, and bioinformatic and expression quantitative trait locus (eQTL) data highlight candidate genes in the associated regions, including one involved in telomere biology.
Genome-wide meta-analysis identifies five new susceptibility loci for cutaneous malignant melanoma
Law, Matthew H.; Bishop, D. Timothy; Martin, Nicholas G.; Moses, Eric K.; Song, Fengju; Barrett, Jennifer H.; Kumar, Rajiv; Easton, Douglas F.; Pharoah, Paul D. P.; Swerdlow, Anthony J.; Kypreou, Katerina P.; Taylor, John C.; Harland, Mark; Randerson-Moor, Juliette; Akslen, Lars A.; Andresen, Per A.; Avril, Marie-Françoise; Azizi, Esther; Scarrà, Giovanna Bianchi; Brown, Kevin M.; Dębniak, Tadeusz; Duffy, David L.; Elder, David E.; Fang, Shenying; Friedman, Eitan; Galan, Pilar; Ghiorzo, Paola; Gillanders, Elizabeth M.; Goldstein, Alisa M.; Gruis, Nelleke A.; Hansson, Johan; Helsing, Per; Hočevar, Marko; Höiom, Veronica; Ingvar, Christian; Kanetsky, Peter A.; Chen, Wei V.; Landi, Maria Teresa; Lang, Julie; Lathrop, G. Mark; Lubiński, Jan; Mackie, Rona M.; Mann, Graham J.; Molven, Anders; Montgomery, Grant W.; Novaković, Srdjan; Olsson, Håkan; Puig, Susana; Puig-Butille, Joan Anton; Qureshi, Abrar A.; Radford-Smith, Graham L.; van der Stoep, Nienke; van Doorn, Remco; Whiteman, David C.; Craig, Jamie E.; Schadendorf, Dirk; Simms, Lisa A.; Burdon, Kathryn P.; Nyholt, Dale R.; Pooley, Karen A.; Orr, Nick; Stratigos, Alexander J.; Cust, Anne E.; Ward, Sarah V.; Hayward, Nicholas K.; Han, Jiali; Schulze, Hans-Joachim; Dunning, Alison M.; Bishop, Julia A. Newton; MacGregor, Stuart; Iles, Mark M.
2015-01-01
Thirteen common susceptibility loci have been reproducibly associated with cutaneous malignant melanoma (CMM). We report the results of an international 2-stage meta-analysis of CMM genome-wide association studies (GWAS). This meta-analysis combines 11 GWAS (5 previously unpublished) and a further three stage 2 data sets, totaling 15,990 CMM cases and 26,409 controls. Five loci not previously associated with CMM risk reached genome-wide significance (P < 5×10–8), as did two previously-reported but un-replicated loci and all thirteen established loci. Novel SNPs fall within putative melanocyte regulatory elements, and bioinformatic and expression quantitative trait locus (eQTL) data highlight candidate genes including one involved in telomere biology. PMID:26237428
A Genome-Wide Association Study of Chronic Obstructive Pulmonary Disease in Hispanics
Chen, Wei; Brehm, John M.; Manichaikul, Ani; Cho, Michael H.; Boutaoui, Nadia; Yan, Qi; Burkart, Kristin M.; Enright, Paul L.; Rotter, Jerome I.; Petersen, Hans; Leng, Shuguang; Obeidat, Ma’en; Bossé, Yohan; Brandsma, Corry-Anke; Hao, Ke; Rich, Stephen S.; Powell, Rhea; Avila, Lydiana; Soto-Quiros, Manuel; Silverman, Edwin K.; Tesfaigzi, Yohannes; Barr, R. Graham
2015-01-01
Rationale: Genome-wide association studies (GWAS) of chronic obstructive pulmonary disease (COPD) have identified disease-susceptibility loci, mostly in subjects of European descent. Objectives: We hypothesized that by studying Hispanic populations we would be able to identify unique loci that contribute to COPD pathogenesis in Hispanics but remain undetected in GWAS of non-Hispanic populations. Methods: We conducted a metaanalysis of two GWAS of COPD in independent cohorts of Hispanics in Costa Rica and the United States (Multi-Ethnic Study of Atherosclerosis [MESA]). We performed a replication study of the top single-nucleotide polymorphisms in an independent Hispanic cohort in New Mexico (the Lovelace Smokers Cohort). We also attempted to replicate prior findings from genome-wide studies in non-Hispanic populations in Hispanic cohorts. Measurements and Main Results: We found no genome-wide significant association with COPD in our metaanalysis of Costa Rica and MESA. After combining the top results from this metaanalysis with those from our replication study in the Lovelace Smokers Cohort, we identified two single-nucleotide polymorphisms approaching genome-wide significance for an association with COPD. The first (rs858249, combined P value = 6.1 × 10−8) is near the genes KLHL7 and NUPL2 on chromosome 7. The second (rs286499, combined P value = 8.4 × 10−8) is located in an intron of DLG2. The two most significant single-nucleotide polymorphisms in FAM13A from a previous genome-wide study in non-Hispanics were associated with COPD in Hispanics. Conclusions: We have identified two novel loci (in or near the genes KLHL7/NUPL2 and DLG2) that may play a role in COPD pathogenesis in Hispanic populations. PMID:25584925
A genome-wide association study of chronic obstructive pulmonary disease in Hispanics.
Chen, Wei; Brehm, John M; Manichaikul, Ani; Cho, Michael H; Boutaoui, Nadia; Yan, Qi; Burkart, Kristin M; Enright, Paul L; Rotter, Jerome I; Petersen, Hans; Leng, Shuguang; Obeidat, Ma'en; Bossé, Yohan; Brandsma, Corry-Anke; Hao, Ke; Rich, Stephen S; Powell, Rhea; Avila, Lydiana; Soto-Quiros, Manuel; Silverman, Edwin K; Tesfaigzi, Yohannes; Barr, R Graham; Celedón, Juan C
2015-03-01
Genome-wide association studies (GWAS) of chronic obstructive pulmonary disease (COPD) have identified disease-susceptibility loci, mostly in subjects of European descent. We hypothesized that by studying Hispanic populations we would be able to identify unique loci that contribute to COPD pathogenesis in Hispanics but remain undetected in GWAS of non-Hispanic populations. We conducted a metaanalysis of two GWAS of COPD in independent cohorts of Hispanics in Costa Rica and the United States (Multi-Ethnic Study of Atherosclerosis [MESA]). We performed a replication study of the top single-nucleotide polymorphisms in an independent Hispanic cohort in New Mexico (the Lovelace Smokers Cohort). We also attempted to replicate prior findings from genome-wide studies in non-Hispanic populations in Hispanic cohorts. We found no genome-wide significant association with COPD in our metaanalysis of Costa Rica and MESA. After combining the top results from this metaanalysis with those from our replication study in the Lovelace Smokers Cohort, we identified two single-nucleotide polymorphisms approaching genome-wide significance for an association with COPD. The first (rs858249, combined P value = 6.1 × 10(-8)) is near the genes KLHL7 and NUPL2 on chromosome 7. The second (rs286499, combined P value = 8.4 × 10(-8)) is located in an intron of DLG2. The two most significant single-nucleotide polymorphisms in FAM13A from a previous genome-wide study in non-Hispanics were associated with COPD in Hispanics. We have identified two novel loci (in or near the genes KLHL7/NUPL2 and DLG2) that may play a role in COPD pathogenesis in Hispanic populations.
Replicability and Robustness of GWAS for Behavioral Traits
Rietveld, Cornelius A.; Conley, Dalton; Eriksson, Nicholas; Esko, Tõnu; Medland, Sarah E.; Vinkhuyzen, Anna A.E.; Yang, Jian; Boardman, Jason D.; Chabris, Christopher F.; Dawes, Christopher T.; Domingue, Benjamin W.; Hinds, David A.; Johannesson, Magnus; Kiefer, Amy K.; Laibson, David; Magnusson, Patrik K. E.; Mountain, Joanna L.; Oskarsson, Sven; Rostapshova, Olga; Teumer, Alexander; Tung, Joyce Y.; Visscher, Peter M.; Benjamin, Daniel J.; Cesarini, David; Koellinger, Philipp D.
2015-01-01
A recent genome-wide association study (GWAS) of educational attainment identified three single-nucleotide polymorphisms (SNPs) that, despite their small effect sizes (each R2 ≈ 0.02%), reached genome-wide significance (p < 5×10−8) in a large discovery sample and replicated in an independent sample (p < 0.05). The study also reported associations between educational attainment and indices of SNPs called “polygenic scores.” We evaluate the robustness of these findings. Study 1 finds that all three SNPs replicate in another large (N = 34,428) independent sample. We also find that the scores remain predictive (R2 ≈ 2%) with stringent controls for stratification (Study 2) and in new within-family analyses (Study 3). Our results show that large and therefore well-powered GWASs can identify replicable genetic associations with behavioral traits. The small effect sizes of individual SNPs are likely to be a major contributing explanation for the striking contrast between our results and the disappointing replication record of most candidate gene studies. PMID:25287667
Won, Sungho; Mattheisen, Manuel; Castaldi, Peter J.; Cho, Michael H.; Rutten, Erica; Hardin, Megan; Yip, Wai‐Ki; Rennard, Stephen I.; Lomas, David A.; Wouters, Emiel F.M.; Agusti, Alvar; Casaburi, Richard; Lange, Christoph P.; O'Connor, George; Hersh, Craig P.; Silverman, Edwin K.
2017-01-01
Abstract Background There have been a number of candidate gene association studies of cancer cachexia‐related traits, but no genome‐wide association study (GWAS) has been published to date. Cachexia presents in patients with a number of complex traits, including both cancer and COPD. The objective of the current investigation was to search for a shared genetic aetiology for change in body mass index (ΔBMI) among cancer and COPD by using GWAS data in the Framingham Heart Study. Methods A linear mixed effects model accounting for age, sex, and change in smoking status was used to calculate ΔBMI in participants over 40 years of age with three consecutive BMI time points (n = 4162). Four GWAS of ΔBMI using generalized estimating equations were performed among 1085 participants with a cancer diagnosis, 204 with gastrointestinal (GI) cancer, 112 with lung cancer, and 237 with COPD to test for association with 418 365 single‐nucleotide polymorphisms (SNPs). Results Two SNPs reached a level of genome‐wide significance (P < 5 × 10−8) with ΔBMI: (i) rs41526344 within the CNTN4 gene, among COPD cases (β = 0.13, P = 4.3 × 10−8); and (ii) rs4751240 in the gene Dedicator of Cytokinesis 1 (DOCK1) among GI cancer cases (β = 0.10, P = 1.9 × 10−8). The DOCK1 SNP association replicated in the ΔBMI GWAS among COPD cases (β meta‐analyis = 0.10, P meta‐analyis = 9.3 × 10−10). The DOCK1 gene codes for the dedicator of cytokinesis 1 protein, which has a role in myoblast fusion. Conclusions In sum, one statistically significant common variant in the DOCK1 gene was associated with ΔBMI in GI cancer and COPD cases providing support for at least partially shared aetiology of ΔBMI in complex diseases. PMID:28044437
Analysis of the Influence of microRNAs in Lithium Response in Bipolar Disorder.
Reinbold, Céline S; Forstner, Andreas J; Hecker, Julian; Fullerton, Janice M; Hoffmann, Per; Hou, Liping; Heilbronner, Urs; Degenhardt, Franziska; Adli, Mazda; Akiyama, Kazufumi; Akula, Nirmala; Ardau, Raffaella; Arias, Bárbara; Backlund, Lena; Benabarre, Antonio; Bengesser, Susanne; Bhattacharjee, Abesh K; Biernacka, Joanna M; Birner, Armin; Marie-Claire, Cynthia; Cervantes, Pablo; Chen, Guo-Bo; Chen, Hsi-Chung; Chillotti, Caterina; Clark, Scott R; Colom, Francesc; Cousins, David A; Cruceanu, Cristiana; Czerski, Piotr M; Dayer, Alexandre; Étain, Bruno; Falkai, Peter; Frisén, Louise; Gard, Sébastien; Garnham, Julie S; Goes, Fernando S; Grof, Paul; Gruber, Oliver; Hashimoto, Ryota; Hauser, Joanna; Herms, Stefan; Jamain, Stéphane; Jiménez, Esther; Kahn, Jean-Pierre; Kassem, Layla; Kittel-Schneider, Sarah; Kliwicki, Sebastian; König, Barbara; Kusumi, Ichiro; Lackner, Nina; Laje, Gonzalo; Landén, Mikael; Lavebratt, Catharina; Leboyer, Marion; Leckband, Susan G; López Jaramillo, Carlos A; MacQueen, Glenda; Manchia, Mirko; Martinsson, Lina; Mattheisen, Manuel; McCarthy, Michael J; McElroy, Susan L; Mitjans, Marina; Mondimore, Francis M; Monteleone, Palmiero; Nievergelt, Caroline M; Ösby, Urban; Ozaki, Norio; Perlis, Roy H; Pfennig, Andrea; Reich-Erkelenz, Daniela; Rouleau, Guy A; Schofield, Peter R; Schubert, K Oliver; Schweizer, Barbara W; Seemüller, Florian; Severino, Giovanni; Shekhtman, Tatyana; Shilling, Paul D; Shimoda, Kazutaka; Simhandl, Christian; Slaney, Claire M; Smoller, Jordan W; Squassina, Alessio; Stamm, Thomas J; Stopkova, Pavla; Tighe, Sarah K; Tortorella, Alfonso; Turecki, Gustavo; Volkert, Julia; Witt, Stephanie H; Wright, Adam J; Young, L Trevor; Zandi, Peter P; Potash, James B; DePaulo, J Raymond; Bauer, Michael; Reininghaus, Eva; Novák, Tomáš; Aubry, Jean-Michel; Maj, Mario; Baune, Bernhard T; Mitchell, Philip B; Vieta, Eduard; Frye, Mark A; Rybakowski, Janusz K; Kuo, Po-Hsiu; Kato, Tadafumi; Grigoroiu-Serbanescu, Maria; Reif, Andreas; Del Zompo, Maria; Bellivier, Frank; Schalling, Martin; Wray, Naomi R; Kelsoe, John R; Alda, Martin; McMahon, Francis J; Schulze, Thomas G; Rietschel, Marcella; Nöthen, Markus M; Cichon, Sven
2018-01-01
Bipolar disorder (BD) is a common, highly heritable neuropsychiatric disease characterized by recurrent episodes of mania and depression. Lithium is the best-established long-term treatment for BD, even though individual response is highly variable. Evidence suggests that some of this variability has a genetic basis. This is supported by the largest genome-wide association study (GWAS) of lithium response to date conducted by the International Consortium on Lithium Genetics (ConLiGen). Recently, we performed the first genome-wide analysis of the involvement of miRNAs in BD and identified nine BD-associated miRNAs. However, it is unknown whether these miRNAs are also associated with lithium response in BD. In the present study, we therefore tested whether common variants at these nine candidate miRNAs contribute to the variance in lithium response in BD. Furthermore, we systematically analyzed whether any other miRNA in the genome is implicated in the response to lithium. For this purpose, we performed gene-based tests for all known miRNA coding genes in the ConLiGen GWAS dataset ( n = 2,563 patients) using a set-based testing approach adapted from the versatile gene-based test for GWAS (VEGAS2). In the candidate approach, miR-499a showed a nominally significant association with lithium response, providing some evidence for involvement in both development and treatment of BD. In the genome-wide miRNA analysis, 71 miRNAs showed nominally significant associations with the dichotomous phenotype and 106 with the continuous trait for treatment response. A total of 15 miRNAs revealed nominal significance in both phenotypes with miR-633 showing the strongest association with the continuous trait ( p = 9.80E-04) and miR-607 with the dichotomous phenotype ( p = 5.79E-04). No association between miRNAs and treatment response to lithium in BD in either of the tested conditions withstood multiple testing correction. Given the limited power of our study, the investigation of miRNAs in larger GWAS samples of BD and lithium response is warranted.
McDonald, Merry-Lynn Noelle; Won, Sungho; Mattheisen, Manuel; Castaldi, Peter J; Cho, Michael H; Rutten, Erica; Hardin, Megan; Yip, Wai-Ki; Rennard, Stephen I; Lomas, David A; Wouters, Emiel F M; Agusti, Alvar; Casaburi, Richard; Lange, Christoph P; O'Connor, George; Hersh, Craig P; Silverman, Edwin K
2017-06-01
There have been a number of candidate gene association studies of cancer cachexia-related traits, but no genome-wide association study (GWAS) has been published to date. Cachexia presents in patients with a number of complex traits, including both cancer and COPD. The objective of the current investigation was to search for a shared genetic aetiology for change in body mass index (ΔBMI) among cancer and COPD by using GWAS data in the Framingham Heart Study. A linear mixed effects model accounting for age, sex, and change in smoking status was used to calculate ΔBMI in participants over 40 years of age with three consecutive BMI time points (n = 4162). Four GWAS of ΔBMI using generalized estimating equations were performed among 1085 participants with a cancer diagnosis, 204 with gastrointestinal (GI) cancer, 112 with lung cancer, and 237 with COPD to test for association with 418 365 single-nucleotide polymorphisms (SNPs). Two SNPs reached a level of genome-wide significance (P < 5 × 10 -8 ) with ΔBMI: (i) rs41526344 within the CNTN4 gene, among COPD cases (β = 0.13, P = 4.3 × 10 -8 ); and (ii) rs4751240 in the gene Dedicator of Cytokinesis 1 (DOCK1) among GI cancer cases (β = 0.10, P = 1.9 × 10 -8 ). The DOCK1 SNP association replicated in the ΔBMI GWAS among COPD cases (β meta-analyis = 0.10, P meta-analyis = 9.3 × 10 -10 ). The DOCK1 gene codes for the dedicator of cytokinesis 1 protein, which has a role in myoblast fusion. In sum, one statistically significant common variant in the DOCK1 gene was associated with ΔBMI in GI cancer and COPD cases providing support for at least partially shared aetiology of ΔBMI in complex diseases. © 2017 The Authors. Journal of Cachexia, Sarcopenia and Muscle published by John Wiley & Sons Ltd on behalf of the Society on Sarcopenia, Cachexia and Wasting Disorders.
An alternative covariance estimator to investigate genetic heterogeneity in populations.
Heslot, Nicolas; Jannink, Jean-Luc
2015-11-26
For genomic prediction and genome-wide association studies (GWAS) using mixed models, covariance between individuals is estimated using molecular markers. Based on the properties of mixed models, using available molecular data for prediction is optimal if this covariance is known. Under this assumption, adding individuals to the analysis should never be detrimental. However, some empirical studies showed that increasing training population size decreased prediction accuracy. Recently, results from theoretical models indicated that even if marker density is high and the genetic architecture of traits is controlled by many loci with small additive effects, the covariance between individuals, which depends on relationships at causal loci, is not always well estimated by the whole-genome kinship. We propose an alternative covariance estimator named K-kernel, to account for potential genetic heterogeneity between populations that is characterized by a lack of genetic correlation, and to limit the information flow between a priori unknown populations in a trait-specific manner. This is similar to a multi-trait model and parameters are estimated by REML and, in extreme cases, it can allow for an independent genetic architecture between populations. As such, K-kernel is useful to study the problem of the design of training populations. K-kernel was compared to other covariance estimators or kernels to examine its fit to the data, cross-validated accuracy and suitability for GWAS on several datasets. It provides a significantly better fit to the data than the genomic best linear unbiased prediction model and, in some cases it performs better than other kernels such as the Gaussian kernel, as shown by an empirical null distribution. In GWAS simulations, alternative kernels control type I errors as well as or better than the classical whole-genome kinship and increase statistical power. No or small gains were observed in cross-validated prediction accuracy. This alternative covariance estimator can be used to gain insight into trait-specific genetic heterogeneity by identifying relevant sub-populations that lack genetic correlation between them. Genetic correlation can be 0 between identified sub-populations by performing automatic selection of relevant sets of individuals to be included in the training population. It may also increase statistical power in GWAS.
An, Ping; Straka, Robert J; Pollin, Toni I; Feitosa, Mary F; Wojczynski, Mary K; Daw, E Warwick; O'Connell, Jeffrey R; Gibson, Quince; Ryan, Kathleen A; Hopkins, Paul N; Tsai, Michael Y; Lai, Chao-Qiang; Province, Michael A; Ordovas, Jose M; Shuldiner, Alan R; Arnett, Donna K; Borecki, Ingrid B
2014-07-01
Non-high-density lipoprotein cholesterol(NHDL) is an independent and superior predictor of CVD risk as compared to low-density lipoprotein alone. It represents a spectrum of atherogenic lipid fractions with possibly a distinct genomic signature. We performed genome-wide association studies (GWAS) to identify loci influencing baseline NHDL and its postprandial lipemic (PPL) response. We carried out GWAS in 4,241 participants of European descent. Our discovery cohort included 928 subjects from the Genetics of Lipid-Lowering Drugs and Diet Network Study. Our replication cohorts included 3,313 subjects from the Heredity and Phenotype Intervention Heart Study and Family Heart Study. A linear mixed model using the kinship matrix was used for association tests. The best association signal was found in a tri-genic region at RHOQ-PIGF-CRIPT for baseline NHDL (lead SNP rs6544903, discovery p = 7e-7, MAF = 2 %; validation p = 6e-4 at 0.1 kb upstream neighboring SNP rs3768725, and 5e-4 at 0.7 kb downstream neighboring SNP rs6733143, MAF = 10 %). The lead and neighboring SNPs were not perfect surrogate proxies to each other (D' = 1, r (2) = 0.003) but they seemed to be partially dependent (likelihood ration test p = 0.04). Other suggestive loci (discovery p < 1e-6) included LOC100419812 and LOC100288337 for baseline NHDL, and LOC100420502 and CDH13 for NHDL PPL response that were not replicated (p > 0.01). The current and first GWAS of NHDL yielded an interesting common variant in RHOQ-PIGF-CRIPT influencing baseline NHDL levels. Another common variant in CDH13 for NHDL response to dietary high-fat intake challenge was also suggested. Further validations for both loci from large independent studies, especially interventional studies, are warranted.
Lu, Yingchang; Justice, Anne E.; Mudgal, Poorva; Liu, Ching-Ti; Young, Kristin; Feitosa, Mary F.; Rand, Kristin; Dimitrov, Latchezar; Duan, Qing; Guo, Xiuqing; Lange, Leslie A.; Nalls, Michael A.; Okut, Hayrettin; Tayo, Bamidele O.; Vedantam, Sailaja; Bradfield, Jonathan P.; Chen, Guanjie; Chesi, Alessandra; Irvin, Marguerite R.; Padhukasahasram, Badri; Zheng, Wei; Allison, Matthew A.; Ambrosone, Christine B.; Bandera, Elisa V.; Berndt, Sonja I.; Blot, William J.; Bottinger, Erwin P.; Carpten, John; Chanock, Stephen J.; Chen, Yii-Der Ida; Conti, David V.; Cooper, Richard S.; Fornage, Myriam; Freedman, Barry I.; Garcia, Melissa; Goodman, Phyllis J.; Hsu, Yu-Han H.; Hu, Jennifer; Huff, Chad D.; Ingles, Sue A.; John, Esther M.; Kittles, Rick; Klein, Eric; Li, Jin; McKnight, Barbara; Nayak, Uma; Nemesure, Barbara; Olshan, Andrew; Salako, Babatunde; Sanderson, Maureen; Shao, Yaming; Siscovick, David S.; Stanford, Janet L.; Strom, Sara S.; Witte, John S.; Yao, Jie; Zhu, Xiaofeng; Ziegler, Regina G.; Zonderman, Alan B.; Ambs, Stefan; Cushman, Mary; Faul, Jessica D.; Hakonarson, Hakon; Levin, Albert M.; Nathanson, Katherine L.; Weir, David R.; Zhi, Degui; Arnett, Donna K.; Kardia, Sharon L. R.; Oloapde, Olufunmilayo I.; Rao, D. C.; Williams, L. Keoki; Becker, Diane M.; Borecki, Ingrid B.; Evans, Michele K.; Harris, Tamara B.; Hirschhorn, Joel N.; Psaty, Bruce M.; Wilson, James G.; Bowden, Donald W.; Cupples, L. Adrienne; Haiman, Christopher A.; Loos, Ruth J. F.; North, Kari E.
2017-01-01
Genome-wide association studies (GWAS) have identified >300 loci associated with measures of adiposity including body mass index (BMI) and waist-to-hip ratio (adjusted for BMI, WHRadjBMI), but few have been identified through screening of the African ancestry genomes. We performed large scale meta-analyses and replications in up to 52,895 individuals for BMI and up to 23,095 individuals for WHRadjBMI from the African Ancestry Anthropometry Genetics Consortium (AAAGC) using 1000 Genomes phase 1 imputed GWAS to improve coverage of both common and low frequency variants in the low linkage disequilibrium African ancestry genomes. In the sex-combined analyses, we identified one novel locus (TCF7L2/HABP2) for WHRadjBMI and eight previously established loci at P < 5×10−8: seven for BMI, and one for WHRadjBMI in African ancestry individuals. An additional novel locus (SPRYD7/DLEU2) was identified for WHRadjBMI when combined with European GWAS. In the sex-stratified analyses, we identified three novel loci for BMI (INTS10/LPL and MLC1 in men, IRX4/IRX2 in women) and four for WHRadjBMI (SSX2IP, CASC8, PDE3B and ZDHHC1/HSD11B2 in women) in individuals of African ancestry or both African and European ancestry. For four of the novel variants, the minor allele frequency was low (<5%). In the trans-ethnic fine mapping of 47 BMI loci and 27 WHRadjBMI loci that were locus-wide significant (P < 0.05 adjusted for effective number of variants per locus) from the African ancestry sex-combined and sex-stratified analyses, 26 BMI loci and 17 WHRadjBMI loci contained ≤ 20 variants in the credible sets that jointly account for 99% posterior probability of driving the associations. The lead variants in 13 of these loci had a high probability of being causal. As compared to our previous HapMap imputed GWAS for BMI and WHRadjBMI including up to 71,412 and 27,350 African ancestry individuals, respectively, our results suggest that 1000 Genomes imputation showed modest improvement in identifying GWAS loci including low frequency variants. Trans-ethnic meta-analyses further improved fine mapping of putative causal variants in loci shared between the African and European ancestry populations. PMID:28430825
Khabirova, Eleonora; Moloney, Aileen; Marciniak, Stefan J; Williams, Julie; Lomas, David A; Oliver, Stephen G; Favrin, Giorgio; Sattelle, David B; Crowther, Damian C
2014-01-01
The human Aβ peptide causes progressive paralysis when expressed in the muscles of the nematode worm, C. elegans. We have exploited this model of Aβ toxicity by carrying out an RNAi screen to identify genes whose reduced expression modifies the severity of this locomotor phenotype. Our initial finding was that none of the human orthologues of these worm genes is identical with the genome-wide significant GWAS genes reported to date (the "white zone"); moreover there was no identity between worm screen hits and the longer list of GWAS genes which included those with borderline levels of significance (the "grey zone"). This indicates that Aβ toxicity should not be considered as equivalent to sporadic AD. To increase the sensitivity of our analysis, we then considered the physical interactors (+1 interactome) of the products of the genes in both the worm and the white+grey zone lists. When we consider these worm and GWAS gene lists we find that 4 of the 60 worm genes have a +1 interactome overlap that is larger than expected by chance. Two of these genes form a chaperonin complex, the third is closely associated with this complex and the fourth gene codes for actin, the major substrate of the same chaperonin.
Association of prediabetes-associated single nucleotide polymorphisms with microalbuminuria.
Choi, Jong Wook; Moon, Shinje; Jang, Eun Jung; Lee, Chang Hwa; Park, Joon-Sung
2017-01-01
Increased glycemic exposure, even below the diagnostic criteria for diabetes mellitus, is crucial in the pathogenesis of diabetic microvascular complications represented by microalbuminuria. Nonetheless, there is limited evidence regarding which single nucleotide polymorphisms (SNPs) are associated with prediabetes and whether genetic predisposition to prediabetes is related to microalbuminuria, especially in the general population. Our objective was to answer these questions. We conducted a genomewide association study (GWAS) separately on two population-based cohorts, Ansung and Ansan, in the Korean Genome and Epidemiology Study (KoGES). The initial GWAS was carried out on the Ansung cohort, followed by a replication study on the Ansan cohort. A total of 5682 native Korean participants without a significant medical illness were classified into either control group (n = 3153) or prediabetic group (n = 2529). In the GWAS, we identified two susceptibility loci associated with prediabetes, one at 17p15.3-p15.1 in the GCK gene and another at 7p15.1 in YKT6. When variations in GCK and YKT6 were used as a model of prediabetes, this genetically determined prediabetes increased microalbuminuria. Multiple logistic regression analyses revealed that fasting glucose concentration in plasma and SNP rs2908289 in GCK were associated with microalbuminuria, and adjustment for age, gender, smoking history, systolic blood pressure, waist circumference, and serum triglyceride levels did not attenuate this association. Our results suggest that prediabetes and the associated SNPs may predispose to microalbuminuria before the diagnosis of diabetes mellitus. Further studies are needed to explore the details of the physiological and molecular mechanisms underlying this genetic association.
Association of prediabetes-associated single nucleotide polymorphisms with microalbuminuria
Choi, Jong Wook; Moon, Shinje; Jang, Eun Jung; Lee, Chang Hwa; Park, Joon-Sung
2017-01-01
Increased glycemic exposure, even below the diagnostic criteria for diabetes mellitus, is crucial in the pathogenesis of diabetic microvascular complications represented by microalbuminuria. Nonetheless, there is limited evidence regarding which single nucleotide polymorphisms (SNPs) are associated with prediabetes and whether genetic predisposition to prediabetes is related to microalbuminuria, especially in the general population. Our objective was to answer these questions. We conducted a genomewide association study (GWAS) separately on two population-based cohorts, Ansung and Ansan, in the Korean Genome and Epidemiology Study (KoGES). The initial GWAS was carried out on the Ansung cohort, followed by a replication study on the Ansan cohort. A total of 5682 native Korean participants without a significant medical illness were classified into either control group (n = 3153) or prediabetic group (n = 2529). In the GWAS, we identified two susceptibility loci associated with prediabetes, one at 17p15.3-p15.1 in the GCK gene and another at 7p15.1 in YKT6. When variations in GCK and YKT6 were used as a model of prediabetes, this genetically determined prediabetes increased microalbuminuria. Multiple logistic regression analyses revealed that fasting glucose concentration in plasma and SNP rs2908289 in GCK were associated with microalbuminuria, and adjustment for age, gender, smoking history, systolic blood pressure, waist circumference, and serum triglyceride levels did not attenuate this association. Our results suggest that prediabetes and the associated SNPs may predispose to microalbuminuria before the diagnosis of diabetes mellitus. Further studies are needed to explore the details of the physiological and molecular mechanisms underlying this genetic association. PMID:28158221
2011-01-01
Background Genome-wide association studies (GWAS) have identified new candidate genes for the occurrence of acute coronary syndrome (ACS), but possible effects of such genes on survival following ACS have yet to be investigated. Methods We examined 95 polymorphisms in 69 distinct gene regions identified in a GWAS for premature myocardial infarction for their association with post-ACS mortality among 811 whites recruited from university-affiliated hospitals in Kansas City, Missouri. We then sought replication of a positive genetic association in a large, racially diverse cohort of myocardial infarction patients (N = 2284) using Kaplan-Meier survival analyses and Cox regression to adjust for relevant covariates. Finally, we investigated the apparent association further in 6086 additional coronary artery disease patients. Results After Cox adjustment for other ACS risk factors, of 95 SNPs tested in 811 whites only the association with the rs6922269 in MTHFD1L was statistically significant, with a 2.6-fold mortality hazard (P = 0.007). The recessive A/A genotype was of borderline significance in an age- and race-adjusted analysis of the entire combined cohort (N = 3095; P = 0.052), but this finding was not confirmed in independent cohorts (N = 6086). Conclusions We found no support for the hypothesis that the GWAS-identified variants in this study substantially alter the probability of post-ACS survival. Large-scale, collaborative, genome-wide studies may be required in order to detect genetic variants that are robustly associated with survival in patients with coronary artery disease. PMID:21957892
Zhang, J; Feng, J-Y; Ni, Y-L; Wen, Y-J; Niu, Y; Tamba, C L; Yue, C; Song, Q; Zhang, Y-M
2017-06-01
Multilocus genome-wide association studies (GWAS) have become the state-of-the-art procedure to identify quantitative trait nucleotides (QTNs) associated with complex traits. However, implementation of multilocus model in GWAS is still difficult. In this study, we integrated least angle regression with empirical Bayes to perform multilocus GWAS under polygenic background control. We used an algorithm of model transformation that whitened the covariance matrix of the polygenic matrix K and environmental noise. Markers on one chromosome were included simultaneously in a multilocus model and least angle regression was used to select the most potentially associated single-nucleotide polymorphisms (SNPs), whereas the markers on the other chromosomes were used to calculate kinship matrix as polygenic background control. The selected SNPs in multilocus model were further detected for their association with the trait by empirical Bayes and likelihood ratio test. We herein refer to this method as the pLARmEB (polygenic-background-control-based least angle regression plus empirical Bayes). Results from simulation studies showed that pLARmEB was more powerful in QTN detection and more accurate in QTN effect estimation, had less false positive rate and required less computing time than Bayesian hierarchical generalized linear model, efficient mixed model association (EMMA) and least angle regression plus empirical Bayes. pLARmEB, multilocus random-SNP-effect mixed linear model and fast multilocus random-SNP-effect EMMA methods had almost equal power of QTN detection in simulation experiments. However, only pLARmEB identified 48 previously reported genes for 7 flowering time-related traits in Arabidopsis thaliana.
CGEMS identifies common inherited genetic variations associated with a number of cancers, including breast and prostate. Data from these genome-wide association studies (GWAS) are available through the Division of Cancer Epidemiology & Genetics website.
A review on neuroimaging studies of genetic and environmental influences on early brain development.
Gao, Wei; Grewen, Karen; Knickmeyer, Rebecca C; Qiu, Anqi; Salzwedel, Andrew; Lin, Weili; Gilmore, John H
2018-04-16
The past decades witnessed a surge of interest in neuroimaging study of normal and abnormal early brain development. Structural and functional studies of normal early brain development revealed massive structural maturation as well as sequential, coordinated, and hierarchical emergence of functional networks during the infancy period, providing a great foundation for the investigation of abnormal early brain development mechanisms. Indeed, studies of altered brain development associated with either genetic or environmental risks emerged and thrived. In this paper, we will review selected studies of genetic and environmental risks that have been relatively more extensively investigated-familial risks, candidate risk genes, and genome-wide association studies (GWAS) on the genetic side; maternal mood disorders and prenatal drug exposures on the environmental side. Emerging studies on environment-gene interactions will also be reviewed. Our goal was not to perform an exhaustive review of all studies in the field but to leverage some representative ones to summarize the current state, point out potential limitations, and elicit discussions on important future directions. Copyright © 2018 Elsevier Inc. All rights reserved.
Capomaccio, Stefano; Milanesi, Marco; Bomba, Lorenzo; Cappelli, Katia; Nicolazzi, Ezequiel L; Williams, John L; Ajmone-Marsan, Paolo; Stefanon, Bruno
2015-08-01
Genome-wide association studies (GWAS) have been widely applied to disentangle the genetic basis of complex traits. In cattle breeds, classical GWAS approaches with medium-density marker panels are far from conclusive, especially for complex traits. This is due to the intrinsic limitations of GWAS and the assumptions that are made to step from the association signals to the functional variations. Here, we applied a gene-based strategy to prioritize genotype-phenotype associations found for milk production and quality traits with classical approaches in three Italian dairy cattle breeds with different sample sizes (Italian Brown n = 745; Italian Holstein n = 2058; Italian Simmental n = 477). Although classical regression on single markers revealed only a single genome-wide significant genotype-phenotype association, for Italian Holstein, the gene-based approach identified specific genes in each breed that are associated with milk physiology and mammary gland development. As no standard method has yet been established to step from variation to functional units (i.e., genes), the strategy proposed here may contribute to revealing new genes that play significant roles in complex traits, such as those investigated here, amplifying low association signals using a gene-centric approach. © 2015 Stichting International Foundation for Animal Genetics.
The influence of polygenic risk for bipolar disorder on neural activation assessed using fMRI
Whalley, H C; Papmeyer, M; Sprooten, E; Romaniuk, L; Blackwood, D H; Glahn, D C; Hall, J; Lawrie, S M; Sussmann, Je; McIntosh, A M
2012-01-01
Genome-wide association studies (GWAS) have demonstrated a significant polygenic contribution to bipolar disorder (BD) where disease risk is determined by the summation of many alleles of small individual magnitude. Modelling polygenic risk scores may be a powerful way of identifying disrupted brain regions whose genetic architecture is related to that of BD. We determined the extent to which common genetic variation underlying risk to BD affected neural activation during an executive processing/language task in individuals at familial risk of BD and healthy controls. Polygenic risk scores were calculated for each individual based on GWAS data from the Psychiatric GWAS Consortium Bipolar Disorder Working Group (PGC-BD) of over 16 000 subjects. The familial group had a significantly higher polygene score than the control group (P=0.04). There were no significant group by polygene interaction effects in terms of association with brain activation. However, we did find that an increasing polygenic risk allele load for BD was associated with increased activation in limbic regions previously implicated in BD, including the anterior cingulate cortex and amygdala, across both groups. The findings suggest that this novel polygenic approach to examine brain-imaging data may be a useful means of identifying genetically mediated traits mechanistically linked to the aetiology of BD. PMID:22760554
Lu, Qiongshi; Li, Boyang; Ou, Derek; Erlendsdottir, Margret; Powles, Ryan L; Jiang, Tony; Hu, Yiming; Chang, David; Jin, Chentian; Dai, Wei; He, Qidu; Liu, Zefeng; Mukherjee, Shubhabrata; Crane, Paul K; Zhao, Hongyu
2017-12-07
Despite the success of large-scale genome-wide association studies (GWASs) on complex traits, our understanding of their genetic architecture is far from complete. Jointly modeling multiple traits' genetic profiles has provided insights into the shared genetic basis of many complex traits. However, large-scale inference sets a high bar for both statistical power and biological interpretability. Here we introduce a principled framework to estimate annotation-stratified genetic covariance between traits using GWAS summary statistics. Through theoretical and numerical analyses, we demonstrate that our method provides accurate covariance estimates, thereby enabling researchers to dissect both the shared and distinct genetic architecture across traits to better understand their etiologies. Among 50 complex traits with publicly accessible GWAS summary statistics (N total ≈ 4.5 million), we identified more than 170 pairs with statistically significant genetic covariance. In particular, we found strong genetic covariance between late-onset Alzheimer disease (LOAD) and amyotrophic lateral sclerosis (ALS), two major neurodegenerative diseases, in single-nucleotide polymorphisms (SNPs) with high minor allele frequencies and in SNPs located in the predicted functional genome. Joint analysis of LOAD, ALS, and other traits highlights LOAD's correlation with cognitive traits and hints at an autoimmune component for ALS. Copyright © 2017 American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.
Plaza-Izurieta, Leticia; Castellanos-Rubio, Ainara; Irastorza, Iñaki; Fernández-Jimenez, Nora; Gutierrez, Galder; Bilbao, Jose Ramon
2011-07-01
Recent genome wide association studies (GWAS) on coeliac disease (CD) have identified risk loci harbouring genes that fit the accepted pathogenic model and are considered aetiological candidates. Using Taqman single nucleotide polymorphism (SNP) and expression assays, the study genotyped 11 SNPs tagging eight GWAS regions (1q31, 2q11-2q12, 3p21, 3q25-3q26, 3q28, 4q27, 6q25 and 12q24) in a Spanish cohort of 1094 CD patients and 540 controls, and performed expression analyses of candidate genes (RGS1, IL18R1/IL18RAP, CCR3, IL12A/SCHIP1, LPP, IL2/IL21-KIAA1109, TAGAP, and SH2B3) in intestinal mucosa from 29 CD children and eight controls. Polymorphisms in 1q31, 2q11-2q12, and 3q25 showed association in our cohort, and also 3q28 and 4q27 when combined with a previous study. Expression levels of IL12A, IL18RAP, IL21, KIAA1109, LPP, SCHIP1, and SH2B3 were affected by disease status, but the correlation between genotype and mRNA levels was observed only in IL12A, LPP, SCHIP1, and SH2B3. Expression differences between treated CD patients and controls along with SNP expression associations suggest a possible primary role for these four genes and their variants in pathogenesis. The lack of SNP effect in the remaining genes is probably a consequence of arbitrary candidate gene selection within association signals that are not based on functional studies.
Ryan, Joanne; Artero, Sylvaine; Carrière, Isabelle; Maller, Jerome J; Meslin, Chantal; Ritchie, Karen; Ancelin, Marie-Laure
2016-01-01
A number of genome-wide association studies (GWAS) have investigated risk factors for major depressive disorder (MDD), however there has been little attempt to replicate these findings in population-based studies of depressive symptoms. Variants within three genes, BICC1, PCLO and GRM7 were selected for replication in our study based on the following criteria: they were identified in a prior MDD GWAS study; a subsequent study found evidence that they influenced depression risk; and there is a solid biological basis for a role in depression. We firstly investigated whether these variants were associated with depressive symptoms in our population-based cohort of 929 elderly (238 with clinical depressive symptoms and 691 controls), and secondly to investigate associations with structural brain alterations. A number of nominally significant associations were identified, but none reached Bonferroni-corrected significance levels. Common SNPs in BICC1 and PCLO were associated with a 50% and 30% decreased risk of depression, respectively. PCLO rs2522833 was also associated with the volume of grey matter (p=1.6×10(-3)), and to a lesser extent with hippocampal volume and white matter lesions. Among depressed individuals rs9870680 (GRM7) was associated with the volume of grey and white matter (p=10(-4) and 8.3×10(-3), respectively). Our results provide some support for the involvement of BICC1 and PCLO in late-life depressive disorders and preliminary evidence that these genetic variants may also influence brain structural volumes. However effect sizes remain modest and associations did not reach corrected significance levels. Further large imaging studies are needed to confirm our findings. Copyright © 2015 Elsevier B.V. and ECNP. All rights reserved.
Poirier, Julia G; Brennan, Paul; McKay, James D; Spitz, Margaret R; Bickeböller, Heike; Risch, Angela; Liu, Geoffrey; Le Marchand, Loic; Tworoger, Shelley; McLaughlin, John; Rosenberger, Albert; Heinrich, Joachim; Brüske, Irene; Muley, Thomas; Henderson, Brian E; Wilkens, Lynne R; Zong, Xuchen; Li, Yafang; Hao, Ke; Timens, Wim; Bossé, Yohan; Sin, Don D; Obeidat, Ma'en; Amos, Christopher I; Hung, Rayjean J
2015-03-01
Lung cancer is the leading cause of cancer death worldwide. Although several genetic variants associated with lung cancer have been identified in the past, stringent selection criteria of genome-wide association studies (GWAS) can lead to missed variants. The objective of this study was to uncover missed variants by using the known association between lung cancer and first-degree family history of lung cancer to enrich the variant prioritization for lung cancer susceptibility regions. In this two-stage GWAS study, we first selected a list of variants associated with both lung cancer and family history of lung cancer in four GWAS (3,953 cases, 4,730 controls), then replicated our findings for 30 variants in a meta-analysis of four additional studies (7,510 cases, 7,476 controls). The top ranked genetic variant rs12415204 in chr10q23.33 encoding FFAR4 in the Discovery set was validated in the Replication set with an overall OR of 1.09 (95% CI=1.04, 1.14, P=1.63×10(-4)). When combining the two stages of the study, the strongest association was found in rs1158970 at Ch4p15.2 encoding KCNIP4 with an OR of 0.89 (95% CI=0.85, 0.94, P=9.64×10(-6)). We performed a stratified analysis of rs12415204 and rs1158970 across all eight studies by age, gender, smoking status, and histology, and found consistent results across strata. Four of the 30 replicated variants act as expression quantitative trait loci (eQTL) sites in 1,111 nontumor lung tissues and meet the genome-wide 10% FDR threshold. © 2015 Wiley Periodicals, Inc.