Science.gov

Sample records for genome-wide mammalian consistency-based

  1. Mammalian NET-seq analysis defines nascent RNA profiles and associated RNA processing genome-wide

    PubMed Central

    Nojima, Takayuki; Gomes, Tomás; Carmo-Fonseca, Maria; Proudfoot, Nicholas J

    2016-01-01

    The transcription cycle of RNA polymerase II (Pol II) correlates with changes to the phosphorylation state of its large subunit C-terminal domain (CTD). We recently developed Native Elongation Transcript sequencing using mammalian cells (mNET-seq), which generates single-nucleotide–resolution genome-wide profiles of nascent RNA and co-transcriptional RNA processing that are associated with different CTD phosphorylation states. Here we provide a detailed protocol for mNET-seq. First, Pol II elongation complexes are isolated with specific phospho-CTD antibodies from chromatin solubilized by micrococcal nuclease digestion. Next, RNA derived from within the Pol II complex is size fractionated and Illumina sequenced. using mNET-seq, we have previously shown that Pol II pauses at both ends of protein-coding genes but with different CTD phosphorylation patterns, and we have also detected phosphorylation at serine 5 (Ser5-P) CTD-specific splicing intermediates and Pol II accumulation over co-transcriptionally spliced exons. With moderate biochemical and bioinformatic skills, mNET-seq can be completed in ~6 d, not including sequencing and data analysis. PMID:26844429

  2. PReMod: a database of genome-wide mammalian cis-regulatory module predictions.

    PubMed

    Ferretti, Vincent; Poitras, Christian; Bergeron, Dominique; Coulombe, Benoit; Robert, François; Blanchette, Mathieu

    2007-01-01

    We describe PReMod, a new database of genome-wide cis-regulatory module (CRM) predictions for both the human and the mouse genomes. The prediction algorithm, described previously in Blanchette et al. (2006) Genome Res., 16, 656-668, exploits the fact that many known CRMs are made of clusters of phylogenetically conserved and repeated transcription factors (TF) binding sites. Contrary to other existing databases, PReMod is not restricted to modules located proximal to genes, but in fact mostly contains distal predicted CRMs (pCRMs). Through its web interface, PReMod allows users to (i) identify pCRMs around a gene of interest; (ii) identify pCRMs that have binding sites for a given TF (or a set of TFs) or (iii) download the entire dataset for local analyses. Queries can also be refined by filtering for specific chromosomal regions, for specific regions relative to genes or for the presence of CpG islands. The output includes information about the binding sites predicted within the selected pCRMs, and a graphical display of their distribution within the pCRMs. It also provides a visual depiction of the chromosomal context of the selected pCRMs in terms of neighboring pCRMs and genes, all of which are linked to the UCSC Genome Browser and the NCBI. PReMod: http://genomequebec.mcgill.ca/PReMod.

  3. Detecting DNA double-stranded breaks in mammalian genomes by linear amplification-mediated high-throughput genome-wide translocation sequencing.

    PubMed

    Hu, Jiazhi; Meyers, Robin M; Dong, Junchao; Panchakshari, Rohit A; Alt, Frederick W; Frock, Richard L

    2016-05-01

    Unbiased, high-throughput assays for detecting and quantifying DNA double-stranded breaks (DSBs) across the genome in mammalian cells will facilitate basic studies of the mechanisms that generate and repair endogenous DSBs. They will also enable more applied studies, such as those to evaluate the on- and off-target activities of engineered nucleases. Here we describe a linear amplification-mediated high-throughput genome-wide sequencing (LAM-HTGTS) method for the detection of genome-wide 'prey' DSBs via their translocation in cultured mammalian cells to a fixed 'bait' DSB. Bait-prey junctions are cloned directly from isolated genomic DNA using LAM-PCR and unidirectionally ligated to bridge adapters; subsequent PCR steps amplify the single-stranded DNA junction library in preparation for Illumina Miseq paired-end sequencing. A custom bioinformatics pipeline identifies prey sequences that contribute to junctions and maps them across the genome. LAM-HTGTS differs from related approaches because it detects a wide range of broken end structures with nucleotide-level resolution. Familiarity with nucleic acid methods and next-generation sequencing analysis is necessary for library generation and data interpretation. LAM-HTGTS assays are sensitive, reproducible, relatively inexpensive, scalable and straightforward to implement with a turnaround time of <1 week.

  4. Genome Wide Association Studies

    NASA Astrophysics Data System (ADS)

    Sebastiani, Paola; Solovieff, Nadia

    The availability of high throughput technology for parallel genotyping has opened the field of genetics to genome-wide association studies (GWAS). These studies generate massive amount of genetic data that challenge investigators with issues related to data management, statistical analysis of large data sets, visualization, and annotation of results. We will review the common approach to analysis of GWAS data and then discuss options to learn more from these data.

  5. A genome-wide screen identifies a single Β-defensin gene cluster in the chicken: implications for the origin and evolution of mammalian defensins

    SciTech Connect

    Xiao, Yanjing; Hughes, Austin L.; Ando, Junko; Matsuda, Yoichi; Cheng, Jan-Fang; Skinner-Noble, Donald; Zhang, Guolong

    2004-08-13

    Defensins comprise a large family of cationic antimicrobial peptides that are characterized by the presence of a conserved cysteine-rich defensin motif. Based on the spacing pattern of cysteines, these defensins are broadly divided into five groups, namely plant, invertebrate, {alpha}-, {beta}-, and {theta}-defensins, with the last three groups being mostly found in mammalian species. However, the evolutionary relationships among these five groups of defensins remain controversial.

  6. Profiling genome-wide DNA methylation.

    PubMed

    Yong, Wai-Shin; Hsu, Fei-Man; Chen, Pao-Yang

    2016-01-01

    DNA methylation is an epigenetic modification that plays an important role in regulating gene expression and therefore a broad range of biological processes and diseases. DNA methylation is tissue-specific, dynamic, sequence-context-dependent and trans-generationally heritable, and these complex patterns of methylation highlight the significance of profiling DNA methylation to answer biological questions. In this review, we surveyed major methylation assays, along with comparisons and biological examples, to provide an overview of DNA methylation profiling techniques. The advances in microarray and sequencing technologies make genome-wide profiling possible at a single-nucleotide or even a single-cell resolution. These profiling approaches vary in many aspects, such as DNA input, resolution, genomic region coverage, and bioinformatics analysis, and selecting a feasible method requires knowledge of these methods. We first introduce the biological background of DNA methylation and its pattern in plants, animals and fungi. We present an overview of major experimental approaches to profiling genome-wide DNA methylation and hydroxymethylation and then extend to the single-cell methylome. To evaluate these methods, we outline their strengths and weaknesses and perform comparisons across the different platforms. Due to the increasing need to compute high-throughput epigenomic data, we interrogate the computational pipeline for bisulfite sequencing data and also discuss the concept of identifying differentially methylated regions (DMRs). This review summarizes the experimental and computational concepts for profiling genome-wide DNA methylation, followed by biological examples. Overall, this review provides researchers useful guidance for the selection of a profiling method suited to specific research questions.

  7. Genome-Wide Approaches to Schizophrenia

    PubMed Central

    Duan, Jubao; Sanders, Alan R.; Gejman, Pablo V.

    2010-01-01

    Schizophrenia (SZ) is a common and severe psychiatric disorder with both environmental and genetic risk factors, and a high heritability. After over 20 years of molecular genetics research, new molecular strategies, primarily genome-wide association studies (GWAS), have generated major tangible progress. This new data provides evidence for: 1) A number of chromosomal regions with common polymorphisms showing genome-wide association with SZ (the major histocompatibility complex, MHC, region at 6p22-p21; 18q21.2; and 2q32.1). The associated alleles present small odds ratios (the odds of a risk variant being present in cases versus controls) and suggest causative involvement of gene regulatory mechanisms in SZ. 2) Polygenic inheritance. 3) Involvement of rare (<1%) and large (>100kb) copy number variants (CNVs). 4) A genetic overlap of SZ with autism and with bipolar disorder (BP) challenging the classical clinical classifications. Most new SZ findings (chromosomal regions and genes) have generated new biological leads. These new findings, however, still need to be translated into a better understanding of the underlying biology and into causal mechanisms. Furthermore, a considerable amount of heritability still remains unexplained (missing heritability). Deep resequencing for rare variants and system biology approaches (e.g., integrating DNA sequence and functional data) are expected to further improve our understanding of the genetic architecture of SZ and its underlying biology. PMID:20433910

  8. Genome-Wide Association Studies of Cancer

    PubMed Central

    Stadler, Zsofia K.; Thom, Peter; Robson, Mark E.; Weitzel, Jeffrey N.; Kauff, Noah D.; Hurley, Karen E.; Devlin, Vincent; Gold, Bert; Klein, Robert J.; Offit, Kenneth

    2010-01-01

    Knowledge of the inherited risk for cancer is an important component of preventive oncology. In addition to well-established syndromes of cancer predisposition, much remains to be discovered about the genetic variation underlying susceptibility to common malignancies. Increased knowledge about the human genome and advances in genotyping technology have made possible genome-wide association studies (GWAS) of human diseases. These studies have identified many important regions of genetic variation associated with an increased risk for human traits and diseases including cancer. Understanding the principles, major findings, and limitations of GWAS is becoming increasingly important for oncologists as dissemination of genomic risk tests directly to consumers is already occurring through commercial companies. GWAS have contributed to our understanding of the genetic basis of cancer and will shed light on biologic pathways and possible new strategies for targeted prevention. To date, however, the clinical utility of GWAS-derived risk markers remains limited. PMID:20585100

  9. Genome-wide analysis correlates Ayurveda Prakriti

    PubMed Central

    Govindaraj, Periyasamy; Nizamuddin, Sheikh; Sharath, Anugula; Jyothi, Vuskamalla; Rotti, Harish; Raval, Ritu; Nayak, Jayakrishna; Bhat, Balakrishna K.; Prasanna, B. V.; Shintre, Pooja; Sule, Mayura; Joshi, Kalpana S.; Dedge, Amrish P.; Bharadwaj, Ramachandra; Gangadharan, G. G.; Nair, Sreekumaran; Gopinath, Puthiya M.; Patwardhan, Bhushan; Kondaiah, Paturu; Satyamoorthy, Kapaettu; Valiathan, Marthanda Varma Sankaran; Thangaraj, Kumarasamy

    2015-01-01

    The practice of Ayurveda, the traditional medicine of India, is based on the concept of three major constitutional types (Vata, Pitta and Kapha) defined as “Prakriti”. To the best of our knowledge, no study has convincingly correlated genomic variations with the classification of Prakriti. In the present study, we performed genome-wide SNP (single nucleotide polymorphism) analysis (Affymetrix, 6.0) of 262 well-classified male individuals (after screening 3416 subjects) belonging to three Prakritis. We found 52 SNPs (p ≤ 1 × 10−5) were significantly different between Prakritis, without any confounding effect of stratification, after 106 permutations. Principal component analysis (PCA) of these SNPs classified 262 individuals into their respective groups (Vata, Pitta and Kapha) irrespective of their ancestry, which represent its power in categorization. We further validated our finding with 297 Indian population samples with known ancestry. Subsequently, we found that PGM1 correlates with phenotype of Pitta as described in the ancient text of Caraka Samhita, suggesting that the phenotypic classification of India’s traditional medicine has a genetic basis; and its Prakriti-based practice in vogue for many centuries resonates with personalized medicine. PMID:26511157

  10. Genome Wide Methylome Alterations in Lung Cancer.

    PubMed

    Mullapudi, Nandita; Ye, Bin; Suzuki, Masako; Fazzari, Melissa; Han, Weiguo; Shi, Miao K; Marquardt, Gaby; Lin, Juan; Wang, Tao; Keller, Steven; Zhu, Changcheng; Locker, Joseph D; Spivack, Simon D

    2015-01-01

    Aberrant cytosine 5-methylation underlies many deregulated elements of cancer. Among paired non-small cell lung cancers (NSCLC), we sought to profile DNA 5-methyl-cytosine features which may underlie genome-wide deregulation. In one of the more dense interrogations of the methylome, we sampled 1.2 million CpG sites from twenty-four NSCLC tumor (T)-non-tumor (NT) pairs using a methylation-sensitive restriction enzyme- based HELP-microarray assay. We found 225,350 differentially methylated (DM) sites in adenocarcinomas versus adjacent non-tumor tissue that vary in frequency across genomic compartment, particularly notable in gene bodies (GB; p<2.2E-16). Further, when DM was coupled to differential transcriptome (DE) in the same samples, 37,056 differential loci in adenocarcinoma emerged. Approximately 90% of the DM-DE relationships were non-canonical; for example, promoter DM associated with DE in the same direction. Of the canonical changes noted, promoter (PR) DM loci with reciprocal changes in expression in adenocarcinomas included HBEGF, AGER, PTPRM, DPT, CST1, MELK; DM GB loci with concordant changes in expression included FOXM1, FERMT1, SLC7A5, and FAP genes. IPA analyses showed adenocarcinoma-specific promoter DMxDE overlay identified familiar lung cancer nodes [tP53, Akt] as well as less familiar nodes [HBEGF, NQO1, GRK5, VWF, HPGD, CDH5, CTNNAL1, PTPN13, DACH1, SMAD6, LAMA3, AR]. The unique findings from this study include the discovery of numerous candidate The unique findings from this study include the discovery of numerous candidate methylation sites in both PR and GB regions not previously identified in NSCLC, and many non-canonical relationships to gene expression. These DNA methylation features could potentially be developed as risk or diagnostic biomarkers, or as candidate targets for newer methylation locus-targeted preventive or therapeutic agents.

  11. Genome Wide Methylome Alterations in Lung Cancer

    PubMed Central

    Suzuki, Masako; Fazzari, Melissa; Han, Weiguo; Shi, Miao K.; Marquardt, Gaby; Lin, Juan; Wang, Tao; Keller, Steven; Zhu, Changcheng; Locker, Joseph D.; Spivack, Simon D.

    2015-01-01

    Aberrant cytosine 5-methylation underlies many deregulated elements of cancer. Among paired non-small cell lung cancers (NSCLC), we sought to profile DNA 5-methyl-cytosine features which may underlie genome-wide deregulation. In one of the more dense interrogations of the methylome, we sampled 1.2 million CpG sites from twenty-four NSCLC tumor (T)–non-tumor (NT) pairs using a methylation-sensitive restriction enzyme- based HELP-microarray assay. We found 225,350 differentially methylated (DM) sites in adenocarcinomas versus adjacent non-tumor tissue that vary in frequency across genomic compartment, particularly notable in gene bodies (GB; p<2.2E-16). Further, when DM was coupled to differential transcriptome (DE) in the same samples, 37,056 differential loci in adenocarcinoma emerged. Approximately 90% of the DM-DE relationships were non-canonical; for example, promoter DM associated with DE in the same direction. Of the canonical changes noted, promoter (PR) DM loci with reciprocal changes in expression in adenocarcinomas included HBEGF, AGER, PTPRM, DPT, CST1, MELK; DM GB loci with concordant changes in expression included FOXM1, FERMT1, SLC7A5, and FAP genes. IPA analyses showed adenocarcinoma-specific promoter DMxDE overlay identified familiar lung cancer nodes [tP53, Akt] as well as less familiar nodes [HBEGF, NQO1, GRK5, VWF, HPGD, CDH5, CTNNAL1, PTPN13, DACH1, SMAD6, LAMA3, AR]. The unique findings from this study include the discovery of numerous candidate The unique findings from this study include the discovery of numerous candidate methylation sites in both PR and GB regions not previously identified in NSCLC, and many non-canonical relationships to gene expression. These DNA methylation features could potentially be developed as risk or diagnostic biomarkers, or as candidate targets for newer methylation locus-targeted preventive or therapeutic agents. PMID:26683690

  12. Genome wide selection in Citrus breeding.

    PubMed

    Gois, I B; Borém, A; Cristofani-Yaly, M; de Resende, M D V; Azevedo, C F; Bastianel, M; Novelli, V M; Machado, M A

    2016-10-17

    Genome wide selection (GWS) is essential for the genetic improvement of perennial species such as Citrus because of its ability to increase gain per unit time and to enable the efficient selection of characteristics with low heritability. This study assessed GWS efficiency in a population of Citrus and compared it with selection based on phenotypic data. A total of 180 individual trees from a cross between Pera sweet orange (Citrus sinensis Osbeck) and Murcott tangor (Citrus sinensis Osbeck x Citrus reticulata Blanco) were evaluated for 10 characteristics related to fruit quality. The hybrids were genotyped using 5287 DArT_seq(TM) (diversity arrays technology) molecular markers and their effects on phenotypes were predicted using the random regression - best linear unbiased predictor (rr-BLUP) method. The predictive ability, prediction bias, and accuracy of GWS were estimated to verify its effectiveness for phenotype prediction. The proportion of genetic variance explained by the markers was also computed. The heritability of the traits, as determined by markers, was 16-28%. The predictive ability of these markers ranged from 0.53 to 0.64, and the regression coefficients between predicted and observed phenotypes were close to unity. Over 35% of the genetic variance was accounted for by the markers. Accuracy estimates with GWS were lower than those obtained by phenotypic analysis; however, GWS was superior in terms of genetic gain per unit time. Thus, GWS may be useful for Citrus breeding as it can predict phenotypes early and accurately, and reduce the length of the selection cycle. This study demonstrates the feasibility of genomic selection in Citrus.

  13. Genome-Wide Views of Chromatin Structure

    PubMed Central

    Rando, Oliver J.; Chang, Howard Y.

    2010-01-01

    Eukaryotic genomes are packaged into a nucleoprotein complex known as chromatin, which affects most processes that occur on DNA. Along with genetic and biochemical studies of resident chromatin proteins and their modifying enzymes, mapping of chromatin structure in vivo is one of the main pillars in our understanding of how chromatin relates to cellular processes. In this review, we discuss the use of genomic technologies to characterize chromatin structure in vivo, with a focus on data from budding yeast and humans. The picture emerging from these studies is the detailed chromatin structure of a typical gene, where the typical behavior gives insight into the mechanisms and deep rules that establish chromatin structure. Important deviation from the archetype is also observed, usually as a consequence of unique regulatory mechanisms at special genomic loci. Chromatin structure shows substantial conservation from yeast to humans, but mammalian chromatin has additional layers of complexity that likely relate to the requirements of multicellularity such as the need to establish faithful gene regulatory mechanisms for cell differentiation. PMID:19317649

  14. Adjusted P values for genome-wide scans.

    PubMed Central

    Lystig, Theodore C

    2003-01-01

    Genome-wide scans for quantitative trait loci (QTL) have traditionally been summarized with plots of logarithm of odds (LOD) scores. A valuable modification is to supplement such plots with an additional vertical axis displaying quantiles of adjusted P values and labeling local maxima of the LOD scores with location-specific adjusted P values. This provides a visible gradation of genome-wide significance for the LOD score curve, instead of the stark dichotomy that a single threshold yields. Adjusted P values give genome-wide significance of individual LOD scores and are obtained through a straightforward modification of the familiar algorithm for generating permutation-based thresholds. PMID:12930772

  15. Genome-Wide Scan Reveals Mutation Associated with Melanoma

    MedlinePlus

    ... historical) Genome-Wide Scan Reveals Mutation Associated with Melanoma A team of international researchers supported by the ... when they divide and grow uncontrollably, develop into melanoma. Also, MITF activity is known to be amplified ...

  16. A novel statistic for genome-wide interaction analysis.

    PubMed

    Wu, Xuesen; Dong, Hua; Luo, Li; Zhu, Yun; Peng, Gang; Reveille, John D; Xiong, Momiao

    2010-09-23

    Although great progress in genome-wide association studies (GWAS) has been made, the significant SNP associations identified by GWAS account for only a few percent of the genetic variance, leading many to question where and how we can find the missing heritability. There is increasing interest in genome-wide interaction analysis as a possible source of finding heritability unexplained by current GWAS. However, the existing statistics for testing interaction have low power for genome-wide interaction analysis. To meet challenges raised by genome-wide interactional analysis, we have developed a novel statistic for testing interaction between two loci (either linked or unlinked). The null distribution and the type I error rates of the new statistic for testing interaction are validated using simulations. Extensive power studies show that the developed statistic has much higher power to detect interaction than classical logistic regression. The results identified 44 and 211 pairs of SNPs showing significant evidence of interactions with FDR<0.001 and 0.001genome-wide interaction analysis is a valuable tool for finding remaining missing heritability unexplained by the current GWAS, and the developed novel statistic is able to search significant interaction between SNPs across the genome. Real data analysis showed that the results of genome-wide interaction analysis can be replicated in two independent studies.

  17. A genome-wide CRISPR screen in primary immune cells to dissect regulatory networks

    PubMed Central

    Parnas, Oren; Jovanovic, Marko; Eisenhaure, Thomas M.; Herbst, Rebecca H.; Dixit, Atray; Ye, Chun Jimmie; Przybylski, Dariusz; Platt, Randall J.; Tirosh, Itay; Sanjana, Neville E.; Shalem, Ophir; Satija, Rahul; Raychowdhury, Raktima; Mertins, Philipp; Carr, Steven A.; Zhang, Feng; Hacohen, Nir; Regev, Aviv

    2015-01-01

    Finding the components of cellular circuits and determining their functions systematically remains a major challenge in mammalian cells. Here, we introduced genome-wide pooled CRISPR-Cas9 libraries into dendritic cells (DCs) to identify genes that control the induction of tumor necrosis factor (Tnf) by bacterial lipopolysaccharide (LPS), a key process in the host response to pathogens, mediated by the Tlr4 pathway. We found many of the known regulators of Tlr4 signaling, as well as dozens of previously unknown candidates that we validated. By measuring protein markers and mRNA profiles in DCs that are deficient in the known or candidate genes, we classified the genes into three functional modules with distinct effects on the canonical responses to LPS, and highlighted functions for the PAF complex and oligosaccharyltransferase (OST) complex. Our findings uncover new facets of innate immune circuits in primary cells, and provide a genetic approach for dissection of mammalian cell circuits. PMID:26189680

  18. Genome-wide association studies in maize: praise and stargaze

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Genome-wide association study (GWAS) has appeared as a widespread strategy in decoding genotype-phenotype associations in many species thanks to technical advances in next-generation sequencing (NGS) applications. Maize is an ideal crop for GWAS and significant progress has been made in the last dec...

  19. A super powerful method for genome wide association study

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Genome-Wide Association Studies shed light on the identification of genes underlying human diseases and agriculturally important traits. This potential has been shadowed by false positive findings. The Mixed Linear Model (MLM) method is flexible enough to simultaneously incorporate population struct...

  20. Genome-wide association study identifies five new schizophrenia loci

    PubMed Central

    2012-01-01

    We examined the role of common genetic variation in schizophrenia in a genome-wide association study of substantial size: a stage 1 discovery sample of 21,856 individuals of European ancestry and a stage 2 replication sample of 29,839 independent subjects. The combined stage 1 and 2 analysis yielded genome-wide significant associations with schizophrenia for seven loci, five of which are new (1p21.3, 2q32.3, 8p23.2, 8q21.3 and 10q24.32-q24.33) and two of which have been previously implicated (6p21.32-p22.1 and 18q21.2). The strongest new finding (P = 1.6 × 10−11) was with rs1625579 within an intron of a putative primary transcript for MIR137 (microRNA 137), a known regulator of neuronal development. Four other schizophrenia loci achieving genome-wide significance contain predicted targets of MIR137, suggesting MIR137-mediated dysregulation as a previously unknown etiologic mechanism in schizophrenia. In a joint analysis with a bipolar disorder sample (16,374 affected individuals and 14,044 controls), three loci reached genome-wide significance: CACNA1C (rs4765905, P = 7.0 × 10−9), ANK3 (rs10994359, P = 2.5 × 10−8) and the ITIH3-ITIH4 region (rs2239547, P = 7.8 × 10−9). PMID:21926974

  1. Genome-wide association mapping of soybean aphid resistance traits

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Soybean aphid is the most damaging insect pest of soybean in the Upper Midwest and is primarily controlled by insecticides. Soybean aphid resistance (i.e., Rag genes) has been documented in some soybean lines at chromosomes 6, 7, 13, and 16, but more sources of resistance are needed. Genome-wide ass...

  2. Analysis of Heritability Using Genome-Wide Data.

    PubMed

    Hall, Jacob B; Bush, William S

    2016-10-11

    Most analyses of genome-wide association data consider each variant independently without considering or adjusting for the genetic background present in the rest of the genome. New approaches to genome analysis use representations of genomic sharing to better account for confounding factors like population stratification or to directly approximate heritability through the estimated sharing of individuals in a dataset. These approaches use mixed linear models, which relate genotypic sharing to phenotypic sharing, and rely on the efficient computation of genetic sharing among individuals in a dataset. This unit describes the principles and practical application of mixed models for the analysis of genome-wide association study data. © 2016 by John Wiley & Sons, Inc.

  3. Genome-wide scans for loci under selection in humans.

    PubMed

    Ronald, James; Akey, Joshua M

    2005-06-01

    Natural selection, which can be defined as the differential contribution of genetic variants to future generations, is the driving force of Darwinian evolution. Identifying regions of the human genome that have been targets of natural selection is an important step in clarifying human evolutionary history and understanding how genetic variation results in phenotypic diversity, it may also facilitate the search for complex disease genes. Technological advances in high-throughput DNA sequencing and single nucleotide polymorphism genotyping have enabled several genome-wide scans of natural selection to be undertaken. Here, some of the observations that are beginning to emerge from these studies will be reviewed, including evidence for geographically restricted selective pressures (ie local adaptation) and a relationship between genes subject to natural selection and human disease. In addition, the paper will highlight several important problems that need to be addressed in future genome-wide studies of natural selection.

  4. Genome-wide functional analysis in Candida albicans.

    PubMed

    Motaung, Thabiso E; Ells, Ruan; Pohl, Carolina H; Albertyn, Jacobus; Tsilo, Toi J

    2017-02-08

    Candida albicans is an important etiological agent of superficial and life-threatening infections in individuals with compromised immune systems. To date, we know of several overlapping genetic networks that govern virulence attributes in this fungal pathogen. Classical use of deletion mutants has led to the discovery of numerous virulence factors over the years, and genome-wide functional analysis has propelled gene discovery at an even faster pace. Indeed, a number of recent studies using large-scale genetic screens followed by genome-wide functional analysis has allowed for the unbiased discovery of many new genes involved in C. albicans biology. Here we share our perspectives on the role of these studies in analyzing fundamental aspects of C. albicans virulence properties.

  5. Genome-wide patterns of selection in 230 ancient Eurasians.

    PubMed

    Mathieson, Iain; Lazaridis, Iosif; Rohland, Nadin; Mallick, Swapan; Patterson, Nick; Roodenberg, Songül Alpaslan; Harney, Eadaoin; Stewardson, Kristin; Fernandes, Daniel; Novak, Mario; Sirak, Kendra; Gamba, Cristina; Jones, Eppie R; Llamas, Bastien; Dryomov, Stanislav; Pickrell, Joseph; Arsuaga, Juan Luís; de Castro, José María Bermúdez; Carbonell, Eudald; Gerritsen, Fokke; Khokhlov, Aleksandr; Kuznetsov, Pavel; Lozano, Marina; Meller, Harald; Mochalov, Oleg; Moiseyev, Vyacheslav; Guerra, Manuel A Rojo; Roodenberg, Jacob; Vergès, Josep Maria; Krause, Johannes; Cooper, Alan; Alt, Kurt W; Brown, Dorcas; Anthony, David; Lalueza-Fox, Carles; Haak, Wolfgang; Pinhasi, Ron; Reich, David

    2015-12-24

    Ancient DNA makes it possible to observe natural selection directly by analysing samples from populations before, during and after adaptation events. Here we report a genome-wide scan for selection using ancient DNA, capitalizing on the largest ancient DNA data set yet assembled: 230 West Eurasians who lived between 6500 and 300 bc, including 163 with newly reported data. The new samples include, to our knowledge, the first genome-wide ancient DNA from Anatolian Neolithic farmers, whose genetic material we obtained by extracting from petrous bones, and who we show were members of the population that was the source of Europe's first farmers. We also report a transect of the steppe region in Samara between 5600 and 300 bc, which allows us to identify admixture into the steppe from at least two external sources. We detect selection at loci associated with diet, pigmentation and immunity, and two independent episodes of selection on height.

  6. Genome-wide patterns of selection in 230 ancient Eurasians

    PubMed Central

    Mathieson, Iain; Lazaridis, Iosif; Rohland, Nadin; Mallick, Swapan; Patterson, Nick; Roodenberg, Songül Alpaslan; Harney, Eadaoin; Stewardson, Kristin; Fernandes, Daniel; Novak, Mario; Sirak, Kendra; Gamba, Cristina; Jones, Eppie R.; Llamas, Bastien; Dryomov, Stanislav; Pickrel, Joseph; Arsuaga, Juan Luís; de Castro, José María Bermúdez; Carbonell, Eudald; Gerritsen, Fokke; Khokhlov, Aleksandr; Kuznetsov, Pavel; Lozano, Marina; Meller, Harald; Mochalov, Oleg; Moiseyev, Vayacheslav; Rojo Guerra, Manuel A.; Roodenberg, Jacob; Vergès, Josep Maria; Krause, Johannes; Cooper, Alan; Alt, Kurt W.; Brown, Dorcas; Anthony, David; Lalueza-Fox, Carles; Haak, Wolfgang; Pinhasi, Ron; Reich, David

    2016-01-01

    Ancient DNA makes it possible to directly witness natural selection by analyzing samples from populations before, during and after adaptation events. Here we report the first scan for selection using ancient DNA, capitalizing on the largest genome-wide dataset yet assembled: 230 West Eurasians dating to between 6500 and 1000 BCE, including 163 with newly reported data. The new samples include the first genome-wide data from the Anatolian Neolithic culture whose genetic material we extracted from the DNA-rich petrous bone and who we show were members of the population that was the source of Europe’s first farmers. We also report a complete transect of the steppe region in Samara between 5500 and 1200 BCE that allows us to recognize admixture from at least two external sources into steppe populations during this period. We detect selection at loci associated with diet, pigmentation and immunity, and two independent episodes of selection on height. PMID:26595274

  7. Genome-wide association studies of obesity and metabolic syndrome.

    PubMed

    Fall, Tove; Ingelsson, Erik

    2014-01-25

    Until just a few years ago, the genetic determinants of obesity and metabolic syndrome were largely unknown, with the exception of a few forms of monogenic extreme obesity. Since genome-wide association studies (GWAS) became available, large advances have been made. The first single nucleotide polymorphism robustly associated with increased body mass index (BMI) was in 2007 mapped to a gene with for the time unknown function. This gene, now known as fat mass and obesity associated (FTO) has been repeatedly replicated in several ethnicities and is affecting obesity by regulating appetite. Since the first report from a GWAS of obesity, an increasing number of markers have been shown to be associated with BMI, other measures of obesity or fat distribution and metabolic syndrome. This systematic review of obesity GWAS will summarize genome-wide significant findings for obesity and metabolic syndrome and briefly give a few suggestions of what is to be expected in the next few years.

  8. Genome-wide association study of paliperidone efficacy

    PubMed Central

    Wineinger, Nathan E.; Fu, Dong-Jing; Libiger, Ondrej; Alphs, Larry; Savitz, Adam; Gopal, Srihari; Cohen, Nadine; Schork, Nicholas J.

    2017-01-01

    Objective Clinical response to the atypical antipsychotic paliperidone is known to vary among schizophrenic patients. We carried out a genome-wide association study to identify common genetic variants predictive of paliperidone efficacy. Methods We leveraged a collection of 1390 samples from individuals of European ancestry enrolled in 12 clinical studies investigating the efficacy of the extended-release tablet paliperidone ER (n1=490) and the once-monthly injection paliperidone palmitate (n2=550 and n3=350). We carried out a genome-wide association study using a general linear model (GLM) analysis on three separate cohorts, followed by meta-analysis and using a mixed linear model analysis on all samples. The variations in response explained by each single nucleotide polymorphism (h2SNP) were estimated. Results No SNP passed genome-wide significance in the GLM-based analyses with suggestive signals from rs56240334 [P=7.97×10−8 for change in the Clinical Global Impression Scale-Severity (CGI-S); P=8.72×10−7 for change in the total Positive and Negative Syndrome Scale (PANSS)] in the intron of ADCK1. The mixed linear model-based association P-values for rs56240334 were consistent with the results from GLM-based analyses and the association with change in CGI-S (P=4.26×10−8) reached genome-wide significance (i.e. P<5×10−8). We also found suggestive evidence for a polygenic contribution toward paliperidone treatment response with estimates of heritability, h2SNP, ranging from 0.31 to 0.43 for change in the total PANSS score, the PANSS positive Marder factor score, and CGI-S. Conclusion Genetic variations in the ADCK1 gene may differentially predict paliperidone efficacy in schizophrenic patients. However, this finding should be replicated in additional samples. PMID:27846195

  9. Genome-Wide Profiling of Alternative Translation Initiation Sites.

    PubMed

    Gao, Xiangwei; Wan, Ji; Qian, Shu-Bing

    2016-01-01

    Regulation of translation initiation is a central control point in protein synthesis. Variations of start codon selection contribute to protein diversity and complexity. Systemic mapping of start codon positions and precise measurement of the corresponding initiation rate would transform our understanding of translational control. Here we describe a ribosome profiling approach that enables identification of translation initiation sites on a genome-wide scale. By capturing initiating ribosomes using lactimidomycin, this approach permits qualitative and quantitative analysis of alternative translation initiation.

  10. Genome-Wide Association Studies and Liver Disease

    PubMed Central

    Speliotes, Elizabeth K.

    2016-01-01

    Sequencing of the human genome has opened up many opportunities to learn about our own genetic susceptibilities to disease. In this Foreword to this issue of Seminars in Liver Disease, I provide some required background to understanding genome-wide association analyses in general, including a list of terms (Table 1) often used in such studies. Five areas of particular significance are then reviewed in detail in the articles that follow. PMID:26676811

  11. Genome-Wide Association Study of Polymorphisms Predisposing to Bronchiolitis

    PubMed Central

    Pasanen, Anu; Karjalainen, Minna K.; Bont, Louis; Piippo-Savolainen, Eija; Ruotsalainen, Marja; Goksör, Emma; Kumawat, Kuldeep; Hodemaekers, Hennie; Nuolivirta, Kirsi; Jartti, Tuomas; Wennergren, Göran; Hallman, Mikko; Rämet, Mika; Korppi, Matti

    2017-01-01

    Bronchiolitis is a major cause of hospitalization among infants. Severe bronchiolitis is associated with later asthma, suggesting a common genetic predisposition. Genetic background of bronchiolitis is not well characterized. To identify polymorphisms associated with bronchiolitis, we conducted a genome-wide association study (GWAS) in which 5,300,000 single nucleotide polymorphisms (SNPs) were tested for association in a Finnish–Swedish population of 217 children hospitalized for bronchiolitis and 778 controls. The most promising SNPs (n = 77) were genotyped in a Dutch replication population of 416 cases and 432 controls. Finally, we used a set of 202 Finnish bronchiolitis cases to further investigate candidate SNPs. We did not detect genome-wide significant associations, but several suggestive association signals (p < 10−5) were observed in the GWAS. In the replication population, three SNPs were nominally associated (p < 0.05). Of them, rs269094 was an expression quantitative trait locus (eQTL) for KCND3, previously shown to be associated with occupational asthma. In the additional set of Finnish cases, the association for another SNP (rs9591920) within a noncoding RNA locus was further strengthened. Our results provide a first genome-wide examination of the genetics underlying bronchiolitis. These preliminary findings require further validation in a larger sample size. PMID:28139761

  12. Genome-wide DNA polymorphism analyses using VariScan

    PubMed Central

    Hutter, Stephan; Vilella, Albert J; Rozas, Julio

    2006-01-01

    Background DNA sequence polymorphisms analysis can provide valuable information on the evolutionary forces shaping nucleotide variation, and provides an insight into the functional significance of genomic regions. The recent ongoing genome projects will radically improve our capabilities to detect specific genomic regions shaped by natural selection. Current available methods and software, however, are unsatisfactory for such genome-wide analysis. Results We have developed methods for the analysis of DNA sequence polymorphisms at the genome-wide scale. These methods, which have been tested on a coalescent-simulated and actual data files from mouse and human, have been implemented in the VariScan software package version 2.0. Additionally, we have also incorporated a graphical-user interface. The main features of this software are: i) exhaustive population-genetic analyses including those based on the coalescent theory; ii) analysis adapted to the shallow data generated by the high-throughput genome projects; iii) use of genome annotations to conduct a comprehensive analyses separately for different functional regions; iv) identification of relevant genomic regions by the sliding-window and wavelet-multiresolution approaches; v) visualization of the results integrated with current genome annotations in commonly available genome browsers. Conclusion VariScan is a powerful and flexible suite of software for the analysis of DNA polymorphisms. The current version implements new algorithms, methods, and capabilities, providing an important tool for an exhaustive exploratory analysis of genome-wide DNA polymorphism data. PMID:16968531

  13. Voxelwise genome-wide association study (vGWAS).

    PubMed

    Stein, Jason L; Hua, Xue; Lee, Suh; Ho, April J; Leow, Alex D; Toga, Arthur W; Saykin, Andrew J; Shen, Li; Foroud, Tatiana; Pankratz, Nathan; Huentelman, Matthew J; Craig, David W; Gerber, Jill D; Allen, April N; Corneveaux, Jason J; Dechairo, Bryan M; Potkin, Steven G; Weiner, Michael W; Thompson, Paul

    2010-11-15

    The structure of the human brain is highly heritable, and is thought to be influenced by many common genetic variants, many of which are currently unknown. Recent advances in neuroimaging and genetics have allowed collection of both highly detailed structural brain scans and genome-wide genotype information. This wealth of information presents a new opportunity to find the genes influencing brain structure. Here we explore the relation between 448,293 single nucleotide polymorphisms in each of 31,622 voxels of the entire brain across 740 elderly subjects (mean age+/-s.d.: 75.52+/-6.82 years; 438 male) including subjects with Alzheimer's disease, Mild Cognitive Impairment, and healthy elderly controls from the Alzheimer's Disease Neuroimaging Initiative (ADNI). We used tensor-based morphometry to measure individual differences in brain structure at the voxel level relative to a study-specific template based on healthy elderly subjects. We then conducted a genome-wide association at each voxel to identify genetic variants of interest. By studying only the most associated variant at each voxel, we developed a novel method to address the multiple comparisons problem and computational burden associated with the unprecedented amount of data. No variant survived the strict significance criterion, but several genes worthy of further exploration were identified, including CSMD2 and CADPS2. These genes have high relevance to brain structure. This is the first voxelwise genome wide association study to our knowledge, and offers a novel method to discover genetic influences on brain structure.

  14. Genome-Wide Estimates of Heritability for Social Demographic Outcomes

    PubMed Central

    Domingue, Benjamin W.; Wedow, Robbee; Conley, Dalton; McQueen, Matt; Hoffmann, Thomas J.; Boardman, Jason D.

    2016-01-01

    An increasing number of studies that are widely used in the demographic research community have collected genome-wide data from their respondents. It is therefore important that demographers have a proper understanding of some of the methodological tools needed to analyze such data. Our paper details the underlying methodology behind one of the most common techniques for analyzing genome-wide data, Genome-Wide Complex Trait Analysis (GCTA). GCTA models provide heritability estimates for health, health behaviors, or indicators of attainment using data from unrelated persons.. Our goal is to describe this model, to highlight the utility of the model for biodemographic research, and to demonstrate the performance of this approach under modifications of the underlying assumptions. The first set of modifications involves changing the nature of the genetic data used to compute genetic similarities between individuals (the genetic relationship matrix). We then explore the sensitivity of the model to heteroscedastic errors. In general, GCTA estimates are robust to the modifications proposed here but we also highlight potential limitations of GCTA estimates. PMID:27050030

  15. Genome-Wide Association Identifies SLC2A9 and NLN Gene Regions as Associated with Entropion in Domestic Sheep

    PubMed Central

    Mousel, Michelle R.; Reynolds, James O.; White, Stephen N.

    2015-01-01

    Entropion is an inward rolling of the eyelid allowing contact between the eyelashes and cornea that may lead to blindness if not corrected. Although many mammalian species, including humans and dogs, are afflicted by congenital entropion, no specific genes or gene regions related to development of entropion have been reported in any mammalian species to date. Entropion in domestic sheep is known to have a genetic component therefore, we used domestic sheep as a model system to identify genomic regions containing genes associated with entropion. A genome-wide association was conducted with congenital entropion in 998 Columbia, Polypay, and Rambouillet sheep genotyped with 50,000 SNP markers. Prevalence of entropion was 6.01%, with all breeds represented. Logistic regression was performed in PLINK with additive allelic, recessive, dominant, and genotypic inheritance models. Two genome-wide significant (empirical P<0.05) SNP were identified, specifically markers in SLC2A9 (empirical P = 0.007; genotypic model) and near NLN (empirical P = 0.026; dominance model). Six additional genome-wide suggestive SNP (nominal P<1x10-5) were identified including markers in or near PIK3CB (P = 2.22x10-6; additive model), KCNB1 (P = 2.93x10-6; dominance model), ZC3H12C (P = 3.25x10-6; genotypic model), JPH1 (P = 4.68x20-6; genotypic model), and MYO3B (P = 5.74x10-6; recessive model). This is the first report of specific gene regions associated with congenital entropion in any mammalian species, to our knowledge. Further, none of these genes have previously been associated with any eyelid traits. These results represent the first genome-wide analysis of gene regions associated with entropion and provide target regions for the development of sheep genetic markers for marker-assisted selection. PMID:26098909

  16. Genome-Wide Association Identifies SLC2A9 and NLN Gene Regions as Associated with Entropion in Domestic Sheep.

    PubMed

    Mousel, Michelle R; Reynolds, James O; White, Stephen N

    2015-01-01

    Entropion is an inward rolling of the eyelid allowing contact between the eyelashes and cornea that may lead to blindness if not corrected. Although many mammalian species, including humans and dogs, are afflicted by congenital entropion, no specific genes or gene regions related to development of entropion have been reported in any mammalian species to date. Entropion in domestic sheep is known to have a genetic component therefore, we used domestic sheep as a model system to identify genomic regions containing genes associated with entropion. A genome-wide association was conducted with congenital entropion in 998 Columbia, Polypay, and Rambouillet sheep genotyped with 50,000 SNP markers. Prevalence of entropion was 6.01%, with all breeds represented. Logistic regression was performed in PLINK with additive allelic, recessive, dominant, and genotypic inheritance models. Two genome-wide significant (empirical P<0.05) SNP were identified, specifically markers in SLC2A9 (empirical P = 0.007; genotypic model) and near NLN (empirical P = 0.026; dominance model). Six additional genome-wide suggestive SNP (nominal P<1x10(-5)) were identified including markers in or near PIK3CB (P = 2.22x10(-6); additive model), KCNB1 (P = 2.93x10(-6); dominance model), ZC3H12C (P = 3.25x10(-6); genotypic model), JPH1 (P = 4.68x20(-6); genotypic model), and MYO3B (P = 5.74x10(-6); recessive model). This is the first report of specific gene regions associated with congenital entropion in any mammalian species, to our knowledge. Further, none of these genes have previously been associated with any eyelid traits. These results represent the first genome-wide analysis of gene regions associated with entropion and provide target regions for the development of sheep genetic markers for marker-assisted selection.

  17. Genome-wide analysis of a Wnt1-regulated transcriptional network implicates neurodegenerative pathways.

    PubMed

    Wexler, Eric M; Rosen, Ezra; Lu, Daning; Osborn, Gregory E; Martin, Elizabeth; Raybould, Helen; Geschwind, Daniel H

    2011-10-04

    Wnt proteins are critical to mammalian brain development and function. The canonical Wnt signaling pathway involves the stabilization and nuclear translocation of β-catenin; however, Wnt also signals through alternative, noncanonical pathways. To gain a systems-level, genome-wide view of Wnt signaling, we analyzed Wnt1-stimulated changes in gene expression by transcriptional microarray analysis in cultured human neural progenitor (hNP) cells at multiple time points over a 72-hour time course. We observed a widespread oscillatory-like pattern of changes in gene expression, involving components of both the canonical and the noncanonical Wnt signaling pathways. A higher-order, systems-level analysis that combined independent component analysis, waveform analysis, and mutual information-based network construction revealed effects on pathways related to cell death and neurodegenerative disease. Wnt effectors were tightly clustered with presenilin1 (PSEN1) and granulin (GRN), which cause dominantly inherited forms of Alzheimer's disease and frontotemporal dementia (FTD), respectively. We further explored a potential link between Wnt1 and GRN and found that Wnt1 decreased GRN expression by hNPs. Conversely, GRN knockdown increased WNT1 expression, demonstrating that Wnt and GRN reciprocally regulate each other. Finally, we provided in vivo validation of the in vitro findings by analyzing gene expression data from individuals with FTD. These unbiased and genome-wide analyses provide evidence for a connection between Wnt signaling and the transcriptional regulation of neurodegenerative disease genes.

  18. Genome-Wide Association Study of Metabolic Syndrome in Koreans

    PubMed Central

    Jeong, Seok Won; Chung, Myungguen; Park, Soo-Jung; Cho, Seong Beom

    2014-01-01

    Metabolic syndrome (METS) is a disorder of energy utilization and storage and increases the risk of developing cardiovascular disease and diabetes. To identify the genetic risk factors of METS, we carried out a genome-wide association study (GWAS) for 2,657 cases and 5,917 controls in Korean populations. As a result, we could identify 2 single nucleotide polymorphisms (SNPs) with genome-wide significance level p-values (<5 × 10-8), 8 SNPs with genome-wide suggestive p-values (5 × 10-8 ≤ p < 1 × 10-5), and 2 SNPs of more functional variants with borderline p-values (5 × 10-5 ≤ p < 1 × 10-4). On the other hand, the multiple correction criteria of conventional GWASs exclude false-positive loci, but simultaneously, they discard many true-positive loci. To reconsider the discarded true-positive loci, we attempted to include the functional variants (nonsynonymous SNPs [nsSNPs] and expression quantitative trait loci [eQTL]) among the top 5,000 SNPs based on the proportion of phenotypic variance explained by genotypic variance. In total, 159 eQTLs and 18 nsSNPs were presented in the top 5,000 SNPs. Although they should be replicated in other independent populations, 6 eQTLs and 2 nsSNP loci were located in the molecular pathways of LPL, APOA5, and CHRM2, which were the significant or suggestive loci in the METS GWAS. Conclusively, our approach using the conventional GWAS, reconsidering functional variants and pathway-based interpretation, suggests a useful method to understand the GWAS results of complex traits and can be expanded in other genomewide association studies. PMID:25705157

  19. Genome-Wide Approaches to Drosophila Heart Development

    PubMed Central

    Frasch, Manfred

    2016-01-01

    The development of the dorsal vessel in Drosophila is one of the first systems in which key mechanisms regulating cardiogenesis have been defined in great detail at the genetic and molecular level. Due to evolutionary conservation, these findings have also provided major inputs into studies of cardiogenesis in vertebrates. Many of the major components that control Drosophila cardiogenesis were discovered based on candidate gene approaches and their functions were defined by employing the outstanding genetic tools and molecular techniques available in this system. More recently, approaches have been taken that aim to interrogate the entire genome in order to identify novel components and describe genomic features that are pertinent to the regulation of heart development. Apart from classical forward genetic screens, the availability of the thoroughly annotated Drosophila genome sequence made new genome-wide approaches possible, which include the generation of massive numbers of RNA interference (RNAi) reagents that were used in forward genetic screens, as well as studies of the transcriptomes and proteomes of the developing heart under normal and experimentally manipulated conditions. Moreover, genome-wide chromatin immunoprecipitation experiments have been performed with the aim to define the full set of genomic binding sites of the major cardiogenic transcription factors, their relevant target genes, and a more complete picture of the regulatory network that drives cardiogenesis. This review will give an overview on these genome-wide approaches to Drosophila heart development and on computational analyses of the obtained information that ultimately aim to provide a description of this process at the systems level. PMID:27294102

  20. Genome-Wide Association of Heroin Dependence in Han Chinese

    PubMed Central

    Coleman, Jonathan R. I.; Ducci, Francesca; Aliev, Fazil; Newhouse, Stephen J.; Liu, Xiehe; Ma, Xiaohong; Wang, Yingcheng; Collier, David A.; Asherson, Philip; Li, Tao; Breen, Gerome

    2016-01-01

    Drug addiction is a costly and recurring healthcare problem, necessitating a need to understand risk factors and mechanisms of addiction, and to identify new biomarkers. To date, genome-wide association studies (GWAS) for heroin addiction have been limited; moreover they have been restricted to examining samples of European and African-American origin due to difficulty of recruiting samples from other populations. This is the first study to test a Han Chinese population; we performed a GWAS on a homogeneous sample of 370 Han Chinese subjects diagnosed with heroin dependence using the DSM-IV criteria and 134 ethnically matched controls. Analysis using the diagnostic criteria of heroin dependence yielded suggestive evidence for association between variants in the genes CCDC42 (coiled coil domain 42; p = 2.8x10-7) and BRSK2 (BR serine/threonine 2; p = 4.110−6). In addition, we found evidence for risk variants within the ARHGEF10 (Rho guanine nucleotide exchange factor 10) gene on chromosome 8 and variants in a region on chromosome 20q13, which is gene-poor but has a concentration of mRNAs and predicted miRNAs. Gene-based association analysis identified genome-wide significant association between variants in CCDC42 and heroin addiction. Additionally, when we investigated shared risk variants between heroin addiction and risk of other addiction-related and psychiatric phenotypes using polygenic risk scores, we found a suggestive relationship with variants predicting tobacco addiction, and a significant relationship with variants predicting schizophrenia. Our genome wide association study of heroin dependence provides data in a novel sample, with functionally plausible results and evidence of genetic data of value to the field. PMID:27936112

  1. Genome-wide Association Study of Obsessive-Compulsive Disorder

    PubMed Central

    Stewart, S Evelyn; Yu, Dongmei; Scharf, Jeremiah M; Neale, Benjamin M; Fagerness, Jesen A; Mathews, Carol A; Arnold, Paul D; Evans, Patrick D; Gamazon, Eric R; Osiecki, Lisa; McGrath, Lauren; Haddad, Stephen; Crane, Jacquelyn; Hezel, Dianne; Illman, Cornelia; Mayerfeld, Catherine; Konkashbaev, Anuar; Liu, Chunyu; Pluzhnikov, Anna; Tikhomirov, Anna; Edlund, Christopher K; Rauch, Scott L; Moessner, Rainald; Falkai, Peter; Maier, Wolfgang; Ruhrmann, Stephan; Grabe, Hans-Jörgen; Lennertz, Leonard; Wagner, Michael; Bellodi, Laura; Cavallini, Maria Cristina; Richter, Margaret A; Cook, Edwin H; Kennedy, James L; Rosenberg, David; Stein, Dan J; Hemmings, Sian MJ; Lochner, Christine; Azzam, Amin; Chavira, Denise A; Fournier, Eduardo; Garrido, Helena; Sheppard, Brooke; Umaña, Paul; Murphy, Dennis L; Wendland, Jens R; Veenstra-VanderWeele, Jeremy; Denys, Damiaan; Blom, Rianne; Deforce, Dieter; Van Nieuwerburgh, Filip; Westenberg, Herman GM; Walitza, Susanne; Egberts, Karin; Renner, Tobias; Miguel, Euripedes Constantino; Cappi, Carolina; Hounie, Ana G; Conceição do Rosário, Maria; Sampaio, Aline S; Vallada, Homero; Nicolini, Humberto; Lanzagorta, Nuria; Camarena, Beatriz; Delorme, Richard; Leboyer, Marion; Pato, Carlos N; Pato, Michele T; Voyiaziakis, Emanuel; Heutink, Peter; Cath, Danielle C; Posthuma, Danielle; Smit, Jan H; Samuels, Jack; Bienvenu, O Joseph; Cullen, Bernadette; Fyer, Abby J; Grados, Marco A; Greenberg, Benjamin D; McCracken, James T; Riddle, Mark A; Wang, Ying; Coric, Vladimir; Leckman, James F; Bloch, Michael; Pittenger, Christopher; Eapen, Valsamma; Black, Donald W; Ophoff, Roel A; Strengman, Eric; Cusi, Daniele; Turiel, Maurizio; Frau, Francesca; Macciardi, Fabio; Gibbs, J Raphael; Cookson, Mark R; Singleton, Andrew; Hardy, John; Crenshaw, Andrew T; Parkin, Melissa A; Mirel, Daniel B; Conti, David V; Purcell, Shaun; Nestadt, Gerald; Hanna, Gregory L; Jenike, Michael A; Knowles, James A; Cox, Nancy; Pauls, David L

    2014-01-01

    Obsessive-compulsive disorder (OCD) is a common, debilitating neuropsychiatric illness with complex genetic etiology. The International OCD Foundation Genetics Collaborative (IOCDF-GC) is a multi-national collaboration established to discover the genetic variation predisposing to OCD. A set of individuals affected with DSM-IV OCD, a subset of their parents, and unselected controls, were genotyped with several different Illumina SNP microarrays. After extensive data cleaning, 1,465 cases, 5,557 ancestry-matched controls and 400 complete trios remained, with a common set of 469,410 autosomal and 9,657 X-chromosome SNPs. Ancestry-stratified case-control association analyses were conducted for three genetically-defined subpopulations and combined in two meta-analyses, with and without the trio-based analysis. In the case-control analysis, the lowest two p-values were located within DLGAP1 (p=2.49×10-6 and p=3.44×10-6), a member of the neuronal postsynaptic density complex. In the trio analysis, rs6131295, near BTBD3, exceeded the genome-wide significance threshold with a p-value=3.84 × 10-8. However, when trios were meta-analyzed with the combined case-control samples, the p-value for this variant was 3.62×10-5, losing genome-wide significance. Although no SNPs were identified to be associated with OCD at a genome-wide significant level in the combined trio-case-control sample, a significant enrichment of methylation-QTLs (p<0.001) and frontal lobe eQTLs (p=0.001) was observed within the top-ranked SNPs (p<0.01) from the trio-case-control analysis, suggesting these top signals may have a broad role in gene expression in the brain, and possibly in the etiology of OCD. PMID:22889921

  2. Genome-wide association study of parity in Bangladeshi women.

    PubMed

    Aschebrook-Kilfoy, Briseis; Argos, Maria; Pierce, Brandon L; Tong, Lin; Jasmine, Farzana; Roy, Shantanu; Parvez, Faruque; Ahmed, Alauddin; Islam, Tariqul; Kibriya, Muhammad G; Ahsan, Habibul

    2015-01-01

    Human fertility is a complex trait determined by gene-environment interactions in which genetic factors represent a significant component. To better understand inter-individual variability in fertility, we performed one of the first genome-wide association studies (GWAS) of common fertility phenotypes, lifetime number of pregnancies and number of children in a developing country population. The fertility phenotype data and DNA samples were obtained at baseline recruitment from individuals participating in a large prospective cohort study in Bangladesh. GWAS analyses of fertility phenotypes were conducted among 1,686 married women. One SNP on chromosome 4 was non-significantly associated with number of children at P <10(-7) and number of pregnancies at P <10(-6). This SNP is located in a region without a gene within 1 Mb. One SNP on chromosome 6 was non-significantly associated with extreme number of children at P <10(-6). The closest gene to this SNP is HDGFL1, a hepatoma-derived growth factor. When we excluded hormonal contraceptive users, a SNP on chromosome 5 was non-significantly associated at P <10(-5) for number of children and number of pregnancies. This SNP is located near C5orf64, an open reading frame, and ZSWIM6, a zinc ion binding gene. We also estimated the heritability of these phenotypes from our genotype data using GCTA (Genome-wide Complex Trait Analysis) for number of children (hg2 = 0.149, SE = 0.24, p-value = 0.265) and number of pregnancies (hg2 = 0.007, SE = 0.22, p-value = 0.487). Our genome-wide association study and heritability estimates of number of pregnancies and number of children in Bangladesh did not confer strong evidence of common variants for parity variation. However, our results suggest that future studies may want to consider the role of 3 notable SNPs in their analysis.

  3. Validating, augmenting and refining genome-wide association signals.

    PubMed

    Ioannidis, John P A; Thomas, Gilles; Daly, Mark J

    2009-05-01

    Studies using genome-wide platforms have yielded an unprecedented number of promising signals of association between genomic variants and human traits. This Review addresses the steps required to validate, augment and refine such signals to identify underlying causal variants for well-defined phenotypes. These steps include: large-scale exact replication across both similar and diverse populations; fine mapping and resequencing; determination of the most informative markers and multiple independent informative loci; incorporation of functional information; and improved phenotype mapping of the implicated genetic effects. Even in cases for which replication proves that an effect exists, confident localization of the causal variant often remains elusive.

  4. Genome-wide association studies and contribution to cardiovascular physiology

    PubMed Central

    Munroe, Patricia B.

    2015-01-01

    The study of family pedigrees with rare monogenic cardiovascular disorders has revealed new molecular players in physiological processes. Genome-wide association studies of complex traits with a heritable component may afford a similar and potentially intellectually richer opportunity. In this review we focus on the interpretation of genetic associations and the issue of causality in relation to known and potentially new physiology. We mainly discuss cardiometabolic traits as it reflects our personal interests, but the issues pertain broadly in many other disciplines. We also describe some of the resources that are now available that may expedite follow up of genetic association signals into observations on causal mechanisms and pathophysiology. PMID:26106147

  5. Genome-wide approaches to defining macrophage identity and function

    PubMed Central

    Fonseca, Gregory J; Seidman, Jason S; Glass, Christopher K

    2016-01-01

    Macrophages play essential roles in the response to injury and infection and contribute to the development and/or homeostasis of the various tissues they reside in. Conversely, macrophages also influence the pathogenesis of metabolic, neurodegenerative, and neoplastic diseases. Mechanisms that contribute to the phenotypic diversity of macrophages in health and disease remain poorly understood. Here we review the recent application of genome-wide approaches to characterize the transcriptomes and epigenetic landscapes of tissue-resident macrophages. These studies are beginning to provide insights into how distinct tissue environments are interpreted by transcriptional regulatory elements to drive specialized programs of gene expression. PMID:28087927

  6. Arabidopsis transcription factors: genome-wide comparative analysis among eukaryotes.

    PubMed

    Riechmann, J L; Heard, J; Martin, G; Reuber, L; Jiang, C; Keddie, J; Adam, L; Pineda, O; Ratcliffe, O J; Samaha, R R; Creelman, R; Pilgrim, M; Broun, P; Zhang, J Z; Ghandehari, D; Sherman, B K; Yu, G

    2000-12-15

    The completion of the Arabidopsis thaliana genome sequence allows a comparative analysis of transcriptional regulators across the three eukaryotic kingdoms. Arabidopsis dedicates over 5% of its genome to code for more than 1500 transcription factors, about 45% of which are from families specific to plants. Arabidopsis transcription factors that belong to families common to all eukaryotes do not share significant similarity with those of the other kingdoms beyond the conserved DNA binding domains, many of which have been arranged in combinations specific to each lineage. The genome-wide comparison reveals the evolutionary generation of diversity in the regulation of transcription.

  7. [New insight of genome-wide association study (GWAS)].

    PubMed

    Hotta, Kikuko

    2013-02-01

    The number of obese patients is increasing in Japan, due to the westernization of lifestyle. Obesity, especially visceral fat obesity, is important for the development of metabolic syndrome. Genetic factors are important for the development of obesity as well as environmental factors. Importance of genetic factors of fat distribution is also reported. Recent genome-wide association studies (GWASs) have revealed the obesity and fat distribution-related polymorphisms. GWAS will highlight a better understanding of the underlying molecular mechanisms in the regulation of obesity and distribution of body fat.

  8. [Genome-wide association study for adolescent idiopathic scoliosis].

    PubMed

    Ogura, Yoji; Kou, Ikuyo; Scoliosis, Japan; Matsumoto, Morio; Watanabe, Kota; Ikegawa, Shiro

    2016-04-01

    Adolescent idiopathic scoliosis(AIS)is a polygenic disease. Genome-wide association studies(GWASs)have been performed for a lot of polygenic diseases. For AIS, we conducted GWAS and identified the first AIS locus near LBX1. After the discovery, we have extended our study by increasing the numbers of subjects and SNPs. In total, our Japanese GWAS has identified four susceptibility genes. GWASs for AIS have also been performed in the USA and China, which identified one and three susceptibility genes, respectively. Here we review GWASs in Japan and abroad and functional analysis to clarify the pathomechanism of AIS.

  9. Genome-wide mapping of DNA strand breaks.

    PubMed

    Leduc, Frédéric; Faucher, David; Bikond Nkoma, Geneviève; Grégoire, Marie-Chantal; Arguin, Mélina; Wellinger, Raymund J; Boissonneault, Guylain

    2011-02-25

    Determination of cellular DNA damage has so far been limited to global assessment of genome integrity whereas nucleotide-level mapping has been restricted to specific loci by the use of specific primers. Therefore, only limited DNA sequences can be studied and novel regions of genomic instability can hardly be discovered. Using a well-characterized yeast model, we describe a straightforward strategy to map genome-wide DNA strand breaks without compromising nucleotide-level resolution. This technique, termed "damaged DNA immunoprecipitation" (dDIP), uses immunoprecipitation and the terminal deoxynucleotidyl transferase-mediated dUTP-biotin end-labeling (TUNEL) to capture DNA at break sites. When used in combination with microarray or next-generation sequencing technologies, dDIP will allow researchers to map genome-wide DNA strand breaks as well as other types of DNA damage and to establish a clear profiling of altered genes and/or intergenic sequences in various experimental conditions. This mapping technique could find several applications for instance in the study of aging, genotoxic drug screening, cancer, meiosis, radiation and oxidative DNA damage.

  10. Genome-wide analysis of differential RNA editing in epilepsy

    PubMed Central

    Srivastava, Prashant Kumar; Bagnati, Marta; Delahaye-Duriez, Andree; Ko, Jeong-Hun; Rotival, Maxime; Langley, Sarah R.; Shkura, Kirill; Mazzuferi, Manuela; Danis, Bénédicte; van Eyll, Jonathan; Foerch, Patrik; Behmoaras, Jacques; Kaminski, Rafal M.; Petretto, Enrico; Johnson, Michael R.

    2017-01-01

    The recoding of genetic information through RNA editing contributes to proteomic diversity, but the extent and significance of RNA editing in disease is poorly understood. In particular, few studies have investigated the relationship between RNA editing and disease at a genome-wide level. Here, we developed a framework for the genome-wide detection of RNA sites that are differentially edited in disease. Using RNA-sequencing data from 100 hippocampi from mice with epilepsy (pilocarpine–temporal lobe epilepsy model) and 100 healthy control hippocampi, we identified 256 RNA sites (overlapping with 87 genes) that were significantly differentially edited between epileptic cases and controls. The degree of differential RNA editing in epileptic mice correlated with frequency of seizures, and the set of genes differentially RNA-edited between case and control mice were enriched for functional terms highly relevant to epilepsy, including “neuron projection” and “seizures.” Genes with differential RNA editing were preferentially enriched for genes with a genetic association to epilepsy. Indeed, we found that they are significantly enriched for genes that harbor nonsynonymous de novo mutations in patients with epileptic encephalopathy and for common susceptibility variants associated with generalized epilepsy. These analyses reveal a functional convergence between genes that are differentially RNA-edited in acquired symptomatic epilepsy and those that contribute risk for genetic epilepsy. Taken together, our results suggest a potential role for RNA editing in the epileptic hippocampus in the occurrence and severity of epileptic seizures. PMID:28250018

  11. A Pooled Genome-Wide Association Study of Asperger Syndrome.

    PubMed

    Warrier, Varun; Chakrabarti, Bhismadev; Murphy, Laura; Chan, Allen; Craig, Ian; Mallya, Uma; Lakatošová, Silvia; Rehnstrom, Karola; Peltonen, Leena; Wheelwright, Sally; Allison, Carrie; Fisher, Simon E; Baron-Cohen, Simon

    2015-01-01

    Asperger Syndrome (AS) is a neurodevelopmental condition characterized by impairments in social interaction and communication, alongside the presence of unusually repetitive, restricted interests and stereotyped behaviour. Individuals with AS have no delay in cognitive and language development. It is a subset of Autism Spectrum Conditions (ASC), which are highly heritable and has a population prevalence of approximately 1%. Few studies have investigated the genetic basis of AS. To address this gap in the literature, we performed a genome-wide pooled DNA association study to identify candidate loci in 612 individuals (294 cases and 318 controls) of Caucasian ancestry, using the Affymetrix GeneChip Human Mapping version 6.0 array. We identified 11 SNPs that had a p-value below 1x10-5. These SNPs were independently genotyped in the same sample. Three of the SNPs (rs1268055, rs7785891 and rs2782448) were nominally significant, though none remained significant after Bonferroni correction. Two of our top three SNPs (rs7785891 and rs2782448) lie in loci previously implicated in ASC. However, investigation of the three SNPs in the ASC genome-wide association dataset from the Psychiatric Genomics Consortium indicated that these three SNPs were not significantly associated with ASC. The effect sizes of the variants were modest, indicating that our study was not sufficiently powered to identify causal variants with precision.

  12. Genome-wide analysis of differential RNA editing in epilepsy.

    PubMed

    Srivastava, Prashant Kumar; Bagnati, Marta; Delahaye-Duriez, Andree; Ko, Jeong-Hun; Rotival, Maxime; Langley, Sarah R; Shkura, Kirill; Mazzuferi, Manuela; Danis, Bénédicte; van Eyll, Jonathan; Foerch, Patrik; Behmoaras, Jacques; Kaminski, Rafal M; Petretto, Enrico; Johnson, Michael R

    2017-03-01

    The recoding of genetic information through RNA editing contributes to proteomic diversity, but the extent and significance of RNA editing in disease is poorly understood. In particular, few studies have investigated the relationship between RNA editing and disease at a genome-wide level. Here, we developed a framework for the genome-wide detection of RNA sites that are differentially edited in disease. Using RNA-sequencing data from 100 hippocampi from mice with epilepsy (pilocarpine-temporal lobe epilepsy model) and 100 healthy control hippocampi, we identified 256 RNA sites (overlapping with 87 genes) that were significantly differentially edited between epileptic cases and controls. The degree of differential RNA editing in epileptic mice correlated with frequency of seizures, and the set of genes differentially RNA-edited between case and control mice were enriched for functional terms highly relevant to epilepsy, including "neuron projection" and "seizures." Genes with differential RNA editing were preferentially enriched for genes with a genetic association to epilepsy. Indeed, we found that they are significantly enriched for genes that harbor nonsynonymous de novo mutations in patients with epileptic encephalopathy and for common susceptibility variants associated with generalized epilepsy. These analyses reveal a functional convergence between genes that are differentially RNA-edited in acquired symptomatic epilepsy and those that contribute risk for genetic epilepsy. Taken together, our results suggest a potential role for RNA editing in the epileptic hippocampus in the occurrence and severity of epileptic seizures.

  13. Genome-wide association study of Tourette Syndrome

    PubMed Central

    Scharf, Jeremiah M.; Yu, Dongmei; Mathews, Carol A.; Neale, Benjamin M.; Stewart, S. Evelyn; Fagerness, Jesen A; Evans, Patrick; Gamazon, Eric; Edlund, Christopher K.; Service, Susan; Tikhomirov, Anna; Osiecki, Lisa; Illmann, Cornelia; Pluzhnikov, Anna; Konkashbaev, Anuar; Davis, Lea K; Han, Buhm; Crane, Jacquelyn; Moorjani, Priya; Crenshaw, Andrew T.; Parkin, Melissa A.; Reus, Victor I.; Lowe, Thomas L.; Rangel-Lugo, Martha; Chouinard, Sylvain; Dion, Yves; Girard, Simon; Cath, Danielle C; Smit, Jan H; King, Robert A.; Fernandez, Thomas; Leckman, James F.; Kidd, Kenneth K.; Kidd, Judith R.; Pakstis, Andrew J.; State, Matthew; Herrera, Luis Diego; Romero, Roxana; Fournier, Eduardo; Sandor, Paul; Barr, Cathy L; Phan, Nam; Gross-Tsur, Varda; Benarroch, Fortu; Pollak, Yehuda; Budman, Cathy L.; Bruun, Ruth D.; Erenberg, Gerald; Naarden, Allan L; Lee, Paul C; Weiss, Nicholas; Kremeyer, Barbara; Berrío, Gabriel Bedoya; Campbell, Desmond; Silgado, Julio C. Cardona; Ochoa, William Cornejo; Restrepo, Sandra C. Mesa; Muller, Heike; Duarte, Ana V. Valencia; Lyon, Gholson J; Leppert, Mark; Morgan, Jubel; Weiss, Robert; Grados, Marco A.; Anderson, Kelley; Davarya, Sarah; Singer, Harvey; Walkup, John; Jankovic, Joseph; Tischfield, Jay A.; Heiman, Gary A.; Gilbert, Donald L.; Hoekstra, Pieter J.; Robertson, Mary M.; Kurlan, Roger; Liu, Chunyu; Gibbs, J. Raphael; Singleton, Andrew; Hardy, John; Strengman, Eric; Ophoff, Roel; Wagner, Michael; Moessner, Rainald; Mirel, Daniel B.; Posthuma, Danielle; Sabatti, Chiara; Eskin, Eleazar; Conti, David V.; Knowles, James A.; Ruiz-Linares, Andres; Rouleau, Guy A.; Purcell, Shaun; Heutink, Peter; Oostra, Ben A.; McMahon, William; Freimer, Nelson; Cox, Nancy J.; Pauls, David L.

    2012-01-01

    Tourette Syndrome (TS) is a developmental disorder that has one of the highest familial recurrence rates among neuropsychiatric diseases with complex inheritance. However, the identification of definitive TS susceptibility genes remains elusive. Here, we report the first genome-wide association study (GWAS) of TS in 1285 cases and 4964 ancestry-matched controls of European ancestry, including two European-derived population isolates, Ashkenazi Jews from North America and Israel, and French Canadians from Quebec, Canada. In a primary meta-analysis of GWAS data from these European ancestry samples, no markers achieved a genome-wide threshold of significance (p<5 × 10−8); the top signal was found in rs7868992 on chromosome 9q32 within COL27A1 (p=1.85 × 10−6). A secondary analysis including an additional 211 cases and 285 controls from two closely-related Latin-American population isolates from the Central Valley of Costa Rica and Antioquia, Colombia also identified rs7868992 as the top signal (p=3.6 × 10−7 for the combined sample of 1496 cases and 5249 controls following imputation with 1000 Genomes data). This study lays the groundwork for the eventual identification of common TS susceptibility variants in larger cohorts and helps to provide a more complete understanding of the full genetic architecture of this disorder. PMID:22889924

  14. A Genome-Wide Association Study of Aging

    PubMed Central

    Walter, Stefan; Atzmon, Gil; Demerath, Ellen W.; Garcia, Melissa E.; Kaplan, Robert C.; Kumari, Meena; Lunetta, Kathryn L.; Milaneschi, Yuri; Tanaka, Toshiko; Tranah, Gregory J.; Völker, Uwe; Yu, Lei; Arnold, Alice; Benjamin, Emelia J.; Biffar, Reiner; Buchman, Aron S.; Boerwinkle, Eric; Couper, David; De Jager, Philip L.; Evans, Denis A.; Harris, Tamara B.; Hoffmann, Wolfgang; Hofman, Albert; Karasik, David; Kiel, Douglas P.; Kocher, Thomas; Kuningas, Maris; Launer, Lenore J.; Lohman, Kurt K.; Lutsey, Pamela L.; Mackenbach, Johan; Marciante, Kristin; Psaty, Bruce M.; Reiman, Eric M.; Rotter, Jerome I.; Seshadri, Sudha; Shardell, Michelle D.; Smith, Albert V.; van Duijn, Cornelia; Walston, Jeremy; Zillikens, M. Carola; Bandinelli, Stefania; Baumeister, Sebastian E.; Bennett, David A.; Ferrucci, Luigi; Gudnason, Vilmundur; Kivimaki, Mika; Liu, Yongmei; Murabito, Joanne M.; Newman, Anne B.; Tiemeier, Henning; Franceschini, Nora

    2011-01-01

    Human longevity and healthy aging show moderate heritability (20–50%). We conducted a meta-analysis of genome-wide association studies from nine studies from the Cohorts for Heart and Aging Research in Genomic Epidemiology Consortium for two outcomes: a) all-cause mortality and b) survival free of major disease or death. No single nucleotide polymorphism (SNP) was a genome-wide significant predictor of either outcome (p < 5 × 10−8). We found fourteen independent SNPs that predicted risk of death, and eight SNPs that predicted event-free survival (p < 10−5). These SNPs are in or near genes that are highly expressed in the brain (HECW2, HIP1, BIN2, GRIA1), genes involved in neural development and function (KCNQ4, LMO4, GRIA1, NETO1) and autophagy (ATG4C), and genes that are associated with risk of various diseases including cancer and Alzheimer’s disease. In addition to considerable overlap between the traits, pathway and network analysis corroborated these findings. These findings indicate that variation in genes involved in neurological processes may be an important factor in regulating aging free of major disease and achieving longevity. PMID:21782286

  15. Genome-wide identification of hypoxia-induced enhancer regions

    PubMed Central

    Preston, Jessica L.; Randel, Melissa A.; Johnson, Eric A.

    2015-01-01

    Here we present a genome-wide method for de novo identification of enhancer regions. This approach enables massively parallel empirical investigation of DNA sequences that mediate transcriptional activation and provides a platform for discovery of regulatory modules capable of driving context-specific gene expression. The method links fragmented genomic DNA to the transcription of randomer molecule identifiers and measures the functional enhancer activity of the library by massively parallel sequencing. We transfected a Drosophila melanogaster library into S2 cells in normoxia and hypoxia, and assayed 4,599,881 genomic DNA fragments in parallel. The locations of the enhancer regions strongly correlate with genes up-regulated after hypoxia and previously described enhancers. Novel enhancer regions were identified and integrated with RNAseq data and transcription factor motifs to describe the hypoxic response on a genome-wide basis as a complex regulatory network involving multiple stress-response pathways. This work provides a novel method for high-throughput assay of enhancer activity and the genome-scale identification of 31 hypoxia-activated enhancers in Drosophila. PMID:26713262

  16. Genome-wide association interaction analysis for Alzheimer's disease

    PubMed Central

    Gusareva, Elena S.; Carrasquillo, Minerva M.; Bellenguez, Céline; Cuyvers, Elise; Colon, Samuel; Graff-Radford, Neill R.; Petersen, Ronald C.; Dickson, Dennis W.; Mahachie Johna, Jestinah M.; Bessonov, Kyrylo; Van Broeckhoven, Christine; Williams, Julie; Amouyel, Philippe; Sleegers, Kristel; Ertekin-Taner, Nilüfer; Lambert, Jean-Charles; Van Steen, Kristel

    2015-01-01

    We propose a minimal protocol for exhaustive genome-wide association interaction analysis that involves screening for epistasis over large-scale genomic data combining strengths of different methods and statistical tools. The different steps of this protocol are illustrated on a real-life data application for Alzheimer's disease (AD) (2259 patients and 6017 controls from France). Particularly, in the exhaustive genome-wide epistasis screening we identified AD-associated interacting SNPs-pair from chromosome 6q11.1 (rs6455128, the KHDRBS2 gene) and 13q12.11 (rs7989332, the CRYL1 gene) (p = 0.006, corrected for multiple testing). A replication analysis in the independent AD cohort from Germany (555 patients and 824 controls) confirmed the discovered epistasis signal (p = 0.036). This signal was also supported by a meta-analysis approach in 5 independent AD cohorts that was applied in the context of epistasis for the first time. Transcriptome analysis revealed negative correlation between expression levels of KHDRBS2 and CRYL1 in both the temporal cortex (β = −0.19, p = 0.0006) and cerebellum (β = −0.23, p < 0.0001) brain regions. This is the first time a replicable epistasis associated with AD was identified using a hypothesis free screening approach. PMID:24958192

  17. A genome-wide association study of aging.

    PubMed

    Walter, Stefan; Atzmon, Gil; Demerath, Ellen W; Garcia, Melissa E; Kaplan, Robert C; Kumari, Meena; Lunetta, Kathryn L; Milaneschi, Yuri; Tanaka, Toshiko; Tranah, Gregory J; Völker, Uwe; Yu, Lei; Arnold, Alice; Benjamin, Emelia J; Biffar, Reiner; Buchman, Aron S; Boerwinkle, Eric; Couper, David; De Jager, Philip L; Evans, Denis A; Harris, Tamara B; Hoffmann, Wolfgang; Hofman, Albert; Karasik, David; Kiel, Douglas P; Kocher, Thomas; Kuningas, Maris; Launer, Lenore J; Lohman, Kurt K; Lutsey, Pamela L; Mackenbach, Johan; Marciante, Kristin; Psaty, Bruce M; Reiman, Eric M; Rotter, Jerome I; Seshadri, Sudha; Shardell, Michelle D; Smith, Albert V; van Duijn, Cornelia; Walston, Jeremy; Zillikens, M Carola; Bandinelli, Stefania; Baumeister, Sebastian E; Bennett, David A; Ferrucci, Luigi; Gudnason, Vilmundur; Kivimaki, Mika; Liu, Yongmei; Murabito, Joanne M; Newman, Anne B; Tiemeier, Henning; Franceschini, Nora

    2011-11-01

    Human longevity and healthy aging show moderate heritability (20%-50%). We conducted a meta-analysis of genome-wide association studies from 9 studies from the Cohorts for Heart and Aging Research in Genomic Epidemiology Consortium for 2 outcomes: (1) all-cause mortality, and (2) survival free of major disease or death. No single nucleotide polymorphism (SNP) was a genome-wide significant predictor of either outcome (p < 5 × 10(-8)). We found 14 independent SNPs that predicted risk of death, and 8 SNPs that predicted event-free survival (p < 10(-5)). These SNPs are in or near genes that are highly expressed in the brain (HECW2, HIP1, BIN2, GRIA1), genes involved in neural development and function (KCNQ4, LMO4, GRIA1, NETO1) and autophagy (ATG4C), and genes that are associated with risk of various diseases including cancer and Alzheimer's disease. In addition to considerable overlap between the traits, pathway and network analysis corroborated these findings. These findings indicate that variation in genes involved in neurological processes may be an important factor in regulating aging free of major disease and achieving longevity.

  18. A Pooled Genome-Wide Association Study of Asperger Syndrome

    PubMed Central

    Warrier, Varun; Chakrabarti, Bhismadev; Murphy, Laura; Chan, Allen; Craig, Ian; Mallya, Uma; Lakatošová, Silvia; Rehnstrom, Karola; Wheelwright, Sally; Allison, Carrie; Fisher, Simon E.; Baron-Cohen, Simon

    2015-01-01

    Asperger Syndrome (AS) is a neurodevelopmental condition characterized by impairments in social interaction and communication, alongside the presence of unusually repetitive, restricted interests and stereotyped behaviour. Individuals with AS have no delay in cognitive and language development. It is a subset of Autism Spectrum Conditions (ASC), which are highly heritable and has a population prevalence of approximately 1%. Few studies have investigated the genetic basis of AS. To address this gap in the literature, we performed a genome-wide pooled DNA association study to identify candidate loci in 612 individuals (294 cases and 318 controls) of Caucasian ancestry, using the Affymetrix GeneChip Human Mapping version 6.0 array. We identified 11 SNPs that had a p-value below 1x10-5. These SNPs were independently genotyped in the same sample. Three of the SNPs (rs1268055, rs7785891 and rs2782448) were nominally significant, though none remained significant after Bonferroni correction. Two of our top three SNPs (rs7785891 and rs2782448) lie in loci previously implicated in ASC. However, investigation of the three SNPs in the ASC genome-wide association dataset from the Psychiatric Genomics Consortium indicated that these three SNPs were not significantly associated with ASC. The effect sizes of the variants were modest, indicating that our study was not sufficiently powered to identify causal variants with precision. PMID:26176695

  19. Genome-wide association study of antisocial personality disorder

    PubMed Central

    Rautiainen, M-R; Paunio, T; Repo-Tiihonen, E; Virkkunen, M; Ollila, H M; Sulkava, S; Jolanki, O; Palotie, A; Tiihonen, J

    2016-01-01

    The pathophysiology of antisocial personality disorder (ASPD) remains unclear. Although the most consistent biological finding is reduced grey matter volume in the frontal cortex, about 50% of the total liability to developing ASPD has been attributed to genetic factors. The contributing genes remain largely unknown. Therefore, we sought to study the genetic background of ASPD. We conducted a genome-wide association study (GWAS) and a replication analysis of Finnish criminal offenders fulfilling DSM-IV criteria for ASPD (N=370, N=5850 for controls, GWAS; N=173, N=3766 for controls and replication sample). The GWAS resulted in suggestive associations of two clusters of single-nucleotide polymorphisms at 6p21.2 and at 6p21.32 at the human leukocyte antigen (HLA) region. Imputation of HLA alleles revealed an independent association with DRB1*01:01 (odds ratio (OR)=2.19 (1.53–3.14), P=1.9 × 10-5). Two polymorphisms at 6p21.2 LINC00951–LRFN2 gene region were replicated in a separate data set, and rs4714329 reached genome-wide significance (OR=1.59 (1.37–1.85), P=1.6 × 10−9) in the meta-analysis. The risk allele also associated with antisocial features in the general population conditioned for severe problems in childhood family (β=0.68, P=0.012). Functional analysis in brain tissue in open access GTEx and Braineac databases revealed eQTL associations of rs4714329 with LINC00951 and LRFN2 in cerebellum. In humans, LINC00951 and LRFN2 are both expressed in the brain, especially in the frontal cortex, which is intriguing considering the role of the frontal cortex in behavior and the neuroanatomical findings of reduced gray matter volume in ASPD. To our knowledge, this is the first study showing genome-wide significant and replicable findings on genetic variants associated with any personality disorder. PMID:27598967

  20. High-resolution genome-wide mapping of histone modifications.

    PubMed

    Roh, Tae-young; Ngau, Wing Chi; Cui, Kairong; Landsman, David; Zhao, Keji

    2004-08-01

    The expression patterns of eukaryotic genomes are controlled by their chromatin structure, consisting of nucleosome subunits in which DNA of approximately 146 bp is wrapped around a core of 8 histone molecules. Post-translational histone modifications play an essential role in modifying chromatin structure. Here we apply a combination of SAGE and chromatin immunoprecipitation (ChIP) protocols to determine the distribution of hyperacetylated histones H3 and H4 in the Saccharomyces cerevisiae genome. We call this approach genome-wide mapping technique (GMAT). Using GMAT, we find that the highest acetylation levels are detected in the 5' end of a gene's coding region, but not in the promoter. Furthermore, we show that the histone acetyltransferase, GCN5p, regulates H3 acetylation in the promoter and 5' end of the coding regions. These findings indicate that GMAT should find valuable applications in mapping target sites of chromatin-modifying enzymes.

  1. Microfluidics for genome-wide studies involving next generation sequencing

    PubMed Central

    Murphy, Travis W.; Lu, Chang

    2017-01-01

    Next-generation sequencing (NGS) has revolutionized how molecular biology studies are conducted. Its decreasing cost and increasing throughput permit profiling of genomic, transcriptomic, and epigenomic features for a wide range of applications. Microfluidics has been proven to be highly complementary to NGS technology with its unique capabilities for handling small volumes of samples and providing platforms for automation, integration, and multiplexing. In this article, we review recent progress on applying microfluidics to facilitate genome-wide studies. We emphasize on several technical aspects of NGS and how they benefit from coupling with microfluidic technology. We also summarize recent efforts on developing microfluidic technology for genomic, transcriptomic, and epigenomic studies, with emphasis on single cell analysis. We envision rapid growth in these directions, driven by the needs for testing scarce primary cell samples from patients in the context of precision medicine.

  2. Implications of genome-wide association studies in cancer therapeutics.

    PubMed

    Patel, Jai N; McLeod, Howard L; Innocenti, Federico

    2013-09-01

    Genome wide association studies (GWAS) provide an agnostic approach to identifying potential genetic variants associated with disease susceptibility, prognosis of survival and/or predictive of drug response. Although these techniques are costly and interpretation of study results is challenging, they do allow for a more unbiased interrogation of the entire genome, resulting in the discovery of novel genes and understanding of novel biological associations. This review will focus on the implications of GWAS in cancer therapy, in particular germ-line mutations, including findings from major GWAS which have identified predictive genetic loci for clinical outcome and/or toxicity. Lessons and challenges in cancer GWAS are also discussed, including the need for functional analysis and replication, as well as future perspectives for biological and clinical utility. Given the large heterogeneity in response to cancer therapeutics, novel methods of identifying mechanisms and biology of variable drug response and ultimately treatment individualization will be indispensable.

  3. [Genome-wide associations for cigarette smoking behavior].

    PubMed

    Strauss, Ewa

    2013-01-01

    Diseases related to tobacco smoking are the second leading cause of death in the world. Despite increasing evidence of genetic determination, the susceptibility genes and loci underlying various aspects of smoking behavior are largely unknown. Genome-wide association studies (GWASs) provided a new conceptual framework in the search for variants underlying common traits/disorders. A massive scan of the genome and a "hypothesis-free" approach enable discovery of new aspects of genetics of complex traits. In this paper the results of GWASs and GWAS meta-analyzes of cigarette smoking behavior and nicotine dependence are reviewed with the particular attention to smoking cessation success and the replacement therapy. The results of these studies are discussed in the context of the results of the candidate gene association studies. Studies on the role of the genomic regions, identified in GWASs, in the development of smoking-related diseases are also discussed.

  4. Genome-wide measurement of RNA folding energies.

    PubMed

    Wan, Yue; Qu, Kun; Ouyang, Zhengqing; Kertesz, Michael; Li, Jun; Tibshirani, Robert; Makino, Debora L; Nutter, Robert C; Segal, Eran; Chang, Howard Y

    2012-10-26

    RNA structural transitions are important in the function and regulation of RNAs. Here, we reveal a layer of transcriptome organization in the form of RNA folding energies. By probing yeast RNA structures at different temperatures, we obtained relative melting temperatures (Tm) for RNA structures in over 4000 transcripts. Specific signatures of RNA Tm demarcated the polarity of mRNA open reading frames and highlighted numerous candidate regulatory RNA motifs in 3' untranslated regions. RNA Tm distinguished noncoding versus coding RNAs and identified mRNAs with distinct cellular functions. We identified thousands of putative RNA thermometers, and their presence is predictive of the pattern of RNA decay in vivo during heat shock. The exosome complex recognizes unpaired bases during heat shock to degrade these RNAs, coupling intrinsic structural stabilities to gene regulation. Thus, genome-wide structural dynamics of RNA can parse functional elements of the transcriptome and reveal diverse biological insights.

  5. Genome-wide studies of telomere biology in budding yeast

    PubMed Central

    Harari, Yaniv; Kupiec, Martin

    2014-01-01

    Telomeres are specialized DNA-protein structures at the ends of eukaryotic chromosomes. Telomeres are essential for chromosomal stability and integrity, as they prevent chromosome ends from being recognized as double strand breaks. In rapidly proliferating cells, telomeric DNA is synthesized by the enzyme telomerase, which copies a short template sequence within its own RNA moiety, thus helping to solve the “end-replication problem”, in which information is lost at the ends of chromosomes with each DNA replication cycle. The basic mechanisms of telomere length, structure and function maintenance are conserved among eukaryotes. Studies in the yeast Saccharomyces cerevisiae have been instrumental in deciphering the basic aspects of telomere biology. In the last decade, technical advances, such as the availability of mutant collections, have allowed carrying out systematic genome-wide screens for mutants affecting various aspects of telomere biology. In this review we summarize these efforts, and the insights that this Systems Biology approach has produced so far.

  6. Ultrafast laser nanosurgery in microfluidics for genome-wide screenings

    PubMed Central

    Ben-Yakar, Adela; Bourgeois, Frederic

    2009-01-01

    Summary The use of ultrafast laser pulses in surgery has allowed for unprecedented precision with minimal collateral damage to surrounding tissues. For these reasons, ultrafast laser nanosurgery, as an injury model, has gained tremendous momentum in experimental biology ranging from in-vitro manipulations of subcellular structures to in-vivo studies in whole living organisms. For example, femtosecond laser nanosurgery on such model organism as the nematode Caenorhabditis elegans (C. elegans) has opened new opportunities for in-vivo nerve regeneration studies. Meanwhile, the development of novel microfluidic devices has brought the control in experimental environment to the level required for precise nanosurgery in various animal models. Merging microfluidics and laser nanosurgery has recently improved the specificities and increased the speed of laser surgeries enabling fast genome-wide screenings that can more readily decode the genetic map of various biological processes. PMID:19278850

  7. Genome-wide association studies in pharmacogenomics of antidepressants.

    PubMed

    Lin, Eugene; Lane, Hsien-Yuan

    2015-01-01

    Major depressive disorder (MDD) is one of the most common psychiatric disorders worldwide. Doctors must prescribe antidepressants based on educated guesses due to the fact that it is unmanageable to predict the effectiveness of any particular antidepressant in an individual patient. With the recent advent of scientific research, the genome-wide association study (GWAS) is extensively employed to analyze hundreds of thousands of single nucleotide polymorphisms by high-throughput genotyping technologies. In addition to the candidate-gene approach, the GWAS approach has recently been utilized to investigate the determinants of antidepressant response to therapy. In this study, we reviewed GWAS studies, their limitations and future directions with respect to the pharmacogenomics of antidepressants in MDD.

  8. A genome-wide association study of anorexia nervosa

    PubMed Central

    Boraska, Vesna; Franklin, Christopher S; Floyd, James AB; Thornton, Laura M; Huckins, Laura M; Southam, Lorraine; Rayner, N William; Tachmazidou, Ioanna; Klump, Kelly L; Treasure, Janet; Lewis, Cathryn M; Schmidt, Ulrike; Tozzi, Federica; Kiezebrink, Kirsty; Hebebrand, Johannes; Gorwood, Philip; Adan, Roger AH; Kas, Martien JH; Favaro, Angela; Santonastaso, Paolo; Fernández-Aranda, Fernando; Gratacos, Monica; Rybakowski, Filip; Dmitrzak-Weglarz, Monika; Kaprio, Jaakko; Keski-Rahkonen, Anna; Raevuori, Anu; Van Furth, Eric F; Slof-Op t Landt, Margarita CT; Hudson, James I; Reichborn-Kjennerud, Ted; Knudsen, Gun Peggy S; Monteleone, Palmiero; Kaplan, Allan S; Karwautz, Andreas; Hakonarson, Hakon; Berrettini, Wade H; Guo, Yiran; Li, Dong; Schork, Nicholas J.; Komaki, Gen; Ando, Tetsuya; Inoko, Hidetoshi; Esko, Tõnu; Fischer, Krista; Männik, Katrin; Metspalu, Andres; Baker, Jessica H; Cone, Roger D; Dackor, Jennifer; DeSocio, Janiece E; Hilliard, Christopher E; O’Toole, Julie K; Pantel, Jacques; Szatkiewicz, Jin P; Taico, Chrysecolla; Zerwas, Stephanie; Trace, Sara E; Davis, Oliver SP; Helder, Sietske; Bühren, Katharina; Burghardt, Roland; de Zwaan, Martina; Egberts, Karin; Ehrlich, Stefan; Herpertz-Dahlmann, Beate; Herzog, Wolfgang; Imgart, Hartmut; Scherag, André; Scherag, Susann; Zipfel, Stephan; Boni, Claudette; Ramoz, Nicolas; Versini, Audrey; Brandys, Marek K; Danner, Unna N; de Kovel, Carolien; Hendriks, Judith; Koeleman, Bobby PC; Ophoff, Roel A; Strengman, Eric; van Elburg, Annemarie A; Bruson, Alice; Clementi, Maurizio; Degortes, Daniela; Forzan, Monica; Tenconi, Elena; Docampo, Elisa; Escaramís, Geòrgia; Jiménez-Murcia, Susana; Lissowska, Jolanta; Rajewski, Andrzej; Szeszenia-Dabrowska, Neonila; Slopien, Agnieszka; Hauser, Joanna; Karhunen, Leila; Meulenbelt, Ingrid; Slagboom, P Eline; Tortorella, Alfonso; Maj, Mario; Dedoussis, George; Dikeos, Dimitris; Gonidakis, Fragiskos; Tziouvas, Konstantinos; Tsitsika, Artemis; Papezova, Hana; Slachtova, Lenka; Martaskova, Debora; Kennedy, James L.; Levitan, Robert D.; Yilmaz, Zeynep; Huemer, Julia; Koubek, Doris; Merl, Elisabeth; Wagner, Gudrun; Lichtenstein, Paul; Breen, Gerome; Cohen-Woods, Sarah; Farmer, Anne; McGuffin, Peter; Cichon, Sven; Giegling, Ina; Herms, Stefan; Rujescu, Dan; Schreiber, Stefan; Wichmann, H-Erich; Dina, Christian; Sladek, Rob; Gambaro, Giovanni; Soranzo, Nicole; Julia, Antonio; Marsal, Sara; Rabionet, Raquel; Gaborieau, Valerie; Dick, Danielle M; Palotie, Aarno; Ripatti, Samuli; Widén, Elisabeth; Andreassen, Ole A; Espeseth, Thomas; Lundervold, Astri; Reinvang, Ivar; Steen, Vidar M; Le Hellard, Stephanie; Mattingsdal, Morten; Ntalla, Ioanna; Bencko, Vladimir; Foretova, Lenka; Janout, Vladimir; Navratilova, Marie; Gallinger, Steven; Pinto, Dalila; Scherer, Stephen; Aschauer, Harald; Carlberg, Laura; Schosser, Alexandra; Alfredsson, Lars; Ding, Bo; Klareskog, Lars; Padyukov, Leonid; Finan, Chris; Kalsi, Gursharan; Roberts, Marion; Logan, Darren W; Peltonen, Leena; Ritchie, Graham RS; Barrett, Jeffrey C; Estivill, Xavier; Hinney, Anke; Sullivan, Patrick F; Collier, David A; Zeggini, Eleftheria; Bulik, Cynthia M

    2013-01-01

    Anorexia nervosa (AN) is a complex and heritable eating disorder characterized by dangerously low body weight. Neither candidate gene studies nor an initial genome wide association study (GWAS) have yielded significant and replicated results. We performed a GWAS in 2,907 cases with AN from 14 countries (15 sites) and 14,860 ancestrally matched controls as part of the Genetic Consortium for AN (GCAN) and the Wellcome Trust Case Control Consortium 3 (WTCCC3). Individual association analyses were conducted in each stratum and meta-analyzed across all 15 discovery datasets. Seventy-six (72 independent) SNPs were taken forward for in silico (two datasets) or de novo (13 datasets) replication genotyping in 2,677 independent AN cases and 8,629 European ancestry controls along with 458 AN cases and 421 controls from Japan. The final global meta-analysis across discovery and replication datasets comprised 5,551 AN cases and 21,080 controls. AN subtype analyses (1,606 AN restricting; 1,445 AN binge-purge) were performed. No findings reached genome-wide significance. Two intronic variants were suggestively associated: rs9839776 (P=3.01×10−7) in SOX2OT and rs17030795 (P=5.84×10−6) in PPP3CA. Two additional signals were specific to Europeans: rs1523921 (P=5.76×10−6) between CUL3 and FAM124B and rs1886797 (P=8.05×10−6) near SPATA13. Comparing discovery to replication results, 76% of the effects were in the same direction, an observation highly unlikely to be due to chance (P= 4×10−6), strongly suggesting that true findings exist but that our sample, the largest yet reported, was underpowered for their detection. The accrual of large genotyped AN case-control samples should be an immediate priority for the field. PMID:21079607

  9. A genome-wide association study of anorexia nervosa.

    PubMed

    Boraska, V; Franklin, C S; Floyd, J A B; Thornton, L M; Huckins, L M; Southam, L; Rayner, N W; Tachmazidou, I; Klump, K L; Treasure, J; Lewis, C M; Schmidt, U; Tozzi, F; Kiezebrink, K; Hebebrand, J; Gorwood, P; Adan, R A H; Kas, M J H; Favaro, A; Santonastaso, P; Fernández-Aranda, F; Gratacos, M; Rybakowski, F; Dmitrzak-Weglarz, M; Kaprio, J; Keski-Rahkonen, A; Raevuori, A; Van Furth, E F; Slof-Op 't Landt, M C T; Hudson, J I; Reichborn-Kjennerud, T; Knudsen, G P S; Monteleone, P; Kaplan, A S; Karwautz, A; Hakonarson, H; Berrettini, W H; Guo, Y; Li, D; Schork, N J; Komaki, G; Ando, T; Inoko, H; Esko, T; Fischer, K; Männik, K; Metspalu, A; Baker, J H; Cone, R D; Dackor, J; DeSocio, J E; Hilliard, C E; O'Toole, J K; Pantel, J; Szatkiewicz, J P; Taico, C; Zerwas, S; Trace, S E; Davis, O S P; Helder, S; Bühren, K; Burghardt, R; de Zwaan, M; Egberts, K; Ehrlich, S; Herpertz-Dahlmann, B; Herzog, W; Imgart, H; Scherag, A; Scherag, S; Zipfel, S; Boni, C; Ramoz, N; Versini, A; Brandys, M K; Danner, U N; de Kovel, C; Hendriks, J; Koeleman, B P C; Ophoff, R A; Strengman, E; van Elburg, A A; Bruson, A; Clementi, M; Degortes, D; Forzan, M; Tenconi, E; Docampo, E; Escaramís, G; Jiménez-Murcia, S; Lissowska, J; Rajewski, A; Szeszenia-Dabrowska, N; Slopien, A; Hauser, J; Karhunen, L; Meulenbelt, I; Slagboom, P E; Tortorella, A; Maj, M; Dedoussis, G; Dikeos, D; Gonidakis, F; Tziouvas, K; Tsitsika, A; Papezova, H; Slachtova, L; Martaskova, D; Kennedy, J L; Levitan, R D; Yilmaz, Z; Huemer, J; Koubek, D; Merl, E; Wagner, G; Lichtenstein, P; Breen, G; Cohen-Woods, S; Farmer, A; McGuffin, P; Cichon, S; Giegling, I; Herms, S; Rujescu, D; Schreiber, S; Wichmann, H-E; Dina, C; Sladek, R; Gambaro, G; Soranzo, N; Julia, A; Marsal, S; Rabionet, R; Gaborieau, V; Dick, D M; Palotie, A; Ripatti, S; Widén, E; Andreassen, O A; Espeseth, T; Lundervold, A; Reinvang, I; Steen, V M; Le Hellard, S; Mattingsdal, M; Ntalla, I; Bencko, V; Foretova, L; Janout, V; Navratilova, M; Gallinger, S; Pinto, D; Scherer, S W; Aschauer, H; Carlberg, L; Schosser, A; Alfredsson, L; Ding, B; Klareskog, L; Padyukov, L; Courtet, P; Guillaume, S; Jaussent, I; Finan, C; Kalsi, G; Roberts, M; Logan, D W; Peltonen, L; Ritchie, G R S; Barrett, J C; Estivill, X; Hinney, A; Sullivan, P F; Collier, D A; Zeggini, E; Bulik, C M

    2014-10-01

    Anorexia nervosa (AN) is a complex and heritable eating disorder characterized by dangerously low body weight. Neither candidate gene studies nor an initial genome-wide association study (GWAS) have yielded significant and replicated results. We performed a GWAS in 2907 cases with AN from 14 countries (15 sites) and 14 860 ancestrally matched controls as part of the Genetic Consortium for AN (GCAN) and the Wellcome Trust Case Control Consortium 3 (WTCCC3). Individual association analyses were conducted in each stratum and meta-analyzed across all 15 discovery data sets. Seventy-six (72 independent) single nucleotide polymorphisms were taken forward for in silico (two data sets) or de novo (13 data sets) replication genotyping in 2677 independent AN cases and 8629 European ancestry controls along with 458 AN cases and 421 controls from Japan. The final global meta-analysis across discovery and replication data sets comprised 5551 AN cases and 21 080 controls. AN subtype analyses (1606 AN restricting; 1445 AN binge-purge) were performed. No findings reached genome-wide significance. Two intronic variants were suggestively associated: rs9839776 (P=3.01 × 10(-7)) in SOX2OT and rs17030795 (P=5.84 × 10(-6)) in PPP3CA. Two additional signals were specific to Europeans: rs1523921 (P=5.76 × 10(-)(6)) between CUL3 and FAM124B and rs1886797 (P=8.05 × 10(-)(6)) near SPATA13. Comparing discovery with replication results, 76% of the effects were in the same direction, an observation highly unlikely to be due to chance (P=4 × 10(-6)), strongly suggesting that true findings exist but our sample, the largest yet reported, was underpowered for their detection. The accrual of large genotyped AN case-control samples should be an immediate priority for the field.

  10. Genome-Wide Association Studies for Comb Traits in Chickens

    PubMed Central

    Ma, Meng; Dou, Taocun; Lu, Jian; Guo, Jun; Hu, Yuping; Yi, Guoqiang; Yuan, Jingwei; Sun, Congjiao; Wang, Kehua; Yang, Ning

    2016-01-01

    The comb, as a secondary sexual character, is an important trait in chicken. Indicators of comb length (CL), comb height (CH), and comb weight (CW) are often selected in production. DNA-based marker-assisted selection could help chicken breeders to accelerate genetic improvement for comb or related economic characters by early selection. Although a number of quantitative trait loci (QTL) and candidate genes have been identified with advances in molecular genetics, candidate genes underlying comb traits are limited. The aim of the study was to use genome-wide association (GWA) studies by 600 K Affymetrix chicken SNP arrays to detect genes that are related to comb, using an F2 resource population. For all comb characters, comb exhibited high SNP-based heritability estimates (0.61–0.69). Chromosome 1 explained 20.80% genetic variance, while chromosome 4 explained 6.89%. Independent univariate genome-wide screens for each character identified 127, 197, and 268 novel significant SNPs with CL, CH, and CW, respectively. Three candidate genes, VPS36, AR, and WNT11B, were determined to have a plausible function in all comb characters. These genes are important to the initiation of follicle development, gonadal growth, and dermal development, respectively. The current study provides the first GWA analysis for comb traits. Identification of the genetic basis as well as promising candidate genes will help us understand the underlying genetic architecture of comb development and has practical significance in breeding programs for the selection of comb as an index for sexual maturity or reproduction. PMID:27427764

  11. A genome-wide association study of anorexia nervosa

    PubMed Central

    Boraska, Vesna; Franklin, Christopher S; Floyd, James AB; Thornton, Laura M; Huckins, Laura M; Southam, Lorraine; Rayner, N William; Tachmazidou, Ioanna; Klump, Kelly L; Treasure, Janet; Lewis, Cathryn M; Schmidt, Ulrike; Tozzi, Federica; Kiezebrink, Kirsty; Hebebrand, Johannes; Gorwood, Philip; Adan, Roger AH; Kas, Martien JH; Favaro, Angela; Santonastaso, Paolo; Fernández-Aranda, Fernando; Gratacos, Monica; Rybakowski, Filip; Dmitrzak-Weglarz, Monika; Kaprio, Jaakko; Keski-Rahkonen, Anna; Raevuori, Anu; Van Furth, Eric F; Landt, Margarita CT Slof-Op t; Hudson, James I; Reichborn-Kjennerud, Ted; Knudsen, Gun Peggy S; Monteleone, Palmiero; Kaplan, Allan S; Karwautz, Andreas; Hakonarson, Hakon; Berrettini, Wade H; Guo, Yiran; Li, Dong; Schork, Nicholas J.; Komaki, Gen; Ando, Tetsuya; Inoko, Hidetoshi; Esko, Tõnu; Fischer, Krista; Männik, Katrin; Metspalu, Andres; Baker, Jessica H; Cone, Roger D; Dackor, Jennifer; DeSocio, Janiece E; Hilliard, Christopher E; O'Toole, Julie K; Pantel, Jacques; Szatkiewicz, Jin P; Taico, Chrysecolla; Zerwas, Stephanie; Trace, Sara E; Davis, Oliver SP; Helder, Sietske; Bühren, Katharina; Burghardt, Roland; de Zwaan, Martina; Egberts, Karin; Ehrlich, Stefan; Herpertz-Dahlmann, Beate; Herzog, Wolfgang; Imgart, Hartmut; Scherag, André; Scherag, Susann; Zipfel, Stephan; Boni, Claudette; Ramoz, Nicolas; Versini, Audrey; Brandys, Marek K; Danner, Unna N; de Kovel, Carolien; Hendriks, Judith; Koeleman, Bobby PC; Ophoff, Roel A; Strengman, Eric; van Elburg, Annemarie A; Bruson, Alice; Clementi, Maurizio; Degortes, Daniela; Forzan, Monica; Tenconi, Elena; Docampo, Elisa; Escaramís, Geòrgia; Jiménez-Murcia, Susana; Lissowska, Jolanta; Rajewski, Andrzej; Szeszenia-Dabrowska, Neonila; Slopien, Agnieszka; Hauser, Joanna; Karhunen, Leila; Meulenbelt, Ingrid; Slagboom, P Eline; Tortorella, Alfonso; Maj, Mario; Dedoussis, George; Dikeos, Dimitris; Gonidakis, Fragiskos; Tziouvas, Konstantinos; Tsitsika, Artemis; Papezova, Hana; Slachtova, Lenka; Martaskova, Debora; Kennedy, James L.; Levitan, Robert D.; Yilmaz, Zeynep; Huemer, Julia; Koubek, Doris; Merl, Elisabeth; Wagner, Gudrun; Lichtenstein, Paul; Breen, Gerome; Cohen-Woods, Sarah; Farmer, Anne; McGuffin, Peter; Cichon, Sven; Giegling, Ina; Herms, Stefan; Rujescu, Dan; Schreiber, Stefan; Wichmann, H-Erich; Dina, Christian; Sladek, Rob; Gambaro, Giovanni; Soranzo, Nicole; Julia, Antonio; Marsal, Sara; Rabionet, Raquel; Gaborieau, Valerie; Dick, Danielle M; Palotie, Aarno; Ripatti, Samuli; Widén, Elisabeth; Andreassen, Ole A; Espeseth, Thomas; Lundervold, Astri; Reinvang, Ivar; Steen, Vidar M; Le Hellard, Stephanie; Mattingsdal, Morten; Ntalla, Ioanna; Bencko, Vladimir; Foretova, Lenka; Janout, Vladimir; Navratilova, Marie; Gallinger, Steven; Pinto, Dalila; Scherer, Stephen; Aschauer, Harald; Carlberg, Laura; Schosser, Alexandra; Alfredsson, Lars; Ding, Bo; Klareskog, Lars; Padyukov, Leonid; Finan, Chris; Kalsi, Gursharan; Roberts, Marion; Logan, Darren W; Peltonen, Leena; Ritchie, Graham RS; Barrett, Jeffrey C; Estivill, Xavier; Hinney, Anke; Sullivan, Patrick F; Collier, David A; Zeggini, Eleftheria; Bulik, Cynthia M

    2015-01-01

    Anorexia nervosa (AN) is a complex and heritable eating disorder characterized by dangerously low body weight. Neither candidate gene studies nor an initial genome wide association study (GWAS) have yielded significant and replicated results. We performed a GWAS in 2,907 cases with AN from 14 countries (15 sites) and 14,860 ancestrally matched controls as part of the Genetic Consortium for AN (GCAN) and the Wellcome Trust Case Control Consortium 3 (WTCCC3). Individual association analyses were conducted in each stratum and meta-analyzed across all 15 discovery datasets. Seventy-six (72 independent) SNPs were taken forward for in silico (two datasets) or de novo (13 datasets) replication genotyping in 2,677 independent AN cases and 8,629 European ancestry controls along with 458 AN cases and 421 controls from Japan. The final global meta-analysis across discovery and replication datasets comprised 5,551 AN cases and 21,080 controls. AN subtype analyses (1,606 AN restricting; 1,445 AN binge-purge) were performed. No findings reached genome-wide significance. Two intronic variants were suggestively associated: rs9839776 (P=3.01×10-7) in SOX2OT and rs17030795 (P=5.84×10-6) in PPP3CA. Two additional signals were specific to Europeans: rs1523921 (P=5.76×10-6) between CUL3 and FAM124B and rs1886797 (P=8.05×10-6) near SPATA13. Comparing discovery to replication results, 76% of the effects were in the same direction, an observation highly unlikely to be due to chance (P=4×10-6), strongly suggesting that true findings exist but that our sample, the largest yet reported, was underpowered for their detection. The accrual of large genotyped AN case-control samples should be an immediate priority for the field. PMID:24514567

  12. Genome-Wide Expression Profiling of Complex Regional Pain Syndrome

    PubMed Central

    Jin, Eun-Heui; Zhang, Enji; Ko, Youngkwon; Sim, Woo Seog; Moon, Dong Eon; Yoon, Keon Jung; Hong, Jang Hee; Lee, Won Hyung

    2013-01-01

    Complex regional pain syndrome (CRPS) is a chronic, progressive, and devastating pain syndrome characterized by spontaneous pain, hyperalgesia, allodynia, altered skin temperature, and motor dysfunction. Although previous gene expression profiling studies have been conducted in animal pain models, there genome-wide expression profiling in the whole blood of CRPS patients has not been reported yet. Here, we successfully identified certain pain-related genes through genome-wide expression profiling in the blood from CRPS patients. We found that 80 genes were differentially expressed between 4 CRPS patients (2 CRPS I and 2 CRPS II) and 5 controls (cut-off value: 1.5-fold change and p<0.05). Most of those genes were associated with signal transduction, developmental processes, cell structure and motility, and immunity and defense. The expression levels of major histocompatibility complex class I A subtype (HLA-A29.1), matrix metalloproteinase 9 (MMP9), alanine aminopeptidase N (ANPEP), l-histidine decarboxylase (HDC), granulocyte colony-stimulating factor 3 receptor (G-CSF3R), and signal transducer and activator of transcription 3 (STAT3) genes selected from the microarray were confirmed in 24 CRPS patients and 18 controls by quantitative reverse transcription-polymerase chain reaction (qRT-PCR). We focused on the MMP9 gene that, by qRT-PCR, showed a statistically significant difference in expression in CRPS patients compared to controls with the highest relative fold change (4.0±1.23 times and p = 1.4×10−4). The up-regulation of MMP9 gene in the blood may be related to the pain progression in CRPS patients. Our findings, which offer a valuable contribution to the understanding of the differential gene expression in CRPS may help in the understanding of the pathophysiology of CRPS pain progression. PMID:24244504

  13. Genome-wide association and genomic selection in animal breeding.

    PubMed

    Hayes, Ben; Goddard, Mike

    2010-11-01

    Results from genome-wide association studies in livestock, and humans, has lead to the conclusion that the effect of individual quantitative trait loci (QTL) on complex traits, such as yield, are likely to be small; therefore, a large number of QTL are necessary to explain genetic variation in these traits. Given this genetic architecture, gains from marker-assisted selection (MAS) programs using only a small number of DNA markers to trace a limited number of QTL is likely to be small. This has lead to the development of alternative technology for using the available dense single nucleotide polymorphism (SNP) information, called genomic selection. Genomic selection uses a genome-wide panel of dense markers so that all QTL are likely to be in linkage disequilibrium with at least one SNP. The genomic breeding values are predicted to be the sum of the effect of these SNPs across the entire genome. In dairy cattle breeding, the accuracy of genomic estimated breeding values (GEBV) that can be achieved and the fact that these are available early in life have lead to rapid adoption of the technology. Here, we discuss the design of experiments necessary to achieve accurate prediction of GEBV in future generations in terms of the number of markers necessary and the size of the reference population where marker effects are estimated. We also present a simple method for implementing genomic selection using a genomic relationship matrix. Future challenges discussed include using whole genome sequence data to improve the accuracy of genomic selection and management of inbreeding through genomic relationships.

  14. Genome-Wide Association Study of Meiotic Recombination Phenotypes

    PubMed Central

    Begum, Ferdouse; Chowdhury, Reshmi; Cheung, Vivian G.; Sherman, Stephanie L.; Feingold, Eleanor

    2016-01-01

    Meiotic recombination is an essential step in gametogenesis, and is one that also generates genetic diversity. Genome-wide association studies (GWAS) and molecular studies have identified genes that influence of human meiotic recombination. RNF212 is associated with total or average number of recombination events, and PRDM9 is associated with the locations of hotspots, or sequences where crossing over appears to cluster. In addition, a common inversion on chromosome 17 is strongly associated with recombination. Other genes have been identified by GWAS, but those results have not been replicated. In this study, using new datasets, we characterized additional recombination phenotypes to uncover novel candidates and further dissect the role of already known loci. We used three datasets totaling 1562 two-generation families, including 3108 parents with 4304 children. We estimated five different recombination phenotypes including two novel phenotypes (average recombination counts within recombination hotspots and outside of hotspots) using dense SNP array genotype data. We then performed gender-specific and combined-sex genome-wide association studies (GWAS) meta-analyses. We replicated associations for several previously reported recombination genes, including RNF212 and PRDM9. By looking specifically at recombination events outside of hotspots, we showed for the first time that PRDM9 has different effects in males and females. We identified several new candidate loci, particularly for recombination events outside of hotspots. These include regions near the genes SPINK6, EVC2, ARHGAP25, and DLGAP2. This study expands our understanding of human meiotic recombination by characterizing additional features that vary across individuals, and identifying regulatory variants influencing the numbers and locations of recombination events. PMID:27733454

  15. Genome-Wide Association Study of Meiotic Recombination Phenotypes.

    PubMed

    Begum, Ferdouse; Chowdhury, Reshmi; Cheung, Vivian G; Sherman, Stephanie L; Feingold, Eleanor

    2016-12-07

    Meiotic recombination is an essential step in gametogenesis, and is one that also generates genetic diversity. Genome-wide association studies (GWAS) and molecular studies have identified genes that influence of human meiotic recombination. RNF212 is associated with total or average number of recombination events, and PRDM9 is associated with the locations of hotspots, or sequences where crossing over appears to cluster. In addition, a common inversion on chromosome 17 is strongly associated with recombination. Other genes have been identified by GWAS, but those results have not been replicated. In this study, using new datasets, we characterized additional recombination phenotypes to uncover novel candidates and further dissect the role of already known loci. We used three datasets totaling 1562 two-generation families, including 3108 parents with 4304 children. We estimated five different recombination phenotypes including two novel phenotypes (average recombination counts within recombination hotspots and outside of hotspots) using dense SNP array genotype data. We then performed gender-specific and combined-sex genome-wide association studies (GWAS) meta-analyses. We replicated associations for several previously reported recombination genes, including RNF212 and PRDM9 By looking specifically at recombination events outside of hotspots, we showed for the first time that PRDM9 has different effects in males and females. We identified several new candidate loci, particularly for recombination events outside of hotspots. These include regions near the genes SPINK6, EVC2, ARHGAP25, and DLGAP2 This study expands our understanding of human meiotic recombination by characterizing additional features that vary across individuals, and identifying regulatory variants influencing the numbers and locations of recombination events.

  16. Genome-wide discovery of loci influencing chemotherapy cytotoxicity.

    PubMed

    Watters, James W; Kraja, Aldi; Meucci, Melissa A; Province, Michael A; McLeod, Howard L

    2004-08-10

    Little is known about the heritability of chemotherapy activity or the identity of genes that may enable the individualization of cancer chemotherapy. Although numerous genes are likely to influence chemotherapy response, current candidate gene-based pharmacogenetics approaches require a priori knowledge and the selection of a small number of candidate genes for hypothesis testing. In this study, an ex vivo familial genetics strategy using lymphoblastoid cells derived from Centre d'Etude du Polymorphisme Humain reference pedigrees was used to discover genetic determinants of chemotherapy cytotoxicity. Cytotoxicity to the mechanistically distinct chemotherapy agents 5-fluorouracil and docetaxel were shown to be heritable traits, with heritability values ranging from 0.26 to 0.65 for 5-fluorouracil and 0.21 to 0.70 for docetaxel, varying with dose. Genome-wide linkage analysis was also used to map a quantitative trait locus influencing the cellular effects of 5-fluorouracil to chromosome 9q13-q22 [logarithm of odds (LOD) = 3.44], and two quantitative trait loci influencing the cellular effects of docetaxel to chromosomes 5q11-21 (LOD = 2.21) and 9q13-q22 (LOD = 2.73). Finally, 5-fluorouracil and docetaxel were shown to cause apoptotic cell death involving caspase-3 cleavage in Centre d'Etude du Polymorphisme Humain lymphoblastoid cells. This study identifies genomic regions likely to harbor genes important for chemotherapy cytotoxicity using genome-wide linkage analysis in human pedigrees and provides a widely applicable strategy for pharmacogenomic discovery without the requirement for a priori candidate gene selection.

  17. Genome-wide association study of atypical psychosis.

    PubMed

    Kanazawa, Tetsufumi; Ikeda, Masashi; Glatt, Stephen J; Tsutsumi, Atsushi; Kikuyama, Hiroki; Kawamura, Yoshiya; Nishida, Nao; Miyagawa, Taku; Hashimoto, Ryota; Takeda, Masatoshi; Sasaki, Tsukasa; Tokunaga, Katsushi; Koh, Jun; Iwata, Nakao; Yoneda, Hiroshi

    2013-10-01

    Atypical psychosis with a periodic course of exacerbation and features of major psychiatric disorders [schizophrenia (SZ) and bipolar disorder (BD)] has a long history in clinical psychiatry in Japan. Based upon the new criteria of atypical psychosis, a Genome-Wide Association Study (GWAS) was conducted to identify the risk gene or variants. The relationships between atypical psychosis, SZ and BD were then assessed using independent GWAS data. Forty-seven patients with solid criteria of atypical psychosis and 882 normal controls (NCs) were scanned using an Affymetrics 6.0 chip. GWAS SZ data (560 SZ cases and 548 NCs) and GWAS BD (107 cases with BD type 1 and 107 NCs) were compared using gene-based analysis. The most significant SNPs were detected around the CHN2/CPVL genes (rs245914, P = 1.6 × 10(-7)) , COL21A1 gene (rs12196860, P = 2.45 × 10(-7) ), and PYGL/TRIM9 genes (rs1959536, P = 7.73 × 10(-7) ), although none of the single-nucleotide polymorphisms exhibited genome-wide significance (P = 5 × 10(-8) ). One of the highest peaks was detected on the major histocompatibility complex region, where large SZ GWASs have previously disclosed an association. The gene-based analysis suggested significant enrichment between SZ and atypical psychosis (P = 0.01), but not BD. This study provides clues about the types of patient whose diagnosis lies between SZ and BD. Studies with larger samples are required to determine the causal variant.

  18. Genome-Wide Binding Patterns of Thyroid Hormone Receptor Beta

    PubMed Central

    Ayers, Stephen; Switnicki, Michal Piotr; Angajala, Anusha; Lammel, Jan; Arumanayagam, Anithachristy S.; Webb, Paul

    2014-01-01

    Thyroid hormone (TH) receptors (TRs) play central roles in metabolism and are major targets for pharmaceutical intervention. Presently, however, there is limited information about genome wide localizations of TR binding sites. Thus, complexities of TR genomic distribution and links between TRβ binding events and gene regulation are not fully appreciated. Here, we employ a BioChIP approach to capture TR genome-wide binding events in a liver cell line (HepG2). Like other NRs, TRβ appears widely distributed throughout the genome. Nevertheless, there is striking enrichment of TRβ binding sites immediately 5′ and 3′ of transcribed genes and TRβ can be detected near 50% of T3 induced genes. In contrast, no significant enrichment of TRβ is seen at negatively regulated genes or genes that respond to unliganded TRs in this system. Canonical TRE half-sites are present in more than 90% of TRβ peaks and classical TREs are also greatly enriched, but individual TRE organization appears highly variable with diverse half-site orientation and spacing. There is also significant enrichment of binding sites for TR associated transcription factors, including AP-1 and CTCF, near TR peaks. We conclude that T3-dependent gene induction commonly involves proximal TRβ binding events but that far-distant binding events are needed for T3 induction of some genes and that distinct, indirect, mechanisms are often at play in negative regulation and unliganded TR actions. Better understanding of genomic context of TR binding sites will help us determine why TR regulates genes in different ways and determine possibilities for selective modulation of TR action. PMID:24558356

  19. Comparative analysis of genome-wide divergence, domestication footprints and genome-wide association study of root traits for Gossypium hirsutum and Gossypium barbadense

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Use of 10,129 singleton SNPs of known genomic location in tetraploid cotton provided unique opportunities to characterize genome-wide diversity among 440 Gossypium hirsutum and 219 G. barbadense cultivars and landrace accessions of widespread origin. Using genome-wide distributed SNPs, we examined ...

  20. Genome-Wide Analysis of Human Metapneumovirus Evolution

    PubMed Central

    Kim, Jin Il; Park, Sehee; Lee, Ilseob; Park, Kwang Sook; Kwak, Eun Jung; Moon, Kwang Mee; Lee, Chang Kyu; Bae, Joon-Yong; Park, Man-Seong; Song, Ki-Joon

    2016-01-01

    Human metapneumovirus (HMPV) has been described as an important etiologic agent of upper and lower respiratory tract infections, especially in young children and the elderly. Most of school-aged children might be introduced to HMPVs, and exacerbation with other viral or bacterial super-infection is common. However, our understanding of the molecular evolution of HMPVs remains limited. To address the comprehensive evolutionary dynamics of HMPVs, we report a genome-wide analysis of the eight genes (N, P, M, F, M2, SH, G, and L) using 103 complete genome sequences. Phylogenetic reconstruction revealed that the eight genes from one HMPV strain grouped into the same genetic group among the five distinct lineages (A1, A2a, A2b, B1, and B2). A few exceptions of phylogenetic incongruence might suggest past recombination events, and we detected possible recombination breakpoints in the F, SH, and G coding regions. The five genetic lineages of HMPVs shared quite remote common ancestors ranging more than 220 to 470 years of age with the most recent origins for the A2b sublineage. Purifying selection was common, but most protein genes except the F and M2-2 coding regions also appeared to experience episodic diversifying selection. Taken together, these suggest that the five lineages of HMPVs maintain their individual evolutionary dynamics and that recombination and selection forces might work on shaping the genetic diversity of HMPVs. PMID:27046055

  1. Genome-wide association study of circulating retinol levels.

    PubMed

    Mondul, Alison M; Yu, Kai; Wheeler, William; Zhang, Hong; Weinstein, Stephanie J; Major, Jacqueline M; Cornelis, Marilyn C; Männistö, Satu; Hazra, Aditi; Hsing, Ann W; Jacobs, Kevin B; Eliassen, Heather; Tanaka, Toshiko; Reding, Douglas J; Hendrickson, Sara; Ferrucci, Luigi; Virtamo, Jarmo; Hunter, David J; Chanock, Stephen J; Kraft, Peter; Albanes, Demetrius

    2011-12-01

    Retinol is one of the most biologically active forms of vitamin A and is hypothesized to influence a wide range of human diseases including asthma, cardiovascular disease, infectious diseases and cancer. We conducted a genome-wide association study of 5006 Caucasian individuals drawn from two cohorts of men: the Alpha-Tocopherol, Beta-Carotene Cancer Prevention (ATBC) Study and the Prostate, Lung, Colorectal, and Ovarian (PLCO) Cancer Screening Trial. We identified two independent single-nucleotide polymorphisms associated with circulating retinol levels, which are located near the transthyretin (TTR) and retinol binding protein 4 (RBP4) genes which encode major carrier proteins of retinol: rs1667255 (P =2.30× 10(-17)) and rs10882272 (P =6.04× 10(-12)). We replicated the association with rs10882272 in RBP4 in independent samples from the Nurses' Health Study and the Invecchiare in Chianti Study (InCHIANTI) that included 3792 women and 504 men (P =9.49× 10(-5)), but found no association for retinol with rs1667255 in TTR among women, thus suggesting evidence for gender dimorphism (P-interaction=1.31× 10(-5)). Discovery of common genetic variants associated with serum retinol levels may provide further insight into the contribution of retinol and other vitamin A compounds to the development of cancer and other complex diseases.

  2. Genome-wide analysis links NFATC2 with asparaginase hypersensitivity

    PubMed Central

    Fernandez, Christian A.; Smith, Colton; Yang, Wenjian; Mullighan, Charles G.; Qu, Chunxu; Larsen, Eric; Bowman, W. Paul; Liu, Chengcheng; Ramsey, Laura B.; Chang, Tamara; Karol, Seth E.; Loh, Mignon L.; Raetz, Elizabeth A.; Winick, Naomi J.; Hunger, Stephen P.; Carroll, William L.; Jeha, Sima; Pui, Ching-Hon; Evans, William E.; Devidas, Meenakshi

    2015-01-01

    Asparaginase is used to treat acute lymphoblastic leukemia (ALL); however, hypersensitivity reactions can lead to suboptimal asparaginase exposure. Our objective was to use a genome-wide approach to identify loci associated with asparaginase hypersensitivity in children with ALL enrolled on St. Jude Children’s Research Hospital (SJCRH) protocols Total XIIIA (n = 154), Total XV (n = 498), and Total XVI (n = 271), or Children’s Oncology Group protocols POG 9906 (n = 222) and AALL0232 (n = 2163). Germline DNA was genotyped using the Affymetrix 500K, Affymetrix 6.0, or the Illumina Exome BeadChip array. In multivariate logistic regression, the intronic rs6021191 variant in nuclear factor of activated T cells 2 (NFATC2) had the strongest association with hypersensitivity (P = 4.1 × 10−8; odds ratio [OR] = 3.11). RNA-seq data available from 65 SJCRH ALL tumor samples and 52 Yoruba HapMap samples showed that samples carrying the rs6021191 variant had higher NFATC2 expression compared with noncarriers (P = 1.1 × 10−3 and 0.03, respectively). The top ranked nonsynonymous polymorphism was rs17885382 in HLA-DRB1 (P = 3.2 × 10−6; OR = 1.63), which is in near complete linkage disequilibrium with the HLA-DRB1*07:01 allele we previously observed in a candidate gene study. The strongest risk factors for asparaginase allergy are variants within genes regulating the immune response. PMID:25987655

  3. Reconstructing Roma History from Genome-Wide Data

    PubMed Central

    Moorjani, Priya; Patterson, Nick; Loh, Po-Ru; Lipson, Mark; Kisfali, Péter; Melegh, Bela I.; Bonin, Michael; Kádaši, Ľudevít; Rieß, Olaf; Berger, Bonnie; Reich, David; Melegh, Béla

    2013-01-01

    The Roma people, living throughout Europe and West Asia, are a diverse population linked by the Romani language and culture. Previous linguistic and genetic studies have suggested that the Roma migrated into Europe from South Asia about 1,000–1,500 years ago. Genetic inferences about Roma history have mostly focused on the Y chromosome and mitochondrial DNA. To explore what additional information can be learned from genome-wide data, we analyzed data from six Roma groups that we genotyped at hundreds of thousands of single nucleotide polymorphisms (SNPs). We estimate that the Roma harbor about 80% West Eurasian ancestry–derived from a combination of European and South Asian sources–and that the date of admixture of South Asian and European ancestry was about 850 years before present. We provide evidence for Eastern Europe being a major source of European ancestry, and North-west India being a major source of the South Asian ancestry in the Roma. By computing allele sharing as a measure of linkage disequilibrium, we estimate that the migration of Roma out of the Indian subcontinent was accompanied by a severe founder event, which appears to have been followed by a major demographic expansion after the arrival in Europe. PMID:23516520

  4. Comparative analysis of methods for genome-wide nucleosome cartography.

    PubMed

    Quintales, Luis; Vázquez, Enrique; Antequera, Francisco

    2015-07-01

    Nucleosomes contribute to compacting the genome into the nucleus and regulate the physical access of regulatory proteins to DNA either directly or through the epigenetic modifications of the histone tails. Precise mapping of nucleosome positioning across the genome is, therefore, essential to understanding the genome regulation. In recent years, several experimental protocols have been developed for this purpose that include the enzymatic digestion, chemical cleavage or immunoprecipitation of chromatin followed by next-generation sequencing of the resulting DNA fragments. Here, we compare the performance and resolution of these methods from the initial biochemical steps through the alignment of the millions of short-sequence reads to a reference genome to the final computational analysis to generate genome-wide maps of nucleosome occupancy. Because of the lack of a unified protocol to process data sets obtained through the different approaches, we have developed a new computational tool (NUCwave), which facilitates their analysis, comparison and assessment and will enable researchers to choose the most suitable method for any particular purpose. NUCwave is freely available at http://nucleosome.usal.es/nucwave along with a step-by-step protocol for its use.

  5. Identification of differential translation in genome wide studies.

    PubMed

    Larsson, Ola; Sonenberg, Nahum; Nadon, Robert

    2010-12-14

    Regulation of gene expression through translational control is a fundamental mechanism implicated in many biological processes ranging from memory formation to innate immunity and whose dysregulation contributes to human diseases. Genome wide analyses of translational control strive to identify differential translation independent of cytosolic mRNA levels. For this reason, most studies measure genes' translation levels as log ratios (translation levels divided by corresponding cytosolic mRNA levels obtained in parallel). Counterintuitively, arising from a mathematical necessity, these log ratios tend to be highly correlated with the cytosolic mRNA levels. Accordingly, they do not effectively correct for cytosolic mRNA level and generate substantial numbers of biological false positives and false negatives. We show that analysis of partial variance, which produces estimates of translational activity that are independent of cytosolic mRNA levels, is a superior alternative. When combined with a variance shrinkage method for estimating error variance, analysis of partial variance has the additional benefit of having greater statistical power and identifying fewer genes as translationally regulated resulting merely from unrealistically low variance estimates rather than from large changes in translational activity. In contrast to log ratios, this formal analytical approach estimates translation effects in a statistically rigorous manner, eliminates the need for inefficient and error-prone heuristics, and produces results that agree with biological function. The method is applicable to datasets obtained from both the commonly used polysome microarray method and the sequencing-based ribosome profiling method.

  6. Genome-Wide Analysis of Human Metapneumovirus Evolution.

    PubMed

    Kim, Jin Il; Park, Sehee; Lee, Ilseob; Park, Kwang Sook; Kwak, Eun Jung; Moon, Kwang Mee; Lee, Chang Kyu; Bae, Joon-Yong; Park, Man-Seong; Song, Ki-Joon

    2016-01-01

    Human metapneumovirus (HMPV) has been described as an important etiologic agent of upper and lower respiratory tract infections, especially in young children and the elderly. Most of school-aged children might be introduced to HMPVs, and exacerbation with other viral or bacterial super-infection is common. However, our understanding of the molecular evolution of HMPVs remains limited. To address the comprehensive evolutionary dynamics of HMPVs, we report a genome-wide analysis of the eight genes (N, P, M, F, M2, SH, G, and L) using 103 complete genome sequences. Phylogenetic reconstruction revealed that the eight genes from one HMPV strain grouped into the same genetic group among the five distinct lineages (A1, A2a, A2b, B1, and B2). A few exceptions of phylogenetic incongruence might suggest past recombination events, and we detected possible recombination breakpoints in the F, SH, and G coding regions. The five genetic lineages of HMPVs shared quite remote common ancestors ranging more than 220 to 470 years of age with the most recent origins for the A2b sublineage. Purifying selection was common, but most protein genes except the F and M2-2 coding regions also appeared to experience episodic diversifying selection. Taken together, these suggest that the five lineages of HMPVs maintain their individual evolutionary dynamics and that recombination and selection forces might work on shaping the genetic diversity of HMPVs.

  7. Genome-wide DNA methylation profile in mungbean

    PubMed Central

    Kang, Yang Jae; Bae, Ahra; Shim, Sangrea; Lee, Taeyoung; Lee, Jayern; Satyawan, Dani; Kim, Moon Young; Lee, Suk-Ha

    2017-01-01

    DNA methylation on cytosine residues is known to affect gene expression and is potentially responsible for the phenotypic variations among different crop cultivars. Here, we present the whole-genome DNA methylation profiles and assess the potential effects of single nucleotide polymorphisms (SNPs) for two mungbean cultivars, Sunhwanogdu (VC1973A) and Kyunggijaerae#5 (V2984). By measuring the DNA methylation levels in leaf tissue with the bisulfite sequencing (BSseq) approach, we show both the frequencies of the various types of DNA methylation and the distribution of weighted gene methylation levels. SNPs that cause nucleotide changes from/to CHH – where C is cytosine and H is any other nucleotide – were found to affect DNA methylation status in VC1973A and V2984. In order to better understand the correlation between gene expression and DNA methylation levels, we surveyed gene expression in leaf tissues of VC1973A and V2984 using RNAseq. Transcript expressions of paralogous genes were controlled by DNA methylation within the VC1973A genome. Moreover, genes that were differentially expressed between the two cultivars showed distinct DNA methylation patterns. Our mungbean genome-wide methylation profiles will be valuable resources for understanding the phenotypic variations between different cultivars, as well as for molecular breeding. PMID:28084412

  8. Genome-Wide Specific Selection in Three Domestic Sheep Breeds

    PubMed Central

    Cao, Jiaxve; Wu, Mingming; Ma, Xiaomeng; Liu, Zhen; Liu, Ruizao; Zhao, Fuping; Wei, Caihong; Du, Lixin

    2015-01-01

    Background Commercial sheep raised for mutton grow faster than traditional Chinese sheep breeds. Here, we aimed to evaluate genetic selection among three different types of sheep breed: two well-known commercial mutton breeds and one indigenous Chinese breed. Results We first combined locus-specific branch lengths and di statistical methods to detect candidate regions targeted by selection in the three different populations. The results showed that the genetic distances reached at least medium divergence for each pairwise combination. We found these two methods were highly correlated, and identified many growth-related candidate genes undergoing artificial selection. For production traits, APOBR and FTO are associated with body mass index. For meat traits, ALDOA, STK32B and FAM190A are related to marbling. For reproduction traits, CCNB2 and SLC8A3 affect oocyte development. We also found two well-known genes, GHR (which affects meat production and quality) and EDAR (associated with hair thickness) were associated with German mutton merino sheep. Furthermore, four genes (POL, RPL7, MSL1 and SHISA9) were associated with pre-weaning gain in our previous genome-wide association study. Conclusions Our results indicated that combine locus-specific branch lengths and di statistical approaches can reduce the searching ranges for specific selection. And we got many credible candidate genes which not only confirm the results of previous reports, but also provide a suite of novel candidate genes in defined breeds to guide hybridization breeding. PMID:26083354

  9. A synergistic DNA logic predicts genome-wide chromatin accessibility

    PubMed Central

    Hashimoto, Tatsunori; Sherwood, Richard I.; Kang, Daniel D.; Rajagopal, Nisha; Barkal, Amira A.; Zeng, Haoyang; Emons, Bart J.M.; Srinivasan, Sharanya; Jaakkola, Tommi; Gifford, David K.

    2016-01-01

    Enhancers and promoters commonly occur in accessible chromatin characterized by depleted nucleosome contact; however, it is unclear how chromatin accessibility is governed. We show that log-additive cis-acting DNA sequence features can predict chromatin accessibility at high spatial resolution. We develop a new type of high-dimensional machine learning model, the Synergistic Chromatin Model (SCM), which when trained with DNase-seq data for a cell type is capable of predicting expected read counts of genome-wide chromatin accessibility at every base from DNA sequence alone, with the highest accuracy at hypersensitive sites shared across cell types. We confirm that a SCM accurately predicts chromatin accessibility for thousands of synthetic DNA sequences using a novel CRISPR-based method of highly efficient site-specific DNA library integration. SCMs are directly interpretable and reveal that a logic based on local, nonspecific synergistic effects, largely among pioneer TFs, is sufficient to predict a large fraction of cellular chromatin accessibility in a wide variety of cell types. PMID:27456004

  10. Genome-Wide Identification of KANADI1 Target Genes

    PubMed Central

    Ott, Felix; Weigel, Detlef; Bowman, John L.; Heisler, Marcus G.; Wenkel, Stephan

    2013-01-01

    Plant organ development and polarity establishment is mediated by the action of several transcription factors. Among these, the KANADI (KAN) subclade of the GARP protein family plays important roles in polarity-associated processes during embryo, shoot and root patterning. In this study, we have identified a set of potential direct target genes of KAN1 through a combination of chromatin immunoprecipitation/DNA sequencing (ChIP-Seq) and genome-wide transcriptional profiling using tiling arrays. Target genes are over-represented for genes involved in the regulation of organ development as well as in the response to auxin. KAN1 affects directly the expression of several genes previously shown to be important in the establishment of polarity during lateral organ and vascular tissue development. We also show that KAN1 controls through its target genes auxin effects on organ development at different levels: transport and its regulation, and signaling. In addition, KAN1 regulates genes involved in the response to abscisic acid, jasmonic acid, brassinosteroids, ethylene, cytokinins and gibberellins. The role of KAN1 in organ polarity is antagonized by HD-ZIPIII transcription factors, including REVOLUTA (REV). A comparison of their target genes reveals that the REV/KAN1 module acts in organ patterning through opposite regulation of shared targets. Evidence of mutual repression between closely related family members is also shown. PMID:24155946

  11. Genome-wide Association Studies for Osteoporosis: A 2013 Update

    PubMed Central

    Liu, Yong-Jun; Zhang, Lei; Papasian, Christopher J.

    2014-01-01

    In the past few years, the bone field has witnessed great advances in genome-wide association studies (GWASs) of osteoporosis, with a number of promising genes identified. In particular, meta-analysis of GWASs, aimed at increasing the power of studies by combining the results from different study populations, have led to the identification of novel associations that would not otherwise have been identified in individual GWASs. Recently, the first whole genome sequencing study for osteoporosis and fractures was published, reporting a novel rare nonsense mutation. This review summarizes the important and representative findings published by December 2013. Comments are made on the notable findings and representative studies for their potential influence and implications on our present understanding of the genetics of osteoporosis. Potential limitations of GWASs and their meta-analyses are evaluated, with an emphasis on understanding the reasons for inconsistent results between different studies and clarification of misinterpretation of GWAS meta-analysis results. Implications and challenges of GWAS are also discussed, including the need for multi- and inter-disciplinary studies. PMID:25006567

  12. A genome wide dosage suppressor network reveals genomic robustness

    PubMed Central

    Patra, Biranchi; Kon, Yoshiko; Yadav, Gitanjali; Sevold, Anthony W.; Frumkin, Jesse P.; Vallabhajosyula, Ravishankar R.; Hintze, Arend; Østman, Bjørn; Schossau, Jory; Bhan, Ashish; Marzolf, Bruz; Tamashiro, Jenna K.; Kaur, Amardeep; Baliga, Nitin S.; Grayhack, Elizabeth J.; Adami, Christoph; Galas, David J.; Raval, Alpan; Phizicky, Eric M.; Ray, Animesh

    2017-01-01

    Genomic robustness is the extent to which an organism has evolved to withstand the effects of deleterious mutations. We explored the extent of genomic robustness in budding yeast by genome wide dosage suppressor analysis of 53 conditional lethal mutations in cell division cycle and RNA synthesis related genes, revealing 660 suppressor interactions of which 642 are novel. This collection has several distinctive features, including high co-occurrence of mutant-suppressor pairs within protein modules, highly correlated functions between the pairs and higher diversity of functions among the co-suppressors than previously observed. Dosage suppression of essential genes encoding RNA polymerase subunits and chromosome cohesion complex suggests a surprising degree of functional plasticity of macromolecular complexes, and the existence of numerous degenerate pathways for circumventing the effects of potentially lethal mutations. These results imply that organisms and cancer are likely able to exploit the genomic robustness properties, due the persistence of cryptic gene and pathway functions, to generate variation and adapt to selective pressures. PMID:27899637

  13. Genome-Wide Discriminatory Information Patterns of Cytosine DNA Methylation

    PubMed Central

    Sanchez, Robersy; Mackenzie, Sally A.

    2016-01-01

    Cytosine DNA methylation (CDM) is a highly abundant, heritable but reversible chemical modification to the genome. Herein, a machine learning approach was applied to analyze the accumulation of epigenetic marks in methylomes of 152 ecotypes and 85 silencing mutants of Arabidopsis thaliana. In an information-thermodynamics framework, two measurements were used: (1) the amount of information gained/lost with the CDM changes IR and (2) the uncertainty of not observing a SNP LCR. We hypothesize that epigenetic marks are chromosomal footprints accounting for different ontogenetic and phylogenetic histories of individual populations. A machine learning approach is proposed to verify this hypothesis. Results support the hypothesis by the existence of discriminatory information (DI) patterns of CDM able to discriminate between individuals and between individual subpopulations. The statistical analyses revealed a strong association between the topologies of the structured population of Arabidopsis ecotypes based on IR and on LCR, respectively. A statistical-physical relationship between IR and LCR was also found. Results to date imply that the genome-wide distribution of CDM changes is not only part of the biological signal created by the methylation regulatory machinery, but ensures the stability of the DNA molecule, preserving the integrity of the genetic message under continuous stress from thermal fluctuations in the cell environment. PMID:27322251

  14. Genome-wide profiling of forum domains in Drosophila melanogaster

    PubMed Central

    Tchurikov, Nickolai A.; Kretova, Olga V.; Sosin, Dmitri V.; Zykov, Ivan A.; Zhimulev, Igor F.; Kravatsky, Yuri V.

    2011-01-01

    Forum domains are stretches of chromosomal DNA that are excised from eukaryotic chromosomes during their spontaneous non-random fragmentation. Most forum domains are 50–200 kb in length. We mapped forum domain termini using FISH on polytene chromosomes and we performed genome-wide mapping using a Drosophila melanogaster genomic tiling microarray consisting of overlapping 3 kb fragments. We found that forum termini very often correspond to regions of intercalary heterochromatin and regions of late replication in polytene chromosomes. We found that forum domains contain clusters of several or many genes. The largest forum domains correspond to the main clusters of homeotic genes inside BX-C and ANTP-C, cluster of histone genes and clusters of piRNAs. PRE/TRE and transcription factor binding sites often reside inside domains and do not overlap with forum domain termini. We also found that about 20% of forum domain termini correspond to small chromosomal regions where Ago1, Ago2, small RNAs and repressive chromatin structures are detected. Our results indicate that forum domains correspond to big multi-gene chromosomal units, some of which could be coordinately expressed. The data on the global mapping of forum domains revealed a strong correlation between fragmentation sites in chromosomes, particular sets of mobile elements and regions of intercalary heterochromatin. PMID:21247882

  15. Genome-wide significant risk associations for mucinous ovarian carcinoma

    PubMed Central

    Kelemen, Linda E.; Lawrenson, Kate; Tyrer, Jonathan; Li, Qiyuan; M. Lee, Janet; Seo, Ji-Heui; Phelan, Catherine M.; Beesley, Jonathan; Chen, Xiaoqin; Spindler, Tassja J.; Aben, Katja K.H.; Anton-Culver, Hoda; Antonenkova, Natalia; Baker, Helen; Bandera, Elisa V.; Bean, Yukie; Beckmann, Matthias W.; Bisogna, Maria; Bjorge, Line; Bogdanova, Natalia; Brinton, Louise A.; Brooks-Wilson, Angela; Bruinsma, Fiona; Butzow, Ralf; Campbell, Ian G.; Carty, Karen; Chang-Claude, Jenny; Chen, Y. Ann; Chen, Zhihua; Cook, Linda S.; Cramer, Daniel W.; Cunningham, Julie M.; Cybulski, Cezary; Dansonka-Mieszkowska, Agnieszka; Dennis, Joe; Dicks, Ed; Doherty, Jennifer A.; Dörk, Thilo; du Bois, Andreas; Dürst, Matthias; Eccles, Diana; Easton, Douglas T.; Edwards, Robert P.; Eilber, Ursula; Ekici, Arif B.; Engelholm, Svend Aage; Fasching, Peter A.; Fridley, Brooke L.; Gao, Yu-Tang; Gentry-Maharaj, Aleksandra; Giles, Graham G.; Glasspool, Rosalind; Goode, Ellen L.; Goodman, Marc T.; Grownwald, Jacek; Harrington, Patricia; Harter, Philipp; Hasmad, Hanis Nazihah; Hein, Alexander; Heitz, Florian; Hildebrandt, Michelle A.T.; Hillemanns, Peter; Hogdall, Estrid; Hogdall, Claus; Hosono, Satoyo; Iversen, Edwin S.; Jakubowska, Anna; Jensen, Allan; Ji, Bu-Tian; Karlan, Beth Y; Kellar, Melissa; Kelley, Joseph L.; Kiemeney, Lambertus A.; Krakstad, Camilla; Kjaer, Susanne K.; Kupryjanczyk, Jolanta; Lambrechts, Diether; Lambrechts, Sandrina; Le, Nhu D.; Lee, Alice W.; Lele, Shashi; Leminen, Arto; Lester, Jenny; Levine, Douglas A.; Liang, Dong; Lissowska, Jolanta; Lu, Karen; Lubinski, Jan; Lundvall, Lene; Massuger, Leon F.A.G.; Matsuo, Keitaro; McGuire, Valerie; McLaughlin, John R.; McNeish, Iain; Menon, Usha; Modugno, Francesmary; Moes-Sosnowska, Joanna; Moysich, Kirsten B.; Narod, Steven A.; Nedergaard, Lotte; Ness, Roberta B.; Nevanlinna, Heli; Azmi, Mat Adenan Noor; Odunsi, Kunle; Olson, Sara H.; Orlow, Irene; Orsulic, Sandra; Weber, Rachel Palmieri; Paul, James; Pearce, Celeste Leigh; Pejovic, Tanja; Pelttari, Liisa M.; Permuth-Wey, Jennifer; Pike, Malcolm C.; Poole, Elizabeth M.; Ramus, Susan J.; Risch, Harvey A.; Rosen, Barry; Rossing, Mary Anne; Rothstein, Joseph H.; Rudolph, Anja; Runnebaum, Ingo B.; Rzepecka, Iwona K.; Salvesen, Helga B.; Schildkraut, Joellen M.; Schwaab, Ira; Shu, Xiao-Ou; Shvetsov, Yurii B; Siddiqui, Nadeem; Sieh, Weiva; Song, Honglin; Southey, Melissa C.; Sucheston, Lara; Tangen, Ingvild L.; Teo, Soo-Hwang; Terry, Kathryn L.; Thompson, Pamela J; Tworoger, Shelley S.; van Altena, Anne M.; Van Nieuwenhuysen, Els; Vergote, Ignace; Vierkant, Robert A.; Wang-Gohrke, Shan; Walsh, Christine; Wentzensen, Nicolas; Whittemore, Alice S.; Wicklund, Kristine G.; Wilkens, Lynne R.; Wlodzimierz, Sawicki; Woo, Yin-Ling; Wu, Xifeng; Wu, Anna H.; Yang, Hannah; Zheng, Wei; Ziogas, Argyrios; Sellers, Thomas A.; Freedman, Matthew L.; Chenevix-Trench, Georgia; Pharoah, Paul D.; Gayther, Simon A.; Berchuck, Andrew

    2015-01-01

    Genome-wide association studies have identified several risk associations for ovarian carcinomas (OC) but not for mucinous ovarian carcinomas (MOC). Genotypes from OC cases and controls were imputed into the 1000 Genomes Project reference panel. Analysis of 1,644 MOC cases and 21,693 controls identified three novel risk associations: rs752590 at 2q13 (P = 3.3 × 10−8), rs711830 at 2q31.1 (P = 7.5 × 10−12) and rs688187 at 19q13.2 (P = 6.8 × 10−13). Expression Quantitative Trait Locus (eQTL) analysis in ovarian and colorectal tumors (which are histologically similar to MOC) identified significant eQTL associations for HOXD9 at 2q31.1 in ovarian (P = 4.95 × 10−4, FDR = 0.003) and colorectal (P = 0.01, FDR = 0.09) tumors, and for PAX8 at 2q13 in colorectal tumors (P = 0.03, FDR = 0.09). Chromosome conformation capture analysis identified interactions between the HOXD9 promoter and risk SNPs at 2q31.1. Overexpressing HOXD9 in MOC cells augmented the neoplastic phenotype. These findings provide the first evidence for MOC susceptibility variants and insights into the underlying biology of the disease. PMID:26075790

  16. Genome-Wide Analysis of Polyadenylation Events in Schmidtea mediterranea

    PubMed Central

    Lakshmanan, Vairavan; Bansal, Dhiru; Kulkarni, Jahnavi; Poduval, Deepak; Krishna, Srikar; Sasidharan, Vidyanand; Anand, Praveen; Seshasayee, Aswin; Palakodeti, Dasaradhi

    2016-01-01

    In eukaryotes, 3′ untranslated regions (UTRs) play important roles in regulating posttranscriptional gene expression. The 3′UTR is defined by regulated cleavage/polyadenylation of the pre-mRNA. The advent of next-generation sequencing technology has now enabled us to identify these events on a genome-wide scale. In this study, we used poly(A)-position profiling by sequencing (3P-Seq) to capture all poly(A) sites across the genome of the freshwater planarian, Schmidtea mediterranea, an ideal model system for exploring the process of regeneration and stem cell function. We identified the 3′UTRs for ∼14,000 transcripts and thus improved the existing gene annotations. We found 97 transcripts, which are polyadenylated within an internal exon, resulting in the shrinking of the ORF and loss of a predicted protein domain. Around 40% of the transcripts in planaria were alternatively polyadenylated (ApA), resulting either in an altered 3′UTR or a change in coding sequence. We identified specific ApA transcript isoforms that were subjected to miRNA mediated gene regulation using degradome sequencing. In this study, we also confirmed a tissue-specific expression pattern for alternate polyadenylated transcripts. The insights from this study highlight the potential role of ApA in regulating the gene expression essential for planarian regeneration. PMID:27489207

  17. Genome-wide association study of aggressive behaviour in chicken

    PubMed Central

    Li, Zhenhui; Zheng, Ming; Abdalla, Bahareldin Ali; Zhang, Zhe; Xu, Zhenqiang; Ye, Qiao; Xu, Haiping; Luo, Wei; Nie, Qinghua; Zhang, Xiquan

    2016-01-01

    In the poultry industry, aggressive behaviour is a large animal welfare issue all over the world. To date, little is known about the underlying genetics of the aggressive behaviour. Here, we performed a genome-wide association study (GWAS) to explore the genetic mechanism associated with aggressive behaviour in chickens. The GWAS results showed that a total of 33 SNPs were associated with aggressive behaviour traits (P < 4.6E-6). rs312463697 on chromosome 4 was significantly associated with aggression (P = 2.10905E-07), and it was in the intron region of the sortilin-related VPS10 domain containing receptor 2 (SORCS2) gene. In addition, biological function analysis of the nearest 26 genes around the significant SNPs was performed with Ingenuity Pathway Analysis. An interaction network contained 17 genes was obtained and SORCS2 was involved in this network, interacted with nerve growth factor (NGF), nerve growth factor receptor (NGFR), dopa decarboxylase (L-dopa) and dopamine. After knockdown of SORCS2, the mRNA levels of NGF, L-dopa and dopamine receptor genes DRD1, DRD2, DRD3 and DRD4 were significantly decreased (P < 0.05). In summary, our data indicated that SORCS2 might play an important role in chicken aggressive behaviour through the regulation of dopaminergic pathways and NGF. PMID:27485826

  18. Weighted SNP set analysis in genome-wide association study.

    PubMed

    Dai, Hui; Zhao, Yang; Qian, Cheng; Cai, Min; Zhang, Ruyang; Chu, Minjie; Dai, Juncheng; Hu, Zhibin; Shen, Hongbing; Chen, Feng

    2013-01-01

    Genome-wide association studies (GWAS) are popular for identifying genetic variants which are associated with disease risk. Many approaches have been proposed to test multiple single nucleotide polymorphisms (SNPs) in a region simultaneously which considering disadvantages of methods in single locus association analysis. Kernel machine based SNP set analysis is more powerful than single locus analysis, which borrows information from SNPs correlated with causal or tag SNPs. Four types of kernel machine functions and principal component based approach (PCA) were also compared. However, given the loss of power caused by low minor allele frequencies (MAF), we conducted an extension work on PCA and used a new method called weighted PCA (wPCA). Comparative analysis was performed for weighted principal component analysis (wPCA), logistic kernel machine based test (LKM) and principal component analysis (PCA) based on SNP set in the case of different minor allele frequencies (MAF) and linkage disequilibrium (LD) structures. We also applied the three methods to analyze two SNP sets extracted from a real GWAS dataset of non-small cell lung cancer in Han Chinese population. Simulation results show that when the MAF of the causal SNP is low, weighted principal component and weighted IBS are more powerful than PCA and other kernel machine functions at different LD structures and different numbers of causal SNPs. Application of the three methods to a real GWAS dataset indicates that wPCA and wIBS have better performance than the linear kernel, IBS kernel and PCA.

  19. Genome-Wide Association Mapping for Phenotypic Plasticity in Rice.

    PubMed

    Kikuchi, Shinji; Bheemanahalli, Raju; Jagadish, Krishna S V; Kumagai, Etsushi; Masuya, Yusuke; Kuroda, Eiki; Raghavan, Chitra; Dingkuhn, Michael; Abe, Akira; Shimono, Hiroyuki

    2017-03-31

    Phenotypic plasticity of plants in response to environmental changes is important for adapting to changing climate. Less attention has been paid to exploring the advantages of phenotypic plasticity in resource-rich environments to enhance the productivity of agricultural crops. Here, we examined genetic variation in phenotypic plasticity in indica rice (Oryza sativa L.) across two diverse panels: (i) a Phenomics of Rice Adaptation and Yield (PRAY) population comprising 301 accessions and (ii) a Multi-parent-Advanced-Generation-Inter-Cross (MAGIC) indica population comprising 151 accessions. Altered planting density was used as a proxy for elevated atmospheric CO2 response. Low planting density significantly increased panicle weight per plant compared with normal density, and the magnitude of the increase ranged from 1.10 to 2.78 times among accessions for the PRAY population and from 1.05 to 2.45 times for the MAGIC population. Genome-wide-association studies revealed three Environmental Responsiveness (ER) candidate alleles (qER1-3) that were associated with relative response of panicle weight to low density. Two of these alleles were tested in 13 genotypes to clarify their biomass responses during vegetative growth under elevated CO2 in Japan. Our study provides evidence for polymorphisms that control rice phenotypic plasticity in environments that are rich in resources such as light and CO2 .

  20. A genome-wide association study in multiple system atrophy

    PubMed Central

    Sailer, Anna; Nalls, Michael A.; Schulte, Claudia; Federoff, Monica; Price, T. Ryan; Lees, Andrew; Ross, Owen A.; Dickson, Dennis W.; Mok, Kin; Mencacci, Niccolo E.; Schottlaender, Lucia; Chelban, Viorica; Ling, Helen; O'Sullivan, Sean S.; Wood, Nicholas W.; Traynor, Bryan J.; Ferrucci, Luigi; Federoff, Howard J.; Mhyre, Timothy R.; Morris, Huw R.; Deuschl, Günther; Quinn, Niall; Widner, Hakan; Albanese, Alberto; Infante, Jon; Bhatia, Kailash P.; Poewe, Werner; Oertel, Wolfgang; Höglinger, Günter U.; Wüllner, Ullrich; Goldwurm, Stefano; Pellecchia, Maria Teresa; Ferreira, Joaquim; Tolosa, Eduardo; Bloem, Bastiaan R.; Rascol, Olivier; Meissner, Wassilios G.; Hardy, John A.; Revesz, Tamas; Holton, Janice L.; Gasser, Thomas; Wenning, Gregor K.; Singleton, Andrew B.

    2016-01-01

    Objective: To identify genetic variants that play a role in the pathogenesis of multiple system atrophy (MSA), we undertook a genome-wide association study (GWAS). Methods: We performed a GWAS with >5 million genotyped and imputed single nucleotide polymorphisms (SNPs) in 918 patients with MSA of European ancestry and 3,864 controls. MSA cases were collected from North American and European centers, one third of which were neuropathologically confirmed. Results: We found no significant loci after stringent multiple testing correction. A number of regions emerged as potentially interesting for follow-up at p < 1 × 10−6, including SNPs in the genes FBXO47, ELOVL7, EDN1, and MAPT. Contrary to previous reports, we found no association of the genes SNCA and COQ2 with MSA. Conclusions: We present a GWAS in MSA. We have identified several potentially interesting gene loci, including the MAPT locus, whose significance will have to be evaluated in a larger sample set. Common genetic variation in SNCA and COQ2 does not seem to be associated with MSA. In the future, additional samples of well-characterized patients with MSA will need to be collected to perform a larger MSA GWAS, but this initial study forms the basis for these next steps. PMID:27629089

  1. Genome-wide transcriptome analysis of human epidermal melanocytes

    PubMed Central

    Haltaufderhyde, Kirk D.; Oancea, Elena

    2015-01-01

    Because human epidermal melanocytes (HEMs) provide critical protection against skin cancer, sunburn, and photoaging, a genome-wide perspective of gene expression in these cells is vital to understanding human skin physiology. In this study we performed high throughput sequencing of HEMs to obtain a complete data set of transcript sizes, abundances, and splicing. As expected, we found that melanocyte specific genes that function in pigmentation were among the highest expressed genes. We analyzed receptor, ion channel and transcription factor gene families to get a better understanding of the cell signalling pathways used by melanocytes. We also performed a comparative transcriptomic analysis of lightly versus darkly pigmented HEMs and found 16 genes differentially expressed in the two pigmentation phenotypes; of those, only one putative melanosomal transporter (SLC45A2) has known function in pigmentation. In addition, we found 166 genes with splice isoforms expressed exclusively in one pigmentation phenotype, 17 of which are genes involved in signal transduction. Our melanocyte transcriptome study provides a comprehensive view and may help identify novel pigmentation genes and potential pharmacological targets. PMID:25451175

  2. Genome-wide profiling of forum domains in Drosophila melanogaster.

    PubMed

    Tchurikov, Nickolai A; Kretova, Olga V; Sosin, Dmitri V; Zykov, Ivan A; Zhimulev, Igor F; Kravatsky, Yuri V

    2011-05-01

    Forum domains are stretches of chromosomal DNA that are excised from eukaryotic chromosomes during their spontaneous non-random fragmentation. Most forum domains are 50-200 kb in length. We mapped forum domain termini using FISH on polytene chromosomes and we performed genome-wide mapping using a Drosophila melanogaster genomic tiling microarray consisting of overlapping 3 kb fragments. We found that forum termini very often correspond to regions of intercalary heterochromatin and regions of late replication in polytene chromosomes. We found that forum domains contain clusters of several or many genes. The largest forum domains correspond to the main clusters of homeotic genes inside BX-C and ANTP-C, cluster of histone genes and clusters of piRNAs. PRE/TRE and transcription factor binding sites often reside inside domains and do not overlap with forum domain termini. We also found that about 20% of forum domain termini correspond to small chromosomal regions where Ago1, Ago2, small RNAs and repressive chromatin structures are detected. Our results indicate that forum domains correspond to big multi-gene chromosomal units, some of which could be coordinately expressed. The data on the global mapping of forum domains revealed a strong correlation between fragmentation sites in chromosomes, particular sets of mobile elements and regions of intercalary heterochromatin.

  3. Genome-Wide Analysis of DNA Methylation in Human Amnion

    PubMed Central

    Kim, Jinsil; Pitlick, Mitchell M.; Christine, Paul J.; Schaefer, Amanda R.; Saleme, Cesar; Comas, Belén; Cosentino, Viviana; Gadow, Enrique; Murray, Jeffrey C.

    2013-01-01

    The amnion is a specialized tissue in contact with the amniotic fluid, which is in a constantly changing state. To investigate the importance of epigenetic events in this tissue in the physiology and pathophysiology of pregnancy, we performed genome-wide DNA methylation profiling of human amnion from term (with and without labor) and preterm deliveries. Using the Illumina Infinium HumanMethylation27 BeadChip, we identified genes exhibiting differential methylation associated with normal labor and preterm birth. Functional analysis of the differentially methylated genes revealed biologically relevant enriched gene sets. Bisulfite sequencing analysis of the promoter region of the oxytocin receptor (OXTR) gene detected two CpG dinucleotides showing significant methylation differences among the three groups of samples. Hypermethylation of the CpG island of the solute carrier family 30 member 3 (SLC30A3) gene in preterm amnion was confirmed by methylation-specific PCR. This work provides preliminary evidence that DNA methylation changes in the amnion may be at least partially involved in the physiological process of labor and the etiology of preterm birth and suggests that DNA methylation profiles, in combination with other biological data, may provide valuable insight into the mechanisms underlying normal and pathological pregnancies. PMID:23533356

  4. Mosaic paternal genome-wide uniparental isodisomy with down syndrome.

    PubMed

    Darcy, Diana; Atwal, Paldeep Singh; Angell, Cathy; Gadi, Inder; Wallerstein, Robert

    2015-10-01

    We report on a 6-month-old girl with two apparent cell lines; one with trisomy 21, and the other with paternal genome-wide uniparental isodisomy (GWUPiD), identified using single nucleotide polymorphism (SNP) based microarray and microsatellite analysis of polymorphic loci. The patient has Beckwith-Wiedemann syndrome (BWS) due to paternal uniparental disomy (UPD) at chromosome location 11p15 (UPD 11p15), which was confirmed through methylation analysis. Hyperinsulinemic hypoglycemia is present, which is associated with paternal UPD 11p15.5; and she likely has medullary nephrocalcinosis, which is associated with paternal UPD 20, although this was not biochemically confirmed. Angelman syndrome (AS) analysis was negative but this testing is not completely informative; she has no specific features of AS. Clinical features of this patient include: dysmorphic features consistent with trisomy 21, tetralogy of Fallot, hemihypertrophy, swirled skin hyperpigmentation, hepatoblastoma, and Wilms tumor. Her karyotype is 47,XX,+21[19]/46,XX[4], and microarray results suggest that the cell line with trisomy 21 is biparentally inherited and represents 40-50% of the genomic material in the tested specimen. The difference in the level of cytogenetically detected mosaicism versus the level of mosaicism observed via microarray analysis is likely caused by differences in the test methodologies. While a handful of cases of mosaic paternal GWUPiD have been reported, this patient is the only reported case that also involves trisomy 21. Other GWUPiD patients have presented with features associated with multiple imprinted regions, as does our patient.

  5. Multicentric Genome-Wide Association Study for Primary Spontaneous Pneumothorax

    PubMed Central

    Abrantes, Patrícia; Francisco, Vânia; Teixeira, Gilberto; Monteiro, Marta; Neves, João; Norte, Ana; Robalo Cordeiro, Carlos; Moura e Sá, João; Reis, Ernestina; Santos, Patrícia; Oliveira, Manuela; Sousa, Susana; Fradinho, Marta; Malheiro, Filipa; Negrão, Luís

    2016-01-01

    Despite elevated incidence and recurrence rates for Primary Spontaneous Pneumothorax (PSP), little is known about its etiology, and the genetics of idiopathic PSP remains unexplored. To identify genetic variants contributing to sporadic PSP risk, we conducted the first PSP genome-wide association study. Two replicate pools of 92 Portuguese PSP cases and of 129 age- and sex-matched controls were allelotyped in triplicate on the Affymetrix Human SNP Array 6.0 arrays. Markers passing quality control were ranked by relative allele score difference between cases and controls (|RASdiff|), by a novel cluster method and by a combined Z-test. 101 single nucleotide polymorphisms (SNPs) were selected using these three approaches for technical validation by individual genotyping in the discovery dataset. 87 out of 94 successfully tested SNPs were nominally associated in the discovery dataset. Replication of the 87 technically validated SNPs was then carried out in an independent replication dataset of 100 Portuguese cases and 425 controls. The intergenic rs4733649 SNP in chromosome 8 (between LINC00824 and LINC00977) was associated with PSP in the discovery (P = 4.07E-03, ORC[95% CI] = 1.88[1.22–2.89]), replication (P = 1.50E-02, ORC[95% CI] = 1.50[1.08–2.09]) and combined datasets (P = 8.61E-05, ORC[95% CI] = 1.65[1.29–2.13]). This study identified for the first time one genetic risk factor for sporadic PSP, but future studies are warranted to further confirm this finding in other populations and uncover its functional role in PSP pathogenesis. PMID:27203581

  6. Genome-wide methylation analyses in glioblastoma multiforme.

    PubMed

    Lai, Rose K; Chen, Yanwen; Guan, Xiaowei; Nousome, Darryl; Sharma, Charu; Canoll, Peter; Bruce, Jeffrey; Sloan, Andrew E; Cortes, Etty; Vonsattel, Jean-Paul; Su, Tao; Delgado-Cruzata, Lissette; Gurvich, Irina; Santella, Regina M; Ostrom, Quinn; Lee, Annette; Gregersen, Peter; Barnholtz-Sloan, Jill

    2014-01-01

    Few studies had investigated genome-wide methylation in glioblastoma multiforme (GBM). Our goals were to study differential methylation across the genome in gene promoters using an array-based method, as well as repetitive elements using surrogate global methylation markers. The discovery sample set for this study consisted of 54 GBM from Columbia University and Case Western Reserve University, and 24 brain controls from the New York Brain Bank. We assembled a validation dataset using methylation data of 162 TCGA GBM and 140 brain controls from dbGAP. HumanMethylation27 Analysis Bead-Chips (Illumina) were used to interrogate 26,486 informative CpG sites in both the discovery and validation datasets. Global methylation levels were assessed by analysis of L1 retrotransposon (LINE1), 5 methyl-deoxycytidine (5m-dC) and 5 hydroxylmethyl-deoxycytidine (5hm-dC) in the discovery dataset. We validated a total of 1548 CpG sites (1307 genes) that were differentially methylated in GBM compared to controls. There were more than twice as many hypomethylated genes as hypermethylated ones. Both the discovery and validation datasets found 5 tumor methylation classes. Pathway analyses showed that the top ten pathways in hypomethylated genes were all related to functions of innate and acquired immunities. Among hypermethylated pathways, transcriptional regulatory network in embryonic stem cells was the most significant. In the study of global methylation markers, 5m-dC level was the best discriminant among methylation classes, whereas in survival analyses, high level of LINE1 methylation was an independent, favorable prognostic factor in the discovery dataset. Based on a pathway approach, hypermethylation in genes that control stem cell differentiation were significant, poor prognostic factors of overall survival in both the discovery and validation datasets. Approaches that targeted these methylated genes may be a future therapeutic goal.

  7. Genome-Wide Architecture of Disease Resistance Genes in Lettuce.

    PubMed

    Christopoulou, Marilena; Wo, Sebastian Reyes-Chin; Kozik, Alex; McHale, Leah K; Truco, Maria-Jose; Wroblewski, Tadeusz; Michelmore, Richard W

    2015-10-08

    Genome-wide motif searches identified 1134 genes in the lettuce reference genome of cv. Salinas that are potentially involved in pathogen recognition, of which 385 were predicted to encode nucleotide binding-leucine rich repeat receptor (NLR) proteins. Using a maximum-likelihood approach, we grouped the NLRs into 25 multigene families and 17 singletons. Forty-one percent of these NLR-encoding genes belong to three families, the largest being RGC16 with 62 genes in cv. Salinas. The majority of NLR-encoding genes are located in five major resistance clusters (MRCs) on chromosomes 1, 2, 3, 4, and 8 and cosegregate with multiple disease resistance phenotypes. Most MRCs contain primarily members of a single NLR gene family but a few are more complex. MRC2 spans 73 Mb and contains 61 NLRs of six different gene families that cosegregate with nine disease resistance phenotypes. MRC3, which is 25 Mb, contains 22 RGC21 genes and colocates with Dm13. A library of 33 transgenic RNA interference tester stocks was generated for functional analysis of NLR-encoding genes that cosegregated with disease resistance phenotypes in each of the MRCs. Members of four NLR-encoding families, RGC1, RGC2, RGC21, and RGC12 were shown to be required for 16 disease resistance phenotypes in lettuce. The general composition of MRCs is conserved across different genotypes; however, the specific repertoire of NLR-encoding genes varied particularly of the rapidly evolving Type I genes. These tester stocks are valuable resources for future analyses of additional resistance phenotypes.

  8. Genome-Wide Association Study of Schizophrenia in Japanese Population

    PubMed Central

    Yamada, Kazuo; Iwayama, Yoshimi; Hattori, Eiji; Iwamoto, Kazuya; Toyota, Tomoko; Ohnishi, Tetsuo; Ohba, Hisako; Maekawa, Motoko; Kato, Tadafumi; Yoshikawa, Takeo

    2011-01-01

    Schizophrenia is a devastating neuropsychiatric disorder with genetically complex traits. Genetic variants should explain a considerable portion of the risk for schizophrenia, and genome-wide association study (GWAS) is a potentially powerful tool for identifying the risk variants that underlie the disease. Here, we report the results of a three-stage analysis of three independent cohorts consisting of a total of 2,535 samples from Japanese and Chinese populations for searching schizophrenia susceptibility genes using a GWAS approach. Firstly, we examined 115,770 single nucleotide polymorphisms (SNPs) in 120 patient-parents trio samples from Japanese schizophrenia pedigrees. In stage II, we evaluated 1,632 SNPs (1,159 SNPs of p<0.01 and 473 SNPs of p<0.05 that located in previously reported linkage regions). The second sample consisted of 1,012 case-control samples of Japanese origin. The most significant p value was obtained for the SNP in the ELAVL2 [(embryonic lethal, abnormal vision, Drosophila)-like 2] gene located on 9p21.3 (p = 0.00087). In stage III, we scrutinized the ELAVL2 gene by genotyping gene-centric tagSNPs in the third sample set of 293 family samples (1,163 individuals) of Chinese descent and the SNP in the gene showed a nominal association with schizophrenia in Chinese population (p = 0.026). The current data in Asian population would be helpful for deciphering ethnic diversity of schizophrenia etiology. PMID:21674006

  9. Genome-wide characteristics of de novo mutations in autism

    PubMed Central

    Yuen, Ryan K C; Merico, Daniele; Cao, Hongzhi; Pellecchia, Giovanna; Alipanahi, Babak; Thiruvahindrapuram, Bhooma; Tong, Xin; Sun, Yuhui; Cao, Dandan; Zhang, Tao; Wu, Xueli; Jin, Xin; Zhou, Ze; Liu, Xiaomin; Nalpathamkalam, Thomas; Walker, Susan; Howe, Jennifer L.; Wang, Zhuozhi; MacDonald, Jeffrey R.; Chan, Ada; D’Abate, Lia; Deneault, Eric; Siu, Michelle T.; Tammimies, Kristiina; Uddin, Mohammed; Zarrei, Mehdi; Wang, Mingbang; Li, Yingrui; Wang, Jun; Wang, Jian; Yang, Huanming; Bookman, Matt; Bingham, Jonathan; Gross, Samuel S.; Loy, Dion; Pletcher, Mathew; Marshall, Christian R.; Anagnostou, Evdokia; Zwaigenbaum, Lonnie; Weksberg, Rosanna; Fernandez, Bridget A; Roberts, Wendy; Szatmari, Peter; Glazer, David; Frey, Brendan J.; Ring, Robert H.; Xu, Xun; Scherer, Stephen W.

    2016-01-01

    De novo mutations (DNMs) are important in Autism Spectrum Disorder (ASD), but so far analyses have mainly been on the ~1.5% of the genome encoding genes. Here, we performed whole genome sequencing (WGS) of 200 ASD parent-child trios and characterized germline and somatic DNMs. We confirmed that the majority of germline DNMs (75.6%) originated from the father, and these increased significantly with paternal age only (p=4.2×10−10). However, when clustered DNMs (those within 20kb) were found in ASD, not only did they mostly originate from the mother (p=7.7×10−13), but they could also be found adjacent to de novo copy number variations (CNVs) where the mutation rate was significantly elevated (p=2.4×10−24). By comparing DNMs detected in controls, we found a significant enrichment of predicted damaging DNMs in ASD cases (p=8.0×10−9; OR=1.84), of which 15.6% (p=4.3×10−3) and 22.5% (p=7.0×10−5) were in the non-coding or genic non-coding, respectively. The non-coding elements most enriched for DNM were untranslated regions of genes, boundaries involved in exon-skipping and DNase I hypersensitive regions. Using microarrays and a novel outlier detection test, we also found aberrant methylation profiles in 2/185 (1.1%) of ASD cases. These same individuals carried independently identified DNMs in the ASD risk- and epigenetic- genes DNMT3A and ADNP. Our data begins to characterize different genome-wide DNMs, and highlight the contribution of non-coding variants, to the etiology of ASD. PMID:27525107

  10. Genome-Wide Association Studies of Multiple Keratinocyte Cancers

    PubMed Central

    Verkouteren, Joris A. C.; Hofman, Albert; Uitterlinden, André G.; Kraft, Peter; Turman, Constance; Han, Jiali; Cho, Eunyoung; Murabito, Joanne M.; Levy, Daniel; Qureshi, Abrar A.; Nijsten, Tamar

    2017-01-01

    There is strong evidence for a role of environmental risk factors involved in susceptibility to develop multiple keratinocyte cancers (mKCs), but whether genes are also involved in mKCs susceptibility has not been thoroughly investigated. We investigated whether single nucleotide polymorphisms (SNPs) are associated with susceptibility for mKCs. A genome-wide association study (GWAS) of 1,666 cases with mKCs and 1,950 cases with single KC (sKCs; controls) from Harvard cohorts (the Nurses' Health Study [NHS], NHS II, and the Health Professionals Follow-Up Study) and the Framingham Heart Study was carried-out using over 8 million SNPs (stage-1). We sought to replicate the most significant statistical associations (p-value≤ 5.5x10-6) in an independent cohort of 574 mKCs and 872 sKCs from the Rotterdam Study. In the discovery stage, 40 SNPs with suggestive associations (p-value ≤5.5x10-6) were identified, with eight independent SNPs tagging all 40 SNPs. The most significant SNP was located at chromosome 9 (rs7468390; p-value = 3.92x10-7). In stage-2, none of these SNPs replicated and only two of them were associated with mKCs in the same direction in the combined meta-analysis. We tested the associations for 19 previously reported basal cell carcinoma-related SNPs (candidate gene association analysis), and found that rs1805007 (MC1R locus) was significantly associated with risk of mKCs (p-value = 2.80x10-4). Although the suggestive SNPs with susceptibility for mKCs were not replicated, we found that previously identified BCC variants may also be associated with mKC, which the most significant association (rs1805007) located at the MC1R gene. PMID:28081215

  11. Genome-wide association study of sleep in Drosophila melanogaster

    PubMed Central

    2013-01-01

    Background Sleep is a highly conserved behavior, yet its duration and pattern vary extensively among species and between individuals within species. The genetic basis of natural variation in sleep remains unknown. Results We used the Drosophila Genetic Reference Panel (DGRP) to perform a genome-wide association (GWA) study of sleep in D. melanogaster. We identified candidate single nucleotide polymorphisms (SNPs) associated with differences in the mean as well as the environmental sensitivity of sleep traits; these SNPs typically had sex-specific or sex-biased effects, and were generally located in non-coding regions. The majority of SNPs (80.3%) affecting sleep were at low frequency and had moderately large effects. Additive models incorporating multiple SNPs explained as much as 55% of the genetic variance for sleep in males and females. Many of these loci are known to interact physically and/or genetically, enabling us to place them in candidate genetic networks. We confirmed the role of seven novel loci on sleep using insertional mutagenesis and RNA interference. Conclusions We identified many SNPs in novel loci that are potentially associated with natural variation in sleep, as well as SNPs within genes previously known to affect Drosophila sleep. Several of the candidate genes have human homologues that were identified in studies of human sleep, suggesting that genes affecting variation in sleep are conserved across species. Our discovery of genetic variants that influence environmental sensitivity to sleep may have a wider application to all GWA studies, because individuals with highly plastic genotypes will not have consistent phenotypes. PMID:23617951

  12. Genome-wide SNP typing reveals signatures of population history.

    PubMed

    Hughes, Austin L; Welch, Robert; Puri, Vinita; Matthews, Casey; Haque, Kashif; Chanock, Stephen J; Yeager, Meredith

    2008-07-01

    Single-nucleotide polymorphism (SNP) arrays have become a popular technology for disease-association studies, but they also have potential for studying the genetic differentiation of human populations. Application of the Affymetrix GeneChip Human Mapping 500K Array Set to a population of 102 individuals representing the major ethnic groups in the United States (African, Asian, European, and Hispanic) revealed patterns of gene diversity and genetic distance that reflected population history. We analyzed allelic frequencies at 388,654 autosomal SNP sites that showed some variation in our study population and 10% or fewer missing values. Despite the small size (23-31 individuals) of each subpopulation, there were no fixed differences at any site between any two subpopulations. As expected from the African origin of modern humans, greater gene diversity was seen in Africans than in either Asians or Europeans, and the genetic distance between the Asian and the European populations was significantly lower than that between either of these two populations and Africans. Principal components analysis applied to a correlation matrix among individuals was able to separate completely the major continental groups of humans (Africans, Asians, and Europeans), while Hispanics overlapped all three of these groups. Genes containing two or more markers with extraordinarily high genetic distance between subpopulations were identified as candidate genes for health differences between subpopulations. The results show that, even with modest sample sizes, genome-wide SNP genotyping technologies have great promise for capturing signatures of gene frequency difference between human subpopulations, with applications in areas as diverse as forensics and the study of ethnic health disparities.

  13. Assessing statistical significance in multivariable genome wide association analysis

    PubMed Central

    Buzdugan, Laura; Kalisch, Markus; Navarro, Arcadi; Schunk, Daniel; Fehr, Ernst; Bühlmann, Peter

    2016-01-01

    Motivation: Although Genome Wide Association Studies (GWAS) genotype a very large number of single nucleotide polymorphisms (SNPs), the data are often analyzed one SNP at a time. The low predictive power of single SNPs, coupled with the high significance threshold needed to correct for multiple testing, greatly decreases the power of GWAS. Results: We propose a procedure in which all the SNPs are analyzed in a multiple generalized linear model, and we show its use for extremely high-dimensional datasets. Our method yields P-values for assessing significance of single SNPs or groups of SNPs while controlling for all other SNPs and the family wise error rate (FWER). Thus, our method tests whether or not a SNP carries any additional information about the phenotype beyond that available by all the other SNPs. This rules out spurious correlations between phenotypes and SNPs that can arise from marginal methods because the ‘spuriously correlated’ SNP merely happens to be correlated with the ‘truly causal’ SNP. In addition, the method offers a data driven approach to identifying and refining groups of SNPs that jointly contain informative signals about the phenotype. We demonstrate the value of our method by applying it to the seven diseases analyzed by the Wellcome Trust Case Control Consortium (WTCCC). We show, in particular, that our method is also capable of finding significant SNPs that were not identified in the original WTCCC study, but were replicated in other independent studies. Availability and implementation: Reproducibility of our research is supported by the open-source Bioconductor package hierGWAS. Contact: peter.buehlmann@stat.math.ethz.ch Supplementary information: Supplementary data are available at Bioinformatics online. PMID:27153677

  14. Technical note: Computing strategies in genome-wide selection.

    PubMed

    Legarra, A; Misztal, I

    2008-01-01

    Genome-wide genetic evaluation might involve the computation of BLUP-like estimations, potentially including thousands of covariates (i.e., single-nucleotide polymorphism markers) for each record. This implies dense Henderson's mixed-model equations and considerable computing resources in time and storage, even for a few thousand records. Possible computing options include the type of storage and the solving algorithm. This work evaluated several computing options, including half-stored Cholesky decomposition, Gauss-Seidel, and 3 matrix-free strategies: Gauss-Seidel, Gauss-Seidel with residuals update, and preconditioned conjugate gradients. Matrix-free Gauss-Seidel with residuals update adjusts the residuals after computing the solution for each effect. This avoids adjusting the left-hand side of the equations by all other effects at every step of the algorithm and saves considerable computing time. Any Gauss-Seidel algorithm can easily be extended for variance component estimation by Markov chain-Monte Carlo. Let m and n be the number of records and markers, respectively. Computing time for Cholesky decomposition is proportional to n3. Computing times per round are proportional to mn2 in matrix-free Gauss-Seidel, to n2 for half-stored Gauss-Seidel, and to n and m for the rest of the algorithms. Algorithms were tested on a real mouse data set, which included 1,928 records and 10,946 single-nucleotide polymorphism markers. Computing times were in the order of a few minutes for Gauss-Seidel with residuals update and preconditioned conjugate gradients, more than 1 h for half-stored Gauss-Seidel, 2 h for Cholesky decomposition, and 4 d for matrix-free Gauss-Seidel. Preconditioned conjugate gradients was the fastest. Gauss-Seidel with residuals update would be the method of choice for variance component estimation as well as solving.

  15. Genome-wide analysis highlights genetic dilution in Algerian sheep.

    PubMed

    Gaouar, S B S; Lafri, M; Djaout, A; El-Bouyahiaoui, R; Bouri, A; Bouchatal, A; Maftah, A; Ciani, E; Da Silva, A B

    2017-03-01

    Algeria represents a reservoir of genetic diversity with local sheep breeds adapted to a large range of environments and showing specific features necessary to deal with harsh conditions. This remarkable diversity results from the traditional management of dryland by pastoralists over centuries. Most of these breeds are poorly productive, and the economic pressure leads farmers to realize anarchic cross-breeding (that is, not carried out in the framework of selection plans) with the hope to increase animal's conformation. In this study, eight of the nine local Algerian sheep breeds (D'men, Hamra, Ouled-Djellal, Rembi, Sidaoun, Tazegzawt, Berber and Barbarine) were investigated for the first time by genome-wide single-nucleotide polymorphism genotyping. At an international scale, Algerian sheep occupied an original position shaped by relations with African and European (particularly Italian) breeds. The strong genetic proximity with Caribbean and Brazilian breeds confirmed that the genetic make-up of these American breeds was largely influenced by the Atlantic slave trade. At a national scale, an alarming genetic dilution of the Berber (a primitive breed) and the Rembi was observed, as a consequence of uncontrolled mating practices with Ouled-Djellal. A similar, though less pronounced, phenomenon was also detected for the Barbarine, another ancestral breed. Genetic originality appeared to be better preserved in Tazegzawt, Hamra, D'men and Sidaoun. These breeds should be given high priority in the establishment of conservation plans to halt their progressive loss. For Berber and Barbarine that also occur in the bordering neighbor countries, urgent concerted transnational actions are needed.

  16. Heritability and genome-wide linkage scan of subjective happiness.

    PubMed

    Bartels, Meike; Saviouk, Viatcheslav; de Moor, Marleen H M; Willemsen, Gonneke; van Beijsterveldt, Toos C E M; Hottenga, Jouke-Jan; de Geus, Eco J C; Boomsma, Dorret I

    2010-04-01

    Causes of individual differences in happiness, as assessed with the Subjective Happiness Scale, are investigated in a large of sample twins and siblings from the Netherlands Twin Register. Over 12,000 twins and siblings, average age 24.7 years (range 12 to 88), took part in the study. A genetic model with an age by sex design was fitted to the data with structural equation modeling in Mx. The heritability of happiness was estimated at 22% for males and 41% in females. No effect of age was observed. To identify the genomic regions contributing to this heritability, a genome-wide linkage study for happiness was conducted in sibling pairs. A subsample of 1157 offspring from 441 families was genotyped with an average of 371 micro-satellite markers per individual. Phenotype and genotype data were analyzed in MERLIN with multipoint variance component linkage analysis and age and sex as covariates. A linkage signal (logarithm of odds score 2.73, empirical p value 0.095) was obtained at the end of the long arm of chromosome 19 for marker D19S254 at 110 cM. A second suggestive linkage peak was found at the short arm of chromosome 1 (LOD of 2.37) at 153 cM, marker D1S534 (empirical p value of .209). These two regions of interest are not overlapping with the regions found for contrasting phenotypes (such as depression, which is negatively associated with happiness). Further linkage and future association studies are warranted.

  17. Genome-wide analysis of condensin binding in Caenorhabditis elegans

    PubMed Central

    2013-01-01

    Background Condensins are multi-subunit protein complexes that are essential for chromosome condensation during mitosis and meiosis, and play key roles in transcription regulation during interphase. Metazoans contain two condensins, I and II, which perform different functions and localize to different chromosomal regions. Caenorhabditis elegans contains a third condensin, IDC, that is targeted to and represses transcription of the X chromosome for dosage compensation. Results To understand condensin binding and function, we performed ChIP-seq analysis of C. elegans condensins in mixed developmental stage embryos, which contain predominantly interphase nuclei. Condensins bind to a subset of active promoters, tRNA genes and putative enhancers. Expression analysis in kle-2-mutant larvae suggests that the primary effect of condensin II on transcription is repression. A DNA sequence motif, GCGC, is enriched at condensin II binding sites. A sequence extension of this core motif, AGGG, creates the condensin IDC motif. In addition to differences in recruitment that result in X-enrichment of condensin IDC and condensin II binding to all chromosomes, we provide evidence for a shared recruitment mechanism, as condensin IDC recruiter SDC-2 also recruits condensin II to the condensin IDC recruitment sites on the X. In addition, we found that condensin sites overlap extensively with the cohesin loader SCC-2, and that SDC-2 also recruits SCC-2 to the condensin IDC recruitment sites. Conclusions Our results provide the first genome-wide view of metazoan condensin II binding in interphase, define putative recruitment motifs, and illustrate shared loading mechanisms for condensin IDC and condensin II. PMID:24125077

  18. Genome-wide metabolic (re-) annotation of Kluyveromyces lactis

    PubMed Central

    2012-01-01

    Background Even before having its genome sequence published in 2004, Kluyveromyces lactis had long been considered a model organism for studies in genetics and physiology. Research on Kluyveromyces lactis is quite advanced and this yeast species is one of the few with which it is possible to perform formal genetic analysis. Nevertheless, until now, no complete metabolic functional annotation has been performed to the proteins encoded in the Kluyveromyces lactis genome. Results In this work, a new metabolic genome-wide functional re-annotation of the proteins encoded in the Kluyveromyces lactis genome was performed, resulting in the annotation of 1759 genes with metabolic functions, and the development of a methodology supported by merlin (software developed in-house). The new annotation includes novelties, such as the assignment of transporter superfamily numbers to genes identified as transporter proteins. Thus, the genes annotated with metabolic functions could be exclusively enzymatic (1410 genes), transporter proteins encoding genes (301 genes) or have both metabolic activities (48 genes). The new annotation produced by this work largely surpassed the Kluyveromyces lactis currently available annotations. A comparison with KEGG’s annotation revealed a match with 844 (~90%) of the genes annotated by KEGG, while adding 850 new gene annotations. Moreover, there are 32 genes with annotations different from KEGG. Conclusions The methodology developed throughout this work can be used to re-annotate any yeast or, with a little tweak of the reference organism, the proteins encoded in any sequenced genome. The new annotation provided by this study offers basic knowledge which might be useful for the scientific community working on this model yeast, because new functions have been identified for the so-called metabolic genes. Furthermore, it served as the basis for the reconstruction of a compartmentalized, genome-scale metabolic model of Kluyveromyces lactis, which is

  19. A genome-wide DNA methylation study in azoospermia.

    PubMed

    Ferfouri, F; Boitrelle, F; Ghout, I; Albert, M; Molina Gomes, D; Wainer, R; Bailly, M; Selva, J; Vialard, F

    2013-11-01

    The objective of this study was to assess genome-wide DNA methylation in testicular tissue from azoospermic patients. A total of 94 azoospermic patients were recruited and classified into three groups: 29 patients presented obstructive azoospermia (OA), 26 displayed non-obstructive azoospermia (NOA) and successful retrieval of spermatozoa by testicular sperm extraction (TESE+) and 39 displayed NOA and failure to retrieve spermatozoa by TESE (TESE-). An Illumina Infinium Human Methylation27 BeadChip DNA methylation array was used to establish a testicular DNA methylation pattern for each type of azoospermic patient. The OA and NOA groups were compared in terms of the relative M-value (the log2 ratio between methylated and non-methylated probe intensities) for each CpG site. We observed significantly different DNA methylation profiles for the NOA and OA groups, with differences at over 9000 of the 27 578 CpG sites; 212 CpG sites had a relative M-value >3. The results highlighted 14 testis-specific genes. Patient clustering with respect to these 212 CpG sites corresponded closely to the clinical classification. The DNA methylation patterns showed that in the NOA group, 78 of the 212 CpG sites were hypomethylated and 134 were hypermethylated (relative to the OA group). On the basis of these DNA methylation profiles, azoospermic patients could be classified as OA or NOA by considering the 212 CpG sites with the greatest methylation differences. Furthermore, we identified genes that may provide insight into the mechanism of idiopathic NOA.

  20. Genome-wide examination of myoblast cell cycle withdrawal duringdifferentiation

    SciTech Connect

    Shen, Xun; Collier, John Michael; Hlaing, Myint; Zhang, Leanne; Delshad, Elizabeth H.; Bristow, James; Bernstein, Harold S.

    2002-12-02

    Skeletal and cardiac myocytes cease division within weeks of birth. Although skeletal muscle retains limited capacity for regeneration through recruitment of satellite cells, resident populations of adult myocardial stem cells have not been identified. Because cell cycle withdrawal accompanies myocyte differentiation, we hypothesized that C2C12 cells, a mouse myoblast cell line previously used to characterize myocyte differentiation, also would provide a model for studying cell cycle withdrawal during differentiation. C2C12 cells were differentiated in culture medium containing horse serum and harvested at various time points to characterize the expression profiles of known cell cycle and myogenic regulatory factors by immunoblot analysis. BrdU incorporation decreased dramatically in confluent cultures 48 hr after addition of horse serum, as cells started to form myotubes. This finding was preceded by up-regulation of MyoD, followed by myogenin, and activation of Bcl-2. Cyclin D1 was expressed in proliferating cultures and became undetectable in cultures containing 40 percent fused myotubes, as levels of p21(WAF1/Cip1) increased and alpha-actin became detectable. Because C2C12 myoblasts withdraw from the cell cycle during myocyte differentiation following a course that recapitulates this process in vivo, we performed a genome-wide screen to identify other gene products involved in this process. Using microarrays containing approximately 10,000 minimally redundant mouse sequences that map to the UniGene database of the National Center for Biotechnology Information, we compared gene expression profiles between proliferating, differentiating, and differentiated C2C12 cells and verified candidate genes demonstrating differential expression by RT-PCR. Cluster analysis of differentially expressed genes revealed groups of gene products involved in cell cycle withdrawal, muscle differentiation, and apoptosis. In addition, we identified several genes, including DDAH2 and Ly

  1. The anti-CMS technique for genome-wide mapping of 5-hydroxymethylcytosine.

    PubMed

    Huang, Yun; Pastor, William A; Zepeda-Martínez, Jorge A; Rao, Anjana

    2012-10-01

    5-Hydroxymethylcytosine (5hmC) is a recently discovered base in the mammalian genome, produced upon oxidation of 5-methylcytosine (5mC) in a process catalyzed by TET proteins. The biological functions of 5hmC and further oxidation products of 5mC are under intense investigation, as they are likely intermediates in DNA demethylation pathways. Here we describe a novel protocol to profile 5hmC at a genome-wide scale. This approach is based on sodium bisulfite-mediated conversion of 5hmC to cytosine-5-methylenesulfonate (CMS); CMS-containing DNA fragments are then immunoprecipitated using a CMS-specific antiserum. The anti-CMS technique is highly specific with a low background, and is much less dependent on 5hmC density than anti-5hmC immunoprecipitation (IP). Moreover, it does not enrich for CA and CT repeats, as noted for 5hmC DNA IP using antibodies to 5hmC. The anti-CMS protocol takes 3 d to complete.

  2. A Genome-Wide Association Study Identifies Multiple Regions Associated with Head Size in Catfish

    PubMed Central

    Geng, Xin; Liu, Shikai; Yao, Jun; Bao, Lisui; Zhang, Jiaren; Li, Chao; Wang, Ruijia; Sha, Jin; Zeng, Peng; Zhi, Degui; Liu, Zhanjiang

    2016-01-01

    Skull morphology is fundamental to evolution and the biological adaptation of species to their environments. With aquaculture fish species, head size is also important for economic reasons because it has a direct impact on fillet yield. However, little is known about the underlying genetic basis of head size. Catfish is the primary aquaculture species in the United States. In this study, we performed a genome-wide association study using the catfish 250K SNP array with backcross hybrid catfish to map the QTL for head size (head length, head width, and head depth). One significantly associated region on linkage group (LG) 7 was identified for head length. In addition, LGs 7, 9, and 16 contain suggestively associated regions for head length. For head width, significantly associated regions were found on LG9, and additional suggestively associated regions were identified on LGs 5 and 7. No region was found associated with head depth. Head size genetic loci were mapped in catfish to genomic regions with candidate genes involved in bone development. Comparative analysis indicated that homologs of several candidate genes are also involved in skull morphology in various other species ranging from amphibian to mammalian species, suggesting possible evolutionary conservation of those genes in the control of skull morphologies. PMID:27558670

  3. Carcinogens induce genome-wide loss of heterozygosity in normal stem cells without persistent chromosomal instability.

    PubMed

    Donahue, Sarah L; Lin, Qing; Cao, Shang; Ruley, H Earl

    2006-08-01

    Widespread losses of heterozygosity (LOH) in human cancer have been thought to result from chromosomal instability caused by mutations affecting DNA repair/genome maintenance. However, the origin of LOH in most tumors is unknown. The present study examined the ability of carcinogenic agents to induce LOH at 53 sites throughout the genome of normal diploid mouse ES cells. Brief exposures to nontoxic levels of methylnitrosourea, diepoxybutane, mitomycin C, hydroxyurea, doxorubicin, and UV light stimulated LOH at all loci at frequencies ranging from 1-8 x 10(-3) per cell (10-123 times higher than in untreated cells). This greatly exceeds the frequencies at which these agents have been reported to induce point mutations and is comparable to the rates of LOH observed in ES cells lacking the gene responsible for Bloom syndrome, an inherited DNA repair defect that results in greatly increased risk of cancer. These results suggest that LOH contributes significantly to the carcinogenicity of a variety of mutagens and raises the possibility that genome-wide LOH observed in some human cancers may reflect prior exposure to genotoxic agents rather than a state of chromosomal instability during the carcinogenic process. Finally, as a practical matter, chemically induced LOH is expected to enhance the recovery of homozygous recessive mutants from phenotype-based genetic screens in mammalian cells.

  4. Carcinogens induce genome-wide loss of heterozygosity in normal stem cells without persistent chromosomal instability

    PubMed Central

    Donahue, Sarah L.; Lin, Qing; Cao, Shang; Ruley, H. Earl

    2006-01-01

    Widespread losses of heterozygosity (LOH) in human cancer have been thought to result from chromosomal instability caused by mutations affecting DNA repair/genome maintenance. However, the origin of LOH in most tumors is unknown. The present study examined the ability of carcinogenic agents to induce LOH at 53 sites throughout the genome of normal diploid mouse ES cells. Brief exposures to nontoxic levels of methylnitrosourea, diepoxybutane, mitomycin C, hydroxyurea, doxorubicin, and UV light stimulated LOH at all loci at frequencies ranging from 1–8 × 10−3 per cell (10–123 times higher than in untreated cells). This greatly exceeds the frequencies at which these agents have been reported to induce point mutations and is comparable to the rates of LOH observed in ES cells lacking the gene responsible for Bloom syndrome, an inherited DNA repair defect that results in greatly increased risk of cancer. These results suggest that LOH contributes significantly to the carcinogenicity of a variety of mutagens and raises the possibility that genome-wide LOH observed in some human cancers may reflect prior exposure to genotoxic agents rather than a state of chromosomal instability during the carcinogenic process. Finally, as a practical matter, chemically induced LOH is expected to enhance the recovery of homozygous recessive mutants from phenotype-based genetic screens in mammalian cells. PMID:16868089

  5. Genome-Wide Analysis of Self-Renewal in Drosophila Neural Stem Cells by Transgenic RNAi

    PubMed Central

    Neumüller, Ralph A.; Richter, Constance; Fischer, Anja; Novatchkova, Maria; Neumüller, Klaus G.; Knoblich, Juergen A.

    2011-01-01

    Summary The balance between stem cell self-renewal and differentiation is precisely controlled to ensure tissue homeostasis and prevent tumorigenesis. Here we use genome-wide transgenic RNAi to identify 620 genes potentially involved in controlling this balance in Drosophila neuroblasts. We quantify all phenotypes and derive measurements for proliferation, lineage, cell size, and cell shape. We identify a set of transcriptional regulators essential for self-renewal and use hierarchical clustering and integration with interaction data to create functional networks for the control of neuroblast self-renewal and differentiation. Our data identify key roles for the chromatin remodeling Brm complex, the spliceosome, and the TRiC/CCT-complex and show that the alternatively spliced transcription factor Lola and the transcriptional elongation factors Ssrp and Barc control self-renewal in neuroblast lineages. As our data are strongly enriched for genes highly expressed in murine neural stem cells, they are likely to provide valuable insights into mammalian stem cell biology as well. PMID:21549331

  6. Genome-wide nucleosome specificity and function of chromatin remodellers in ES cells

    PubMed Central

    de Dieuleveult, Maud; Yen, Kuangyu; Hmitou, Isabelle; Depaux, Arnaud; Boussouar, Fayçal; Dargham, Daria Bou; Jounier, Sylvie; Humbertclaude, Hélène; Ribierre, Florence; Baulard, Céline; Farrell, Nina P.; Park, Bongsoo; Keime, Céline; Carrière, Lucie; Berlivet, Soizick; Gut, Marta; Gut, Ivo; Werner, Michel; Deleuze, Jean-François; Olaso, Robert; Aude, Jean-Christophe; Chantalat, Sophie; Pugh, B. Franklin; Gérard, Matthieu

    2015-01-01

    Summary ATP-dependent chromatin remodellers allow access to DNA for transcription factors and the general transcription machinery, but whether mammalian chromatin remodellers1–3 target specific nucleosomes to regulate transcription is unclear. Here, we present genome-wide remodeller-nucleosome interaction profiles for Chd1, Chd2, Chd4, Chd6, Chd8, Chd9, Brg1 and Ep400 in mouse embryonic stem (ES) cells. These remodellers bind one or both full nucleosomes that flank MNase-defined nucleosome-free promoter regions (NFRs), where they separate divergent transcription. Surprisingly, large CpG-rich NFRs that extend downstream of annotated transcriptional start sites (TSSs) are nevertheless chromatinized with non-nucleosomal or subnucleosomal histone variants (H3.3 and H2A.Z) and modifications (H3K4me3 and H3K27ac). RNA polymerase (pol) II therefore navigates hundreds of bp of altered chromatin in the sense direction before encountering an MNase-resistant nucleosome at the 3′ end of the NFR. Transcriptome analysis upon remodeller depletion reveals reciprocal mechanisms of transcriptional regulation by remodellers. Whereas at active genes individual remodellers play either positive or negative roles via altering nucleosome stability, at polycomb-enriched bivalent genes the same remodellers act in an opposite manner. These findings indicate that remodellers target specific nucleosomes at the edge of NFRs, where they regulate ES cell transcriptional programs. PMID:26814966

  7. Genome-wide nucleosome specificity and function of chromatin remodellers in ES cells.

    PubMed

    de Dieuleveult, Maud; Yen, Kuangyu; Hmitou, Isabelle; Depaux, Arnaud; Boussouar, Fayçal; Bou Dargham, Daria; Jounier, Sylvie; Humbertclaude, Hélène; Ribierre, Florence; Baulard, Céline; Farrell, Nina P; Park, Bongsoo; Keime, Céline; Carrière, Lucie; Berlivet, Soizick; Gut, Marta; Gut, Ivo; Werner, Michel; Deleuze, Jean-François; Olaso, Robert; Aude, Jean-Christophe; Chantalat, Sophie; Pugh, B Franklin; Gérard, Matthieu

    2016-02-04

    ATP-dependent chromatin remodellers allow access to DNA for transcription factors and the general transcription machinery, but whether mammalian chromatin remodellers target specific nucleosomes to regulate transcription is unclear. Here we present genome-wide remodeller-nucleosome interaction profiles for the chromatin remodellers Chd1, Chd2, Chd4, Chd6, Chd8, Chd9, Brg1 and Ep400 in mouse embryonic stem (ES) cells. These remodellers bind one or both full nucleosomes that flank micrococcal nuclease (MNase)-defined nucleosome-free promoter regions (NFRs), where they separate divergent transcription. Surprisingly, large CpG-rich NFRs that extend downstream of annotated transcriptional start sites are nevertheless bound by non-nucleosomal or subnucleosomal histone variants (H3.3 and H2A.Z) and marked by H3K4me3 and H3K27ac modifications. RNA polymerase II therefore navigates hundreds of base pairs of altered chromatin in the sense direction before encountering an MNase-resistant nucleosome at the 3' end of the NFR. Transcriptome analysis after remodeller depletion reveals reciprocal mechanisms of transcriptional regulation by remodellers. Whereas at active genes individual remodellers have either positive or negative roles via altering nucleosome stability, at polycomb-enriched bivalent genes the same remodellers act in an opposite manner. These findings indicate that remodellers target specific nucleosomes at the edge of NFRs, where they regulate ES cell transcriptional programs.

  8. Genome-Wide Analysis of Promoters: Clustering by Alignment and Analysis of Regular Patterns

    PubMed Central

    Pettinato, Lucia; Calistri, Elisa; Di Patti, Francesca; Livi, Roberto; Luccioli, Stefano

    2014-01-01

    In this paper we perform a genome-wide analysis of H. sapiens promoters. To this aim, we developed and combined two mathematical methods that allow us to (i) classify promoters into groups characterized by specific global structural features, and (ii) recover, in full generality, any regular sequence in the different classes of promoters. One of the main findings of this analysis is that H. sapiens promoters can be classified into three main groups. Two of them are distinguished by the prevalence of weak or strong nucleotides and are characterized by short compositionally biased sequences, while the most frequent regular sequences in the third group are strongly correlated with transposons. Taking advantage of the generality of these mathematical procedures, we have compared the promoter database of H. sapiens with those of other species. We have found that the above-mentioned features characterize also the evolutionary content appearing in mammalian promoters, at variance with ancestral species in the phylogenetic tree, that exhibit a definitely lower level of differentiation among promoters. PMID:24465517

  9. A Genome-wide CRISPR Screen in Toxoplasma Identifies Essential Apicomplexan Genes.

    PubMed

    Sidik, Saima M; Huet, Diego; Ganesan, Suresh M; Huynh, My-Hang; Wang, Tim; Nasamu, Armiyaw S; Thiru, Prathapan; Saeij, Jeroen P J; Carruthers, Vern B; Niles, Jacquin C; Lourido, Sebastian

    2016-09-08

    Apicomplexan parasites are leading causes of human and livestock diseases such as malaria and toxoplasmosis, yet most of their genes remain uncharacterized. Here, we present the first genome-wide genetic screen of an apicomplexan. We adapted CRISPR/Cas9 to assess the contribution of each gene from the parasite Toxoplasma gondii during infection of human fibroblasts. Our analysis defines ∼200 previously uncharacterized, fitness-conferring genes unique to the phylum, from which 16 were investigated, revealing essential functions during infection of human cells. Secondary screens identify as an invasion factor the claudin-like apicomplexan microneme protein (CLAMP), which resembles mammalian tight-junction proteins and localizes to secretory organelles, making it critical to the initiation of infection. CLAMP is present throughout sequenced apicomplexan genomes and is essential during the asexual stages of the malaria parasite Plasmodium falciparum. These results provide broad-based functional information on T. gondii genes and will facilitate future approaches to expand the horizon of antiparasitic interventions.

  10. Genome-wide quantitative assessment of variation in DNA methylation patterns

    PubMed Central

    Xie, Hehuang; Wang, Min; de Andrade, Alexandre; de F. Bonaldo, Maria; Galat, Vasil; Arndt, Kelly; Rajaram, Veena; Goldman, Stewart; Tomita, Tadanori; Soares, Marcelo B.

    2011-01-01

    Genomic DNA methylation contributes substantively to transcriptional regulations that underlie mammalian development and cellular differentiation. Much effort has been made to decipher the molecular mechanisms governing the establishment and maintenance of DNA methylation patterns. However, little is known about genome-wide variation of DNA methylation patterns. In this study, we introduced the concept of methylation entropy, a measure of the randomness of DNA methylation patterns in a cell population, and exploited it to assess the variability in DNA methylation patterns of Alu repeats and promoters. A few interesting observations were made: (i) within a cell population, methylation entropy varies among genomic loci; (ii) among cell populations, the methylation entropies of most genomic loci remain constant; (iii) compared to normal tissue controls, some tumors exhibit greater methylation entropies; (iv) Alu elements with high methylation entropy are associated with high GC content but depletion of CpG dinucleotides and (v) Alu elements in the intronic regions or far from CpG islands are associated with low methylation entropy. We further identified 12 putative allelic-specific methylated genomic loci, including four Alu elements and eight promoters. Lastly, using subcloned normal fibroblast cells, we demonstrated the highly variable methylation patterns are resulted from low fidelity of DNA methylation inheritance. PMID:21278160

  11. Genome-wide maps of nuclear lamina interactions in single human cells

    PubMed Central

    Kind, Jop; Pagie, Ludo; de Vries, Sandra S.; Nahidiazar, Leila; Dey, Siddharth S.; Bienko, Magda; Zhan, Ye; Lajoie, Bryan; de Graaf, Carolyn A.; Amendola, Mario; Fudenberg, Geoffrey; Imakaev, Maxim; Mirny, Leonid A.; Jalink, Kees; Dekker, Job; van Oudenaarden, Alexander; van Steensel, Bas

    2015-01-01

    Summary Mammalian interphase chromosomes interact with the nuclear lamina (NL) through hundreds of large Lamina Associated Domains (LADs). We report a method to map NL contacts genome-wide in single human cells. Analysis of nearly 400 maps reveals a core architecture of gene-poor LADs that contact the NL with high cell-to-cell consistency, interspersed by LADs with more variable NL interactions. The variable contacts tend to be cell-type specific and are more sensitive to changes in genome ploidy than the consistent contacts. Single-cell maps indicate that NL contacts involve multivalent interactions over hundreds of kilobases. Moreover, we observe extensive intra-chromosomal coordination of NL contacts, even over tens of megabases. Such coordinated loci exhibit preferential interactions as detected by Hi-C. Finally, consistency of NL contacts is inversely linked to gene activity in single cells, and correlates positively with the heterochromatic histone modification H3K9me3. These results highlight fundamental principles of single cell chromatin organization. PMID:26365489

  12. Genome-Wide RNAi Screening to Dissect the TGF-β Signal Transduction Pathway.

    PubMed

    Chen, Xiaochu; Xu, Lan

    2016-01-01

    The transforming growth factor-β (TGF-β) family of cytokines figures prominently in regulation of embryonic development and adult tissue homeostasis from Drosophila to mammals. Genetic defects affecting TGF-β signaling underlie developmental disorders and diseases such as cancer in human. Therefore, delineating the molecular mechanism by which TGF-β regulates cell biology is critical for understanding normal biology and disease mechanisms. Forward genetic screens in model organisms and biochemical approaches in mammalian tissue culture were instrumental in initial characterization of the TGF-β signal transduction pathway. With complete sequence information of the genomes and the advent of RNA interference (RNAi) technology, genome-wide RNAi screening emerged as a powerful functional genomics approach to systematically delineate molecular components of signal transduction pathways. Here, we describe a protocol for image-based whole-genome RNAi screening aimed at identifying molecules required for TGF-β signaling into the nucleus. Using this protocol we examined >90 % of annotated Drosophila open reading frames (ORF) individually and successfully uncovered several novel factors serving critical roles in the TGF-β pathway. Thus cell-based high-throughput functional genomics can uncover new mechanistic insights on signaling pathways beyond what the classical genetics had revealed.

  13. Novel skin phenotypes revealed by a genome-wide mouse reverse genetic screen

    PubMed Central

    Liakath-Ali, Kifayathullah; Vancollie, Valerie E.; Heath, Emma; Smedley, Damian P.; Estabel, Jeanne; Sunter, David; DiTommaso, Tia; White, Jacqueline K.; Ramirez-Solis, Ramiro; Smyth, Ian; Steel, Karen P.; Watt, Fiona M.

    2014-01-01

    Permanent stop-and-shop large-scale mouse mutant resources provide an excellent platform to decipher tissue phenogenomics. Here we analyse skin from 538 knockout mouse mutants generated by the Sanger Institute Mouse Genetics Project. We optimize immunolabelling of tail epidermal wholemounts to allow systematic annotation of hair follicle, sebaceous gland and interfollicular epidermal abnormalities using ontology terms from the Mammalian Phenotype Ontology. Of the 50 mutants with an epidermal phenotype, 9 map to human genetic conditions with skin abnormalities. Some mutant genes are expressed in the skin, whereas others are not, indicating systemic effects. One phenotype is affected by diet and several are incompletely penetrant. In-depth analysis of three mutants, Krt76, Myo5a (a model of human Griscelli syndrome) and Mysm1, provides validation of the screen. Our study is the first large-scale genome-wide tissue phenotype screen from the International Knockout Mouse Consortium and provides an open access resource for the scientific community. PMID:24721909

  14. Phenotype prediction based on genome-wide DNA methylation data

    PubMed Central

    2014-01-01

    Background DNA methylation (DNAm) has important regulatory roles in many biological processes and diseases. It is the only epigenetic mark with a clear mechanism of mitotic inheritance and the only one easily available on a genome scale. Aberrant cytosine-phosphate-guanine (CpG) methylation has been discussed in the context of disease aetiology, especially cancer. CpG hypermethylation of promoter regions is often associated with silencing of tumour suppressor genes and hypomethylation with activation of oncogenes. Supervised principal component analysis (SPCA) is a popular machine learning method. However, in a recent application to phenotype prediction from DNAm data SPCA was inferior to the specific method EVORA. Results We present Model-Selection-SPCA (MS-SPCA), an enhanced version of SPCA. MS-SPCA applies several models that perform well in the training data to the test data and selects the very best models for final prediction based on parameters of the test data. We have applied MS-SPCA for phenotype prediction from genome-wide DNAm data. CpGs used for prediction are selected based on the quantification of three features of their methylation (average methylation difference, methylation variation difference and methylation-age-correlation). We analysed four independent case–control datasets that correspond to different stages of cervical cancer: (i) cases currently cytologically normal, but will later develop neoplastic transformations, (ii, iii) cases showing neoplastic transformations and (iv) cases with confirmed cancer. The first dataset was split into several smaller case–control datasets (samples either Human Papilloma Virus (HPV) positive or negative). We demonstrate that cytology normal HPV+ and HPV- samples contain DNAm patterns which are associated with later neoplastic transformations. We present evidence that DNAm patterns exist in cytology normal HPV- samples that (i) predispose to neoplastic transformations after HPV infection and (ii

  15. A Genome-wide Pleiotropy Scan for Prostate Cancer Risk

    PubMed Central

    Panagiotou, Orestis A; Travis, Ruth C; Campa, Daniele; Berndt, Sonja I.; Lindstrom, Sara; Kraft, Peter; Schumacher, Fredrick R.; Siddiq, Afshan; Papatheodorou, Stefania I.; Stanford, Janet L.; Albanes, Demetrius; Virtamo, Jarmo; Weinstein, Stephanie J.; Diver, W. Ryan; Gapstur, Susan M.; Stevens, Victoria L.; Boeing, Heiner; Bueno-de-Mesquita, H. Bas; Gurrea, Aurelio Barricarte; Kaaks, Rudolf; Khaw, Kay-Tee; Krogh, Vittorio; Overvad, Kim; Riboli, Elio; Trichopoulos, Dimitrios; Giovannucci, Edward; Stampfer, Meir; Haiman, Christopher; Henderson, Brian; Le Marchand, Loic; Gaziano, J. Michael; Hunter, DavidJ.; Koutros, Stella; Yeager, Meredith; Hoover, Robert N.; Chanock, Stephen J.; Wacholder, Sholom; Key, Timothy J.; Tsilidis, Konstantinos K

    2014-01-01

    Background No single-nucleotide polymorphisms (SNPs) specific for aggressive prostate cancer have been identified in genome-wide association studies (GWAS). Objective To test if SNPs associated with other traits may also affect the risk of aggressive prostate cancer. Design, setting, and participants SNPs implicated in any phenotype other than prostate cancer (p ≤ 10−7) were identified through the catalog of published GWAS and tested in 2891 aggressive prostate cancer cases and 4592 controls from the Breast and Prostate Cancer Cohort Consortium (BPC3). The 40 most significant SNPs were followed up in 4872 aggressive prostate cancer cases and 24 534 controls from the Prostate Cancer Association Group to Investigate Cancer Associated Alterations in the Genome (PRACTICAL) consortium. Outcome measurements and statistical analysis Odds ratios (ORs) and 95% confidence intervals (CIs) for aggressive prostate cancer were estimated. Results and limitations A total of 4666 SNPs were evaluated by the BPC3. Two signals were seen in regions already reported for prostate cancer risk. rs7014346 at 8q24.21 was marginally associated with aggressive prostate cancer in the BPC3 trial (p = 1.6 × 10-6), whereas after meta-analysis by PRACTICAL the summary OR was 1.21 (95%CI 1.16–1.27; p = 3.22 × 10−18). rs9900242 at 17q24.3 was also marginally associated with aggressive disease in the meta-analysis (OR 0.90, 95% CI 0.86–0.94; p = 2.5 × 10−6). Neither of these SNPs remained statistically significant when conditioning on correlated known prostate cancer SNPs. The meta-analysis by BPC3 and PRACTICAL identified a third promising signal, marked by rs16844874 at 2q34, independent of known prostate cancer loci (OR 1.12,95% CI 1.06–1.19; p = 4.67 × 10−5); it has been shown that SNPs correlated with this signal affect glycine concentrations. The main limitation is the heterogeneity in the definition of aggressive prostate cancer between BPC3 and PRACTICAL. Conclusions We did

  16. Nightshift work and genome-wide DNA methylation.

    PubMed

    Bhatti, Parveen; Zhang, Yuzheng; Song, Xiaoling; Makar, Karen W; Sather, Cassandra L; Kelsey, Karl T; Houseman, E Andres; Wang, Pei

    2015-02-01

    The negative health effects of shift work, including carcinogenesis, may be mediated by changes in DNA methylation, particularly in the circadian genes. Using the Infinium HumanMethylation450 Bead Array (Illumina, San Diego, CA), we compared genome-wide methylation between 65 actively working dayshift workers and 59 actively working nightshift workers in the healthcare industry. A total of 473 800 loci, including 391 loci across the 12 core circadian genes, were analyzed to identify methylation markers associated with shift work status using linear regression models adjusted for gender, age, body mass index, race, smoking status and leukocyte cell profile as measured by flow cytometry. Analyses at the level of gene, CpG island and gene region were also conducted. To account for multiple comparisons, we controlled the false discovery rate (FDR ≤0.05). Significant differences between nightshift and dayshift workers were found at 16 135 of 473 800 loci, across 3769 of 20 164 genes, across 7173 of 22 721 CpG islands and across 5508 of 51 843 gene regions. For each significant loci, gene, CpG island or gene region, average methylation was consistently found to be decreased among nightshift workers compared to dayshift workers. Twenty-one loci located in the circadian genes were also found to be significantly hypomethylated among nightshift workers. The largest differences were observed for three loci located in the gene body of PER3. A total of nine significant loci were found in the CSNK1E gene, most of which were located in a CpG island and near the transcription start site of the gene. Methylation changes in these circadian genes may lead to altered expression of these genes which has been associated with cancer in previous studies. Gene ontology enrichment analysis revealed that among the significantly hypomethylated genes, processes related to host defense and immunity were represented. Our results indicate that the health effects of shift work may be

  17. Meta-Analysis of Genome-Wide Association Studies of Attention-Deficit/Hyperactivity Disorder

    ERIC Educational Resources Information Center

    Neale, Benjamin M.; Medland, Sarah E.; Ripke, Stephan; Asherson, Philip; Franke, Barbara; Lesch, Klaus-Peter; Faraone, Stephen V.; Nguyen, Thuy Trang; Schafer, Helmut; Holmans, Peter; Daly, Mark; Steinhausen, Hans-Christoph; Freitag, Christine; Reif, Andreas; Renner, Tobias J.; Romanos, Marcel; Romanos, Jasmin; Walitza, Susanne; Warnke, Andreas; Meyer, Jobst; Palmason, Haukur; Buitelaar, Jan; Vasquez, Alejandro Arias; Lambregts-Rommelse, Nanda; Gill, Michael; Anney, Richard J. L.; Langely, Kate; O'Donovan, Michael; Williams, Nigel; Owen, Michael; Thapar, Anita; Kent, Lindsey; Sergeant, Joseph; Roeyers, Herbert; Mick, Eric; Biederman, Joseph; Doyle, Alysa; Smalley, Susan; Loo, Sandra; Hakonarson, Hakon; Elia, Josephine; Todorov, Alexandre; Miranda, Ana; Mulas, Fernando; Ebstein, Richard P.; Rothenberger, Aribert; Banaschewski, Tobias; Oades, Robert D.; Sonuga-Barke, Edmund; McGough, James; Nisenbaum, Laura; Middleton, Frank; Hu, Xiaolan; Nelson, Stan

    2010-01-01

    Objective: Although twin and family studies have shown attention-deficit/hyperactivity disorder (ADHD) to be highly heritable, genetic variants influencing the trait at a genome-wide significant level have yet to be identified. As prior genome-wide association studies (GWAS) have not yielded significant results, we conducted a meta-analysis of…

  18. Genome-wide screening and identification of antigens for rickettsial vaccine development

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The capacity to identify immunogens for vaccine development by genome-wide screening has been markedly enhanced by the availability of complete microbial genome sequences coupled to rapid proteomic and bioinformatic analysis. Critical to this genome-wide screening is in vivo testing in the context o...

  19. Case-Control Genome-Wide Association Study of Attention-Deficit/Hyperactivity Disorder

    ERIC Educational Resources Information Center

    Neale, Benjamin M.; Medland, Sarah; Ripke, Stephan; Anney, Richard J. L.; Asherson, Philip; Buitelaar, Jan; Franke, Barbara; Gill, Michael; Kent, Lindsey; Holmans, Peter; Middleton, Frank; Thapar, Anita; Lesch, Klaus-Peter; Faraone, Stephen V.; Daly, Mark; Nguyen, Thuy Trang; Schafer, Helmut; Steinhausen, Hans-Christoph; Reif, Andreas; Renner, Tobias J.; Romanos, Marcel; Romanos, Jasmin; Warnke, Andreas; Walitza, Susanne; Freitag, Christine; Meyer, Jobst; Palmason, Haukur; Rothenberger, Aribert; Hawi, Ziarih; Sergeant, Joseph; Roeyers, Herbert; Mick, Eric; Biederman, Joseph

    2010-01-01

    Objective: Although twin and family studies have shown attention-deficit/hyperactivity disorder (ADHD) to be highly heritable, genetic variants influencing the trait at a genome-wide significant level have yet to be identified. Thus additional genome-wide association studies (GWAS) are needed. Method: We used case-control analyses of 896 cases…

  20. Family-Based Genome-Wide Association Scan of Attention-Deficit/Hyperactivity Disorder

    ERIC Educational Resources Information Center

    Mick, Eric; Todorov, Alexandre; Smalley, Susan; Hu, Xiaolan; Loo, Sandra; Todd, Richard D.; Biederman, Joseph; Byrne, Deirdre; Dechairo, Bryan; Guiney, Allan; McCracken, James; McGough, James; Nelson, Stanley F.; Reiersen, Angela M.; Wilens, Timothy E.; Wozniak, Janet; Neale, Benjamin M.; Faraone, Stephen V.

    2010-01-01

    Objective: Genes likely play a substantial role in the etiology of attention-deficit/hyperactivity disorder (ADHD). However, the genetic architecture of the disorder is unknown, and prior genome-wide association studies (GWAS) have not identified a genome-wide significant association. We have conducted a third, independent, multisite GWAS of…

  1. More heritability probably captured by psoriasis genome-wide association study in Han Chinese.

    PubMed

    Jiang, Long; Liu, Lu; Cheng, Yuyan; Lin, Yan; Shen, Changbing; Zhu, Caihong; Yang, Sen; Yin, Xianyong; Zhang, Xuejun

    2015-11-15

    Missing heritability is a common problem in genome-wide association studies in complex diseases/traits. To quantify the unbiased heritability estimate, we applied the phenotype correlation-genotype correlation regression in psoriasis genome-wide association data in Han Chinese which comprises 1139 cases and 1132 controls. We estimated that 45.7% heritability of psoriasis in Han Chinese were captured by common variants (s.e.=12.5%), which reinforced that the majority of psoriasis heritability can be covered by common variants in genome-wide association data (68.2%). The results provided evidence that the heritability covered by psoriasis genome-wide genotyping data was probably underestimated in previous restricted maximum likelihood method. Our study highlights the broad role of common variants in the etiology of psoriasis and sheds light on the possibility to identify more common variants of small effect by increasing the sample size in psoriasis genome-wide association studies.

  2. No Genome-Wide Protein Sequence Convergence for Echolocation

    PubMed Central

    Zou, Zhengting; Zhang, Jianzhi

    2015-01-01

    Toothed whales and two groups of bats independently acquired echolocation, the ability to locate and identify objects by reflected sound. Echolocation requires physiologically complex and coordinated vocal, auditory, and neural functions, but the molecular basis of the capacity for echolocation is not well understood. A recent study suggested that convergent amino acid substitutions widespread in the proteins of echolocators underlay the convergent origins of mammalian echolocation. Here, we show that genomic signatures of molecular convergence between echolocating lineages are generally no stronger than those between echolocating and comparable nonecholocating lineages. The same is true for the group of 29 hearing-related proteins claimed to be enriched with molecular convergence. Reexamining the previous selection test reveals several flaws and invalidates the asserted evidence for adaptive convergence. Together, these findings indicate that the reported genomic signatures of convergence largely reflect the background level of sequence convergence unrelated to the origins of echolocation. PMID:25631925

  3. No genome-wide protein sequence convergence for echolocation.

    PubMed

    Zou, Zhengting; Zhang, Jianzhi

    2015-05-01

    Toothed whales and two groups of bats independently acquired echolocation, the ability to locate and identify objects by reflected sound. Echolocation requires physiologically complex and coordinated vocal, auditory, and neural functions, but the molecular basis of the capacity for echolocation is not well understood. A recent study suggested that convergent amino acid substitutions widespread in the proteins of echolocators underlay the convergent origins of mammalian echolocation. Here, we show that genomic signatures of molecular convergence between echolocating lineages are generally no stronger than those between echolocating and comparable nonecholocating lineages. The same is true for the group of 29 hearing-related proteins claimed to be enriched with molecular convergence. Reexamining the previous selection test reveals several flaws and invalidates the asserted evidence for adaptive convergence. Together, these findings indicate that the reported genomic signatures of convergence largely reflect the background level of sequence convergence unrelated to the origins of echolocation.

  4. Genome-wide gene expression and DNA methylation differences in abnormally cloned and normally natural mating piglets.

    PubMed

    Zou, C; Fu, Y; Li, C; Liu, H; Li, G; Li, J; Zhang, H; Wu, Y; Li, C

    2016-08-01

    Many studies have proved that DNA methylation can regulate gene expression and further affect skeletal muscle growth and development of pig, whereas the mechanisms of how DNA methylation or gene expression alteration ultimately lead to phenotypical differences between the cloned and natural mating pigs remain elusive. This study aimed to investigate genome-wide gene expression and DNA methylation differences between abnormally cloned and normally natural mating piglets and identify molecular markers related to skeletal muscle growth and development in pig. The DNA methylation and genome-wide gene expression in the two groups of piglets were analysed through methylated DNA immunoprecipitation binding high-throughput sequencing and RNA sequencing respectively. We detected 1493 differentially expressed genes between the two groups, of which 382 genes were also differentially methylated. The results of the integrative analysis between DNA methylation and gene expression revealed that the DNA methylation levels showed a significantly negative and monotonic correlation with gene expression levels around the transcription start site of genes. By contrast, no notable monotonic correlation was observed in other regions. Furthermore, we identified some interesting genes and signalling pathways (e.g. myosin, heavy chain 7 and mammalian target of rapamycin) which possibly play essential roles in skeletal muscle growth and development. The results of this study provide insights into the relationship of DNA methylation with gene expression in newborn piglets and into the mechanisms in abnormally cloned animals through somatic cell nuclear transfer.

  5. Genome-Wide Association Mapping for Intelligence in Military Working Dogs: Canine Cohort, Canine Intelligence Assessment Regimen, Genome-Wide Single Nucleotide Polymorphism (SNP) Typing, and Unsupervised Classification Algorithm for Genome-Wide Association Data Analysis

    DTIC Science & Technology

    2011-09-01

    were down-selected and successfully genotyped for whole genome (WG) single nucleotide polymorphism (SNP) markers by means of the Affymetrix Canine...SUBJECT TERMS Military working dog genome-wide association study genetic marker intelligence... marker , intelligence, Canine Intelligence Testing Protocol, classification technique, clustering analysis Technical Report: September 2011 2

  6. Pervasive, genome-wide positive selection leading to functional divergence in the bacterial genus Campylobacter

    PubMed Central

    Lefébure, Tristan; Stanhope, Michael J.

    2009-01-01

    An open question in bacterial genomics is the role that adaptive evolution of the core genome plays in diversification and adaptation of bacterial species, and how this might differ between groups of bacteria occupying different environmental circumstances. The genus Campylobacter encompasses several important human and animal enteric pathogens, with genome sequence data available for eight species. We estimate the Campylobacter core genome at 647 genes, with 92.5% of the nonrecombinant core genome loci under positive selection in at least one lineage and the same gene frequently under positive selection in multiple lineages. Tests are provided that reject recombination, saturation, and variation in codon usage bias as factors contributing to this high level of selection. We suggest this genome-wide adaptive evolution may result from a Red Queen macroevolutionary dynamic, in which species are involved in competition for resources within the mammalian and/or vertebrate gastrointestinal tract. Much reduced levels of positive selection evident in Streptococcus, as reported by the authors in an earlier work, may be a consequence of these taxa inhabiting less species-rich habitats, and more unique niches. Despite many common loci under positive selection in multiple Campylobacter lineages, we found no evidence for molecular adaptive convergence at the level of the same or adjacent codons, or even protein domains. Taken collectively, these results describe the diversification of a bacterial genus that involves pervasive natural selection pressure across virtually the entire genome, with this adaptation occurring in different ways in different lineages, despite the species tendency toward a common gastrointestinal habitat. PMID:19304960

  7. Impact of high predation risk on genome-wide hippocampal gene expression in snowshoe hares.

    PubMed

    Lavergne, Sophia G; McGowan, Patrick O; Krebs, Charles J; Boonstra, Rudy

    2014-11-01

    The population dynamics of snowshoe hares (Lepus americanus) are fundamental to the ecosystem dynamics of Canada's boreal forest. During the 8- to 11-year population cycle, hare densities can fluctuate up to 40-fold. Predators in this system (lynx, coyotes, great-horned owls) affect population numbers not only through direct mortality but also through sublethal effects. The chronic stress hypothesis posits that high predation risk during the decline severely stresses hares, leading to greater stress responses, heightened ability to mobilize cortisol and energy, and a poorer body condition. These effects may result in, or be mediated by, differential gene expression. We used an oligonucleotide microarray designed for a closely-related species, the European rabbit (Oryctolagus cuniculus), to characterize differences in genome-wide hippocampal RNA transcript abundance in wild hares from the Yukon during peak and decline phases of a single cycle. A total of 106 genes were differentially regulated between phases. Array results were validated with quantitative real-time PCR, and mammalian protein sequence similarity was used to infer gene function. In comparison to hares from the peak, decline phase hares showed increased expression of genes involved in metabolic processes and hormone response, and decreased expression of immune response and blood cell formation genes. We found evidence for predation risk effects on the expression of genes whose putative functions correspond with physiological impacts known to be induced by predation risk in snowshoe hares. This study shows, for the first time, a link between changes in demography and alterations in neural RNA transcript abundance in a natural population.

  8. Genome-Wide Inference of Ancestral Recombination Graphs

    PubMed Central

    Rasmussen, Matthew D.; Hubisz, Melissa J.; Gronau, Ilan; Siepel, Adam

    2014-01-01

    The complex correlation structure of a collection of orthologous DNA sequences is uniquely captured by the “ancestral recombination graph” (ARG), a complete record of coalescence and recombination events in the history of the sample. However, existing methods for ARG inference are computationally intensive, highly approximate, or limited to small numbers of sequences, and, as a consequence, explicit ARG inference is rarely used in applied population genomics. Here, we introduce a new algorithm for ARG inference that is efficient enough to apply to dozens of complete mammalian genomes. The key idea of our approach is to sample an ARG of chromosomes conditional on an ARG of chromosomes, an operation we call “threading.” Using techniques based on hidden Markov models, we can perform this threading operation exactly, up to the assumptions of the sequentially Markov coalescent and a discretization of time. An extension allows for threading of subtrees instead of individual sequences. Repeated application of these threading operations results in highly efficient Markov chain Monte Carlo samplers for ARGs. We have implemented these methods in a computer program called ARGweaver. Experiments with simulated data indicate that ARGweaver converges rapidly to the posterior distribution over ARGs and is effective in recovering various features of the ARG for dozens of sequences generated under realistic parameters for human populations. In applications of ARGweaver to 54 human genome sequences from Complete Genomics, we find clear signatures of natural selection, including regions of unusually ancient ancestry associated with balancing selection and reductions in allele age in sites under directional selection. The patterns we observe near protein-coding genes are consistent with a primary influence from background selection rather than hitchhiking, although we cannot rule out a contribution from recurrent selective sweeps. PMID:24831947

  9. Genome-Wide Association Mapping for Intelligence in Military Working Dogs: Development of Advanced Classification Algorithm for Genome-Wide Single Nucleotide Polymorphism (SNP) Data Analysis

    DTIC Science & Technology

    2011-04-01

    distribution unlimited. QC – quality control QTL – quantitative trait loci SNP – single nucleotide polymorphism TE – Tris + EDTA TBE – Tris + Boric Acid + EDTA WGSA – whole genome sampling assay ...canine intelligence testing protocol EDTA – ethylenediaminetetraacetic acid GWAS – genome-wide association study LD – linkage disequilibrium MWD

  10. Genome-wide differentiation of various melon horticultural groups for use in genome wide association study for fruit firmness and construction of a high resolution genetic map

    Technology Transfer Automated Retrieval System (TEKTRAN)

    We generated 13,789 single nucleotide plymorphism (SNP) markers from 97 melon accessions using genotyping by sequencing and anchored them to chromosomes to understand genome-wide fixation index between various melon morphotypes and linkage disequilibrium (LD) decay for inodorus and cantalupensis, th...

  11. A Genome Wide Survey of SNP Variation Reveals the Genetic Structure of Sheep Breeds

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The genetic structure of sheep reflects their domestication and subsequent formation into discrete breeds. Understanding genetic structure is essential for achieving genetic improvement through genome-wide association studies, genomic selection and the dissection of quantitative traits. After identi...

  12. Efficient multivariate linear mixed model algorithms for genome-wide association studies.

    PubMed

    Zhou, Xiang; Stephens, Matthew

    2014-04-01

    Multivariate linear mixed models (mvLMMs) are powerful tools for testing associations between single-nucleotide polymorphisms and multiple correlated phenotypes while controlling for population stratification in genome-wide association studies. We present efficient algorithms in the genome-wide efficient mixed model association (GEMMA) software for fitting mvLMMs and computing likelihood ratio tests. These algorithms offer improved computation speed, power and P-value calibration over existing methods, and can deal with more than two phenotypes.

  13. Genome-wide association mapping in plants exemplified for root growth in Arabidopsis thaliana.

    PubMed

    Slovak, Radka; Göschl, Christian; Seren, Ümit; Busch, Wolfgang

    2015-01-01

    Genome-wide association (GWA) mapping is a powerful technique to address the molecular basis of genotype to phenotype relationships and to map regulators of biological processes. This chapter presents a protocol for genome-wide association mapping in Arabidopsis thaliana using the user-friendly internet application GWAPP, and provides a specific protocol for acquiring root trait data suitable for GWA studies using the semi-automated, high-throughput phenotyping pipeline BRAT for early root growth.

  14. Reproduction and In-Depth Evaluation of Genome-Wide Association Studies and Genome-Wide Meta-analyses Using Summary Statistics

    PubMed Central

    Niu, Yao-Fang; Ye, Chengyin; He, Ji; Han, Fang; Guo, Long-Biao; Zheng, Hou-Feng; Chen, Guo-Bo

    2017-01-01

    In line with open-source genetics, we report a novel linear regression technique for genome-wide association studies (GWAS), called Open GWAS algoriTHm (OATH). When individual-level data are not available, OATH can not only completely reproduce reported results from an experimental model, but also recover underreported results from other alternative models with a different combination of nuisance parameters using naïve summary statistics (NSS). OATH can also reliably evaluate all reported results in-depth (e.g., p-value variance analysis), as demonstrated for 42 Arabidopsis phenotypes under three magnesium (Mg) conditions. In addition, OATH can be used for consortium-driven genome-wide association meta-analyses (GWAMA), and can greatly improve the flexibility of GWAMA. A prototype of OATH is available in the Genetic Analysis Repository (https://github.com/gc5k/GEAR). PMID:28122950

  15. Genome-wide association study for the level of serum electrolytes in Italian Large White pigs.

    PubMed

    Bovo, S; Schiavo, G; Mazzoni, G; Dall'Olio, S; Galimberti, G; Calò, D G; Scotti, E; Bertolini, F; Buttazzoni, L; Samorè, A B; Fontanesi, L

    2016-10-01

    Calcium, magnesium and phosphorus are essential electrolytes involved in a large number of biological processes. Imbalance of these minerals in blood may indicate clinically relevant conditions and are important in inferring acute or chronic pathologies in humans and animals. In this work, we carried out a genome-wide association study (GWAS) for the level of these three electrolytes in the serum of 843 performance-tested Italian Large White pigs. All pigs were genotyped with the Illumina PorcineSNP60 BeadChip, and GWAS was carried out using genome-wide efficient mixed-model association. For the level of Ca(2+) , eight single nucleotide polymorphisms (SNPs) were significant, considering a false discovery rate (FDR) < 0.05, and another eight were above the moderate association threshold (Pnominal value  < 5.00E-05). These SNPs are distributed in four porcine chromosomes (SSC): SSC8, SSC11, SSC12 and SSC13. In particular, a few putative different signals of association detected on SSC13 and one on SSC12 were in genes or close to genes involved in calcium metabolism (P2RY1, RAP2B, SLC9A9, C3orf58, TSC22D2, PLCH1 and CACNB1). Only one SNP (on SSC7) and six SNPs (on SSC2 and SSC7) showed moderate association with the level of magnesium and phosphorus respectively. The association signals for these two latter minerals might identify genes not known thus far for playing a role in their biological functions and regulations. In conclusion, our GWAS contributed to increased knowledge on the role that calcium, magnesium and phosphorus may play in the genetically determined physiological mechanisms affecting the natural variability of mineral levels in mammalian blood.

  16. An Efficient Resampling Method for Assessing Genome-Wide Statistical Significance in Mapping Quantitative Trait Loci

    PubMed Central

    Zou, Fei; Fine, Jason P.; Hu, Jianhua; Lin, D. Y.

    2004-01-01

    Assessing genome-wide statistical significance is an important and difficult problem in multipoint linkage analysis. Due to multiple tests on the same genome, the usual pointwise significance level based on the chi-square approximation is inappropriate. Permutation is widely used to determine genome-wide significance. Theoretical approximations are available for simple experimental crosses. In this article, we propose a resampling procedure to assess the significance of genome-wide QTL mapping for experimental crosses. The proposed method is computationally much less intensive than the permutation procedure (in the order of 102 or higher) and is applicable to complex breeding designs and sophisticated genetic models that cannot be handled by the permutation and theoretical methods. The usefulness of the proposed method is demonstrated through simulation studies and an application to a Drosophila backcross. PMID:15611194

  17. Genome-wide efficient mixed-model analysis for association studies.

    PubMed

    Zhou, Xiang; Stephens, Matthew

    2012-06-17

    Linear mixed models have attracted considerable attention recently as a powerful and effective tool for accounting for population stratification and relatedness in genetic association tests. However, existing methods for exact computation of standard test statistics are computationally impractical for even moderate-sized genome-wide association studies. To address this issue, several approximate methods have been proposed. Here, we present an efficient exact method, which we refer to as genome-wide efficient mixed-model association (GEMMA), that makes approximations unnecessary in many contexts. This method is approximately n times faster than the widely used exact method known as efficient mixed-model association (EMMA), where n is the sample size, making exact genome-wide association analysis computationally practical for large numbers of individuals.

  18. Genetic link between family socioeconomic status and children's educational achievement estimated from genome-wide SNPs

    PubMed Central

    Krapohl, E; Plomin, R

    2016-01-01

    One of the best predictors of children's educational achievement is their family's socioeconomic status (SES), but the degree to which this association is genetically mediated remains unclear. For 3000 UK-representative unrelated children we found that genome-wide single-nucleotide polymorphisms could explain a third of the variance of scores on an age-16 UK national examination of educational achievement and half of the correlation between their scores and family SES. Moreover, genome-wide polygenic scores based on a previously published genome-wide association meta-analysis of total number of years in education accounted for ~3.0% variance in educational achievement and ~2.5% in family SES. This study provides the first molecular evidence for substantial genetic influence on differences in children's educational achievement and its association with family SES. PMID:25754083

  19. Genetic link between family socioeconomic status and children's educational achievement estimated from genome-wide SNPs.

    PubMed

    Krapohl, E; Plomin, R

    2016-03-01

    One of the best predictors of children's educational achievement is their family's socioeconomic status (SES), but the degree to which this association is genetically mediated remains unclear. For 3000 UK-representative unrelated children we found that genome-wide single-nucleotide polymorphisms could explain a third of the variance of scores on an age-16 UK national examination of educational achievement and half of the correlation between their scores and family SES. Moreover, genome-wide polygenic scores based on a previously published genome-wide association meta-analysis of total number of years in education accounted for ~3.0% variance in educational achievement and ~2.5% in family SES. This study provides the first molecular evidence for substantial genetic influence on differences in children's educational achievement and its association with family SES.

  20. High quality genome-wide genotyping from archived dried blood spots without DNA amplification.

    PubMed

    St Julien, Krystal R; Jelliffe-Pawlowski, Laura L; Shaw, Gary M; Stevenson, David K; O'Brodovich, Hugh M; Krasnow, Mark A

    2013-01-01

    Spots of blood are routinely collected from newborn babies onto filter paper called Guthrie cards and used to screen for metabolic and genetic disorders. The archived dried blood spots are an important and precious resource for genomic research. Whole genome amplification of dried blood spot DNA has been used to provide DNA for genome-wide SNP genotyping. Here we describe a 96 well format procedure to extract DNA from a portion of a dried blood spot that provides sufficient unamplified genomic DNA for genome-wide single nucleotide polymorphism (SNP) genotyping. We show that SNP genotyping of the unamplified DNA is more robust than genotyping amplified dried blood spot DNA, is comparable in cost, and can be done with thousands of samples. This procedure can be used for genome-wide association studies and other large-scale genomic analyses that require robust, high-accuracy genotyping of dried blood spot DNA.

  1. Control selection options for genome-wide association studies in cohorts.

    PubMed

    Wacholder, Sholom; Rotunno, Melissa

    2009-03-01

    Investigators planning studies within cohorts have many options for choosing an efficient sampling design for genome-wide association and other molecular epidemiology studies. Consideration of person-year and proportional hazards analyses of full cohorts may add further insight into ramifications of different designs. Empirical evidence from genome-wide association studies can supplement intuition and simulations in comparing properties of various case-control designs within cohorts. Additional theoretical and empirical work, justification of sampling choice in publications, and consideration of context and scientific aims can improve designs and, thereby, increase the scientific value and cost effectiveness of future studies.

  2. Constitutional mosaic genome-wide uniparental disomy due to diploidisation: an unusual cancer-predisposing mechanism.

    PubMed

    Romanelli, Valeria; Nevado, Julián; Fraga, Mario; Trujillo, Alex Martín; Mori, Maria Ángeles; Fernández, Luis; Pérez de Nanclares, Guiomar; Martínez-Glez, Víctor; Pita, Guillermo; Meneses, Heloisa; Gracia, Ricardo; García-Miñaur, Sixto; García de Miguel, Purificación; Lecumberri, Beatriz; Rodríguez, José Ignacio; González Neira, Anna; Monk, David; Lapunzina, Pablo

    2011-03-01

    Molecular studies in a patient with Beckwith-Wiedemann syndrome phenotype who developed two different tumours revealed an unexpected observation of almost complete loss of heterozygosity of all chromosomes. It is shown, by means of numerous molecular methods, that the absence of maternal contribution in somatic cells is due to high-degree (∼ 85%) genome-wide paternal uniparental disomy (UPD). The observations indicate that the genome-wide UPD results from diploidisation, and have important implications for genetic counselling and tumour surveillance for the growing number of UPD associated imprinting disorders.

  3. Genome-wide approaches (GWA) in oral and craniofacial diseases research

    PubMed Central

    Kim, H; Gordon, S; Dionne, R

    2012-01-01

    Underlying molecular genetic mechanisms of diseases can be deciphered with unbiased strategies using recently developed technologies enabling genome-wide scale investigations. These technologies have been applied in scanning for genetic variations, gene expression profiles, and epigenetic changes for oral and craniofacial diseases. However, these approaches as applied to oral and craniofacial conditions are in the initial stages, and challenges remain to be overcome, including analysis of high throughput data and their interpretation. Here, we review methodology and studies using genome-wide approaches in oral and craniofacial diseases and suggest future directions. PMID:22913301

  4. genipe: an automated genome-wide imputation pipeline with automatic reporting and statistical tools.

    PubMed

    Lemieux Perreault, Louis-Philippe; Legault, Marc-André; Asselin, Géraldine; Dubé, Marie-Pierre

    2016-12-01

    Genotype imputation is now commonly performed following genome-wide genotyping experiments. Imputation increases the density of analyzed genotypes in the dataset, enabling fine-mapping across the genome. However, the process of imputation using the most recent publicly available reference datasets can require considerable computation power and the management of hundreds of large intermediate files. We have developed genipe, a complete genome-wide imputation pipeline which includes automatic reporting, imputed data indexing and management, and a suite of statistical tests for imputed data commonly used in genetic epidemiology (Sequence Kernel Association Test, Cox proportional hazards for survival analysis, and linear mixed models for repeated measurements in longitudinal studies).

  5. Genome-Wide Association Mapping of Root Traits in the Context of Plant Hormone Research.

    PubMed

    Ristova, Daniela; Busch, Wolfgang

    2017-01-01

    Genome-wide association (GWA) mapping is a powerful method for the identification of alleles that underlie quantitative traits. It enables one to understand how genetic variation translates into phenotypic variation. In particular, plant hormone signaling pathways play a key role in shaping phenotypes. This chapter presents a protocol for genome-wide association mapping of root traits of Arabidopsis thaliana in the context of hormone research. We describe a specific protocol for acquiring primary and lateral root trait data that is appropriate for GWA studies using FIJI (ImageJ), and subsequent GWA mapping using a user-friendly Internet application.

  6. Genome-wide DNA methylation analysis using massively parallel sequencing technologies.

    PubMed

    Suzuki, Masako; Greally, John M

    2013-01-01

    "Epigenetics" refers to a heritable change in transcriptional status without alteration in the primary nucleotide sequence. Epigenetics provides an extra layer of transcriptional control and plays a crucial role in normal development, as well as in pathological conditions. DNA methylation is one of the best known and well-studied epigenetic modifications. Genome-wide DNA methylation profiling has become recognized as a biologically and clinically important epigenomic assay. In this review, we discuss the strengths and weaknesses of the protocols for genome-wide DNA methylation profiling using massively parallel sequencing (MPS) techniques. We will also describe recently discovered DNA modifications, and the protocols to detect these modifications.

  7. Genome-Wide Association Study in African-Americans with Systemic Lupus Erythematosus

    DTIC Science & Technology

    2013-09-01

    Americans with Systemic Lupus Erythematosus PRINCIPAL INVESTIGATOR: John Harley, M.D., Ph.D...September 2012 – 31 August 2013 4. TITLE AND SUBTITLE 5a. CONTRACT NUMBER Genome-Wide Association Study in African-Americans with Systemic Lupus ...SUPPLEMENTARY NOTES 14. ABSTRACT Systemic lupus erythematosus ( lupus ) is a potentially deadly systemic autoimmune disease that disproportionately

  8. Genome wide association analysis for seedling response traits to thermal stress in sorghum germplasm

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The sorghum association panel exhibited extensive variation for seedling traits under cold and heat stress. Genome-wide analyses identified thirty single nucleotide polymorphisms (SNPs) that were strongly associated with traits measured at seedling stage under cold stress and tagged genes that act a...

  9. Genome-Wide Association Study of Receptive Language Ability of 12-Year-Olds

    PubMed Central

    Harlaar, Nicole; Meaburn, Emma L.; Hayiou-Thomas, Marianna E.; Davis, Oliver S. P.; Docherty, Sophia; Hanscombe, Ken B.; Haworth, Claire M. A.; Price, Thomas S.; Trzaskowski, Maciej; Dale, Philip S.; Plomin, Robert

    2014-01-01

    Purpose Researchers have previously shown that individual differences in measures of receptive language ability at age 12 are highly heritable. In the current study, the authors attempted to identify some of the genes responsible for the heritability of receptive language ability using a genome-wide association approach. Method The authors administered 4 Internet-based measures of receptive language (vocabulary, semantics, syntax, and pragmatics) to a sample of 2,329 twelve-year-olds for whom DNA and genome-wide genotyping were available. Nearly 700,000 single-nucleotide polymorphisms (SNPs) and 1 million imputed SNPs were included in a genome-wide association analysis of receptive language composite scores. Results No SNP associations met the demanding criterion of genome-wide significance that corrects for multiple testing across the genome (p < 5 × 10–8). The strongest SNP association did not replicate in an additional sample of 2,639 twelve-year-olds. Conclusions These results indicate that individual differences in receptive language ability in the general population do not reflect common genetic variants that account for more than 3% of the phenotypic variance. The search for genetic variants associated with language skill will require larger samples and additional methods to identify and functionally characterize the full spectrum of risk variants. PMID:24687471

  10. Genome-wide association study of maize identifies genes affecting leaf architecture

    Technology Transfer Automated Retrieval System (TEKTRAN)

    U.S. maize yield has increased eightfold in the past 80 years with half of the improvement attributed to genetics. Changes in maize leaf angle and size provided a basis for more efficient light capture as plant densities increased. Through a genome wide association study (GWAS) of the maize nested a...

  11. A population structure and genome-wide association analysis on the USDA soybean germplasm collection

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Genotype-phenotype associations within the soybean (Glycine max) germplasm collection could provide valuable information on the frequency and distribution of alleles affecting economically important traits. Here we performed a genome-wide association study (GWAS) for seed protein and oil content in ...

  12. Mixed linear model approach adapted for genome-wide association studies

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Mixed linear model (MLM) methods have proven useful in controlling for population structure and relatedness within genome-wide association studies. However, MLM-based methods can be computationally challenging for large datasets. We report a compression approach, called ‘compressed MLM,’ that decrea...

  13. Genome-wide association study of swine farrowing traits. Part II: Bayesian analysis of marker data

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Reproductive efficiency has a great impact on the economic success of pork production. Number born alive (NBA) and average piglet birth weight (ABW) contribute greatly to reproductive efficiency. To better understand the underlying genetics of birth traits, a genome wide association study (GWAS) w...

  14. Genome-wide significant predictors of metabolites in the one-carbon metabolism pathway

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Low plasma B-vitamin levels and elevated homocysteine have been associated with cancer, cardiovascular disease, and neurodegenerative disorders. Common variants in FUT2 on chromosome 19q13 were associated with plasma vitamin B12 levels among women in a genome-wide association study (GWAS) in the Nur...

  15. Software engineering the mixed model for genome-wide association studies on large samples

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Mixed models improve the ability to detect phenotype-genotype associations in the presence of population stratification and multiple levels of relatedness in genome-wide association studies (GWAS), but for large data sets the resource consumption becomes impractical. At the same time, the sample siz...

  16. Implementing meta-analysis from genome-wide association studies for pork quality traits

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Pork quality plays an important role in the meat processing industry, thus different methodologies have been implemented to elucidate the genetic architecture of traits affecting meat quality. One of the most common and widely used approaches is to perform genome-wide association (GWA) studies. Howe...

  17. A genome-wide regulatory framework identifies maize Pericarp Color1 (P1) controlled genes

    Technology Transfer Automated Retrieval System (TEKTRAN)

    P1 encodes an R2R3-MYB transcription factor responsible for the accumulation of insecticidal flavones in maize silks and red phlobaphene pigments in pericarps and other floral tissues. Using genome-wide expression analyses (RNA-Seq) in pericarps and silks of plants with contrasting P1 alleles combin...

  18. snpGeneSets: An R Package for Genome-Wide Study Annotation

    PubMed Central

    Mei, Hao; Li, Lianna; Jiang, Fan; Simino, Jeannette; Griswold, Michael; Mosley, Thomas; Liu, Shijian

    2016-01-01

    Genome-wide studies (GWS) of SNP associations and differential gene expressions have generated abundant results; next-generation sequencing technology has further boosted the number of variants and genes identified. Effective interpretation requires massive annotation and downstream analysis of these genome-wide results, a computationally challenging task. We developed the snpGeneSets package to simplify annotation and analysis of GWS results. Our package integrates local copies of knowledge bases for SNPs, genes, and gene sets, and implements wrapper functions in the R language to enable transparent access to low-level databases for efficient annotation of large genomic data. The package contains functions that execute three types of annotations: (1) genomic mapping annotation for SNPs and genes and functional annotation for gene sets; (2) bidirectional mapping between SNPs and genes, and genes and gene sets; and (3) calculation of gene effect measures from SNP associations and performance of gene set enrichment analyses to identify functional pathways. We applied snpGeneSets to type 2 diabetes (T2D) results from the NHGRI genome-wide association study (GWAS) catalog, a Finnish GWAS, and a genome-wide expression study (GWES). These studies demonstrate the usefulness of snpGeneSets for annotating and performing enrichment analysis of GWS results. The package is open-source, free, and can be downloaded at: https://www.umc.edu/biostats_software/. PMID:27807048

  19. snpGeneSets: An R Package for Genome-Wide Study Annotation.

    PubMed

    Mei, Hao; Li, Lianna; Jiang, Fan; Simino, Jeannette; Griswold, Michael; Mosley, Thomas; Liu, Shijian

    2016-12-07

    Genome-wide studies (GWS) of SNP associations and differential gene expressions have generated abundant results; next-generation sequencing technology has further boosted the number of variants and genes identified. Effective interpretation requires massive annotation and downstream analysis of these genome-wide results, a computationally challenging task. We developed the snpGeneSets package to simplify annotation and analysis of GWS results. Our package integrates local copies of knowledge bases for SNPs, genes, and gene sets, and implements wrapper functions in the R language to enable transparent access to low-level databases for efficient annotation of large genomic data. The package contains functions that execute three types of annotations: (1) genomic mapping annotation for SNPs and genes and functional annotation for gene sets; (2) bidirectional mapping between SNPs and genes, and genes and gene sets; and (3) calculation of gene effect measures from SNP associations and performance of gene set enrichment analyses to identify functional pathways. We applied snpGeneSets to type 2 diabetes (T2D) results from the NHGRI genome-wide association study (GWAS) catalog, a Finnish GWAS, and a genome-wide expression study (GWES). These studies demonstrate the usefulness of snpGeneSets for annotating and performing enrichment analysis of GWS results. The package is open-source, free, and can be downloaded at: https://www.umc.edu/biostats_software/.

  20. Genome-wide association mapping of partial resistance to Aphanomyces euteiches in pea

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Genome-wide association mapping has recently emerged as a valuable approach to refine genetic basis of polygenic resistance to plant diseases, which are increasingly used in integrated strategies for durable crop protection. Aphanomyces euteiches is a soil borne pathogen of pea and other legumes wor...

  1. Genome-wide SNP detection, validation, and development of an 8K SNP array for apple

    Technology Transfer Automated Retrieval System (TEKTRAN)

    As high-throughput genetic marker screening systems are essential for a range of genetics studies and plant breeding applications, the International RosBREED SNP Consortium (IRSC) has utilized the Illumina Infinium® II system to develop a medium- to high-throughput SNP screening tool for genome-wide...

  2. Signatures of positive selection in East African Shorthorn Zebu: a genome-wide SNP analysis

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The small East African Shorthorn Zebu is the main indigenous cattle across East Africa. A recent genome wide SNPs analysis has revealed their ancient stable African taurine x Asian zebu admixture. Here, we assess the presence of candidate signature of positive selection in their genome, with the aim...

  3. Genome-wide Association Analysis of Kernel Weight in Hard Winter Wheat

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Wheat kernel weight is an important and heritable component of wheat grain yield and a key predictor of flour extraction. Genome-wide association analysis was conducted to identify genomic regions associated with kernel weight and kernel weight environmental response in 8 trials of 299 hard winter ...

  4. Genome-Wide Association Study of Intelligence: Additive Effects of Novel Brain Expressed Genes

    ERIC Educational Resources Information Center

    Loo, Sandra K.; Shtir, Corina; Doyle, Alysa E.; Mick, Eric; McGough, James J.; McCracken, James; Biederman, Joseph; Smalley, Susan L.; Cantor, Rita M.; Faraone, Stephen V.; Nelson, Stanley F.

    2012-01-01

    Objective: The purpose of the present study was to identify common genetic variants that are associated with human intelligence or general cognitive ability. Method: We performed a genome-wide association analysis with a dense set of 1 million single-nucleotide polymorphisms (SNPs) and quantitative intelligence scores within an ancestrally…

  5. Genome-Wide Association Study of Receptive Language Ability of 12-Year-Olds

    ERIC Educational Resources Information Center

    Harlaar, Nicole; Meaburn, Emma L.; Hayiou-Thomas, Marianna E.; Davis, Oliver S. P.; Docherty, Sophia; Hanscombe, Ken B.; Haworth, Claire M. A.; Price, Thomas S.; Trzaskowski, Maciej; Dale, Philip S.; Plomin, Robert

    2014-01-01

    Purpose: Researchers have previously shown that individual differences in measures of receptive language ability at age 12 are highly heritable. In the current study, the authors attempted to identify some of the genes responsible for the heritability of receptive language ability using a "genome-wide association" approach. Method: The…

  6. Genome-wide association of meat quality traits and tenderness in swine

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Pork quality has a large impact on consumer preference and perception of eating quality. A genome-wide association was performed for pork quality traits [intramuscular fat (IMF)], slice shear force (SSF), color attributes, purge, cooking loss, and pH] from 531 to 1,237 records on barrows and gilts o...

  7. Implementing Meta-analysis for genome-wide association studies of pork quality traits

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Pork quality is a critical concern in the meat industry. Implementation of genome-wide association studies (GWA) allows identification of genomic regions that explain a substantial portion of the variation of relevant traits. It is also important to determine the consistency of results of GWA across...

  8. Meta-analysis of genome-wide association from genomic prediction models

    Technology Transfer Automated Retrieval System (TEKTRAN)

    A limitation of many genome-wide association studies (GWA) in animal breeding is that there are many loci with small effect sizes; thus, larger sample sizes (N) are required to guarantee suitable power of detection. To increase sample size, results from different GWA can be combined in a meta-analys...

  9. Methods for meta-analysis of genome-wide association studies

    Technology Transfer Automated Retrieval System (TEKTRAN)

    A limitation of many genome-wide association studies (GWA) in animal breeding is that there are many loci with small effect sizes; thus, larger sample sizes (N) are required to guarantee suitable power of detection. For increasing N, results from different GWA can be combined in a meta-analysis (MA-...

  10. Meta-analysis of genome wide association studies for pork quality traits

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Given the importance of pork quality in the meat processing industry, genome-wide association studies were performed for eight meat quality traits and also, a meta-analysis (MA) of GWA was implemented combining independent results from pig populations. Data from three pig datasets (USMARC, Commercia...

  11. Genome-wide association as a means to understanding the mammary gland

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Next-generation sequencing and related technologies have facilitated the creation of enormous public databases that catalogue genomic variation. These databases have facilitated a variety of approaches to discover new genes that regulate normal biology as well as disease. Genome wide association (...

  12. A genome-wide association study platform built on iPlant cyber-infrastructure

    Technology Transfer Automated Retrieval System (TEKTRAN)

    We demonstrated a flexible Genome-Wide Association (GWA) Study (GWAS) platform built upon the iPlant Collaborative Cyber-infrastructure. The platform supports big data management, sharing, and large scale study of both genotype and phenotype data on clusters. End users can add their own analysis too...

  13. Enhancing genomic prediction with genome-wide association studies in multiparental maize populations

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Genome-wide association mapping using dense marker sets has identified some nucleotide variants affecting complex traits which have been validated with fine-mapping and functional analysis. Many sequence variants associated with complex traits in maize have small effects and low repeatability, howev...

  14. A genome-wide SNP panel for genetic diversity, mapping and breeding studies in rice

    Technology Transfer Automated Retrieval System (TEKTRAN)

    A genome-wide SNP resource was developed for rice using the GoldenGate assay and used to genotype 400 landrace accessions of O. sativa. SNPs were originally discovered using Perlegen re-sequencing technology in 20 diverse landraces of O. sativa as part of OryzaSNP project (http://irfgc.irri.org). An...

  15. A Genome-Wide Scan for Breast Cancer Risk Haplotypes among African American Women

    PubMed Central

    Song, Chi; Chen, Gary K.; Millikan, Robert C.; Ambrosone, Christine B.; John, Esther M.; Bernstein, Leslie; Zheng, Wei; Hu, Jennifer J.; Ziegler, Regina G.; Nyante, Sarah; Bandera, Elisa V.; Ingles, Sue A.; Press, Michael F.; Deming, Sandra L.; Rodriguez-Gil, Jorge L.; Chanock, Stephen J.; Wan, Peggy; Sheng, Xin; Pooler, Loreall C.; Van Den Berg, David J.; Le Marchand, Loic; Kolonel, Laurence N.; Henderson, Brian E.; Haiman, Chris A.; Stram, Daniel O.

    2013-01-01

    Genome-wide association studies (GWAS) simultaneously investigating hundreds of thousands of single nucleotide polymorphisms (SNP) have become a powerful tool in the investigation of new disease susceptibility loci. Haplotypes are sometimes thought to be superior to SNPs and are promising in genetic association analyses. The application of genome-wide haplotype analysis, however, is hindered by the complexity of haplotypes themselves and sophistication in computation. We systematically analyzed the haplotype effects for breast cancer risk among 5,761 African American women (3,016 cases and 2,745 controls) using a sliding window approach on the genome-wide scale. Three regions on chromosomes 1, 4 and 18 exhibited moderate haplotype effects. Furthermore, among 21 breast cancer susceptibility loci previously established in European populations, 10p15 and 14q24 are likely to harbor novel haplotype effects. We also proposed a heuristic of determining the significance level and the effective number of independent tests by the permutation analysis on chromosome 22 data. It suggests that the effective number was approximately half of the total (7,794 out of 15,645), thus the half number could serve as a quick reference to evaluating genome-wide significance if a similar sliding window approach of haplotype analysis is adopted in similar populations using similar genotype density. PMID:23468962

  16. Genome-wide association analysis for drought tolerance and associated traits in common bean

    Technology Transfer Automated Retrieval System (TEKTRAN)

    A genome-wide association study (GWAS) was conducted to explore the genetic basis of variation for drought tolerance and related traits in a Middle American diversity panel comprised of 96 common bean (Phaseolus vulgaris) genotypes. The panel grown under irrigated and rainfed conditions and single n...

  17. Genome-wide association study of agronomic traits in common bean

    Technology Transfer Automated Retrieval System (TEKTRAN)

    A genome-wide association study (GWAS) using a global Andean diversity panel (ADP) of 237 genotypes of common bean, Phaseolus vulgaris was conducted to gain insight into the genetic architecture of several agronomic traits controlling phenology, biomass, yield components and seed yield. The panel wa...

  18. Genome-wide CNV analysis reveals variants associated with growth traits in Bos indicus

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Background: Apart from single nucleotide polymorphism (SNP), copy number variation (CNV) is another important type of genetic variation, which may affect growth traits and play key roles for the production of beef cattle. To date, no genome-wide association study (GWAS) for CNV and body traits in be...

  19. CNV-based genome wide association study reveals additional variants contributing to meat quality in swine

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Pork quality is important both to the meat processing industry and consumers’ purchasing attitudes. Copy number variation (CNV) is a burgeoning kind of variant that may influence meat quality. Herein, a genome-wide association study (GWAS) was performed between CNVs and meat quality traits in swine....

  20. Genome wide search for variation associated with micronutrient density of developing rice grains

    Technology Transfer Automated Retrieval System (TEKTRAN)

    "Omic" tools are rapidly being employed to delineate the biological framework controlling phenotypes of interest in crop species. An advanced understanding of the genetic basis for quantitative trait variation has been made possible through genome wide association studies (GWAS) that make use of gen...

  1. Genome-wide association analysis of symbiotic nitrogen fixation in common bean

    Technology Transfer Automated Retrieval System (TEKTRAN)

    A genome-wide association study (GWAS) was conducted to explore the genetic basis of variation for symbiotic nitrogen fixation (SNF) and related traits in the Andean diversity panel (ADP) comprised of 259 common bean (Phaseolus vulgaris) genotypes. The ADP was evaluated for SNF and related traits in...

  2. Gene-Environment Interactions in Genome-Wide Association Studies: Current Approaches and New Directions

    PubMed Central

    Winham, Stacey J; Biernacka, Joanna M.

    2013-01-01

    Background Complex psychiatric traits have long been thought to be the result of a combination of genetic and environmental factors, and gene-environment interactions are thought to play a crucial role in behavioral phenotypes and the susceptibility and progression of psychiatric disorders. Candidate gene studies to investigate hypothesized gene-environment interactions are now fairly common in human genetic research, and with the shift towards genome-wide association studies, genome-wide gene-environment interaction studies are beginning to emerge. Methods We summarize the basic ideas behind gene-environment interaction, and provide an overview of possible study designs and traditional analysis methods in the context of genome-wide analysis. We then discuss novel approaches beyond the traditional strategy of analyzing the interaction between the environmental factor and each polymorphism individually. Results Two-step filtering approaches that reduce the number of polymorphisms tested for interactions can substantially increase the power of genome-wide gene-environment studies. New analytical methods including data-mining approaches, and gene-level and pathway-level analyses, also have the capacity to improve our understanding of how complex genetic and environmental factors interact to influence psychological and psychiatric traits. Such methods, however, have not yet been utilized much in behavioral and mental health research. Conclusions Although methods to investigate gene-environment interactions are available, there is a need for further development and extension of these methods to identify gene-environment interactions in the context of genome-wide association studies. These novel approaches need to be applied in studies of psychology and psychiatry. PMID:23808649

  3. A genome-wide approach to children's aggressive behavior: The EAGLE consortium.

    PubMed

    Pappa, Irene; St Pourcain, Beate; Benke, Kelly; Cavadino, Alana; Hakulinen, Christian; Nivard, Michel G; Nolte, Ilja M; Tiesler, Carla M T; Bakermans-Kranenburg, Marian J; Davies, Gareth E; Evans, David M; Geoffroy, Marie-Claude; Grallert, Harald; Groen-Blokhuis, Maria M; Hudziak, James J; Kemp, John P; Keltikangas-Järvinen, Liisa; McMahon, George; Mileva-Seitz, Viara R; Motazedi, Ehsan; Power, Christine; Raitakari, Olli T; Ring, Susan M; Rivadeneira, Fernando; Rodriguez, Alina; Scheet, Paul A; Seppälä, Ilkka; Snieder, Harold; Standl, Marie; Thiering, Elisabeth; Timpson, Nicholas J; Veenstra, René; Velders, Fleur P; Whitehouse, Andrew J O; Smith, George Davey; Heinrich, Joachim; Hypponen, Elina; Lehtimäki, Terho; Middeldorp, Christel M; Oldehinkel, Albertine J; Pennell, Craig E; Boomsma, Dorret I; Tiemeier, Henning

    2016-07-01

    Individual differences in aggressive behavior emerge in early childhood and predict persisting behavioral problems and disorders. Studies of antisocial and severe aggression in adulthood indicate substantial underlying biology. However, little attention has been given to genome-wide approaches of aggressive behavior in children. We analyzed data from nine population-based studies and assessed aggressive behavior using well-validated parent-reported questionnaires. This is the largest sample exploring children's aggressive behavior to date (N = 18,988), with measures in two developmental stages (N = 15,668 early childhood and N = 16,311 middle childhood/early adolescence). First, we estimated the additive genetic variance of children's aggressive behavior based on genome-wide SNP information, using genome-wide complex trait analysis (GCTA). Second, genetic associations within each study were assessed using a quasi-Poisson regression approach, capturing the highly right-skewed distribution of aggressive behavior. Third, we performed meta-analyses of genome-wide associations for both the total age-mixed sample and the two developmental stages. Finally, we performed a gene-based test using the summary statistics of the total sample. GCTA quantified variance tagged by common SNPs (10-54%). The meta-analysis of the total sample identified one region in chromosome 2 (2p12) at near genome-wide significance (top SNP rs11126630, P = 5.30 × 10(-8) ). The separate meta-analyses of the two developmental stages revealed suggestive evidence of association at the same locus. The gene-based analysis indicated association of variation within AVPR1A with aggressive behavior. We conclude that common variants at 2p12 show suggestive evidence for association with childhood aggression. Replication of these initial findings is needed, and further studies should clarify its biological meaning. © 2015 Wiley Periodicals, Inc.

  4. Genome-wide meta-analysis of cerebral white matter hyperintensities in patients with stroke

    PubMed Central

    Zhang, Cathy R.; Adib-Samii, Poneh; Devan, William J.; Parsons, Owen E.; Lanfranconi, Silvia; Gregory, Sarah; Cloonan, Lisa; Falcone, Guido J.; Radmanesh, Farid; Fitzpatrick, Kaitlin; Kanakis, Allison; Barrick, Thomas R.; Moynihan, Barry; Lewis, Cathryn M.; Boncoraglio, Giorgio B.; Lemmens, Robin; Thijs, Vincent; Sudlow, Cathie; Wardlaw, Joanna; Rothwell, Peter M.; Meschia, James F.; Worrall, Bradford B.; Levi, Christopher; Bevan, Steve; Furie, Karen L.; Dichgans, Martin; Rosand, Jonathan; Markus, Hugh S.; Rost, Natalia

    2016-01-01

    Objective: For 3,670 stroke patients from the United Kingdom, United States, Australia, Belgium, and Italy, we performed a genome-wide meta-analysis of white matter hyperintensity volumes (WMHV) on data imputed to the 1000 Genomes reference dataset to provide insights into disease mechanisms. Methods: We first sought to identify genetic associations with white matter hyperintensities in a stroke population, and then examined whether genetic loci previously linked to WMHV in community populations are also associated in stroke patients. Having established that genetic associations are shared between the 2 populations, we performed a meta-analysis testing which associations with WMHV in stroke-free populations are associated overall when combined with stroke populations. Results: There were no associations at genome-wide significance with WMHV in stroke patients. All previously reported genome-wide significant associations with WMHV in community populations shared direction of effect in stroke patients. In a meta-analysis of the genome-wide significant and suggestive loci (p < 5 × 10−6) from community populations (15 single nucleotide polymorphisms in total) and from stroke patients, 6 independent loci were associated with WMHV in both populations. Four of these are novel associations at the genome-wide level (rs72934505 [NBEAL1], p = 2.2 × 10−8; rs941898 [EVL], p = 4.0 × 10−8; rs962888 [C1QL1], p = 1.1 × 10−8; rs9515201 [COL4A2], p = 6.9 × 10−9). Conclusions: Genetic associations with WMHV are shared in otherwise healthy individuals and patients with stroke, indicating common genetic susceptibility in cerebral small vessel disease. PMID:26674333

  5. A genome-wide search for type 2 diabetes susceptibility genes in Utah Caucasians.

    PubMed

    Elbein, S C; Hoffman, M D; Teng, K; Leppert, M F; Hasstedt, S J

    1999-05-01

    Considerable evidence supports a major inherited component of type 2 diabetes. We initially conducted a genome-wide scan with 440 microsatellite markers at 10-cM intervals in 19 multigenerational families of Northern European ancestry with at least two diabetic siblings. Initial two-point analyses of these families directed marker typing of 23 additional families. Subsequently, all available marker data on the total of 42 families were analyzed using both parametric and nonparametric multipoint methods to test for linkage to type 2 diabetes. One locus on chromosome 1q21-1q23 met genome-wide criteria for significant linkage under a model of recessive inheritance with a common diabetes allele (logarithm of odds [LOD] = 4.295). Both pedigree-based nonparametric linkage (NPL) analysis and affected sib pair (MAPMAKER/SIBS) nonparametric methods also showed the highest genome-wide scores at this region, near markers CRP and APOA2, but failed to meet levels of genome-wide significance. The risk of type 2 diabetes to siblings of a diabetic person when compared with the population (lambdaS) was estimated from MAPMAKER/SIBS to be 2.8 in these 42 families. Simulation studies using study data confirmed a genome-wide significance level of P<0.05 (95% CI 0.005-0.0466). However, analysis of 20 similarly ascertained but smaller families failed to confirm this linkage. The LOD score with 50% heterogeneity for all 62 families considered together was only 2.25, with an estimated lambdaS of 1.87. Our data suggest a novel diabetes susceptibility locus near APOA2 on chromosome 1 in a region with many transcribed genes.

  6. The First Pilot Genome-Wide Gene-Environment Study of Depression in the Japanese Population

    PubMed Central

    Otowa, Takeshi; Kawamura, Yoshiya; Tsutsumi, Akizumi; Kawakami, Norito; Kan, Chiemi; Shimada, Takafumi; Umekage, Tadashi; Kasai, Kiyoto; Tokunaga, Katsushi; Sasaki, Tsukasa

    2016-01-01

    Stressful events have been identified as a risk factor for depression. Although gene–environment (G × E) interaction in a limited number of candidate genes has been explored, no genome-wide search has been reported. The aim of the present study is to identify genes that influence the association of stressful events with depression. Therefore, we performed a genome-wide G × E interaction analysis in the Japanese population. A genome-wide screen with 320 subjects was performed using the Affymetrix Genome-Wide Human Array 6.0. Stressful life events were assessed using the Social Readjustment Rating Scale (SRRS) and depression symptoms were assessed with self-rating questionnaires using the Center for Epidemiologic Studies Depression (CES-D) scale. The p values for interactions between single nucleotide polymorphisms (SNPs) and stressful events were calculated using the linear regression model adjusted for sex and age. After quality control of genotype data, a total of 534,848 SNPs on autosomal chromosomes were further analyzed. Although none surpassed the level of the genome-wide significance, a marginal significant association of interaction between SRRS and rs10510057 with depression were found (p = 4.5 × 10−8). The SNP is located on 10q26 near Regulators of G-protein signaling 10 (RGS10), which encodes a regulatory molecule involved in stress response. When we investigated a similar G × E interaction between depression (K6 scale) and work-related stress in an independent sample (n = 439), a significant G × E effect on depression was observed (p = 0.015). Our findings suggest that rs10510057, interacting with stressors, may be involved in depression risk. Incorporating G × E interaction into GWAS can contribute to find susceptibility locus that are potentially missed by conventional GWAS. PMID:27529621

  7. Genome-wide identification of Fas/CD95 alternative splicing regulators reveals links with iron homeostasis.

    PubMed

    Tejedor, J Ramón; Papasaikas, Panagiotis; Valcárcel, Juan

    2015-01-08

    Alternative splicing of Fas/CD95 exon 6 generates either a membrane-bound receptor that promotes, or a soluble isoform that inhibits, apoptosis. Using an automatized genome-wide siRNA screening for alternative splicing regulators of endogenous transcripts in mammalian cells, we identified 200 genes whose knockdown modulates the ratio between Fas/CD95 isoforms. These include classical splicing regulators; core spliceosome components; and factors implicated in transcription and chromatin remodeling, RNA transport, intracellular signaling, and metabolic control. Coherent effects of genes involved in iron homeostasis and pharmacological modulation of iron levels revealed a link between intracellular iron and Fas/CD95 exon 6 inclusion. A splicing regulatory network linked iron levels with reduced activity of the Zinc-finger-containing splicing regulator SRSF7, and in vivo and in vitro assays revealed that iron inhibits SRSF7 RNA binding. Our results uncover numerous links between cellular pathways and RNA processing and a mechanism by which iron homeostasis can influence alternative splicing.

  8. A genome-wide screen identifies conserved protein hubs required for cadherin-mediated cell–cell adhesion

    PubMed Central

    Toret, Christopher P.; D’Ambrosio, Michael V.; Vale, Ronald D.; Simon, Michael A.

    2014-01-01

    Cadherins and associated catenins provide an important structural interface between neighboring cells, the actin cytoskeleton, and intracellular signaling pathways in a variety of cell types throughout the Metazoa. However, the full inventory of the proteins and pathways required for cadherin-mediated adhesion has not been established. To this end, we completed a genome-wide (∼14,000 genes) ribonucleic acid interference (RNAi) screen that targeted Ca2+-dependent adhesion in DE-cadherin–expressing Drosophila melanogaster S2 cells in suspension culture. This novel screen eliminated Ca2+-independent cell–cell adhesion, integrin-based adhesion, cell spreading, and cell migration. We identified 17 interconnected regulatory hubs, based on protein functions and protein–protein interactions that regulate the levels of the core cadherin–catenin complex and coordinate cadherin-mediated cell–cell adhesion. Representative proteins from these hubs were analyzed further in Drosophila oogenesis, using targeted germline RNAi, and adhesion was analyzed in Madin–Darby canine kidney mammalian epithelial cell–cell adhesion. These experiments reveal roles for a diversity of cellular pathways that are required for cadherin function in Metazoa, including cytoskeleton organization, cell–substrate interactions, and nuclear and cytoplasmic signaling. PMID:24446484

  9. A genome-wide RNA interference screen in Drosophila melanogaster cells for new components of the Hh signaling pathway.

    PubMed

    Nybakken, Kent; Vokes, Steven A; Lin, Ting-Yi; McMahon, Andrew P; Perrimon, Norbert

    2005-12-01

    Members of the Hedgehog (Hh) family of signaling proteins are powerful regulators of developmental processes in many organisms and have been implicated in many human disease states. Here we report the results of a genome-wide RNA interference screen in Drosophila melanogaster cells for new components of the Hh signaling pathway. The screen identified hundreds of potential new regulators of Hh signaling, including many large protein complexes with pleiotropic effects, such as the coat protein complex I (COPI) complex, the ribosome and the proteasome. We identified the multimeric protein phosphatase 2A (PP2A) and two new kinases, the D. melanogaster orthologs of the vertebrate PITSLRE and cyclin-dependent kinase-9 (CDK9) kinases, as Hh regulators. We also identified a large group of constitutive and alternative splicing factors, two nucleoporins involved in mRNA export and several RNA-regulatory proteins as potent regulators of Hh signal transduction, indicating that splicing regulation and mRNA transport have a previously unrecognized role in Hh signaling. Finally, we showed that several of these genes have conserved roles in mammalian Hh signaling.

  10. Genome-wide association of multiple complex traits in outbred mice by ultra-low-coverage sequencing.

    PubMed

    Nicod, Jérôme; Davies, Robert W; Cai, Na; Hassett, Carl; Goodstadt, Leo; Cosgrove, Cormac; Yee, Benjamin K; Lionikaite, Vikte; McIntyre, Rebecca E; Remme, Carol Ann; Lodder, Elisabeth M; Gregory, Jennifer S; Hough, Tertius; Joynson, Russell; Phelps, Hayley; Nell, Barbara; Rowe, Clare; Wood, Joe; Walling, Alison; Bopp, Nasrin; Bhomra, Amarjit; Hernandez-Pliego, Polinka; Callebert, Jacques; Aspden, Richard M; Talbot, Nick P; Robbins, Peter A; Harrison, Mark; Fray, Martin; Launay, Jean-Marie; Pinto, Yigal M; Blizard, David A; Bezzina, Connie R; Adams, David J; Franken, Paul; Weaver, Tom; Wells, Sara; Brown, Steve D M; Potter, Paul K; Klenerman, Paul; Lionikas, Arimantas; Mott, Richard; Flint, Jonathan

    2016-08-01

    Two bottlenecks impeding the genetic analysis of complex traits in rodents are access to mapping populations able to deliver gene-level mapping resolution and the need for population-specific genotyping arrays and haplotype reference panels. Here we combine low-coverage (0.15×) sequencing with a new method to impute the ancestral haplotype space in 1,887 commercially available outbred mice. We mapped 156 unique quantitative trait loci for 92 phenotypes at a 5% false discovery rate. Gene-level mapping resolution was achieved at about one-fifth of the loci, implicating Unc13c and Pgc1a at loci for the quality of sleep, Adarb2 for home cage activity, Rtkn2 for intensity of reaction to startle, Bmp2 for wound healing, Il15 and Id2 for several T cell measures and Prkca for bone mineral content. These findings have implications for diverse areas of mammalian biology and demonstrate how genome-wide association studies can be extended via low-coverage sequencing to species with highly recombinant outbred populations.

  11. ChEA: transcription factor regulation inferred from integrating genome-wide ChIP-X experiments

    PubMed Central

    Lachmann, Alexander; Xu, Huilei; Krishnan, Jayanth; Berger, Seth I.; Mazloom, Amin R.; Ma'ayan, Avi

    2010-01-01

    Motivation: Experiments such as ChIP-chip, ChIP-seq, ChIP-PET and DamID (the four methods referred herein as ChIP-X) are used to profile the binding of transcription factors to DNA at a genome-wide scale. Such experiments provide hundreds to thousands of potential binding sites for a given transcription factor in proximity to gene coding regions. Results: In order to integrate data from such studies and utilize it for further biological discovery, we collected interactions from such experiments to construct a mammalian ChIP-X database. The database contains 189 933 interactions, manually extracted from 87 publications, describing the binding of 92 transcription factors to 31 932 target genes. We used the database to analyze mRNA expression data where we perform gene-list enrichment analysis using the ChIP-X database as the prior biological knowledge gene-list library. The system is delivered as a web-based interactive application called ChIP Enrichment Analysis (ChEA). With ChEA, users can input lists of mammalian gene symbols for which the program computes over-representation of transcription factor targets from the ChIP-X database. The ChEA database allowed us to reconstruct an initial network of transcription factors connected based on shared overlapping targets and binding site proximity. To demonstrate the utility of ChEA we present three case studies. We show how by combining the Connectivity Map (CMAP) with ChEA, we can rank pairs of compounds to be used to target specific transcription factor activity in cancer cells. Availability: The ChEA software and ChIP-X database is freely available online at: http://amp.pharm.mssm.edu/lib/chea.jsp Contact: avi.maayan@mssm.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:20709693

  12. A guide to genome-wide association analysis and post-analytic interrogation.

    PubMed

    Reed, Eric; Nunez, Sara; Kulp, David; Qian, Jing; Reilly, Muredach P; Foulkes, Andrea S

    2015-12-10

    This tutorial is a learning resource that outlines the basic process and provides specific software tools for implementing a complete genome-wide association analysis. Approaches to post-analytic visualization and interrogation of potentially novel findings are also presented. Applications are illustrated using the free and open-source R statistical computing and graphics software environment, Bioconductor software for bioinformatics and the UCSC Genome Browser. Complete genome-wide association data on 1401 individuals across 861,473 typed single nucleotide polymorphisms from the PennCATH study of coronary artery disease are used for illustration. All data and code, as well as additional instructional resources, are publicly available through the Open Resources in Statistical Genomics project: http://www.stat-gen.org.

  13. Quality control and conduct of genome-wide association meta-analyses.

    PubMed

    Winkler, Thomas W; Day, Felix R; Croteau-Chonka, Damien C; Wood, Andrew R; Locke, Adam E; Mägi, Reedik; Ferreira, Teresa; Fall, Tove; Graff, Mariaelisa; Justice, Anne E; Luan, Jian'an; Gustafsson, Stefan; Randall, Joshua C; Vedantam, Sailaja; Workalemahu, Tsegaselassie; Kilpeläinen, Tuomas O; Scherag, André; Esko, Tonu; Kutalik, Zoltán; Heid, Iris M; Loos, Ruth J F

    2014-05-01

    Rigorous organization and quality control (QC) are necessary to facilitate successful genome-wide association meta-analyses (GWAMAs) of statistics aggregated across multiple genome-wide association studies. This protocol provides guidelines for (i) organizational aspects of GWAMAs, and for (ii) QC at the study file level, the meta-level across studies and the meta-analysis output level. Real-world examples highlight issues experienced and solutions developed by the GIANT Consortium that has conducted meta-analyses including data from 125 studies comprising more than 330,000 individuals. We provide a general protocol for conducting GWAMAs and carrying out QC to minimize errors and to guarantee maximum use of the data. We also include details for the use of a powerful and flexible software package called EasyQC. Precise timings will be greatly influenced by consortium size. For consortia of comparable size to the GIANT Consortium, this protocol takes a minimum of about 10 months to complete.

  14. High-resolution, genome-wide mapping of chromatin modifications by GMAT.

    PubMed

    Roh, Tae-Young; Zhao, Keji

    2008-01-01

    One major postgenomic challenge is to characterize the epigenomes that control genome functions. The epigenomes are mainly defined by the specific association of nonhistone proteins with chromatin and the covalent modifications of chromatin, including DNA methylation and posttranslational histone modifications. The in vivo protein-binding and chromatin-modification patterns can be revealed by the chromatin immunoprecipitation assay (ChIP). By combining the ChIP assays and the serial analysis of gene expression (SAGE) protocols, we have developed an unbiased and high-resolution genome-wide mapping technique (GMAT) to determine the genome-wide protein-targeting and chromatin-modification patterns. GMAT has been successfully applied to mapping the target sites of the histone acetyltransferase, Gcn5p, in yeast and to the discovery of the histone acetylation islands as an epigenetic mark for functional regulatory elements in the human genome.

  15. Quality control and conduct of genome-wide association meta-analyses

    PubMed Central

    Winkler, Thomas W; Day, Felix R; Croteau-Chonka, Damien C; Wood, Andrew R; Locke, Adam E; Mägi, Reedik; Ferreira, Teresa; Fall, Tove; Graff, Mariaelisa; Justice, Anne E; Luan, Jian'an; Gustafsson, Stefan; Randall, Joshua C; Vedantam, Sailaja; Workalemahu, Tsegaselassie; Kilpeläinen, Tuomas O; Scherag, André; Esko, Tonu; Kutalik, Zoltán; Heid, Iris M; Loos, Ruth JF

    2014-01-01

    Rigorous organization and quality control (QC) are necessary to facilitate successful genome-wide association meta-analyses (GWAMAs) of statistics aggregated across multiple genome-wide association studies. This protocol provides guidelines for [1] organizational aspects of GWAMAs, and for [2] QC at the study file level, the meta-level across studies, and the meta-analysis output level. Real–world examples highlight issues experienced and solutions developed by the GIANT Consortium that has conducted meta-analyses including data from 125 studies comprising more than 330,000 individuals. We provide a general protocol for conducting GWAMAs and carrying out QC to minimize errors and to guarantee maximum use of the data. We also include details for use of a powerful and flexible software package called EasyQC. For consortia of comparable size to the GIANT consortium, the present protocol takes a minimum of about 10 months to complete. PMID:24762786

  16. Genome-wide analysis identifies 12 loci influencing human reproductive behavior.

    PubMed

    Barban, Nicola; Jansen, Rick; de Vlaming, Ronald; Vaez, Ahmad; Mandemakers, Jornt J; Tropf, Felix C; Shen, Xia; Wilson, James F; Chasman, Daniel I; Nolte, Ilja M; Tragante, Vinicius; van der Laan, Sander W; Perry, John R B; Kong, Augustine; Ahluwalia, Tarunveer S; Albrecht, Eva; Yerges-Armstrong, Laura; Atzmon, Gil; Auro, Kirsi; Ayers, Kristin; Bakshi, Andrew; Ben-Avraham, Danny; Berger, Klaus; Bergman, Aviv; Bertram, Lars; Bielak, Lawrence F; Bjornsdottir, Gyda; Bonder, Marc Jan; Broer, Linda; Bui, Minh; Barbieri, Caterina; Cavadino, Alana; Chavarro, Jorge E; Turman, Constance; Concas, Maria Pina; Cordell, Heather J; Davies, Gail; Eibich, Peter; Eriksson, Nicholas; Esko, Tõnu; Eriksson, Joel; Falahi, Fahimeh; Felix, Janine F; Fontana, Mark Alan; Franke, Lude; Gandin, Ilaria; Gaskins, Audrey J; Gieger, Christian; Gunderson, Erica P; Guo, Xiuqing; Hayward, Caroline; He, Chunyan; Hofer, Edith; Huang, Hongyan; Joshi, Peter K; Kanoni, Stavroula; Karlsson, Robert; Kiechl, Stefan; Kifley, Annette; Kluttig, Alexander; Kraft, Peter; Lagou, Vasiliki; Lecoeur, Cecile; Lahti, Jari; Li-Gao, Ruifang; Lind, Penelope A; Liu, Tian; Makalic, Enes; Mamasoula, Crysovalanto; Matteson, Lindsay; Mbarek, Hamdi; McArdle, Patrick F; McMahon, George; Meddens, S Fleur W; Mihailov, Evelin; Miller, Mike; Missmer, Stacey A; Monnereau, Claire; van der Most, Peter J; Myhre, Ronny; Nalls, Mike A; Nutile, Teresa; Kalafati, Ioanna Panagiota; Porcu, Eleonora; Prokopenko, Inga; Rajan, Kumar B; Rich-Edwards, Janet; Rietveld, Cornelius A; Robino, Antonietta; Rose, Lynda M; Rueedi, Rico; Ryan, Kathleen A; Saba, Yasaman; Schmidt, Daniel; Smith, Jennifer A; Stolk, Lisette; Streeten, Elizabeth; Tönjes, Anke; Thorleifsson, Gudmar; Ulivi, Sheila; Wedenoja, Juho; Wellmann, Juergen; Willeit, Peter; Yao, Jie; Yengo, Loic; Zhao, Jing Hua; Zhao, Wei; Zhernakova, Daria V; Amin, Najaf; Andrews, Howard; Balkau, Beverley; Barzilai, Nir; Bergmann, Sven; Biino, Ginevra; Bisgaard, Hans; Bønnelykke, Klaus; Boomsma, Dorret I; Buring, Julie E; Campbell, Harry; Cappellani, Stefania; Ciullo, Marina; Cox, Simon R; Cucca, Francesco; Toniolo, Daniela; Davey-Smith, George; Deary, Ian J; Dedoussis, George; Deloukas, Panos; van Duijn, Cornelia M; de Geus, Eco J C; Eriksson, Johan G; Evans, Denis A; Faul, Jessica D; Sala, Cinzia Felicita; Froguel, Philippe; Gasparini, Paolo; Girotto, Giorgia; Grabe, Hans-Jörgen; Greiser, Karin Halina; Groenen, Patrick J F; de Haan, Hugoline G; Haerting, Johannes; Harris, Tamara B; Heath, Andrew C; Heikkilä, Kauko; Hofman, Albert; Homuth, Georg; Holliday, Elizabeth G; Hopper, John; Hyppönen, Elina; Jacobsson, Bo; Jaddoe, Vincent W V; Johannesson, Magnus; Jugessur, Astanand; Kähönen, Mika; Kajantie, Eero; Kardia, Sharon L R; Keavney, Bernard; Kolcic, Ivana; Koponen, Päivikki; Kovacs, Peter; Kronenberg, Florian; Kutalik, Zoltan; La Bianca, Martina; Lachance, Genevieve; Iacono, William G; Lai, Sandra; Lehtimäki, Terho; Liewald, David C; Lindgren, Cecilia M; Liu, Yongmei; Luben, Robert; Lucht, Michael; Luoto, Riitta; Magnus, Per; Magnusson, Patrik K E; Martin, Nicholas G; McGue, Matt; McQuillan, Ruth; Medland, Sarah E; Meisinger, Christa; Mellström, Dan; Metspalu, Andres; Traglia, Michela; Milani, Lili; Mitchell, Paul; Montgomery, Grant W; Mook-Kanamori, Dennis; de Mutsert, Renée; Nohr, Ellen A; Ohlsson, Claes; Olsen, Jørn; Ong, Ken K; Paternoster, Lavinia; Pattie, Alison; Penninx, Brenda W J H; Perola, Markus; Peyser, Patricia A; Pirastu, Mario; Polasek, Ozren; Power, Chris; Kaprio, Jaakko; Raffel, Leslie J; Räikkönen, Katri; Raitakari, Olli; Ridker, Paul M; Ring, Susan M; Roll, Kathryn; Rudan, Igor; Ruggiero, Daniela; Rujescu, Dan; Salomaa, Veikko; Schlessinger, David; Schmidt, Helena; Schmidt, Reinhold; Schupf, Nicole; Smit, Johannes; Sorice, Rossella; Spector, Tim D; Starr, John M; Stöckl, Doris; Strauch, Konstantin; Stumvoll, Michael; Swertz, Morris A; Thorsteinsdottir, Unnur; Thurik, A Roy; Timpson, Nicholas J; Tung, Joyce Y; Uitterlinden, André G; Vaccargiu, Simona; Viikari, Jorma; Vitart, Veronique; Völzke, Henry; Vollenweider, Peter; Vuckovic, Dragana; Waage, Johannes; Wagner, Gert G; Wang, Jie Jin; Wareham, Nicholas J; Weir, David R; Willemsen, Gonneke; Willeit, Johann; Wright, Alan F; Zondervan, Krina T; Stefansson, Kari; Krueger, Robert F; Lee, James J; Benjamin, Daniel J; Cesarini, David; Koellinger, Philipp D; den Hoed, Marcel; Snieder, Harold; Mills, Melinda C

    2016-12-01

    The genetic architecture of human reproductive behavior-age at first birth (AFB) and number of children ever born (NEB)-has a strong relationship with fitness, human development, infertility and risk of neuropsychiatric disorders. However, very few genetic loci have been identified, and the underlying mechanisms of AFB and NEB are poorly understood. We report a large genome-wide association study of both sexes including 251,151 individuals for AFB and 343,072 individuals for NEB. We identified 12 independent loci that are significantly associated with AFB and/or NEB in a SNP-based genome-wide association study and 4 additional loci associated in a gene-based effort. These loci harbor genes that are likely to have a role, either directly or by affecting non-local gene expression, in human reproduction and infertility, thereby increasing understanding of these complex traits.

  17. Genome-wide analytical approaches for reverse metabolic engineering of industrially relevant phenotypes in yeast.

    PubMed

    Oud, Bart; van Maris, Antonius J A; Daran, Jean-Marc; Pronk, Jack T

    2012-03-01

    Successful reverse engineering of mutants that have been obtained by nontargeted strain improvement has long presented a major challenge in yeast biotechnology. This paper reviews the use of genome-wide approaches for analysis of Saccharomyces cerevisiae strains originating from evolutionary engineering or random mutagenesis. On the basis of an evaluation of the strengths and weaknesses of different methods, we conclude that for the initial identification of relevant genetic changes, whole genome sequencing is superior to other analytical techniques, such as transcriptome, metabolome, proteome, or array-based genome analysis. Key advantages of this technique over gene expression analysis include the independency of genome sequences on experimental context and the possibility to directly and precisely reproduce the identified changes in naive strains. The predictive value of genome-wide analysis of strains with industrially relevant characteristics can be further improved by classical genetics or simultaneous analysis of strains derived from parallel, independent strain improvement lineages.

  18. Overlapping chromatin-remodeling systems collaborate genome wide at dynamic chromatin transitions.

    PubMed

    Morris, Stephanie A; Baek, Songjoon; Sung, Myong-Hee; John, Sam; Wiench, Malgorzata; Johnson, Thomas A; Schiltz, R Louis; Hager, Gordon L

    2014-01-01

    ATP-dependent chromatin remodeling is an essential process required for the dynamic organization of chromatin structure. Here we describe the genome-wide location and activity of three remodeler proteins with diverse physiological functions in the mouse genome: Brg1, Chd4 and Snf2h. The localization patterns of all three proteins substantially overlap with one another and with regions of accessible chromatin. Furthermore, using inducible mutant variants, we demonstrate that the catalytic activity of these proteins contributes to the remodeling of chromatin genome wide and that each of these remodelers can independently regulate chromatin reorganization at distinct sites. Many regions require the activity of more than one remodeler to regulate accessibility. These findings provide a dynamic view of chromatin organization and highlight the differential contributions of remodelers to chromatin maintenance in higher eukaryotes.

  19. Novel R tools for analysis of genome-wide population genetic data with emphasis on clonality

    PubMed Central

    Kamvar, Zhian N.; Brooks, Jonah C.; Grünwald, Niklaus J.

    2015-01-01

    To gain a detailed understanding of how plant microbes evolve and adapt to hosts, pesticides, and other factors, knowledge of the population dynamics and evolutionary history of populations is crucial. Plant pathogen populations are often clonal or partially clonal which requires different analytical tools. With the advent of high throughput sequencing technologies, obtaining genome-wide population genetic data has become easier than ever before. We previously contributed the R package poppr specifically addressing issues with analysis of clonal populations. In this paper we provide several significant extensions to poppr with a focus on large, genome-wide SNP data. Specifically, we provide several new functionalities including the new function mlg.filter to define clone boundaries allowing for inspection and definition of what is a clonal lineage, minimum spanning networks with reticulation, a sliding-window analysis of the index of association, modular bootstrapping of any genetic distance, and analyses across any level of hierarchies. PMID:26113860

  20. Genome-wide association analyses of quantitative traits: the GAW16 experience.

    PubMed

    Ghosh, Saurabh

    2009-01-01

    The group that formed on the theme of genome-wide association analyses of quantitative traits (Group 2) in the Genetic Analysis Workshop 16 comprised eight sets of investigators. Three data sets were available: one on autoantibodies related to rheumatoid arthritis provided by the North American Rheumatoid Arthritis Consortium; the second on anthropometric, lipid, and biochemical measures provided by the Framingham Heart Study (FHS); and the third a simulated data set modeled after FHS. The different investigators in the group addressed a large set of statistical challenges and applied a wide spectrum of association methods in analyzing quantitative traits at the genome-wide level. While some previously reported genes were validated, some novel chromosomal regions provided significant evidence of association in multiple contributions in the group. In this report, we discuss the different strategies explored by the different investigators with the common goal of improving the power to detect association.

  1. Genome-wide association analyses identify 13 new susceptibility loci for generalized vitiligo

    PubMed Central

    Jin, Ying; Birlea, Stanca A; Fain, Pamela R; Ferrara, Tracey M; Ben, Songtao; Riccardi, Sheri L; Cole, Joanne B; Gowan, Katherine; Holland, Paulene J; Bennett, Dorothy C; Luiten, Rosalie M; Wolkerstorfer, Albert; van der Veen, JP Wietze; Hartmann, Anke; Eichner, Saskia; Schuler, Gerold; van Geel, Nanja; Lambert, Jo; Kemp, E Helen; Gawkrodger, David J; Weetman, Anthony P; Taïeb, Alain; Jouary, Thomas; Ezzedine, Khaled; Wallace, Margaret R; McCormack, Wayne T; Picardo, Mauro; Leone, Giovanni; Overbeck, Andreas; Silverberg, Nanette B; Spritz, Richard A

    2012-01-01

    In previous linkage and genome-wide association studies we identified 17 susceptibility loci for generalized vitiligo. By a second genome-wide association study, meta-analysis, and independent replication study, we have now identified 13 additional vitiligo-associated loci, including OCA2-HERC2, a region of 16q24.3 containing MC1R, a region of chromosome 11q21 near TYR, several immunoregulatory loci including IFIH1, CD80, CLNK, BACH2, SLA, CASP7, CD44, IKZF4, SH2B3, and a region of 22q13.2 where the causal gene remains uncertain. Functional pathway analysis shows that most vitiligo susceptibility loci encode immunoregulatory proteins or melanocyte components that likely mediate immune targeting and genetic relationships among vitiligo, malignant melanoma, and normal variation of eye, skin, and hair color. PMID:22561518

  2. Genome-wide Association Study Identifies Two Susceptibility Loci for Osteosarcoma

    PubMed Central

    Savage, Sharon A.; Mirabello, Lisa; Wang, Zhaoming; Gastier-Foster, Julie M.; Gorlick, Richard; Khanna, Chand; Flanagan, Adrienne M.; Tirabosco, Roberto; Andrulis, Irene L.; Wunder, Jay S.; Gokgoz, Nalan; Patiño-Garcia, Ana; Sierrasesúmaga, Luis; Lecanda, Fernando; Kurucu, Nilgün; Ilhan, Inci Ergurhan; Sari, Neriman; Serra, Massimo; Hattinger, Claudia; Picci, Piero; Spector, Logan; Barkauskas, Donald A.; Marina, Neyssa; de Toledo, Silvia Regina Caminada; Petrilli, Antonio S.; Amary, Maria Fernanda; Halai, Dina; Thomas, David M.; Douglass, Chester; Meltzer, Paul S.; Jacobs, Kevin; Chung, Charles C.; Berndt, Sonja I.; Purdue, Mark P.; Caporaso, Neil E.; Tucker, Margaret; Rothman, Nathaniel; Landi, Maria Teresa; Silverman, Debra T.; Kraft, Peter; Hunter, David J.; Malats, Nuria; Kogevinas, Manolis; Wacholder, Sholom; Troisi, Rebecca; Helman, Lee; Fraumeni, Joseph F.; Yeager, Meredith; Hoover, Robert N.; Chanock, Stephen J.

    2013-01-01

    Osteosarcoma is the most common primary bone malignancy of adolescents and young adults. In order to better understand the genetic etiology of osteosarcoma, we performed a multi-stage genome-wide association study (GWAS) consisting of 941 cases and 3,291 cancer-free adult controls of European ancestry. Two loci achieved genome-wide significance: rs1906953 at 6p21.3, in the glutamate receptor metabotropic 4 [GRM4] gene (P = 8.1 ×10-9), and rs7591996 and rs10208273 in a gene desert on 2p25.2 (P = 1.0 ×10-8 and 2.9 ×10-7). These two susceptibility loci warrant further exploration to uncover the biological mechanisms underlying susceptibility to osteosarcoma. PMID:23727862

  3. Genome-wide meta-analysis identifies five new susceptibility loci for cutaneous malignant melanoma.

    PubMed

    Law, Matthew H; Bishop, D Timothy; Lee, Jeffrey E; Brossard, Myriam; Martin, Nicholas G; Moses, Eric K; Song, Fengju; Barrett, Jennifer H; Kumar, Rajiv; Easton, Douglas F; Pharoah, Paul D P; Swerdlow, Anthony J; Kypreou, Katerina P; Taylor, John C; Harland, Mark; Randerson-Moor, Juliette; Akslen, Lars A; Andresen, Per A; Avril, Marie-Françoise; Azizi, Esther; Scarrà, Giovanna Bianchi; Brown, Kevin M; Dȩbniak, Tadeusz; Duffy, David L; Elder, David E; Fang, Shenying; Friedman, Eitan; Galan, Pilar; Ghiorzo, Paola; Gillanders, Elizabeth M; Goldstein, Alisa M; Gruis, Nelleke A; Hansson, Johan; Helsing, Per; Hočevar, Marko; Höiom, Veronica; Ingvar, Christian; Kanetsky, Peter A; Chen, Wei V; Landi, Maria Teresa; Lang, Julie; Lathrop, G Mark; Lubiński, Jan; Mackie, Rona M; Mann, Graham J; Molven, Anders; Montgomery, Grant W; Novaković, Srdjan; Olsson, Håkan; Puig, Susana; Puig-Butille, Joan Anton; Qureshi, Abrar A; Radford-Smith, Graham L; van der Stoep, Nienke; van Doorn, Remco; Whiteman, David C; Craig, Jamie E; Schadendorf, Dirk; Simms, Lisa A; Burdon, Kathryn P; Nyholt, Dale R; Pooley, Karen A; Orr, Nick; Stratigos, Alexander J; Cust, Anne E; Ward, Sarah V; Hayward, Nicholas K; Han, Jiali; Schulze, Hans-Joachim; Dunning, Alison M; Bishop, Julia A Newton; Demenais, Florence; Amos, Christopher I; MacGregor, Stuart; Iles, Mark M

    2015-09-01

    Thirteen common susceptibility loci have been reproducibly associated with cutaneous malignant melanoma (CMM). We report the results of an international 2-stage meta-analysis of CMM genome-wide association studies (GWAS). This meta-analysis combines 11 GWAS (5 previously unpublished) and a further three stage 2 data sets, totaling 15,990 CMM cases and 26,409 controls. Five loci not previously associated with CMM risk reached genome-wide significance (P < 5 × 10(-8)), as did 2 previously reported but unreplicated loci and all 13 established loci. Newly associated SNPs fall within putative melanocyte regulatory elements, and bioinformatic and expression quantitative trait locus (eQTL) data highlight candidate genes in the associated regions, including one involved in telomere biology.

  4. Genome-wide meta-analysis identifies five new susceptibility loci for cutaneous malignant melanoma

    PubMed Central

    Law, Matthew H.; Bishop, D. Timothy; Martin, Nicholas G.; Moses, Eric K.; Song, Fengju; Barrett, Jennifer H.; Kumar, Rajiv; Easton, Douglas F.; Pharoah, Paul D. P.; Swerdlow, Anthony J.; Kypreou, Katerina P.; Taylor, John C.; Harland, Mark; Randerson-Moor, Juliette; Akslen, Lars A.; Andresen, Per A.; Avril, Marie-Françoise; Azizi, Esther; Scarrà, Giovanna Bianchi; Brown, Kevin M.; Dębniak, Tadeusz; Duffy, David L.; Elder, David E.; Fang, Shenying; Friedman, Eitan; Galan, Pilar; Ghiorzo, Paola; Gillanders, Elizabeth M.; Goldstein, Alisa M.; Gruis, Nelleke A.; Hansson, Johan; Helsing, Per; Hočevar, Marko; Höiom, Veronica; Ingvar, Christian; Kanetsky, Peter A.; Chen, Wei V.; Landi, Maria Teresa; Lang, Julie; Lathrop, G. Mark; Lubiński, Jan; Mackie, Rona M.; Mann, Graham J.; Molven, Anders; Montgomery, Grant W.; Novaković, Srdjan; Olsson, Håkan; Puig, Susana; Puig-Butille, Joan Anton; Qureshi, Abrar A.; Radford-Smith, Graham L.; van der Stoep, Nienke; van Doorn, Remco; Whiteman, David C.; Craig, Jamie E.; Schadendorf, Dirk; Simms, Lisa A.; Burdon, Kathryn P.; Nyholt, Dale R.; Pooley, Karen A.; Orr, Nick; Stratigos, Alexander J.; Cust, Anne E.; Ward, Sarah V.; Hayward, Nicholas K.; Han, Jiali; Schulze, Hans-Joachim; Dunning, Alison M.; Bishop, Julia A. Newton; MacGregor, Stuart; Iles, Mark M.

    2015-01-01

    Thirteen common susceptibility loci have been reproducibly associated with cutaneous malignant melanoma (CMM). We report the results of an international 2-stage meta-analysis of CMM genome-wide association studies (GWAS). This meta-analysis combines 11 GWAS (5 previously unpublished) and a further three stage 2 data sets, totaling 15,990 CMM cases and 26,409 controls. Five loci not previously associated with CMM risk reached genome-wide significance (P < 5×10–8), as did two previously-reported but un-replicated loci and all thirteen established loci. Novel SNPs fall within putative melanocyte regulatory elements, and bioinformatic and expression quantitative trait locus (eQTL) data highlight candidate genes including one involved in telomere biology. PMID:26237428

  5. Genome-wide genetic homogeneity between sexes and populations for human height and body mass index.

    PubMed

    Yang, Jian; Bakshi, Andrew; Zhu, Zhihong; Hemani, Gibran; Vinkhuyzen, Anna A E; Nolte, Ilja M; van Vliet-Ostaptchouk, Jana V; Snieder, Harold; Esko, Tonu; Milani, Lili; Mägi, Reedik; Metspalu, Andres; Hamsten, Anders; Magnusson, Patrik K E; Pedersen, Nancy L; Ingelsson, Erik; Visscher, Peter M

    2015-12-20

    Sex-specific genetic effects have been proposed to be an important source of variation for human complex traits. Here we use two distinct genome-wide methods to estimate the autosomal genetic correlation (rg) between men and women for human height and body mass index (BMI), using individual-level (n = ∼44 000) and summary-level (n = ∼133 000) data from genome-wide association studies. Results are consistent and show that the between-sex genetic correlation is not significantly different from unity for both traits. In contrast, we find evidence of genetic heterogeneity between sexes for waist-hip ratio (rg = ∼0.7) and between populations for BMI (rg = ∼0.9 between Europe and the USA) but not for height. The lack of evidence for substantial genetic heterogeneity for body size is consistent with empirical findings across traits and species.

  6. Genome-wide analysis of microRNA and mRNA expression signatures in cancer

    PubMed Central

    Li, Ming-hui; Fu, Sheng-bo; Xiao, Hua-sheng

    2015-01-01

    Cancer is an extremely diverse and complex disease that results from various genetic and epigenetic changes such as DNA copy-number variations, mutations, and aberrant mRNA and/or protein expression caused by abnormal transcriptional regulation. The expression profiles of certain microRNAs (miRNAs) and messenger RNAs (mRNAs) are closely related to cancer progression stages. In the past few decades, DNA microarray and next-generation sequencing techniques have been widely applied to identify miRNA and mRNA signatures for cancers on a genome-wide scale and have provided meaningful insights into cancer diagnosis, prognosis and personalized medicine. In this review, we summarize the progress in genome-wide analysis of miRNAs and mRNAs as cancer biomarkers, highlighting their diagnostic and prognostic roles. PMID:26299954

  7. Genome-wide association study identifies 14 novel risk alleles associated with basal cell carcinoma

    PubMed Central

    Chahal, Harvind S.; Wu, Wenting; Ransohoff, Katherine J.; Yang, Lingyao; Hedlin, Haley; Desai, Manisha; Lin, Yuan; Dai, Hong-Ji; Qureshi, Abrar A.; Li, Wen-Qing; Kraft, Peter; Hinds, David A.; Tang, Jean Y.; Han, Jiali; Sarin, Kavita Y.

    2016-01-01

    Basal cell carcinoma (BCC) is the most common cancer worldwide with an annual incidence of 2.8 million cases in the United States alone. Previous studies have demonstrated an association between 21 distinct genetic loci and BCC risk. Here, we report the results of a two-stage genome-wide association study of BCC, totalling 17,187 cases and 287,054 controls. We confirm 17 previously reported loci and identify 14 new susceptibility loci reaching genome-wide significance (P<5 × 10−8, logistic regression). These newly associated SNPs lie within predicted keratinocyte regulatory elements and in expression quantitative trait loci; furthermore, we identify candidate genes and non-coding RNAs involved in telomere maintenance, immune regulation and tumour progression, providing deeper insight into the pathogenesis of BCC. PMID:27539887

  8. Genome-wide association analysis identifies six new loci associated with forced vital capacity.

    PubMed

    Loth, Daan W; Soler Artigas, María; Gharib, Sina A; Wain, Louise V; Franceschini, Nora; Koch, Beate; Pottinger, Tess D; Smith, Albert Vernon; Duan, Qing; Oldmeadow, Chris; Lee, Mi Kyeong; Strachan, David P; James, Alan L; Huffman, Jennifer E; Vitart, Veronique; Ramasamy, Adaikalavan; Wareham, Nicholas J; Kaprio, Jaakko; Wang, Xin-Qun; Trochet, Holly; Kähönen, Mika; Flexeder, Claudia; Albrecht, Eva; Lopez, Lorna M; de Jong, Kim; Thyagarajan, Bharat; Alves, Alexessander Couto; Enroth, Stefan; Omenaas, Ernst; Joshi, Peter K; Fall, Tove; Viñuela, Ana; Launer, Lenore J; Loehr, Laura R; Fornage, Myriam; Li, Guo; Wilk, Jemma B; Tang, Wenbo; Manichaikul, Ani; Lahousse, Lies; Harris, Tamara B; North, Kari E; Rudnicka, Alicja R; Hui, Jennie; Gu, Xiangjun; Lumley, Thomas; Wright, Alan F; Hastie, Nicholas D; Campbell, Susan; Kumar, Rajesh; Pin, Isabelle; Scott, Robert A; Pietiläinen, Kirsi H; Surakka, Ida; Liu, Yongmei; Holliday, Elizabeth G; Schulz, Holger; Heinrich, Joachim; Davies, Gail; Vonk, Judith M; Wojczynski, Mary; Pouta, Anneli; Johansson, Asa; Wild, Sarah H; Ingelsson, Erik; Rivadeneira, Fernando; Völzke, Henry; Hysi, Pirro G; Eiriksdottir, Gudny; Morrison, Alanna C; Rotter, Jerome I; Gao, Wei; Postma, Dirkje S; White, Wendy B; Rich, Stephen S; Hofman, Albert; Aspelund, Thor; Couper, David; Smith, Lewis J; Psaty, Bruce M; Lohman, Kurt; Burchard, Esteban G; Uitterlinden, André G; Garcia, Melissa; Joubert, Bonnie R; McArdle, Wendy L; Musk, A Bill; Hansel, Nadia; Heckbert, Susan R; Zgaga, Lina; van Meurs, Joyce B J; Navarro, Pau; Rudan, Igor; Oh, Yeon-Mok; Redline, Susan; Jarvis, Deborah L; Zhao, Jing Hua; Rantanen, Taina; O'Connor, George T; Ripatti, Samuli; Scott, Rodney J; Karrasch, Stefan; Grallert, Harald; Gaddis, Nathan C; Starr, John M; Wijmenga, Cisca; Minster, Ryan L; Lederer, David J; Pekkanen, Juha; Gyllensten, Ulf; Campbell, Harry; Morris, Andrew P; Gläser, Sven; Hammond, Christopher J; Burkart, Kristin M; Beilby, John; Kritchevsky, Stephen B; Gudnason, Vilmundur; Hancock, Dana B; Williams, O Dale; Polasek, Ozren; Zemunik, Tatijana; Kolcic, Ivana; Petrini, Marcy F; Wjst, Matthias; Kim, Woo Jin; Porteous, David J; Scotland, Generation; Smith, Blair H; Viljanen, Anne; Heliövaara, Markku; Attia, John R; Sayers, Ian; Hampel, Regina; Gieger, Christian; Deary, Ian J; Boezen, H Marike; Newman, Anne; Jarvelin, Marjo-Riitta; Wilson, James F; Lind, Lars; Stricker, Bruno H; Teumer, Alexander; Spector, Timothy D; Melén, Erik; Peters, Marjolein J; Lange, Leslie A; Barr, R Graham; Bracke, Ken R; Verhamme, Fien M; Sung, Joohon; Hiemstra, Pieter S; Cassano, Patricia A; Sood, Akshay; Hayward, Caroline; Dupuis, Josée; Hall, Ian P; Brusselle, Guy G; Tobin, Martin D; London, Stephanie J

    2014-07-01

    Forced vital capacity (FVC), a spirometric measure of pulmonary function, reflects lung volume and is used to diagnose and monitor lung diseases. We performed genome-wide association study meta-analysis of FVC in 52,253 individuals from 26 studies and followed up the top associations in 32,917 additional individuals of European ancestry. We found six new regions associated at genome-wide significance (P < 5 × 10(-8)) with FVC in or near EFEMP1, BMP6, MIR129-2-HSD17B12, PRDM11, WWOX and KCNJ2. Two loci previously associated with spirometric measures (GSTCD and PTCH1) were related to FVC. Newly implicated regions were followed up in samples from African-American, Korean, Chinese and Hispanic individuals. We detected transcripts for all six newly implicated genes in human lung tissue. The new loci may inform mechanisms involved in lung development and the pathogenesis of restrictive lung disease.

  9. Genome-wide association analysis identifies six new loci associated with forced vital capacity

    PubMed Central

    Loth, Daan W.; Artigas, María Soler; Gharib, Sina A.; Wain, Louise V.; Franceschini, Nora; Koch, Beate; Pottinger, Tess; Smith, Albert Vernon; Duan, Qing; Oldmeadow, Chris; Lee, Mi Kyeong; Strachan, David P.; James, Alan L.; Huffman, Jennifer E.; Vitart, Veronique; Ramasamy, Adaikalavan; Wareham, Nicholas J.; Kaprio, Jaakko; Wang, Xin-Qun; Trochet, Holly; Kähönen, Mika; Flexeder, Claudia; Albrecht, Eva; Lopez, Lorna M.; de Jong, Kim; Thyagarajan, Bharat; Alves, Alexessander Couto; Enroth, Stefan; Omenaas, Ernst; Joshi, Peter K.; Fall, Tove; Viňuela, Ana; Launer, Lenore J.; Loehr, Laura R.; Fornage, Myriam; Li, Guo; Wilk, Jemma B.; Tang, Wenbo; Manichaikul, Ani; Lahousse, Lies; Harris, Tamara B.; North, Kari E.; Rudnicka, Alicja R.; Hui, Jennie; Gu, Xiangjun; Lumley, Thomas; Wright, Alan F.; Hastie, Nicholas D.; Campbell, Susan; Kumar, Rajesh; Pin, Isabelle; Scott, Robert A.; Pietiläinen, Kirsi H.; Surakka, Ida; Liu, Yongmei; Holliday, Elizabeth G.; Schulz, Holger; Heinrich, Joachim; Davies, Gail; Vonk, Judith M.; Wojczynski, Mary; Pouta, Anneli; Johansson, Åsa; Wild, Sarah H.; Ingelsson, Erik; Rivadeneira, Fernando; Völzke, Henry; Hysi, Pirro G.; Eiriksdottir, Gudny; Morrison, Alanna C.; Rotter, Jerome I.; Gao, Wei; Postma, Dirkje S.; White, Wendy B.; Rich, Stephen S.; Hofman, Albert; Aspelund, Thor; Couper, David; Smith, Lewis J.; Psaty, Bruce M.; Lohman, Kurt; Burchard, Esteban G.; Uitterlinden, André G.; Garcia, Melissa; Joubert, Bonnie R.; McArdle, Wendy L.; Musk, A. Bill; Hansel, Nadia; Heckbert, Susan R.; Zgaga, Lina; van Meurs, Joyce B.J.; Navarro, Pau; Rudan, Igor; Oh, Yeon-Mok; Redline, Susan; Jarvis, Deborah; Zhao, Jing Hua; Rantanen, Taina; O’Connor, George T.; Ripatti, Samuli; Scott, Rodney J.; Karrasch, Stefan; Grallert, Harald; Gaddis, Nathan C.; Starr, John M.; Wijmenga, Cisca; Minster, Ryan L.; Lederer, David J.; Pekkanen, Juha; Gyllensten, Ulf; Campbell, Harry; Morris, Andrew P.; Gläser, Sven; Hammond, Christopher J.; Burkart, Kristin M.; Beilby, John; Kritchevsky, Stephen B.; Gudnason, Vilmundur; Hancock, Dana B.; Williams, O. Dale; Polasek, Ozren; Zemunik, Tatijana; Kolcic, Ivana; Petrini, Marcy F.; Wjst, Matthias; Kim, Woo Jin; Porteous, David J.; Scotland, Generation; Smith, Blair H.; Viljanen, Anne; Heliövaara, Markku; Attia, John R.; Sayers, Ian; Hampel, Regina; Gieger, Christian; Deary, Ian J.; Boezen, H. Marike; Newman, Anne; Jarvelin, Marjo-Riitta; Wilson, James F.; Lind, Lars; Stricker, Bruno H.; Teumer, Alexander; Spector, Timothy D.; Melén, Erik; Peters, Marjolein J.; Lange, Leslie A.; Barr, R. Graham; Bracke, Ken R.; Verhamme, Fien M.; Sung, Joohon; Hiemstra, Pieter S.; Cassano, Patricia A.; Sood, Akshay; Hayward, Caroline; Dupuis, Josée; Hall, Ian P.; Brusselle, Guy G.; Tobin, Martin D.; London, Stephanie J.

    2014-01-01

    Forced vital capacity (FVC), a spirometric measure of pulmonary function, reflects lung volume and is used to diagnose and monitor lung diseases. We performed genome-wide association study meta-analysis of FVC in 52,253 individuals from 26 studies and followed up the top associations in 32,917 additional individuals of European ancestry. We found six new regions associated at genome-wide significance (P < 5 × 10−8) with FVC in or near EFEMP1, BMP6, MIR-129-2/HSD17B12, PRDM11, WWOX, and KCNJ2. Two (GSTCD and PTCH1) loci previously associated with spirometric measures were related to FVC. Newly implicated regions were followed-up in samples of African American, Korean, Chinese, and Hispanic individuals. We detected transcripts for all six newly implicated genes in human lung tissue. The new loci may inform mechanisms involved in lung development and pathogenesis of restrictive lung disease. PMID:24929828

  10. Genome-wide approaches for identifying genetic risk factors for osteoporosis

    PubMed Central

    2013-01-01

    Osteoporosis, the most common type of bone disease worldwide, is clinically characterized by low bone mineral density (BMD) and increased susceptibility to fracture. Multiple genetic and environmental factors and gene-environment interactions have been implicated in its pathogenesis. Osteoporosis has strong genetic determination, with the heritability of BMD estimated to be as high as 60%. More than 80 genes or genetic variants have been implicated in risk of osteoporosis by hypothesis-free genome-wide studies. However, these genes or genetic variants can only explain a small portion of BMD variation, suggesting that many other genes or genetic variants underlying osteoporosis risk await discovery. Here, we review recent progress in genome-wide studies of osteoporosis and discuss their implications for medicine and the major challenges in the field. PMID:23731620

  11. Genome-wide analytical approaches for reverse metabolic engineering of industrially relevant phenotypes in yeast

    PubMed Central

    Oud, Bart; Maris, Antonius J A; Daran, Jean-Marc; Pronk, Jack T

    2012-01-01

    Successful reverse engineering of mutants that have been obtained by nontargeted strain improvement has long presented a major challenge in yeast biotechnology. This paper reviews the use of genome-wide approaches for analysis of Saccharomyces cerevisiae strains originating from evolutionary engineering or random mutagenesis. On the basis of an evaluation of the strengths and weaknesses of different methods, we conclude that for the initial identification of relevant genetic changes, whole genome sequencing is superior to other analytical techniques, such as transcriptome, metabolome, proteome, or array-based genome analysis. Key advantages of this technique over gene expression analysis include the independency of genome sequences on experimental context and the possibility to directly and precisely reproduce the identified changes in naive strains. The predictive value of genome-wide analysis of strains with industrially relevant characteristics can be further improved by classical genetics or simultaneous analysis of strains derived from parallel, independent strain improvement lineages. PMID:22152095

  12. Genome-wide association study for semen quality traits in German Warmblood stallions.

    PubMed

    Gottschalk, Maren; Metzger, Julia; Martinsson, Gunilla; Sieme, Harald; Distl, Ottmar

    2016-08-01

    We performed a genome-wide association study for semen quality traits in 139 German Warmblood stallions. Stallions were genotyped using the Illumina equine SNP50 Beadchip. Traits analysed were de-regressed estimated breeding values (EBVs) for gel-free volume, sperm concentration, total number of sperm, progressive motility and the total number of progressively motile sperm. The GWAS revealed 29 SNPs on 12 different chromosomes as genome-wide significantly associated with semen quality traits. For ten genomic regions we could retrieve candidate genes influencing stallion fertility. Among the candidate genes, we could find the genes encoding cysteine-rich secretory proteins (CRISP1, CRISP2 and CRISP3). This was the first GWAS in horses performed for semen quality traits.

  13. Genome wide linkage disequilibrium in Chinese asparagus bean (Vigna. unguiculata ssp. sesquipedialis) germplasm: implications for domestication history and genome wide association studies.

    PubMed

    Xu, P; Wu, X; Wang, B; Luo, J; Liu, Y; Ehlers, J D; Close, T J; Roberts, P A; Lu, Z; Wang, S; Li, G

    2012-07-01

    Association mapping of important traits of crop plants relies on first understanding the extent and patterns of linkage disequilibrium (LD) in the particular germplasm being investigated. We characterize here the genetic diversity, population structure and genome wide LD patterns in a set of asparagus bean (Vigna. unguiculata ssp. sesquipedialis) germplasm from China. A diverse collection of 99 asparagus bean and normal cowpea accessions were genotyped with 1127 expressed sequence tag-derived single nucleotide polymorphism markers (SNPs). The proportion of polymorphic SNPs across the collection was relatively low (39%), with an average number of SNPs per locus of 1.33. Bayesian population structure analysis indicated two subdivisions within the collection sampled that generally represented the 'standard vegetable' type (subgroup SV) and the 'non-standard vegetable' type (subgroup NSV), respectively. Level of LD (r(2)) was higher and extent of LD persisted longer in subgroup SV than in subgroup NSV, whereas LD decayed rapidly (0-2 cM) in both subgroups. LD decay distance varied among chromosomes, with the longest (≈ 5 cM) five times longer than the shortest (≈ 1 cM). Partitioning of LD variance into within- and between-subgroup components coupled with comparative LD decay analysis suggested that linkage group 5, 7 and 10 may have undergone the most intensive epistatic selection toward traits favorable for vegetable use. This work provides a first population genetic insight into domestication history of asparagus bean and demonstrates the feasibility of mapping complex traits by genome wide association study in asparagus bean using a currently available cowpea SNPs marker platform.

  14. Technologies for Genome-Wide Identification of Stat5 Regulated Genes

    DTIC Science & Technology

    2003-01-01

    preinfected with adenovirus carrying either wild type or dominant-negative Stat5, were harvested and RNA was isolated as described in the Materials and...maintain tight control of signal transduction pathways, both for rapid induction and cessation of signaling. The protein CIS1 was isolated independently...differential suppression of Stat5 regulated transcripts and large-scale gene chip analysis. As a result of this work, rapid progress in genome-wide

  15. GW-SEM: A Statistical Package to Conduct Genome-Wide Structural Equation Modeling.

    PubMed

    Verhulst, Brad; Maes, Hermine H; Neale, Michael C

    2017-03-15

    Improving the accuracy of phenotyping through the use of advanced psychometric tools will increase the power to find significant associations with genetic variants and expand the range of possible hypotheses that can be tested on a genome-wide scale. Multivariate methods, such as structural equation modeling (SEM), are valuable in the phenotypic analysis of psychiatric and substance use phenotypes, but these methods have not been integrated into standard genome-wide association analyses because fitting a SEM at each single nucleotide polymorphism (SNP) along the genome was hitherto considered to be too computationally demanding. By developing a method that can efficiently fit SEMs, it is possible to expand the set of models that can be tested. This is particularly necessary in psychiatric and behavioral genetics, where the statistical methods are often handicapped by phenotypes with large components of stochastic variance. Due to the enormous amount of data that genome-wide scans produce, the statistical methods used to analyze the data are relatively elementary and do not directly correspond with the rich theoretical development, and lack the potential to test more complex hypotheses about the measurement of, and interaction between, comorbid traits. In this paper, we present a method to test the association of a SNP with multiple phenotypes or a latent construct on a genome-wide basis using a diagonally weighted least squares (DWLS) estimator for four common SEMs: a one-factor model, a one-factor residuals model, a two-factor model, and a latent growth model. We demonstrate that the DWLS parameters and p-values strongly correspond with the more traditional full information maximum likelihood parameters and p-values. We also present the timing of simulations and power analyses and a comparison with and existing multivariate GWAS software package.

  16. Identification of genetic causes of congenital neurodevelopmental disorders using genome wide molecular technologies

    PubMed Central

    Eglė, Preikšaitienė; Laima, Ambrozaitytė; Živilė, Maldžienė; Aušra, Morkūnienė,; Loreta, Cimbalistienė; Tautvydas, Rančelis; Algirdas, Utkus; Vaidutis, Kučinskas

    2016-01-01

    Background. Intellectual disability affects about 1–2% of the general population worldwide, and this is the leading socio-economic problem of health care. The evaluation of the genetic causes of intellectual disability is challenging because these conditions are genetically heterogeneous with many different genetic alterations resulting in clinically indistinguishable phenotypes. Genome wide molecular technologies are effective in a research setting for establishing the new genetic basis of a disease. We describe the first Lithuanian experience in genome-wide CNV detection and whole exome sequencing, presenting the results obtained in the research project UNIGENE. Materials and methods. The patients with developmental delay/intellectual disability have been investigated (n = 66). Diagnostic screening was performed using array-CGH technology. FISH and real time-PCR were used for the confirmation of gene-dose imbalances and investigation of parental samples. Whole exome sequencing using the next generation high throughput NGS technique was used to sequence the samples of 12 selected families. Results. 14 out of 66 patients had pathogenic copy number variants, and one patient had novel likely pathogenic aberration (microdeletion at 4p15.2). Twelve families have been processed for whole exome sequencing. Two identified sequence variants could be classified as pathogenic (in MECP2, CREBBP genes). The other families had several candidate intellectual disability gene variants that are of unclear clinical significance and must be further investigated for possible effect on the molecular pathways of intellectual disability. Conclusions. The genetic heterogeneity of intellectual disability requires genome wide approaches, including detection of chromosomal aberrations by chromosomal microarrays and whole exome sequencing capable of uncovering single gene mutations. This study demonstrates the benefits and challenges that accompany the use of genome wide molecular

  17. Genome-wide gene-environment interaction analysis for asbestos exposure in lung cancer susceptibility.

    PubMed

    Wei, Sheng; Wang, Li-E; McHugh, Michelle K; Han, Younghun; Xiong, Momiao; Amos, Christopher I; Spitz, Margaret R; Wei, Qingyi Wei

    2012-08-01

    Asbestos exposure is a known risk factor for lung cancer. Although recent genome-wide association studies (GWASs) have identified some novel loci for lung cancer risk, few addressed genome-wide gene-environment interactions. To determine gene-asbestos interactions in lung cancer risk, we conducted genome-wide gene-environment interaction analyses at levels of single nucleotide polymorphisms (SNPs), genes and pathways, using our published Texas lung cancer GWAS dataset. This dataset included 317 498 SNPs from 1154 lung cancer cases and 1137 cancer-free controls. The initial SNP-level P-values for interactions between genetic variants and self-reported asbestos exposure were estimated by unconditional logistic regression models with adjustment for age, sex, smoking status and pack-years. The P-value for the most significant SNP rs13383928 was 2.17×10(-6), which did not reach the genome-wide statistical significance. Using a versatile gene-based test approach, we found that the top significant gene was C7orf54, located on 7q32.1 (P = 8.90×10(-5)). Interestingly, most of the other significant genes were located on 11q13. When we used an improved gene-set-enrichment analysis approach, we found that the Fas signaling pathway and the antigen processing and presentation pathway were most significant (nominal P < 0.001; false discovery rate < 0.05) among 250 pathways containing 17 572 genes. We believe that our analysis is a pilot study that first describes the gene-asbestos interaction in lung cancer risk at levels of SNPs, genes and pathways. Our findings suggest that immune function regulation-related pathways may be mechanistically involved in asbestos-associated lung cancer risk.

  18. A Genome-Wide Investigation of Autozygosity and Breast Cancer Risk

    DTIC Science & Technology

    2011-07-01

    likely to harbor cancer-related genes 6 Early-onset breast cancer GWAS • 3,203 non- Hispanic white participants – 1,647cases , 1,556 controls – From BCFR...cases than in controls, using logistic regression methods. Using genome-wide SNP data (525,000 SNPs) on 1,647 non- Hispanic white, early-onset...specific genomic locations, suggesting these regions harbor important cancer genes (10, 11). Homozygosity mapping is a natural extension of large

  19. Sniffing out significant “Pee values”: genome wide association study of asparagus anosmia

    PubMed Central

    Markt, Sarah C; Nuttall, Elizabeth; Turman, Constance; Sinnott, Jennifer; Rimm, Eric B; Ecsedy, Ethan; Unger, Robert H; Fall, Katja; Finn, Stephen; Jensen, Majken K; Rider, Jennifer R; Kraft, Peter

    2016-01-01

    Objective To determine the inherited factors associated with the ability to smell asparagus metabolites in urine. Design Genome wide association study. Setting Nurses’ Health Study and Health Professionals Follow-up Study cohorts. Participants 6909 men and women of European-American descent with available genetic data from genome wide association studies. Main outcome measure Participants were characterized as asparagus smellers if they strongly agreed with the prompt “after eating asparagus, you notice a strong characteristic odor in your urine,” and anosmic if otherwise. We calculated per-allele estimates of asparagus anosmia for about nine million single nucleotide polymorphisms using logistic regression. P values <5×10-8 were considered as genome wide significant. Results 58.0% of men (n=1449/2500) and 61.5% of women (n=2712/4409) had anosmia. 871 single nucleotide polymorphisms reached genome wide significance for asparagus anosmia, all in a region on chromosome 1 (1q44: 248139851-248595299) containing multiple genes in the olfactory receptor 2 (OR2) family. Conditional analyses revealed three independent markers associated with asparagus anosmia: rs13373863, rs71538191, and rs6689553. Conclusion A large proportion of people have asparagus anosmia. Genetic variation near multiple olfactory receptor genes is associated with the ability of an individual to smell the metabolites of asparagus in urine. Future replication studies are necessary before considering targeted therapies to help anosmic people discover what they are missing. PMID:27965198

  20. Meta-analyses of genome-wide linkage scans of anxiety-related phenotypes

    PubMed Central

    Webb, Bradley T; Guo, An-Yuan; Maher, Brion S; Zhao, Zhongming; van den Oord, Edwin J; Kendler, Kenneth S; Riley, Brien P; Gillespie, Nathan A; Prescott, Carol A; Middeldorp, Christel M; Willemsen, Gonneke; de Geus, Eco JC; Hottenga, Jouke-Jan; Boomsma, Dorret I; Slagboom, Eline P; Wray, Naomi R; Montgomery, Grant W; Martin, Nicholas G; Wright, Margie J; Heath, Andrew C; Madden, Pamela A; Gelernter, Joel; Knowles, James A; Hamilton, Steven P; Weissman, Myrna M; Fyer, Abby J; Huezo-Diaz, Patricia; McGuffin, Peter; Farmer, Anne; Craig, Ian W; Lewis, Cathryn; Sham, Pak; Crowe, Raymond R; Flint, Jonathan; Hettema, John M

    2012-01-01

    Genetic factors underlying trait neuroticism, reflecting a tendency towards negative affective states, may overlap genetic susceptibility for anxiety disorders and help explain the extensive comorbidity amongst internalizing disorders. Genome-wide linkage (GWL) data from several studies of neuroticism and anxiety disorders have been published, providing an opportunity to test such hypotheses and identify genomic regions that harbor genes common to these phenotypes. In all, 11 independent GWL studies of either neuroticism (n=8) or anxiety disorders (n=3) were collected, which comprised of 5341 families with 15 529 individuals. The rank-based genome scan meta-analysis (GSMA) approach was used to analyze each trait separately and combined, and global correlations between results were examined. False discovery rate (FDR) analysis was performed to test for enrichment of significant effects. Using 10 cM intervals, bins nominally significant for both GSMA statistics, PSR and POR, were found on chromosomes 9, 11, 12, and 14 for neuroticism and on chromosomes 1, 5, 15, and 16 for anxiety disorders. Genome-wide, the results for the two phenotypes were significantly correlated, and a combined analysis identified additional nominally significant bins. Although none reached genome-wide significance, an excess of significant PSRP-values were observed, with 12 bins falling under a FDR threshold of 0.50. As demonstrated by our identification of multiple, consistent signals across the genome, meta-analytically combining existing GWL data is a valuable approach to narrowing down regions relevant for anxiety-related phenotypes. This may prove useful for prioritizing emerging genome-wide association data for anxiety disorders. PMID:22473089

  1. Genome wide expression profiling of angiogenic signaling and the Heisenberg uncertainty principle.

    PubMed

    Huber, Peter E; Hauser, Kai; Abdollahi, Amir

    2004-11-01

    Genome wide DNA expression profiling coupled with antibody array experiments using endostatin to probe the angiogenic signaling network in human endothelial cells were performed. The results reveal constraints on the measuring process that are of a similar kind as those implied by the uncertainty principle of quantum mechanics as described by Werner Heisenberg. We describe this analogy and argue for its heuristic utility in the conceptualization of angiogenesis as an important step in tumor formation.

  2. Genome-wide (over)view on the actions of vitamin D

    PubMed Central

    Carlberg, Carsten

    2014-01-01

    For a global understanding of the physiological impact of the nuclear hormone 1α,25-dihydroxyvitamin D3 (1,25(OH)2D3) the analysis of the genome-wide locations of its high affinity receptor, the transcription factor vitamin D receptor (VDR), is essential. Chromatin immunoprecipitation sequencing (ChIP-seq) in GM10855 and GM10861 lymphoblastoid cells, undifferentiated and lipopolysaccharide-differentiated THP-1 monocytes, LS180 colorectal cancer cells and LX2 hepatic stellate cells revealed between 1000 and 13,000 VDR-specific genomic binding sites. The harmonized analysis of these ChIP-seq datasets indicates that the mechanistic basis for the action of the VDR is independent of the cell type. Formaldehyde-assisted isolation of regulatory elements sequencing (FAIRE-seq) data highlight accessible chromatin regions, which are under control of 1,25(OH)2D3. In addition, public data, such as from the ENCODE project, allow to relate the genome-wide actions of VDR and 1,25(OH)2D3 to those of other proteins within the nucleus. For example, locations of the insulator protein CTCF suggest a segregation of the human genome into chromatin domains, of which more than 1000 contain at least one VDR binding site. The integration of all these genome-wide data facilitates the identification of the most important VDR binding sites and associated primary 1,25(OH)2D3 target genes. Expression changes of these key genes can serve as biomarkers for the actions of vitamin D3 and its metabolites in different tissues and cell types of human individuals. Analysis of primary tissues obtained from vitamin D3 intervention studies using such markers indicated a large inter-individual variation for the efficiency of vitamin D3 supplementation. In conclusion, a genome-wide (over)view on the genomic locations of VDR provides a broader basis for addressing vitamin D's role in health and disease. PMID:24808867

  3. Common genetic variation and survival after colorectal cancer diagnosis: a genome-wide analysis

    PubMed Central

    Phipps, Amanda I.; Passarelli, Michael N.; Chan, Andrew T.; Harrison, Tabitha A.; Jeon, Jihyoun; Hutter, Carolyn M.; Berndt, Sonja I.; Brenner, Hermann; Caan, Bette J.; Campbell, Peter T.; Chang-Claude, Jenny; Chanock, Stephen J.; Cheadle, Jeremy P.; Curtis, Keith R.; Duggan, David; Fisher, David; Fuchs, Charles S.; Gala, Manish; Giovannucci, Edward L.; Hayes, Richard B.; Hoffmeister, Michael; Hsu, Li; Jacobs, Eric J.; Jansen, Lina; Kaplan, Richard; Kap, Elisabeth J.; Maughan, Timothy S.; Potter, John D.; Schoen, Robert E.; Seminara, Daniela; Slattery, Martha L.; West, Hannah; White, Emily; Peters, Ulrike; Newcomb, Polly A.

    2016-01-01

    Genome-wide association studies have identified several germline single nucleotide polymorphisms (SNPs) significantly associated with colorectal cancer (CRC) incidence. Common germline genetic variation may also be related to CRC survival. We used a discovery-based approach to identify SNPs related to survival outcomes after CRC diagnosis. Genome-wide genotyping arrays were conducted for 3494 individuals with invasive CRC enrolled in six prospective cohort studies (median study-specific follow-up = 4.2–8.1 years). In pooled analyses, we used Cox regression to assess SNP-specific associations with CRC-specific and overall survival, with additional analyses stratified by stage at diagnosis. Top findings were followed-up in independent studies. A P value threshold of P < 5×10−8 in analyses combining discovery and follow-up studies was required for genome-wide significance. Among individuals with distant-metastatic CRC, several SNPs at 6p12.1, nearest the ELOVL5 gene, were statistically significantly associated with poorer survival, with the strongest associations noted for rs209489 [hazard ratio (HR) = 1.8, P = 7.6×10−10 and HR = 1.8, P = 3.7×10−9 for CRC-specific and overall survival, respectively). No SNPs were statistically significantly associated with survival among all cases combined or in cases without distant-metastases. SNPs in 6p12.1/ELOVL5 were associated with survival outcomes in individuals with distant-metastatic CRC, and merit further follow-up for functional significance. Findings from this genome-wide association study highlight the potential importance of genetic variation in CRC prognosis and provide clues to genomic regions of potential interest. PMID:26586795

  4. Cooperative Genome-Wide Analysis Shows Increased Homozygosity in Early Onset Parkinson's Disease

    PubMed Central

    Nalls, Michael A.; Martinez, Maria; Schulte, Claudia; Holmans, Peter; Gasser, Thomas; Hardy, John; Singleton, Andrew B.; Wood, Nicholas W.; Brice, Alexis; Heutink, Peter; Williams, Nigel; Morris, Huw R.

    2012-01-01

    Parkinson's disease (PD) occurs in both familial and sporadic forms, and both monogenic and complex genetic factors have been identified. Early onset PD (EOPD) is particularly associated with autosomal recessive (AR) mutations, and three genes, PARK2, PARK7 and PINK1, have been found to carry mutations leading to AR disease. Since mutations in these genes account for less than 10% of EOPD patients, we hypothesized that further recessive genetic factors are involved in this disorder, which may appear in extended runs of homozygosity. We carried out genome wide SNP genotyping to look for extended runs of homozygosity (ROHs) in 1,445 EOPD cases and 6,987 controls. Logistic regression analyses showed an increased level of genomic homozygosity in EOPD cases compared to controls. These differences are larger for ROH of 9 Mb and above, where there is a more than three-fold increase in the proportion of cases carrying a ROH. These differences are not explained by occult recessive mutations at existing loci. Controlling for genome wide homozygosity in logistic regression analyses increased the differences between cases and controls, indicating that in EOPD cases ROHs do not simply relate to genome wide measures of inbreeding. Homozygosity at a locus on chromosome19p13.3 was identified as being more common in EOPD cases as compared to controls. Sequencing analysis of genes and predicted transcripts within this locus failed to identify a novel mutation causing EOPD in our cohort. There is an increased rate of genome wide homozygosity in EOPD, as measured by an increase in ROHs. These ROHs are a signature of inbreeding and do not necessarily harbour disease-causing genetic variants. Although there might be other regions of interest apart from chromosome 19p13.3, we lack the power to detect them with this analysis. PMID:22427796

  5. Genome-wide Association and Functional Studies Identify a Role for IGFBP3 in Hip Osteoarthritis

    PubMed Central

    Evans, Daniel S.; Cailotto, Frederic; Parimi, Neeta; Valdes, Ana M.; Castaño-Betancourt, Martha C.; Liu, Youfang; Kaplan, Robert C.; Bidlingmaier, Martin; Vasan, Ramachandran S.; Teumer, Alexander; Tranah, Gregory J.; Nevitt, Michael C.; Cummings, Steven R.; Orwoll, Eric S.; Barrett-Connor, Elizabeth; Renner, Jordan B.; Jordan, Joanne M.; Doherty, Michael; Doherty, Sally A.; Uitterlinden, Andre G.; van Meurs, Joyce B.J.; Spector, Tim D.; Lories, Rik J.; Lane, Nancy E.

    2015-01-01

    Objectives To identify genetic associations with hip osteoarthritis (HOA), we performed a meta-analysis of genome-wide association studies (GWAS) of HOA. Methods The GWAS meta-analysis included approximately 2.5 million imputed HapMap single nucleotide polymorphisms (SNPs). HOA cases and controls defined radiographically and by total hip replacement were selected from the Osteoporotic Fractures in Men (MrOS) Study and the Study of Osteoporotic Fractures (SOF) (654 cases and 4697 controls, combined). Replication of genome-wide significant SNP associations (P-value ≤ 5x10−8) was examined in five studies (3243 cases and 6891 controls, combined). Functional studies were performed using in vitro models of chondrogenesis and osteogenesis. Results The A allele of rs788748, located 65 kb upstream of the IGFBP3 gene, was associated with lower HOA odds at the genome-wide significance level in the discovery stage (OR = 0.71, P-value = 2x10−8). The association replicated in five studies (OR = 0.92, P-value = 0.020), but the joint analysis of discovery and replication results was not genome-wide significant (P-value = 1x10−6). In separate study populations, the rs788748 A allele was also associated with lower circulating IGFBP3 protein levels (P-value = 4x10−13), suggesting that this SNP or a variant in linkage disequilibrium (LD) could be an IGFBP3 regulatory variant. Results from functional studies were consistent with association results. Chondrocyte hypertrophy, a deleterious event in OA pathogenesis, was largely prevented upon IGFBP3 knockdown in chondrocytes. Furthermore, IGFBP3 overexpression induced cartilage catabolism and osteogenic differentiation. Conclusions Results from GWAS and functional studies provided suggestive links between IGFBP3 and HOA. PMID:24928840

  6. HIV Genome-Wide Protein Associations: a Review of 30 Years of Research.

    PubMed

    Li, Guangdi; De Clercq, Erik

    2016-09-01

    The HIV genome encodes a small number of viral proteins (i.e., 16), invariably establishing cooperative associations among HIV proteins and between HIV and host proteins, to invade host cells and hijack their internal machineries. As a known example, the HIV envelope glycoprotein GP120 is closely associated with GP41 for viral entry. From a genome-wide perspective, a hypothesis can be worked out to determine whether 16 HIV proteins could develop 120 possible pairwise associations either by physical interactions or by functional associations mediated via HIV or host molecules. Here, we present the first systematic review of experimental evidence on HIV genome-wide protein associations using a large body of publications accumulated over the past 3 decades. Of 120 possible pairwise associations between 16 HIV proteins, at least 34 physical interactions and 17 functional associations have been identified. To achieve efficient viral replication and infection, HIV protein associations play essential roles (e.g., cleavage, inhibition, and activation) during the HIV life cycle. In either a dispensable or an indispensable manner, each HIV protein collaborates with another viral protein to accomplish specific activities that precisely take place at the proper stages of the HIV life cycle. In addition, HIV genome-wide protein associations have an impact on anti-HIV inhibitors due to the extensive cross talk between drug-inhibited proteins and other HIV proteins. Overall, this study presents for the first time a comprehensive overview of HIV genome-wide protein associations, highlighting meticulous collaborations between all viral proteins during the HIV life cycle.

  7. Genome-Wide Association Study Identifies Novel Loci Associated With Diisocyanate-Induced Occupational Asthma.

    PubMed

    Yucesoy, Berran; Kaufman, Kenneth M; Lummus, Zana L; Weirauch, Matthew T; Zhang, Ge; Cartier, André; Boulet, Louis-Philippe; Sastre, Joaquin; Quirce, Santiago; Tarlo, Susan M; Cruz, Maria-Jesus; Munoz, Xavier; Harley, John B; Bernstein, David I

    2015-07-01

    Diisocyanates, reactive chemicals used to produce polyurethane products, are the most common causes of occupational asthma. The aim of this study is to identify susceptibility gene variants that could contribute to the pathogenesis of diisocyanate asthma (DA) using a Genome-Wide Association Study (GWAS) approach. Genome-wide single nucleotide polymorphism (SNP) genotyping was performed in 74 diisocyanate-exposed workers with DA and 824 healthy controls using Omni-2.5 and Omni-5 SNP microarrays. We identified 11 SNPs that exceeded genome-wide significance; the strongest association was for the rs12913832 SNP located on chromosome 15, which has been mapped to the HERC2 gene (p = 6.94 × 10(-14)). Strong associations were also found for SNPs near the ODZ3 and CDH17 genes on chromosomes 4 and 8 (rs908084, p = 8.59 × 10(-9) and rs2514805, p = 1.22 × 10(-8), respectively). We also prioritized 38 SNPs with suggestive genome-wide significance (p < 1 × 10(-6)). Among them, 17 SNPs map to the PITPNC1, ACMSD, ZBTB16, ODZ3, and CDH17 gene loci. Functional genomics data indicate that 2 of the suggestive SNPs (rs2446823 and rs2446824) are located within putative binding sites for the CCAAT/Enhancer Binding Protein (CEBP) and Hepatocyte Nuclear Factor 4, Alpha transcription factors (TFs), respectively. This study identified SNPs mapping to the HERC2, CDH17, and ODZ3 genes as potential susceptibility loci for DA. Pathway analysis indicated that these genes are associated with antigen processing and presentation, and other immune pathways. Overlap of 2 suggestive SNPs with likely TF binding sites suggests possible roles in disruption of gene regulation. These results provide new insights into the genetic architecture of DA and serve as a basis for future functional and mechanistic studies.

  8. Multi-Instance Metric Transfer Learning for Genome-Wide Protein Function Prediction.

    PubMed

    Xu, Yonghui; Min, Huaqing; Wu, Qingyao; Song, Hengjie; Ye, Bicui

    2017-02-06

    Multi-Instance (MI) learning has been proven to be effective for the genome-wide protein function prediction problems where each training example is associated with multiple instances. Many studies in this literature attempted to find an appropriate Multi-Instance Learning (MIL) method for genome-wide protein function prediction under a usual assumption, the underlying distribution from testing data (target domain, i.e., TD) is the same as that from training data (source domain, i.e., SD). However, this assumption may be violated in real practice. To tackle this problem, in this paper, we propose a Multi-Instance Metric Transfer Learning (MIMTL) approach for genome-wide protein function prediction. In MIMTL, we first transfer the source domain distribution to the target domain distribution by utilizing the bag weights. Then, we construct a distance metric learning method with the reweighted bags. At last, we develop an alternative optimization scheme for MIMTL. Comprehensive experimental evidence on seven real-world organisms verifies the effectiveness and efficiency of the proposed MIMTL approach over several state-of-the-art methods.

  9. Multi-instance multi-label distance metric learning for genome-wide protein function prediction.

    PubMed

    Xu, Yonghui; Min, Huaqing; Song, Hengjie; Wu, Qingyao

    2016-08-01

    Multi-instance multi-label (MIML) learning has been proven to be effective for the genome-wide protein function prediction problems where each training example is associated with not only multiple instances but also multiple class labels. To find an appropriate MIML learning method for genome-wide protein function prediction, many studies in the literature attempted to optimize objective functions in which dissimilarity between instances is measured using the Euclidean distance. But in many real applications, Euclidean distance may be unable to capture the intrinsic similarity/dissimilarity in feature space and label space. Unlike other previous approaches, in this paper, we propose to learn a multi-instance multi-label distance metric learning framework (MIMLDML) for genome-wide protein function prediction. Specifically, we learn a Mahalanobis distance to preserve and utilize the intrinsic geometric information of both feature space and label space for MIML learning. In addition, we try to deal with the sparsely labeled data by giving weight to the labeled data. Extensive experiments on seven real-world organisms covering the biological three-domain system (i.e., archaea, bacteria, and eukaryote; Woese et al., 1990) show that the MIMLDML algorithm is superior to most state-of-the-art MIML learning algorithms.

  10. Identifying Human Genome-Wide CNV, LOH and UPD by Targeted Sequencing of Selected Regions.

    PubMed

    Wang, Yu; Li, Wei; Xia, Yingying; Wang, Chongzhi; Tang, Y Tom; Guo, Wenying; Li, Jinliang; Zhao, Xia; Sun, Yepeng; Hu, Juan; Zhen, Hefu; Zhang, Xiandong; Chen, Chao; Shi, Yujian; Li, Lin; Cao, Hongzhi; Du, Hongli; Li, Jian

    2014-01-01

    Copy-number variations (CNV), loss of heterozygosity (LOH), and uniparental disomy (UPD) are large genomic aberrations leading to many common inherited diseases, cancers, and other complex diseases. An integrated tool to identify these aberrations is essential in understanding diseases and in designing clinical interventions. Previous discovery methods based on whole-genome sequencing (WGS) require very high depth of coverage on the whole genome scale, and are cost-wise inefficient. Another approach, whole exome genome sequencing (WEGS), is limited to discovering variations within exons. Thus, we are lacking efficient methods to detect genomic aberrations on the whole genome scale using next-generation sequencing technology. Here we present a method to identify genome-wide CNV, LOH and UPD for the human genome via selectively sequencing a small portion of genome termed Selected Target Regions (SeTRs). In our experiments, the SeTRs are covered by 99.73%~99.95% with sufficient depth. Our developed bioinformatics pipeline calls genome-wide CNVs with high confidence, revealing 8 credible events of LOH and 3 UPD events larger than 5M from 15 individual samples. We demonstrate that genome-wide CNV, LOH and UPD can be detected using a cost-effective SeTRs sequencing approach, and that LOH and UPD can be identified using just a sample grouping technique, without using a matched sample or familial information.

  11. A genome-wide association study of sleep habits and insomnia.

    PubMed

    Byrne, Enda M; Gehrman, Philip R; Medland, Sarah E; Nyholt, Dale R; Heath, Andrew C; Madden, Pamela A F; Hickie, Ian B; Van Duijn, Cornelia M; Henders, Anjali K; Montgomery, Grant W; Martin, Nicholas G; Wray, Naomi R

    2013-07-01

    Several aspects of sleep behavior such as timing, duration and quality have been demonstrated to be heritable. To identify common variants that influence sleep traits in the population, we conducted a genome-wide association study of six sleep phenotypes assessed by questionnaire in a sample of 2,323 individuals from the Australian Twin Registry. Genotyping was performed on the Illumina 317, 370, and 610K arrays and the SNPs in common between platforms were used to impute non-genotyped SNPs. We tested for association with more than 2,000,000 common polymorphisms across the genome. While no SNPs reached the genome-wide significance threshold, we identified a number of associations in plausible candidate genes. Most notably, a group of SNPs in the third intron of the CACNA1C gene ranked as most significant in the analysis of sleep latency (P = 1.3 × 10⁻⁶). We attempted to replicate this association in an independent sample from the Chronogen Consortium (n = 2,034), but found no evidence of association (P = 0.73). We have identified several other suggestive associations that await replication in an independent sample. We did not replicate the results from previous genome-wide analyses of self-reported sleep phenotypes after correction for multiple testing.

  12. Genome-wide association analysis of age at onset and psychotic symptoms in bipolar disorder.

    PubMed

    Belmonte Mahon, Pamela; Pirooznia, Mehdi; Goes, Fernando S; Seifuddin, Fayaz; Steele, Jo; Lee, Phil Hyoun; Huang, Jie; Hamshere, Marian L; Depaulo, J Raymond; Kelsoe, John R; Rietschel, Marcella; Nöthen, Markus; Cichon, Sven; Gurling, Hugh; Purcell, Shaun; Smoller, Jordan W; Craddock, Nick; Schulze, Thomas G; McMahon, Francis J; Potash, James B; Zandi, Peter P

    2011-04-01

    Genome-wide association studies (GWAS) have identified several susceptibility loci for bipolar disorder (BP), most notably ANK3. However, most of the inherited risk for BP remains unexplained. One reason for the limited success may be the genetic heterogeneity of BP. Clinical sub-phenotypes of BP may identify more etiologically homogeneous subsets of patients, which can be studied with increased power to detect genetic variation. Here, we report on a mega-analysis of two widely studied sub-phenotypes of BP, age at onset and psychotic symptoms, which are familial and clinically significant. We combined data from three GWAS: NIMH Bipolar Disorder Genetic Association Information Network (GAIN-BP), NIMH Bipolar Disorder Genome Study (BiGS), and a German sample. The combined sample consisted of 2,836 BP cases with information on sub-phenotypes and 2,744 controls. Imputation was performed, resulting in 2.3 million SNPs available for analysis. No SNP reached genome-wide significance for either sub-phenotype. In addition, no SNP reached genome-wide significance in a meta-analysis with an independent replication sample. We had 80% power to detect associations with a common SNP at an OR of 1.6 for psychotic symptoms and a mean difference of 1.8 years in age at onset. Age at onset and psychotic symptoms in BP may be influenced by many genes of smaller effect sizes or other variants not measured well by SNP arrays, such as rare alleles.

  13. Genome-wide association study for wool production traits in a Chinese Merino sheep population.

    PubMed

    Wang, Zhipeng; Zhang, Hui; Yang, Hua; Wang, Shouzhi; Rong, Enguang; Pei, Wenyu; Li, Hui; Wang, Ning

    2014-01-01

    Genome-wide association studies (GWAS) provide a powerful approach for identifying quantitative trait loci without prior knowledge of location or function. To identify loci associated with wool production traits, we performed a genome-wide association study on a total of 765 Chinese Merino sheep (JunKen type) genotyped with 50 K single nucleotide polymorphisms (SNPs). In the present study, five wool production traits were examined: fiber diameter, fiber diameter coefficient of variation, fineness dispersion, staple length and crimp. We detected 28 genome-wide significant SNPs for fiber diameter, fiber diameter coefficient of variation, fineness dispersion, and crimp trait in the Chinese Merino sheep. About 43% of the significant SNP markers were located within known or predicted genes, including YWHAZ, KRTCAP3, TSPEAR, PIK3R4, KIF16B, PTPN3, GPRC5A, DDX47, TCF9, TPTE2, EPHA5 and NBEA genes. Our results not only confirm the results of previous reports, but also provide a suite of novel SNP markers and candidate genes associated with wool traits. Our findings will be useful for exploring the genetic control of wool traits in sheep.

  14. Multi-Instance Metric Transfer Learning for Genome-Wide Protein Function Prediction

    PubMed Central

    Xu, Yonghui; Min, Huaqing; Wu, Qingyao; Song, Hengjie; Ye, Bicui

    2017-01-01

    Multi-Instance (MI) learning has been proven to be effective for the genome-wide protein function prediction problems where each training example is associated with multiple instances. Many studies in this literature attempted to find an appropriate Multi-Instance Learning (MIL) method for genome-wide protein function prediction under a usual assumption, the underlying distribution from testing data (target domain, i.e., TD) is the same as that from training data (source domain, i.e., SD). However, this assumption may be violated in real practice. To tackle this problem, in this paper, we propose a Multi-Instance Metric Transfer Learning (MIMTL) approach for genome-wide protein function prediction. In MIMTL, we first transfer the source domain distribution to the target domain distribution by utilizing the bag weights. Then, we construct a distance metric learning method with the reweighted bags. At last, we develop an alternative optimization scheme for MIMTL. Comprehensive experimental evidence on seven real-world organisms verifies the effectiveness and efficiency of the proposed MIMTL approach over several state-of-the-art methods. PMID:28165495

  15. Genome-wide association study in essential tremor identifies three new loci

    PubMed Central

    Müller, Stefanie H.; Girard, Simon L.; Hopfner, Franziska; Merner, Nancy D.; Bourassa, Cynthia V.; Lorenz, Delia; Clark, Lorraine N.; Tittmann, Lukas; Soto-Ortolaza, Alexandra I.; Klebe, Stephan; Hallett, Mark; Schneider, Susanne A.; Hodgkinson, Colin A.; Lieb, Wolfgang; Wszolek, Zbigniew K.; Pendziwiat, Manuela; Lorenzo-Betancor, Oswaldo; Poewe, Werner; Ortega-Cubero, Sara; Seppi, Klaus; Rajput, Alex; Hussl, Anna; Rajput, Ali H.; Berg, Daniela; Dion, Patrick A.; Wurster, Isabel; Shulman, Joshua M.; Srulijes, Karin; Haubenberger, Dietrich; Pastor, Pau; Vilariño-Güell, Carles; Postuma, Ronald B.; Bernard, Geneviève; Ladwig, Karl-Heinz; Dupré, Nicolas; Jankovic, Joseph; Strauch, Konstantin; Panisset, Michel; Winkelmann, Juliane; Testa, Claudia M.; Reischl, Eva; Zeuner, Kirsten E.; Ross, Owen A.; Arzberger, Thomas; Chouinard, Sylvain; Deuschl, Günther; Louis, Elan D.; Kuhlenbäumer, Gregor

    2016-01-01

    We conducted a genome-wide association study of essential tremor, a common movement disorder characterized mainly by a postural and kinetic tremor of the upper extremities. Twin and family history studies show a high heritability for essential tremor. The molecular genetic determinants of essential tremor are unknown. We included 2807 patients and 6441 controls of European descent in our two-stage genome-wide association study. The 59 most significantly disease-associated markers of the discovery stage were genotyped in the replication stage. After Bonferroni correction two markers, one (rs10937625) located in the serine/threonine kinase STK32B and one (rs17590046) in the transcriptional coactivator PPARGC1A were associated with essential tremor. Three markers (rs12764057, rs10822974, rs7903491) in the cell-adhesion molecule CTNNA3 were significant in the combined analysis of both stages. The expression of STK32B was increased in the cerebellar cortex of patients and expression quantitative trait loci database mining showed association between the protective minor allele of rs10937625 and reduced expression in cerebellar cortex. We found no expression differences related to disease status or marker genotype for the other two genes. Replication of two lead single nucleotide polymorphisms of previous small genome-wide association studies (rs3794087 in SLC1A2, rs9652490 in LINGO1) did not confirm the association with essential tremor. PMID:27797806

  16. Genome Wide Allele Frequency Fingerprints (GWAFFs) of Populations via Genotyping by Sequencing

    PubMed Central

    Byrne, Stephen; Czaban, Adrian; Studer, Bruno; Panitz, Frank; Bendixen, Christian; Asp, Torben

    2013-01-01

    Genotyping-by-Sequencing (GBS) is an excellent tool for characterising genetic variation between plant genomes. To date, its use has been reported only for genotyping of single individuals. However, there are many applications where resolving allele frequencies within populations on a genome-wide scale would be very powerful, examples include the breeding of outbreeding species, varietal protection in outbreeding species, monitoring changes in population allele frequencies. This motivated us to test the potential to use GBS to evaluate allele frequencies within populations. Perennial ryegrass is an outbreeding species, and breeding programs are based upon selection on populations. We tested two restriction enzymes for their efficiency in complexity reduction of the perennial ryegrass genome. The resulting profiles have been termed Genome Wide Allele Frequency Fingerprints (GWAFFs), and we have shown how these fingerprints can be used to distinguish between plant populations. Even at current costs and throughput, using sequencing to directly evaluate populations on a genome-wide scale is viable. GWAFFs should find many applications, from varietal development in outbreeding species right through to playing a role in protecting plant breeders’ rights. PMID:23469194

  17. Identifying Human Genome-Wide CNV, LOH and UPD by Targeted Sequencing of Selected Regions

    PubMed Central

    Guo, Wenying; Li, Jinliang; Zhao, Xia; Sun, Yepeng; Hu, Juan; Zhen, Hefu; Zhang, Xiandong; Chen, Chao; Shi, Yujian; Li, Lin; Cao, Hongzhi; Du, Hongli; Li, Jian

    2015-01-01

    Copy-number variations (CNV), loss of heterozygosity (LOH), and uniparental disomy (UPD) are large genomic aberrations leading to many common inherited diseases, cancers, and other complex diseases. An integrated tool to identify these aberrations is essential in understanding diseases and in designing clinical interventions. Previous discovery methods based on whole-genome sequencing (WGS) require very high depth of coverage on the whole genome scale, and are cost-wise inefficient. Another approach, whole exome genome sequencing (WEGS), is limited to discovering variations within exons. Thus, we are lacking efficient methods to detect genomic aberrations on the whole genome scale using next-generation sequencing technology. Here we present a method to identify genome-wide CNV, LOH and UPD for the human genome via selectively sequencing a small portion of genome termed Selected Target Regions (SeTRs). In our experiments, the SeTRs are covered by 99.73%~99.95% with sufficient depth. Our developed bioinformatics pipeline calls genome-wide CNVs with high confidence, revealing 8 credible events of LOH and 3 UPD events larger than 5M from 15 individual samples. We demonstrate that genome-wide CNV, LOH and UPD can be detected using a cost-effective SeTRs sequencing approach, and that LOH and UPD can be identified using just a sample grouping technique, without using a matched sample or familial information. PMID:25919136

  18. A Genome-Wide Association Study Identifies Genetic Variants Associated with Mathematics Ability

    PubMed Central

    Chen, Huan; Gu, Xiao-hong; Zhou, Yuxi; Ge, Zeng; Wang, Bin; Siok, Wai Ting; Wang, Guoqing; Huen, Michael; Jiang, Yuyang; Tan, Li-Hai; Sun, Yimin

    2017-01-01

    Mathematics ability is a complex cognitive trait with polygenic heritability. Genome-wide association study (GWAS) has been an effective approach to investigate genetic components underlying mathematic ability. Although previous studies reported several candidate genetic variants, none of them exceeded genome-wide significant threshold in general populations. Herein, we performed GWAS in Chinese elementary school students to identify potential genetic variants associated with mathematics ability. The discovery stage included 494 and 504 individuals from two independent cohorts respectively. The replication stage included another cohort of 599 individuals. In total, 28 of 81 candidate SNPs that met validation criteria were further replicated. Combined meta-analysis of three cohorts identified four SNPs (rs1012694, rs11743006, rs17778739 and rs17777541) of SPOCK1 gene showing association with mathematics ability (minimum p value 5.67 × 10−10, maximum β −2.43). The SPOCK1 gene is located on chromosome 5q31.2 and encodes a highly conserved glycoprotein testican-1 which was associated with tumor progression and prognosis as well as neurogenesis. This is the first study to report genome-wide significant association of individual SNPs with mathematics ability in general populations. Our preliminary results further supported the role of SPOCK1 during neurodevelopment. The genetic complexities underlying mathematics ability might contribute to explain the basis of human cognition and intelligence at genetic level. PMID:28155865

  19. A GENOME-WIDE LINKAGE AND ASSOCIATION SCAN REVEALS NOVEL LOCI FOR AUTISM

    PubMed Central

    Weiss, Lauren A.; Arking, Dan E.

    2009-01-01

    Summary Although autism is a highly heritable neurodevelopmental disorder, attempts to identify specific susceptibility genes have thus far met with limited success 1. Genome-wide association studies (GWAS) using half a million or more markers, particularly those with very large sample sizes achieved through meta-analysis, have shown great success in mapping genes for other complex genetic traits (http://www.genome.gov/26525384). Consequently, we initiated a linkage and association mapping study using half a million genome-wide SNPs in a common set of 1,031 multiplex autism families (1,553 affected offspring). We identified regions of suggestive and significant linkage on chromosomes 6q27 and 20p13, respectively. Initial analysis did not yield genome-wide significant associations; however, genotyping of top hits in additional families revealed a SNP on chromosome 5p15 (between SEMA5A and TAS2R1) that was significantly associated with autism (P = 2 × 10−7). We also demonstrated that expression of SEMA5A is reduced in brains from autistic patients, further implicating SEMA5A as an autism susceptibility gene. The linkage regions reported here provide targets for rare variation screening while the discovery of a single novel association demonstrates the action of common variants. PMID:19812673

  20. Cell-Type-Specific Genome-wide Expression Profiling after Laser Capture Microdissection of Living Tissue

    SciTech Connect

    Marchetti, F; Manohar, C F

    2005-02-09

    The purpose of this technical feasibility study was to develop and evaluate robust microgenomic tools for investigations of genome-wide expression of very small numbers of cells isolated from whole tissue sections. Tissues contain large numbers of cell-types that play varied roles in organ function and responses to endogenous and exogenous toxicants whether bacterial, viral, chemical or radiation. Expression studies of whole tissue biopsy are severely limited because heterogeneous cell-types result in an averaging of molecular signals masking subtle but important changes in gene expression in any one cell type(s) or group of cells. Accurate gene expression analysis requires the study of specific cell types in their tissue environment but without contamination from surrounding cells. Laser capture microdissection (LCM) is a new technology to isolate morphologically distinct cells from tissue sections. Alternative methods are available for isolating single cells but not yet for their reliable genome-wide expression analyses. The tasks of this feasibility project were to: (1) Develop efficient protocols for laser capture microdissection of cells from tissues identified by antibody label, or morphological stain. (2) Develop reproducible gene-transcript analyses techniques for single cell-types and determine the numbers of cells needed for reliable genome-wide analyses. (3) Validate the technology for epithelial and endothelial cells isolated from the gastrointestinal tract of mice.

  1. NSD1 mutations generate a genome-wide DNA methylation signature.

    PubMed

    Choufani, S; Cytrynbaum, C; Chung, B H Y; Turinsky, A L; Grafodatskaya, D; Chen, Y A; Cohen, A S A; Dupuis, L; Butcher, D T; Siu, M T; Luk, H M; Lo, I F M; Lam, S T S; Caluseriu, O; Stavropoulos, D J; Reardon, W; Mendoza-Londono, R; Brudno, M; Gibson, W T; Chitayat, D; Weksberg, R

    2015-12-22

    Sotos syndrome (SS) represents an important human model system for the study of epigenetic regulation; it is an overgrowth/intellectual disability syndrome caused by mutations in a histone methyltransferase, NSD1. As layered epigenetic modifications are often interdependent, we propose that pathogenic NSD1 mutations have a genome-wide impact on the most stable epigenetic mark, DNA methylation (DNAm). By interrogating DNAm in SS patients, we identify a genome-wide, highly significant NSD1(+/-)-specific signature that differentiates pathogenic NSD1 mutations from controls, benign NSD1 variants and the clinically overlapping Weaver syndrome. Validation studies of independent cohorts of SS and controls assigned 100% of these samples correctly. This highly specific and sensitive NSD1(+/-) signature encompasses genes that function in cellular morphogenesis and neuronal differentiation, reflecting cardinal features of the SS phenotype. The identification of SS-specific genome-wide DNAm alterations will facilitate both the elucidation of the molecular pathophysiology of SS and the development of improved diagnostic testing.

  2. On the analysis of a repeated measure design in genome-wide association analysis.

    PubMed

    Lee, Young; Park, Suyeon; Moon, Sanghoon; Lee, Juyoung; Elston, Robert C; Lee, Woojoo; Won, Sungho

    2014-11-28

    Longitudinal data enables detecting the effect of aging/time, and as a repeated measures design is statistically more efficient compared to cross-sectional data if the correlations between repeated measurements are not large. In particular, when genotyping cost is more expensive than phenotyping cost, the collection of longitudinal data can be an efficient strategy for genetic association analysis. However, in spite of these advantages, genome-wide association studies (GWAS) with longitudinal data have rarely been analyzed taking this into account. In this report, we calculate the required sample size to achieve 80% power at the genome-wide significance level for both longitudinal and cross-sectional data, and compare their statistical efficiency. Furthermore, we analyzed the GWAS of eight phenotypes with three observations on each individual in the Korean Association Resource (KARE). A linear mixed model allowing for the correlations between observations for each individual was applied to analyze the longitudinal data, and linear regression was used to analyze the first observation on each individual as cross-sectional data. We found 12 novel genome-wide significant disease susceptibility loci that were then confirmed in the Health Examination cohort, as well as some significant interactions between age/sex and SNPs.

  3. Replicability and robustness of genome-wide-association studies for behavioral traits.

    PubMed

    Rietveld, Cornelius A; Conley, Dalton; Eriksson, Nicholas; Esko, Tõnu; Medland, Sarah E; Vinkhuyzen, Anna A E; Yang, Jian; Boardman, Jason D; Chabris, Christopher F; Dawes, Christopher T; Domingue, Benjamin W; Hinds, David A; Johannesson, Magnus; Kiefer, Amy K; Laibson, David; Magnusson, Patrik K E; Mountain, Joanna L; Oskarsson, Sven; Rostapshova, Olga; Teumer, Alexander; Tung, Joyce Y; Visscher, Peter M; Benjamin, Daniel J; Cesarini, David; Koellinger, Philipp D

    2014-11-01

    A recent genome-wide-association study of educational attainment identified three single-nucleotide polymorphisms (SNPs) whose associations, despite their small effect sizes (each R (2) ≈ 0.02%), reached genome-wide significance (p < 5 × 10(-8)) in a large discovery sample and were replicated in an independent sample (p < .05). The study also reported associations between educational attainment and indices of SNPs called "polygenic scores." In three studies, we evaluated the robustness of these findings. Study 1 showed that the associations with all three SNPs were replicated in another large (N = 34,428) independent sample. We also found that the scores remained predictive (R (2) ≈ 2%) in regressions with stringent controls for stratification (Study 2) and in new within-family analyses (Study 3). Our results show that large and therefore well-powered genome-wide-association studies can identify replicable genetic associations with behavioral traits. The small effect sizes of individual SNPs are likely to be a major contributing factor explaining the striking contrast between our results and the disappointing replication record of most candidate-gene studies.

  4. Five endometrial cancer risk loci identified through genome-wide association analysis.

    PubMed

    Cheng, Timothy H T; Thompson, Deborah J; O'Mara, Tracy A; Painter, Jodie N; Glubb, Dylan M; Flach, Susanne; Lewis, Annabelle; French, Juliet D; Freeman-Mills, Luke; Church, David; Gorman, Maggie; Martin, Lynn; Hodgson, Shirley; Webb, Penelope M; Attia, John; Holliday, Elizabeth G; McEvoy, Mark; Scott, Rodney J; Henders, Anjali K; Martin, Nicholas G; Montgomery, Grant W; Nyholt, Dale R; Ahmed, Shahana; Healey, Catherine S; Shah, Mitul; Dennis, Joe; Fasching, Peter A; Beckmann, Matthias W; Hein, Alexander; Ekici, Arif B; Hall, Per; Czene, Kamila; Darabi, Hatef; Li, Jingmei; Dörk, Thilo; Dürst, Matthias; Hillemanns, Peter; Runnebaum, Ingo; Amant, Frederic; Schrauwen, Stefanie; Zhao, Hui; Lambrechts, Diether; Depreeuw, Jeroen; Dowdy, Sean C; Goode, Ellen L; Fridley, Brooke L; Winham, Stacey J; Njølstad, Tormund S; Salvesen, Helga B; Trovik, Jone; Werner, Henrica M J; Ashton, Katie; Otton, Geoffrey; Proietto, Tony; Liu, Tao; Mints, Miriam; Tham, Emma; Li, Mulin Jun; Yip, Shun H; Wang, Junwen; Bolla, Manjeet K; Michailidou, Kyriaki; Wang, Qin; Tyrer, Jonathan P; Dunlop, Malcolm; Houlston, Richard; Palles, Claire; Hopper, John L; Peto, Julian; Swerdlow, Anthony J; Burwinkel, Barbara; Brenner, Hermann; Meindl, Alfons; Brauch, Hiltrud; Lindblom, Annika; Chang-Claude, Jenny; Couch, Fergus J; Giles, Graham G; Kristensen, Vessela N; Cox, Angela; Cunningham, Julie M; Pharoah, Paul D P; Dunning, Alison M; Edwards, Stacey L; Easton, Douglas F; Tomlinson, Ian; Spurdle, Amanda B

    2016-06-01

    We conducted a meta-analysis of three endometrial cancer genome-wide association studies (GWAS) and two follow-up phases totaling 7,737 endometrial cancer cases and 37,144 controls of European ancestry. Genome-wide imputation and meta-analysis identified five new risk loci of genome-wide significance at likely regulatory regions on chromosomes 13q22.1 (rs11841589, near KLF5), 6q22.31 (rs13328298, in LOC643623 and near HEY2 and NCOA7), 8q24.21 (rs4733613, telomeric to MYC), 15q15.1 (rs937213, in EIF2AK4, near BMF) and 14q32.33 (rs2498796, in AKT1, near SIVA1). We also found a second independent 8q24.21 signal (rs17232730). Functional studies of the 13q22.1 locus showed that rs9600103 (pairwise r(2) = 0.98 with rs11841589) is located in a region of active chromatin that interacts with the KLF5 promoter region. The rs9600103[T] allele that is protective in endometrial cancer suppressed gene expression in vitro, suggesting that regulation of the expression of KLF5, a gene linked to uterine development, is implicated in tumorigenesis. These findings provide enhanced insight into the genetic and biological basis of endometrial cancer.

  5. Genome-wide analysis of long-term evolutionary domestication in Drosophila melanogaster

    PubMed Central

    Phillips, Mark A.; Long, Anthony D.; Greenspan, Zachary S.; Greer, Lee F.; Burke, Molly K.; Villeponteau, Bryant; Matsagas, Kennedy C.; Rizza, Cristina L.; Mueller, Laurence D.; Rose, Michael R.

    2016-01-01

    Experimental evolutionary genomics now allows biologists to test fundamental theories concerning the genetic basis of adaptation. We have conducted one of the longest laboratory evolution experiments with any sexually-reproducing metazoan, Drosophila melanogaster. We used next-generation resequencing data from this experiment to examine genome-wide patterns of genetic variation over an evolutionary time-scale that approaches 1,000 generations. We also compared measures of variation within and differentiation between our populations to simulations based on a variety of evolutionary scenarios. Our analysis yielded no clear evidence of hard selective sweeps, whereby natural selection acts to increase the frequency of a newly-arising mutation in a population until it becomes fixed. We do find evidence for selection acting on standing genetic variation, as independent replicate populations exhibit similar population-genetic dynamics, without obvious fixation of candidate alleles under selection. A hidden-Markov model test for selection also found widespread evidence for selection. We found more genetic variation genome-wide, and less differentiation between replicate populations genome-wide, than arose in any of our simulated evolutionary scenarios. PMID:28004838

  6. Novel Loci Associated with Usual Sleep Duration: The CHARGE Consortium Genome-Wide Association Study

    PubMed Central

    Gottlieb, Daniel J.; Hek, Karin; Chen, Ting-hsu; Watson, Nathaniel F.; Eiriksdottir, Gudny; Byrne, Enda M.; Cornelis, Marilyn; Warby, Simon C.; Bandinelli, Stefania; Cherkas, Lynn; Evans, Daniel S.; Grabe, Hans J.; Lahti, Jari; Li, Man; Lehtimäki, Terho; Lumley, Thomas; Marciante, Kristin D.; Pérusse, Louis; Psaty, Bruce M.; Robbins, John; Tranah, Gregory J.; Vink, Jacqueline M.; Wilk, Jemma B.; Stafford, Jeanette M.; Bellis, Claire; Biffar, Reiner; Bouchard, Claude; Cade, Brian; Curhan, Gary C.; Eriksson, Johan G.; Ewert, Ralf; Ferrucci, Luigi; Fülöp, Tibor; Gehrman, Philip R.; Goodloe, Robert; Harris, Tamara B.; Heath, Andrew C.; Hernandez, Dena; Hofman, Albert; Hottenga, Jouke-Jan; Hunter, David J.; Jensen, Majken K.; Johnson, Andrew D.; Kähönen, Mika; Kao, Linda; Kraft, Peter; Larkin, Emma K.; Lauderdale, Diane S.; Luik, Annemarie I.; Medici, Marco; Montgomery, Grant W.; Palotie, Aarno; Patel, Sanjay R.; Pistis, Giorgio; Porcu, Eleonora; Quaye, Lydia; Raitakari, Olli; Redline, Susan; Rimm, Eric B.; Rotter, Jerome I.; Smith, Albert V.; Spector, Tim D.; Teumer, Alexander; Uitterlinden, André G.; Vohl, Marie-Claude; Widen, Elisabeth; Willemsen, Gonneke; Young, Terry; Zhang, Xiaoling; Liu, Yongmei; Blangero, John; Boomsma, Dorret I.; Gudnason, Vilmundur; Hu, Frank; Mangino, Massimo; Martin, Nicholas G.; O’Connor, George T.; Stone, Katie L.; Tanaka, Toshiko; Viikari, Jorma; Gharib, Sina A.; Punjabi, Naresh M.; Räikkönen, Katri; Völzke, Henry; Mignot, Emmanuel; Tiemeier, Henning

    2015-01-01

    Usual sleep duration is a heritable trait correlated with psychiatric morbidity, cardiometabolic disease and mortality, although little is known about the genetic variants influencing this trait. A genome-wide association study of usual sleep duration was conducted using 18 population-based cohorts totaling 47,180 individuals of European ancestry. Genome-wide significant association was identified at two loci. The strongest is located on chromosome 2, in an intergenic region 35–80 kb upstream from the thyroid-specific transcription factor PAX8 (lowest p=1.1 ×10−9). This finding was replicated in an African-American sample of 4771 individuals (lowest p=9.3 × 10−4). The strongest combined association was at rs1823125 (p=1.5 × 10−10, minor allele frequency 0.26 in the discovery sample, 0.12 in the replication sample), with each copy of the minor allele associated with a sleep duration 3.1 minutes longer per night. The alleles associated with longer sleep duration were associated in previous genome-wide association studies with a more favorable metabolic profile and a lower risk of attention deficit hyperactivity disorder. Understanding the mechanisms underlying these associations may help elucidate biological mechanisms influencing sleep duration and its association with psychiatric, metabolic and cardiovascular disease. PMID:25469926

  7. Genome-wide association study in essential tremor identifies three new loci.

    PubMed

    Müller, Stefanie H; Girard, Simon L; Hopfner, Franziska; Merner, Nancy D; Bourassa, Cynthia V; Lorenz, Delia; Clark, Lorraine N; Tittmann, Lukas; Soto-Ortolaza, Alexandra I; Klebe, Stephan; Hallett, Mark; Schneider, Susanne A; Hodgkinson, Colin A; Lieb, Wolfgang; Wszolek, Zbigniew K; Pendziwiat, Manuela; Lorenzo-Betancor, Oswaldo; Poewe, Werner; Ortega-Cubero, Sara; Seppi, Klaus; Rajput, Alex; Hussl, Anna; Rajput, Ali H; Berg, Daniela; Dion, Patrick A; Wurster, Isabel; Shulman, Joshua M; Srulijes, Karin; Haubenberger, Dietrich; Pastor, Pau; Vilariño-Güell, Carles; Postuma, Ronald B; Bernard, Geneviève; Ladwig, Karl-Heinz; Dupré, Nicolas; Jankovic, Joseph; Strauch, Konstantin; Panisset, Michel; Winkelmann, Juliane; Testa, Claudia M; Reischl, Eva; Zeuner, Kirsten E; Ross, Owen A; Arzberger, Thomas; Chouinard, Sylvain; Deuschl, Günther; Louis, Elan D; Kuhlenbäumer, Gregor; Rouleau, Guy A

    2016-12-01

    We conducted a genome-wide association study of essential tremor, a common movement disorder characterized mainly by a postural and kinetic tremor of the upper extremities. Twin and family history studies show a high heritability for essential tremor. The molecular genetic determinants of essential tremor are unknown. We included 2807 patients and 6441 controls of European descent in our two-stage genome-wide association study. The 59 most significantly disease-associated markers of the discovery stage were genotyped in the replication stage. After Bonferroni correction two markers, one (rs10937625) located in the serine/threonine kinase STK32B and one (rs17590046) in the transcriptional coactivator PPARGC1A were associated with essential tremor. Three markers (rs12764057, rs10822974, rs7903491) in the cell-adhesion molecule CTNNA3 were significant in the combined analysis of both stages. The expression of STK32B was increased in the cerebellar cortex of patients and expression quantitative trait loci database mining showed association between the protective minor allele of rs10937625 and reduced expression in cerebellar cortex. We found no expression differences related to disease status or marker genotype for the other two genes. Replication of two lead single nucleotide polymorphisms of previous small genome-wide association studies (rs3794087 in SLC1A2, rs9652490 in LINGO1) did not confirm the association with essential tremor.

  8. Empirical estimation of genome-wide significance thresholds based on the 1000 Genomes Project data set

    PubMed Central

    Kanai, Masahiro; Tanaka, Toshihiro; Okada, Yukinori

    2016-01-01

    To assess the statistical significance of associations between variants and traits, genome-wide association studies (GWAS) should employ an appropriate threshold that accounts for the massive burden of multiple testing in the study. Although most studies in the current literature commonly set a genome-wide significance threshold at the level of P=5.0 × 10−8, the adequacy of this value for respective populations has not been fully investigated. To empirically estimate thresholds for different ancestral populations, we conducted GWAS simulations using the 1000 Genomes Phase 3 data set for Africans (AFR), Europeans (EUR), Admixed Americans (AMR), East Asians (EAS) and South Asians (SAS). The estimated empirical genome-wide significance thresholds were Psig=3.24 × 10−8 (AFR), 9.26 × 10−8 (EUR), 1.83 × 10−7 (AMR), 1.61 × 10−7 (EAS) and 9.46 × 10−8 (SAS). We additionally conducted trans-ethnic meta-analyses across all populations (ALL) and all populations except for AFR (ΔAFR), which yielded Psig=3.25 × 10−8 (ALL) and 4.20 × 10−8 (ΔAFR). Our results indicate that the current threshold (P=5.0 × 10−8) is overly stringent for all ancestral populations except for Africans; however, we should employ a more stringent threshold when conducting a meta-analysis, regardless of the presence of African samples. PMID:27305981

  9. Empirical estimation of genome-wide significance thresholds based on the 1000 Genomes Project data set.

    PubMed

    Kanai, Masahiro; Tanaka, Toshihiro; Okada, Yukinori

    2016-10-01

    To assess the statistical significance of associations between variants and traits, genome-wide association studies (GWAS) should employ an appropriate threshold that accounts for the massive burden of multiple testing in the study. Although most studies in the current literature commonly set a genome-wide significance threshold at the level of P=5.0 × 10(-8), the adequacy of this value for respective populations has not been fully investigated. To empirically estimate thresholds for different ancestral populations, we conducted GWAS simulations using the 1000 Genomes Phase 3 data set for Africans (AFR), Europeans (EUR), Admixed Americans (AMR), East Asians (EAS) and South Asians (SAS). The estimated empirical genome-wide significance thresholds were Psig=3.24 × 10(-8) (AFR), 9.26 × 10(-8) (EUR), 1.83 × 10(-7) (AMR), 1.61 × 10(-7) (EAS) and 9.46 × 10(-8) (SAS). We additionally conducted trans-ethnic meta-analyses across all populations (ALL) and all populations except for AFR (ΔAFR), which yielded Psig=3.25 × 10(-8) (ALL) and 4.20 × 10(-8) (ΔAFR). Our results indicate that the current threshold (P=5.0 × 10(-8)) is overly stringent for all ancestral populations except for Africans; however, we should employ a more stringent threshold when conducting a meta-analysis, regardless of the presence of African samples.

  10. Genome-wide association study identifies multiple loci associated with bladder cancer risk

    PubMed Central

    Figueroa, Jonine D.; Ye, Yuanqing; Siddiq, Afshan; Garcia-Closas, Montserrat; Chatterjee, Nilanjan; Prokunina-Olsson, Ludmila; Cortessis, Victoria K.; Kooperberg, Charles; Cussenot, Olivier; Benhamou, Simone; Prescott, Jennifer; Porru, Stefano; Dinney, Colin P.; Malats, Núria; Baris, Dalsu; Purdue, Mark; Jacobs, Eric J.; Albanes, Demetrius; Wang, Zhaoming; Deng, Xiang; Chung, Charles C.; Tang, Wei; Bas Bueno-de-Mesquita, H.; Trichopoulos, Dimitrios; Ljungberg, Börje; Clavel-Chapelon, Françoise; Weiderpass, Elisabete; Krogh, Vittorio; Dorronsoro, Miren; Travis, Ruth; Tjønneland, Anne; Brenan, Paul; Chang-Claude, Jenny; Riboli, Elio; Conti, David; Gago-Dominguez, Manuela; Stern, Mariana C.; Pike, Malcolm C.; Van Den Berg, David; Yuan, Jian-Min; Hohensee, Chancellor; Rodabough, Rebecca; Cancel-Tassin, Geraldine; Roupret, Morgan; Comperat, Eva; Chen, Constance; De Vivo, Immaculata; Giovannucci, Edward; Hunter, David J.; Kraft, Peter; Lindstrom, Sara; Carta, Angela; Pavanello, Sofia; Arici, Cecilia; Mastrangelo, Giuseppe; Kamat, Ashish M.; Lerner, Seth P.; Barton Grossman, H.; Lin, Jie; Gu, Jian; Pu, Xia; Hutchinson, Amy; Burdette, Laurie; Wheeler, William; Kogevinas, Manolis; Tardón, Adonina; Serra, Consol; Carrato, Alfredo; García-Closas, Reina; Lloreta, Josep; Schwenn, Molly; Karagas, Margaret R.; Johnson, Alison; Schned, Alan; Armenti, Karla R.; Hosain, G.M.; Andriole, Gerald; Grubb, Robert; Black, Amanda; Ryan Diver, W.; Gapstur, Susan M.; Weinstein, Stephanie J.; Virtamo, Jarmo; Haiman, Chris A.; Landi, Maria T.; Caporaso, Neil; Fraumeni, Joseph F.; Vineis, Paolo; Wu, Xifeng; Silverman, Debra T.; Chanock, Stephen; Rothman, Nathaniel

    2014-01-01

    Candidate gene and genome-wide association studies (GWAS) have identified 11 independent susceptibility loci associated with bladder cancer risk. To discover additional risk variants, we conducted a new GWAS of 2422 bladder cancer cases and 5751 controls, followed by a meta-analysis with two independently published bladder cancer GWAS, resulting in a combined analysis of 6911 cases and 11 814 controls of European descent. TaqMan genotyping of 13 promising single nucleotide polymorphisms with P < 1 × 10−5 was pursued in a follow-up set of 801 cases and 1307 controls. Two new loci achieved genome-wide statistical significance: rs10936599 on 3q26.2 (P = 4.53 × 10−9) and rs907611 on 11p15.5 (P = 4.11 × 10−8). Two notable loci were also identified that approached genome-wide statistical significance: rs6104690 on 20p12.2 (P = 7.13 × 10−7) and rs4510656 on 6p22.3 (P = 6.98 × 10−7); these require further studies for confirmation. In conclusion, our study has identified new susceptibility alleles for bladder cancer risk that require fine-mapping and laboratory investigation, which could further understanding into the biological underpinnings of bladder carcinogenesis. PMID:24163127

  11. Genome-wide signatures of male-mediated migration shaping the Indian gene pool.

    PubMed

    ArunKumar, GaneshPrasad; Tatarinova, Tatiana V; Duty, Jeff; Rollo, Debra; Syama, Adhikarla; Arun, Varatharajan Santhakumari; Kavitha, Valampuri John; Triska, Petr; Greenspan, Bennett; Wells, R Spencer; Pitchappan, Ramasamy

    2015-09-01

    Multiple questions relating to contributions of cultural and demographical factors in the process of human geographical dispersal remain largely unanswered. India, a land of early human settlement and the resulting diversity is a good place to look for some of the answers. In this study, we explored the genetic structure of India using a diverse panel of 78 males genotyped using the GenoChip. Their genome-wide single-nucleotide polymorphism (SNP) diversity was examined in the context of various covariates that influence Indian gene pool. Admixture analysis of genome-wide SNP data showed high proportion of the Southwest Asian component in all of the Indian samples. Hierarchical clustering based on admixture proportions revealed seven distinct clusters correlating to geographical and linguistic affiliations. Convex hull overlay of Y-chromosomal haplogroups on the genome-wide SNP principal component analysis brought out distinct non-overlapping polygons of F*-M89, H*-M69, L1-M27, O2a-M95 and O3a3c1-M117, suggesting a male-mediated migration and expansion of the Indian gene pool. Lack of similar correlation with mitochondrial DNA clades indicated a shared genetic ancestry of females. We suggest that ancient male-mediated migratory events and settlement in various regional niches led to the present day scenario and peopling of India.

  12. Genome-Wide Pathway Association Studies of Multiple Correlated Quantitative Phenotypes Using Principle Component Analyses

    PubMed Central

    Zhang, Feng; Guo, Xiong; Wu, Shixun; Han, Jing; Liu, Yongjun; Shen, Hui; Deng, Hong-Wen

    2012-01-01

    Genome-wide pathway association studies provide novel insight into the biological mechanism underlying complex diseases. Current pathway association studies primarily focus on single important disease phenotype, which is sometimes insufficient to characterize the clinical manifestations of complex diseases. We present a multi-phenotypes pathway association study(MPPAS) approach using principle component analysis(PCA). In our approach, PCA is first applied to multiple correlated quantitative phenotypes for extracting a set of orthogonal phenotypic components. The extracted phenotypic components are then used for pathway association analysis instead of original quantitative phenotypes. Four statistics were proposed for PCA-based MPPAS in this study. Simulations using the real data from the HapMap project were conducted to evaluate the power and type I error rates of PCA-based MPPAS under various scenarios considering sample sizes, additive and interactive genetic effects. A real genome-wide association study data set of bone mineral density (BMD) at hip and spine were also analyzed by PCA-based MPPAS. Simulation studies illustrated the performance of PCA-based MPPAS for identifying the causal pathways underlying complex diseases. Genome-wide MPPAS of BMD detected associations between BMD and KENNY_CTNNB1_TARGETS_UP as well as LONGEVITYPATHWAY pathways in this study. We aim to provide a applicable MPPAS approach, which may help to gain deep understanding the potential biological mechanism of association results for complex diseases. PMID:23285279

  13. Genome-wide association study in Chinese identifies novel loci for blood pressure and hypertension

    PubMed Central

    Lu, Xiangfeng; Wang, Laiyuan; Lin, Xu; Huang, Jianfeng; Charles Gu, C.; He, Meian; Shen, Hongbing; He, Jiang; Zhu, Jingwen; Li, Huaixing; Hixson, James E.; Wu, Tangchun; Dai, Juncheng; Lu, Ling; Shen, Chong; Chen, Shufeng; He, Lin; Mo, Zengnan; Hao, Yongchen; Mo, Xingbo; Yang, Xueli; Li, Jianxin; Cao, Jie; Chen, Jichun; Fan, Zhongjie; Li, Ying; Zhao, Liancheng; Li, Hongfan; Lu, Fanghong; Yao, Cailiang; Yu, Lin; Xu, Lihua; Mu, Jianjun; Wu, Xianping; Deng, Ying; Hu, Dongsheng; Zhang, Weidong; Ji, Xu; Guo, Dongshuang; Guo, Zhirong; Zhou, Zhengyuan; Yang, Zili; Wang, Renping; Yang, Jun; Zhou, Xiaoyang; Yan, Weili; Sun, Ningling; Gao, Pingjin; Gu, Dongfeng

    2015-01-01

    Hypertension is a common disorder and the leading risk factor for cardiovascular disease and premature deaths worldwide. Genome-wide association studies (GWASs) in the European population have identified multiple chromosomal regions associated with blood pressure, and the identified loci altogether explain only a small fraction of the variance for blood pressure. The differences in environmental exposures and genetic background between Chinese and European populations might suggest potential different pathways of blood pressure regulation. To identify novel genetic variants affecting blood pressure variation, we conducted a meta-analysis of GWASs of blood pressure and hypertension in 11 816 subjects followed by replication studies including 69 146 additional individuals. We identified genome-wide significant (P < 5.0 × 10−8) associations with blood pressure, which included variants at three new loci (CACNA1D, CYP21A2, and MED13L) and a newly discovered variant near SLC4A7. We also replicated 14 previously reported loci, 8 (CASZ1, MOV10, FGF5, CYP17A1, SOX6, ATP2B1, ALDH2, and JAG1) at genome-wide significance, and 6 (FIGN, ULK4, GUCY1A3, HFE, TBX3-TBX5, and TBX3) at a suggestive level of P = 1.81 × 10−3 to 5.16 × 10−8. These findings provide new mechanistic insights into the regulation of blood pressure and potential targets for treatments. PMID:25249183

  14. A twin study of breastfeeding with a preliminary genome wide association scan

    PubMed Central

    Colodro-Conde, L.; Zhu, G.; Power, R. A.; Henders, A.; Heath, A.C.; Madden, P.A.F.; Montgomery, G.W.; Medland, S. E.; Ordoñana, J.R.; Martin, N.G.

    2015-01-01

    Breastfeeding has been an important survival trait during human history, though it has long been recognised that individuals differ in their exact breastfeeding behaviour. Here our aims were, first, to explore to what extent genetic and environmental influences contributed to the individual differences in breastfeeding behaviour; second, to detect possible genetic variants related to breastfeeding; and lastly, to test if the genetic variants associated with breastfeeding have been previously found to be related with breast size. Data were collected from a large community-based cohort of Australian twins, with 3,364 women for the twin modelling analyses and 1,521 of them included in the genome wide association study. Monozygotic twin correlations (rMZ = .52, 95% CI .46 – .57) were larger than dizygotic twin correlations (rDZ = .35, 95% CI .25 – .43) and the best-fitting model was the one composed by additive genetics and unique environmental factors, explaining 53% and 47% of the variance in breastfeeding behaviour, respectively. No breastfeeding-related genetic variants reached genome-wide significance. The polygenic risk score analyses showed no significant results, suggesting breast size does not influence breastfeeding. This study confers a replication of a previous one exploring the sources of variance of breastfeeding and, to our knowledge, is the first one to conduct a Genome-Wide Association Study on breastfeeding and look at the overlap with variants for breast size. PMID:25475840

  15. Transcription, epigenetics and ameliorative strategies in Huntington's Disease: a genome-wide perspective.

    PubMed

    Valor, Luis M

    2015-02-01

    Transcriptional dysregulation in Huntington's disease (HD) is an early event that shapes the brain transcriptome by both the depletion and ectopic activation of gene products that eventually affect survival and neuronal functions. Disruption in the activity of gene expression regulators, such as transcription factors, chromatin-remodeling proteins, and noncoding RNAs, accounts for the expression changes observed in multiple animal and cellular models of HD and in samples from patients. Here, I review the recent advances in the study of HD transcriptional dysregulation and its causes to finally discuss the possible implications in ameliorative strategies from a genome-wide perspective. To date, the use of genome-wide approaches, predominantly based on microarray platforms, has been successful in providing an extensive catalog of differentially regulated genes, including biomarkers aimed at monitoring the progress of the pathology. Although still incipient, the introduction of combined next-generation sequencing techniques is enhancing our comprehension of the mechanisms underlying altered transcriptional dysregulation in HD by providing the first genomic landscapes associated with epigenetics and the occupancy of transcription factors. In addition, the use of genome-wide approaches is becoming more and more necessary to evaluate the efficacy and safety of ameliorative strategies and to identify novel mechanisms of amelioration that may help in the improvement of current preclinical therapeutics. Finally, the major conclusions obtained from HD transcriptomics studies have the potential to be extrapolated to other neurodegenerative disorders.

  16. Unraveling the Genetic Etiology of Adult Antisocial Behavior: A Genome-Wide Association Study

    PubMed Central

    Tielbeek, Jorim J.; Medland, Sarah E.; Benyamin, Beben; Byrne, Enda M.; Heath, Andrew C.; Madden, Pamela A. F.; Martin, Nicholas G.; Wray, Naomi R.; Verweij, Karin J. H.

    2012-01-01

    Crime poses a major burden for society. The heterogeneous nature of criminal behavior makes it difficult to unravel its causes. Relatively little research has been conducted on the genetic influences of criminal behavior. The few twin and adoption studies that have been undertaken suggest that about half of the variance in antisocial behavior can be explained by genetic factors. In order to identify the specific common genetic variants underlying this behavior, we conduct the first genome-wide association study (GWAS) on adult antisocial behavior. Our sample comprised a community sample of 4816 individuals who had completed a self-report questionnaire. No genetic polymorphisms reached genome-wide significance for association with adult antisocial behavior. In addition, none of the traditional candidate genes can be confirmed in our study. While not genome-wide significant, the gene with the strongest association (p-value = 8.7×10−5) was DYRK1A, a gene previously related to abnormal brain development and mental retardation. Future studies should use larger, more homogeneous samples to disentangle the etiology of antisocial behavior. Biosocial criminological research allows a more empirically grounded understanding of criminal behavior, which could ultimately inform and improve current treatment strategies. PMID:23077488

  17. Meta-analysis of sex-specific genome-wide association studies.

    PubMed

    Magi, Reedik; Lindgren, Cecilia M; Morris, Andrew P

    2010-12-01

    Despite the success of genome-wide association studies, much of the genetic contribution to complex human traits is still unexplained. One potential source of genetic variation that may contribute to this "missing heritability" is that which differs in magnitude and/or direction between males and females, which could result from sexual dimorphism in gene expression. Such sex-differentiated effects are common in model organisms, and are becoming increasingly evident in human complex traits through large-scale male- and female-specific meta-analyses. In this article, we review the methodology for meta-analysis of sex-specific genome-wide association studies, and propose a sex-differentiated test of association with quantitative or dichotomous traits, which allows for heterogeneity of allelic effects between males and females. We perform detailed simulations to compare the power of the proposed sex-differentiated meta-analysis with the more traditional "sex-combined" approach, which is ambivalent to gender. The results of this study highlight only a small loss in power for the sex-differentiated meta-analysis when the allelic effects of the causal variant are the same in males and females. However, over a range of models of heterogeneity in allelic effects between genders, our sex-differentiated meta-analysis strategy offers substantial gains in power, and thus has the potential to discover novel loci contributing effects to complex human traits with existing genome-wide association data.

  18. The impact of recent alcohol use on genome wide DNA methylation signatures.

    PubMed

    Philibert, Robert A; Plume, Jeffrey M; Gibbons, Frederick X; Brody, Gene H; Beach, Steven R H

    2012-01-01

    Chronic alcohol intake is associated with a wide variety of adverse health outcomes including depression, diabetes, and heart disease. Unfortunately, the molecular mechanisms through which these effects are conveyed are not clearly understood. To examine the potential role of epigenetic factors in this process, we examined the relationship of recent alcohol intake to genome wide methylation patterns using the Illumina 450 Methylation Bead Chip and lymphoblast DNA derived from 165 female subjects participating in the Iowa Adoption Studies. We found that the pattern of alcohol use over the 6-months immediately prior to phlebotomy was associated with, severity-dependent changes in the degree of genome wide methylation that preferentially hypermethylate the central portion of CpG islands with methylation at cg05600126, a probe in ABR, and the 5' untranslated region of BLCAP attaining genome wide significance in two point and sliding window analyses of probe methylation data, respectively. We conclude that recent alcohol use is associated with widespread changes in DNA methylation in women and that further study to confirm these findings and determine their relationship to somatic function are in order.

  19. Genome-Wide Meta-Analysis of Longitudinal Alcohol Consumption Across Youth and Early Adulthood.

    PubMed

    Adkins, Daniel E; Clark, Shaunna L; Copeland, William E; Kennedy, Martin; Conway, Kevin; Angold, Adrian; Maes, Hermine; Liu, Youfang; Kumar, Gaurav; Erkanli, Alaattin; Patkar, Ashwin A; Silberg, Judy; Brown, Tyson H; Fergusson, David M; Horwood, L John; Eaves, Lindon; van den Oord, Edwin J C G; Sullivan, Patrick F; Costello, E J

    2015-08-01

    The public health burden of alcohol is unevenly distributed across the life course, with levels of use, abuse, and dependence increasing across adolescence and peaking in early adulthood. Here, we leverage this temporal patterning to search for common genetic variants predicting developmental trajectories of alcohol consumption. Comparable psychiatric evaluations measuring alcohol consumption were collected in three longitudinal community samples (N=2,126, obs=12,166). Consumption-repeated measurements spanning adolescence and early adulthood were analyzed using linear mixed models, estimating individual consumption trajectories, which were then tested for association with Illumina 660W-Quad genotype data (866,099 SNPs after imputation and QC). Association results were combined across samples using standard meta-analysis methods. Four meta-analysis associations satisfied our pre-determined genome-wide significance criterion (FDR<0.1) and six others met our 'suggestive' criterion (FDR<0.2). Genome-wide significant associations were highly biological plausible, including associations within GABA transporter 1, SLC6A1 (solute carrier family 6, member 1), and exonic hits in LOC100129340 (mitofusin-1-like). Pathway analyses elaborated single marker results, indicating significant enriched associations to intuitive biological mechanisms, including neurotransmission, xenobiotic pharmacodynamics, and nuclear hormone receptors (NHR). These findings underscore the value of combining longitudinal behavioral data and genome-wide genotype information in order to study developmental patterns and improve statistical power in genomic studies.

  20. Genome-wide detection of DNA double-stranded breaks induced by engineered nucleases.

    PubMed

    Frock, Richard L; Hu, Jiazhi; Meyers, Robin M; Ho, Yu-Jui; Kii, Erina; Alt, Frederick W

    2015-02-01

    Although great progress has been made in the characterization of the off-target effects of engineered nucleases, sensitive and unbiased genome-wide methods for the detection of off-target cleavage events and potential collateral damage are still lacking. Here we describe a linear amplification-mediated modification of a previously published high-throughput, genome-wide, translocation sequencing (HTGTS) method that robustly detects DNA double-stranded breaks (DSBs) generated by engineered nucleases across the human genome based on their translocation to other endogenous or ectopic DSBs. HTGTS with different Cas9:sgRNA or TALEN nucleases revealed off-target hotspot numbers for given nucleases that ranged from a few or none to dozens or more, and extended the number of known off-targets for certain previously characterized nucleases more than tenfold. We also identified translocations between bona fide nuclease targets on homologous chromosomes, an undesired collateral effect that has not been described previously. Finally, HTGTS confirmed that the Cas9D10A paired nickase approach suppresses off-target cleavage genome-wide.

  1. Quality control and quality assurance in genotypic data for genome-wide association studies

    PubMed Central

    Laurie, Cathy C.; Doheny, Kimberly F.; Mirel, Daniel B.; Pugh, Elizabeth W.; Bierut, Laura J.; Bhangale, Tushar; Boehm, Frederick; Caporaso, Neil E.; Cornelis, Marilyn C.; Edenberg, Howard J.; Gabriel, Stacy B.; Harris, Emily L.; Hu, Frank B.; Jacobs, Kevin; Kraft, Peter; Landi, Maria Teresa; Lumley, Thomas; Manolio, Teri A.; McHugh, Caitlin; Painter, Ian; Paschall, Justin; Rice, John P.; Rice, Kenneth M.; Zheng, Xiuwen; Weir, Bruce S.

    2011-01-01

    Genome-wide scans of nucleotide variation in human subjects are providing an increasing number of replicated associations with complex disease traits. Most of the variants detected have small effects and, collectively, they account for a small fraction of the total genetic variance. Very large sample sizes are required to identify and validate findings. In this situation, even small sources of systematic or random error can cause spurious results or obscure real effects. The need for careful attention to data quality has been appreciated for some time in this field, and a number of strategies for quality control and quality assurance (QC/QA) have been developed. Here we extend these methods and describe a system of QC/QA for genotypic data in genome-wide association studies. This system includes some new approaches that (1) combine analysis of allelic probe intensities and called genotypes to distinguish gender misidentification from sex chromosome aberrations, (2) detect autosomal chromosome aberrations that may affect genotype calling accuracy, (3) infer DNA sample quality from relatedness and allelic intensities, (4) use duplicate concordance to infer SNP quality, (5) detect genotyping artifacts from dependence of Hardy-Weinberg equilibrium (HWE) test p-values on allelic frequency, and (6) demonstrate sensitivity of principal components analysis (PCA) to SNP selection. The methods are illustrated with examples from the ‘Gene Environment Association Studies’ (GENEVA) program. The results suggest several recommendations for QC/QA in the design and execution of genome-wide association studies. PMID:20718045

  2. Hematopoietic Transcriptional Mechanisms: From Locus-Specific to Genome-Wide Vantage Points

    PubMed Central

    DeVilbiss, Andrew W.; Sanalkumar, Rajendran; Johnson, Kirby D.; Keles, Sunduz; Bresnick, Emery H.

    2014-01-01

    Hematopoiesis is an exquisitely regulated process in which stem cells in the developing embryo and the adult generate progenitor cells that give rise to all blood lineages. Master regulatory transcription factors control hematopoiesis by integrating signals from the microenvironment and dynamically establishing and maintaining genetic networks. One of the most rudimentary aspects of cell type-specific transcription factor function, how they occupy a highly restricted cohort of cis-elements in chromatin remains poorly understood. Transformative technological advances involving the coupling of next-generation DNA sequencing technology with the chromatin immunoprecipitation assay (ChIP-seq) have enabled genome-wide mapping of factor occupancy patterns. However, formidable problems remain, notably ChIP-seq analysis yields hundreds to thousands of chromatin sites occupied by a given transcription factor, and only a fraction of the sites appear to be endowed with critical, non-redundant function. It has become en vogue to map transcription factor occupancy patterns genome-wide, while utilizing powerful statistical tools to establish correlations to inform biology and mechanisms. With the advent of revolutionary genome editing technologies, one can now reach beyond correlations to conduct definitive hypothesis testing. This review will focus on key discoveries that have emerged during the path from single loci to genome-wide analyses, specifically in the context of hematopoietic transcriptional mechanisms. PMID:24816274

  3. Estimating genome-wide gene networks using nonparametric Bayesian network models on massively parallel computers.

    PubMed

    Tamada, Yoshinori; Imoto, Seiya; Araki, Hiromitsu; Nagasaki, Masao; Print, Cristin; Charnock-Jones, D Stephen; Miyano, Satoru

    2011-01-01

    We present a novel algorithm to estimate genome-wide gene networks consisting of more than 20,000 genes from gene expression data using nonparametric Bayesian networks. Due to the difficulty of learning Bayesian network structures, existing algorithms cannot be applied to more than a few thousand genes. Our algorithm overcomes this limitation by repeatedly estimating subnetworks in parallel for genes selected by neighbor node sampling. Through numerical simulation, we confirmed that our algorithm outperformed a heuristic algorithm in a shorter time. We applied our algorithm to microarray data from human umbilical vein endothelial cells (HUVECs) treated with siRNAs, to construct a human genome-wide gene network, which we compared to a small gene network estimated for the genes extracted using a traditional bioinformatics method. The results showed that our genome-wide gene network contains many features of the small network, as well as others that could not be captured during the small network estimation. The results also revealed master-regulator genes that are not in the small network but that control many of the genes in the small network. These analyses were impossible to realize without our proposed algorithm.

  4. Genome-wide association study identifies novel loci predisposing to cutaneous melanoma†

    PubMed Central

    Amos, Christopher I.; Wang, Li-E; Lee, Jeffrey E.; Gershenwald, Jeffrey E.; Chen, Wei V.; Fang, Shenying; Kosoy, Roman; Zhang, Mingfeng; Qureshi, Abrar A.; Vattathil, Selina; Schacherer, Christopher W.; Gardner, Julie M.; Wang, Yuling; Tim Bishop, D.; Barrett, Jennifer H.; MacGregor, Stuart; Hayward, Nicholas K.; Martin, Nicholas G.; Duffy, David L.; Mann, Graham J.; Cust, Anne; Hopper, John; Brown, Kevin M.; Grimm, Elizabeth A.; Xu, Yaji; Han, Younghun; Jing, Kaiyan; McHugh, Caitlin; Laurie, Cathy C.; Doheny, Kim F.; Pugh, Elizabeth W.; Seldin, Michael F.; Han, Jiali; Wei, Qingyi

    2011-01-01

    We performed a multistage genome-wide association study of melanoma. In a discovery cohort of 1804 melanoma cases and 1026 controls, we identified loci at chromosomes 15q13.1 (HERC2/OCA2 region) and 16q24.3 (MC1R) regions that reached genome-wide significance within this study and also found strong evidence for genetic effects on susceptibility to melanoma from markers on chromosome 9p21.3 in the p16/ARF region and on chromosome 1q21.3 (ARNT/LASS2/ANXA9 region). The most significant single-nucleotide polymorphisms (SNPs) in the 15q13.1 locus (rs1129038 and rs12913832) lie within a genomic region that has profound effects on eye and skin color; notably, 50% of variability in eye color is associated with variation in the SNP rs12913832. Because eye and skin colors vary across European populations, we further evaluated the associations of the significant SNPs after carefully adjusting for European substructure. We also evaluated the top 10 most significant SNPs by using data from three other genome-wide scans. Additional in silico data provided replication of the findings from the most significant region on chromosome 1q21.3 rs7412746 (P = 6 × 10−10). Together, these data identified several candidate genes for additional studies to identify causal variants predisposing to increased risk for developing melanoma. PMID:21926416

  5. Host cell factors in HIV replication: meta-analysis of genome-wide studies.

    PubMed

    Bushman, Frederic D; Malani, Nirav; Fernandes, Jason; D'Orso, Iván; Cagney, Gerard; Diamond, Tracy L; Zhou, Honglin; Hazuda, Daria J; Espeseth, Amy S; König, Renate; Bandyopadhyay, Sourav; Ideker, Trey; Goff, Stephen P; Krogan, Nevan J; Frankel, Alan D; Young, John A T; Chanda, Sumit K

    2009-05-01

    We have analyzed host cell genes linked to HIV replication that were identified in nine genome-wide studies, including three independent siRNA screens. Overlaps among the siRNA screens were very modest (<7% for any pairwise combination), and similarly, only modest overlaps were seen in pairwise comparisons with other types of genome-wide studies. Combining all genes from the genome-wide studies together with genes reported in the literature to affect HIV yields 2,410 protein-coding genes, or fully 9.5% of all human genes (though of course some of these are false positive calls). Here we report an "encyclopedia" of all overlaps between studies (available at http://www.hostpathogen.org), which yielded a more extensively corroborated set of host factors assisting HIV replication. We used these genes to calculate refined networks that specify cellular subsystems recruited by HIV to assist in replication, and present additional analysis specifying host cell genes that are attractive as potential therapeutic targets.

  6. Genome-wide Association Analysis Identifies 14 New Risk Loci for Schizophrenia

    PubMed Central

    Ripke, Stephan; O'Dushlaine, Colm; Chambert, Kimberly; Moran, Jennifer L; Kähler, Anna K; Akterin, Susanne; Bergen, Sarah; Collins, Ann L; Crowley, James J; Fromer, Menachem; Kim, Yunjung; Lee, Sang Hong; Magnusson, Patrik KE; Sanchez, Nick; Stahl, Eli A; Williams, Stephanie; Wray, Naomi R; Xia, Kai; Bettella, Francesco; Børglum, Anders D; Bulik-Sullivan, Brendan K; Cormican, Paul; Craddock, Nick; de Leeuw, Christiaan; Durmishi, Naser; Gill, Michael; Golimbet, Vera; Hamshere, Marian L; Holmans, Peter; Hougaard, David M; Kendler, Kenneth S; Lin, Kuang; Morris, Derek W; Mors, Ole; Mortensen, Preben B; Neale, Benjamin M; O'Neill, Francis A; Owen, Michael J; Milovancevic, MilicaPejovic; Posthuma, Danielle; Powell, John; Richards, Alexander L; Riley, Brien P; Ruderfer, Douglas; Rujescu, Dan; Sigurdsson, Engilbert; Silagadze, Teimuraz; Smit, August B; Stefansson, Hreinn; Steinberg, Stacy; Suvisaari, Jaana; Tosato, Sarah; Verhage, Matthijs; Walters, James T; Bramon, Elvira; Corvin, Aiden P; O'Donovan, Michael C; Stefansson, Kari; Scolnick, Edward; Purcell, Shaun; McCarroll, Steve; Sklar, Pamela; Hultman, Christina M; Sullivan, Patrick F

    2013-01-01

    Schizophrenia is a heritable disorder with substantial public health impact. We conducted a multi-stage genome-wide association study (GWAS) for schizophrenia beginning with a Swedish national sample (5,001 cases, 6,243 controls) followed by meta-analysis with prior schizophrenia GWAS (8,832 cases, 12,067 controls) and finally by replication of SNPs in 168 genomic regions in independent samples (7,413 cases, 19,762 controls, and 581 trios). In total, 22 regions met genome-wide significance (14 novel and one previously implicated in bipolar disorder). The results strongly implicate calcium signaling in the etiology of schizophrenia, and include genome-wide significant results for CACNA1C and CACNB2 whose protein products interact. We estimate that ∼8,300 independent and predominantly common SNPs contribute to risk for schizophrenia and that these collectively account for most of its heritability. Common genetic variation plays an important role in the etiology of schizophrenia, and larger studies will allow more detailed understanding of this devastating disorder. PMID:23974872

  7. A Genome-Wide Association Study of a Biomarker of Nicotine Metabolism

    PubMed Central

    Loukola, Anu; Buchwald, Jadwiga; Gupta, Richa; Palviainen, Teemu; Hällfors, Jenni; Tikkanen, Emmi; Korhonen, Tellervo; Ollikainen, Miina; Sarin, Antti-Pekka; Ripatti, Samuli; Lehtimäki, Terho; Raitakari, Olli; Salomaa, Veikko; Rose, Richard J.; Tyndale, Rachel F.; Kaprio, Jaakko

    2015-01-01

    Individuals with fast nicotine metabolism typically smoke more and thus have a greater risk for smoking-induced diseases. Further, the efficacy of smoking cessation pharmacotherapy is dependent on the rate of nicotine metabolism. Our objective was to use nicotine metabolite ratio (NMR), an established biomarker of nicotine metabolism rate, in a genome-wide association study (GWAS) to identify novel genetic variants influencing nicotine metabolism. A heritability estimate of 0.81 (95% CI 0.70–0.88) was obtained for NMR using monozygotic and dizygotic twins of the FinnTwin cohort. We performed a GWAS in cotinine-verified current smokers of three Finnish cohorts (FinnTwin, Young Finns Study, FINRISK2007), followed by a meta-analysis of 1518 subjects, and annotated the genome-wide significant SNPs with methylation quantitative loci (meQTL) analyses. We detected association on 19q13 with 719 SNPs exceeding genome-wide significance within a 4.2 Mb region. The strongest evidence for association emerged for CYP2A6 (min p = 5.77E-86, in intron 4), the main metabolic enzyme for nicotine. Other interesting genes with genome-wide significant signals included CYP2B6, CYP2A7, EGLN2, and NUMBL. Conditional analyses revealed three independent signals on 19q13, all located within or in the immediate vicinity of CYP2A6. A genetic risk score constructed using the independent signals showed association with smoking quantity (p = 0.0019) in two independent Finnish samples. Our meQTL results showed that methylation values of 16 CpG sites within the region are affected by genotypes of the genome-wide significant SNPs, and according to causal inference test, for some of the SNPs the effect on NMR is mediated through methylation. To our knowledge, this is the first GWAS on NMR. Our results enclose three independent novel signals on 19q13.2. The detected CYP2A6 variants explain a strikingly large fraction of variance (up to 31%) in NMR in these study samples. Further, we provide evidence

  8. Population Stratification in the Context of Diverse Epidemiologic Surveys Sans Genome-Wide Data

    PubMed Central

    Oetjens, Matthew T.; Brown-Gentry, Kristin; Goodloe, Robert; Dilks, Holli H.; Crawford, Dana C.

    2016-01-01

    Population stratification or confounding by genetic ancestry is a potential cause of false associations in genetic association studies. Estimation of and adjustment for genetic ancestry has become common practice thanks in part to the availability of ancestry informative markers on genome-wide association study (GWAS) arrays. While array data is now widespread, these data are not ubiquitous as several large epidemiologic and clinic-based studies lack genome-wide data. One such large epidemiologic-based study lacking genome-wide data accessible to investigators is the National Health and Nutrition Examination Surveys (NHANES), population-based cross-sectional surveys of Americans linked to demographic, health, and lifestyle data conducted by the Centers for Disease Control and Prevention. DNA samples (n = 14,998) were extracted from biospecimens from consented NHANES participants between 1991–1994 (NHANES III, phase 2) and 1999–2002 and represent three major self-identified racial/ethnic groups: non-Hispanic whites (n = 6,634), non-Hispanic blacks (n = 3,458), and Mexican Americans (n = 3,950). We as the Epidemiologic Architecture for Genes Linked to Environment study genotyped candidate gene and GWAS-identified index variants in NHANES as part of the larger Population Architecture using Genomics and Epidemiology I study for collaborative genetic association studies. To enable basic quality control such as estimation of genetic ancestry to control for population stratification in NHANES san genome-wide data, we outline here strategies that use limited genetic data to identify the markers optimal for characterizing genetic ancestry. From among 411 and 295 autosomal SNPs available in NHANES III and NHANES 1999–2002, we demonstrate that markers with ancestry information can be identified to estimate global ancestry. Despite limited resolution, global genetic ancestry is highly correlated with self-identified race for the majority of participants, although less so

  9. Genetic determinants of common epilepsies: a meta-analysis of genome-wide association studies

    PubMed Central

    2014-01-01

    Summary Background The epilepsies are a clinically heterogeneous group of neurological disorders. Despite strong evidence for heritability, genome-wide association studies have had little success in identification of risk loci associated with epilepsy, probably because of relatively small sample sizes and insufficient power. We aimed to identify risk loci through meta-analyses of genome-wide association studies for all epilepsy and the two largest clinical subtypes (genetic generalised epilepsy and focal epilepsy). Methods We combined genome-wide association data from 12 cohorts of individuals with epilepsy and controls from population-based datasets. Controls were ethnically matched with cases. We phenotyped individuals with epilepsy into categories of genetic generalised epilepsy, focal epilepsy, or unclassified epilepsy. After standardised filtering for quality control and imputation to account for different genotyping platforms across sites, investigators at each site conducted a linear mixed-model association analysis for each dataset. Combining summary statistics, we conducted fixed-effects meta-analyses of all epilepsy, focal epilepsy, and genetic generalised epilepsy. We set the genome-wide significance threshold at p<1·66 × 10−8. Findings We included 8696 cases and 26 157 controls in our analysis. Meta-analysis of the all-epilepsy cohort identified loci at 2q24.3 (p=8·71 × 10−10), implicating SCN1A, and at 4p15.1 (p=5·44 × 10−9), harbouring PCDH7, which encodes a protocadherin molecule not previously implicated in epilepsy. For the cohort of genetic generalised epilepsy, we noted a single signal at 2p16.1 (p=9·99 × 10−9), implicating VRK2 or FANCL. No single nucleotide polymorphism achieved genome-wide significance for focal epilepsy. Interpretation This meta-analysis describes a new locus not previously implicated in epilepsy and provides further evidence about the genetic architecture of these disorders, with the

  10. A Genome-Wide Association Study of a Biomarker of Nicotine Metabolism.

    PubMed

    Loukola, Anu; Buchwald, Jadwiga; Gupta, Richa; Palviainen, Teemu; Hällfors, Jenni; Tikkanen, Emmi; Korhonen, Tellervo; Ollikainen, Miina; Sarin, Antti-Pekka; Ripatti, Samuli; Lehtimäki, Terho; Raitakari, Olli; Salomaa, Veikko; Rose, Richard J; Tyndale, Rachel F; Kaprio, Jaakko

    2015-01-01

    Individuals with fast nicotine metabolism typically smoke more and thus have a greater risk for smoking-induced diseases. Further, the efficacy of smoking cessation pharmacotherapy is dependent on the rate of nicotine metabolism. Our objective was to use nicotine metabolite ratio (NMR), an established biomarker of nicotine metabolism rate, in a genome-wide association study (GWAS) to identify novel genetic variants influencing nicotine metabolism. A heritability estimate of 0.81 (95% CI 0.70-0.88) was obtained for NMR using monozygotic and dizygotic twins of the FinnTwin cohort. We performed a GWAS in cotinine-verified current smokers of three Finnish cohorts (FinnTwin, Young Finns Study, FINRISK2007), followed by a meta-analysis of 1518 subjects, and annotated the genome-wide significant SNPs with methylation quantitative loci (meQTL) analyses. We detected association on 19q13 with 719 SNPs exceeding genome-wide significance within a 4.2 Mb region. The strongest evidence for association emerged for CYP2A6 (min p = 5.77E-86, in intron 4), the main metabolic enzyme for nicotine. Other interesting genes with genome-wide significant signals included CYP2B6, CYP2A7, EGLN2, and NUMBL. Conditional analyses revealed three independent signals on 19q13, all located within or in the immediate vicinity of CYP2A6. A genetic risk score constructed using the independent signals showed association with smoking quantity (p = 0.0019) in two independent Finnish samples. Our meQTL results showed that methylation values of 16 CpG sites within the region are affected by genotypes of the genome-wide significant SNPs, and according to causal inference test, for some of the SNPs the effect on NMR is mediated through methylation. To our knowledge, this is the first GWAS on NMR. Our results enclose three independent novel signals on 19q13.2. The detected CYP2A6 variants explain a strikingly large fraction of variance (up to 31%) in NMR in these study samples. Further, we provide evidence

  11. DamID-seq: Genome-wide Mapping of Protein-DNA Interactions by High Throughput Sequencing of Adenine-methylated DNA Fragments.

    PubMed

    Wu, Feinan; Olson, Brennan G; Yao, Jie

    2016-01-27

    The DNA adenine methyltransferase identification (DamID) assay is a powerful method to detect protein-DNA interactions both locally and genome-wide. It is an alternative approach to chromatin immunoprecipitation (ChIP). An expressed fusion protein consisting of the protein of interest and the E. coli DNA adenine methyltransferase can methylate the adenine base in GATC motifs near the sites of protein-DNA interactions. Adenine-methylated DNA fragments can then be specifically amplified and detected. The original DamID assay detects the genomic locations of methylated DNA fragments by hybridization to DNA microarrays, which is limited by the availability of microarrays and the density of predetermined probes. In this paper, we report the detailed protocol of integrating high throughput DNA sequencing into DamID (DamID-seq). The large number of short reads generated from DamID-seq enables detecting and localizing protein-DNA interactions genome-wide with high precision and sensitivity. We have used the DamID-seq assay to study genome-nuclear lamina (NL) interactions in mammalian cells, and have noticed that DamID-seq provides a high resolution and a wide dynamic range in detecting genome-NL interactions. The DamID-seq approach enables probing NL associations within gene structures and allows comparing genome-NL interaction maps with other functional genomic data, such as ChIP-seq and RNA-seq.

  12. Genome-wide siRNA screen of genes regulating the LPS-induced NF-κB and TNF-α responses in mouse macrophages

    PubMed Central

    Li, Ning; Katz, Samuel; Dutta, Bhaskar; Benet, Zachary L.; Sun, Jing; Fraser, Iain D.C.

    2017-01-01

    The mammalian innate immune system senses many bacterial stimuli through the toll-like receptor (TLR) family. Activation of the TLR4 receptor by bacterial lipopolysaccharide (LPS) is the most widely studied TLR pathway due to its central role in host responses to gram-negative bacterial infection and its contribution to endotoxemia and sepsis. Here we describe a genome-wide siRNA screen to identify genes regulating the mouse macrophage TNF-α and NF-κB responses to LPS. We include a secondary validation screen conducted with six independent siRNAs per gene to facilitate removal of off-target screen hits. We also provide microarray data from the same LPS-treated macrophage cells to facilitate downstream data analysis. These data provide a resource for analyzing gene function in the predominant pathway driving inflammatory signaling and cytokine expression in mouse macrophages. PMID:28248925

  13. Genome-wide functional analysis of CREB/long-term memory-dependent transcription reveals distinct basal and memory gene expression programs.

    PubMed

    Lakhina, Vanisha; Arey, Rachel N; Kaletsky, Rachel; Kauffman, Amanda; Stein, Geneva; Keyes, William; Xu, Daniel; Murphy, Coleen T

    2015-01-21

    Induced CREB activity is a hallmark of long-term memory, but the full repertoire of CREB transcriptional targets required specifically for memory is not known in any system. To obtain a more complete picture of the mechanisms involved in memory, we combined memory training with genome-wide transcriptional analysis of C. elegans CREB mutants. This approach identified 757 significant CREB/memory-induced targets and confirmed the involvement of known memory genes from other organisms, but also suggested new mechanisms and novel components that may be conserved through mammals. CREB mediates distinct basal and memory transcriptional programs at least partially through spatial restriction of CREB activity: basal targets are regulated primarily in nonneuronal tissues, while memory targets are enriched for neuronal expression, emanating from CREB activity in AIM neurons. This suite of novel memory-associated genes will provide a platform for the discovery of orthologous mammalian long-term memory components.

  14. Genome-Wide Association of the Laboratory-Based Nicotine Metabolite Ratio in Three Ancestries

    PubMed Central

    Baurley, James W.; Edlund, Christopher K.; Pardamean, Carissa I.; Conti, David V.; Krasnow, Ruth; Javitz, Harold S.; Hops, Hyman; Swan, Gary E.; Benowitz, Neal L.

    2016-01-01

    Introduction: Metabolic enzyme variation and other patient and environmental characteristics influence smoking behaviors, treatment success, and risk of related disease. Population-specific variation in metabolic genes contributes to challenges in developing and optimizing pharmacogenetic interventions. We applied a custom genome-wide genotyping array for addiction research (Smokescreen), to three laboratory-based studies of nicotine metabolism with oral or venous administration of labeled nicotine and cotinine, to model nicotine metabolism in multiple populations. The trans-3′-hydroxycotinine/cotinine ratio, the nicotine metabolite ratio (NMR), was the nicotine metabolism measure analyzed. Methods: Three hundred twelve individuals of self-identified European, African, and Asian American ancestry were genotyped and included in ancestry-specific genome-wide association scans (GWAS) and a meta-GWAS analysis of the NMR. We modeled natural-log transformed NMR with covariates: principal components of genetic ancestry, age, sex, body mass index, and smoking status. Results: African and Asian American NMRs were statistically significantly (P values ≤ 5E-5) lower than European American NMRs. Meta-GWAS analysis identified 36 genome-wide significant variants over a 43 kilobase pair region at CYP2A6 with minimum P = 2.46E-18 at rs12459249, proximal to CYP2A6. Additional minima were located in intron 4 (rs56113850, P = 6.61E-18) and in the CYP2A6-CYP2A7 intergenic region (rs34226463, P = 1.45E-12). Most (34/36) genome-wide significant variants suggested reduced CYP2A6 activity; functional mechanisms were identified and tested in knowledge-bases. Conditional analysis resulted in intergenic variants of possible interest (P values < 5E-5). Conclusions: This meta-GWAS of the NMR identifies CYP2A6 variants, replicates the top-ranked single nucleotide polymorphism from a recent Finnish meta-GWAS of the NMR, identifies functional mechanisms, and provides pan

  15. Genome Wide Association for Addiction: Replicated Results and Comparisons of Two Analytic Approaches

    PubMed Central

    Drgon, Tomas; Zhang, Ping-Wu; Johnson, Catherine; Walther, Donna; Hess, Judith; Nino, Michelle; Uhl, George R.

    2010-01-01

    Background Vulnerabilities to dependence on addictive substances are substantially heritable complex disorders whose underlying genetic architecture is likely to be polygenic, with modest contributions from variants in many individual genes. “Nontemplate” genome wide association (GWA) approaches can identity groups of chromosomal regions and genes that, taken together, are much more likely to contain allelic variants that alter vulnerability to substance dependence than expected by chance. Methodology/Principal Findings We report pooled “nontemplate” genome-wide association studies of two independent samples of substance dependent vs control research volunteers (n = 1620), one European-American and the other African-American using 1 million SNP (single nucleotide polymorphism) Affymetrix genotyping arrays. We assess convergence between results from these two samples using two related methods that seek clustering of nominally-positive results and assess significance levels with Monte Carlo and permutation approaches. Both “converge then cluster” and “cluster then converge” analyses document convergence between the results obtained from these two independent datasets in ways that are virtually never found by chance. The genes identified in this fashion are also identified by individually-genotyped dbGAP data that compare allele frequencies in cocaine dependent vs control individuals. Conclusions/Significance These overlapping results identify small chromosomal regions that are also identified by genome wide data from studies of other relevant samples to extents much greater than chance. These chromosomal regions contain more genes related to “cell adhesion” processes than expected by chance. They also contain a number of genes that encode potential targets for anti-addiction pharmacotherapeutics. “Nontemplate” GWA approaches that seek chromosomal regions in which nominally-positive associations are found in multiple independent samples are

  16. A novel bayesian graphical model for genome-wide multi-SNP association mapping.

    PubMed

    Zhang, Yu

    2012-01-01

    Most disease association mapping algorithms are based on hypothesis testing procedures that test one variant at a time. Those methods lose power when the disease mutations are jointly tagged by multiple variants, or when gene-gene interaction exist. Nearby variants are also correlated, for which procedures ignoring the dependence between variants will inevitably produce redundant results. With a large number of variants genotyped in current genome-wide disease association studies, simultaneous multivariant association mapping algorithms are strongly desired. We present a novel Bayesian method for automatic detection of multivariant joint association in genome-wide case-control studies. Our method has improved power and specificity over existing tools. We fit a joint probabilistic model to the entire data and identify disease variants simultaneously. The method dynamically accounts for the strong linkage disequilibrium (LD) between variants. As a result, only the primary disease variants will be identified, with all secondary associations due to LD effects filtered out. Our method better pinpoints the disease variants with improved resolution. The method is also computationally efficient for genome-wide studies. When applied to a real data set of inflammatory bowel disease (IBD) containing 401,473 variants in 4,720 individuals, our method detected all previously reported IBD loci in the same data, and recovered two missed loci. We further detected two novel interchromosome interactions. The first is between STAT3 and PARD6G, and the second is between DLG5 and an intergenic region at 5p14. We further validated the two interactions in an independent study.

  17. Genome-wide Meta-analysis on the Sense of Smell Among US Older Adults.

    PubMed

    Dong, Jing; Yang, Jingyun; Tranah, Greg; Franceschini, Nora; Parimi, Neeta; Alkorta-Aranburu, Gorka; Xu, Zongli; Alonso, Alvaro; Cummings, Steven R; Fornage, Myriam; Huang, Xuemei; Kritchevsky, Stephen; Liu, Yongmei; London, Stephanie; Niu, Liang; Wilson, Robert S; De Jager, Philip L; Yu, Lei; Singleton, Andrew B; Harris, Tamara; Mosley, Thomas H; Pinto, Jayant M; Bennett, David A; Chen, Honglei

    2015-11-01

    Olfactory dysfunction is common among older adults and affects their safety, nutrition, quality of life, and mortality. More importantly, the decreased sense of smell is an early symptom of neurodegenerative diseases such as Parkinson disease (PD) and Alzheimer disease. However, the genetic determinants for the sense of smell have been poorly investigated. We here performed the first genome-wide meta-analysis on the sense of smell among 6252 US older adults of European descent from the Atherosclerosis Risk in Communities (ARIC) study, the Health, Aging, and Body Composition (Health ABC) study, and the Religious Orders Study and the Rush Memory and Aging Project (ROS/MAP). Genome-wide association study analysis was performed first by individual cohorts and then meta-analyzed using fixed-effect models with inverse variance weights. Although no SNPs reached genome-wide statistical significance, we identified 13 loci with suggestive evidence for an association with the sense of smell (Pmeta < 1 × 10). Of these, 2 SNPs at chromosome 17q21.31 (rs199443 in NSF, P = 3.02 × 10; and rs2732614 in KIAA1267-LRRC37A, P = 6.65 × 10) exhibited cis effects on the expression of microtubule-associated protein tau (MAPT, 17q21.31) in 447 frontal-cortex samples obtained postmortem and profiled by RNA-seq (P < 1 × 10). Gene-based and pathway-enrichment analyses further implicated MAPT in regulating the sense of smell in older adults. Similar results were obtained after excluding participants who reported a physician-diagnosed PD or use of PD medications. In conclusion, we provide preliminary evidence that the MAPT locus may play a role in regulating the sense of smell in older adults and therefore offer a potential genetic link between poor sense of smell and major neurodegenerative diseases.

  18. Genome-wide landscape of liver X receptor chromatin binding and gene regulation in human macrophages

    PubMed Central

    2012-01-01

    Background The liver X receptors (LXRs) are oxysterol sensing nuclear receptors with multiple effects on metabolism and immune cells. However, the complete genome-wide cistrome of LXR in cells of human origin has not yet been provided. Results We performed ChIP-seq in phorbol myristate acetate-differentiated THP-1 cells (macrophage-type) after stimulation with the potent synthetic LXR ligand T0901317 (T09). Microarray gene expression analysis was performed in the same cellular model. We identified 1357 genome-wide LXR locations (FDR < 1%), of which 526 were observed after T09 treatment. De novo analysis of LXR binding sequences identified a DR4-type element as the major motif. On mRNA level T09 up-regulated 1258 genes and repressed 455 genes. Our results show that LXR actions are focused on 112 genomic regions that contain up to 11 T09 target genes per region under the control of highly stringent LXR binding sites with individual constellations for each region. We could confirm that LXR controls lipid metabolism and transport and observed a strong association with apoptosis-related functions. Conclusions This first report on genome-wide binding of LXR in a human cell line provides new insights into the transcriptional network of LXR and its target genes with their link to physiological processes, such as apoptosis. The gene expression microarray and sequence data have been submitted collectively to the NCBI Gene Expression Omnibus http://www.ncbi.nlm.nih.gov/geo under accession number GSE28319. PMID:22292898

  19. Genome-wide analysis of zygotic linkage disequilibrium and its components in crossbred cattle

    PubMed Central

    2012-01-01

    Background Linkage disequilibrium (LD) between genes at linked or independent loci can occur at gametic and zygotic levels known asgametic LD and zygotic LD, respectively. Gametic LD is well known for its roles in fine-scale mapping of quantitative trait loci, genomic selection and evolutionary inference. The less-well studied is the zygotic LD and its components that can be also estimated directly from the unphased SNPs. Results This study was set up to investigate the genome-wide extent and patterns of zygotic LD and its components in a crossbred cattle population using the genomic data from the Illumina BovineSNP50 beadchip. The animal population arose from repeated crossbreeding of multiple breeds and selection for growth and cow reproduction. The study showed that similar genomic structures in gametic and zygotic LD were observed, with zygotic LD decaying faster than gametic LD over marker distance. The trigenic and quadrigenic disequilibria were generally two- to three-fold smaller than the usual digenic disequilibria (gametic or composite LD). There was less power of testing for these high-order genic disequilibria than for the digenic disequilibria. The power estimates decreased with the marker distance between markers though the decay trend is more obvious for the digenic disequilibria than for high-order disequilibria. Conclusions This study is the first major genome-wide survey of all non-allelic associations between pairs of SNPs in a cattle population. Such analysis allows us to assess the relative importance of gametic LD vs. all other non-allelic genic LDs regardless of whether or not the population is in HWE. The observed predominance of digenic LD (gametic or composite LD) coupled with insignificant high-order trigenic and quadrigenic disequilibria supports the current intensive focus on the use of high-density SNP markers for genome-wide association studies and genomic selection activities in the cattle population. PMID:22827586

  20. Genome-Wide Association Study Reveals Multiple Loci Influencing Normal Human Facial Morphology

    PubMed Central

    Raffensperger, Zachary D.; Heike, Carrie L.; Cunningham, Michael L.; Hecht, Jacqueline T.; Kau, Chung How; Moreno, Lina M.; Wehby, George L.; Murray, Jeffrey C.; Laurie, Cecelia A.; Laurie, Cathy C.; Santorico, Stephanie; Klein, Ophir; Feingold, Eleanor; Hallgrimsson, Benedikt; Spritz, Richard A.; Marazita, Mary L.; Weinberg, Seth M.

    2016-01-01

    Numerous lines of evidence point to a genetic basis for facial morphology in humans, yet little is known about how specific genetic variants relate to the phenotypic expression of many common facial features. We conducted genome-wide association meta-analyses of 20 quantitative facial measurements derived from the 3D surface images of 3118 healthy individuals of European ancestry belonging to two US cohorts. Analyses were performed on just under one million genotyped SNPs (Illumina OmniExpress+Exome v1.2 array) imputed to the 1000 Genomes reference panel (Phase 3). We observed genome-wide significant associations (p < 5 x 10−8) for cranial base width at 14q21.1 and 20q12, intercanthal width at 1p13.3 and Xq13.2, nasal width at 20p11.22, nasal ala length at 14q11.2, and upper facial depth at 11q22.1. Several genes in the associated regions are known to play roles in craniofacial development or in syndromes affecting the face: MAFB, PAX9, MIPOL1, ALX3, HDAC8, and PAX1. We also tested genotype-phenotype associations reported in two previous genome-wide studies and found evidence of replication for nasal ala length and SNPs in CACNA2D3 and PRDM16. These results provide further evidence that common variants in regions harboring genes of known craniofacial function contribute to normal variation in human facial features. Improved understanding of the genes associated with facial morphology in healthy individuals can provide insights into the pathways and mechanisms controlling normal and abnormal facial morphogenesis. PMID:27560520

  1. Genome-wide single nucleotide polymorphisms reveal population history and adaptive divergence in wild guppies.

    PubMed

    Willing, Eva-Maria; Bentzen, Paul; van Oosterhout, Cock; Hoffmann, Margarete; Cable, Joanne; Breden, Felix; Weigel, Detlef; Dreyer, Christine

    2010-03-01

    Adaptation of guppies (Poecilia reticulata) to contrasting upland and lowland habitats has been extensively studied with respect to behaviour, morphology and life history traits. Yet population history has not been studied at the whole-genome level. Although single nucleotide polymorphisms (SNPs) are the most abundant form of variation in many genomes and consequently very informative for a genome-wide picture of standing natural variation in populations, genome-wide SNP data are rarely available for wild vertebrates. Here we use genetically mapped SNP markers to comprehensively survey genetic variation within and among naturally occurring guppy populations from a wide geographic range in Trinidad and Venezuela. Results from three different clustering methods, Neighbor-net, principal component analysis (PCA) and Bayesian analysis show that the population substructure agrees with geographic separation and largely with previously hypothesized patterns of historical colonization. Within major drainages (Caroni, Oropouche and Northern), populations are genetically similar, but those in different geographic regions are highly divergent from one another, with some indications of ancient shared polymorphisms. Clear genomic signatures of a previous introduction experiment were seen, and we detected additional potential admixture events. Headwater populations were significantly less heterozygous than downstream populations. Pairwise F(ST) values revealed marked differences in allele frequencies among populations from different regions, and also among populations within the same region. F(ST) outlier methods indicated some regions of the genome as being under directional selection. Overall, this study demonstrates the power of a genome-wide SNP data set to inform for studies on natural variation, adaptation and evolution of wild populations.

  2. Genome-wide association study of personality traits in the long life family study.

    PubMed

    Bae, Harold T; Sebastiani, Paola; Sun, Jenny X; Andersen, Stacy L; Daw, E Warwick; Terracciano, Antonio; Ferrucci, Luigi; Perls, Thomas T

    2013-01-01

    Personality traits have been shown to be associated with longevity and healthy aging. In order to discover novel genetic modifiers associated with personality traits as related with longevity, we performed a genome-wide association study (GWAS) on personality factors assessed by NEO-five-factor inventory in individuals enrolled in the Long Life Family Study (LLFS), a study of 583 families (N up to 4595) with clustering for longevity in the United States and Denmark. Three SNPs, in almost perfect LD, associated with agreeableness reached genome-wide significance (p < 10(-8)) and replicated in an additional sample of 1279 LLFS subjects, although one (rs9650241) failed to replicate and the other two were not available in two independent replication cohorts, the Baltimore Longitudinal Study of Aging and the New England Centenarian Study. Based on 10,000,000 permutations, the empirical p-value of 2 × 10(-7) was observed for the genome-wide significant SNPs. Seventeen SNPs that reached marginal statistical significance in the two previous GWASs (p-value <10(-4) and 10(-5)), were also marginally significantly associated in this study (p-value <0.05), although none of the associations passed the Bonferroni correction. In addition, we tested age-by-SNP interactions and found some significant associations. Since scores of personality traits in LLFS subjects change in the oldest ages, and genetic factors outweigh environmental factors to achieve extreme ages, these age-by-SNP interactions could be a proxy for complex gene-gene interactions affecting personality traits and longevity.

  3. Genome-wide evidence for speciation with gene flow in Heliconius butterflies

    PubMed Central

    Martin, Simon H.; Dasmahapatra, Kanchon K.; Nadeau, Nicola J.; Salazar, Camilo; Walters, James R.; Simpson, Fraser; Blaxter, Mark; Manica, Andrea; Mallet, James; Jiggins, Chris D.

    2013-01-01

    Most speciation events probably occur gradually, without complete and immediate reproductive isolation, but the full extent of gene flow between diverging species has rarely been characterized on a genome-wide scale. Documenting the extent and timing of admixture between diverging species can clarify the role of geographic isolation in speciation. Here we use new methodology to quantify admixture at different stages of divergence in Heliconius butterflies, based on whole-genome sequences of 31 individuals. Comparisons between sympatric and allopatric populations of H. melpomene, H. cydno, and H. timareta revealed a genome-wide trend of increased shared variation in sympatry, indicative of pervasive interspecific gene flow. Up to 40% of 100-kb genomic windows clustered by geography rather than by species, demonstrating that a very substantial fraction of the genome has been shared between sympatric species. Analyses of genetic variation shared over different time intervals suggested that admixture between these species has continued since early in speciation. Alleles shared between species during recent time intervals displayed higher levels of linkage disequilibrium than those shared over longer time intervals, suggesting that this admixture took place at multiple points during divergence and is probably ongoing. The signal of admixture was significantly reduced around loci controlling divergent wing patterns, as well as throughout the Z chromosome, consistent with strong selection for Müllerian mimicry and with known Z-linked hybrid incompatibility. Overall these results show that species divergence can occur in the face of persistent and genome-wide admixture over long periods of time. PMID:24045163

  4. Genome-wide selective sweeps and gene-specific sweeps in natural bacterial populations

    DOE PAGES

    Bendall, Matthew L.; Stevens, Sarah L.R.; Chan, Leong-Keat; ...

    2016-01-08

    Multiple models describe the formation and evolution of distinct microbial phylogenetic groups. These evolutionary models make different predictions regarding how adaptive alleles spread through populations and how genetic diversity is maintained. Processes predicted by competing evolutionary models, for example, genome-wide selective sweeps vs gene-specific sweeps, could be captured in natural populations using time-series metagenomics if the approach were applied over a sufficiently long time frame. Direct observations of either process would help resolve how distinct microbial groups evolve. Using a 9-year metagenomic study of a freshwater lake (2005–2013), we explore changes in single-nucleotide polymorphism (SNP) frequencies and patterns of genemore » gain and loss in 30 bacterial populations. SNP analyses revealed substantial genetic heterogeneity within these populations, although the degree of heterogeneity varied by >1000-fold among populations. SNP allele frequencies also changed dramatically over time within some populations. Interestingly, nearly all SNP variants were slowly purged over several years from one population of green sulfur bacteria, while at the same time multiple genes either swept through or were lost from this population. Furthermore, these patterns were consistent with a genome-wide selective sweep in progress, a process predicted by the ‘ecotype model’ of speciation but not previously observed in nature. In contrast, other populations contained large, SNP-free genomic regions that appear to have swept independently through the populations prior to the study without purging diversity elsewhere in the genome. Finally, evidence for both genome-wide and gene-specific sweeps suggests that different models of bacterial speciation may apply to different populations coexisting in the same environment.« less

  5. Genome-wide scan of healthy human connectome discovers SPON1 gene variant influencing dementia severity.

    PubMed

    Jahanshad, Neda; Rajagopalan, Priya; Hua, Xue; Hibar, Derrek P; Nir, Talia M; Toga, Arthur W; Jack, Clifford R; Saykin, Andrew J; Green, Robert C; Weiner, Michael W; Medland, Sarah E; Montgomery, Grant W; Hansell, Narelle K; McMahon, Katie L; de Zubicaray, Greig I; Martin, Nicholas G; Wright, Margaret J; Thompson, Paul M

    2013-03-19

    Aberrant connectivity is implicated in many neurological and psychiatric disorders, including Alzheimer's disease and schizophrenia. However, other than a few disease-associated candidate genes, we know little about the degree to which genetics play a role in the brain networks; we know even less about specific genes that influence brain connections. Twin and family-based studies can generate estimates of overall genetic influences on a trait, but genome-wide association scans (GWASs) can screen the genome for specific variants influencing the brain or risk for disease. To identify the heritability of various brain connections, we scanned healthy young adult twins with high-field, high-angular resolution diffusion MRI. We adapted GWASs to screen the brain's connectivity pattern, allowing us to discover genetic variants that affect the human brain's wiring. The association of connectivity with the SPON1 variant at rs2618516 on chromosome 11 (11p15.2) reached connectome-wide, genome-wide significance after stringent statistical corrections were enforced, and it was replicated in an independent subsample. rs2618516 was shown to affect brain structure in an elderly population with varying degrees of dementia. Older people who carried the connectivity variant had significantly milder clinical dementia scores and lower risk of Alzheimer's disease. As a posthoc analysis, we conducted GWASs on several organizational and topological network measures derived from the matrices to discover variants in and around genes associated with autism (MACROD2), development (NEDD4), and mental retardation (UBE2A) significantly associated with connectivity. Connectome-wide, genome-wide screening offers substantial promise to discover genes affecting brain connectivity and risk for brain diseases.

  6. Genome-wide estimates of coancestry and inbreeding in a closed herd of ancient Iberian pigs.

    PubMed

    Saura, María; Fernández, Almudena; Rodríguez, M Carmen; Toro, Miguel A; Barragán, Carmen; Fernández, Ana I; Villanueva, Beatriz

    2013-01-01

    Maintaining genetic variation and controlling the increase in inbreeding are crucial requirements in animal conservation programs. The most widely accepted strategy for achieving these objectives is to maximize the effective population size by minimizing the global coancestry obtained from a particular pedigree. However, for most natural or captive populations genealogical information is absent. In this situation, microsatellites have been traditionally the markers of choice to characterize genetic variation, and several estimators of genealogical coefficients have been developed using marker data, with unsatisfactory results. The development of high-throughput genotyping techniques states the necessity of reviewing the paradigm that genealogical coancestry is the best parameter for measuring genetic diversity. In this study, the Illumina PorcineSNP60 BeadChip was used to obtain genome-wide estimates of rates of coancestry and inbreeding and effective population size for an ancient strain of Iberian pigs that is now in serious danger of extinction and for which very accurate genealogical information is available (the Guadyerbas strain). Genome-wide estimates were compared with those obtained from microsatellite and from pedigree data. Estimates of coancestry and inbreeding computed from the SNP chip were strongly correlated with genealogical estimates and these correlations were substantially higher than those between microsatellite and genealogical coefficients. Also, molecular coancestry computed from SNP information was a better predictor of genealogical coancestry than coancestry computed from microsatellites. Rates of change in coancestry and inbreeding and effective population size estimated from molecular data were very similar to those estimated from genealogical data. However, estimates of effective population size obtained from changes in coancestry or inbreeding differed. Our results indicate that genome-wide information represents a useful alternative

  7. Genome-wide selective sweeps and gene-specific sweeps in natural bacterial populations

    PubMed Central

    Bendall, Matthew L; Stevens, Sarah LR; Chan, Leong-Keat; Malfatti, Stephanie; Schwientek, Patrick; Tremblay, Julien; Schackwitz, Wendy; Martin, Joel; Pati, Amrita; Bushnell, Brian; Froula, Jeff; Kang, Dongwan; Tringe, Susannah G; Bertilsson, Stefan; Moran, Mary A; Shade, Ashley; Newton, Ryan J; McMahon, Katherine D; Malmstrom, Rex R

    2016-01-01

    Multiple models describe the formation and evolution of distinct microbial phylogenetic groups. These evolutionary models make different predictions regarding how adaptive alleles spread through populations and how genetic diversity is maintained. Processes predicted by competing evolutionary models, for example, genome-wide selective sweeps vs gene-specific sweeps, could be captured in natural populations using time-series metagenomics if the approach were applied over a sufficiently long time frame. Direct observations of either process would help resolve how distinct microbial groups evolve. Here, from a 9-year metagenomic study of a freshwater lake (2005–2013), we explore changes in single-nucleotide polymorphism (SNP) frequencies and patterns of gene gain and loss in 30 bacterial populations. SNP analyses revealed substantial genetic heterogeneity within these populations, although the degree of heterogeneity varied by >1000-fold among populations. SNP allele frequencies also changed dramatically over time within some populations. Interestingly, nearly all SNP variants were slowly purged over several years from one population of green sulfur bacteria, while at the same time multiple genes either swept through or were lost from this population. These patterns were consistent with a genome-wide selective sweep in progress, a process predicted by the ‘ecotype model' of speciation but not previously observed in nature. In contrast, other populations contained large, SNP-free genomic regions that appear to have swept independently through the populations prior to the study without purging diversity elsewhere in the genome. Evidence for both genome-wide and gene-specific sweeps suggests that different models of bacterial speciation may apply to different populations coexisting in the same environment. PMID:26744812

  8. Power comparison of admixture mapping and direct association analysis in genome-wide association studies.

    PubMed

    Qin, Huaizhen; Zhu, Xiaofeng

    2012-04-01

    When dense markers are available, one can interrogate almost every common variant across the genome via imputation and single nucleotide polymorphism (SNP) test, which has become a routine in current genome-wide association studies (GWASs). As a complement, admixture mapping exploits the long-range linkage disequilibrium (LD) generated by admixture between genetically distinct ancestral populations. It is then questionable whether admixture mapping analysis is still necessary in detecting the disease associated variants in admixed populations. We argue that admixture mapping is able to reduce the burden of massive comparisons in GWASs; it therefore can be a powerful tool to locate the disease variants with substantial allele frequency differences between ancestral populations. In this report we studied a two-stage approach, where candidate regions are defined by conducting admixture mapping at stage 1, and single SNP association tests are followed at stage 2 within the candidate regions defined at stage 1. We first established the genome-wide significance levels corresponding to the criteria to define the candidate regions at stage 1 by simulations. We next compared the power of the two-stage approach with direct association analysis. Our simulations suggest that the two-stage approach can be more powerful than the standard genome-wide association analysis when the allele frequency difference of a causal variant in ancestral populations, is larger than 0.4. Our conclusion is consistent with a theoretical prediction by Risch and Tang ([2006] Am J Hum Genet 79:S254). Surprisingly, our study also suggests that power can be improved when we use less strict criteria to define the candidate regions at stage 1.

  9. Identification of genome-wide variations among three elite restorer lines for hybrid-rice.

    PubMed

    Li, Shuangcheng; Wang, Shiquan; Deng, Qiming; Zheng, Aiping; Zhu, Jun; Liu, Huainian; Wang, Lingxia; Gao, Fengyan; Zou, Ting; Huang, Bin; Cao, Xuemei; Xu, Lizhi; Yu, Chuang; Ai, Peng; Li, Ping

    2012-01-01

    Rice restorer lines play an important role in three-line hybrid rice production. Previous research based on molecular tagging has suggested that the restorer lines used widely today have narrow genetic backgrounds. However, patterns of genetic variation at a genome-wide scale in these restorer lines remain largely unknown. The present study performed re-sequencing and genome-wide variation analysis of three important representative restorer lines, namely, IR24, MH63, and SH527, using the Solexa sequencing technology. With the genomic sequence of the Indica cultivar 9311 as the reference, the following genetic features were identified: 267,383 single-nucleotide polymorphisms (SNPs), 52,847 insertion/deletion polymorphisms (InDels), and 3,286 structural variations (SVs) in the genome of IR24; 288,764 SNPs, 59,658 InDels, and 3,226 SVs in MH63; and 259,862 SNPs, 55,500 InDels, and 3,127 SVs in SH527. Variations between samples were also determined by comparative analysis of authentic collections of SNPs, InDels, and SVs, and were functionally annotated. Furthermore, variations in several important genes were also surveyed by alignment analysis in these lines. Our results suggest that genetic variations among these lines, although far lower than those reported in the landrace population, are greater than expected, indicating a complicated genetic basis for the phenotypic diversity of the restorer lines. Identification of genome-wide variation and pattern analysis among the restorer lines will facilitate future genetic studies and the molecular improvement of hybrid rice.

  10. Genome-Wide Association Study in Immunocompetent Patients with Delayed Hypersensitivity to Sulfonamide Antimicrobials

    PubMed Central

    Motsinger-Reif, Alison; Dickey, Allison; Yale, Steven; Trepanier, Lauren A.

    2016-01-01

    Background Hypersensitivity (HS) reactions to sulfonamide antibiotics occur uncommonly, but with potentially severe clinical manifestations. A familial predisposition to sulfonamide HS is suspected, but robust predictive genetic risk factors have yet to be identified. Strongly linked genetic polymorphisms have been used clinically as screening tests for other HS reactions prior to administration of high-risk drugs. Objective The purpose of this study was to evaluate for genetic risk of sulfonamide HS in the immunocompetent population using genome-wide association. Methods Ninety-one patients with symptoms after trimethoprim-sulfamethoxazole (TMP-SMX) attributable to “probable” drug HS based on medical record review and the Naranjo Adverse Drug Reaction Probability Scale, and 184 age- and sex-matched patients who tolerated a therapeutic course of TMP-SMX, were included in a genome-wide association study using both common and rare variant techniques. Additionally, two subgroups of HS patients with a more refined clinical phenotype (fever and rash; or fever, rash and eosinophilia) were evaluated separately. Results For the full dataset, no single nucleotide polymorphisms were suggestive of or reached genome-wide significance in the common variant analysis, nor was any genetic locus significant in the rare variant analysis. A single, possible gene locus association (COL12A1) was identified in the rare variant analysis for patients with both fever and rash, but the sample size was very small in this subgroup (n = 16), and this may be a false positive finding. No other significant associations were found for the subgroups. Conclusions No convincing genetic risk factors for sulfonamide HS were identified in this population. These negative findings may be due to challenges in accurately confirming the phenotype in exanthematous drug eruptions, or to unidentified gene-environment interactions influencing sulfonamide HS. PMID:27272151

  11. Longitudinal Genome-Wide Association of Cardiovascular Disease Risk Factors in the Bogalusa Heart Study

    PubMed Central

    Smith, Erin N.; Chen, Wei; Kähönen, Mika; Kettunen, Johannes; Lehtimäki, Terho; Peltonen, Leena; Raitakari, Olli T.; Salem, Rany M.; Schork, Nicholas J.; Shaw, Marian; Srinivasan, Sathanur R.; Topol, Eric J.; Viikari, Jorma S.; Berenson, Gerald S.; Murray, Sarah S.

    2010-01-01

    Cardiovascular disease (CVD) is the leading cause of death worldwide. Recent genome-wide association (GWA) studies have pinpointed many loci associated with CVD risk factors in adults. It is unclear, however, if these loci predict trait levels at all ages, if they are associated with how a trait develops over time, or if they could be used to screen individuals who are pre-symptomatic to provide the opportunity for preventive measures before disease onset. We completed a genome-wide association study on participants in the longitudinal Bogalusa Heart Study (BHS) and have characterized the association between genetic factors and the development of CVD risk factors from childhood to adulthood. We report 7 genome-wide significant associations involving CVD risk factors, two of which have been previously reported. Top regions were tested for replication in the Young Finns Study (YF) and two associations strongly replicated: rs247616 in CETP with HDL levels (combined P = 9.7×10−24), and rs445925 at APOE with LDL levels (combined P = 8.7×10−19). We show that SNPs previously identified in adult cross-sectional studies tend to show age-independent effects in the BHS with effect sizes consistent with previous reports. Previously identified variants were associated with adult trait levels above and beyond those seen in childhood; however, variants with time-dependent effects were also promising predictors. This is the first GWA study to evaluate the role of common genetic variants in the development of CVD risk factors in children as they advance through adulthood and highlights the utility of using longitudinal studies to identify genetic predictors of adult traits in children. PMID:20838585

  12. Genome-wide transcriptional profiling reveals molecular signatures of secondary xylem differentiation in Populus tomentosa.

    PubMed

    Yang, X H; Li, X G; Li, B L; Zhang, D Q

    2014-11-11

    Wood formation occurs via cell division, primary cell wall and secondary wall formation, and programmed cell death in the vascular cambium. Transcriptional profiling of secondary xylem differentiation is essential for understanding the molecular mechanisms underlying wood formation. Differential gene expression in secondary xylem differentiation of Populus has been previously investigated using cDNA microarray analysis. However, little is known about the molecular mechanisms from a genome-wide perspective. In this study, the Affymetrix poplar genome chips containing 61,413 probes were used to investigate the changes in the transcriptome during secondary xylem differentiation in Chinese white poplar (Populus tomentosa). Two xylem tissues (newly formed and lignified) were sampled for genome-wide transcriptional profiling. In total, 6843 genes (~11%) were identified with differential expression in the two xylem tissues. Many genes involved in cell division, primary wall modification, and cellulose synthesis were preferentially expressed in the newly formed xylem. In contrast, many genes, including 4-coumarate:cinnamate-4-hydroxylase (C4H), 4-coumarate:CoA ligase (4CL), cinnamyl alcohol dehydrogenase (CAD), and caffeoyl CoA 3-O-methyltransferase (CCoAOMT), associated with lignin biosynthesis were more transcribed in the lignified xylem. The two xylem tissues also showed differential expression of genes related to various hormones; thus, the secondary xylem differentiation could be regulated by hormone signaling. Furthermore, many transcription factor genes were preferentially expressed in the lignified xylem, suggesting that wood lignification involves extensive transcription regulation. The genome-wide transcriptional profiling of secondary xylem differentiation could provide additional insights into the molecular basis of wood formation in poplar species.

  13. Inference of gene regulatory networks from genome-wide knockout fitness data

    PubMed Central

    Wang, Liming; Wang, Xiaodong; Arkin, Adam P.; Samoilov, Michael S.

    2013-01-01

    Motivation: Genome-wide fitness is an emerging type of high-throughput biological data generated for individual organisms by creating libraries of knockouts, subjecting them to broad ranges of environmental conditions, and measuring the resulting clone-specific fitnesses. Since fitness is an organism-scale measure of gene regulatory network behaviour, it may offer certain advantages when insights into such phenotypical and functional features are of primary interest over individual gene expression. Previous works have shown that genome-wide fitness data can be used to uncover novel gene regulatory interactions, when compared with results of more conventional gene expression analysis. Yet, to date, few algorithms have been proposed for systematically using genome-wide mutant fitness data for gene regulatory network inference. Results: In this article, we describe a model and propose an inference algorithm for using fitness data from knockout libraries to identify underlying gene regulatory networks. Unlike most prior methods, the presented approach captures not only structural, but also dynamical and non-linear nature of biomolecular systems involved. A state–space model with non-linear basis is used for dynamically describing gene regulatory networks. Network structure is then elucidated by estimating unknown model parameters. Unscented Kalman filter is used to cope with the non-linearities introduced in the model, which also enables the algorithm to run in on-line mode for practical use. Here, we demonstrate that the algorithm provides satisfying results for both synthetic data as well as empirical measurements of GAL network in yeast Saccharomyces cerevisiae and TyrR–LiuR network in bacteria Shewanella oneidensis. Availability: MATLAB code and datasets are available to download at http://www.duke.edu/∼lw174/Fitness.zip and http://genomics.lbl.gov/supplemental/fitness-bioinf/ Contact: wangx@ee.columbia.edu or mssamoilov@lbl.gov Supplementary information

  14. Genome-wide disruption of 5-hydroxymethylcytosine in a mouse model of autism

    PubMed Central

    Papale, Ligia A.; Zhang, Qi; Li, Sisi; Chen, Kailei; Keleş, Sündüz; Alisch, Reid S.

    2015-01-01

    The autism spectrum disorders (ASD) comprise a broad group of behaviorally related neurodevelopmental disorders affecting as many as 1 in 68 children. The hallmarks of ASD consist of impaired social and communication interactions, pronounced repetitive behaviors and restricted patterns of interests. Family, twin and epidemiological studies suggest a polygenetic and epistatic susceptibility model involving the interaction of many genes; however, the etiology of ASD is likely to be complex and include both epigenetic and environmental factors. 5-hydroxymethylcytosine (5hmC) is a novel environmentally sensitive DNA modification that is highly enriched in post-mitotic neurons and is associated with active transcription of neuronal genes. Here, we used an established chemical labeling and affinity purification method coupled with high-throughput sequencing technology to generate a genome-wide profile of striatal 5hmC in an autism mouse model (Cntnap2−/− mice) and found that at 9 weeks of age the Cntnap2−/− mice have a genome-wide disruption in 5hmC, primarily in genic regions and repetitive elements. Annotation of differentially hydroxymethylated regions (DhMRs) to genes revealed a significant overlap with known ASD genes (e.g. Nrxn1 and Reln) that carried an enrichment of neuronal ontological functions, including axonogenesis and neuron projection morphogenesis. Finally, sequence motif predictions identified associations with transcription factors that have a high correlation with important genes in neuronal developmental and functional pathways. Together, our data implicate a role for 5hmC-mediated epigenetic modulation in the pathogenesis of autism and represent a critical step toward understanding the genome-wide molecular consequence of the Cntnap2 mutation, which results in an autism-like phenotype. PMID:26423458

  15. Genome-wide scan of healthy human connectome discovers SPON1 gene variant influencing dementia severity

    PubMed Central

    Jahanshad, Neda; Rajagopalan, Priya; Hua, Xue; Hibar, Derrek P.; Nir, Talia M.; Toga, Arthur W.; Jack, Clifford R.; Saykin, Andrew J.; Green, Robert C.; Weiner, Michael W.; Medland, Sarah E.; Montgomery, Grant W.; Hansell, Narelle K.; McMahon, Katie L.; de Zubicaray, Greig I.; Martin, Nicholas G.; Wright, Margaret J.; Thompson, Paul M.; Weiner, Michael; Aisen, Paul; Weiner, Michael; Aisen, Paul; Petersen, Ronald; Jack, Clifford R.; Jagust, William; Trojanowski, John Q.; Toga, Arthur W.; Beckett, Laurel; Green, Robert C.; Saykin, Andrew J.; Morris, John; Liu, Enchi; Green, Robert C.; Montine, Tom; Petersen, Ronald; Aisen, Paul; Gamst, Anthony; Thomas, Ronald G.; Donohue, Michael; Walter, Sarah; Gessert, Devon; Sather, Tamie; Beckett, Laurel; Harvey, Danielle; Gamst, Anthony; Donohue, Michael; Kornak, John; Jack, Clifford R.; Dale, Anders; Bernstein, Matthew; Felmlee, Joel; Fox, Nick; Thompson, Paul; Schuff, Norbert; Alexander, Gene; DeCarli, Charles; Jagust, William; Bandy, Dan; Koeppe, Robert A.; Foster, Norm; Reiman, Eric M.; Chen, Kewei; Mathis, Chet; Morris, John; Cairns, Nigel J.; Taylor-Reinwald, Lisa; Trojanowki, J.Q.; Shaw, Les; Lee, Virginia M.Y.; Korecka, Magdalena; Toga, Arthur W.; Crawford, Karen; Neu, Scott; Saykin, Andrew J.; Foroud, Tatiana M.; Potkin, Steven; Shen, Li; Khachaturian, Zaven; Frank, Richard; Snyder, Peter J.; Molchan, Susan; Kaye, Jeffrey; Quinn, Joseph; Lind, Betty; Dolen, Sara; Schneider, Lon S.; Pawluczyk, Sonia; Spann, Bryan M.; Brewer, James; Vanderswag, Helen; Heidebrink, Judith L.; Lord, Joanne L.; Petersen, Ronald; Johnson, Kris; Doody, Rachelle S.; Villanueva-Meyer, Javier; Chowdhury, Munir; Stern, Yaakov; Honig, Lawrence S.; Bell, Karen L.; Morris, John C.; Ances, Beau; Carroll, Maria; Leon, Sue; Mintun, Mark A.; Schneider, Stacy; Marson, Daniel; Griffith, Randall; Clark, David; Grossman, Hillel; Mitsis, Effie; Romirowsky, Aliza; deToledo-Morrell, Leyla; Shah, Raj C.; Duara, Ranjan; Varon, Daniel; Roberts, Peggy; Albert, Marilyn; Onyike, Chiadi; Kielb, Stephanie; Rusinek, Henry; de Leon, Mony J.; Glodzik, Lidia; De Santi, Susan; Doraiswamy, P. Murali; Petrella, Jeffrey R.; Coleman, R. Edward; Arnold, Steven E.; Karlawish, Jason H.; Wolk, David; Smith, Charles D.; Jicha, Greg; Hardy, Peter; Lopez, Oscar L.; Oakley, MaryAnn; Simpson, Donna M.; Porsteinsson, Anton P.; Goldstein, Bonnie S.; Martin, Kim; Makino, Kelly M.; Ismail, M. Saleem; Brand, Connie; Mulnard, Ruth A.; Thai, Gaby; Mc-Adams-Ortiz, Catherine; Womack, Kyle; Mathews, Dana; Quiceno, Mary; Diaz-Arrastia, Ramon; King, Richard; Weiner, Myron; Martin-Cook, Kristen; DeVous, Michael; Levey, Allan I.; Lah, James J.; Cellar, Janet S.; Burns, Jeffrey M.; Anderson, Heather S.; Swerdlow, Russell H.; Apostolova, Liana; Lu, Po H.; Bartzokis, George; Silverman, Daniel H.S.; Graff-Radford, Neill R.; Parfitt, Francine; Johnson, Heather; Farlow, Martin R.; Hake, Ann Marie; Matthews, Brandy R.; Herring, Scott; van Dyck, Christopher H.; Carson, Richard E.; MacAvoy, Martha G.; Chertkow, Howard; Bergman, Howard; Hosein, Chris; Black, Sandra; Stefanovic, Bojana; Caldwell, Curtis; Hsiung, Ging-Yuek Robin; Feldman, Howard; Mudge, Benita; Assaly, Michele; Kertesz, Andrew; Rogers, John; Trost, Dick; Bernick, Charles; Munic, Donna; Kerwin, Diana; Mesulam, Marek-Marsel; Lipowski, Kristina; Wu, Chuang-Kuo; Johnson, Nancy; Sadowsky, Carl; Martinez, Walter; Villena, Teresa; Turner, Raymond Scott; Johnson, Kathleen; Reynolds, Brigid; Sperling, Reisa A.; Johnson, Keith A.; Marshall, Gad; Frey, Meghan; Yesavage, Jerome; Taylor, Joy L.; Lane, Barton; Rosen, Allyson; Tinklenberg, Jared; Sabbagh, Marwan; Belden, Christine; Jacobson, Sandra; Kowall, Neil; Killiany, Ronald; Budson, Andrew E.; Norbash, Alexander; Johnson, Patricia Lynn; Obisesan, Thomas O.; Wolday, Saba; Bwayo, Salome K.; Lerner, Alan; Hudson, Leon; Ogrocki, Paula; Fletcher, Evan; Carmichael, Owen; Olichney, John; DeCarli, Charles; Kittur, Smita; Borrie, Michael; Lee, T.-Y.; Bartha, Rob; Johnson, Sterling; Asthana, Sanjay; Carlsson, Cynthia M.; Potkin, Steven G.; Preda, Adrian; Nguyen, Dana; Tariot, Pierre; Fleisher, Adam; Reeder, Stephanie; Bates, Vernice; Capote, Horacio; Rainka, Michelle; Scharre, Douglas W.; Kataki, Maria; Zimmerman, Earl A.; Celmins, Dzintra; Brown, Alice D.; Pearlson, Godfrey D.; Blank, Karen; Anderson, Karen; Saykin, Andrew J.; Santulli, Robert B.; Schwartz, Eben S.; Sink, Kaycee M.; Williamson, Jeff D.; Garg, Pradeep; Watkins, Franklin; Ott, Brian R.; Querfurth, Henry; Tremont, Geoffrey; Salloway, Stephen; Malloy, Paul; Correia, Stephen; Rosen, Howard J.; Miller, Bruce L.; Mintzer, Jacobo; Longmire, Crystal Flynn; Spicer, Kenneth; Finger, Elizabeth; Rachinsky, Irina; Rogers, John; Kertesz, Andrew; Drost, Dick

    2013-01-01

    Aberrant connectivity is implicated in many neurological and psychiatric disorders, including Alzheimer’s disease and schizophrenia. However, other than a few disease-associated candidate genes, we know little about the degree to which genetics play a role in the brain networks; we know even less about specific genes that influence brain connections. Twin and family-based studies can generate estimates of overall genetic influences on a trait, but genome-wide association scans (GWASs) can screen the genome for specific variants influencing the brain or risk for disease. To identify the heritability of various brain connections, we scanned healthy young adult twins with high-field, high-angular resolution diffusion MRI. We adapted GWASs to screen the brain’s connectivity pattern, allowing us to discover genetic variants that affect the human brain’s wiring. The association of connectivity with the SPON1 variant at rs2618516 on chromosome 11 (11p15.2) reached connectome-wide, genome-wide significance after stringent statistical corrections were enforced, and it was replicated in an independent subsample. rs2618516 was shown to affect brain structure in an elderly population with varying degrees of dementia. Older people who carried the connectivity variant had significantly milder clinical dementia scores and lower risk of Alzheimer’s disease. As a posthoc analysis, we conducted GWASs on several organizational and topological network measures derived from the matrices to discover variants in and around genes associated with autism (MACROD2), development (NEDD4), and mental retardation (UBE2A) significantly associated with connectivity. Connectome-wide, genome-wide screening offers substantial promise to discover genes affecting brain connectivity and risk for brain diseases. PMID:23471985

  16. A Pilot Genome-Wide Association Study Identifies Potential Metabolic Pathways Involved in Tinnitus

    PubMed Central

    Gilles, Annick; Van Camp, Guy; Van de Heyning, Paul; Fransen, Erik

    2017-01-01

    Tinnitus, the perception of an auditory phantom sound in the form of ringing, buzzing, roaring, or hissing in the absence of an external sound source, is perceived by ~15% of the population and 2.5% experiences a severely bothersome tinnitus. The contribution of genes on the development of tinnitus is still under debate. The current manuscript reports a pilot Genome Wide Association Study (GWAS) into tinnitus, in a small cohort of 167 independent tinnitus subjects, and 749 non-tinnitus controls, who were collected as part of a cross-sectional study. After genotyping, imputation, and quality checking, the association between the tinnitus phenotype and 4,000,000 single-nucleotide polymorphisms (SNPs) was tested followed by gene set enrichment analysis. None of the SNPs reached the threshold for genome-wide significance (p < 5.0e–8), with the most significant SNPs, situated outside coding genes, reaching a p-value of 3.4e–7. By using the Genetic Analysis of Complex Traits (GACT) software, the percentage of the variance explained by all SNPs in the GWAS was estimated to be 3.2%, indicating that additive genetic effects explain only a small fraction of the tinnitus phenotype. Despite the lack of genome-wide significant SNPs, which is, at least in part, due to the limited sample size of the current study, evidence was found for a genetic involvement in tinnitus. Gene set enrichment analysis showed several metabolic pathways to be significantly enriched with SNPs having a low p-value in the GWAS. These pathways are involved in oxidative stress, endoplasmatic reticulum (ER) stress, and serotonin reception mediated signaling. These results are a promising basis for further research into the genetic basis of tinnitus, including GWAS with larger sample sizes and considering tinnitus subtypes for which a greater genetic contribution is more likely. PMID:28303087

  17. Disruptive selection without genome-wide evolution across a migratory divide.

    PubMed

    von Rönn, Jan A C; Shafer, Aaron B A; Wolf, Jochen B W

    2016-06-01

    Transcontinental migration is a fascinating example of how animals can respond to climatic oscillation. Yet, quantitative data on fitness components are scarce, and the resulting population genetic consequences are poorly understood. Migratory divides, hybrid zones with a transition in migratory behaviour, provide a natural setting to investigate the micro-evolutionary dynamics induced by migration under sympatric conditions. Here, we studied the effects of migratory programme on survival, trait evolution and genome-wide patterns of population differentiation in a migratory divide of European barn swallows. We sampled a total of 824 individuals from both allopatric European populations wintering in central and southern Africa, respectively, along with two mixed populations from within the migratory divide. While most morphological characters varied by latitude consistent with Bergmann's rule, wing length co-varied with distance to wintering grounds. Survival data collected during a 5-year period provided strong evidence that this covariance is repeatedly generated by disruptive selection against intermediate phenotypes. Yet, selection-induced divergence did not translate into genome-wide genetic differentiation as assessed by microsatellites, mtDNA and >20 000 genome-wide SNP markers; nor did we find evidence of local genomic selection between migratory types. Among breeding populations, a single outlier locus mapped to the BUB1 gene with a role in mitotic and meiotic organization. Overall, this study provides evidence for an adaptive response to variation in migration behaviour continuously eroded by gene flow under current conditions of nonassortative mating. It supports the theoretical prediction that population differentiation is difficult to achieve under conditions of gene flow despite measurable disruptive selection.

  18. Genome-wide association scan in psoriasis: new insights into chronic inflammatory disease.

    PubMed

    Schrodi, Steven J

    2008-09-01

    Evaluation of: Liu Y, Helms C, Liao W et al. A genome-wide association study of psoriasis and psoriatic arthritis identifies new disease loci. PLoS Genet. 4, e1000041 (2008). Genome-wide association scans have delivered on their promise of revealing susceptibility polymorphisms underlying common diseases. This comprehensive psoriasis study by Liu and colleagues reports confirmation of previously identified genes (HLA-C, IL12B and IL23R), identifies several novel psoriasis loci and is the first to report psoriatic arthritis association on a genome-wide scale. Along with other recent studies, this work gives further evidence that IL-23-mediated signaling is a key component of both psoriasis and psoriatic arthritis pathogenesis. Importantly, this study provides evidence of a single-nucleotide polymorphism (SNP), 35 kb upstream of HLA-C, which is stronger than Cw 0602 - the variant traditionally attributed to the MHC-linked psoriasis-susceptibility effect. Within this region, the authors also discovered an independent SNP with very strong predisposing effects. SNPs in the COG6 region and the USP8-TNFAIP8l3 region are among the novel psoriasis associations reported. In addition, a region showing linkage on chromosome 1q demonstrated association in the epidermal differentiation complex. Four SNPs over a 439-kb region on chromosome 4q27, where KIAA1109, ADAD1 and two cytokine-encoding genes (IL2 and IL21) reside, exhibit intriguing correlation with psoriatic arthritis, although the signal strength is moderate. These results, while still preliminary, may substantially expand our knowledge of psoriasis and psoriatic arthritis genetics, opening new avenues of chronic inflammatory disease research.

  19. Genome-wide selective sweeps and gene-specific sweeps in natural bacterial populations

    SciTech Connect

    Bendall, Matthew L.; Stevens, Sarah L.R.; Chan, Leong-Keat; Malfatti, Stephanie; Schwientek, Patrick; Tremblay, Julien; Schackwitz, Wendy; Martin, Joel; Pati, Amrita; Bushnell, Brian; Froula, Jeff; Kang, Dongwan; Tringe, Susannah G.; Bertilsson, Stefan; Moran, Mary A.; Shade, Ashley; Newton, Ryan J.; McMahon, Katherine D.; Malmstrom, Rex R.

    2016-01-08

    Multiple models describe the formation and evolution of distinct microbial phylogenetic groups. These evolutionary models make different predictions regarding how adaptive alleles spread through populations and how genetic diversity is maintained. Processes predicted by competing evolutionary models, for example, genome-wide selective sweeps vs gene-specific sweeps, could be captured in natural populations using time-series metagenomics if the approach were applied over a sufficiently long time frame. Direct observations of either process would help resolve how distinct microbial groups evolve. Using a 9-year metagenomic study of a freshwater lake (2005–2013), we explore changes in single-nucleotide polymorphism (SNP) frequencies and patterns of gene gain and loss in 30 bacterial populations. SNP analyses revealed substantial genetic heterogeneity within these populations, although the degree of heterogeneity varied by >1000-fold among populations. SNP allele frequencies also changed dramatically over time within some populations. Interestingly, nearly all SNP variants were slowly purged over several years from one population of green sulfur bacteria, while at the same time multiple genes either swept through or were lost from this population. Furthermore, these patterns were consistent with a genome-wide selective sweep in progress, a process predicted by the ‘ecotype model’ of speciation but not previously observed in nature. In contrast, other populations contained large, SNP-free genomic regions that appear to have swept independently through the populations prior to the study without purging diversity elsewhere in the genome. Finally, evidence for both genome-wide and gene-specific sweeps suggests that different models of bacterial speciation may apply to different populations coexisting in the same environment.

  20. A Genome-wide Association Study Identifies LIPA as a Susceptibility Gene for Coronary Artery Disease

    PubMed Central

    Wild, Philipp S; Zeller, Tanja; Schillert, Arne; Szymczak, Silke; Sinning, Christoph R; Deiseroth, Arne; Schnabel, Renate B; Lubos, Edith; Keller, Till; Eleftheriadis, Medea S; Bickel, Christoph; Rupprecht, Hans J; Wilde, Sandra; Rossmann, Heidi; Diemert, Patrick; Cupples, L Adrienne; Perret, Claire; Erdmann, Jeanette; Stark, Klaus; Kleber, Marcus E; Epstein, Stephen E; Voight, Benjamin F; Kuulasmaa, Kari; Li, Mingyao; Schäfer, Arne S; Klopp, Norman; Braund, Peter S; Sager, Hendrik B; Demissie, Serkalem; Proust, Carole; König, Inke R; Wichmann, Heinz-Erich; Reinhard, Wibke; Hoffmann, Michael M; Virtamo, Jarmo; Burnett, Mary Susan; Siscovick, David; Wiklund, Per Gunnar; Qu, Liming; El Mokthari, Nour Eddine; Thompson, John R; Peters, Annette; Smith, Albert V; Yon, Emmanuelle; Baumert, Jens; Hengstenberg, Christian; März, Winfried; Amouyel, Philippe; Devaney, Joseph; Schwartz, Stephen M; Saarela, Olli; Mehta, Nehal N; Rubin, Diana; Silander, Kaisa; Hall, Alistair S; Ferrieres, Jean; Harris, Tamara B; Melander, Olle; Kee, Frank; Hakonarson, Hakon; Schrezenmeir, Juergen; Gudnason, Vilmundur; Elosua, Roberto; Arveiler, Dominique; Evans, Alun; Rader, Daniel J; Illig, Thomas; Schreiber, Stefan; Bis, Joshua C; Altshuler, David; Kavousi, Maryam; Witteman, Jaqueline CM; Uitterlinden, Andre G; Hofman, Albert; Folsom, Aaron R; Barbalic, Maja; Boerwinkle, Eric; Kathiresan, Sekar; Reilly, Muredach P; O'Donnell, Christopher J; Samani, Nilesh J; Schunkert, Heribert; Cambien, Francois; Lackner, Karl J; Tiret, Laurence; Salomaa, Veikko; Munzel, Thomas; Ziegler, Andreas; Blankenberg, Stefan

    2011-01-01

    Background eQTL analyses are important to improve the understanding of genetic association results. Here, we performed a genome-wide association and global gene expression study to identify functionally relevant variants affecting the risk of coronary artery disease (CAD). Methods and Results In a genome-wide association analysis of 2,078 CAD cases and 2,953 controls, we identified 950 single nucleotide polymorphisms (SNPs) that were associated with CAD at P<10-3. Subsequent in silico and wet-lab replication stages and a final meta-analysis of 21,428 CAD cases and 38,361 controls revealed a novel association signal at chromosome 10q23.31 within the LIPA (Lysosomal Acid Lipase A) gene (P=3.7×10-8; OR 1.1; 95% CI: 1.07-1.14). The association of this locus with global gene expression was assessed by genome-wide expression analyses in the monocyte transcriptome of 1,494 individuals. The results showed a strong association of this locus with expression of the LIPA transcript (P=1.3×10-96). An assessment of LIPA SNPs and transcript with cardiovascular phenotypes revealed an association of LIPA transcript levels with impaired endothelial function (P=4.4×10-3). Conclusions The use of data on genetic variants and the addition of data on global monocytic gene expression led to the identification of the novel functional CAD susceptibility locus LIPA, located on chromosome 10q23.31. The respective eSNPs associated with CAD strongly affect LIPA gene expression level, which itself was related to endothelial dysfunction, a precursor of CAD. PMID:21606135

  1. Genome-wide association study in German patients with attention deficit/hyperactivity disorder.

    PubMed

    Hinney, Anke; Scherag, André; Jarick, Ivonne; Albayrak, Özgür; Pütter, Carolin; Pechlivanis, Sonali; Dauvermann, Maria R; Beck, Sebastian; Weber, Heike; Scherag, Susann; Nguyen, Trang T; Volckmar, Anna-Lena; Knoll, Nadja; Faraone, Stephen V; Neale, Benjamin M; Franke, Barbara; Cichon, Sven; Hoffmann, Per; Nöthen, Markus M; Schreiber, Stefan; Jöckel, Karl-Heinz; Wichmann, H-Erich; Freitag, Christine; Lempp, Thomas; Meyer, Jobst; Gilsbach, Susanne; Herpertz-Dahlmann, Beate; Sinzig, Judith; Lehmkuhl, Gerd; Renner, Tobias J; Warnke, Andreas; Romanos, Marcel; Lesch, Klaus-Peter; Reif, Andreas; Schimmelmann, Benno G; Hebebrand, Johannes

    2011-12-01

    The heritability of attention deficit hyperactivity disorder (ADHD) is approximately 0.8. Despite several larger scale attempts, genome-wide association studies (GWAS) have not led to the identification of significant results. We performed a GWAS based on 495 German young patients with ADHD (according to DSM-IV criteria; Human660W-Quadv1; Illumina, San Diego, CA) and on 1,300 population-based adult controls (HumanHap550v3; Illumina). Some genes neighboring the single nucleotide polymorphisms (SNPs) with the lowest P-values (best P-value: 8.38 × 10(-7)) have potential relevance for ADHD (e.g., glutamate receptor, metabotropic 5 gene, GRM5). After quality control, the 30 independent SNPs with the lowest P-values (P-values ≤ 7.57 × 10(-5) ) were chosen for confirmation. Genotyping of these SNPs in up to 320 independent German families comprising at least one child with ADHD revealed directionally consistent effect-size point estimates for 19 (10 not consistent) of the SNPs. In silico analyses of the 30 SNPs in the largest meta-analysis so far (2,064 trios, 896 cases, and 2,455 controls) revealed directionally consistent effect-size point estimates for 16 SNPs (11 not consistent). None of the combined analyses revealed a genome-wide significant result. SNPs in previously described autosomal candidate genes did not show significantly lower P-values compared to SNPs within random sets of genes of the same size. We did not find genome-wide significant results in a GWAS of German children with ADHD compared to controls. The second best SNP is located in an intron of GRM5, a gene located within a recently described region with an infrequent copy number variation in patients with ADHD.

  2. Utilizing twins as controls for non-twin case-materials in genome wide association studies.

    PubMed

    Ganna, Andrea; Ortega-Alonso, Alfredo; Havulinna, Aki; Salomaa, Veikko; Kaprio, Jaakko; Pedersen, Nancy L; Sullivan, Patrick F; Ingelsson, Erik; Hultman, Christina M; Magnusson, Patrik K E

    2013-01-01

    Twin registries around the globe have collected DNA samples from large numbers of monozygotic and dizygotic twins. The twin sample collections are frequently used as controls in disease-specific studies together with non-twins. This approach is unbiased under the hypothesis that twins and singletons are comparable in terms of allele frequencies; i.e. there are no genetic variants associated with being a twin per se. To test this hypothesis we performed a genome-wide association study comparing the allele frequency of 572,352 single nucleotide polymorphisms (SNPs) in 1,413 monozygotic (MZ) and 5,451 dizygotic (DZ) twins with 3,720 healthy singletons. Twins and singletons have been genotyped using the same platform. SNPs showing association with being a twin at P-value < 1 × 10(-5) were selected for replication analysis in 1,492 twins (463 MZ and 1,029 DZ) and 1,880 singletons from Finland. No SNPs reached genome-wide significance (P-value < 5 × 10(-8)) in the main analysis combining MZ and DZ twins. In a secondary analysis including only DZ twins two SNPs (rs2033541 close to ADAMTSL1 and rs4149283 close to ABCA1) were genome-wide significant after meta-analysis with the Finnish population. The estimated proportion of variance on the liability scale explained by all SNPs was 0.08 (P-value=0.003) when MZ and DZ were considered together and smaller for MZ (0.06, P-value=0.10) compared to DZ (0.09, P-value=0.003) when analyzed separately. In conclusion, twins and singletons can be used in genetic studies together with general population samples without introducing large bias. Further research is needed to explore genetic variances associated with DZ twinning.

  3. A GENOME WIDE ASSOCIATION STUDY FOR DIABETIC NEPHROPATHY GENES IN AFRICAN AMERICANS

    PubMed Central

    McDonough, Caitrin W.; Palmer, Nicholette D.; Hicks, Pamela J.; Roh, Bong H.; An, S. Sandy; Cooke, Jessica N.; Hester, Jessica M.; Wing, Maria R.; Bostrom, Meredith A.; Rudock, Megan E.; Lewis, Joshua P.; Talbert, Matthew E.; Blevins, Rebecca A.; Lu, Lingyi; Ng, Maggie C.Y.; Sale, Michele M.; Divers, Jasmin; Langefeld, Carl D.; Freedman, Barry I.; Bowden, Donald W.

    2011-01-01

    A genome-wide association study was performed using the Affymetrix 6.0 chip to identify genes associated with diabetic nephropathy in African Americans. Association analysis was performed adjusting for admixture in 965 type 2 diabetic African American patients with end-stage renal disease (ESRD) and in 1029 African Americans without type 2 diabetes or kidney disease as controls. The top 724 single nucleotide polymorphisms (SNPs) with evidence of association to diabetic nephropathy were then genotyped in a replication sample of an additional 709 type 2 diabetes-ESRD patients and 690 controls. SNPs with evidence of association in both the original and replication studies were tested in additional African American cohorts consisting of 1246 patients with type 2 diabetes without kidney disease and 1216 with non-diabetic ESRD to differentiate candidate loci for type 2 diabetes-ESRD, type 2 diabetes, and/or all-cause ESRD. Twenty-five SNPs were significantly associated with type 2 diabetes-ESRD in the genome-wide association and initial replication. Although genome-wide significance with type 2 diabetes was not found for any of these 25 SNPs, several genes, including RPS12, LIMK2, and SFI1 are strong candidates for diabetic nephropathy. A combined analysis of all 2890 patients with ESRD showed significant association SNPs in LIMK2 and SFI1 suggesting that they also contribute to all-cause ESRD. Thus, our results suggest that multiple loci underlie susceptibility to kidney disease in African Americans with type 2 diabetes and some may also contribute to all-cause ESRD. PMID:21150874

  4. A genome-wide association study of osteochondritis dissecans in the Thoroughbred.

    PubMed

    Corbin, Laura J; Blott, Sarah C; Swinburne, June E; Sibbons, Charlene; Fox-Clipsham, Laura Y; Helwegen, Maud; Parkin, Tim D H; Newton, J Richard; Bramlage, Lawrence R; McIlwraith, C Wayne; Bishop, Stephen C; Woolliams, John A; Vaudin, Mark

    2012-04-01

    Osteochondrosis is a developmental orthopaedic disease that occurs in horses, other livestock species, companion animal species, and humans. The principal aim of this study was to identify quantitative trait loci (QTL) associated with osteochondritis dissecans (OCD) in the Thoroughbred using a genome-wide association study. A secondary objective was to test the effect of previously identified QTL in the current population. Over 300 horses, classified as cases or controls according to clinical findings, were genotyped for the Illumina Equine SNP50 BeadChip. An animal model was first implemented in order to adjust each horse's phenotypic status for average relatedness among horses and other potentially confounding factors which were present in the data. The genome-wide association test was then conducted on the residuals from the animal model. A single SNP on chromosome 3 was found to be associated with OCD at a genome-wide level of significance, as determined by permutation. According to the current sequence annotation, the SNP is located in an intergenic region of the genome. The effects of 24 SNPs, representing QTL previously identified in a sample of Hanoverian Warmblood horses, were tested directly in the animal model. When fitted alongside the significant SNP on ECA3, two of these SNPs were found to be associated with OCD. Confirmation of the putative QTL identified on ECA3 requires validation in an independent sample. The results of this study suggest that a significant challenge faced by equine researchers is the generation of sufficiently large data sets to effectively study complex diseases such as osteochondrosis.

  5. Using genome-wide complex trait analysis to quantify ‘missing heritability’ in Parkinson's disease

    PubMed Central

    Keller, Margaux F.; Saad, Mohamad; Bras, Jose; Bettella, Francesco; Nicolaou, Nayia; Simón-Sánchez, Javier; Mittag, Florian; Büchel, Finja; Sharma, Manu; Gibbs, J. Raphael; Schulte, Claudia; Moskvina, Valentina; Durr, Alexandra; Holmans, Peter; Kilarski, Laura L.; Guerreiro, Rita; Hernandez, Dena G.; Brice, Alexis; Ylikotila, Pauli; Stefánsson, Hreinn; Majamaa, Kari; Morris, Huw R.; Williams, Nigel; Gasser, Thomas; Heutink, Peter; Wood, Nicholas W.; Hardy, John; Martinez, Maria; Singleton, Andrew B.; Nalls, Michael A.

    2012-01-01

    Genome-wide association studies (GWASs) have been successful at identifying single-nucleotide polymorphisms (SNPs) highly associated with common traits; however, a great deal of the heritable variation associated with common traits remains unaccounted for within the genome. Genome-wide complex trait analysis (GCTA) is a statistical method that applies a linear mixed model to estimate phenotypic variance of complex traits explained by genome-wide SNPs, including those not associated with the trait in a GWAS. We applied GCTA to 8 cohorts containing 7096 case and 19 455 control individuals of European ancestry in order to examine the missing heritability present in Parkinson's disease (PD). We meta-analyzed our initial results to produce robust heritability estimates for PD types across cohorts. Our results identify 27% (95% CI 17–38, P = 8.08E − 08) phenotypic variance associated with all types of PD, 15% (95% CI −0.2 to 33, P = 0.09) phenotypic variance associated with early-onset PD and 31% (95% CI 17–44, P = 1.34E − 05) phenotypic variance associated with late-onset PD. This is a substantial increase from the genetic variance identified by top GWAS hits alone (between 3 and 5%) and indicates there are substantially more risk loci to be identified. Our results suggest that although GWASs are a useful tool in identifying the most common variants associated with complex disease, a great deal of common variants of small effect remain to be discovered. PMID:22892372

  6. Meta-analysis of genome-wide association studies of anxiety disorders

    PubMed Central

    Otowa, Takeshi; Hek, Karin; Lee, Minyoung; Byrne, Enda M.; Mirza, Saira S.; Nivard, Michel G.; Bigdeli, Timothy; Aggen, Steven H.; Adkins, Daniel; Wolen, Aaron; Fanous, Ayman; Keller, Matthew C.; Castelao, Enrique; Kutalik, Zoltan; Van der Auwera, Sandra; Homuth, Georg; Nauck, Matthias; Teumer, Alexander; Milaneschi, Yuri; Hottenga, Jouke-Jan; Direk, Nese; Hofman, Albert; Uitterlinden, Andre; Mulder, Cornelis L.; Henders, Anjali K.; Medland, Sarah E.; Gordon, Scott; Heath, Andrew C.; Madden, Pamela A.F.; Pergadia, Michelle; van der Most, Peter J.; Nolte, Ilja M.; van Oort, Floor V.A.; Hartman, Catharina A.; Oldehinkel, Albertine J.; Preisig, Martin; Grabe, Hans Jörgen; Middeldorp, Christel M.; Penninx, Brenda WJH; Boomsma, Dorret; Martin, Nicholas G.; Montgomery, Grant; Maher, Brion S.; van den Oord, Edwin J.; Wray, Naomi R.; Tiemeier, Henning; Hettema, John M.

    2015-01-01

    Anxiety disorders, namely generalized anxiety disorder, panic disorder, and phobias, are common, etiologically complex conditions with a partially genetic basis. Despite differing on diagnostic definitions based upon clinical presentation, anxiety disorders likely represent various expressions of an underlying common diathesis of abnormal regulation of basic threat-response systems. We conducted genome-wide association analyses in nine samples of European ancestry from seven large, independent studies. To identify genetic variants contributing to genetic susceptibility shared across interview-generated DSM-based anxiety disorders, we applied two phenotypic approaches: (1) comparisons between categorical anxiety disorder cases and super-normal controls, and (2) quantitative phenotypic factor scores derived from a multivariate analysis combining information across the clinical phenotypes. We used logistic and linear regression, respectively, to analyze the association between these phenotypes and genome-wide single nucleotide polymorphisms. Meta-analysis for each phenotype combined results across the nine samples for over 18 000 unrelated individuals. Each meta-analysis identified a different genome-wide significant region, with the following markers showing the strongest association: for case-control contrasts, rs1709393 located in an uncharacterized non-coding RNA locus on chromosomal band 3q12.3 (P=1.65×10−8); for factor scores, rs1067327 within CAMKMT encoding the calmodulin-lysine N-methyltransferase on chromosomal band 2p21 (P=2.86×10−9). Independent replication and further exploration of these findings are needed to more fully understand the role of these variants in risk and expression of anxiety disorders. PMID:26754954

  7. Genome wide chromatin occupancy of mrhl RNA and its role in gene regulation in mouse spermatogonial cells

    PubMed Central

    Akhade, Vijay Suresh; Arun, Gayatri; Donakonda, Sainitin; Satyanarayana Rao, Manchanahalli R

    2014-01-01

    Mrhl RNA is a nuclear lncRNA encoded in the mouse genome and negatively regulates Wnt signaling in spermatogonial cells through p68/Ddx5 RNA helicase. Mrhl RNA is present in the chromatin fraction of mouse spermatogonial Gc1-Spg cells and genome wide chromatin occupancy of mrhl RNA by ChOP (Chromatin oligo affinity precipitation) technique identified 1370 statistically significant genomic loci. Among these, genes at 37 genomic loci also showed altered expression pattern upon mrhl RNA down regulation which are referred to as GRPAM (Genes Regulated by Physical Association of Mrhl RNA). p68 interacted with mrhl RNA in chromatin at these GRPAM loci. p68 silencing drastically reduced mrhl RNA occupancy at 27 GRPAM loci and also perturbed the expression of GRPAM suggesting a role for p68 mediated mrhl RNA occupancy in regulating GRPAM expression. Wnt3a ligand treatment of Gc1-Spg cells down regulated mrhl RNA expression and also perturbed expression of these 27 GRPAM genes that included genes regulating Wnt signaling pathway and spermatogenesis, one of them being Sox8, a developmentally important transcription factor. We also identified interacting proteins of mrhl RNA associated chromatin fraction which included Pc4, a chromatin organizer protein and hnRNP A/B and hnRNP A2/B1 which have been shown to be associated with lincRNA-Cox2 function in gene regulation. Our findings in the Gc1-Spg cell line also correlate with the results from analysis of mouse testicular tissue which further highlights the in vivo physiological significance of mrhl RNA in the context of gene regulation during mammalian spermatogenesis. PMID:25584904

  8. Identification of Drosophila Zfh2 as a mediator of hypercapnic immune regulation by a genome-wide RNAi screen

    PubMed Central

    Kwon, Yong-Jae; Hu, Jennifer A.; Krupinski, Thomas; Casalino-Matsuda, S. Marina; Sporn, Peter H. S.; Sznajder, Jacob I.; Beitel, Greg J.

    2015-01-01

    Hypercapnia, elevated partial pressure of carbon dioxide (PCO2) in blood and tissue, develops in many patients with chronic severe obstructive pulmonary disease and other advanced lung disorders. Patients with advanced disease frequently develop bacterial lung infections, and hypercapnia is a risk factor for mortality in such individuals. We previously demonstrated that hypercapnia suppresses induction of NF-κB-regulated innate immune response genes required for host defense in human, mouse and Drosophila cells, and increases mortality from bacterial infections in both mice and Drosophila. However, the molecular mediator(s) of hypercapnic immune suppression are undefined. Here, we report a genome-wide RNAi screen in Drosophila S2* cells stimulated with bacterial peptidoglycan (PGN). The screen identified 16 genes with human orthologs whose knockdown reduced hypercapnic suppression of the gene encoding the antimicrobial peptide (AMPs) Diptericin (Dipt), but did not increase Dipt mRNA levels in air. In vivo tests of one of the strongest screen hits, Zfh2 (mammalian orthologs ZFHX3/ATBF1 and ZFHX4), demonstrate that reducing zfh2 function using a mutation or RNAi improves survival of flies exposed to elevated CO2 and infected with S. aureus. Tissue-specific knockdown of zfh2 in the fat body, the major immune and metabolic organ of the fly, mitigates hypercapnia-induced reductions in Dipt and other AMPs and improves resistance of CO2-exposed flies to infection. Zfh2 mutations also partially rescue hypercapnia-induced delays in egg hatching, suggesting that Zfh2's role in mediating responses to hypercapnia extends beyond the immune system. Together, these results identify Zfh2 as the first in vivo mediator of hypercapnic immune suppression. PMID:26643480

  9. Genome-wide scan for linkage to schizophrenia in a Spanish-origin cohort from Costa Rica.

    PubMed

    DeLisi, Lynn E; Mesen, Andrea; Rodriguez, Carlos; Bertheau, Arturo; LaPrade, Beatrice; Llach, Michelle; Riondet, Silvina; Razi, Kamran; Relja, Margaret; Byerley, William; Sherrington, Robin

    2002-07-08

    Genetic isolates have been useful cohorts in which to search for genes underlying disorders of unknown pathology. One such cohort is thought to exist in the Central Valley of Costa Rica surrounding the city of San Jose. Previous investigators identified a rare dominant gene for hereditary deafness in this population, and a suggestive linkage of severe bipolar psychosis has been reported in another study. Ninety-nine families with at least one pair of siblings affected with schizophrenia or a schizophrenia-spectrum diagnosis had clinical evaluations and DNA collected for genotyping. The Marshfield Medical Research Foundation (NHLBI) Mammalian Genotyping Service performed all genotyping using 404 short-tandem repeat polymorphic markers (STRPs) spaced on average 10 cM apart. Data were analyzed using the nonparametric program, GeneHunterPlus. The population structure was investigated using the STRUCT program. No region was found with genome-wide significance for linkage. Using a phenotype of schizophrenia plus schizoaffective disorder, the highest maximum likelihood score (MLS) observed was 1.78 (P < 0.004) at 176.6 cM from pter on chromosome 5q, an area previously implicated by some other groups. In addition, five regions on chromosomes 1p, 2p, 2q, 14p, and 8p had MLSs above 1.0. All other regions produced scores below 1.0. Population genetic analysis reveals no evidence for population substructure, for admixture with other populations, such as Amerindians, or for inbreeding in the parental generation. The latter casts some doubt on this population being an isolate, although there was evidence of inbreeding among the offspring.

  10. Principal components analysis corrects for stratification in genome-wide association studies.

    PubMed

    Price, Alkes L; Patterson, Nick J; Plenge, Robert M; Weinblatt, Michael E; Shadick, Nancy A; Reich, David

    2006-08-01

    Population stratification--allele frequency differences between cases and controls due to systematic ancestry differences-can cause spurious associations in disease studies. We describe a method that enables explicit detection and correction of population stratification on a genome-wide scale. Our method uses principal components analysis to explicitly model ancestry differences between cases and controls. The resulting correction is specific to a candidate marker's variation in frequency across ancestral populations, minimizing spurious associations while maximizing power to detect true associations. Our simple, efficient approach can easily be applied to disease studies with hundreds of thousands of markers.

  11. Reverse Engineering of Genome-wide Gene Regulatory Networks from Gene Expression Data.

    PubMed

    Liu, Zhi-Ping

    2015-02-01

    Transcriptional regulation plays vital roles in many fundamental biological processes. Reverse engineering of genome-wide regulatory networks from high-throughput transcriptomic data provides a promising way to characterize the global scenario of regulatory relationships between regulators and their targets. In this review, we summarize and categorize the main frameworks and methods currently available for inferring transcriptional regulatory networks from microarray gene expression profiling data. We overview each of strategies and introduce representative methods respectively. Their assumptions, advantages, shortcomings, and possible improvements and extensions are also clarified and commented.

  12. MAVEN: a tool for visualization and functional analysis of genome-wide association results

    PubMed Central

    Narayanan, Kanchana; Li, Jing

    2010-01-01

    Summary: We describe the features and implementation of a web application tool named MAVEN—for Management, Analysis, Visualization and rEsults shariNg of genome-wide association data using cutting edge technologies. Main capabilities include user data uploading and management, queries using a variety of criteria, visualization of results, interactive selections and seamless integration of users' data with databases at the National Center for Biotechnology Information (NCBI) for functional annotations of single nucleotide polymorphisms (SNPs) and genes. Availability: http://cbc.case.edu/maven Contact: jingli@case.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:19933166

  13. Genome-wide association studies and genetic architecture of common human diseases.

    PubMed

    Montgomery, Grant W

    2011-06-03

    Genome-wide association scans provide the first successful method to identify genetic variation contributing to risk for common complex disease. Progress in identifying genes associated with melanoma show complex relationships between genes for pigmentation and the development of melanoma. Novel risk loci account for only a small fraction of the genetic variation contributing to this and many other diseases. Large meta-analyses find additional variants, but there is current debate about the contribution of common polymorphisms, rare polymorphisms or mutations to disease risk.

  14. Pharmacogenetic meta-analysis of genome-wide association studies of LDL cholesterol response to statins.

    PubMed

    Postmus, Iris; Trompet, Stella; Deshmukh, Harshal A; Barnes, Michael R; Li, Xiaohui; Warren, Helen R; Chasman, Daniel I; Zhou, Kaixin; Arsenault, Benoit J; Donnelly, Louise A; Wiggins, Kerri L; Avery, Christy L; Griffin, Paula; Feng, QiPing; Taylor, Kent D; Li, Guo; Evans, Daniel S; Smith, Albert V; de Keyser, Catherine E; Johnson, Andrew D; de Craen, Anton J M; Stott, David J; Buckley, Brendan M; Ford, Ian; Westendorp, Rudi G J; Slagboom, P Eline; Sattar, Naveed; Munroe, Patricia B; Sever, Peter; Poulter, Neil; Stanton, Alice; Shields, Denis C; O'Brien, Eoin; Shaw-Hawkins, Sue; Chen, Y-D Ida; Nickerson, Deborah A; Smith, Joshua D; Dubé, Marie Pierre; Boekholdt, S Matthijs; Hovingh, G Kees; Kastelein, John J P; McKeigue, Paul M; Betteridge, John; Neil, Andrew; Durrington, Paul N; Doney, Alex; Carr, Fiona; Morris, Andrew; McCarthy, Mark I; Groop, Leif; Ahlqvist, Emma; Bis, Joshua C; Rice, Kenneth; Smith, Nicholas L; Lumley, Thomas; Whitsel, Eric A; Stürmer, Til; Boerwinkle, Eric; Ngwa, Julius S; O'Donnell, Christopher J; Vasan, Ramachandran S; Wei, Wei-Qi; Wilke, Russell A; Liu, Ching-Ti; Sun, Fangui; Guo, Xiuqing; Heckbert, Susan R; Post, Wendy; Sotoodehnia, Nona; Arnold, Alice M; Stafford, Jeanette M; Ding, Jingzhong; Herrington, David M; Kritchevsky, Stephen B; Eiriksdottir, Gudny; Launer, Leonore J; Harris, Tamara B; Chu, Audrey Y; Giulianini, Franco; MacFadyen, Jean G; Barratt, Bryan J; Nyberg, Fredrik; Stricker, Bruno H; Uitterlinden, André G; Hofman, Albert; Rivadeneira, Fernando; Emilsson, Valur; Franco, Oscar H; Ridker, Paul M; Gudnason, Vilmundur; Liu, Yongmei; Denny, Joshua C; Ballantyne, Christie M; Rotter, Jerome I; Adrienne Cupples, L; Psaty, Bruce M; Palmer, Colin N A; Tardif, Jean-Claude; Colhoun, Helen M; Hitman, Graham; Krauss, Ronald M; Wouter Jukema, J; Caulfield, Mark J

    2014-10-28

    Statins effectively lower LDL cholesterol levels in large studies and the observed interindividual response variability may be partially explained by genetic variation. Here we perform a pharmacogenetic meta-analysis of genome-wide association studies (GWAS) in studies addressing the LDL cholesterol response to statins, including up to 18,596 statin-treated subjects. We validate the most promising signals in a further 22,318 statin recipients and identify two loci, SORT1/CELSR2/PSRC1 and SLCO1B1, not previously identified in GWAS. Moreover, we confirm the previously described associations with APOE and LPA. Our findings advance the understanding of the pharmacogenetic architecture of statin response.

  15. Genome Wide Examination of Allelic Loss in Lobular and Ductal Breast Cancer

    DTIC Science & Technology

    2004-07-01

    assay with data from array comparative genomic hybridization (CGH) on the same tumors. We find almost complete concordance with LOH as defined by the...Facility for assis- HuSNP assay on PEP material may be an acceptable tance in the hybridization and analysis of the HuSNP arrays, approach to genome-wide...Levine D, it is still a low-density map, with an average of one SNP Rabinovitch P, Reid B: 17p (p53) allelic losses, 4N (G2/ tetraploid ) site per 8.5 Mb in

  16. Genome-wide meta-analyses identify three loci associated with primary biliary cirrhosis.

    PubMed

    Liu, Xiangdong; Invernizzi, Pietro; Lu, Yue; Kosoy, Roman; Lu, Yan; Bianchi, Ilaria; Podda, Mauro; Xu, Chun; Xie, Gang; Macciardi, Fabio; Selmi, Carlo; Lupoli, Sara; Shigeta, Russell; Ransom, Michael; Lleo, Ana; Lee, Annette T; Mason, Andrew L; Myers, Robert P; Peltekian, Kevork M; Ghent, Cameron N; Bernuzzi, Francesca; Zuin, Massimo; Rosina, Floriano; Borghesio, Elisabetta; Floreani, Annarosa; Lazzari, Roberta; Niro, Grazia; Andriulli, Angelo; Muratori, Luigi; Muratori, Paolo; Almasio, Piero L; Andreone, Pietro; Margotti, Marzia; Brunetto, Maurizia; Coco, Barbara; Alvaro, Domenico; Bragazzi, Maria C; Marra, Fabio; Pisano, Alessandro; Rigamonti, Cristina; Colombo, Massimo; Marzioni, Marco; Benedetti, Antonio; Fabris, Luca; Strazzabosco, Mario; Portincasa, Piero; Palmieri, Vincenzo O; Tiribelli, Claudio; Croce, Lory; Bruno, Savino; Rossi, Sonia; Vinci, Maria; Prisco, Cleofe; Mattalia, Alberto; Toniutto, Pierluigi; Picciotto, Antonio; Galli, Andrea; Ferrari, Carlo; Colombo, Silvia; Casella, Giovanni; Morini, Lorenzo; Caporaso, Nicola; Colli, Agostino; Spinzi, Giancarlo; Montanari, Renzo; Gregersen, Peter K; Heathcote, E Jenny; Hirschfield, Gideon M; Siminovitch, Katherine A; Amos, Christopher I; Gershwin, M Eric; Seldin, Michael F

    2010-08-01

    A genome-wide association screen for primary biliary cirrhosis risk alleles was performed in an Italian cohort. The results from the Italian cohort replicated IL12A and IL12RB associations, and a combined meta-analysis using a Canadian dataset identified newly associated loci at SPIB (P = 7.9 x 10(-11), odds ratio (OR) = 1.46), IRF5-TNPO3 (P = 2.8 x 10(-10), OR = 1.63) and 17q12-21 (P = 1.7 x 10(-10), OR = 1.38).

  17. Attenuation of genome-wide 5-methylcytosine level is an epigenetic feature of cutaneous malignant melanomas.

    PubMed

    Micevic, Goran; Theodosakis, Nicholas; Taube, Janis M; Bosenberg, Marcus W; Rodić, Nemanja

    2017-04-01

    Epigenetic modification of DNA, namely covalent changes of cytosine residues, plays a key role in the maintenance of inactive chromatin regions, both in health and in disease. In the vast majority of malignant melanomas, the most notable known epigenetic abnormality is the attenuation of 5-hydroxymethylcytosine (5-hmC) residues. However, it remains unknown whether a decrease in 5-hmC represents a primary defect of melanoma cancer epigenome or whether it is secondary to the loss of 5-methylcytosine (5-mC), a chemical substrate for 5-hmC. Here, we evaluated 5-mC levels in a spectrum of melanocytic proliferations. To study the epigenetic features of melanocytic nuclei, we began by measuring 5-mC levels in histologic specimens semiquantitatively by immunohistochemistry. We next treated established melanoma cell lines with S-adenosyl methionine (SAM), a universal methyl group donor, in an effort to cause changes in 5-mC levels. We detected a marked reduction in 5-mC levels in both primary and metastatic melanomas compared with 5-mC levels in benign melanocytic nevi. We also empirically induced changes in 5-mC in melanoma cell lines by incubation with SAM. To our surprise, we observed a significant cytoreductive effect of SAM on all melanoma cell lines examined. At subcytotoxic levels, SAM treatment is accompanied by a genome-wide increase in 5-mC. Moreover, we recorded a dose-dependent increase in genome-wide 5-mC levels in melanoma cell lines following SAM treatment. Taken together, we report that genome-wide attenuation of 5-mC is a hallmark of malignant melanomas. We propose that genome-wide attenuation of 5-mC is not merely an epiphenomenon as it is required for melanoma cell growth, albeit by an as of yet undetermined mechanism. Given its potential benefit in slowing down the growth of melanoma cells, SAM should be studied further to determine its role in epigenome modulation.

  18. Inferring Where and When Replication Initiates from Genome-Wide Replication Timing Data

    NASA Astrophysics Data System (ADS)

    Baker, A.; Audit, B.; Yang, S. C.-H.; Bechhoefer, J.; Arneodo, A.

    2012-06-01

    Based on an analogy between DNA replication and one dimensional nucleation-and-growth processes, various attempts to infer the local initiation rate I(x,t) of DNA replication origins from replication timing data have been developed in the framework of phase transition kinetics theories. These works have all used curve-fit strategies to estimate I(x,t) from genome-wide replication timing data. Here, we show how to invert analytically the Kolmogorov-Johnson-Mehl-Avrami model and extract I(x,t) directly. Tests on both simulated and experimental budding-yeast data confirm the location and firing-time distribution of replication origins.

  19. Genomic Scans across Three Eucalypts Suggest that Adaptation to Aridity is a Genome-Wide Phenomenon

    PubMed Central

    Potts, Brad M.; McLean, Elizabeth H.; Collins, Lesley; Holland, Barbara R.; Prober, Suzanne M.; Stock, William D.; Vaillancourt, René E.; Byrne, Margaret

    2017-01-01

    Widespread species spanning strong environmental (e.g., climatic) gradients frequently display morphological and physiological adaptations to local conditions. Some adaptations are common to different species that occupy similar environments. However, the genomic architecture underlying such convergent traits may not be the same between species. Using genomic data from previous studies of three widespread eucalypt species that grow along rainfall gradients in southern Australia, our probabilistic approach provides evidence that adaptation to aridity is a genome-wide phenomenon, likely to involve multiple and diverse genes, gene families and regulatory regions that affect a multitude of complex genetic and biochemical processes. PMID:28391293

  20. Genetic Control of Canine Leishmaniasis: Genome-Wide Association Study and Genomic Selection Analysis

    PubMed Central

    Quilez, Javier; Martínez, Verónica; Woolliams, John A.; Sanchez, Armand; Pong-Wong, Ricardo; Kennedy, Lorna J.; Quinnell, Rupert J.; Ollier, William E. R.; Roura, Xavier; Ferrer, Lluís; Altet, Laura; Francino, Olga

    2012-01-01

    Background The current disease model for leishmaniasis suggests that only a proportion of infected individuals develop clinical disease, while others are asymptomatically infected due to immune control of infection. The factors that determine whether individuals progress to clinical disease following Leishmania infection are unclear, although previous studies suggest a role for host genetics. Our hypothesis was that canine leishmaniasis is a complex disease with multiple loci responsible for the progression of the disease from Leishmania infection. Methodology/Principal Findings Genome-wide association and genomic selection approaches were applied to a population-based case-control dataset of 219 dogs from a single breed (Boxer) genotyped for ∼170,000 SNPs. Firstly, we aimed to identify individual disease loci; secondly, we quantified the genetic component of the observed phenotypic variance; and thirdly, we tested whether genome-wide SNP data could accurately predict the disease. Conclusions/Significance We estimated that a substantial proportion of the genome is affecting the trait and that its heritability could be as high as 60%. Using the genome-wide association approach, the strongest associations were on chromosomes 1, 4 and 20, although none of these were statistically significant at a genome-wide level and after correcting for genetic stratification and lifestyle. Amongst these associations, chromosome 4: 61.2–76.9 Mb maps to a locus that has previously been associated with host susceptibility to human and murine leishmaniasis, and genomic selection estimated markers in this region to have the greatest effect on the phenotype. We therefore propose these regions as candidates for replication studies. An important finding of this study was the significant predictive value from using the genomic information. We found that the phenotype could be predicted with an accuracy of ∼0.29 in new samples and that the affection status was correctly predicted in 60

  1. Genome-wide re-sequencing of multidrug-resistant Mycobacterium leprae Airaku-3.

    PubMed

    Singh, P; Benjak, A; Carat, S; Kai, M; Busso, P; Avanzi, C; Paniz-Mondolfi, A; Peter, C; Harshman, K; Rougemont, J; Matsuoka, M; Cole, S T

    2014-10-01

    Genotyping and molecular characterization of drug resistance mechanisms in Mycobacterium leprae enables disease transmission and drug resistance trends to be monitored. In the present study, we performed genome-wide analysis of Airaku-3, a multidrug-resistant strain with an unknown mechanism of resistance to rifampicin. We identified 12 unique non-synonymous single-nucleotide polymorphisms (SNPs) including two in the transporter-encoding ctpC and ctpI genes. In addition, two SNPs were found that improve the resolution of SNP-based genotyping, particularly for Venezuelan and South East Asian strains of M. leprae.

  2. Genome-wide meta-analyses identify three loci associated with primary biliary cirrhosis

    PubMed Central

    Liu, Xiangdong; Invernizzi, Pietro; Lu, Yue; Kosoy, Roman; Lu, Yan; Bianchi, Ilaria; Podda, Mauro; Xu, Chun; Xie, Gang; Macciardi, Fabio; Selmi, Carlo; Lupoli, Sara; Shigeta, Russell; Ransom, Michael; Lleo, Ana; Lee, Annette T; Mason, Andrew L; Myers, Robert P; Peltekian, Kevork M; Ghent, Cameron N; Bernuzzi, Francesca; Zuin, Massimo; Rosina, Floriano; Borghesio, Elisabetta; Floreani, Annarosa; Lazzari, Roberta; Niro, Grazia; Andriulli, Angelo; Muratori, Luigi; Muratori, Paolo; Almasio, Piero L; Andreone, Pietro; Margotti, Marzia; Brunetto, Maurizia; Coco, Barbara; Alvaro, Domenico; Bragazzi, Maria C; Marra, Fabio; Pisano, Alessandro; Rigamonti, Cristina; Colombo, Massimo; Marzioni, Marco; Benedetti, Antonio; Fabris, Luca; Strazzabosco, Mario; Portincasa, Piero; Palmieri, Vincenzo O; Tiribelli, Claudio; Croce, Lory; Bruno, Savino; Rossi, Sonia; Vinci, Maria; Prisco, Cleofe; Mattalia, Alberto; Toniutto, Pierluigi; Picciotto, Antonio; Galli, Andrea; Ferrari, Carlo; Colombo, Silvia; Casella, Giovanni; Morini, Lorenzo; Caporaso, Nicola; Colli, Agostino; Spinzi, Giancarlo; Montanari, Renzo; Gregersen, Peter K; Heathcote, E Jenny; Hirschfield, Gideon M; Siminovitch, Katherine A; Amos, Christopher I; Gershwin, M Eric; Seldin, Michael F

    2011-01-01

    A genome-wide association screen for primary biliary cirrhosis risk alleles was performed in an Italian cohort. The results from the Italian cohort replicated IL12A and IL12RB associations, and a combined meta-analysis using a Canadian dataset identified newly associated loci at SPIB (P = 7.9 × 10–11, odds ratio (OR) = 1.46), IRF5-TNPO3 (P = 2.8 × 10–10, OR = 1.63) and 17q12-21 (P = 1.7 × 10–10, OR = 1.38). PMID:20639880

  3. Genome-wide linkage analysis of blood pressure under locus heterogeneity

    PubMed Central

    Yang, Xinqun; Wang, Kai; Huang, Jian; Vieland, Veronica J

    2003-01-01

    We describe a method for mapping quantitative trait loci that allows for locus heterogeneity. A genome-wide linkage analysis of blood pressure was performed using sib-pair data from the Framingham Heart Study. Evidence of linkage was found on four markers (GATA89G08, GATA23D06, GATA14E09, and 049xd2) at a significance level of 0.01. Two of them (GATA14E09 and 049xd2) seem to overlap with linkage signals reported previously, while the other two are not linked to any known signals. PMID:14975146

  4. Reverse Engineering of Genome-wide Gene Regulatory Networks from Gene Expression Data

    PubMed Central

    Liu, Zhi-Ping

    2015-01-01

    Transcriptional regulation plays vital roles in many fundamental biological processes. Reverse engineering of genome-wide regulatory networks from high-throughput transcriptomic data provides a promising way to characterize the global scenario of regulatory relationships between regulators and their targets. In this review, we summarize and categorize the main frameworks and methods currently available for inferring transcriptional regulatory networks from microarray gene expression profiling data. We overview each of strategies and introduce representative methods respectively. Their assumptions, advantages, shortcomings, and possible improvements and extensions are also clarified and commented. PMID:25937810

  5. Genome-Wide Association Analysis of Blood Biomarkers in Chronic Obstructive Pulmonary Disease

    PubMed Central

    Kim, Deog Kyeom; Cho, Michael H.; Hersh, Craig P.; Lomas, David A.; Miller, Bruce E.; Kong, Xiangyang; Bakke, Per; Gulsvik, Amund; Agustí, Alvar; Wouters, Emiel; Celli, Bartolome; Coxson, Harvey; Vestbo, Jørgen; MacNee, William; Yates, Julie C.; Rennard, Stephen; Litonjua, Augusto; Qiu, Weiliang; Beaty, Terri H.; Crapo, James D.; Riley, John H.; Tal-Singer, Ruth

    2012-01-01

    Rationale: A genome-wide association study (GWAS) for circulating chronic obstructive pulmonary disease (COPD) biomarkers could identify genetic determinants of biomarker levels and COPD susceptibility. Objectives: To identify genetic variants of circulating protein biomarkers and novel genetic determinants of COPD. Methods: GWAS was performed for two pneumoproteins, Clara cell secretory protein (CC16) and surfactant protein D (SP-D), and five systemic inflammatory markers (C-reactive protein, fibrinogen, IL-6, IL-8, and tumor necrosis factor-α) in 1,951 subjects with COPD. For genome-wide significant single nucleotide polymorphisms (SNPs) (P < 1 × 10−8), association with COPD susceptibility was tested in 2,939 cases with COPD and 1,380 smoking control subjects. The association of candidate SNPs with mRNA expression in induced sputum was also elucidated. Measurements and Main Results: Genome-wide significant susceptibility loci affecting biomarker levels were found only for the two pneumoproteins. Two discrete loci affecting CC16, one region near the CC16 coding gene (SCGB1A1) on chromosome 11 and another locus approximately 25 Mb away from SCGB1A1, were identified, whereas multiple SNPs on chromosomes 6 and 16, in addition to SNPs near SFTPD, had genome-wide significant associations with SP-D levels. Several SNPs affecting circulating CC16 levels were significantly associated with sputum mRNA expression of SCGB1A1 (P = 0.009–0.03). Several SNPs highly associated with CC16 or SP-D levels were nominally associated with COPD in a collaborative GWAS (P = 0.001–0.049), although these COPD associations were not replicated in two additional cohorts. Conclusions: Distant genetic loci and biomarker-coding genes affect circulating levels of COPD-related pneumoproteins. A subset of these protein quantitative trait loci may influence their gene expression in the lung and/or COPD susceptibility. Clinical trial registered with www.clinicaltrials.gov (NCT 00292552). PMID

  6. Cellular RNA helicases and HIV-1: insights from genome-wide, proteomic, and molecular studies.

    PubMed

    Chen, Chia-Yen; Liu, Xiang; Boris-Lawrie, Kathleen; Sharma, Amit; Jeang, Kuan-Teh

    2013-02-01

    RNA helicases are ubiquitous in plants and animals and function in many cellular processes. Retroviruses, such as human immunodeficiency virus (HIV-1), encode no RNA helicases in their genomes and utilize host cellular RNA helicases at various stages of their life cycle. Here, we briefly summarize the roles RNA helicases play in HIV-1 replication that have been identified recently, in part, through genome-wide screenings, proteomics, and molecular studies. Some of these helicases augment virus propagation while others apparently participate in antiviral defenses against viral replication.

  7. [Genome-wide association study(GWAS) and genetic risk of prostate cancer].

    PubMed

    Nakagawa, Hidewaki; Akamatsu, Shusuke; Takata, Ryo

    2016-01-01

    It is evident that genetic factors play critical roles in prostate cancer development. GWAS (genome-wide association studies) in multiple ethnic groups have been identifying more than 100 loci or genes which was significantly associated with prostate cancer susceptibility. They include several loci at 8q24, prostate-specific gene, inflammation gene, and metabolism-related genes. Risk prediction for prostate cancer by combining multiple SNPs is still primitive and not sufficiently accurate for clinical use, but this model could have a potential to affect clinical decision when it is applied to patients with gray-zone PSA or very high risk of prostate cancer.

  8. Insights into RNA structure and function from genome-wide studies.

    PubMed

    Mortimer, Stefanie A; Kidwell, Mary Anne; Doudna, Jennifer A

    2014-07-01

    A comprehensive understanding of RNA structure will provide fundamental insights into the cellular function of both coding and non-coding RNAs. Although many RNA structures have been analysed by traditional biophysical and biochemical methods, the low-throughput nature of these approaches has prevented investigation of the vast majority of cellular transcripts. Triggered by advances in sequencing technology, genome-wide approaches for probing the transcriptome are beginning to reveal how RNA structure affects each step of protein expression and RNA stability. In this Review, we discuss the emerging relationships between RNA structure and the regulation of gene expression.

  9. Genome-wide genetic screening with chemically mutagenized haploid embryonic stem cells.

    PubMed

    Forment, Josep V; Herzog, Mareike; Coates, Julia; Konopka, Tomasz; Gapp, Bianca V; Nijman, Sebastian M; Adams, David J; Keane, Thomas M; Jackson, Stephen P

    2017-01-01

    In model organisms, classical genetic screening via random mutagenesis provides key insights into the molecular bases of genetic interactions, helping to define synthetic lethality, synthetic viability and drug-resistance mechanisms. The limited genetic tractability of diploid mammalian cells, however, precludes this approach. Here, we demonstrate the feasibility of classical genetic screening in mammalian systems by using haploid cells, chemical mutagenesis and next-generation sequencing, providing a new tool to explore mammalian genetic interactions.

  10. Genome-wide Association Studies from the Cancer Genetic Markers of Susceptibility (CGEMS) Initiative | Office of Cancer Genomics

    Cancer.gov

    CGEMS identifies common inherited genetic variations associated with a number of cancers, including breast and prostate. Data from these genome-wide association studies (GWAS) are available through the Division of Cancer Epidemiology & Genetics website.

  11. Genome-Wide Association for Nicotine Dependence and Smoking Cessation Success in NIH Research Volunteers

    PubMed Central

    Drgon, Tomas; Montoya, Ivan; Johnson, Catherine; Liu, Qing-Rong; Walther, Donna; Hamer, Dean; Uhl, George R

    2009-01-01

    Phenotypes related to both nicotine dependence and ability to successfully quit smoking display substantial heritabilities in classical and molecular genetic studies. Twin studies suggest that some genetic components for dependence overlap with genetic components of ability to quit, but that many components do not overlap. Initial genome-wide association (GWA) studies have demonstrated haplotypes that distinguish nicotine-dependent from nondependent smokers. These haplotypes overlap partially with those that distinguish individuals who successfully quit smoking from those who were not able to quit smoking in clinical trials for smoking cessation. We now report novel genome-wide association results from National Institutes of Health research volunteers who reported smoking histories, symptoms of nicotine dependence, and ability to successfully quit smoking outside the context of a clinical trial. These results buttress data from several prior GWA studies. The data from these volunteers support the idea that previously reported studies of genes associated with smoking cessation success in clinical trial participants may also apply to smokers who are more or less able to initiate and sustain abstinence outside of clinical trial settings. PMID:19009022

  12. Quantifying the heritability of glioma using genome-wide complex trait analysis

    PubMed Central

    Kinnersley, Ben; Mitchell, Jonathan S.; Gousias, Konstantinos; Schramm, Johannes; Idbaih, Ahmed; Labussière, Marianne; Marie, Yannick; Rahimian, Amithys; Wichmann, H.-Erich; Schreiber, Stefan; Hoang-Xuan, Khe; Delattre, Jean-Yves; Nöthen, Markus M.; Mokhtari, Karima; Lathrop, Mark; Bondy, Melissa; Simon, Matthias; Sanson, Marc; Houlston, Richard S.

    2015-01-01

    Genome-wide association studies (GWAS) have successfully identified a number of common single-nucleotide polymorphisms (SNPs) influencing glioma risk. While these SNPs only explain a small proportion of the genetic risk it is unclear how much is left to be detected by other, yet to be identified, common SNPs. Therefore, we applied Genome-Wide Complex Trait Analysis (GCTA) to three GWAS datasets totalling 3,373 cases and 4,571 controls and performed a meta-analysis to estimate the heritability of glioma. Our results identify heritability estimates of 25% (95% CI: 20–31%, P = 1.15 × 10−17) for all forms of glioma - 26% (95% CI: 17–35%, P = 1.05 × 10−8) for glioblastoma multiforme (GBM) and 25% (95% CI: 17–32%, P = 1.26 × 10−10) for non-GBM tumors. This is a substantial increase from the genetic variance identified by the currently identified GWAS risk loci (~6% of common heritability), indicating that most of the heritable risk attributable to common genetic variants remains to be identified. PMID:26625949

  13. Genome-wide genetic diversity, population structure and admixture analysis in African and Asian cattle breeds.

    PubMed

    Edea, Z; Bhuiyan, M S A; Dessie, T; Rothschild, M F; Dadi, H; Kim, K S

    2015-02-01

    Knowledge about genetic diversity and population structure is useful for designing effective strategies to improve the production, management and conservation of farm animal genetic resources. Here, we present a comprehensive genome-wide analysis of genetic diversity, population structure and admixture based on 244 animals sampled from 10 cattle populations in Asia and Africa and genotyped for 69,903 autosomal single-nucleotide polymorphisms (SNPs) mainly derived from the indicine breed. Principal component analysis, STRUCTURE and distance analysis from high-density SNP data clearly revealed that the largest genetic difference occurred between the two domestic lineages (taurine and indicine), whereas Ethiopian cattle populations represent a mosaic of the humped zebu and taurine. Estimation of the genetic influence of zebu and taurine revealed that Ethiopian cattle were characterized by considerable levels of introgression from South Asian zebu, whereas Bangladeshi populations shared very low taurine ancestry. The relationships among Ethiopian cattle populations reflect their history of origin and admixture rather than phenotype-based distinctions. The high within-individual genetic variability observed in Ethiopian cattle represents an untapped opportunity for adaptation to changing environments and for implementation of within-breed genetic improvement schemes. Our results provide a basis for future applications of genome-wide SNP data to exploit the unique genetic makeup of indigenous cattle breeds and to facilitate their improvement and conservation.

  14. Active chromatin domains are defined by acetylation islands revealed by genome-wide mapping.

    PubMed

    Roh, Tae-Young; Cuddapah, Suresh; Zhao, Keji

    2005-03-01

    The identity and developmental potential of a human cell is specified by its epigenome that is largely defined by patterns of chromatin modifications including histone acetylation. Here we report high-resolution genome-wide mapping of diacetylation of histone H3 at Lys 9 and Lys 14 in resting and activated human T cells by genome-wide mapping technique (GMAT). Our data show that high levels of the H3 acetylation are detected in gene-rich regions. The chromatin accessibility and gene expression of a genetic domain is correlated with hyperacetylation of promoters and other regulatory elements but not with generally elevated acetylation of the entire domain. Islands of acetylation are identified in the intergenic and transcribed regions. The locations of the 46,813 acetylation islands identified in this study are significantly correlated with conserved noncoding sequences (CNSs) and many of them are colocalized with known regulatory elements in T cells. TCR signaling induces 4045 new acetylation loci that may mediate the global chromatin remodeling and gene activation. We propose that the acetylation islands are epigenetic marks that allow prediction of functional regulatory elements.

  15. Active chromatin domains are defined by acetylation islands revealed by genome-wide mapping

    PubMed Central

    Roh, Tae-Young; Cuddapah, Suresh; Zhao, Keji

    2005-01-01

    The identity and developmental potential of a human cell is specified by its epigenome that is largely defined by patterns of chromatin modifications including histone acetylation. Here we report high-resolution genome-wide mapping of diacetylation of histone H3 at Lys 9 and Lys 14 in resting and activated human T cells by genome-wide mapping technique (GMAT). Our data show that high levels of the H3 acetylation are detected in gene-rich regions. The chromatin accessibility and gene expression of a genetic domain is correlated with hyperacetylation of promoters and other regulatory elements but not with generally elevated acetylation of the entire domain. Islands of acetylation are identified in the intergenic and transcribed regions. The locations of the 46,813 acetylation islands identified in this study are significantly correlated with conserved noncoding sequences (CNSs) and many of them are colocalized with known regulatory elements in T cells. TCR signaling induces 4045 new acetylation loci that may mediate the global chromatin remodeling and gene activation. We propose that the acetylation islands are epigenetic marks that allow prediction of functional regulatory elements. PMID:15706033

  16. Genome-wide association studies for multiple diseases of the German Shepherd Dog

    PubMed Central

    Tsai, Kate L.; Noorai, Rooksana E.; Starr-Moss, Alison N.; Quignon, Pascale; Rinz, Caitlin J.; Ostrander, Elaine A.; Steiner, Jörg M.; Murphy, Keith E.

    2012-01-01

    The German Shepherd Dog (GSD) is a popular working and companion breed for which over 50 hereditary diseases have been documented. Herein, SNP profiles for 197 GSDs were generated using the Affymetrix v2 canine SNP array for a genome-wide association study to identify loci associated with four diseases: pituitary dwarfism, degenerative myelopathy (DM), congenital megaesophagus (ME), and pancreatic acinar atrophy (PAA). A locus on Chr 9 is strongly associated with pituitary dwarfism and is proximal to a plausible candidate gene, LHX3. Results for DM confirm a major locus encompassing SOD1, in which an associated point mutation was previously identified, but do not suggest modifier loci. Several SNPs on Chr 12 are associated with ME and a 4.7 Mb haplotype block is present in affected dogs. Analysis of additional ME cases for a SNP within the haplotype provides further support for this association. Results for PAA indicate more complex genetic underpinnings. Several regions on multiple chromosomes reach genome-wide significance. However, no major locus is apparent and only two associated haplotype blocks, on Chrs 7 and 12 are observed. These data suggest that PAA may be governed by multiple loci with small effects, or it may be a heterogeneous disorder. PMID:22105877

  17. Quantification and genome-wide mapping of DNA double-strand breaks.

    PubMed

    Grégoire, Marie-Chantal; Massonneau, Julien; Leduc, Frédéric; Arguin, Mélina; Brazeau, Marc-André; Boissonneault, Guylain

    2016-12-01

    DNA double-strand breaks (DSBs) represent a major threat to the genetic integrity of the cell. Knowing both their genome-wide distribution and number is important for a better assessment of genotoxicity at a molecular level. Available methods may have underestimated the extent of DSBs as they are based on markers specific to those undergoing active repair or may not be adapted for the large diversity of naturally occurring DNA ends. We have established conditions for an efficient first step of DNA nick and gap repair (NGR) allowing specific determination of DSBs by end labeling with terminal transferase. We used DNA extracted from HeLa cells harboring an I-SceI cassette to induce a targeted nick or DSB and demonstrated by immunocapture of 3'-OH that a prior step of NGR allows specific determination of loci-specific or genome wide DSBs. This method can be applied to the global determination of DSBs using radioactive end labeling and can find several applications aimed at understanding the distribution and kinetics of DSBs formation and repair.

  18. Genome-wide association analyses in East Asians identify new susceptibility loci for colorectal cancer

    PubMed Central

    Jia, Wei-Hua; Zhang, Ben; Matsuo, Keitaro; Shin, Aesun; Xiang, Yong-Bing; Jee, Sun Ha; Kim, Dong-Hyun; Ren, Zefang; Cai, Qiuyin; Long, Jirong; Shi, Jiajun; Wen, Wanqing; Yang, Gong; Delahanty, Ryan J.; Ji, Bu-Tian; Pan, Zhi-Zhong; Matsuda, Fumihiko; Gao, Yu-Tang; Oh, Jae Hwan; Ahn, Yoon-Ok; Park, Eun Jung; Li, Hong-Lan; Park, Ji Won; Jo, Jaeseong; Jeong, Jin-Young; Hosono, Satoyo; Casey, Graham; Peters, Ulrike; Shu, Xiao-Ou; Zeng, Yi-Xin; Zheng, Wei

    2013-01-01

    To identify novel genetic factors for colorectal cancer (CRC), we conducted a genome-wide association study in East Asians. By analyzing genome-wide data in 2,098 cases and 5,749 controls, we selected 64 promising SNPs for replication in an independent set of samples including up to 5,358 cases and 5,922 controls. We identified four SNPs with a P-value of 8.58 × 10−7 to 3.77 × 10−10 in the combined analysis of all East Asian samples. Three of the four SNPs were replicated in a study conducted among 26,060 European descendants with a combined P-value of 1.22 × 10−10 for rs647161 (5q31.1), 6.64 × 10−9 for rs2423279 (20p12.3), and 3.06 × 10−8 for rs10774214 (12p13.32 near the CCND2 gene), respectively, derived from the meta-analysis of data from both East Asian and European populations. This study identified three new CRC susceptibility loci and provides additional insight into the genetics and biology of CRC. PMID:23263487

  19. Genome-wide identification and analysis of the MADS-box gene family in apple.

    PubMed

    Tian, Yi; Dong, Qinglong; Ji, Zhirui; Chi, Fumei; Cong, Peihua; Zhou, Zongshan

    2015-01-25

    The MADS-box gene family is one of the most widely studied families in plants and has diverse developmental roles in flower pattern formation, gametophyte cell division and fruit differentiation. Although the genome-wide analysis of this family has been performed in some species, little is known regarding MADS-box genes in apple (Malus domestica). In this study, 146 MADS-box genes were identified in the apple genome and were phylogenetically clustered into six subgroups (MIKC(c), MIKC*, Mα, Mβ, Mγ and Mδ) with the MADS-box genes from Arabidopsis and rice. The predicted apple MADS-box genes were distributed across all 17 chromosomes at different densities. Additionally, the MADS-box domain, exon length, gene structure and motif compositions of the apple MADS-box genes were analysed. Moreover, the expression of all of the apple MADS-box genes was analysed in the root, stem, leaf, flower tissues and five stages of fruit development. All of the apple MADS-box genes, with the exception of some genes in each group, were expressed in at least one of the tissues tested, which indicates that the MADS-box genes are involved in various aspects of the physiological and developmental processes of the apple. To the best of our knowledge, this report describes the first genome-wide analysis of the apple MADS-box gene family, and the results should provide valuable information for understanding the classification, cloning and putative functions of this family.

  20. Discovery and validation of sub-threshold genome-wide association study loci using epigenomic signatures

    PubMed Central

    Wang, Xinchen; Tucker, Nathan R; Rizki, Gizem; Mills, Robert; Krijger, Peter HL; de Wit, Elzo; Subramanian, Vidya; Bartell, Eric; Nguyen, Xinh-Xinh; Ye, Jiangchuan; Leyton-Mange, Jordan; Dolmatova, Elena V; van der Harst, Pim; de Laat, Wouter; Ellinor, Patrick T; Newton-Cheh, Christopher; Milan, David J; Kellis, Manolis; Boyer, Laurie A

    2016-01-01

    Genetic variants identified by genome-wide association studies explain only a modest proportion of heritability, suggesting that meaningful associations lie 'hidden' below current thresholds. Here, we integrate information from association studies with epigenomic maps to demonstrate that enhancers significantly overlap known loci associated with the cardiac QT interval and QRS duration. We apply functional criteria to identify loci associated with QT interval that do not meet genome-wide significance and are missed by existing studies. We demonstrate that these 'sub-threshold' signals represent novel loci, and that epigenomic maps are effective at discriminating true biological signals from noise. We experimentally validate the molecular, gene-regulatory, cellular and organismal phenotypes of these sub-threshold loci, demonstrating that most sub-threshold loci have regulatory consequences and that genetic perturbation of nearby genes causes cardiac phenotypes in mouse. Our work provides a general approach for improving the detection of novel loci associated with complex human traits. DOI: http://dx.doi.org/10.7554/eLife.10557.001 PMID:27162171

  1. Genome Wide Analysis of Fatty Acid Desaturation and Its Response to Temperature1[OPEN

    PubMed Central

    Menard, Guillaume N.; Moreno, Jose Martin; Bryant, Fiona M.; Munoz-Azcarate, Olaya; Hassani-Pak, Keywan; Kurup, Smita

    2017-01-01

    Plants modify the polyunsaturated fatty acid content of their membrane and storage lipids in order to adapt to changes in temperature. In developing seeds, this response is largely controlled by the activities of the microsomal ω-6 and ω-3 fatty acid desaturases, FAD2 and FAD3. Although temperature regulation of desaturation has been studied at the molecular and biochemical levels, the genetic control of this trait is poorly understood. Here, we have characterized the response of Arabidopsis (Arabidopsis thaliana) seed lipids to variation in ambient temperature and found that heat inhibits both ω-6 and ω-3 desaturation in phosphatidylcholine, leading to a proportional change in triacylglycerol composition. Analysis of the 19 parental accessions of the multiparent advanced generation intercross (MAGIC) population showed that significant natural variation exists in the temperature responsiveness of ω-6 desaturation. A combination of quantitative trait locus (QTL) analysis and genome-wide association studies (GWAS) using the MAGIC population suggests that ω-6 desaturation is largely controlled by cis-acting sequence variants in the FAD2 5′ untranslated region intron that determine the expression level of the gene. However, the temperature responsiveness of ω-6 desaturation is controlled by a separate QTL on chromosome 2. The identity of this locus is unknown, but genome-wide association studies identified potentially causal sequence variants within ∼40 genes in an ∼450-kb region of the QTL. PMID:28108698

  2. Genome Wide Sampling Sequencing for SNP Genotyping: Methods, Challenges and Future Development.

    PubMed

    Jiang, Zhihua; Wang, Hongyang; Michal, Jennifer J; Zhou, Xiang; Liu, Bang; Woods, Leah C Solberg; Fuchs, Rita A

    2016-01-01

    Genetic polymorphisms, particularly single nucleotide polymorphisms (SNPs), have been widely used to advance quantitative, functional and evolutionary genomics. Ideally, all genetic variants among individuals should be discovered when next generation sequencing (NGS) technologies and platforms are used for whole genome sequencing or resequencing. In order to improve the cost-effectiveness of the process, however, the research community has mainly focused on developing genome-wide sampling sequencing (GWSS) methods, a collection of reduced genome complexity sequencing, reduced genome representation sequencing and selective genome target sequencing. Here we review the major steps involved in library preparation, the types of adapters used for ligation and the primers designed for amplification of ligated products for sequencing. Unfortunately, currently available GWSS methods have their drawbacks, such as inconsistency in the number of reads per sample library, the number of sites/targets per individual, and the number of reads per site/target, all of which result in missing data. Suggestions are proposed here to improve library construction, genotype calling accuracy, genome-wide marker density and read mapping rate. In brief, optimized GWSS library preparation should generate a unique set of target sites with dense distribution along chromosomes and even coverage per site across all individuals.

  3. Potential assessment of genome-wide association study and genomic selection in Japanese pear Pyrus pyrifolia.

    PubMed

    Iwata, Hiroyoshi; Hayashi, Takeshi; Terakami, Shingo; Takada, Norio; Sawamura, Yutaka; Yamamoto, Toshiya

    2013-03-01

    Although the potential of marker-assisted selection (MAS) in fruit tree breeding has been reported, bi-parental QTL mapping before MAS has hindered the introduction of MAS to fruit tree breeding programs. Genome-wide association studies (GWAS) are an alternative to bi-parental QTL mapping in long-lived perennials. Selection based on genomic predictions of breeding values (genomic selection: GS) is another alternative for MAS. This study examined the potential of GWAS and GS in pear breeding with 76 Japanese pear cultivars to detect significant associations of 162 markers with nine agronomic traits. We applied multilocus Bayesian models accounting for ordinal categorical phenotypes for GWAS and GS model training. Significant associations were detected at harvest time, black spot resistance and the number of spurs and two of the associations were closely linked to known loci. Genome-wide predictions for GS were accurate at the highest level (0.75) in harvest time, at medium levels (0.38-0.61) in resistance to black spot, firmness of flesh, fruit shape in longitudinal section, fruit size, acid content and number of spurs and at low levels (<0.2) in all soluble solid content and vigor of tree. Results suggest the potential of GWAS and GS for use in future breeding programs in Japanese pear.

  4. Genome-wide characterization of microsatellites in Triticeae species: abundance, distribution and evolution

    PubMed Central

    Deng, Pingchuan; Wang, Meng; Feng, Kewei; Cui, Licao; Tong, Wei; Song, Weining; Nie, Xiaojun

    2016-01-01

    Microsatellites are an important constituent of plant genome and distributed across entire genome. In this study, genome-wide analysis of microsatellites in 8 Triticeae species and 9 model plants revealed that microsatellite characteristics were similar among the Triticeae species. Furthermore, genome-wide microsatellite markers were designed in wheat and then used to analyze the evolutionary relationship of wheat and other Triticeae species. Results displayed that Aegilops tauschii was found to be the closest species to Triticum aestivum, followed by Triticum urartu, Triticum turgidum and Aegilops speltoides, while Triticum monococcum, Aegilops sharonensis and Hordeum vulgare showed a relatively lower PCR amplification effectivity. Additionally, a significantly higher PCR amplification effectivity was found in chromosomes at the same subgenome than its homoeologous when these markers were subjected to search against different chromosomes in wheat. After a rigorous screening process, a total of 20,666 markers showed high amplification and polymorphic potential in wheat and its relatives, which were integrated with the public available wheat markers and then anchored to the genome of wheat (CS). This study not only provided the useful resource for SSR markers development in Triticeae species, but also shed light on the evolution of polyploid wheat from the perspective of microsatellites. PMID:27561724

  5. Genome-wide association study identifies 12 new susceptibility loci for primary biliary cirrhosis.

    PubMed

    Mells, George F; Floyd, James A B; Morley, Katherine I; Cordell, Heather J; Franklin, Christopher S; Shin, So-Youn; Heneghan, Michael A; Neuberger, James M; Donaldson, Peter T; Day, Darren B; Ducker, Samantha J; Muriithi, Agnes W; Wheater, Elizabeth F; Hammond, Christopher J; Dawwas, Muhammad F; Jones, David E; Peltonen, Leena; Alexander, Graeme J; Sandford, Richard N; Anderson, Carl A

    2011-03-13

    In addition to the HLA locus, six genetic risk factors for primary biliary cirrhosis (PBC) have been identified in recent genome-wide association studies (GWAS). To identify additional loci, we carried out a GWAS using 1,840 cases from the UK PBC Consortium and 5,163 UK population controls as part of the Wellcome Trust Case Control Consortium 3 (WTCCC3). We followed up 28 loci in an additional UK cohort of 620 PBC cases and 2,514 population controls. We identified 12 new susceptibility loci (at a genome-wide significance level of P < 5 × 10⁻⁸) and replicated all previously associated loci. We identified three further new loci in a meta-analysis of data from our study and previously published GWAS results. New candidate genes include STAT4, DENND1B, CD80, IL7R, CXCR5, TNFRSF1A, CLEC16A and NFKB1. This study has considerably expanded our knowledge of the genetic architecture of PBC.

  6. Genome-wide association study reveals two new risk loci for bipolar disorder.

    PubMed

    Mühleisen, Thomas W; Leber, Markus; Schulze, Thomas G; Strohmaier, Jana; Degenhardt, Franziska; Treutlein, Jens; Mattheisen, Manuel; Forstner, Andreas J; Schumacher, Johannes; Breuer, René; Meier, Sandra; Herms, Stefan; Hoffmann, Per; Lacour, André; Witt, Stephanie H; Reif, Andreas; Müller-Myhsok, Bertram; Lucae, Susanne; Maier, Wolfgang; Schwarz, Markus; Vedder, Helmut; Kammerer-Ciernioch, Jutta; Pfennig, Andrea; Bauer, Michael; Hautzinger, Martin; Moebus, Susanne; Priebe, Lutz; Czerski, Piotr M; Hauser, Joanna; Lissowska, Jolanta; Szeszenia-Dabrowska, Neonila; Brennan, Paul; McKay, James D; Wright, Adam; Mitchell, Philip B; Fullerton, Janice M; Schofield, Peter R; Montgomery, Grant W; Medland, Sarah E; Gordon, Scott D; Martin, Nicholas G; Krasnow, Valery; Chuchalin, Alexander; Babadjanova, Gulja; Pantelejeva, Galina; Abramova, Lilia I; Tiganov, Alexander S; Polonikov, Alexey; Khusnutdinova, Elza; Alda, Martin; Grof, Paul; Rouleau, Guy A; Turecki, Gustavo; Laprise, Catherine; Rivas, Fabio; Mayoral, Fermin; Kogevinas, Manolis; Grigoroiu-Serbanescu, Maria; Propping, Peter; Becker, Tim; Rietschel, Marcella; Nöthen, Markus M; Cichon, Sven

    2014-03-11

    Bipolar disorder (BD) is a common and highly heritable mental illness and genome-wide association studies (GWAS) have robustly identified the first common genetic variants involved in disease aetiology. The data also provide strong evidence for the presence of multiple additional risk loci, each contributing a relatively small effect to BD susceptibility. Large samples are necessary to detect these risk loci. Here we present results from the largest BD GWAS to date by investigating 2.3 million single-nucleotide polymorphisms (SNPs) in a sample of 24,025 patients and controls. We detect 56 genome-wide significant SNPs in five chromosomal regions including previously reported risk loci ANK3, ODZ4 and TRANK1, as well as the risk locus ADCY2 (5p15.31) and a region between MIR2113 and POU3F2 (6q16.1). ADCY2 is a key enzyme in cAMP signalling and our finding provides new insights into the biological mechanisms involved in the development of BD.

  7. Genome-Wide Association of Copy Number Polymorphisms and Kidney Function

    PubMed Central

    Li, Man; Carey, Jacob; Cristiano, Stephen; Susztak, Katalin; Coresh, Josef; Boerwinkle, Eric; Beaty, Terri H.; Köttgen, Anna; Scharpf, Robert B.

    2017-01-01

    Genome-wide association studies (GWAS) using single nucleotide polymorphisms (SNPs) have identified more than 50 loci associated with estimated glomerular filtration rate (eGFR), a measure of kidney function. However, significant SNPs account for a small proportion of eGFR variability. Other forms of genetic variation have not been comprehensively evaluated for association with eGFR. In this study, we assess whether changes in germline DNA copy number are associated with GFR estimated from serum creatinine, eGFRcrea. We used hidden Markov models (HMMs) to identify copy number polymorphic regions (CNPs) from high-throughput SNP arrays for 2,514 African (AA) and 8,645 European ancestry (EA) participants in the Atherosclerosis Risk in Communities (ARIC) study. Separately for the EA and AA cohorts, we used Bayesian Gaussian mixture models to estimate copy number at regions identified by the HMM or previously reported in the HapMap Project. We identified 312 and 464 autosomal CNPs among individuals of EA and AA, respectively. Multivariate models adjusted for SNP-derived covariates of population structure identified one CNP in the EA cohort near genome-wide statistical significance (Bonferroni-adjusted p = 0.067) located on chromosome 5 (876–880kb). Overall, our findings suggest a limited role of CNPs in explaining eGFR variability. PMID:28135296

  8. Mycobacterium tuberculosis genome-wide screen exposes multiple CD8+ T cell epitopes

    PubMed Central

    Hammond, A S; Klein, M R; Corrah, T; Fox, A; Jaye, A; McAdam, K P; Brookes, R H

    2005-01-01

    Mounting evidence suggests human leucocyte antigen (HLA) class I-restricted CD8+ T cells play a role in protective immunity against tuberculosis yet relatively few epitopes specific for the causative organism, Mycobacterium tuberculosis, are reported. Here a total genome-wide screen of M. tuberculosis was used to identify putative HLA-B*3501 T cell epitopes. Of 479 predicted epitopes, 13 with the highest score were synthesized and used to restimulate lymphocytes from naturally exposed HLA-B*3501 healthy individuals in cultured and ex vivo enzyme-linked immunospot (ELISPOT) assays for interferon (IFN)-γ. All 13 peptides elicited a response that varied considerably between individuals. For three peptides CD8+ T cell lines were expanded and four of the 13 were recognized permissively through the HLA-B7 supertype family. Although further testing is required we show the genome-wide screen to be feasible for the identification of unknown mycobacterial antigens involved in immunity against natural infection. While the mechanisms of protective immunity against M. tuberculosis infection remain unclear, conventional class I-restricted CD8+ T cell responses appear to be widespread throughout the genome. PMID:15762882

  9. Genome-wide genetic investigation of serological measures of common infections

    PubMed Central

    Rubicz, Rohina; Yolken, Robert; Drigalenko, Eugene; Carless, Melanie A; Dyer, Thomas D; Kent Jr, Jack; Curran, Joanne E; Johnson, Matthew P; Cole, Shelley A; Fowler, Sharon P; Arya, Rector; Puppala, Sobha; Almasy, Laura; Moses, Eric K; Kraig, Ellen; Duggirala, Ravindranath; Blangero, John; Leach, Charles T; Göring, Harald HH

    2015-01-01

    Populations and individuals differ in susceptibility to infections because of a number of factors, including host genetic variation. We previously demonstrated that differences in antibody titer, which reflect infection history, are significantly heritable. Here we attempt to identify the genetic factors influencing variation in these serological phenotypes. Blood samples from >1300 Mexican Americans were quantified for IgG antibody level against 12 common infections, selected on the basis of their reported role in cardiovascular disease risk: Chlamydia pneumoniae; Helicobacter pylori; Toxoplasma gondii; cytomegalovirus; herpes simplex I virus; herpes simplex II virus; human herpesvirus 6 (HHV6); human herpesvirus 8 (HHV8); varicella zoster virus; hepatitis A virus (HAV); influenza A virus; and influenza B virus. Pathogen-specific quantitative antibody levels were analyzed, as were three measures of pathogen burden. Genome-wide linkage and joint linkage and association analyses were performed using ~1 million SNPs. Significant linkage (lod scores >3.0) was obtained for HHV6 (on chromosome 7), HHV8 (on chromosome 6), and HAV (on chromosome 13). SNP rs4812712 on chromosome 20 was significantly associated with C. pneumoniae (P=5.3 × 10−8). However, no genome-wide significant loci were obtained for the other investigated antibodies. We conclude that it is possible to localize host genetic factors influencing some of these antibody traits, but that further larger-scale investigations will be required to elucidate the genetic mechanisms contributing to variation in antibody levels. PMID:25758998

  10. The genetics of loneliness: linking evolutionary theory to genome-wide genetics, epigenetics, and social science.

    PubMed

    Goossens, Luc; van Roekel, Eeske; Verhagen, Maaike; Cacioppo, John T; Cacioppo, Stephanie; Maes, Marlies; Boomsma, Dorret I

    2015-03-01

    As a complex trait, loneliness is likely to be influenced by the interplay of numerous genetic and environmental factors. Studies in behavioral genetics indicate that loneliness has a sizable degree of heritability. Candidate-gene and gene-expression studies have pointed to several genes related to neurotransmitters and the immune system. The notion that these genes are related to loneliness is compatible with the basic tenets of the evolutionary theory of loneliness. Research on gene-environment interactions indicates that social-environmental factors (e.g., low social support) may have a more pronounced effect and lead to higher levels of loneliness if individuals carry the sensitive variant of these candidate genes. Currently, there is no extant research on loneliness based on genome-wide association studies, gene-environment-interaction studies, or studies in epigenetics. Such studies would allow researchers to identify networks of genes that contribute to loneliness. The contribution of genetics to loneliness research will become stronger when genome-wide genetics and epigenetics are integrated and used along with well-established methods in psychology to analyze the complex process of gene-environment interplay.

  11. Six decades of vitiligo genetics: genome-wide studies provide insights into autoimmune pathogenesis.

    PubMed

    Spritz, Richard A

    2012-02-01

    Generalized vitiligo (GV) is a complex disease in which patchy depigmentation results from autoimmune loss of melanocytes from affected regions. Genetic analyses of GV span six decades, with the goal of understanding biological mechanisms and elucidating pathways that underlie the disease. The earliest studies attempted to describe the mode of inheritance and genetic epidemiology. Early genetic association studies of biological candidate genes resulted in some successes, principally HLA and PTPN22, but in hindsight many such reports now seem to be false-positives. Later, genome-wide linkage studies of multiplex GV families identified NLRP1 and XBP1, which appear to be valid GV susceptibility genes that control key aspects of immune regulation. Recently, the application of genome-wide association studies to analysis of GV has produced a rich yield of validated GV susceptibility genes that encode components of biological pathways reaching from immune cells to the melanocyte. These genes and pathways provide insights into underlying pathogenetic mechanisms and possible triggers of GV, establish relationships to other autoimmune diseases, and may provide clues to potential new approaches to GV treatment and perhaps even prevention. These results thus validate the hopes and efforts of the early investigators who first attempted to comprehend the genetic basis of vitiligo.

  12. Genome-Wide Association Study of Down Syndrome-Associated Atrioventricular Septal Defects

    PubMed Central

    Ramachandran, Dhanya; Zeng, Zhen; Locke, Adam E.; Mulle, Jennifer G.; Bean, Lora J.H.; Rosser, Tracie C.; Dooley, Kenneth J.; Cua, Clifford L.; Capone, George T.; Reeves, Roger H.; Maslen, Cheryl L.; Cutler, David J.; Feingold, Eleanor; Sherman, Stephanie L.; Zwick, Michael E.

    2015-01-01

    The goal of this study was to identify the contribution of common genetic variants to Down syndrome−associated atrioventricular septal defect, a severe heart abnormality. Compared with the euploid population, infants with Down syndrome, or trisomy 21, have a 2000-fold increased risk of presenting with atrioventricular septal defects. The cause of this increased risk remains elusive. Here we present data from the largest heart study conducted to date on a trisomic background by using a carefully characterized collection of individuals from extreme ends of the phenotypic spectrum. We performed a genome-wide association study using logistic regression analysis on 452 individuals with Down syndrome, consisting of 210 cases with complete atrioventricular septal defects and 242 controls with structurally normal hearts. No individual variant achieved genome-wide significance. We identified four disomic regions (1p36.3, 5p15.31, 8q22.3, and 17q22) and two trisomic regions on chromosome 21 (around PDXK and KCNJ6 genes) that merit further investigation in large replication studies. Our data show that a few common genetic variants of large effect size (odds ratio >2.0) do not account for the elevated risk of Down syndrome−associated atrioventricular septal defects. Instead, multiple variants of low-to-moderate effect sizes may contribute to this elevated risk, highlighting the complex genetic architecture of atrioventricular septal defects even in the highly susceptible Down syndrome population. PMID:26194203

  13. Genome-Wide Association for Sensitivity to Chronic Oxidative Stress in Drosophila melanogaster

    PubMed Central

    Jordan, Katherine W.; Craver, Kyle L.; Magwire, Michael M.; Cubilla, Carmen E.; Mackay, Trudy F. C.; Anholt, Robert R. H.

    2012-01-01

    Reactive oxygen species (ROS) are a common byproduct of mitochondrial energy metabolism, and can also be induced by exogenous sources, including UV light, radiation, and environmental toxins. ROS generation is essential for maintaining homeostasis by triggering cellular signaling pathways and host defense mechanisms. However, an imbalance of ROS induces oxidative stress and cellular death and is associated with human disease, including age-related locomotor impairment. To identify genes affecting sensitivity and resistance to ROS-induced locomotor decline, we assessed locomotion of aged flies of the sequenced, wild-derived lines from the Drosophila melanogaster Genetics Reference Panel on standard medium and following chronic exposure to medium supplemented with 3 mM menadione sodium bisulfite (MSB). We found substantial genetic variation in sensitivity to oxidative stress with respect to locomotor phenotypes. We performed genome-wide association analyses to identify candidate genes associated with variation in sensitivity to ROS-induced decline in locomotor performance, and confirmed the effects for 13 of 16 mutations tested in these candidate genes. Candidate genes associated with variation in sensitivity to MSB-induced oxidative stress form networks of genes involved in neural development, immunity, and signal transduction. Many of these genes have human orthologs, highlighting the utility of genome-wide association in Drosophila for studying complex human disease. PMID:22715409

  14. Genome-wide Comparative Analysis of Atopic Dermatitis and Psoriasis Gives Insight into Opposing Genetic Mechanisms

    PubMed Central

    Baurecht, Hansjörg; Hotze, Melanie; Brand, Stephan; Büning, Carsten; Cormican, Paul; Corvin, Aiden; Ellinghaus, David; Ellinghaus, Eva; Esparza-Gordillo, Jorge; Fölster-Holst, Regina; Franke, Andre; Gieger, Christian; Hubner, Norbert; Illig, Thomas; Irvine, Alan D.; Kabesch, Michael; Lee, Young A.E.; Lieb, Wolfgang; Marenholz, Ingo; McLean, W.H. Irwin; Morris, Derek W.; Mrowietz, Ulrich; Nair, Rajan; Nöthen, Markus M.; Novak, Natalija; O’Regan, Grainne M.; Schreiber, Stefan; Smith, Catherine; Strauch, Konstantin; Stuart, Philip E.; Trembath, Richard; Tsoi, Lam C.; Weichenthal, Michael; Barker, Jonathan; Elder, James T.; Weidinger, Stephan; Cordell, Heather J.; Brown, Sara J.

    2015-01-01

    Atopic dermatitis and psoriasis are the two most common immune-mediated inflammatory disorders affecting the skin. Genome-wide studies demonstrate a high degree of genetic overlap, but these diseases have mutually exclusive clinical phenotypes and opposing immune mechanisms. Despite their prevalence, atopic dermatitis and psoriasis very rarely co-occur within one individual. By utilizing genome-wide association study and ImmunoChip data from >19,000 individuals and methodologies developed from meta-analysis, we have identified opposing risk alleles at shared loci as well as independent disease-specific loci within the epidermal differentiation complex (chromosome 1q21.3), the Th2 locus control region (chromosome 5q31.1), and the major histocompatibility complex (chromosome 6p21–22). We further identified previously unreported pleiotropic alleles with opposing effects on atopic dermatitis and psoriasis risk in PRKRA and ANXA6/TNIP1. In contrast, there was no evidence for shared loci with effects operating in the same direction on both diseases. Our results show that atopic dermatitis and psoriasis have distinct genetic mechanisms with opposing effects in shared pathways influencing epidermal differentiation and immune response. The statistical analysis methods developed in the conduct of this study have produced additional insight from previously published data sets. The approach is likely to be applicable to the investigation of the genetic basis of other complex traits with overlapping and distinct clinical features. PMID:25574825

  15. Insect herbivory elicits genome-wide alternative splicing responses in Nicotiana attenuata.

    PubMed

    Ling, Zhihao; Zhou, Wenwu; Baldwin, Ian T; Xu, Shuqing

    2015-10-01

    Changes in gene expression and alternative splicing (AS) are involved in many responses to abiotic and biotic stresses in eukaryotic organisms. In response to attack and oviposition by insect herbivores, plants elicit rapid changes in gene expression which are essential for the activation of plant defenses; however, the herbivory-induced changes in AS remain unstudied. Using mRNA sequencing, we performed a genome-wide analysis on tobacco hornworm (Manduca sexta) feeding-induced AS in both leaves and roots of Nicotiana attenuata. Feeding by M. sexta for 5 h reduced total AS events by 7.3% in leaves but increased them in roots by 8.0% and significantly changed AS patterns in leaves and roots of existing AS genes. Feeding by M. sexta also resulted in increased (in roots) and decreased (in leaves) transcript levels of the serine/arginine-rich (SR) proteins that are involved in the AS machinery of plants and induced changes in SR gene expression that were jasmonic acid (JA)-independent in leaves but JA-dependent in roots. Changes in AS and gene expression elicited by M. sexta feeding were regulated independently in both tissues. This study provides genome-wide evidence that insect herbivory induces changes not only in the levels of gene expression but also in their splicing, which might contribute to defense against and/or tolerance of herbivory.

  16. Gene-based and pathway-based genome-wide association study of alcohol dependence

    PubMed Central

    ZUO, Lingjun; ZHANG, Clarence K.; SAYWARD, Frederick G.; CHEUNG, Kei-Hoi; WANG, Kesheng; KRYSTAL, John H.; ZHAO, Hongyu; LUO, Xingguang

    2015-01-01

    Background The organization of risk genes within signaling pathways may provide clues about the converging neurobiological effects of risk genes for alcohol dependence. Aim Identify risk genes and risk gene pathways for alcohol dependence. Methods We conducted a pathway-based genome-wide association study (GWAS) of alcohol dependence using a gene-set-rich analytic approach. Approximately one million genetic markers were tested in the discovery sample which included 1409 European-American (EA) alcohol dependent individuals and 1518 EA healthy comparison subjects. An additional 681 African-American (AA) cases and 508 AA healthy subjects served as the replication sample. Results We identified several genome-wide replicable risk genes and risk pathways that were significantly associated with alcohol dependence. After applying the Bonferroni correction for multiple testing, the ‘cellextracellular matrix interactions’ pathway (p<2.0E-4 in EAs) and the PXN gene (which encodes paxillin) (p=3.9E-7 in EAs) within this pathway were the most promising risk factors for alcohol dependence. There were also two nominally replicable pathways enriched in alcohol dependence-related genes in both EAs (0.015≤p≤0.035) and AAs (0.025≤p≤0.050): the ‘Na+/Cl- dependent neurotransmitter transporters’ pathway and the ‘other glycan degradation’ pathway. Conclusion These findings provide new evidence highlighting several genes and biological signaling processes that may be related to the risk for alcohol dependence. PMID:26120261

  17. Identifying Pleiotropic Genes in Genome-Wide Association Studies for Multivariate Phenotypes with Mixed Measurement Scales

    PubMed Central

    Williams, L. Keoki; Buu, Anne

    2017-01-01

    We propose a multivariate genome-wide association test for mixed continuous, binary, and ordinal phenotypes. A latent response model is used to estimate the correlation between phenotypes with different measurement scales so that the empirical distribution of the Fisher’s combination statistic under the null hypothesis is estimated efficiently. The simulation study shows that our proposed correlation estimation methods have high levels of accuracy. More importantly, our approach conservatively estimates the variance of the test statistic so that the type I error rate is controlled. The simulation also shows that the proposed test maintains the power at the level very close to that of the ideal analysis based on known latent phenotypes while controlling the type I error. In contrast, conventional approaches–dichotomizing all observed phenotypes or treating them as continuous variables–could either reduce the power or employ a linear regression model unfit for the data. Furthermore, the statistical analysis on the database of the Study of Addiction: Genetics and Environment (SAGE) demonstrates that conducting a multivariate test on multiple phenotypes can increase the power of identifying markers that may not be, otherwise, chosen using marginal tests. The proposed method also offers a new approach to analyzing the Fagerström Test for Nicotine Dependence as multivariate phenotypes in genome-wide association studies. PMID:28081206

  18. Genome-wide association study identifies multiple susceptibility loci for diffuse large B cell lymphoma.

    PubMed

    Cerhan, James R; Berndt, Sonja I; Vijai, Joseph; Ghesquières, Hervé; McKay, James; Wang, Sophia S; Wang, Zhaoming; Yeager, Meredith; Conde, Lucia; de Bakker, Paul I W; Nieters, Alexandra; Cox, David; Burdett, Laurie; Monnereau, Alain; Flowers, Christopher R; De Roos, Anneclaire J; Brooks-Wilson, Angela R; Lan, Qing; Severi, Gianluca; Melbye, Mads; Gu, Jian; Jackson, Rebecca D; Kane, Eleanor; Teras, Lauren R; Purdue, Mark P; Vajdic, Claire M; Spinelli, John J; Giles, Graham G; Albanes, Demetrius; Kelly, Rachel S; Zucca, Mariagrazia; Bertrand, Kimberly A; Zeleniuch-Jacquotte, Anne; Lawrence, Charles; Hutchinson, Amy; Zhi, Degui; Habermann, Thomas M; Link, Brian K; Novak, Anne J; Dogan, Ahmet; Asmann, Yan W; Liebow, Mark; Thompson, Carrie A; Ansell, Stephen M; Witzig, Thomas E; Weiner, George J; Veron, Amelie S; Zelenika, Diana; Tilly, Hervé; Haioun, Corinne; Molina, Thierry Jo; Hjalgrim, Henrik; Glimelius, Bengt; Adami, Hans-Olov; Bracci, Paige M; Riby, Jacques; Smith, Martyn T; Holly, Elizabeth A; Cozen, Wendy; Hartge, Patricia; Morton, Lindsay M; Severson, Richard K; Tinker, Lesley F; North, Kari E; Becker, Nikolaus; Benavente, Yolanda; Boffetta, Paolo; Brennan, Paul; Foretova, Lenka; Maynadie, Marc; Staines, Anthony; Lightfoot, Tracy; Crouch, Simon; Smith, Alex; Roman, Eve; Diver, W Ryan; Offit, Kenneth; Zelenetz, Andrew; Klein, Robert J; Villano, Danylo J; Zheng, Tongzhang; Zhang, Yawei; Holford, Theodore R; Kricker, Anne; Turner, Jenny; Southey, Melissa C; Clavel, Jacqueline; Virtamo, Jarmo; Weinstein, Stephanie; Riboli, Elio; Vineis, Paolo; Kaaks, Rudolph; Trichopoulos, Dimitrios; Vermeulen, Roel C H; Boeing, Heiner; Tjonneland, Anne; Angelucci, Emanuele; Di Lollo, Simonetta; Rais, Marco; Birmann, Brenda M; Laden, Francine; Giovannucci, Edward; Kraft, Peter; Huang, Jinyan; Ma, Baoshan; Ye, Yuanqing; Chiu, Brian C H; Sampson, Joshua; Liang, Liming; Park, Ju-Hyun; Chung, Charles C; Weisenburger, Dennis D; Chatterjee, Nilanjan; Fraumeni, Joseph F; Slager, Susan L; Wu, Xifeng; de Sanjose, Silvia; Smedby, Karin E; Salles, Gilles; Skibola, Christine F; Rothman, Nathaniel; Chanock, Stephen J

    2014-11-01

    Diffuse large B cell lymphoma (DLBCL) is the most common lymphoma subtype and is clinically aggressive. To identify genetic susceptibility loci for DLBCL, we conducted a meta-analysis of 3 new genome-wide association studies (GWAS) and 1 previous scan, totaling 3,857 cases and 7,666 controls of European ancestry, with additional genotyping of 9 promising SNPs in 1,359 cases and 4,557 controls. In our multi-stage analysis, five independent SNPs in four loci achieved genome-wide significance marked by rs116446171 at 6p25.3 (EXOC2; P = 2.33 × 10(-21)), rs2523607 at 6p21.33 (HLA-B; P = 2.40 × 10(-10)), rs79480871 at 2p23.3 (NCOA1; P = 4.23 × 10(-8)) and two independent SNPs, rs13255292 and rs4733601, at 8q24.21 (PVT1; P = 9.98 × 10(-13) and 3.63 × 10(-11), respectively). These data provide substantial new evidence for genetic susceptibility to this B cell malignancy and point to pathways involved in immune recognition and immune function in the pathogenesis of DLBCL.

  19. Genome-wide nucleosome map and cytosine methylation levels of an ancient human genome.

    PubMed

    Pedersen, Jakob Skou; Valen, Eivind; Velazquez, Amhed M Vargas; Parker, Brian J; Rasmussen, Morten; Lindgreen, Stinus; Lilje, Berit; Tobin, Desmond J; Kelly, Theresa K; Vang, Søren; Andersson, Robin; Jones, Peter A; Hoover, Cindi A; Tikhonov, Alexei; Prokhortchouk, Egor; Rubin, Edward M; Sandelin, Albin; Gilbert, M Thomas P; Krogh, Anders; Willerslev, Eske; Orlando, Ludovic

    2014-03-01

    Epigenetic information is available from contemporary organisms, but is difficult to track back in evolutionary time. Here, we show that genome-wide epigenetic information can be gathered directly from next-generation sequence reads of DNA isolated from ancient remains. Using the genome sequence data generated from hair shafts of a 4000-yr-old Paleo-Eskimo belonging to the Saqqaq culture, we generate the first ancient nucleosome map coupled with a genome-wide survey of cytosine methylation levels. The validity of both nucleosome map and methylation levels were confirmed by the recovery of the expected signals at promoter regions, exon/intron boundaries, and CTCF sites. The top-scoring nucleosome calls revealed distinct DNA positioning biases, attesting to nucleotide-level accuracy. The ancient methylation levels exhibited high conservation over time, clustering closely with modern hair tissues. Using ancient methylation information, we estimated the age at death of the Saqqaq individual and illustrate how epigenetic information can be used to infer ancient gene expression. Similar epigenetic signatures were found in other fossil material, such as 110,000- to 130,000-yr-old bones, supporting the contention that ancient epigenomic information can be reconstructed from a deep past. Our findings lay the foundation for extracting epigenomic information from ancient samples, allowing shifts in epialleles to be tracked through evolutionary time, as well as providing an original window into modern epigenomics.

  20. Genome-wide antisense transcription drives mRNA processing in bacteria

    PubMed Central

    Lasa, Iñigo; Toledo-Arana, Alejandro; Dobin, Alexander; Villanueva, Maite; de los Mozos, Igor Ruiz; Vergara-Irigaray, Marta; Segura, Víctor; Fagegaltier, Delphine; Penadés, José R.; Valle, Jaione; Solano, Cristina; Gingeras, Thomas R.

    2011-01-01

    RNA deep sequencing technologies are revealing unexpected levels of complexity in bacterial transcriptomes with the discovery of abundant noncoding RNAs, antisense RNAs, long 5′ and 3′ untranslated regions, and alternative operon structures. Here, by applying deep RNA sequencing to both the long and short RNA fractions (<50 nucleotides) obtained from the major human pathogen Staphylococcus aureus, we have detected a collection of short RNAs that is generated genome-wide through the digestion of overlapping sense/antisense transcripts by RNase III endoribonuclease. At least 75% of sense RNAs from annotated genes are subject to this mechanism of antisense processing. Removal of RNase III activity reduces the amount of short RNAs and is accompanied by the accumulation of discrete antisense transcripts. These results suggest the production of pervasive but hidden antisense transcription used to process sense transcripts by means of creating double-stranded substrates. This process of RNase III-mediated digestion of overlapping transcripts can be observed in several evolutionarily diverse Gram-positive bacteria and is capable of providing a unique genome-wide posttranscriptional mechanism to adjust mRNA levels. PMID:22123973

  1. Genome-wide Selective Sweeps in Natural Bacterial Populations Revealed by Time-series Metagenomics

    SciTech Connect

    Chan, Leong-Keat; Bendall, Matthew L.; Malfatti, Stephanie; Schwientek, Patrick; Tremblay, Julien; Schackwitz, Wendy; Martin, Joel; Pati, Amrita; Bushnell, Brian; Foster, Brian; Kang, Dongwan; Tringe, Susannah G.; Bertilsson, Stefan; Moran, Mary Ann; Shade, Ashley; Newton, Ryan J.; Stevens, Sarah; McMcahon, Katherine D.; Mamlstrom, Rex R.

    2014-05-12

    Multiple evolutionary models have been proposed to explain the formation of genetically and ecologically distinct bacterial groups. Time-series metagenomics enables direct observation of evolutionary processes in natural populations, and if applied over a sufficiently long time frame, this approach could capture events such as gene-specific or genome-wide selective sweeps. Direct observations of either process could help resolve how distinct groups form in natural microbial assemblages. Here, from a three-year metagenomic study of a freshwater lake, we explore changes in single nucleotide polymorphism (SNP) frequencies and patterns of gene gain and loss in populations of Chlorobiaceae and Methylophilaceae. SNP analyses revealed substantial genetic heterogeneity within these populations, although the degree of heterogeneity varied considerably among closely related, co-occurring Methylophilaceae populations. SNP allele frequencies, as well as the relative abundance of certain genes, changed dramatically over time in each population. Interestingly, SNP diversity was purged at nearly every genome position in one of the Chlorobiaceae populations over the course of three years, while at the same time multiple genes either swept through or were swept from this population. These patterns were consistent with a genome-wide selective sweep, a process predicted by the ecotype model? of diversification, but not previously observed in natural populations.

  2. Genome-wide Selective Sweeps in Natural Bacterial Populations Revealed by Time-series Metagenomics

    SciTech Connect

    Chan, Leong-Keat; Bendall, Matthew L.; Malfatti, Stephanie; Schwientek, Patrick; Tremblay, Julien; Schackwitz, Wendy; Martin, Joel; Pati, Amrita; Bushnell, Brian; Foster, Brian; Kang, Dongwan; Tringe, Susannah G.; Bertilsson, Stefan; Moran, Mary Ann; Shade, Ashley; Newton, Ryan J.; Stevens, Sarah; McMahon, Katherine D.; Malmstrom, Rex R.

    2014-06-18

    Multiple evolutionary models have been proposed to explain the formation of genetically and ecologically distinct bacterial groups. Time-series metagenomics enables direct observation of evolutionary processes in natural populations, and if applied over a sufficiently long time frame, this approach could capture events such as gene-specific or genome-wide selective sweeps. Direct observations of either process could help resolve how distinct groups form in natural microbial assemblages. Here, from a three-year metagenomic study of a freshwater lake, we explore changes in single nucleotide polymorphism (SNP) frequencies and patterns of gene gain and loss in populations of Chlorobiaceae and Methylophilaceae. SNP analyses revealed substantial genetic heterogeneity within these populations, although the degree of heterogeneity varied considerably among closely related, co-occurring Methylophilaceae populations. SNP allele frequencies, as well as the relative abundance of certain genes, changed dramatically over time in each population. Interestingly, SNP diversity was purged at nearly every genome position in one of the Chlorobiaceae populations over the course of three years, while at the same time multiple genes either swept through or were swept from this population. These patterns were consistent with a genome-wide selective sweep, a process predicted by the ‘ecotype model’ of diversification, but not previously observed in natural populations.

  3. Meta-analysis of genome-wide association studies identifies 10 loci influencing allergic sensitization

    PubMed Central

    Granell, Raquel; Strachan, David P; Alves, Alexessander Couto; Linneberg, Allan; Curtin, John A; Warrington, Nicole M; Standl, Marie; Kerkhof, Marjan; Jonsdottir, Ingileif; Bukvic, Blazenka K; Kaakinen, Marika; Sleimann, Patrick; Thorleifsson, Gudmar; Thorsteinsdottir, Unnur; Schramm, Katharina; Baltic, Svetlana; Kreiner-Møller, Eskil; Simpson, Angela; St Pourcain, Beate; Coin, Lachlan; Hui, Jennie; Walters, Eugene H; Tiesler, Carla M T; Duffy, David L; Jones, Graham; Ring, Susan M; McArdle, Wendy L; Price, Loren; Robertson, Colin F; Pekkanen, Juha; Tang, Clara S; Thiering, Elisabeth; Montgomery, Grant W; Hartikainen, Anna-Liisa; Dharmage, Shyamali C; Husemoen, Lise L; Herder, Christian; Kemp, John P; Elliot, Paul; James, Alan; Waldenberger, Melanie; Abramson, Michael J; Fairfax, Benjamin P; Knight, Julian C; Gupta, Ramneek; Thompson, Philip J; Holt, Patrick; Sly, Peter; Hirschhorn, Joel N; Blekic, Mario; Weidinger, Stephan; Hakonarsson, Hakon; Stefansson, Kari; Heinrich, Joachim; Postma, Dirkje S; Custovic, Adnan; Pennell, Craig E; Jarvelin, Marjo-Riitta; Koppelman, Gerard H; Timpson, Nicholas; Ferreira, Manuel A; Bisgaard, Hans; Henderson, A John

    2016-01-01

    Allergen-specific IgE (allergic sensitization) plays a central role in the pathogenesis of allergic disease. We performed the first large-scale genome wide association study (GWAS) of allergic sensitization in 5,789 affected individuals and 10,056 controls and followed up the top SNP from 26 loci in 6,114 affected individuals and 9,920 controls. We increased the number of susceptibility loci with genome-wide significant association to allergic sensitization from three to 10, including SNPs in or near TLR6, C11orf30, STAT6, SLC25A46, HLA-DQB1, IL1RL1, LPP, MYC, IL2 and HLA-B. All the top-SNPs were associated with allergic symptoms in an independent study. Risk variants at these 10 loci were estimated to account for at least 25% of allergic sensitization and allergic rhinitis. Understanding the molecular mechanisms underlying these associations may provide novel insight into the etiology of allergic disease. PMID:23817571

  4. Genome-wide association study identifies six new loci influencing pulse pressure and mean arterial pressure.

    PubMed

    Wain, Louise V; Verwoert, Germaine C; O'Reilly, Paul F; Shi, Gang; Johnson, Toby; Johnson, Andrew D; Bochud, Murielle; Rice, Kenneth M; Henneman, Peter; Smith, Albert V; Ehret, Georg B; Amin, Najaf; Larson, Martin G; Mooser, Vincent; Hadley, David; Dörr, Marcus; Bis, Joshua C; Aspelund, Thor; Esko, Tõnu; Janssens, A Cecile J W; Zhao, Jing Hua; Heath, Simon; Laan, Maris; Fu, Jingyuan; Pistis, Giorgio; Luan, Jian'an; Arora, Pankaj; Lucas, Gavin; Pirastu, Nicola; Pichler, Irene; Jackson, Anne U; Webster, Rebecca J; Zhang, Feng; Peden, John F; Schmidt, Helena; Tanaka, Toshiko; Campbell, Harry; Igl, Wilmar; Milaneschi, Yuri; Hottenga, Jouke-Jan; Vitart, Veronique; Chasman, Daniel I; Trompet, Stella; Bragg-Gresham, Jennifer L; Alizadeh, Behrooz Z; Chambers, John C; Guo, Xiuqing; Lehtimäki, Terho; Kühnel, Brigitte; Lopez, Lorna M; Polašek, Ozren; Boban, Mladen; Nelson, Christopher P; Morrison, Alanna C; Pihur, Vasyl; Ganesh, Santhi K; Hofman, Albert; Kundu, Suman; Mattace-Raso, Francesco U S; Rivadeneira, Fernando; Sijbrands, Eric J G; Uitterlinden, Andre G; Hwang, Shih-Jen; Vasan, Ramachandran S; Wang, Thomas J; Bergmann, Sven; Vollenweider, Peter; Waeber, Gérard; Laitinen, Jaana; Pouta, Anneli; Zitting, Paavo; McArdle, Wendy L; Kroemer, Heyo K; Völker, Uwe; Völzke, Henry; Glazer, Nicole L; Taylor, Kent D; Harris, Tamara B; Alavere, Helene; Haller, Toomas; Keis, Aime; Tammesoo, Mari-Liis; Aulchenko, Yurii; Barroso, Inês; Khaw, Kay-Tee; Galan, Pilar; Hercberg, Serge; Lathrop, Mark; Eyheramendy, Susana; Org, Elin; Sõber, Siim; Lu, Xiaowen; Nolte, Ilja M; Penninx, Brenda W; Corre, Tanguy; Masciullo, Corrado; Sala, Cinzia; Groop, Leif; Voight, Benjamin F; Melander, Olle; O'Donnell, Christopher J; Salomaa, Veikko; d'Adamo, Adamo Pio; Fabretto, Antonella; Faletra, Flavio; Ulivi, Sheila; Del Greco, Fabiola M; Facheris, Maurizio; Collins, Francis S; Bergman, Richard N; Beilby, John P; Hung, Joseph; Musk, A William; Mangino, Massimo; Shin, So-Youn; Soranzo, Nicole; Watkins, Hugh; Goel, Anuj; Hamsten, Anders; Gider, Pierre; Loitfelder, Marisa; Zeginigg, Marion; Hernandez, Dena; Najjar, Samer S; Navarro, Pau; Wild, Sarah H; Corsi, Anna Maria; Singleton, Andrew; de Geus, Eco J C; Willemsen, Gonneke; Parker, Alex N; Rose, Lynda M; Buckley, Brendan; Stott, David; Orru, Marco; Uda, Manuela; van der Klauw, Melanie M; Zhang, Weihua; Li, Xinzhong; Scott, James; Chen, Yii-Der Ida; Burke, Gregory L; Kähönen, Mika; Viikari, Jorma; Döring, Angela; Meitinger, Thomas; Davies, Gail; Starr, John M; Emilsson, Valur; Plump, Andrew; Lindeman, Jan H; Hoen, Peter A C 't; König, Inke R; Felix, Janine F; Clarke, Robert; Hopewell, Jemma C; Ongen, Halit; Breteler, Monique; Debette, Stéphanie; Destefano, Anita L; Fornage, Myriam; Mitchell, Gary F; Smith, Nicholas L; Holm, Hilma; Stefansson, Kari; Thorleifsson, Gudmar; Thorsteinsdottir, Unnur; Samani, Nilesh J; Preuss, Michael; Rudan, Igor; Hayward, Caroline; Deary, Ian J; Wichmann, H-Erich; Raitakari, Olli T; Palmas, Walter; Kooner, Jaspal S; Stolk, Ronald P; Jukema, J Wouter; Wright, Alan F; Boomsma, Dorret I; Bandinelli, Stefania; Gyllensten, Ulf B; Wilson, James F; Ferrucci, Luigi; Schmidt, Reinhold; Farrall, Martin; Spector, Tim D; Palmer, Lyle J; Tuomilehto, Jaakko; Pfeufer, Arne; Gasparini, Paolo; Siscovick, David; Altshuler, David; Loos, Ruth J F; Toniolo, Daniela; Snieder, Harold; Gieger, Christian; Meneton, Pierre; Wareham, Nicholas J; Oostra, Ben A; Metspalu, Andres; Launer, Lenore; Rettig, Rainer; Strachan, David P; Beckmann, Jacques S; Witteman, Jacqueline C M; Erdmann, Jeanette; van Dijk, Ko Willems; Boerwinkle, Eric; Boehnke, Michael; Ridker, Paul M; Jarvelin, Marjo-Riitta; Chakravarti, Aravinda; Abecasis, Goncalo R; Gudnason, Vilmundur; Newton-Cheh, Christopher; Levy, Daniel; Munroe, Patricia B; Psaty, Bruce M; Caulfield, Mark J; Rao, Dabeeru C; Tobin, Martin D; Elliott, Paul; van Duijn, Cornelia M

    2011-09-11

    Numerous genetic loci have been associated with systolic blood pressure (SBP) and diastolic blood pressure (DBP) in Europeans. We now report genome-wide association studies of pulse pressure (PP) and mean arterial pressure (MAP). In discovery (N = 74,064) and follow-up studies (N = 48,607), we identified at genome-wide significance (P = 2.7 × 10(-8) to P = 2.3 × 10(-13)) four new PP loci (at 4q12 near CHIC2, 7q22.3 near PIK3CG, 8q24.12 in NOV and 11q24.3 near ADAMTS8), two new MAP loci (3p21.31 in MAP4 and 10q25.3 near ADRB1) and one locus associated with both of these traits (2q24.3 near FIGN) that has also recently been associated with SBP in east Asians. For three of the new PP loci, the estimated effect for SBP was opposite of that for DBP, in contrast to the majority of common SBP- and DBP-associated variants, which show concordant effects on both traits. These findings suggest new genetic pathways underlying blood pressure variation, some of which may differentially influence SBP and DBP.

  5. A high-density SNP genome-wide linkage scan in a large autism extended pedigree.

    PubMed

    Allen-Brady, K; Miller, J; Matsunami, N; Stevens, J; Block, H; Farley, M; Krasny, L; Pingree, C; Lainhart, J; Leppert, M; McMahon, W M; Coon, H

    2009-06-01

    We performed a high-density, single nucleotide polymorphism (SNP), genome-wide scan on a six-generation pedigree from Utah with seven affected males, diagnosed with autism spectrum disorder. Using a two-stage linkage design, we first performed a nonparametric analysis on the entire genome using a 10K SNP chip to identify potential regions of interest. To confirm potentially interesting regions, we eliminated SNPs in high linkage disequilibrium (LD) using a principal components analysis (PCA) method and repeated the linkage results. Three regions met genome-wide significance criteria after controlling for LD: 3q13.2-q13.31 (nonparametric linkage (NPL), 5.58), 3q26.31-q27.3 (NPL, 4.85) and 20q11.21-q13.12 (NPL, 5.56). Two regions met suggestive criteria for significance 7p14.1-p11.22 (NPL, 3.18) and 9p24.3 (NPL, 3.44). All five chromosomal regions are consistent with other published findings. Haplotype sharing results showed that five of the affected subjects shared more than a single chromosomal region of interest with other affected subjects. Although no common autism susceptibility genes were found for all seven autism cases, these results suggest that multiple genetic loci within these regions may contribute to the autism phenotype in this family, and further follow-up of these chromosomal regions is warranted.

  6. CONAN: copy number variation analysis software for genome-wide association studies

    PubMed Central

    2010-01-01

    Background Genome-wide association studies (GWAS) based on single nucleotide polymorphisms (SNPs) revolutionized our perception of the genetic regulation of complex traits and diseases. Copy number variations (CNVs) promise to shed additional light on the genetic basis of monogenic as well as complex diseases and phenotypes. Indeed, the number of detected associations between CNVs and certain phenotypes are constantly increasing. However, while several software packages support the determination of CNVs from SNP chip data, the downstream statistical inference of CNV-phenotype associations is still subject to complicated and inefficient in-house solutions, thus strongly limiting the performance of GWAS based on CNVs. Results CONAN is a freely available client-server software solution which provides an intuitive graphical user interface for categorizing, analyzing and associating CNVs with phenotypes. Moreover, CONAN assists the evaluation process by visualizing detected associations via Manhattan plots in order to enable a rapid identification of genome-wide significant CNV regions. Various file formats including the information on CNVs in population samples are supported as input data. Conclusions CONAN facilitates the performance of GWAS based on CNVs and the visual analysis of calculated results. CONAN provides a rapid, valid and straightforward software solution to identify genetic variation underlying the 'missing' heritability for complex traits that remains unexplained by recent GWAS. The freely available software can be downloaded at http://genepi-conan.i-med.ac.at. PMID:20546565

  7. Identification of loci affecting teat number by genome-wide association studies on three pig populations

    PubMed Central

    Tang, Jianhong; Zhang, Zhiyan; Yang, Bin; Guo, Yuanmei; Ai, Huashui; Long, Yi; Su, Ying; Cui, Leilei; Zhou, Liyu; Wang, Xiaopeng; Zhang, Hui; Wang, Chengbin; Ren, Jun; Huang, Lusheng; Ding, Nengshui

    2017-01-01

    Objective Three genome-wide association studies (GWAS) and a meta-analysis of GWAS were conducted to explore the genetic mechanisms underlying variation in pig teat number. Methods We performed three GWAS and a meta-analysis for teat number on three pig populations, including a White Duroc×Erhualian F2 resource population (n = 1,743), a Chinese Erhualian pig population (n = 320) and a Chinese Sutai pig population (n = 383). Results We detected 24 single nucleotide polymorphisms (SNPs) that surpassed the genome-wide significant level on Sus Scrofa chromosomes (SSC) 1, 7, and 12 in the F2 resource population, corresponding to four loci for pig teat number. We highlighted vertnin (VRTN) and lysine demethylase 6B (KDM6B) as two interesting candidate genes at the loci on SSC7 and SSC12. No significant associated SNPs were identified in the meta-analysis of GWAS. Conclusion The results verified the complex genetic architecture of pig teat number. The causative variants for teat number may be different in the three populations PMID:27165028

  8. Genome-wide association study identifies three novel loci for type 2 diabetes.

    PubMed

    Hara, Kazuo; Fujita, Hayato; Johnson, Todd A; Yamauchi, Toshimasa; Yasuda, Kazuki; Horikoshi, Momoko; Peng, Chen; Hu, Cheng; Ma, Ronald C W; Imamura, Minako; Iwata, Minoru; Tsunoda, Tatsuhiko; Morizono, Takashi; Shojima, Nobuhiro; So, Wing Yee; Leung, Ting Fan; Kwan, Patrick; Zhang, Rong; Wang, Jie; Yu, Weihui; Maegawa, Hiroshi; Hirose, Hiroshi; Kaku, Kohei; Ito, Chikako; Watada, Hirotaka; Tanaka, Yasushi; Tobe, Kazuyuki; Kashiwagi, Atsunori; Kawamori, Ryuzo; Jia, Weiping; Chan, Juliana C N; Teo, Yik Ying; Shyong, Tai E; Kamatani, Naoyuki; Kubo, Michiaki; Maeda, Shiro; Kadowaki, Takashi

    2014-01-01

    Although over 60 loci for type 2 diabetes (T2D) have been identified, there still remains a large genetic component to be clarified. To explore unidentified loci for T2D, we performed a genome-wide association study (GWAS) of 6 209 637 single-nucleotide polymorphisms (SNPs), which were directly genotyped or imputed using East Asian references from the 1000 Genomes Project (June 2011 release) in 5976 Japanese patients with T2D and 20 829 nondiabetic individuals. Nineteen unreported loci were selected and taken forward to follow-up analyses. Combined discovery and follow-up analyses (30 392 cases and 34 814 controls) identified three new loci with genome-wide significance, which were MIR129-LEP [rs791595; risk allele = A; risk allele frequency (RAF) = 0.080; P = 2.55 × 10(-13); odds ratio (OR) = 1.17], GPSM1 [rs11787792; risk allele = A; RAF = 0.874; P = 1.74 × 10(-10); OR = 1.15] and SLC16A13 (rs312457; risk allele = G; RAF = 0.078; P = 7.69 × 10(-13); OR = 1.20). This study demonstrates that GWASs based on the imputation of genotypes using modern reference haplotypes such as that from the 1000 Genomes Project data can assist in identification of new loci for common diseases.

  9. Genome-wide association study identifies ZFHX1B as a susceptibility locus for severe myopia.

    PubMed

    Khor, Chiea Chuen; Miyake, Masahiro; Chen, Li Jia; Shi, Yi; Barathi, Veluchamy A; Qiao, Fan; Nakata, Isao; Yamashiro, Kenji; Zhou, Xin; Tam, Pancy O S; Cheng, Ching-Yu; Tai, E Shyong; Vithana, Eranga N; Aung, Tin; Teo, Yik-Ying; Wong, Tien-Yin; Moriyama, Muka; Ohno-Matsui, Kyoko; Mochizuki, Manabu; Matsuda, Fumihiko; Yong, Rita Y Y; Yap, Eric P H; Yang, Zhenglin; Pang, Chi Pui; Saw, Seang-Mei; Yoshimura, Nagahisa

    2013-12-20

    Severe myopia (defined as spherical equivalent < -6.0 D) is a predominant problem in Asian countries, resulting in substantial morbidity. We performed a meta-analysis of four genome-wide association studies (GWAS), all of East Asian descent totaling 1603 cases and 3427 controls. Two single nucleotide polymorphisms (SNPs) (rs13382811 from ZFHX1B [encoding for ZEB2] and rs6469937 from SNTB1) showed highly suggestive evidence of association with disease (P < 1 × 10(-7)) and were brought forward for replication analysis in a further 1241 severe myopia cases and 3559 controls from a further three independent sample collections. Significant evidence of replication was observed, and both SNP markers surpassed the formal threshold for genome-wide significance upon meta-analysis of both discovery and replication stages (P = 5.79 × 10(-10), per-allele odds ratio (OR) = 1.26 for rs13382811 and P = 2.01 × 10(-9), per-allele OR = 0.79 for rs6469937). The observation at SNTB1 is confirmatory of a very recent GWAS on severe myopia. Both genes were expressed in the human retina, sclera, as well as the retinal pigmented epithelium. In an experimental mouse model for myopia, we observed significant alterations to gene and protein expression in the retina and sclera of the unilateral induced myopic eyes for Zfhx1b and Sntb1. These new data advance our understanding of the molecular pathogenesis of severe myopia.

  10. Genome-wide association and linkage analyses localize a progressive retinal atrophy locus in Persian cats.

    PubMed

    Alhaddad, Hasan; Gandolfi, Barbara; Grahn, Robert A; Rah, Hyung-Chul; Peterson, Carlyn B; Maggs, David J; Good, Kathryn L; Pedersen, Niels C; Lyons, Leslie A

    2014-08-01

    Hereditary eye diseases of animals serve as excellent models of human ocular disorders and assist in the development of gene and drug therapies for inherited forms of blindness. Several primary hereditary eye conditions affecting various ocular tissues and having different rates of progression have been documented in domestic cats. Gene therapy for canine retinopathies has been successful, thus the cat could be a gene therapy candidate for other forms of retinal degenerations. The current study investigates a hereditary, autosomal recessive, retinal degeneration specific to Persian cats. A multi-generational pedigree segregating for this progressive retinal atrophy was genotyped using a 63 K SNP array and analyzed via genome-wide linkage and association methods. A multi-point parametric linkage analysis localized the blindness phenotype to a ~1.75 Mb region with significant LOD scores (Z ≈ 14, θ = 0.00) on cat chromosome E1. Genome-wide TDT, sib-TDT, and case-control analyses also consistently supported significant association within the same region on chromosome E1, which is homologous to human chromosome 17. Using haplotype analysis, a ~1.3 Mb region was identified as highly associated for progressive retinal atrophy in Persian cats. Several candidate genes within the region are reasonable candidates as a potential causative gene and should be considered for molecular analyses.

  11. Genome-wide scans of genetic variants for psychophysiological endophenotypes: A methodological overview

    PubMed Central

    IACONO, WILLIAM. G.; MALONE, STEPHEN. M.; VAIDYANATHAN, UMA; VRIEZE, SCOTT I.

    2014-01-01

    This article provides an introductory overview of the investigative strategy employed to evaluate the genetic basis of 17 endophenotypes examined as part of a 20-year data collection effort from the Minnesota Center for Twin and Family Research. Included are characterization of the study samples, descriptive statistics for key properties of the psychophysiological measures, and rationale behind the steps taken in the molecular genetic study design. The statistical approach included (a) biometric analysis of twin and family data, (b) heritability analysis using 527,829 single nucleotide polymorphisms (SNPs), (c) genome-wide association analysis of these SNPs and 17,601 autosomal genes, (d) follow-up analyses of candidate SNPs and genes hypothesized to have an association with each endophenotype, (e) rare variant analysis of nonsynonymous SNPs in the exome, and (f) whole genome sequencing association analysis using 27 million genetic variants. These methods were used in the accompanying empirical articles comprising this special issue, Genome-Wide Scans of Genetic Variants for Psychophysiological Endophenotypes. PMID:25387703

  12. Genome-wide mapping of IBD segments in an Ashkenazi PD cohort identifies associated haplotypes.

    PubMed

    Vacic, Vladimir; Ozelius, Laurie J; Clark, Lorraine N; Bar-Shira, Anat; Gana-Weisz, Mali; Gurevich, Tanya; Gusev, Alexander; Kedmi, Merav; Kenny, Eimear E; Liu, Xinmin; Mejia-Santana, Helen; Mirelman, Anat; Raymond, Deborah; Saunders-Pullman, Rachel; Desnick, Robert J; Atzmon, Gil; Burns, Edward R; Ostrer, Harry; Hakonarson, Hakon; Bergman, Aviv; Barzilai, Nir; Darvasi, Ariel; Peter, Inga; Guha, Saurav; Lencz, Todd; Giladi, Nir; Marder, Karen; Pe'er, Itsik; Bressman, Susan B; Orr-Urtreger, Avi

    2014-09-01

    The recent series of large genome-wide association studies in European and Japanese cohorts established that Parkinson disease (PD) has a substantial genetic component. To further investigate the genetic landscape of PD, we performed a genome-wide scan in the largest to date Ashkenazi Jewish cohort of 1130 Parkinson patients and 2611 pooled controls. Motivated by the reduced disease allele heterogeneity and a high degree of identical-by-descent (IBD) haplotype sharing in this founder population, we conducted a haplotype association study based on mapping of shared IBD segments. We observed significant haplotype association signals at three previously implicated Parkinson loci: LRRK2 (OR = 12.05, P = 1.23 × 10(-56)), MAPT (OR = 0.62, P = 1.78 × 10(-11)) and GBA (multiple distinct haplotypes, OR > 8.28, P = 1.13 × 10(-11) and OR = 2.50, P = 1.22 × 10(-9)). In addition, we identified a novel association signal on chr2q14.3 coming from a rare haplotype (OR = 22.58, P = 1.21 × 10(-10)) and replicated it in a secondary cohort of 306 Ashkenazi PD cases and 2583 controls. Our results highlight the power of our haplotype association method, particularly useful in studies of founder populations, and reaffirm the benefits of studying complex diseases in Ashkenazi Jewish cohorts.

  13. Genome-Wide Analysis of Homeobox Gene Family in Legumes: Identification, Gene Duplication and Expression Profiling

    PubMed Central

    Garg, Rohini; Jain, Mukesh

    2015-01-01

    Homeobox genes encode transcription factors that are known to play a major role in different aspects of plant growth and development. In the present study, we identified homeobox genes belonging to 14 different classes in five legume species, including chickpea, soybean, Medicago, Lotus and pigeonpea. The characteristic differences within homeodomain sequences among various classes of homeobox gene family were quite evident. Genome-wide expression analysis using publicly available datasets (RNA-seq and microarray) indicated that homeobox genes are differentially expressed in various tissues/developmental stages and under stress conditions in different legumes. We validated the differential expression of selected chickpea homeobox genes via quantitative reverse transcription polymerase chain reaction. Genome duplication analysis in soybean indicated that segmental duplication has significantly contributed in the expansion of homeobox gene family. The Ka/Ks ratio of duplicated homeobox genes in soybean showed that several members of this family have undergone purifying selection. Moreover, expression profiling indicated that duplicated genes might have been retained due to sub-functionalization. The genome-wide identification and comprehensive gene expression profiling of homeobox gene family members in legumes will provide opportunities for functional analysis to unravel their exact role in plant growth and development. PMID:25745864

  14. Genome-wide microsatellite characterization and marker development in the sequenced Brassica crop species.

    PubMed

    Shi, Jiaqin; Huang, Shunmou; Zhan, Jiepeng; Yu, Jingyin; Wang, Xinfa; Hua, Wei; Liu, Shengyi; Liu, Guihua; Wang, Hanzhong

    2014-02-01

    Although much research has been conducted, the pattern of microsatellite distribution has remained ambiguous, and the development/utilization of microsatellite markers has still been limited/inefficient in Brassica, due to the lack of genome sequences. In view of this, we conducted genome-wide microsatellite characterization and marker development in three recently sequenced Brassica crops: Brassica rapa, Brassica oleracea and Brassica napus. The analysed microsatellite characteristics of these Brassica species were highly similar or almost identical, which suggests that the pattern of microsatellite distribution is likely conservative in Brassica. The genomic distribution of microsatellites was highly non-uniform and positively or negatively correlated with genes or transposable elements, respectively. Of the total of 115 869, 185 662 and 356 522 simple sequence repeat (SSR) markers developed with high frequencies (408.2, 343.8 and 356.2 per Mb or one every 2.45, 2.91 and 2.81 kb, respectively), most represented new SSR markers, the majority had determined physical positions, and a large number were genic or putative single-locus SSR markers. We also constructed a comprehensive database for the newly developed SSR markers, which was integrated with public Brassica SSR markers and annotated genome components. The genome-wide SSR markers developed in this study provide a useful tool to extend the annotated genome resources of sequenced Brassica species to genetic study/breeding in different Brassica species.

  15. Genome Wide Sampling Sequencing for SNP Genotyping: Methods, Challenges and Future Development

    PubMed Central

    Jiang, Zhihua; Wang, Hongyang; Michal, Jennifer J.; Zhou, Xiang; Liu, Bang; Woods, Leah C. Solberg; Fuchs, Rita A.

    2016-01-01

    Genetic polymorphisms, particularly single nucleotide polymorphisms (SNPs), have been widely used to advance quantitative, functional and evolutionary genomics. Ideally, all genetic variants among individuals should be discovered when next generation sequencing (NGS) technologies and platforms are used for whole genome sequencing or resequencing. In order to improve the cost-effectiveness of the process, however, the research community has mainly focused on developing genome-wide sampling sequencing (GWSS) methods, a collection of reduced genome complexity sequencing, reduced genome representation sequencing and selective genome target sequencing. Here we review the major steps involved in library preparation, the types of adapters used for ligation and the primers designed for amplification of ligated products for sequencing. Unfortunately, currently available GWSS methods have their drawbacks, such as inconsistency in the number of reads per sample library, the number of sites/targets per individual, and the number of reads per site/target, all of which result in missing data. Suggestions are proposed here to improve library construction, genotype calling accuracy, genome-wide marker density and read mapping rate. In brief, optimized GWSS library preparation should generate a unique set of target sites with dense distribution along chromosomes and even coverage per site across all individuals. PMID:26722221

  16. Genome-wide alterations of the DNA replication program during tumor progression

    NASA Astrophysics Data System (ADS)

    Arneodo, A.; Goldar, A.; Argoul, F.; Hyrien, O.; Audit, B.

    2016-08-01

    Oncogenic stress is a major driving force in the early stages of cancer development. Recent experimental findings reveal that, in precancerous lesions and cancers, activated oncogenes may induce stalling and dissociation of DNA replication forks resulting in DNA damage. Replication timing is emerging as an important epigenetic feature that recapitulates several genomic, epigenetic and functional specificities of even closely related cell types. There is increasing evidence that chromosome rearrangements, the hallmark of many cancer genomes, are intimately associated with the DNA replication program and that epigenetic replication timing changes often precede chromosomic rearrangements. The recent development of a novel methodology to map replication fork polarity using deep sequencing of Okazaki fragments has provided new and complementary genome-wide replication profiling data. We review the results of a wavelet-based multi-scale analysis of genomic and epigenetic data including replication profiles along human chromosomes. These results provide new insight into the spatio-temporal replication program and its dynamics during differentiation. Here our goal is to bring to cancer research, the experimental protocols and computational methodologies for replication program profiling, and also the modeling of the spatio-temporal replication program. To illustrate our purpose, we report very preliminary results obtained for the chronic myelogeneous leukemia, the archetype model of cancer. Finally, we discuss promising perspectives on using genome-wide DNA replication profiling as a novel efficient tool for cancer diagnosis, prognosis and personalized treatment.

  17. Five endometrial cancer risk loci identified through genome-wide association analysis

    PubMed Central

    O’Mara, Tracy A; Painter, Jodie N; Glubb, Dylan M; Flach, Susanne; Lewis, Annabelle; French, Juliet D; Freeman-Mills, Luke; Church, David; Gorman, Maggie; Martin, Lynn; Hodgson, Shirley; Webb, Penelope M; Attia, John; Holliday, Elizabeth G; McEvoy, Mark; Scott, Rodney J; Henders, Anjali K; Martin, Nicholas G; Montgomery, Grant W; Nyholt, Dale R; Ahmed, Shahana; Healey, Catherine S; Shah, Mitul; Dennis, Joe; Fasching, Peter A; Beckmann, Matthias W; Hein, Alexander; Ekici, Arif B; Hall, Per; Czene, Kamila; Darabi, Hatef; Li, Jingmei; Dörk, Thilo; Dürst, Matthias; Hillemanns, Peter; Runnebaum, Ingo; Amant, Frederic; Schrauwen, Stefanie; Zhao, Hui; Lambrechts, Diether; Depreeuw, Jeroen; Dowdy, Sean C; Goode, Ellen L; Fridley, Brooke L; Winham, Stacey J; Njølstad, Tormund S; Salvesen, Helga B; Trovik, Jone; Werner, Henrica MJ; Ashton, Katie; Otton, Geoffrey; Proietto, Tony; Liu, Tao; Mints, Miriam; Tham, Emma; Consortium, CHIBCHA; Jun Li, Mulin; Yip, Shun H; Wang, Junwen; Bolla, Manjeet K; Michailidou, Kyriaki; Wang, Qin; Tyrer, Jonathan P; Dunlop, Malcolm; Houlston, Richard; Palles, Claire; Hopper, John L; Peto, Julian; Swerdlow, Anthony J; Burwinkel, Barbara; Brenner, Hermann; Meindl, Alfons; Brauch, Hiltrud; Lindblom, Annika; Chang-Claude, Jenny; Couch, Fergus J; Giles, Graham G; Kristensen, Vessela N; Cox, Angela; Cunningham, Julie M; Pharoah, Paul D P; Dunning, Alison M; Edwards, Stacey L; Easton, Douglas F; Tomlinson, Ian; Spurdle, Amanda B

    2016-01-01

    We conducted a meta-analysis of three endometrial cancer GWAS and two replication phases totaling 7,737 endometrial cancer cases and 37,144 controls of European ancestry. Genome-wide imputation and meta-analysis identified five novel risk loci of genome-wide significance at likely regulatory regions on chromosomes 13q22.1 (rs11841589, near KLF5), 6q22.31 (rs13328298, in LOC643623 and near HEY2 and NCOA7), 8q24.21 (rs4733613, telomeric to MYC), 15q15.1 (rs937213, in EIF2AK4, near BMF) and 14q32.33 (rs2498796, in AKT1 near SIVA1). A second independent 8q24.21 signal (rs17232730) was found. Functional studies of the 13q22.1 locus showed that rs9600103 (pairwise r2=0.98 with rs11841589) is located in a region of active chromatin that interacts with the KLF5 promoter region. The rs9600103-T endometrial cancer protective allele suppressed gene expression in vitro suggesting that regulation of KLF5 expression, a gene linked to uterine development, is implicated in tumorigenesis. These findings provide enhanced insight into the genetic and biological basis of endometrial cancer. PMID:27135401

  18. Genetic Diversity in the Modern Horse Illustrated from Genome-Wide SNP Data

    PubMed Central

    Petersen, Jessica L.; Mickelson, James R.; Cothran, E. Gus; Andersson, Lisa S.; Axelsson, Jeanette; Bailey, Ernie; Bannasch, Danika; Binns, Matthew M.; Borges, Alexandre S.; Brama, Pieter; da Câmara Machado, Artur; Distl, Ottmar; Felicetti, Michela; Fox-Clipsham, Laura; Graves, Kathryn T.; Guérin, Gérard; Haase, Bianca; Hasegawa, Telhisa; Hemmann, Karin; Hill, Emmeline W.; Leeb, Tosso; Lindgren, Gabriella; Lohi, Hannes; Lopes, Maria Susana; McGivney, Beatrice A.; Mikko, Sofia; Orr, Nicholas; Penedo, M. Cecilia T; Piercy, Richard J.; Raekallio, Marja; Rieder, Stefan; Røed, Knut H.; Silvestrelli, Maurizio; Swinburne, June; Tozaki, Teruaki; Vaudin, Mark; M. Wade, Claire; McCue, Molly E.

    2013-01-01

    Horses were domesticated from the Eurasian steppes 5,000–6,000 years ago. Since then, the use of horses for transportation, warfare, and agriculture, as well as selection for desired traits and fitness, has resulted in diverse populations distributed across the world, many of which have become or are in the process of becoming formally organized into closed, breeding populations (breeds). This report describes the use of a genome-wide set of autosomal SNPs and 814 horses from 36 breeds to provide the first detailed description of equine breed diversity. FST calculations, parsimony, and distance analysis demonstrated relationships among the breeds that largely reflect geographic origins and known breed histories. Low levels of population divergence were observed between breeds that are relatively early on in the process of breed development, and between those with high levels of within-breed diversity, whether due to large population size, ongoing outcrossing, or large within-breed phenotypic diversity. Populations with low within-breed diversity included those which have experienced population bottlenecks, have been under intense selective pressure, or are closed populations with long breed histories. These results provide new insights into the relationships among and the diversity within breeds of horses. In addition these results will facilitate future genome-wide association studies and investigations into genomic targets of selection. PMID:23383025

  19. DECIDE: a Decision Support Tool to Facilitate Parents' Choices Regarding Genome-Wide Sequencing.

    PubMed

    Birch, Patricia; Adam, S; Bansback, N; Coe, R R; Hicklin, J; Lehman, A; Li, K C; Friedman, J M

    2016-12-01

    We describe the rationale, development, and usability testing for an integrated e-learning tool and decision aid for parents facing decisions about genome-wide sequencing (GWS) for their children with a suspected genetic condition. The online tool, DECIDE, is designed to provide decision-support and to promote high quality decisions about undergoing GWS with or without return of optional incidental finding results. DECIDE works by integrating educational material with decision aids. Users may tailor their learning by controlling both the amount of information and its format - text and diagrams and/or short videos. The decision aid guides users to weigh the importance of various relevant factors in their own lives and circumstances. After considering the pros and cons of GWS and return of incidental findings, DECIDE summarizes the user's responses and apparent preferred choices. In a usability study of 16 parents who had already chosen GWS after conventional genetic counselling, all participants found DECIDE to be helpful. Many would have been satisfied to use it alone to guide their GWS decisions, but most would prefer to have the option of consulting a health care professional as well to aid their decision. Further testing is necessary to establish the effectiveness of using DECIDE as an adjunct to or instead of conventional pre-test genetic counselling for clinical genome-wide sequencing.

  20. Genome-Wide Divergence in the West-African Malaria Vector Anopheles melas.

    PubMed

    Deitz, Kevin C; Athrey, Giridhar A; Jawara, Musa; Overgaard, Hans J; Matias, Abrahan; Slotman, Michel A

    2016-09-08

    Anopheles melas is a member of the recently diverged An. gambiae species complex, a model for speciation studies, and is a locally important malaria vector along the West-African coast where it breeds in brackish water. A recent population genetic study of An. melas revealed species-level genetic differentiation between three population clusters. An. melas West extends from The Gambia to the village of Tiko, Cameroon. The other mainland cluster, An. melas South, extends from the southern Cameroonian village of Ipono to Angola. Bioko Island, Equatorial Guinea An. melas populations are genetically isolated from mainland populations. To examine how genetic differentiation between these An. melas forms is distributed across their genomes, we conducted a genome-wide analysis of genetic differentiation and selection using whole genome sequencing data of pooled individuals (Pool-seq) from a representative population of each cluster. The An. melas forms exhibit high levels of genetic differentiation throughout their genomes, including the presence of numerous fixed differences between clusters. Although the level of divergence between the clusters is on a par with that of other species within the An. gambiae complex, patterns of genome-wide divergence and diversity do not provide evidence for the presence of pre- and/or postmating isolating mechanisms in the form of speciation islands. These results are consistent with an allopatric divergence process with little or no introgression.

  1. Genome-wide comparative analysis of atopic dermatitis and psoriasis gives insight into opposing genetic mechanisms.

    PubMed

    Baurecht, Hansjörg; Hotze, Melanie; Brand, Stephan; Büning, Carsten; Cormican, Paul; Corvin, Aiden; Ellinghaus, David; Ellinghaus, Eva; Esparza-Gordillo, Jorge; Fölster-Holst, Regina; Franke, Andre; Gieger, Christian; Hubner, Norbert; Illig, Thomas; Irvine, Alan D; Kabesch, Michael; Lee, Young A E; Lieb, Wolfgang; Marenholz, Ingo; McLean, W H Irwin; Morris, Derek W; Mrowietz, Ulrich; Nair, Rajan; Nöthen, Markus M; Novak, Natalija; O'Regan, Grainne M; Schreiber, Stefan; Smith, Catherine; Strauch, Konstantin; Stuart, Philip E; Trembath, Richard; Tsoi, Lam C; Weichenthal, Michael; Barker, Jonathan; Elder, James T; Weidinger, Stephan; Cordell, Heather J; Brown, Sara J

    2015-01-08

    Atopic dermatitis and psoriasis are the two most common immune-mediated inflammatory disorders affecting the skin. Genome-wide studies demonstrate a high degree of genetic overlap, but these diseases have mutually exclusive clinical phenotypes and opposing immune mechanisms. Despite their prevalence, atopic dermatitis and psoriasis very rarely co-occur within one individual. By utilizing genome-wide association study and ImmunoChip data from >19,000 individuals and methodologies developed from meta-analysis, we have identified opposing risk alleles at shared loci as well as independent disease-specific loci within the epidermal differentiation complex (chromosome 1q21.3), the Th2 locus control region (chromosome 5q31.1), and the major histocompatibility complex (chromosome 6p21-22). We further identified previously unreported pleiotropic alleles with opposing effects on atopic dermatitis and psoriasis risk in PRKRA and ANXA6/TNIP1. In contrast, there was no evidence for shared loci with effects operating in the same direction on both diseases. Our results show that atopic dermatitis and psoriasis have distinct genetic mechanisms with opposing effects in shared pathways influencing epidermal differentiation and immune response. The statistical analysis methods developed in the conduct of this study have produced additional insight from previously published data sets. The approach is likely to be applicable to the investigation of the genetic basis of other complex traits with overlapping and distinct clinical features.

  2. Genome-wide association analysis demonstrates the highly polygenic character of age-related hearing impairment

    PubMed Central

    Fransen, Erik; Bonneux, Sarah; Corneveaux, Jason J; Schrauwen, Isabelle; Di Berardino, Federica; White, Cory H; Ohmen, Jeffrey D; Van de Heyning, Paul; Ambrosetti, Umberto; Huentelman, Matthew J; Van Camp, Guy; Friedman, Rick A

    2015-01-01

    We performed a genome-wide association study (GWAS) to identify the genes responsible for age-related hearing impairment (ARHI), the most common form of hearing impairment in the elderly. Analysis of common variants, with and without adjustment for stratification and environmental covariates, rare variants and interactions, as well as gene-set enrichment analysis, showed no variants with genome-wide significance. No evidence for replication of any previously reported genes was found. A study of the genetic architecture indicates for the first time that ARHI is highly polygenic in nature, with probably no major genes involved. The phenotype depends on the aggregated effect of a large number of SNPs, of which the individual effects are undetectable in a modestly powered GWAS. We estimated that 22% of the variance in our data set can be explained by the collective effect of all genotyped SNPs. A score analysis showed a modest enrichment in causative SNPs among the SNPs with a P-value below 0.01. PMID:24939585

  3. Genome-wide analyses identify common variants associated with macular telangiectasia type 2.

    PubMed

    Scerri, Thomas S; Quaglieri, Anna; Cai, Carolyn; Zernant, Jana; Matsunami, Nori; Baird, Lisa; Scheppke, Lea; Bonelli, Roberto; Yannuzzi, Lawrence A; Friedlander, Martin; Egan, Catherine A; Fruttiger, Marcus; Leppert, Mark; Allikmets, Rando; Bahlo, Melanie

    2017-02-27

    Idiopathic juxtafoveal retinal telangiectasis type 2 (macular telangiectasia type 2; MacTel) is a rare neurovascular degenerative retinal disease. To identify genetic susceptibility loci for MacTel, we performed a genome-wide association study (GWAS) with 476 cases and 1,733 controls of European ancestry. Genome-wide significant associations (P < 5 × 10(-8)) were identified at three independent loci (rs73171800 at 5q14.3, P = 7.74 × 10(-17); rs715 at 2q34, P = 9.97 × 10(-14); rs477992 at 1p12, P = 2.60 × 10(-12)) and then replicated (P < 0.01) in an independent cohort of 172 cases and 1,134 controls. The 5q14.3 locus is known to associate with variation in retinal vascular diameter, and the 2q34 and 1p12 loci have been implicated in the glycine/serine metabolic pathway. We subsequently found significant differences in blood serum levels of glycine (P = 4.04 × 10(-6)) and serine (P = 2.48 × 10(-4)) between MacTel cases and controls.

  4. A Genome-Wide CRISPR Library for High-Throughput Genetic Screening in Drosophila Cells

    PubMed Central

    Bassett, Andrew R.; Kong, Lesheng; Liu, Ji-Long

    2015-01-01

    The simplicity of the CRISPR/Cas9 system of genome engineering has opened up the possibility of performing genome-wide targeted mutagenesis in cell lines, enabling screening for cellular phenotypes resulting from genetic aberrations. Drosophila cells have proven to be highly effective in identifying genes involved in cellular processes through similar screens using partial knockdown by RNAi. This is in part due to the lower degree of redundancy between genes in this organism, whilst still maintaining highly conserved gene networks and orthologs of many human disease-causing genes. The ability of CRISPR to generate genetic loss of function mutations not only increases the magnitude of any effect over currently employed RNAi techniques, but allows analysis over longer periods of time which can be critical for certain phenotypes. In this study, we have designed and built a genome-wide CRISPR library covering 13,501 genes, among which 8989 genes are targeted by three or more independent single guide RNAs (sgRNAs). Moreover, we describe strategies to monitor the population of guide RNAs by high throughput sequencing (HTS). We hope that this library will provide an invaluable resource for the community to screen loss of function mutations for cellular phenotypes, and as a source of guide RNA designs for future studies. PMID:26165496

  5. Conjunctival fibrosis and the innate barriers to Chlamydia trachomatis intracellular infection: a genome wide association study.

    PubMed

    Roberts, Chrissy h; Franklin, Christopher S; Makalo, Pateh; Joof, Hassan; Sarr, Isatou; Mahdi, Olaimatu S; Sillah, Ansumana; Bah, Momodou; Payne, Felicity; Jeffreys, Anna E; Bottomley, William; Natividad, Angels; Molina-Gonzalez, Sandra; Burr, Sarah E; Preston, Mark; Kwiatkowski, Dominic; Rockett, Kirk A; Clark, Taane G; Burton, Matthew J; Mabey, David C W; Bailey, Robin; Barroso, Inês; Holland, Martin J

    2015-11-30

    Chlamydia trachomatis causes both trachoma and sexually transmitted infections. These diseases have similar pathology and potentially similar genetic predisposing factors. We aimed to identify polymorphisms and pathways associated with pathological sequelae of ocular Chlamydia trachomatis infections in The Gambia. We report a discovery phase genome-wide association study (GWAS) of scarring trachoma (1090 cases, 1531 controls) that identified 27 SNPs with strong, but not genome-wide significant, association with disease (5 × 10(-6) > P > 5 × 10(-8)). The most strongly associated SNP (rs111513399, P = 5.38 × 10(-7)) fell within a gene (PREX2) with homology to factors known to facilitate chlamydial entry to the host cell. Pathway analysis of GWAS data was significantly enriched for mitotic cell cycle processes (P = 0.001), the immune response (P = 0.00001) and for multiple cell surface receptor signalling pathways. New analyses of published transcriptome data sets from Gambia, Tanzania and Ethiopia also revealed that the same cell cycle and immune response pathways were enriched at the transcriptional level in various disease states. Although unconfirmed, the data suggest that genetic associations with chlamydial scarring disease may be focussed on processes relating to the immune response, the host cell cycle and cell surface receptor signalling.

  6. Conjunctival fibrosis and the innate barriers to Chlamydia trachomatis intracellular infection: a genome wide association study

    PubMed Central

    Roberts, Chrissy h.; Franklin, Christopher S.; Makalo, Pateh; Joof, Hassan; Sarr, Isatou; Mahdi, Olaimatu S.; Sillah, Ansumana; Bah, Momodou; Payne, Felicity; Jeffreys, Anna E.; Bottomley, William; Natividad, Angels; Molina-Gonzalez, Sandra; Burr, Sarah E.; Preston, Mark; Kwiatkowski, Dominic; Rockett, Kirk A.; Clark, Taane G.; Burton, Matthew J.; Mabey, David C. W.; Bailey, Robin; Barroso, Inês; Holland, Martin J.

    2015-01-01

    Chlamydia trachomatis causes both trachoma and sexually transmitted infections. These diseases have similar pathology and potentially similar genetic predisposing factors. We aimed to identify polymorphisms and pathways associated with pathological sequelae of ocular Chlamydia trachomatis infections in The Gambia. We report a discovery phase genome-wide association study (GWAS) of scarring trachoma (1090 cases, 1531 controls) that identified 27 SNPs with strong, but not genome-wide significant, association with disease (5 × 10−6 > P > 5 × 10−8). The most strongly associated SNP (rs111513399, P = 5.38 × 10−7) fell within a gene (PREX2) with homology to factors known to facilitate chlamydial entry to the host cell. Pathway analysis of GWAS data was significantly enriched for mitotic cell cycle processes (P = 0.001), the immune response (P = 0.00001) and for multiple cell surface receptor signalling pathways. New analyses of published transcriptome data sets from Gambia, Tanzania and Ethiopia also revealed that the same cell cycle and immune response pathways were enriched at the transcriptional level in various disease states. Although unconfirmed, the data suggest that genetic associations with chlamydial scarring disease may be focussed on processes relating to the immune response, the host cell cycle and cell surface receptor signalling. PMID:26616738

  7. A powerful test of independent assortment that determines genome-wide significance quickly and accurately

    PubMed Central

    Stewart, W C L; Hager, V R

    2016-01-01

    In the analysis of DNA sequences on related individuals, most methods strive to incorporate as much information as possible, with little or no attention paid to the issue of statistical significance. For example, a modern workstation can easily handle the computations needed to perform a large-scale genome-wide inheritance-by-descent (IBD) scan, but accurate assessment of the significance of that scan is often hindered by inaccurate approximations and computationally intensive simulation. To address these issues, we developed gLOD—a test of co-segregation that, for large samples, models chromosome-specific IBD statistics as a collection of stationary Gaussian processes. With this simple model, the parametric bootstrap yields an accurate and rapid assessment of significance—the genome-wide corrected P-value. Furthermore, we show that (i) under the null hypothesis, the limiting distribution of the gLOD is the standard Gumbel distribution; (ii) our parametric bootstrap simulator is approximately 40 000 times faster than gene-dropping methods, and it is more powerful than methods that approximate the adjusted P-value; and, (iii) the gLOD has the same statistical power as the widely used maximum Kong and Cox LOD. Thus, our approach gives researchers the ability to determine quickly and accurately the significance of most large-scale IBD scans, which may contain multiple traits, thousands of families and tens of thousands of DNA sequences. PMID:27245422

  8. The complex genetics of gait speed: genome-wide meta-analysis approach

    PubMed Central

    Lunetta, Kathryn L.; Smith, Jennifer A.; Eicher, John D.; Vered, Rotem; Deelen, Joris; Arnold, Alice M.; Buchman, Aron S.; Tanaka, Toshiko; Faul, Jessica D.; Nethander, Maria; Fornage, Myriam; Adams, Hieab H.; Matteini, Amy M.; Callisaya, Michele L.; Smith, Albert V.; Yu, Lei; De Jager, Philip L.; Evans, Denis A.; Gudnason, Vilmundur; Hofman, Albert; Pattie, Alison; Corley, Janie; Launer, Lenore J.; Knopman, Davis S.; Parimi, Neeta; Turner, Stephen T.; Bandinelli, Stefania; Beekman, Marian; Gutman, Danielle; Sharvit, Lital; Mooijaart, Simon P.; Liewald, David C.; Houwing-Duistermaat, Jeanine J.; Ohlsson, Claes; Moed, Matthijs; Verlinden, Vincent J.; Mellström, Dan; van der Geest, Jos N.; Karlsson, Magnus; Hernandez, Dena; McWhirter, Rebekah; Liu, Yongmei; Thomson, Russell; Tranah, Gregory J.; Uitterlinden, Andre G.; Weir, David R.; Zhao, Wei; Starr, John M.; Johnson, Andrew D.; Ikram, M. Arfan; Bennett, David A.; Cummings, Steven R.; Deary, Ian J.; Harris, Tamara B.; Kardia, Sharon L. R.; Mosley, Thomas H.; Srikanth, Velandai K.; Windham, Beverly G.; Newman, Ann B.; Walston, Jeremy D.; Davies, Gail; Evans, Daniel S.; Slagboom, Eline P.; Ferrucci, Luigi; Kiel, Douglas P.; Murabito, Joanne M.; Atzmon, Gil

    2017-01-01

    Emerging evidence suggests that the basis for variation in late-life mobility is attributable, in part, to genetic factors, which may become increasingly important with age. Our objective was to systematically assess the contribution of genetic variation to gait speed in older individuals. We conducted a meta-analysis of gait speed GWASs in 31,478 older adults from 17 cohorts of the CHARGE consortium, and validated our results in 2,588 older adults from 4 independent studies. We followed our initial discoveries with network and eQTL analysis of candidate signals in tissues. The meta-analysis resulted in a list of 536 suggestive genome wide significant SNPs in or near 69 genes. Further interrogation with Pathway Analysis placed gait speed as a polygenic complex trait in five major networks. Subsequent eQTL analysis revealed several SNPs significantly associated with the expression of PRSS16, WDSUB1 and PTPRT, which in addition to the meta-analysis and pathway suggested that genetic effects on gait speed may occur through synaptic function and neuronal development pathways. No genome-wide significant signals for gait speed were identified from this moderately large sample of older adults, suggesting that more refined physical function phenotypes will be needed to identify the genetic basis of gait speed in aging. PMID:28077804

  9. A Genome-Wide Survey of Date Palm Cultivars Supports Two Major Subpopulations in Phoenix dactylifera.

    PubMed

    Mathew, Lisa S; Seidel, Michael A; George, Binu; Mathew, Sweety; Spannagl, Manuel; Haberer, Georg; Torres, Maria F; Al-Dous, Eman K; Al-Azwani, Eman K; Diboun, Ilhem; Krueger, Robert R; Mayer, Klaus F X; Mohamoud, Yasmin Ali; Suhre, Karsten; Malek, Joel A

    2015-05-08

    The date palm (Phoenix dactylifera L.) is one of the oldest cultivated trees and is intimately tied to the history of human civilization. There are hundreds of commercial cultivars with distinct fruit shapes, colors, and sizes growing mainly in arid lands from the west of North Africa to India. The origin of date palm domestication is still uncertain, and few studies have attempted to document genetic diversity across multiple regions. We conducted genotyping-by-sequencing on 70 female cultivar samples from across the date palm-growing regions, including four Phoenix species as the outgroup. Here, for the first time, we generate genome-wide genotyping data for 13,000-65,000 SNPs in a diverse set of date palm fruit and leaf samples. Our analysis provides the first genome-wide evidence confirming recent findings that the date palm cultivars segregate into two main regions of shared genetic background from North Africa and the Arabian Gulf. We identify genomic regions with high densities of geographically segregating SNPs and also observe higher levels of allele fixation on the recently described X-chromosome than on the autosomes. Our results fit a model with two centers of earliest cultivation including date palms autochthonous to North Africa. These results adjust our understanding of human agriculture history and will provide the foundation for more directed functional studies and a better understanding of genetic diversity in date palm.

  10. Genome-Wide Divergence in the West-African Malaria Vector Anopheles melas

    PubMed Central

    Deitz, Kevin C.; Athrey, Giridhar A.; Jawara, Musa; Overgaard, Hans J.; Matias, Abrahan; Slotman, Michel A.

    2016-01-01

    Anopheles melas is a member of the recently diverged An. gambiae species complex, a model for speciation studies, and is a locally important malaria vector along the West-African coast where it breeds in brackish water. A recent population genetic study of An. melas revealed species-level genetic differentiation between three population clusters. An. melas West extends from The Gambia to the village of Tiko, Cameroon. The other mainland cluster, An. melas South, extends from the southern Cameroonian village of Ipono to Angola. Bioko Island, Equatorial Guinea An. melas populations are genetically isolated from mainland populations. To examine how genetic differentiation between these An. melas forms is distributed across their genomes, we conducted a genome-wide analysis of genetic differentiation and selection using whole genome sequencing data of pooled individuals (Pool-seq) from a representative population of each cluster. The An. melas forms exhibit high levels of genetic differentiation throughout their genomes, including the presence of numerous fixed differences between clusters. Although the level of divergence between the clusters is on a par with that of other species within the An. gambiae complex, patterns of genome-wide divergence and diversity do not provide evidence for the presence of pre- and/or postmating isolating mechanisms in the form of speciation islands. These results are consistent with an allopatric divergence process with little or no introgression. PMID:27466271

  11. Genome-wide nucleosome map and cytosine methylation levels of an ancient human genome

    PubMed Central

    Pedersen, Jakob Skou; Valen, Eivind; Velazquez, Amhed M. Vargas; Parker, Brian J.; Rasmussen, Morten; Lindgreen, Stinus; Lilje, Berit; Tobin, Desmond J.; Kelly, Theresa K.; Vang, Søren; Andersson, Robin; Jones, Peter A.; Hoover, Cindi A.; Tikhonov, Alexei; Prokhortchouk, Egor; Rubin, Edward M.; Sandelin, Albin; Gilbert, M. Thomas P.; Krogh, Anders; Willerslev, Eske; Orlando, Ludovic

    2014-01-01

    Epigenetic information is available from contemporary organisms, but is difficult to track back in evolutionary time. Here, we show that genome-wide epigenetic information can be gathered directly from next-generation sequence reads of DNA isolated from ancient remains. Using the genome sequence data generated from hair shafts of a 4000-yr-old Paleo-Eskimo belonging to the Saqqaq culture, we generate the first ancient nucleosome map coupled with a genome-wide survey of cytosine methylation levels. The validity of both nucleosome map and methylation levels were confirmed by the recovery of the expected signals at promoter regions, exon/intron boundaries, and CTCF sites. The top-scoring nucleosome calls revealed distinct DNA positioning biases, attesting to nucleotide-level accuracy. The ancient methylation levels exhibited high conservation over time, clustering closely with modern hair tissues. Using ancient methylation information, we estimated the age at death of the Saqqaq individual and illustrate how epigenetic information can be used to infer ancient gene expression. Similar epigenetic signatures were found in other fossil material, such as 110,000- to 130,000-yr-old bones, supporting the contention that ancient epigenomic information can be reconstructed from a deep past. Our findings lay the foundation for extracting epigenomic information from ancient samples, allowing shifts in epialleles to be tracked through evolutionary time, as well as providing an original window into modern epigenomics. PMID:24299735

  12. Genome-wide analysis of homeobox gene family in legumes: identification, gene duplication and expression profiling.

    PubMed

    Bhattacharjee, Annapurna; Ghangal, Rajesh; Garg, Rohini; Jain, Mukesh

    2015-01-01

    Homeobox genes encode transcription factors that are known to play a major role in different aspects of plant growth and development. In the present study, we identified homeobox genes belonging to 14 different classes in five legume species, including chickpea, soybean, Medicago, Lotus and pigeonpea. The characteristic differences within homeodomain sequences among various classes of homeobox gene family were quite evident. Genome-wide expression analysis using publicly available datasets (RNA-seq and microarray) indicated that homeobox genes are differentially expressed in various tissues/developmental stages and under stress conditions in different legumes. We validated the differential expression of selected chickpea homeobox genes via quantitative reverse transcription polymerase chain reaction. Genome duplication analysis in soybean indicated that segmental duplication has significantly contributed in the expansion of homeobox gene family. The Ka/Ks ratio of duplicated homeobox genes in soybean showed that several members of this family have undergone purifying selection. Moreover, expression profiling indicated that duplicated genes might have been retained due to sub-functionalization. The genome-wide identification and comprehensive gene expression profiling of homeobox gene family members in legumes will provide opportunities for functional analysis to unravel their exact role in plant growth and development.

  13. [Analysis of population stratification using random SNPs in genome-wide association studies].

    PubMed

    Cao, Zong-Fu; Ma, Chuan-Xiang; Wang, Lei; Cai, Bin

    2010-09-01

    Since population genetic STRUCTURE can increase false-positive rate in genome-wide association studies (GWAS) for complex diseases, the effect of population stratification should be taken into account in GWAS. However, the effect of randomly selected SNPs in population stratification analysis is underdetermined. In this study, based on the genotype data generated on Genome-Wide Human SNP Array 6.0 from unrelated individuals of HapMap Phase2, we randomly selected SNPs that were evenly distributed across the whole-genome, and acquired Ancestry Informative Markers (AIMs) by the method of f value and allelic Fisher exact test. F-statistics and STRUCTURE analysis based on the select different sets of SNPs were used to evaluate the effect of distinguishing the populations from HapMap Phase3. We found that randomly selected SNPs that were evenly distributed across the whole-genome were able to be used to identify the population structure. This study further indicated that more than 3 000 randomly selected SNPs that were evenly distributed across the whole-genome were substituted for AIMs in population stratification analysis, when there were no available AIMs for spe-cific populations.

  14. Genome-wide association study of drought-related resistance traits in Aegilops tauschii

    PubMed Central

    Qin, Peng; Lin, Yu; Hu, Yaodong; Liu, Kun; Mao, Shuangshuang; Li, Zhanyi; Wang, Jirui; Liu, Yaxi; Wei, Yuming; Zheng, Youliang

    2016-01-01

    Abstract The D-genome progenitor of wheat (Triticum aestivum), Aegilops tauschii, possesses numerous genes for resistance to abiotic stresses, including drought. Therefore, information on the genetic architecture of A. tauschii can aid the development of drought-resistant wheat varieties. Here, we evaluated 13 traits in 373 A. tauschii accessions grown under normal and polyethylene glycol-simulated drought stress conditions and performed a genome-wide association study using 7,185 single nucleotide polymorphism (SNP) markers. We identified 208 and 28 SNPs associated with all traits using the general linear model and mixed linear model, respectively, while both models detected 25 significant SNPs with genome-wide distribution. Public database searches revealed several candidate/flanking genes related to drought resistance that were grouped into three categories according to the type of encoded protein (enzyme, storage protein, and drought-induced protein). This study provided essential information for SNPs and genes related to drought resistance in A. tauschii and wheat, and represents a foundation for breeding drought-resistant wheat cultivars using marker-assisted selection. PMID:27560650

  15. Genome-Wide Association in Tomato Reveals 44 Candidate Loci for Fruit Metabolic Traits1[W

    PubMed Central

    Sauvage, Christopher; Segura, Vincent; Bauchet, Guillaume; Stevens, Rebecca; Do, Phuc Thi; Nikoloski, Zoran; Fernie, Alisdair R.; Causse, Mathilde

    2014-01-01

    Genome-wide association studies have been successful in identifying genes involved in polygenic traits and are valuable for crop improvement. Tomato (Solanum lycopersicum) is a major crop and is highly appreciated worldwide for its health value. We used a core collection of 163 tomato accessions composed of S. lycopersicum, S. lycopersicum var cerasiforme, and Solanum pimpinellifolium to map loci controlling variation in fruit metabolites. Fruits were phenotyped for a broad range of metabolites, including amino acids, sugars, and ascorbate. In parallel, the accessions were genotyped with 5,995 single-nucleotide polymorphism markers spread over the whole genome. Genome-wide association analysis was conducted on a large set of metabolic traits that were stable over 2 years using a multilocus mixed model as a general method for mapping complex traits in structured populations and applied to tomato. We detected a total of 44 loci that were significantly associated with a total of 19 traits, including sucrose, ascorbate, malate, and citrate levels. These results not only provide a list of candidate loci to be functionally validated but also a powerful analytical approach for finding genetic variants that can be directly used for crop improvement and deciphering the genetic architecture of complex traits. PMID:24894148

  16. A Genome-Wide Association Study for Regulators of Micronucleus Formation in Mice

    PubMed Central

    McIntyre, Rebecca E.; Nicod, Jérôme; Robles-Espinoza, Carla Daniela; Maciejowski, John; Cai, Na; Hill, Jennifer; Verstraten, Ruth; Iyer, Vivek; Rust, Alistair G.; Balmus, Gabriel; Mott, Richard; Flint, Jonathan; Adams, David J.

    2016-01-01

    In mammals the regulation of genomic instability plays a key role in tumor suppression and also controls genome plasticity, which is important for recombination during the processes of immunity and meiosis. Most studies to identify regulators of genomic instability have been performed in cells in culture or in systems that report on gross rearrangements of the genome, yet subtle differences in the level of genomic instability can contribute to whole organism phenotypes such as tumor predisposition. Here we performed a genome-wide association study in a population of 1379 outbred Crl:CFW(SW)-US_P08 mice to dissect the genetic landscape of micronucleus formation, a biomarker of chromosomal breaks, whole chromosome loss, and extranuclear DNA. Variation in micronucleus levels is a complex trait with a genome-wide heritability of 53.1%. We identify seven loci influencing micronucleus formation (false discovery rate <5%), and define candidate genes at each locus. Intriguingly at several loci we find evidence for sexual dimorphism in micronucleus formation, with a locus on chromosome 11 being specific to males. PMID:27233670

  17. A Genome-Wide Association Study for Regulators of Micronucleus Formation in Mice.

    PubMed

    McIntyre, Rebecca E; Nicod, Jérôme; Robles-Espinoza, Carla Daniela; Maciejowski, John; Cai, Na; Hill, Jennifer; Verstraten, Ruth; Iyer, Vivek; Rust, Alistair G; Balmus, Gabriel; Mott, Richard; Flint, Jonathan; Adams, David J

    2016-08-09

    In mammals the regulation of genomic instability plays a key role in tumor suppression and also controls genome plasticity, which is important for recombination during the processes of immunity and meiosis. Most studies to identify regulators of genomic instability have been performed in cells in culture or in systems that report on gross rearrangements of the genome, yet subtle differences in the level of genomic instability can contribute to whole organism phenotypes such as tumor predisposition. Here we performed a genome-wide association study in a population of 1379 outbred Crl:CFW(SW)-US_P08 mice to dissect the genetic landscape of micronucleus formation, a biomarker of chromosomal breaks, whole chromosome loss, and extranuclear DNA. Variation in micronucleus levels is a complex trait with a genome-wide heritability of 53.1%. We identify seven loci influencing micronucleus formation (false discovery rate <5%), and define candidate genes at each locus. Intriguingly at several loci we find evidence for sexual dimorphism in micronucleus formation, with a locus on chromosome 11 being specific to males.

  18. A genome-wide CRISPR library for high-throughput genetic screening in Drosophila cells.

    PubMed

    Bassett, Andrew R; Kong, Lesheng; Liu, Ji-Long

    2015-06-20

    The simplicity of the CRISPR/Cas9 system of genome engineering has opened up the possibility of performing genome-wide targeted mutagenesis in cell lines, enabling screening for cellular phenotypes resulting from genetic aberrations. Drosophila cells have proven to be highly effective in identifying genes involved in cellular processes through similar screens using partial knockdown by RNAi. This is in part due to the lower degree of redundancy between genes in this organism, whilst still maintaining highly conserved gene networks and orthologs of many human disease-causing genes. The ability of CRISPR to generate genetic loss of function mutations not only increases the magnitude of any effect over currently employed RNAi techniques, but allows analysis over longer periods of time which can be critical for certain phenotypes. In this study, we have designed and built a genome-wide CRISPR library covering 13,501 genes, among which 8989 genes are targeted by three or more independent single guide RNAs (sgRNAs). Moreover, we describe strategies to monitor the population of guide RNAs by high throughput sequencing (HTS). We hope that this library will provide an invaluable resource for the community to screen loss of function mutations for cellular phenotypes, and as a source of guide RNA designs for future studies.

  19. Translation elicits a growth rate-dependent, genome-wide, differential protein production in Bacillus subtilis.

    PubMed

    Borkowski, Olivier; Goelzer, Anne; Schaffer, Marc; Calabre, Magali; Mäder, Ulrike; Aymerich, Stéphane; Jules, Matthieu; Fromion, Vincent

    2016-05-17

    Complex regulatory programs control cell adaptation to environmental changes by setting condition-specific proteomes. In balanced growth, bacterial protein abundances depend on the dilution rate, transcript abundances and transcript-specific translation efficiencies. We revisited the current theory claiming the invariance of bacterial translation efficiency. By integrating genome-wide transcriptome datasets and datasets from a library of synthetic gfp-reporter fusions, we demonstrated that translation efficiencies in Bacillus subtilis decreased up to fourfold from slow to fast growth. The translation initiation regions elicited a growth rate-dependent, differential production of proteins without regulators, hence revealing a unique, hard-coded, growth rate-dependent mode of regulation. We combined model-based data analyses of transcript and protein abundances genome-wide and revealed that this global regulation is extensively used in B. subtilis We eventually developed a knowledge-based, three-step translation initiation model, experimentally challenged the model predictions and proposed that a growth rate-dependent drop in free ribosome abundance accounted for the differential protein production.

  20. Genome-Wide Motif Statistics are Shaped by DNA Binding Proteins over Evolutionary Time Scales

    NASA Astrophysics Data System (ADS)

    Qian, Long; Kussell, Edo

    2016-10-01

    The composition of a genome with respect to all possible short DNA motifs impacts the ability of DNA binding proteins to locate and bind their target sites. Since nonfunctional DNA binding can be detrimental to cellular functions and ultimately to organismal fitness, organisms could benefit from reducing the number of nonfunctional DNA binding sites genome wide. Using in vitro measurements of binding affinities for a large collection of DNA binding proteins, in multiple species, we detect a significant global avoidance of weak binding sites in genomes. We demonstrate that the underlying evolutionary process leaves a distinct genomic hallmark in that similar words have correlated frequencies, a signal that we detect in all species across domains of life. We consider the possibility that natural selection against weak binding sites contributes to this process, and using an evolutionary model we show that the strength of selection needed to maintain global word compositions is on the order of point mutation rates. Likewise, we show that evolutionary mechanisms based on interference of protein-DNA binding with replication and mutational repair processes could yield similar results and operate with similar rates. On the basis of these modeling and bioinformatic results, we conclude that genome-wide word compositions have been molded by DNA binding proteins acting through tiny evolutionary steps over time scales spanning millions of generations.

  1. Genome-wide analysis of alternative splicing during human heart development

    NASA Astrophysics Data System (ADS)

    Wang, He; Chen, Yanmei; Li, Xinzhong; Chen, Guojun; Zhong, Lintao; Chen, Gangbing; Liao, Yulin; Liao, Wangjun; Bin, Jianping

    2016-10-01

    Alternative splicing (AS) drives determinative changes during mouse heart development. Recent high-throughput technological advancements have facilitated genome-wide AS, while its analysis in human foetal heart transition to the adult stage has not been reported. Here, we present a high-resolution global analysis of AS transitions between human foetal and adult hearts. RNA-sequencing data showed extensive AS transitions occurred between human foetal and adult hearts, and AS events occurred more frequently in protein-coding genes than in long non-coding RNA (lncRNA). A significant difference of AS patterns was found between foetal and adult hearts. The predicted difference in AS events was further confirmed using quantitative reverse transcription-polymerase chain reaction analysis of human heart samples. Functional foetal-specific AS event analysis showed enrichment associated with cell proliferation-related pathways including cell cycle, whereas adult-specific AS events were associated with protein synthesis. Furthermore, 42.6% of foetal-specific AS events showed significant changes in gene expression levels between foetal and adult hearts. Genes exhibiting both foetal-specific AS and differential expression were highly enriched in cell cycle-associated functions. In conclusion, we provided a genome-wide profiling of AS transitions between foetal and adult hearts and proposed that AS transitions and deferential gene expression may play determinative roles in human heart development.

  2. Heavy metals induce oxidative stress and genome-wide modulation in transcriptome of rice root.

    PubMed

    Dubey, Sonali; Shri, Manju; Misra, Prashant; Lakhwani, Deepika; Bag, Sumit Kumar; Asif, Mehar H; Trivedi, Prabodh Kumar; Tripathi, Rudro Deo; Chakrabarty, Debasis

    2014-06-01

    Industrial growth, ecological disturbances and agricultural practices have contaminated the soil and water with many harmful compounds, including heavy metals. These heavy metals affect growth and development of plants as well as cause severe human health hazards through food chain contamination. In past, studies have been made to identify biochemical and molecular networks associated with heavy metal toxicity and uptake in plants. Studies suggested that most of the physiological and molecular processes affected by different heavy metals are similar to those affected by other abiotic stresses. To identify common and unique responses by different metals, we have studied biochemical and genome-wide modulation in transcriptome of rice (IR-64 cultivar) root after exposure to cadmium (Cd), arsenate [As(V)], lead (Pb) and chromium [Cr(VI)] in hydroponic condition. We observed that root tissue shows variable responses for antioxidant enzyme system for different heavy metals. Genome-wide expression analysis suggests variable number of genes differentially expressed in root in response to As(V), Cd, Pb and Cr(VI) stresses. In addition to unique genes, each heavy metal modulated expression of a large number of common genes. Study also identified cis-acting regions of the promoters which can be determinants for the modulated expression of the genes in response to different heavy metals. Our study advances understanding related to various processes and networks which might be responsible for heavy metal stresses, accumulation and detoxification.

  3. Genome-wide association scan suggests basis for microtia in Awassi sheep.

    PubMed

    Jawasreh, K; Boettcher, P J; Stella, A

    2016-08-01

    Hereditary underdevelopment of the ear, a condition also known as microtia, has been observed in several sheep breeds as well as in humans and other species. Its genetic basis in sheep is unknown. The Awassi sheep, a breed native to southwest Asia, carries this phenotype and was targeted for molecular characterization via a genome-wide association study. DNA samples were collected from sheep in Jordan. Eight affected and 12 normal individuals were genotyped with the Illumina OvineSNP50(®) chip. Multilocus analyses failed to identify any genotypic association. In contrast, a single-locus analysis revealed a statistically significant association (P = 0.012, genome-wide) with a SNP at basepair 34 647 499 on OAR23. This marker is adjacent to the gene encoding transcription factor GATA-6, which has been shown to play a role in many developmental processes, including chondrogenesis. The lack of extended homozygosity in this region suggests a fairly ancient mutation, and the time of occurrence was estimated to be approximately 3000 years ago. Many of the earless sheep breeds may thus share the causative mutation, especially within the subgroup of fat-tailed, wool sheep.

  4. Development and application of a novel genome-wide SNP array reveals domestication history in soybean.

    PubMed

    Wang, Jiao; Chu, Shanshan; Zhang, Huairen; Zhu, Ying; Cheng, Hao; Yu, Deyue

    2016-02-09

    Domestication of soybeans occurred under the intense human-directed selections aimed at developing high-yielding lines. Tracing the domestication history and identifying the genes underlying soybean domestication require further exploration. Here, we developed a high-throughput NJAU 355 K SoySNP array and used this array to study the genetic variation patterns in 367 soybean accessions, including 105 wild soybeans and 262 cultivated soybeans. The population genetic analysis suggests that cultivated soybeans have tended to originate from northern and central China, from where they spread to other regions, accompanied with a gradual increase in seed weight. Genome-wide scanning for evidence of artificial selection revealed signs of selective sweeps involving genes controlling domestication-related agronomic traits including seed weight. To further identify genomic regions related to seed weight, a genome-wide association study (GWAS) was conducted across multiple environments in wild and cultivated soybeans. As a result, a strong linkage disequilibrium region on chromosome 20 was found to be significantly correlated with seed weight in cultivated soybeans. Collectively, these findings should provide an important basis for genomic-enabled breeding and advance the study of functional genomics in soybean.

  5. Bortezomib-induced peripheral neuropathy: A genome-wide association study on multiple myeloma patients.

    PubMed

    Campo, Chiara; da Silva Filho, Miguel Inacio; Weinhold, Niels; Mahmoudpour, Seyed Hamidreza; Goldschmidt, Hartmut; Hemminki, Kari; Merz, Maximilian; Försti, Asta

    2017-03-20

    The proteasome-inhibitor bortezomib was introduced into the treatment of multiple myeloma more than a decade ago. It is clinically beneficial, but peripheral neuropathy (PNP) is a side effect that may limit its use in some patients. To examine the possible genetic predisposing factors to PNP, we performed a genome-wide association study on 646 bortezomib-treated German multiple myeloma patients. Our aim was to identify genetic risk variants associated with the development of PNP as a serious side effect of the treatment. We identified 4 new promising loci for bortezomib-induced PNP at 4q34.3 (rs6552496), 5q14.1 (rs12521798), 16q23.3 (rs8060632), and 18q21.2 (rs17748074). Even though the results did not reach genome-wide significance level, they support the idea of previous studies, suggesting a genetic basis for neurotoxicity. The identified single nucleotide polymorphisms map to genes or next to genes involved in the development and function of the nervous system (CDH13, DCC, and TENM3). As possible functional clues, 2 of the variants, rs12521798 and rs17748074, affect enhancer histone marks in the brain. The rs12521798 may also impact expression of THBS4, which affects specific signal trasduction pathways in the nervous system. Further research is needed to clarify the mechanism of action of the identified single nucleotide polymorphisms in the development of drug-induced PNP and to functionally validate our in silico predictions.

  6. A genome-wide association study of gestational diabetes mellitus in Korean women.

    PubMed

    Kwak, Soo Heon; Kim, Sung-Hoon; Cho, Young Min; Go, Min Jin; Cho, Yoon Shin; Choi, Sung Hee; Moon, Min Kyong; Jung, Hye Seung; Shin, Hyoung Doo; Kang, Hyun Min; Cho, Nam H; Lee, In Kyu; Kim, Seong Yeon; Han, Bok-Ghee; Jang, Hak C; Park, Kyong Soo

    2012-02-01

    Knowledge regarding the genetic risk loci for gestational diabetes mellitus (GDM) is still limited. In this study, we performed a two-stage genome-wide association analysis in Korean women. In the stage 1 genome scan, 468 women with GDM and 1,242 nondiabetic control women were compared using 2.19 million genotyped or imputed markers. We selected 11 loci for further genotyping in stage 2 samples of 931 case and 783 control subjects. The joint effect of stage 1 plus stage 2 studies was analyzed by meta-analysis. We also investigated the effect of known type 2 diabetes variants in GDM. Two loci known to be associated with type 2 diabetes had a genome-wide significant association with GDM in the joint analysis. rs7754840, a variant in CDKAL1, had the strongest association with GDM (odds ratio 1.518; P=6.65×10(-16)). A variant near MTNR1B, rs10830962, was also significantly associated with the risk of GDM (1.454; P=2.49×10(-13)). We found that there is an excess of association between known type 2 diabetes variants and GDM above what is expected under the null hypothesis. In conclusion, we have confirmed that genetic variants in CDKAL1 and near MTNR1B are strongly associated with GDM in Korean women. There seems to be a shared genetic basis between GDM and type 2 diabetes.

  7. A genome wide survey of SNP variation reveals the genetic structure of sheep breeds.

    PubMed

    Kijas, James W; Townley, David; Dalrymple, Brian P; Heaton, Michael P; Maddox, Jillian F; McGrath, Annette; Wilson, Peter; Ingersoll, Roxann G; McCulloch, Russell; McWilliam, Sean; Tang, Dave; McEwan, John; Cockett, Noelle; Oddy, V Hutton; Nicholas, Frank W; Raadsma, Herman

    2009-01-01

    The genetic structure of sheep reflects their domestication and subsequent formation into discrete breeds. Understanding genetic structure is essential for achieving genetic improvement through genome-wide association studies, genomic selection and the dissection of quantitative traits. After identifying the first genome-wide set of SNP for sheep, we report on levels of genetic variability both within and between a diverse sample of ovine populations. Then, using cluster analysis and the partitioning of genetic variation, we demonstrate sheep are characterised by weak phylogeographic structure, overlapping genetic similarity and generally low differentiation which is consistent with their short evolutionary history. The degree of population substructure was, however, sufficient to cluster individuals based on geographic origin and known breed history. Specifically, African and Asian populations clustered separately from breeds of European origin sampled from Australia, New Zealand, Europe and North America. Furthermore, we demonstrate the presence of stratification within some, but not all, ovine breeds. The results emphasize that careful documentation of genetic structure will be an essential prerequisite when mapping the genetic basis of complex traits. Furthermore, the identification of a subset of SNP able to assign individuals into broad groupings demonstrates even a small panel of markers may be suitable for applications such as traceability.

  8. Selecting instruments for Mendelian randomization in the wake of genome-wide association studies

    PubMed Central

    Swerdlow, Daniel I; Kuchenbaecker, Karoline B; Shah, Sonia; Sofat, Reecha; Holmes, Michael V; White, Jon; Mindell, Jennifer S; Kivimaki, Mika; Brunner, Eric J; Whittaker, John C; Casas, Juan P; Hingorani, Aroon D

    2016-01-01

    Mendelian randomization (MR) studies typically assess the pathogenic relevance of environmental exposures or disease biomarkers, using genetic variants that instrument these exposures. The approach is gaining popularity—our systematic review reveals a greater than 10-fold increase in MR studies published between 2004 and 2015. When the MR paradigm was first proposed, few biomarker- or exposure-related genetic variants were known, most having been identified by candidate gene studies. However, genome-wide association studies (GWAS) are now providing a rich source of potential instruments for MR analysis. Many early reviews covering the concept, applications and analytical aspects of the MR technique preceded the surge in GWAS, and thus the question of how best to select instruments for MR studies from the now extensive pool of available variants has received insufficient attention. Here we focus on the most common category of MR studies—those concerning disease biomarkers. We consider how the selection of instruments for MR analysis from GWAS requires consideration of: the assumptions underlying the MR approach; the biology of the biomarker; the genome-wide distribution, frequency and effect size of biomarker-associated variants (the genetic architecture); and the specificity of the genetic associations. Based on this, we develop guidance that may help investigators to plan and readers interpret MR studies. PMID:27342221

  9. Genome-Wide Analysis Identifies Germ-Line Risk Factors Associated with Canine Mammary Tumours

    PubMed Central

    Melin, Malin; Murén, Eva; Gustafson, Ulla; Starkey, Mike; Borge, Kaja Sverdrup; Lingaas, Frode; Saellström, Sara; Rönnberg, Henrik; Lindblad-Toh, Kerstin

    2016-01-01

    Canine mammary tumours (CMT) are the most common neoplasia in unspayed female dogs. CMTs are suitable naturally occurring models for human breast cancer and share many characteristics, indicating that the genetic causes could also be shared. We have performed a genome-wide association study (GWAS) in English Springer Spaniel dogs and identified a genome-wide significant locus on chromosome 11 (praw = 5.6x10-7, pperm = 0.019). The most associated haplotype spans a 446 kb region overlapping the CDK5RAP2 gene. The CDK5RAP2 protein has a function in cell cycle regulation and could potentially have an impact on response to chemotherapy treatment. Two additional loci, both on chromosome 27, were nominally associated (praw = 1.97x10-5 and praw = 8.30x10-6). The three loci explain 28.1±10.0% of the phenotypic variation seen in the cohort, whereas the top ten associated regions account for 38.2±10.8% of the risk. Furthermore, the ten GWAS loci and regions with reduced genetic variability are significantly enriched for snoRNAs and tumour-associated antigen genes, suggesting a role for these genes in CMT development. We have identified several candidate genes associated with canine mammary tumours, including CDK5RAP2. Our findings enable further comparative studies to investigate the genes and pathways in human breast cancer patients. PMID:27158822

  10. Genome-wide association for sensitivity to chronic oxidative stress in Drosophila melanogaster.

    PubMed

    Jordan, Katherine W; Craver, Kyle L; Magwire, Michael M; Cubilla, Carmen E; Mackay, Trudy F C; Anholt, Robert R H

    2012-01-01

    Reactive oxygen species (ROS) are a common byproduct of mitochondrial energy metabolism, and can also be induced by exogenous sources, including UV light, radiation, and environmental toxins. ROS generation is essential for maintaining homeostasis by triggering cellular signaling pathways and host defense mechanisms. However, an imbalance of ROS induces oxidative stress and cellular death and is associated with human disease, including age-related locomotor impairment. To identify genes affecting sensitivity and resistance to ROS-induced locomotor decline, we assessed locomotion of aged flies of the sequenced, wild-derived lines from the Drosophila melanogaster Genetics Reference Panel on standard medium and following chronic exposure to medium supplemented with 3 mM menadione sodium bisulfite (MSB). We found substantial genetic variation in sensitivity to oxidative stress with respect to locomotor phenotypes. We performed genome-wide association analyses to identify candidate genes associated with variation in sensitivity to ROS-induced decline in locomotor performance, and confirmed the effects for 13 of 16 mutations tested in these candidate genes. Candidate genes associated with variation in sensitivity to MSB-induced oxidative stress form networks of genes involved in neural development, immunity, and signal transduction. Many of these genes have human orthologs, highlighting the utility of genome-wide association in Drosophila for studying complex human disease.

  11. Mapping the sensory perception of apple using descriptive sensory evaluation in a genome wide association study

    PubMed Central

    Amyotte, Beatrice; Bowen, Amy J.; Banks, Travis; Rajcan, Istvan; Somers, Daryl J.

    2017-01-01

    Breeding apples is a long-term endeavour and it is imperative that new cultivars are selected to have outstanding consumer appeal. This study has taken the approach of merging sensory science with genome wide association analyses in order to map the human perception of apple flavour and texture onto the apple genome. The goal was to identify genomic associations that could be used in breeding apples for improved fruit quality. A collection of 85 apple cultivars was examined over two years through descriptive sensory evaluation by a trained sensory panel. The trained sensory panel scored randomized sliced samples of each apple cultivar for seventeen taste, flavour and texture attributes using controlled sensory evaluation practices. In addition, the apple collection was subjected to genotyping by sequencing for marker discovery. A genome wide association analysis suggested significant genomic associations for several sensory traits including juiciness, crispness, mealiness and fresh green apple flavour. The findings include previously unreported genomic regions that could be used in apple breeding and suggest that similar sensory association mapping methods could be applied in other plants. PMID:28231290

  12. HITS-CLIP yields genome-wide insights into brain alternative RNA processing

    NASA Astrophysics Data System (ADS)

    Licatalosi, Donny D.; Mele, Aldo; Fak, John J.; Ule, Jernej; Kayikci, Melis; Chi, Sung Wook; Clark, Tyson A.; Schweitzer, Anthony C.; Blume, John E.; Wang, Xuning; Darnell, Jennifer C.; Darnell, Robert B.

    2008-11-01

    Protein-RNA interactions have critical roles in all aspects of gene expression. However, applying biochemical methods to understand such interactions in living tissues has been challenging. Here we develop a genome-wide means of mapping protein-RNA binding sites in vivo, by high-throughput sequencing of RNA isolated by crosslinking immunoprecipitation (HITS-CLIP). HITS-CLIP analysis of the neuron-specific splicing factor Nova revealed extremely reproducible RNA-binding maps in multiple mouse brains. These maps provide genome-wide in vivo biochemical footprints confirming the previous prediction that the position of Nova binding determines the outcome of alternative splicing; moreover, they are sufficiently powerful to predict Nova action de novo. HITS-CLIP revealed a large number of Nova-RNA interactions in 3' untranslated regions, leading to the discovery that Nova regulates alternative polyadenylation in the brain. HITS-CLIP, therefore, provides a robust, unbiased means to identify functional protein-RNA interactions in vivo.

  13. Genome wide features, distribution and correlations of NF-Y binding sites.

    PubMed

    Zambelli, Federico; Pavesi, Giulio

    2016-10-18

    NF-Y is a trimeric transcription factor that binds on DNA the CCAAT-box motif. In this article we reviewed and complemented with additional bioinformatic analysis existing data on genome-wide NF-Y binding characterization in human, reaching the following main conclusions: (1) about half of NF-Y binding sites are located at promoters, about 60-80 base pairs from transcription start sites; NF-Y binding to distal genomic regions takes place at inactive chromatin loci and/or DNA repetitive elements more often than active enhancers; (2) on almost half of its binding sites, regardless of their genomic localization (promoters or distal regions), NF-Y finds on DNA more than one CCAAT-box, and most of those multiple CCAAT binding loci present precise spacing and organization of the elements composing them; (3) there exists a well defined class of transcription factors that show genome-wide co-localization with NF-Y. Some of them lack their canonical binding site in binding regions overlapping with NF-Y, hence hinting at NF-Y mediated recruitment, while others show a precise positioning on DNA of their binding sites with respect to the CCAAT box bound by NF-Y. This article is part of a Special Issue entitled: Nuclear Factor Y in Development and Disease, edited by Prof. Roberto Mantovani.

  14. Genome-wide association study identifies candidate markers for bull fertility in Holstein dairy cattle.

    PubMed

    Peñagaricano, F; Weigel, K A; Khatib, H

    2012-07-01

    The decline in the reproductive efficiency of dairy cattle has become a challenging problem worldwide. Female fertility is now taken into account in breeding goals while generally less attention is given to male fertility. The objective of this study was to perform a genome-wide association study in Holstein bulls to identify genetic variants significantly related to sire conception rate (SCR), a new phenotypic evaluation of bull fertility. The analysis included 1755 sires with SCR data and 38,650 single nucleotide polymorphisms (SNPs) spanning the entire bovine genome. Associations between SNPs and SCR were analyzed using a mixed linear model that included a random polygenic effect and SNP genotype either as a linear covariate or as a categorical variable. A multiple testing correction approach was used to account for the correlation between SNPs because of linkage disequilibrium. After genome-wide correction, eight SNPs showed significant association with SCR. Some of these SNPs are located close to or in the middle of genes with functions related to male fertility, such as the sperm acrosome reaction, chromatin remodeling during the spermatogenesis, and the meiotic process during male germ cell maturation. Some SNPs showed marked dominance effects, which provide more evidence for the relevance of non-additive effects in traits closely related to fitness such as fertility. The results could contribute to the identification of genes and pathways associated with male fertility in dairy cattle.

  15. Genome-Wide Association Study of Down Syndrome-Associated Atrioventricular Septal Defects.

    PubMed

    Ramachandran, Dhanya; Zeng, Zhen; Locke, Adam E; Mulle, Jennifer G; Bean, Lora J H; Rosser, Tracie C; Dooley, Kenneth J; Cua, Clifford L; Capone, George T; Reeves, Roger H; Maslen, Cheryl L; Cutler, David J; Feingold, Eleanor; Sherman, Stephanie L; Zwick, Michael E

    2015-07-20

    The goal of this study was to identify the contribution of common genetic variants to Down syndrome-associated atrioventricular septal defect, a severe heart abnormality. Compared with the euploid population, infants with Down syndrome, or trisomy 21, have a 2000-fold increased risk of presenting with atrioventricular septal defects. The cause of this increased risk remains elusive. Here we present data from the largest heart study conducted to date on a trisomic background by using a carefully characterized collection of individuals from extreme ends of the phenotypic spectrum. We performed a genome-wide association study using logistic regression analysis on 452 individuals with Down syndrome, consisting of 210 cases with complete atrioventricular septal defects and 242 controls with structurally normal hearts. No individual variant achieved genome-wide significance. We identified four disomic regions (1p36.3, 5p15.31, 8q22.3, and 17q22) and two trisomic regions on chromosome 21 (around PDXK and KCNJ6 genes) that merit further investigation in large replication studies. Our data show that a few common genetic variants of large effect size (odds ratio >2.0) do not account for the elevated risk of Down syndrome-associated atrioventricular septal defects. Instead, multiple variants of low-to-moderate effect sizes may contribute to this elevated risk, highlighting the complex genetic architecture of atrioventricular septal defects even in the highly susceptible Down syndrome population.

  16. Genome-wide association studies for fatty acid metabolic traits in five divergent pig populations

    PubMed Central

    Zhang, Wanchang; Bin Yang; Zhang, Junjie; Cui, Leilei; Ma, Junwu; Chen, Congying; Ai, Huashui; Xiao, Shijun; Ren, Jun; Huang, Lusheng

    2016-01-01

    Fatty acid composition profiles are important indicators of meat quality and tasting flavor. Metabolic indices of fatty acids are more authentic to reflect meat nutrition and public acceptance. To investigate the genetic mechanism of fatty acid metabolic indices in pork, we conducted genome-wide association studies (GWAS) for 33 fatty acid metabolic traits in five pig populations. We identified a total of 865 single nucleotide polymorphisms (SNPs), corresponding to 11 genome-wide significant loci on nine chromosomes and 12 suggestive loci on nine chromosomes. Our findings not only confirmed seven previously reported QTL with stronger association strength, but also revealed four novel population-specific loci, showing that investigations on intermediate phenotypes like the metabolic traits of fatty acids can increase the statistical power of GWAS for end-point phenotypes. We proposed a list of candidate genes at the identified loci, including three novel genes (FADS2, SREBF1 and PLA2G7). Further, we constructed the functional networks involving these candidate genes and deduced the potential fatty acid metabolic pathway. These findings advance our understanding of the genetic basis of fatty acid composition in pigs. The results from European hybrid commercial pigs can be immediately transited into breeding practice for beneficial fatty acid composition. PMID:27097669

  17. A genome-wide association study identifies multiple loci for variation in human ear morphology.

    PubMed

    Adhikari, Kaustubh; Reales, Guillermo; Smith, Andrew J P; Konka, Esra; Palmen, Jutta; Quinto-Sanchez, Mirsha; Acuña-Alonzo, Victor; Jaramillo, Claudia; Arias, William; Fuentes, Macarena; Pizarro, María; Barquera Lozano, Rodrigo; Macín Pérez, Gastón; Gómez-Valdés, Jorge; Villamil-Ramírez, Hugo; Hunemeier, Tábita; Ramallo, Virginia; Silva de Cerqueira, Caio C; Hurtado, Malena; Villegas, Valeria; Granja, Vanessa; Gallo, Carla; Poletti, Giovanni; Schuler-Faccini, Lavinia; Salzano, Francisco M; Bortolini, Maria-Cátira; Canizales-Quinteros, Samuel; Rothhammer, Francisco; Bedoya, Gabriel; Calderón, Rosario; Rosique, Javier; Cheeseman, Michael; Bhutta, Mahmood F; Humphries, Steve E; Gonzalez-José, Rolando; Headon, Denis; Balding, David; Ruiz-Linares, Andrés

    2015-06-24

    Here we report a genome-wide association study for non-pathological pinna morphology in over 5,000 Latin Americans. We find genome-wide significant association at seven genomic regions affecting: lobe size and attachment, folding of antihelix, helix rolling, ear protrusion and antitragus size (linear regression P values 2 × 10(-8) to 3 × 10(-14)). Four traits are associated with a functional variant in the Ectodysplasin A receptor (EDAR) gene, a key regulator of embryonic skin appendage development. We confirm expression of Edar in the developing mouse ear and that Edar-deficient mice have an abnormally shaped pinna. Two traits are associated with SNPs in a region overlapping the T-Box Protein 15 (TBX15) gene, a major determinant of mouse skeletal development. Strongest association in this region is observed for SNP rs17023457 located in an evolutionarily conserved binding site for the transcription factor Cartilage paired-class homeoprotein 1 (CART1), and we confirm that rs17023457 alters in vitro binding of CART1.

  18. A genome-wide association study identifies multiple loci for variation in human ear morphology

    PubMed Central

    Adhikari, Kaustubh; Reales, Guillermo; Smith, Andrew J. P.; Konka, Esra; Palmen, Jutta; Quinto-Sanchez, Mirsha; Acuña-Alonzo, Victor; Jaramillo, Claudia; Arias, William; Fuentes, Macarena; Pizarro, María; Barquera Lozano, Rodrigo; Macín Pérez, Gastón; Gómez-Valdés, Jorge; Villamil-Ramírez, Hugo; Hunemeier, Tábita; Ramallo, Virginia; Silva de Cerqueira, Caio C.; Hurtado, Malena; Villegas, Valeria; Granja, Vanessa; Gallo, Carla; Poletti, Giovanni; Schuler-Faccini, Lavinia; Salzano, Francisco M.; Bortolini, Maria- Cátira; Canizales-Quinteros, Samuel; Rothhammer, Francisco; Bedoya, Gabriel; Calderón, Rosario; Rosique, Javier; Cheeseman, Michael; Bhutta, Mahmood F.; Humphries, Steve E.; Gonzalez-José, Rolando; Headon, Denis; Balding, David; Ruiz-Linares, Andrés

    2015-01-01

    Here we report a genome-wide association study for non-pathological pinna morphology in over 5,000 Latin Americans. We find genome-wide significant association at seven genomic regions affecting: lobe size and attachment, folding of antihelix, helix rolling, ear protrusion and antitragus size (linear regression P values 2 × 10−8 to 3 × 10−14). Four traits are associated with a functional variant in the Ectodysplasin A receptor (EDAR) gene, a key regulator of embryonic skin appendage development. We confirm expression of Edar in the developing mouse ear and that Edar-deficient mice have an abnormally shaped pinna. Two traits are associated with SNPs in a region overlapping the T-Box Protein 15 (TBX15) gene, a major determinant of mouse skeletal development. Strongest association in this region is observed for SNP rs17023457 located in an evolutionarily conserved binding site for the transcription factor Cartilage paired-class homeoprotein 1 (CART1), and we confirm that rs17023457 alters in vitro binding of CART1. PMID:26105758

  19. Estimating genome-wide heterozygosity: effects of demographic history and marker type

    PubMed Central

    Miller, J M; Malenfant, R M; David, P; Davis, C S; Poissant, J; Hogg, J T; Festa-Bianchet, M; Coltman, D W

    2014-01-01

    Heterozygosity–fitness correlations (HFCs) are often used to link individual genetic variation to differences in fitness. However, most studies examining HFCs find weak or no correlations. Here, we derive broad theoretical predictions about how many loci are needed to adequately measure genomic heterozygosity assuming different levels of identity disequilibrium (ID), a proxy for inbreeding. We then evaluate the expected ability to detect HFCs using an empirical data set of 200 microsatellites and 412 single nucleotide polymorphisms (SNPs) genotyped in two populations of bighorn sheep (Ovis canadensis), with different demographic histories. In both populations, heterozygosity was significantly correlated across marker types, although the strength of the correlation was weaker in a native population compared with one founded via translocation and later supplemented with additional individuals. Despite being bi-allelic, SNPs had similar correlations to genome-wide heterozygosity as microsatellites in both populations. For both marker types, this association became stronger and less variable as more markers were considered. Both populations had significant levels of ID; however, estimates were an order of magnitude lower in the native population. As with heterozygosity, SNPs performed similarly to microsatellites, and precision and accuracy of the estimates of ID increased as more loci were considered. Although dependent on the demographic history of the population considered, these results illustrate that genome-wide heterozygosity, and therefore HFCs, are best measured by a large number of markers, a feat now more realistically accomplished with SNPs than microsatellites. PMID:24149650

  20. Novel loci associated with usual sleep duration: the CHARGE Consortium Genome-Wide Association Study.

    PubMed

    Gottlieb, D J; Hek, K; Chen, T-H; Watson, N F; Eiriksdottir, G; Byrne, E M; Cornelis, M; Warby, S C; Bandinelli, S; Cherkas, L; Evans, D S; Grabe, H J; Lahti, J; Li, M; Lehtimäki, T; Lumley, T; Marciante, K D; Pérusse, L; Psaty, B M; Robbins, J; Tranah, G J; Vink, J M; Wilk, J B; Stafford, J M; Bellis, C; Biffar, R; Bouchard, C; Cade, B; Curhan, G C; Eriksson, J G; Ewert, R; Ferrucci, L; Fülöp, T; Gehrman, P R; Goodloe, R; Harris, T B; Heath, A C; Hernandez, D; Hofman, A; Hottenga, J-J; Hunter, D J; Jensen, M K; Johnson, A D; Kähönen, M; Kao, L; Kraft, P; Larkin, E K; Lauderdale, D S; Luik, A I; Medici, M; Montgomery, G W; Palotie, A; Patel, S R; Pistis, G; Porcu, E; Quaye, L; Raitakari, O; Redline, S; Rimm, E B; Rotter, J I; Smith, A V; Spector, T D; Teumer, A; Uitterlinden, A G; Vohl, M-C; Widen, E; Willemsen, G; Young, T; Zhang, X; Liu, Y; Blangero, J; Boomsma, D I; Gudnason, V; Hu, F; Mangino, M; Martin, N G; O'Connor, G T; Stone, K L; Tanaka, T; Viikari, J; Gharib, S A; Punjabi, N M; Räikkönen, K; Völzke, H; Mignot, E; Tiemeier, H

    2015-10-01

    Usual sleep duration is a heritable trait correlated with psychiatric morbidity, cardiometabolic disease and mortality, although little is known about the genetic variants influencing this trait. A genome-wide association study (GWAS) of usual sleep duration was conducted using 18 population-based cohorts totaling 47 180 individuals of European ancestry. Genome-wide significant association was identified at two loci. The strongest is located on chromosome 2, in an intergenic region 35- to 80-kb upstream from the thyroid-specific transcription factor PAX8 (lowest P=1.1 × 10(-9)). This finding was replicated in an African-American sample of 4771 individuals (lowest P=9.3 × 10(-4)). The strongest combined association was at rs1823125 (P=1.5 × 10(-10), minor allele frequency 0.26 in the discovery sample, 0.12 in the replication sample), with each copy of the minor allele associated with a sleep duration 3.1 min longer per night. The alleles associated with longer sleep duration were associated in previous GWAS with a more favorable metabolic profile and a lower risk of attention deficit hyperactivity disorder. Understanding the mechanisms underlying these associations may help elucidate biological mechanisms influencing sleep duration and its association with psychiatric, metabolic and cardiovascular disease.

  1. Genome-wide association study identifies three novel loci in Fuchs endothelial corneal dystrophy

    PubMed Central

    Afshari, Natalie A.; Igo, Robert P.; Morris, Nathan J.; Stambolian, Dwight; Sharma, Shiwani; Pulagam, V. Lakshmi; Dunn, Steven; Stamler, John F.; Truitt, Barbara J.; Rimmler, Jacqueline; Kuot, Abraham; Croasdale, Christopher R.; Qin, Xuejun; Burdon, Kathryn P.; Riazuddin, S. Amer; Mills, Richard; Klebe, Sonja; Minear, Mollie A.; Zhao, Jiagang; Balajonda, Elmer; Rosenwasser, George O.; Baratz, Keith H; Mootha, V. Vinod; Patel, Sanjay V.; Gregory, Simon G.; Bailey-Wilson, Joan E.; Price, Marianne O.; Price, Francis W.; Craig, Jamie E.; Fingert, John H.; Gottsch, John D.; Aldave, Anthony J.; Klintworth, Gordon K.; Lass, Jonathan H.; Li, Yi-Ju; Iyengar, Sudha K.

    2017-01-01

    The structure of the cornea is vital to its transparency, and dystrophies that disrupt corneal organization are highly heritable. To understand the genetic aetiology of Fuchs endothelial corneal dystrophy (FECD), the most prevalent corneal disorder requiring transplantation, we conducted a genome-wide association study (GWAS) on 1,404 FECD cases and 2,564 controls of European ancestry, followed by replication and meta-analysis, for a total of 2,075 cases and 3,342 controls. We identify three novel loci meeting genome-wide significance (P<5 × 10−8): KANK4 rs79742895, LAMC1 rs3768617 and LINC00970/ATP1B1 rs1200114. We also observe an overwhelming effect of the established TCF4 locus. Interestingly, we detect differential sex-specific association at LAMC1, with greater risk in women, and TCF4, with greater risk in men. Combining GWAS results with biological evidence we expand the knowledge of common FECD loci from one to four, and provide a deeper understanding of the underlying pathogenic basis of FECD. PMID:28358029

  2. Genome-wide association study identifies FCGR2A as a susceptibility locus for Kawasaki disease.

    PubMed

    Khor, Chiea Chuen; Davila, Sonia; Breunis, Willemijn B; Lee, Yi-Ching; Shimizu, Chisato; Wright, Victoria J; Yeung, Rae S M; Tan, Dennis E K; Sim, Kar Seng; Wang, Jie Jin; Wong, Tien Yin; Pang, Junxiong; Mitchell, Paul; Cimaz, Rolando; Dahdah, Nagib; Cheung, Yiu-Fai; Huang, Guo-Ying; Yang, Wanling; Park, In-Sook; Lee, Jong-Keuk; Wu, Jer-Yuarn; Levin, Michael; Burns, Jane C; Burgner, David; Kuijpers, Taco W; Hibberd, Martin L

    2011-11-13

    Kawasaki disease is a systemic vasculitis of unknown etiology, with clinical observations suggesting a substantial genetic contribution to disease susceptibility. We conducted a genome-wide association study and replication analysis in 2,173 individuals with Kawasaki disease and 9,383 controls from five independent sample collections. Two loci exceeded the formal threshold for genome-wide significance. The first locus is a functional polymorphism in the IgG receptor gene FCGR2A (encoding an H131R substitution) (rs1801274; P = 7.35 × 10(-11), odds ratio (OR) = 1.32), with the A allele (coding for histadine) conferring elevated disease risk. The second locus is at 19q13, (P = 2.51 × 10(-9), OR = 1.42 for the rs2233152 SNP near MIA and RAB4B; P = 1.68 × 10(-12), OR = 1.52 for rs28493229 in ITPKC), which confirms previous findings(1). The involvement of the FCGR2A locus may have implications for understanding immune activation in Kawasaki disease pathogenesis and the mechanism of response to intravenous immunoglobulin, the only proven therapy for this disease.

  3. Genome-wide association studies of chronic kidney disease: what have we learned?

    PubMed Central

    O'Seaghdha, Conall M.; Fox, Caroline S.

    2015-01-01

    The past 3 years have witnessed a dramatic expansion in our knowledge of the genetic determinants of estimated glomerular filtration rate (eGFR) and chronic kidney disease (CKD). However, heritability estimates of eGFR indicate that we have only identified a small proportion of the total heritable contribution to the phenotypic variation. The majority of associations reported from genome-wide association studies identify genomic regions of interest and further work will be required to identify the causal variants responsible for a specific phenotype. Progress in this area is likely to stem from the identification of novel risk genotypes, which will offer insight into the pathogenesis of disease and potential novel therapeutic targets. Follow-up studies stimulated by findings from genome-wide association studies of kidney disease are already yielding promising results, such as the identification of an association between urinary uromodulin levels and incident CKD. Although this work is at an early stage, prospects for progress in our understanding of CKD and its treatment look more promising now than at any point in the past. PMID:22143329

  4. Genome-wide association study identifies novel susceptibility loci for cutaneous squamous cell carcinoma

    PubMed Central

    Chahal, Harvind S.; Lin, Yuan; Ransohoff, Katherine J.; Hinds, David A.; Wu, Wenting; Dai, Hong-Ji; Qureshi, Abrar A.; Li, Wen-Qing; Kraft, Peter; Tang, Jean Y.; Han, Jiali; Sarin, Kavita Y.

    2016-01-01

    Cutaneous squamous cell carcinoma represents the second most common cutaneous malignancy, affecting 7–11% of Caucasians in the United States. The genetic determinants of susceptibility to cutaneous squamous cell carcinoma remain largely unknown. Here we report the results of a two-stage genome-wide association study of cutaneous squamous cell carcinoma, totalling 7,404 cases and 292,076 controls. Eleven loci reached genome-wide significance (P<5 × 10−8) including seven previously confirmed pigmentation-related loci: MC1R, ASIP, TYR, SLC45A2, OCA2, IRF4 and BNC2. We identify an additional four susceptibility loci: 11q23.3 CADM1, a metastasis suppressor gene involved in modifying tumour interaction with cell-mediated immunity; 2p22.3; 7p21.1 AHR, the dioxin receptor involved in anti-apoptotic pathways and melanoma progression; and 9q34.3 SEC16A, a putative oncogene with roles in secretion and cellular proliferation. These susceptibility loci provide deeper insight into the pathogenesis of squamous cell carcinoma. PMID:27424798

  5. Genome-wide conserved consensus transcription factor binding motifs are hyper-methylated

    PubMed Central

    2010-01-01

    Background DNA methylation can regulate gene expression by modulating the interaction between DNA and proteins or protein complexes. Conserved consensus motifs exist across the human genome ("predicted transcription factor binding sites": "predicted TFBS") but the large majority of these are proven by chromatin immunoprecipitation and high throughput sequencing (ChIP-seq) not to be biological transcription factor binding sites ("empirical TFBS"). We hypothesize that DNA methylation at conserved consensus motifs prevents promiscuous or disorderly transcription factor binding. Results Using genome-wide methylation maps of the human heart and sperm, we found that all conserved consensus motifs as well as the subset of those that reside outside CpG islands have an aggregate profile of hyper-methylation. In contrast, empirical TFBS with conserved consensus motifs have a profile of hypo-methylation. 40% of empirical TFBS with conserved consensus motifs resided in CpG islands whereas only 7% of all conserved consensus motifs were in CpG islands. Finally we further identified a minority subset of TF whose profiles are either hypo-methylated or neutral at their respective conserved consensus motifs implicating that these TF may be responsible for establishing or maintaining an un-methylated DNA state, or whose binding is not regulated by DNA methylation. Conclusions Our analysis supports the hypothesis that at least for a subset of TF, empirical binding to conserved consensus motifs genome-wide may be controlled by DNA methylation. PMID:20875111

  6. Ancestry informative markers for distinguishing between Thai populations based on genome-wide association datasets.

    PubMed

    Vongpaisarnsin, Kornkiat; Listman, Jennifer Beth; Malison, Robert T; Gelernter, Joel

    2015-07-01

    The main purpose of this work was to identify a set of AIMs that stratify the genetic structure and diversity of the Thai population from a high-throughput autosomal genome-wide association study. In this study, more than one million SNPs from the international HapMap database and the Thai depression genome-wide association study have been examined to identify ancestry informative markers (AIMs) that distinguish between Thai populations. An efficient strategy is proposed to identify and characterize such SNPs and to test high-resolution SNP data from international HapMap populations. The best AIMs are identified to stratify the population and to infer genetic ancestry structure. A total of 124 AIMs were clearly clustered geographically across the continent, whereas only 89 AIMs stratified the Thai population from East Asian populations. Finally, a set of 273 AIMs was able to distinguish northern from southern Thai subpopulations. These markers will be of particular value in identifying the ethnic origins in regions where matching by self-reports is unavailable or unreliable, which usually occurs in real forensic cases.

  7. Genome-wide association study identifies 74 loci associated with educational attainment

    PubMed Central

    Okbay, Aysu; Beauchamp, Jonathan P.; Fontana, Mark A.; Lee, James J.; Pers, Tune H.; Rietveld, Cornelius A.; Turley, Patrick; Chen, Guo-Bo; Emilsson, Valur; Meddens, S. Fleur W.; Oskarsson, Sven; Pickrell, Joseph K.; Thom, Kevin; Timshel, Pascal; de Vlaming, Ronald; Abdellaoui, Abdel; Ahluwalia, Tarunveer S.; Bacelis, Jonas; Baumbach, Clemens; Bjornsdottir, Gyda; Brandsma, Johannes H.; Concas, Maria Pina; Derringer, Jaime; Furlotte, Nicholas A.; Galesloot, Tessel E.; Girotto, Giorgia; Gupta, Richa; Hall, Leanne M.; Harris, Sarah E.; Hofer, Edith; Horikoshi, Momoko; Huffman, Jennifer E.; Kaasik, Kadri; Kalafati, Ioanna P.; Karlsson, Robert; Kong, Augustine; Lahti, Jari; van der Lee, Sven J.; de Leeuw, Christiaan; Lind, Penelope A.; Lindgren, Karl-Oskar; Liu, Tian; Mangino, Massimo; Marten, Jonathan; Mihailov, Evelin; Miller, Michael B.; van der Most, Peter J.; Oldmeadow, Christopher; Payton, Antony; Pervjakova, Natalia; Peyrot, Wouter J.; Qian, Yong; Raitakari, Olli; Rueedi, Rico; Salvi, Erika; Schmidt, Börge; Schraut, Katharina E.; Shi, Jianxin; Smith, Albert V.; Poot, Raymond A.; Pourcain, Beate; Teumer, Alexander; Thorleifsson, Gudmar; Verweij, Niek; Vuckovic, Dragana; Wellmann, Juergen; Westra, Harm-Jan; Yang, Jingyun; Zhao, Wei; Zhu, Zhihong; Alizadeh, Behrooz Z.; Amin, Najaf; Bakshi, Andrew; Baumeister, Sebastian E.; Biino, Ginevra; Bønnelykke, Klaus; Boyle, Patricia A.; Campbell, Harry; Cappuccio, Francesco P.; Davies, Gail; De Neve, Jan-Emmanuel; Deloukas, Panos; Demuth, Ilja; Ding, Jun; Eibich, Peter; Eisele, Lewin; Eklund, Niina; Evans68, David M.; Faul, Jessica D.; Feitosa, Mary F.; Forstner, Andreas J.; Gandin, Ilaria; Gunnarsson, Bjarni; Halldórsson, Bjarni V.; Harris, Tamara B.; Heath, Andrew C.; Hocking, Lynne J.; Holliday, Elizabeth G.; Homuth, Georg; Horan, Michael A.; Hottenga, Jouke-Jan; de Jager, Philip L.; Joshi, Peter K.; Jugessur, Astanand; Kaakinen, Marika A.; Kähönen, Mika; Kanoni, Stavroula; Keltigangas-Järvinen, Liisa; Kiemeney, Lambertus A.L.M.; Kolcic, Ivana; Koskinen, Seppo; Kraja, Aldi T.; Kroh, Martin; Kutalik, Zoltan; Latvala, Antti; Launer, Lenore J.; Lebreton, Maël P.; Levinson, Douglas F.; Lichtenstein, Paul; Lichtner, Peter; Liewald, David C.M.; Loukola, Anu; Madden, Pamela A.; Mägi, Reedik; Mäki-Opas, Tomi; Marioni, Riccardo E.; Marques-Vidal, Pedro; Meddens, Gerardus A.; McMahon, George; Meisinger, Christa; Meitinger, Thomas; Milaneschi, Yusplitri; Milani, Lili; Montgomery, Grant W.; Myhre, Ronny; Nelson, Christopher P.; Nyholt, Dale R.; Ollier, William E.R.; Palotie, Aarno; Paternoster, Lavinia; Pedersen, Nancy L.; Petrovic, Katja E.; Porteous, David J.; Räikkönen, Katri; Ring, Susan M.; Robino, Antonietta; Rostapshova, Olga; Rudan, Igor; Rustichini, Aldo; Salomaa, Veikko; Sanders, Alan R.; Sarin, Antti-Pekka; Schmidt, Helena; Scott, Rodney J.; Smith, Blair H.; Smith, Jennifer A.; Staessen, Jan A.; Steinhagen-Thiessen, Elisabeth; Strauch, Konstantin; Terracciano, Antonio; Tobin, Martin D.; Ulivi, Sheila; Vaccargiu, Simona; Quaye, Lydia; van Rooij, Frank J.A.; Venturini, Cristina; Vinkhuyzen, Anna A.E.; Völker, Uwe; Völzke, Henry; Vonk, Judith M.; Vozzi, Diego; Waage, Johannes; Ware, Erin B.; Willemsen, Gonneke; Attia, John R.; Bennett, David A.; Berger, Klaus; Bertram, Lars; Bisgaard, Hans; Boomsma, Dorret I.; Borecki, Ingrid B.; Bultmann, Ute; Chabris, Christopher F.; Cucca, Francesco; Cusi, Daniele; Deary, Ian J.; Dedoussis, George V.; van Duijn, Cornelia M.; Eriksson, Johan G.; Franke, Barbara; Franke, Lude; Gasparini, Paolo; Gejman, Pablo V.; Gieger, Christian; Grabe, Hans-Jörgen; Gratten, Jacob; Groenen, Patrick J.F.; Gudnason, Vilmundur; van der Harst, Pim; Hayward, Caroline; Hinds, David A.; Hoffmann, Wolfgang; Hyppönen, Elina; Iacono, William G.; Jacobsson, Bo; Järvelin, Marjo-Riitta; Jöckel, Karl-Heinz; Kaprio, Jaakko; Kardia, Sharon L.R.; Lehtimäki, Terho; Lehrer, Steven F.; Magnusson, Patrik K.E.; Martin, Nicholas G.; McGue, Matt; Metspalu, Andres; Pendleton, Neil; Penninx, Brenda W.J.H.; Perola, Markus; Pirastu, Nicola; Pirastu, Mario; Polasek, Ozren; Posthuma, Danielle; Power, Christine; Province, Michael A.; Samani, Nilesh J.; Schlessinger, David; Schmidt, Reinhold; Sørensen, Thorkild I.A.; Spector, Tim D.; Stefansson, Kari; Thorsteinsdottir, Unnur; Thurik, A. Roy; Timpson, Nicholas J.; Tiemeier, Henning; Tung, Joyce Y.; Uitterlinden, André G.; Vitart, Veronique; Vollenweider, Peter; Weir, David R.; Wilson, James F.; Wright, Alan F.; Conley, Dalton C.; Krueger, Robert F.; Smith, George Davey; Hofman, Albert; Laibson, David I.; Medland, Sarah E.; Meyer, Michelle N.; Yang, Jian; Johannesson, Magnus; Visscher, Peter M.; Esko, Tõnu; Koellinger, Philipp D.; Cesarini, David; Benjamin, Daniel J.

    2016-01-01

    Summary Educational attainment (EA) is strongly influenced by social and other environmental factors, but genetic factors are also estimated to account for at least 20% of the variation across individuals1. We report the results of a genome-wide association study (GWAS) for EA that extends our earlier discovery sample1,2 of 101,069 individuals to 293,723 individuals, and a replication in an independent sample of 111,349 individuals from the UK Biobank. We now identify 74 genome-wide significant loci associated with number of years of schooling completed. Single-nucleotide polymorphisms (SNPs) associated with educational attainment are disproportionately found in genomic regions regulating gene expression in the fetal brain. Candidate genes are preferentially expressed in neural tissue, especially during the prenatal period, and enriched for biological pathways involved in neural development. Our findings demonstrate that, even for a behavioral phenotype that is mostly environmentally determined, a well-powered GWAS identifies replicable associated genetic variants that suggest biologically relevant pathways. Because EA is measured in large numbers of individuals, it will continue to be useful as a proxy phenotype in efforts to characterize the genetic influences of related phenotypes, including cognition and neuropsychiatric disease. PMID:27225129

  8. Genome-wide investigation of schizophrenia associated plasma Ndel1 enzyme activity.

    PubMed

    Gadelha, Ary; Coleman, Jonathan; Breen, Gerome; Mazzoti, Diego Robles; Yonamine, Camila M; Pellegrino, Renata; Ota, Vanessa Kiyomi; Belangero, Sintia Iole; Glessner, Joseph; Sleiman, Patrick; Hakonarson, Hakon; Hayashi, Mirian A F; Bressan, Rodrigo A

    2016-04-01

    Ndel1 is a DISC1-interacting oligopeptidase that cleaves in vitro neuropeptides as neurotensin and bradykinin, and which has been associated with both neuronal migration and neurite outgrowth. We previously reported that plasma Ndel1 enzyme activity is lower in patients with schizophrenia (SCZ) compared to healthy controls (HCs). To our knowledge, no previous study has investigated the genetic factors associated with the plasma Ndel1 enzyme activity. In the current analyses, samples from 83 SCZ patients and 92 control subjects that were assayed for plasma Ndel1 enzyme activity were genotyped on Illumina Omni Express arrays. A genetic relationship matrix using genome-wide information was then used for ancestry correction, and association statistics were calculated genome-wide. Ndel1 enzyme activity was significantly lower in patients with SCZ (t=4.9; p<0.001) and was found to be associated with CAMK1D, MAGI2, CCDC25, and GABGR3, at a level of suggestive significance (p<10(-6)), independent of the clinical status. Then, we performed a model to investigate the observed differences for case/control measures. 2 SNPs at region 1p22.2 reached the p<10(-7) level. ZFPM2 and MAD1L1 were the only two genes with more than one hit at 10(-6) order of p value. Therefore, Ndel1 enzyme activity is a complex trait influenced by many different genetic variants that may contribute to SCZ physiopathology.

  9. Risk Prediction Using Genome-Wide Association Studies on Type 2 Diabetes

    PubMed Central

    Choi, Sungkyoung; Bae, Sunghwan

    2016-01-01

    The success of genome-wide association studies (GWASs) has enabled us to improve risk assessment and provide novel genetic variants for diagnosis, prevention, and treatment. However, most variants discovered by GWASs have been reported to have very small effect sizes on complex human diseases, which has been a big hurdle in building risk prediction models. Recently, many statistical approaches based on penalized regression have been developed to solve the “large p and small n” problem. In this report, we evaluated the performance of several statistical methods for predicting a binary trait: stepwise logistic regression (SLR), least absolute shrinkage and selection operator (LASSO), and Elastic-Net (EN). We first built a prediction model by combining variable selection and prediction methods for type 2 diabetes using Affymetrix Genome-Wide Human SNP Array 5.0 from the Korean Association Resource project. We assessed the risk prediction performance using area under the receiver operating characteristic curve (AUC) for the internal and external validation datasets. In the internal validation, SLR-LASSO and SLR-EN tended to yield more accurate predictions than other combinations. During the external validation, the SLR-SLR and SLR-EN combinations achieved the highest AUC of 0.726. We propose these combinations as a potentially powerful risk prediction model for type 2 diabetes. PMID:28154504

  10. Genome-Wide DNA Methylation in Mixed Ancestry Individuals with Diabetes and Prediabetes from South Africa

    PubMed Central

    Pheiffer, Carmen; Humphries, Stephen E.; Gamieldien, Junaid; Erasmus, Rajiv T.

    2016-01-01

    Aims. To conduct a genome-wide DNA methylation in individuals with type 2 diabetes, individuals with prediabetes, and control mixed ancestry individuals from South Africa. Methods. We used peripheral blood to perform genome-wide DNA methylation analysis in 3 individuals with screen detected diabetes, 3 individuals with prediabetes, and 3 individuals with normoglycaemia from the Bellville South Community, Cape Town, South Africa, who were age-, gender-, body mass index-, and duration of residency-matched. Methylated DNA immunoprecipitation (MeDIP) was performed by Arraystar Inc. (Rockville, MD, USA). Results. Hypermethylated DMRs were 1160 (81.97%) and 124 (43.20%), respectively, in individuals with diabetes and prediabetes when both were compared to subjects with normoglycaemia. Our data shows that genes related to the immune system, signal transduction, glucose transport, and pancreas development have altered DNA methylation in subjects with prediabetes and diabetes. Pathway analysis based on the functional analysis mapping of genes to KEGG pathways suggested that the linoleic acid metabolism and arachidonic acid metabolism pathways are hypomethylated in prediabetes and diabetes. Conclusions. Our study suggests that epigenetic changes are likely to be an early process that occurs before the onset of overt diabetes. Detailed analysis of DMRs that shows gradual methylation differences from control versus prediabetes to prediabetes versus diabetes in a larger sample size is required to confirm these findings. PMID:27555869

  11. Search and Validation of Short Genome-Wide Biomarkers for Bacterial Biological Phylogenies

    NASA Astrophysics Data System (ADS)

    Garzon, Max H.; Wong, Tit-Yee

    We continue the exploration of DNA-based indexing as a universal coordinate system in DNA spaces to characterize very large groups (families, genera, and even phylla) of organisms on a uniform biomarker reference system, a comprehensive "Atlas of Life", as it is or as it could be on earth. We provide a second confirmation that DNA noncrosshybridizing (nxh) sets can be successfully applied to infer ab-initio phylogenetic trees by providing a method to measure distances among entire genomes indexed by sets of short oligonucleotides selected so as to minimize crosshybridization. These phylogenies are solidly established and well accepted in bacterial biology, albeit done by analyses of relatively small segments of highly conserved rybozomic DNA. Second, it is further demonstrated that DNA indexing does provide novel and principled genome-wide predictions into the phylogenesis of organisms hitherto inaccessible by current methods, such as a prediction of the origin of the Salmonella plasmid 50 as being acquired horizontally, likely from some bacteria somewhat related to Yesinia. We conclude with some discussion about the scalability and potential of this method to develop a comprehensive tree of life based on genome-wide methods.

  12. Yeast genome-wide screen reveals dissimilar sets of host genes affecting replication of RNA viruses

    PubMed Central

    Panavas, Tadas; Serviene, Elena; Brasher, Jeremy; Nagy, Peter D.

    2005-01-01

    Viruses are devastating pathogens of humans, animals, and plants. To further our understanding of how viruses use the resources of infected cells, we systematically tested the yeast single-gene-knockout library for the effect of each host gene on the replication of tomato bushy stunt virus (TBSV), a positive-strand RNA virus of plants. The genome-wide screen identified 96 host genes whose absence either reduced or increased the accumulation of the TBSV replicon. The identified genes are involved in the metabolism of nucleic acids, lipids, proteins, and other compounds and in protein targeting/transport. Comparison with published genome-wide screens reveals that the replication of TBSV and brome mosaic virus (BMV), which belongs to a different supergroup among plus-strand RNA viruses, is affected by vastly different yeast genes. Moreover, a set of yeast genes involved in vacuolar targeting of proteins and vesicle-mediated transport both affected replication of the TBSV replicon and enhanced the cytotoxicity of the Parkinson's disease-related α-synuclein when this protein was expressed in yeast. In addition, a set of host genes involved in ubiquitin-dependent protein catabolism affected both TBSV replication and the cytotoxicity of a mutant huntingtin protein, a candidate agent in Huntington's disease. This finding suggests that virus infection and disease-causing proteins might use or alter similar host pathways and may suggest connections between chronic diseases and prior virus infection. PMID:15883361

  13. Genome-wide association study identifies six new loci influencing pulse pressure and mean arterial pressure

    PubMed Central

    Wain, Louise V; Verwoert, Germaine C; O’Reilly, Paul F; Shi, Gang; Johnson, Toby; Johnson, Andrew D; Bochud, Murielle; Rice, Kenneth M; Henneman, Peter; Smith, Albert V; Ehret, Georg B; Amin, Najaf; Larson, Martin G; Mooser, Vincent; Hadley, David; Dörr, Marc